Solr is a standalone enterprise search server with a REST-like API. You put documents in it (called "indexing") via JSON, XML, CSV or binary over HTTP. You query it via HTTP GET and receive JSON, XML, CSV or binary results. Solr uses the Lucene search library and extends it.
Here are some of the main features that solr provides:
- Advanced Full-Text Search Capabilities
Powered by Luceneā¢, Solr enables powerful matching capabilities including phrases, wildcards, joins, grouping and much more across any data type
Solr is proven at extremely large scales the world over
Solr uses the tools you use to make application building a snap
Solr ships with a built-in, responsive administrative user interface to make it easy to control your Solr instances
Need more insight into your instances? Solr publishes loads of metric data via JMX
Built on the battle-tested Apache Zookeeper, Solr makes it easy to scale up and down. Solr bakes in replication, distribution, rebalancing and fault tolerance out of the box.
Solr's is designed to adapt to your needs all while simplifying configuration
Want to see your updates now? Solr takes advantage of Lucene's Near Real-Time Indexing capabilities to make sure you see your content when you want to see it
Solr publishes many well-defined extension points that make it easy to plugin both index and query time plugins. Of course, since it is Apache-licensed open source, you can change any code you want!
Some solr cool features:
Use Solr's data-driven schemaless mode when getting started and then lock it down when it's time for production.
Solr ships with optional plugins for indexing rich content (e.g. PDFs, Word), language detection, search results clustering and more
Slice and dice your data as you see fit using a large array of faceting algorithms
Enabling location-based search is simple with Solr's built-in support for spatial search
Solr ships with advanced capabilites for auto-complete (typeahead search), spell checking and more
Solr ships with Apache Tika built-in, making it easy to index rich content such as Adobe PDF, Microsoft Word and more.