LucidWorks Search 2.5 Vs Lucene/Solr Matrix
|
System configuration and setup |
LucidWorks Search 2.5 |
Lucene/Solr 4.0 |
|
Built in version-to-version migration tools |
|
|
|
Automated Cluster Provisioning (via "Chef") |
|
|
|
Hosted Option |
|
|
|
Monitoring |
|
|
|
ReST API for configuration |
|
|
|
UI for configuration |
|
|
|
Wizard-based installation |
|
|
|
Security |
||
|
Access Control List integration for document access control |
|
|
|
HTTPS encryption between components |
|
|
|
LDAP or Active Directory for user authentication |
|
|
|
Solr filter queries (fq) for document access control |
|
|
|
Connectors |
||
|
Create custom connector via API |
|
|
|
HDFS |
|
|
|
Amazon Cloud S3 |
|
|
|
JDBC for Database Indexing |
|
|
|
Load CSV |
|
|
|
Local file systems |
|
|
|
Post in JSON format |
|
|
|
Post in XML format |
|
|
|
Java serialized objects (javabin) |
|
|
|
Index Sharepoint Content |
|
|
|
SMB (Window File Shares) |
|
|
|
SolrXML |
|
|
|
Websites and web content |
|
|
|
Indexing Content |
||
|
Batch Processing ie forced full site respidering with scheduling |
|
|
|
Embedded Luke to analyze indexes |
|
|
|
Near real-time |
|
|
|
New Lucene index format |
|
|
|
Scheduling content indexing |
|
|
|
Transform XML Solr documents via XSLT for indexing |
|
|
|
UI dashboard for monitoring crawl and index processes |
|
|
|
UI for configuration of data sources |
|
|
|
Improvements to Data Import Handling (DIH) |
|
|
|
Index Management |
||
|
Automation-based auto-complete/suggest |
|
|
|
Data-specific indexing codecs |
|
|
|
Faster simultaneous indexing with Lockless writes (DocumentWriterPerThread) |
|
|
|
Per-field similarity definition |
|
|
|
Terms component for access to indexed terms |
|
|
|
UI for updating and adding fields |
|
|
|
Query Handling |
||
|
Joins |
|
|
|
Block Joins |
|
|
|
Psuedo-Joins to select a set of documents based on their relationship to a second set of documents |
|
|
|
eDismax query parser |
|
|
|
Language Analysis: Hunspell Stemmer |
|
|
|
Language Analysis: KStemmer |
|
|
|
Language identifier |
|
|
|
Locale-sensitive range queries |
|
|
|
Lucid query parser |
|
|
|
Mixed-range operators (such as { and ]) are legal |
|
|
|
Open-ended ranges (such as [a TO *]) |
|
|
|
Real-time Get to retrieve the latest version of a document without a commit |
|
|
|
Spell check |
|
|
|
Spellcheck without separate index |
|
|
|
Unary operators (+,-,!) will not be operators if followed by whitespace |
|
|
|
Results and relevance |
||
|
Conditional function queries |
|
|
|
CSV response writer |
|
|
|
Grouping (field collapsing) |
|
|
|
Highlight a facet term instead of query term when browsing facet results |
|
|
|
Highlighting improvements (such as fastvectorhighlight) |
|
|
|
JSON update request handler |
|
|
|
New ranking algorithms (BM25, LM, DFR, etc.) |
|
|
|
Numeric range faceting |
|
|
|
Numeric range faceting in distributed search |
|
|
|
Per-field/flexible scoring |
|
|
|
Pivot faceting |
|
|
|
Per-segment field faceting for improved performance with near real-time search |
|
|
|
Popularity-based ranking (Click Scoring Relevance Framework) |
|
|
|
Pseudo-field support to return extra data such as function query values, score explanation and field aliasing with stored fields |
|
|
|
Relevance function queries |
|
|
|
Ruby, JSON, XML, Python response writers |
|
|
|
Search Results Clustering |
|
|
|
SolrJ parses grouped and range responses |
|
|
|
Sort by output of a FunctionQuery |
|
|
|
Spatial search, filtering, boosting, sorting |
|
|
|
Suggester (auto complete) |
|
|
|
User alerts for new content |
|
|
|
Deployment and Monitoring |
||
|
Built-in QPS dashboard |
|
|
|
MBeans for crawling activity |
|
|
|
MBeans for filter, result and document caches |
|
|
|
MBeans for indexing activity |
|
|
|
MBeans for search activity |
|
|
|
Nagios Application Monitor integration |
|
|
|
Searchable log files |
|
|
|
StatsComponent support for dates and strings |
|
|
|
Zabbix Hardware Monitor integration |
|
|
|
Scaling and Performance |
||
|
Distributed search |
|
|
|
Distributed search grouping (distributed field collapsing) |
|
|
|
Fast fuzzy query |
|
|
|
Memory improvements for searching and sorting |
|
|
|
Post filters and filter cache controls |
|
|
|
SolrCloud Phase 1 (improved distributed search, cluster state, read-side fault tolerance and centralized configurations |
|
|
|
SolrCloud Phase 2 (distributed indexing, real-time GET, read & write fault tolerance and cluster elasticity |
|
|
|
Tiered merge policy |
|
|


