LucidWorks Search 2.5 Vs Lucene/Solr Matrix

System configuration and setup

LucidWorks Search 2.5

 Lucene/Solr 4.0

Built in version-to-version migration tools

Automated Cluster Provisioning (via "Chef")

Hosted Option

Monitoring

ReST API for configuration

UI for configuration

Wizard-based installation

Security

Access Control List integration for document access control

HTTPS encryption between components

LDAP or Active Directory for user authentication

Solr filter queries (fq) for document access control

Connectors

Create custom connector via API

HDFS

Amazon Cloud S3

JDBC for Database Indexing

Load CSV

Local file systems

Post in JSON format

Post in XML format

Java serialized objects (javabin)

Index Sharepoint Content

SMB (Window File Shares)

SolrXML

Websites and web content

Indexing Content

Batch Processing ie forced full site respidering with scheduling

Embedded Luke to analyze indexes

Near real-time

New Lucene index format

Scheduling content indexing

Transform XML Solr documents via XSLT for indexing

UI dashboard for monitoring crawl and index processes

UI for configuration of data sources

Improvements to Data Import Handling (DIH)

Index Management

Automation-based auto-complete/suggest

Data-specific indexing codecs

Faster simultaneous indexing with Lockless writes (DocumentWriterPerThread)

Per-field similarity definition

Terms component for access to indexed terms

UI for updating and adding fields

Query Handling

Joins

Block Joins

Psuedo-Joins to select a set of documents based on their relationship to a second set of documents

eDismax query parser

Language Analysis: Hunspell Stemmer

Language Analysis: KStemmer

Language identifier

Locale-sensitive range queries

Lucid query parser

Mixed-range operators (such as { and ]) are legal

Open-ended ranges (such as [a TO *])

Real-time Get to retrieve the latest version of a document without a commit

Spell check

Spellcheck without separate index

Unary operators (+,-,!) will not be operators if followed by whitespace

Results and relevance

Conditional function queries

CSV response writer

Grouping (field collapsing)

Highlight a facet term instead of query term when browsing facet results

Highlighting improvements (such as fastvectorhighlight)

JSON update request handler

New ranking algorithms (BM25, LM, DFR, etc.)

Numeric range faceting

Numeric range faceting in distributed search

Per-field/flexible scoring

Pivot faceting

Per-segment field faceting for improved performance with near real-time search

Popularity-based ranking (Click Scoring Relevance Framework)

Pseudo-field support to return extra data such as function query values, score explanation and field aliasing with stored fields

Relevance function queries

Ruby, JSON, XML, Python response writers

Search Results Clustering

SolrJ parses grouped and range responses

Sort by output of a FunctionQuery

Spatial search, filtering, boosting, sorting

Suggester (auto complete)

User alerts for new content

Deployment and Monitoring

Built-in QPS dashboard

MBeans for crawling activity

MBeans for filter, result and document caches

MBeans for indexing activity

MBeans for search activity

Nagios Application Monitor integration

Searchable log files

StatsComponent support for dates and strings

Zabbix Hardware Monitor integration

Scaling and Performance

Distributed search

Distributed search grouping (distributed field collapsing)

Fast fuzzy query

Memory improvements for searching and sorting

Post filters and filter cache controls

SolrCloud Phase 1 (improved distributed search, cluster state, read-side fault tolerance and centralized configurations

SolrCloud Phase 2 (distributed indexing, real-time GET, read & write fault tolerance and cluster elasticity

Tiered merge policy