Differences between revisions 3 and 4
Revision 3 as of 2013-03-11 05:25:56
Size: 2660
Editor: RobertMuir
Comment:
Revision 4 as of 2013-03-11 18:35:40
Size: 2773
Editor: jpountz
Comment: improved description of term vectors compression
Deletions are marked like this. Additions are marked like this.
Line 2: Line 2:
January 2013, Apache Lucene™ 4.2 available March 2013, Apache Lucene™ 4.2 available
Line 21: Line 21:
   overhead, improved numeric compression) and LZ4-compressed term
   vectors.
   overhead, improved numeric compression) and smaller term
   vectors (
LZ4-compressed terms dictionaries and payloads, delta-encoded
   positions and offsets using blocks of packed integers).

March 2013, Apache Lucene™ 4.2 available
The Lucene PMC is pleased to announce the release of Apache Lucene 4.2

Apache Lucene is a high-performance, full-featured text search engine
library written entirely in Java. It is a technology suitable for nearly
any application that requires full-text search, especially cross-platform.

This release contains numerous bug fixes, optimizations, and
improvements, some of which are highlighted below. The release
is available for immediate download at:
   http://lucene.apache.org/core/mirrors-core-latest-redir.html

See the CHANGES.txt file included with the release for a full list of
details.

Lucene 4.2 Release Highlights:

 * Lucene 4.2 has a new default codec (Lucene42Codec) with a more
   efficient docvalues format (sorted bytes in FST, less addressing
   overhead, improved numeric compression) and smaller term
   vectors (LZ4-compressed terms dictionaries and payloads, delta-encoded
   positions and offsets using blocks of packed integers).

 * Doc values external and codec API and implementations have been
   simplified: the codec is no longer responsible for buffering doc
   values; the numerous types have been consolidated down to only
   three (NUMERIC, BINARY, SORTED); PerFieldDocValuesFormat lets you
   set a different format for each field, and the doc values and FieldCache
   APIs were unified.

 * Significant refactoring and performance enhancements to the facet
   module, resulting in overall ~3.8X speedup in one case (single Date
   field faceting).

 * DrillDownQuery in the facet module now supports multi-select.

 * A new DrillSideways class enables counting facet labels and counts
   for both hits and near-misses in a single query.  See
   http://blog.mikemccandless.com/2013/02/drill-sideways-faceting-with-lucene.html

 * An additional docvalues type (SORTED_SET) was added that supports multiple
   values.

 * FSTs are a bit smaller, and the FST package supports FSTs over 2GB
   in size.

 * A new LiveFieldValues class lets you get live or real-time values
   for any indexed doc / field.  See http://blog.mikemccandless.com/2013/01/getting-real-time-field-values-in-lucene.html

 * Added a new classification module.

 * Various bugfixes and optimizations since the 4.1 release.

Please read CHANGES.txt for a full list of new features.

Please report any feedback to the mailing lists
(http://lucene.apache.org/core/discussion.html)

Note: The Apache Software Foundation uses an extensive mirroring network
for distributing releases.  It is possible that the mirror you are using
may not have replicated the release yet.  If that is the case, please
try another mirror.  This also goes for Maven access.

ReleaseNote42 (last edited 2013-03-11 18:35:40 by jpountz)