February 2014, Apache Lucene™ 4.7 available
The Lucene PMC is pleased to announce the release of Apache Lucene 4.7

Apache Lucene is a high-performance, full-featured text search engine
library written entirely in Java. It is a technology suitable for nearly
any application that requires full-text search, especially cross-platform.

This release contains numerous bug fixes, optimizations, and
improvements, some of which are highlighted below. The release
is available for immediate download at:
  http://lucene.apache.org/core/mirrors-core-latest-redir.html

See the CHANGES.txt file included with the release for a full list of
details.

Lucene 4.7 Release Highlights:

* When sorting by String (SortField.STRING), you can now specify whether
  missing values should be sorted first (the default), or last.

* Add two memory resident dictionaries (FST terms dictionary and FSTOrd
  terms dictionary) to improve primary key lookups. The
  PostingsBaseFormat API is also changed so that term dictionaries get
  the ability to block encode term metadata, and all dictionary
  implementations can now plug in any PostingsBaseFormat.  See
  http://blog.mikemccandless.com/2013/09/lucene-now-has-in-memory-terms.html

* NRT support for file systems that do not have delete on last close or
  cannot delete while referenced semantics.

* Add LongBitSet for managing more than 2.1B bits (otherwise use
  FixedBitSet).

* Speed up Lucene range faceting from O(N) per hit to O(log(N)) per hit
  using segment trees.  See
  http://blog.mikemccandless.com/2013/12/fast-range-faceting-using-segment-trees.html

* Add SearcherTaxonomyManager over search and taxonomy index directories
  (i.e. not only NRT).

* Drilling down or sideways on a Lucene facet range (using
  Range.getFilter()) is now faster for costly filters (uses random
  access, not iteration); range facet counts now accept a fast-match
  filter to avoid computing the value for documents that are out of
  bounds, e.g. using a bounding box filter with distance range faceting.

* Add Analyzer for Kurdish.

* Add Payload support to FileDictionary (Suggest) and make it more
  configurable.

* Add a new BlendedInfixSuggester, which is like AnalyzingInfixSuggester
  but boosts suggestions that matched tokens with lower positions.

* Add SimpleQueryParser: parser for human-entered queries.

* Add multitermquery (wildcards,prefix,etc) to PostingsHighlighter.

* Upgrade to Spatial4j 0.4.1: Parses WKT (including ENVELOPE) with
  extension "BUFFER"; buffering a point results in a Circle. JTS isn't
  needed for WKT any more but remains required for Polygons. New Shapes:
  ShapeCollection and BufferedLineString.  More info:
  https://github.com/spatial4j/spatial4j/blob/master/CHANGES.md

* Add spatial SerializedDVStrategy that serializes a binary
  representation of a shape into BinaryDocValues. It supports exact
  geometry relationship calculations.

* Various bugfixes and optimizations since the 4.6.1 release.

Please read CHANGES.txt for a full list of new features.

Please report any feedback to the mailing lists
(http://lucene.apache.org/core/discussion.html)

Note: The Apache Software Foundation uses an extensive mirroring network
for distributing releases.  It is possible that the mirror you are using
may not have replicated the release yet.  If that is the case, please
try another mirror.  This also goes for Maven access.
  • No labels