Plugin: German

!!! This plugin is based on old nutch classes and not running with the current nutch version !!!

The plugin enables German-language stemming during indexing and searching. Unnecessary German stop words are removed from content and query.

The package contains:

Download at http://nutch.eventax.com/

Config File Options

german.stopword.file

Default filename: german-stopword.txt

german-stopword.txt has to be placed into CLASSPATH/conf directory.

Syntax:
#List of stopwords:
der die
das and
a ...

Internal Documentation

The German Analyzer from the Lucene package is used.

The GermanBasicIndexingFilter works approximately 10

Searching

It is possible to use stop words in the query. They are ignored, but emphasized like normal hits.

HammoudaBouyedda - 28 Sep 2004

  • No labels