The plugin enables German-language stemming during indexing and searching. Unnecessary German stop words are removed from content and query.
The package contains:
Download at http://nutch.eventax.com/
Default filename: german-stopword.txt
german-stopword.txt has to be placed into CLASSPATH/conf directory.
Syntax:
#List of stopwords:
der die
das and
a ...
The German Analyzer from the Lucene package is used.
The GermanBasicIndexingFilter works approximately 10
It is possible to use stop words in the query. They are ignored, but emphasized like normal hits.
– HammoudaBouyedda - 28 Sep 2004