Architecture

TODO

NGram profile format

TODO

Generating some NGrams profiles

Generating a new language profile in Nutch is really easy. Simply launch the following command:

java org.apache.nutch.analysis.lang.NGramProfile -create <profile-name> <filename> <encoding>

where

Open Issues