Differences between revisions 5 and 6
Revision 5 as of 2005-11-21 10:46:47
Size: 1090
Comment:
Revision 6 as of 2009-09-20 23:09:37
Size: 1090
Editor: localhost
Comment: converted to 1.6 markup
No differences found!

Identity

  • plugin name: languageidentifier
  • plugin version: none
  • provider: SamiSiren, JeromeCharron

  • plugin home url: LanguageIdentifierPlugin

  • plugin download url: Included with nutch source distribution
  • license: Same as Nutch
  • short description: Analyzer plugin that identifies the language of documents.
  • long description:
  • configureable parameters: lang.ngram.min.length, lang.ngram.max.length, lang.analyze.max.length
  • meta data added to index: lang
  • required jars:
  • plugin extension points:
  • plugin extension point interface:
  • plugin extension point xml snippet:

Documentation

Implemented Languages and their ISO 636 Codes

  • da Danish
  • de German
  • el Greek
  • en English
  • es Spanish
  • fi Finnish
  • fr French
  • hu Hungarian
  • it Italian
  • nl Dutch
  • pl Polish
  • pt Portuguese
  • ru Russian
  • sv Swedish

LanguageIdentifierPlugin (last edited 2009-09-20 23:09:37 by localhost)