http://sourceforge.net/projects/normalizer/ "Contextual rule-based text normalization engine written in java, that can be used to implement stemming algorithms or phonetic normalizers. The project includes a french stemmer/phonetic normalizer, and a soundex implementation. Includes a Lucene Analyzer"
Nutch (http://www.nutch.org/) is a web search engine that uses Lucene as it's search technology. It adds a scalable crawler, link database, distributed searching, and a web-based search interface.