This diagram depicts most of the dependencies among the Nutch modules (packages in Java parlance). Modules in the same box are interdependent (for example, fetcher and parse each depend on the other) and dependent on modules in boxes below them (for example, fs depends on ipc, io, and util, but not net or plugin).

tools

JSP UI

indexer searcher analysis

Lucene

fetcher parse

fs <!-- the width=20

ipc

io

util

(This text and diagram are up-to-date as of Nutch 0.5.)

There are a few things omitted from this diagram. The html, pagedb, and linkdb packages have been omitted entirely as uninteresting. MAXOUTLINKSPER_PAGE is in tools.UpdateDatabaseTool but is used by several modules further down the stack; and util.ScoreStats, which depends on db and therefore transitively on net and io (?), but upon which nothing depends, should probably be in tools instead of in util.

Also, plugins provide much of Nutch's functionality, and they are completely omitted from this diagram.

NutchLayerDiagram (last edited 2009-09-20 23:09:50 by localhost)