CommandLineOptions

Command Line Options of bin/nutch

See each entry for datails of the command arguments and options.

command

function

bin/nutch admin

Web page and link database administration, including creation

bin/nutch analyze

Adjust database link-analysis scoring

bin/nutch crawl

Perform complete crawling and indexing of a set of root urls

bin/nutch datanode

NDFS data node

bin/nutch dedup

Deletes duplicate documents in a set of segment indexes

bin/nutch fetch

Fetch a segment's pages

bin/nutch fetchlist

Print the fetchlist of a segment

bin/nutch generate

Generate new segments to fetch

bin/nutch index

Run the indexer on a segment's fetcher output

bin/nutch inject

Inject new urls into the web page and link database

bin/nutch merge

Merge several segment indexes

bin/nutch mergesegs

Merges multiple segments & removes duplicates

bin/nutch namenode

NDFS name node

bin/nutch ndfs

NDFS administrative access

bin/nutch parse

Parse contents in one segment

bin/nutch prune

Prunes existing Nutch indexes of unwanted content

bin/nutch readdb

Read data from the web page and link db

bin/nutch segread

Read data in an existing segment

bin/nutch segslice

Divide data from one segement into several segments

bin/nutch server

Run a search server of IPC connections

bin/nutch updatedb

Updates the web page and link db from the segment fetcher output

last edited 2005-07-17 01:06:51 by RobPettengill