datanode is an alias for org.apache.nutch.ndfs.NDFS

The NDFS class holds the NDFS client and server.

DataNode controls just one critical table: block-> BLOCK_SIZE stream of bytes

This info is stored on disk (the NameNode is responsible for asking other machines to replicate the data). The DataNode reports the table's contents to the NameNode upon startup and every so often afterwards.

Usage: bin/nutch org.apache.nutch.ndfs.NDFS <dataDir> <localMachine> <namenode:port>


bin/nutch_datanode (last edited 2009-09-20 23:10:09 by localhost)