"readlinkdb" is an alias for "org.apache.nutch.crawl.LinkDbReader"
Exports information on the Link Database or Returns information on an URL in the Link Database
Usage
nutch-0.8-dev/bin/nutch org.apache.nutch.crawl.LinkDbReader <linkdb> (-dump <out_dir> | -url <url>)
<linkdb>: Path to the linkdb directory.
[-dump <out_dir>]: Exports the linkdb to a file in <out_dir>
[-url <url>]: Prints statistics on <url> to System.out
Configuration Files
hadoop-default.xml
hadoop-site.xml
nutch-default.xml
nutch-site.xml
Other Files
- None.
Caveats and Notes
- None.