"merge" is an alias for "org.apache.nutch.indexer.IndexMerger"
Merges several segment indexes
Usage
nutch-0.8-dev/bin/nutch org.apache.nutch.indexer.IndexMerger [-workingdir <workingdir>] <outputIndex> <indexesDir> ...
[-workingdir <workingdir>]: Specifies a working directory for the merger located at <workingdir>.
<outputIndex>: Path to a directory where the merged index will be created.
<indexesDir>: Path to a directory containing indexes to merge. More then one directory may be specified.
Configuration Files
hadoop-default.xml
hadoop-site.xml
nutch-default.xml
nutch-site.xml
Other Files
- None.
Caveats and Notes
- index.done file is not created.