The configuration files that are specific to Nutch 2.X and Gora are listed below. These configuration files are present in (/conf/) directory once the source package is download and extracted. It is also present in ($NUTCH_HOME/runtime/local/conf/) directory once the source is built using ant.

1) gora-hbase-mapping.xml

2) gora-sql-mapping.xml

3) gora.properties

4) gora-accumulo-mapping.xml

5) gora-cassandra-mapping.xml

Please check the configuration to understand how Nutch 2.x matches the schema.

  • No labels