Working With Gora Snapshots

Apache Gora is released as source code only (as oppose to both binary and source) due to changing user requirements and the fact that code may need to be compiled and recompiled in an ad-hoc fashion. This however poses a bit of a problem for Nutch'ers when they need to go and get the code as they may need to work with Gora SNAPSHOT's.

For an official account of how to work with Gora SNAPSHOT's in your projects please consult the Gora Quickstart Guide

Since NUTCH-1696 was committed it is a piece of cake to work with Gora SNAPSHOT's within Nutch 2.x. Simply read the following

  1. Open $NUTCH_HOME/ivy/ivy.xml and look for the Gora dependencies 2. Change the dependency configuration from this

<dependency org="org.apache.gora" name="gora-core" rev="0.3" conf="*->default"/>

to this

<dependency org="org.apache.gora" name="gora-core" rev="0.4-SNAPSHOT" conf="*->default" changing="true"/>

You'll now be able to work with Gora SNAPSHOT's.

N.B. It should be noted that from time to time Avro schema's may change and require re-compilation of persistent classes. This is OK, simply run * ant generate-gora-src* from $NUTCH_HOME and watch the GoraCompiler do it's work.

See below for an additional schema resource.

[0] https://issues.apache.org/jira/secure/attachment/12559893/webpage.avsc

  • No labels