Differences between revisions 1 and 2
Revision 1 as of 2006-08-23 14:13:19
Size: 510
Editor: pool-68-160-34-54
Revision 2 as of 2009-09-20 23:10:16
Size: 512
Editor: localhost
Comment: converted to 1.6 markup
Deletions are marked like this. Additions are marked like this.
Line 5: Line 5:
 * see the [http://lucene.apache.org/nutch/tutorial8.html Tutorial]  * see the [[http://lucene.apache.org/nutch/tutorial8.html|Tutorial]]

Upgrade From Nutch 0.7 To Nutch 0.8

Configuration changes

  • see the Tutorial

    • put your root urls in urls/whatever_name instead of urls
    • make sure you set up http.agent.name

Index migration

Unfortunately, the data is not portable between these versions. The only thing you could do to preserve your webdb is to dump it into a text file, and then inject into a 0.8 crawldb. As for the segments, you will have to refetch them.

UpgradeFrom07To08 (last edited 2009-09-20 23:10:16 by localhost)