Differences between revisions 223 and 225 (spanning 2 versions)
Revision 223 as of 2011-08-26 15:50:43
Size: 4674
Revision 225 as of 2011-08-26 16:35:33
Size: 4832
Deletions are marked like this. Additions are marked like this.
Line 49: Line 49:
 * TaskList -- Tasks for Nutch developers.  * TaskList -- Tasks for Nutch developers. /!\ :Severe update required: /!\
Line 56: Line 56:
 * IndexStructure  * IndexStructure /!\ :This page needs a slight update to provide more information on plugins and the data they send to Solr for indexing: /!\

Welcome to the Apache Nutch Wiki


Please contribute your knowledge about Nutch here!

Nutch Version 1.3 Administration


  • RunningNutchAndSolr - How to configure Nutch 1.3 to crawl in local mode and post to Apache Solr for search/index.

  • RunningNutchInDeployMode - How to configure Nutch 1.3 to crawl in deploy mode. /!\ :TODO:This tutorial is in construction. /!\

  • Hadoop Tutorial Nutch being based Hadoop, it helps to have a better understanding of Hadoop.

  • RunNutchInEclipse - How to configure, build, crawl and debug Nutch 1.3 within Eclipse


General Information

Nutch Development

Nutch 2.0

Pre Nutch 1.3 and Archive

How to edit this Wiki

This Wiki is a collaborative site, anyone can contribute and share:

  • Create an account by clicking the "Login" link at the top of any page, and picking a username and password.
  • Edit any page by pressing Edit at the top or the bottom of the page

There are some conventions used on the Nutch wiki:

  • /!\ :TODO: /!\ (/!\ :TODO: /!\ ) is used to denote sections that definitely need to be cleaned up.

Some general info on using this Wiki Software:

FrontPage (last edited 2018-09-27 15:44:39 by RoannelFernandez)