Differences between revisions 227 and 228
Revision 227 as of 2011-09-02 19:55:27
Size: 4812
Comment:
Revision 228 as of 2011-09-02 20:13:32
Size: 4854
Comment:
Deletions are marked like this. Additions are marked like this.
Line 14: Line 14:
 * [[NutchHadoopTutorial|Nutch Hadoop Tutorial]] - How to setup and run Nutch in deploy mode over a Hadoop cluster.  * [[NutchHadoopTutorial|Nutch Hadoop Tutorial]] - How to setup and run Nutch in deploy mode over a Hadoop cluster. /!\ :This tutorial is in development: /!\

Welcome to the Apache Nutch Wiki

http://www.interadvertising.co.uk/files/nutch_logo_medium.gif

Please contribute your knowledge about Nutch here!

Nutch Version 1.3 Administration

Tutorials

  • NutchTutorial - How to configure Nutch 1.3 to crawl in local mode and post to Apache Solr for search/index.

  • Hadoop Tutorial Nutch being based Hadoop, it helps to have a better understanding of Hadoop.

  • Nutch Hadoop Tutorial - How to setup and run Nutch in deploy mode over a Hadoop cluster. /!\ :This tutorial is in development: /!\

  • RunNutchInEclipse - How to configure, build, crawl and debug Nutch 1.3 within Eclipse

Configuration

General Information

Nutch Development

Nutch 2.0

Pre Nutch 1.3 and Archive

How to edit this Wiki

This Wiki is a collaborative site, anyone can contribute and share:

  • Create an account by clicking the "Login" link at the top of any page, and picking a username and password.
  • Edit any page by pressing Edit at the top or the bottom of the page

There are some conventions used on the Nutch wiki:

  • /!\ :TODO: /!\ (/!\ :TODO: /!\ ) is used to denote sections that definitely need to be cleaned up.

Some general info on using this Wiki Software:

FrontPage (last edited 2018-09-27 15:44:39 by RoannelFernandez)