Differences between revisions 228 and 229
Revision 228 as of 2011-09-02 20:13:32
Size: 4854
Comment:
Revision 229 as of 2011-09-08 20:08:41
Size: 4854
Comment:
Deletions are marked like this. Additions are marked like this.
Line 24: Line 24:
 * IndexStructure /!\ :This page needs a slight update to provide more information on plugins and the data they send to Solr for indexing: /!\
Line 56: Line 57:
 * IndexStructure /!\ :This page needs a slight update to provide more information on plugins and the data they send to Solr for indexing: /!\

Welcome to the Apache Nutch Wiki

http://www.interadvertising.co.uk/files/nutch_logo_medium.gif

Please contribute your knowledge about Nutch here!

Nutch Version 1.3 Administration

Tutorials

  • NutchTutorial - How to configure Nutch 1.3 to crawl in local mode and post to Apache Solr for search/index.

  • Hadoop Tutorial Nutch being based Hadoop, it helps to have a better understanding of Hadoop.

  • Nutch Hadoop Tutorial - How to setup and run Nutch in deploy mode over a Hadoop cluster. /!\ :This tutorial is in development: /!\

  • RunNutchInEclipse - How to configure, build, crawl and debug Nutch 1.3 within Eclipse

Configuration

General Information

Nutch Development

Nutch 2.0

Pre Nutch 1.3 and Archive

How to edit this Wiki

This Wiki is a collaborative site, anyone can contribute and share:

  • Create an account by clicking the "Login" link at the top of any page, and picking a username and password.
  • Edit any page by pressing Edit at the top or the bottom of the page

There are some conventions used on the Nutch wiki:

  • /!\ :TODO: /!\ (/!\ :TODO: /!\ ) is used to denote sections that definitely need to be cleaned up.

Some general info on using this Wiki Software:

FrontPage (last edited 2018-09-27 15:44:39 by RoannelFernandez)