Differences between revisions 258 and 260 (spanning 2 versions)
Revision 258 as of 2013-01-25 02:02:00
Size: 5951
Comment:
Revision 260 as of 2013-03-21 00:04:03
Size: 6093
Editor: 128
Comment:
Deletions are marked like this. Additions are marked like this.
Line 74: Line 74:
 * Nutch2Crawling - A description of the crawling jobs  * Nutch2Crawling - A description of the crawling jobs and field to database mappings.
Line 81: Line 81:
 * [[http:///nlp.solutions.asia/?p=232|Understanding the columns/fields in Nutch 2.0 Webpage]]  * [[NutchConfigurationFiles-2.x]] -- Configuration files that are specific to Nutch-2.x
* [[http:///nlp.solutions.asia/?p=232|Understanding the columns/fields in Nutch 2.0 Webpage - Detailed article]]

Welcome to the Apache Nutch Wiki

http://www.interadvertising.co.uk/files/nutch_logo_medium.gif

Please contribute your knowledge about Nutch here!

Nutch Version Administration

Tutorials

Nutch 1.X tutorial(s)

  • NutchTutorial - How to configure Nutch to crawl in local mode and post to Apache Solr for search/index.

Nutch 2.X tutorial(s)

Other Tutorial(s)

Configuration

General Information

Nutch Development

Nutch 2.x

Pre Nutch 1.3 and Archive

How to edit this Wiki

This Wiki is a collaborative site, anyone can contribute and share:

  • Create an account by clicking the "Login" link at the top of any page, and picking a username and password.
  • Edit any page by pressing Edit at the top or the bottom of the page

There are some conventions used on the Nutch wiki:

  • /!\ :TODO: /!\ (/!\ :TODO: /!\ ) is used to denote sections that definitely need to be cleaned up.

Some general info on using this Wiki Software:

FrontPage (last edited 2018-09-27 15:44:39 by RoannelFernandez)