Requirement

Steps

<property>
  <name>http.agent.name</name>
  <value>Value_name</value>
  <description>HTTP 'User-Agent' request header. MUST NOT be empty -
  please set this to a single word uniquely related to your organization.

  NOTE: You should also check other related properties:

    http.robots.agents
    http.agent.description
    http.agent.url
    http.agent.email
    http.agent.version

  and set their values appropriately.

  </description>
</property>

./bin/nutch parsechecker -dumpText http://www.jpl.nasa.gov > jpl_out.txt

QuickStartparseChecker (last edited 2014-09-25 16:58:09 by 162)