ErrorMessages

Error messages, reasons and solutions

Please feel free to add error messages, reasons and solutions!

Please report bugs to the mailing list!

FileNotFoundException: 1

delay 1 fails crawltest and subdirectories are created; also ant compiles no probs; ROOT.war is installed and runs; urls file exists. Adding ./ or full path as x below changes nothing. Server runs squid on 80 and real Apache 1.3 on 81. Catalina is on 8080 and is up and running.

crawl test exists

ls -R crawl.test/ crawl.test/: . .. db

crawl.test/db: . .. dbreadlock dbwritelock webdb

crawl.test/db/webdb: . .. linksByMD5 linksByURL pagesByMD5 pagesByURL

crawl.test/db/webdb/linksByMD5: . .. data index

crawl.test/db/webdb/linksByURL: . .. data index

crawl.test/db/webdb/pagesByMD5: . .. data index

crawl.test/db/webdb/pagesByURL: . .. data index


It always fails with above error, while omitting the delay tag seems to work :\ ... I tried putting the -delay tag at several places above, it always fails

nutch 0.7 Apache Tomcat/5.0.19 jdsk 1.4.2-b28 Sun Microsystems Inc. Linux (Suse 8.2 1.5 years old but updated) Linux Kernel 2.4.21 i386

Well its working without the delay tag but I can't release it on other sites with no delay tag. What am I doing wrong?

Errors Fetching

Why do I get error "123456 104934 fetch of [WWW] http://mydomain/index.html failed with: net.nutch.net.protocols.http.HttpError: HTTP Error: 401" when crawling?

/etc/host.conf: line 1: cannot specify more then 4 services

Errors Updating

Until updating my DB I got a OutOfMemoryException or a 'to many files open' error.

Errors Searching

Tomcat reports root cause: java.lang.OutOfMemoryError and does not find anything.

Errors installing

See GettingNutchRunningWithUbuntu for some help.

Nutch on Debian (cont)

What is mentioned here

[WWW] http://nutch.sourceforge.net/cgi-bin/twiki/view/Main/GettingNutchRunningOnDebian

java.lang.NoClassDefFoundError: org/apache/coyote/http11/Http11Processor$1

can be avoided with permission java.io.FilePermission "*", "read,write,execute,delete";

pityfully the cache anchor option doesn't work still

java.security.AccessControlException: access denied (java.util.PropertyPermission * read,write)

this happens independent of putting

permission java.io.FilePermission "*", "read,write,execute,delete";

in

/etc/tomcat4/policy.d/04webapps.policy

:(

so if you are then entirely fed up trying to find what's up ... because bad stack trace + idiotic and unpenetrable security settings are selfdefeating..

you enter permission java.security.AllPermission;

in /etc/tomcat4/policy.d/04webapps.policy

and the thing works ... (but I am not even contemplating what security holes I have opened here :|)

Setup on a SUSE 8.1 system was no problem btw ...

last edited 2005-11-19 13:10:27 by NeilMcAndrew