Hadoop is a framework for running applications on large clusters built of commodity hardware. The Hadoop framework transparently provides applications both reliability and data motion. Hadoop implements a computational paradigm named Map/Reduce, where the application is divided into many small fragments of work, each of which may be executed or reexecuted on any node in the cluster. In addition, it provides a distributed file system (HDFS) that stores data on the compute nodes, providing very high aggregate bandwidth across the cluster. Both Map/Reduce and the distributed file system are designed so that node failures are automatically handled by the framework.
General Information
Overview of Hadoop
Official Hadoop Core Website : download, bug-tracking, mailing-lists, etc. Presentations and articles about Hadoop
PoweredBy, a list of sites and applications powered by Hadoop
Support
User Documentation
GettingStartedWithHadoop (lots of details and explanation)
QuickStart (for those who just want it to work now)
Running Hadoop On Ubuntu Linux (Single-Node Cluster) (tutorial on installing, configuring and running Hadoop on a single machine)
HowToConfigure Hadoop software
Command Line Options for hadoop shell script.
[TroubleShooting] What do when things go wrong
Map/Reduce
Examples
Amazon
Benchmarks
Contrib
HadoopStreaming (Useful for using Hadoop with other programming languages)
Hbase, a Bigtable-like structured storage system for Hadoop HDFS
DistributedLucene, a Proposal for a distributed Lucene index in Hadoop
MountableHDFS, Fuse-DFS & other Tools to mount HDFS as a standard filesystem on Linux (and some other Unix OSs)
Developer Documentation
Related Resources
Nutch Hadoop Tutorial (Useful for understanding Hadoop in an application context)
IBM MapReduce Tools for Eclipse (An Eclipse plug-in that simplifies the creation and deployment of MapReduce programs) Hadoop IRC channel is #hadoop at irc.freenode.net.
Using Spring and Hadoop (Discussion of possibilities to use Hadoop and Dependency Injection with Spring)
Heart, a Planet-Scale RDF Data Store and a Distributed Processing Engine
Hama, a Parallel Matrix Computational Package based on Hadoop Map/Reduce