Apache Hadoop Hackathon, May 18, 2011

Hosted at Cloudera's San Francisco and Palo Alto offices.

This page is aliased at: http://bit.ly/hadoop-hack-may18

Useful resources

Quick Start

Checking out Hadoop:
Git:

mkdir hadoop-git ; cd hadoop-git

git clone https://github.com/apache/hadoop-common.git
git clone https://github.com/apache/hadoop-hdfs.git
git clone https://github.com/apache/hadoop-mapreduce.git

(or if we fix ssh: #git clone git://git.apache.org/hadoop-common.git
#git clone git://git.apache.org/hadoop-mapreduce.git
#git clone git://git.apache.org/hadoop-hdfs.git
)

svn:

mkdir hadoop-svn ; cd hadoop-svn
svn co https://svn.apache.org/repos/asf/hadoop/common/trunk
svn co https://svn.apache.org/repos/asf/hadoop/mapreduce/trunk
svn co https://svn.apache.org/repos/asf/hadoop/hdfs/trunk
(for trunk -- for branches, use /repos/asf/hadoop/common/branches/branch-0.22 )

Running tests

ant test-core -Dtest.output=yes -Dtestcase=TestEditLog

test.output will print output to console, useful for hanging tests

Eclipse: see EclipseEnvironment

Submitting a patch

Open a jira
Make change
Run tests
git diff --no-prefix > /tmp/HADOOP-1234.txt

Review queues

Suggestions for what to work on

Infrastructure improvements

Make it easier for others to contribute

Help get 0.22 out the door

Try to use the release (or build from trunk)

Help get trunk in shape