Presentations about Hadoop
This is a list of presentations about Hadoop, by event and paper (newest first):
Public Presentations
A lot of these presentations are at local user groups. If there is not one in your area, start one! Take one of the existing talks and give it! Don't be afraid. The only thing to fear is trying to do live demos of MapReduce against a remote cluster. Most presenters avoid this.
DevHouse Berlin, October 2009
Larges scale data processing with Hadoop (Isabel Drost, Apache Mahout)
Lambda Lounge (St. Louis, USA), October 2009
Hadoop In 45 Minutes or Less (Tom Wheeler, OCI)
Hadoop World, October 2009
Security and Backward Compatibility (Owen O'Malley, Yahoo!)
Next Steps (Avro) (Doug Cutting, Cloudera)
Sunnyvale Hadoop User Group, September 2009
Moving to the new Map/Reduce API ( Owen O'Malley, Yahoo!)
Apache Hadoop Get Together, September 2009
Solving Puzzles with Map Reduce (Thorsten Schütt, ZIB)
An introduction to JAQL ( Thilo Götz, IBM)
Lucene 2.9 Developments (Uwe Schindler, Apache Lucene)
Bristol Hadoop Workshop, August 2009
The Bristol Hadoop Workshop was a small meeting; these presentations were intended to start discussion and thought
Hadoop Futures (Tom White, Cloudera)
Hadoop and High-Energy Physics (Simon Metson, Bristol University)
HDFS (Johan Oskarsson, Last.fm)
Graphs Paolo Castagna, HP
Long Haul Hadoop (Steve Loughran, HP)
Benchmarking Hadoop (Steve Loughran & Julio Guijarro, HP)
FrOSCon Sankt Augustin, August 2009
From data to information - An overview of the Hadoop ecosystem with a close-up on Mahout (Isabel Drost, Apache Mahout, video (starts with a "Hello FrOSCon visitors" round)
Hadoop Technical Discussion Presented at Machine Learning group TU Berlin, July 2009
Apache Hadoop - Large scale data processing (Isabel Drost, Apache Mahout)
Hadoop guest lecture at Beuth Hochschule Berlin, July 2009
Apache Hadoop - Large scale data processing (Isabel Drost, Apache Mahout)
Apache Hadoop Get Together Berlin, June 2009
Protocol Buffers vs. Apache Thrift (Thorsten Curdt, slides available from speaker)
Lucene for Life Science Knowledge Discovery (Dr. Christoph M. Friedrich, Fraunhofer SCAI)
Usenix, June 2009
Usenix is one of the big computing talks. The fact that Hadoop is now a subject of discussion is a measure of its success
Hadoop Cluster Management (Marco Nicosia, USENIX, June 2009)
Hadoop Summit, June 2009
This was the west coast summit, hosted by Yahoo!
Hadoop Sort Benchmarks 2009 (Arun C. Murthy and Owen O'Malley, Hadoop Summit, June 2009)
HUG UK, April 2009
London meeting of the UK Hadoop Users Group
Practical MapReduce (Tom White, Cloudera)
Introducing Apache Mahout (Isabel Drost, ASF)
The Terrier Project (Iadh Ounisand Craig Macdonald, University of Glasgow)
Apache HBase (Michael Stack, Powerset)
Having Fun with PageRank and MapReduce (Paolo Castagna, HP)
HADOOP-1722 and Typed Bytes (Klaas Bosteels, Last.fm)
Hypercubes in HBase - (Fredrik Möllerstrand, Last.fm)
Scalable reasoning on RDF documents with Hadoop and HBase (Michele Catasta, Last.fm)
Apache Hadoop Get Together Berlin, March 2009
HBase (Lars George, Worldlingo)
CouchDB in 20 minutes (Jan Lehnardt, couch.io)
ApacheCon EU 2009, March 2009
The main ASF get-together in Europe
Tuning and Debugging Hadoop Map-Reduce (Arun C. Murthy, ApacheCon EU, March 2009)
Running Hadoop in the Cloud (Tom White, ApacheCon EU, March 2009)
Hadoop 24/7 (Allen Wittenauer, ApacheCon EU, March 2009)
Dynamic Hadoop Clusters (Steve Loughran, ApacheCon EU, March 2009)
Application Architecture For The Cloud (Steve Loughran, ApacheCon EU, March 2009)
Apache Hadoop Get Together, December 2008
BI over Text on the Cloud (Alexander Löser, DIMA TU Berlin)
- Katta slides available from Stefan Groschupf.
ApacheCon 2008, November 2008
The annual Apache US conference
Introduction to Hadoop (Owen O'Malley, ApacheCon, Nov 2008)
Hadoop Usage at Facebook (Dhruba Borthakur, ApacheCon, Nov 2008)
Hadoop Technical Discussion Presented by Rapleaf, October 2008
Hadoop Map-Reduce: Tuning and Debugging Presentation to Hadoop Technical Discussion - Presented by Rapleaf, San Francisco, California, (Arun C. Murthy, October 2008)
East Bay Innovation Group, October 2008
Introduction to Hadoop Hadoop presentation to East Bay Innovation Group, Oakland, California, (Owen O'Malley, October 2008)
NY Hadoop User Group, October 2008
Hadoop Namenode High Availability, NY Hadoop User Group Meeting, New York, August 2008 (Paul George, ContextWeb)
Apache Hadoop Get Together Berlin, September 2008
Hadoop/ HBase as Webstore (Rasmus Hahn, neofonie)
UIMA on Hadoop (Marc Hofer, DIMA TU Berlin)
HUG UK Meeting, August 2008
Presentations from the Hadoop User Group UK Meeting, London, August 2008
Hadoop overview (Doug Cutting)
Hadoop Web Services on Amazon S3/EC2 Tom White
Hadoop usage at Last.fm (Martin Dittus)
Distributed Lucene for Hadoop (Mark Butler)
Dumbo: Hadoop streaming made elegant and easy (Klaas Bosteels)
Deploying Apache Hadoop with Smartfrog (Steve Loughran and Julio Guijarro)
Hadoop at Last.fm: Radio Log Analysis for A/B Tests (Elias Pampalk)
PostgreSQL to HBase Replication (Tim Sell)
Hadoop: Lessons learned at Last.fm (Johan Oskarsson)
Apache Hadoop Get Together Berlin, June 2008
Crawling the DNS (Gert Pfeifer, TU Dresden)
Hadoop @ Semgine - Einsatz im NLP Umfeld (Sascha Kohlmann, Semgine)
Mahout (Isabel Drost, Apache Mahout)
How we use Apache Pig (Stefan Groschupf, 101tec)
IBM Almaden Research, June 2008
Hadoop Distributed File System (HDFS) Architecture and Design (Dhruba Borthakur, IBM Almaden Research, June 2008)
ApacheCon EU 2008, April 2008
Deploying Grid Services Using Hadoop (Allen Wittenauer, ApacheCon EU, April 2008)
A Tour of Apache Hadoop (Tom White, ApacheCon EU, April 2008)
Programming with Hadoops Map Reduce (Owen O'Malley, ApacheCon EU, April 2008)
SPA 2008, March 2008
Understanding MapReduce with Hadoop (Tom White, SPA 2008, March 2008, feedback, answers)
Mailtrust Tech Talk, February 2008
MapReduce vs SQL (Stu Hood, Mailtrust Tech Talk, February 2008)
Older talks
Presentation to the RAD Lab at Berkeley (Owen O'Malley and Eric Baldeschwieler, October 2007)
Meet Hadoop (part 1, part 2) (Doug Cutting and Eric Baldeschwieler, OSCON, July 25 2007)
Hadoop Distributed File System (HDFS) (Dhruba Borthakur, June 2007)
Introduction To Hadoop (Owen O'Malley, May 2007)
Hadoop Map/Reduce Architecture (Owen O'Malley, July 2006)
Scalable Computing with Hadoop (Doug Cutting, May 2006)
Teaching
Here are some courses that have used Hadoop to teach distributed computing (newest first):
Presentations in other languages:
MapReduce & Apache Hadoop (Turkish) (Enis Söztutar, 1. Ulusal Yüksek Başarım ve Grid Konferansı, 04/2009)
Hadoop et MapReduce : traitement distribué à l’échelle du web (Jean-Daniel Cryans, École de technologie supérieure de Montréal, Juillet 2008)