Differences between revisions 10 and 11
Revision 10 as of 2013-03-10 17:30:23
Size: 5080
Comment: Rip back the marketing blurb on something clearly over the top.
Revision 11 as of 2013-03-10 17:38:57
Size: 5608
Comment: add commented out section telling editors to not go overboard on marketing hype
Deletions are marked like this. Additions are marked like this.
Line 8: Line 8:


{{{#!wiki comment/dotted
Attention people adding new entries.
# Please include publishing date and version of Hadoop the book is relevant to.
# Please write this in a neutral voice, not "this book will help you", as that implies that the ASF has
opinions on the matter. Someone will just edit the claims out.
# Please do not go overboard in exaggerating the outcome of reading a book, "readers of this book will become experts in advanced production-scale Hadoop MapReduce jobs". Such claims will be edited out.
}}}

Hadoop Books

These books are listed in order of publication, most recent first. The Apache Software Foundation does not endorse any specific book. The links to Amazon are affiliated with the specific author. That said, we also encourage you to support your local bookshops, by buying the book from any local outlet, especially independent ones.

Books in Print

Here are the books that are currently in print -in order of publishing-, along with the Hadoop version they were written against. One problem anyone writing a book will encounter is that Hadoop is a very fast-moving target, and that things can change fast. Usually this is for the better, when a book says "Hadoop can't" they really mean "the version of Hadoop we worked with couldn't", and that the situation may have improved since then. If you have any query about Hadoop, don't be afraid to ask on the relevant user mailing lists.

Hadoop Beginner's Guide

Name: Hadoop Beginner's Guide

Author: Garry Turkington

Hadoop Version: 1.0.x

Publisher: Packt Publishing

Date of Publishing: February 22, 2013

Sample Chapter: Chapter 4: Developing MapReduce Programs

Written for complete beginners to Hadoop, covers how to install and run Hadoop on a local Ubuntu host or create an on-demand Hadoop cluster on Amazon Web Services (EC2), before getting to grips with MapReduce.

Hadoop Real World Solutions Cookbook

Name: Hadoop Real World Solutions Cookbook

Author: Jonathan Owens, Brian Femiano, Jon Lentz

Hadoop Version: CDH3

Publisher: Packt Publishing

Date of Publishing: February 7, 2013

Sample Chapter: Chapter 6: Big Data Analysis

Collection of real world code analytics and design patterns using various tools from the Hadoop community. Each recipe walks the reader through the implementation, or in some cases debugging and configuration tuning. The book covers various tools including MapReduce, Hive, Pig, MRUnit, serialization using Avro/Thrift/ProtoBuffs, Giraph, Accumulo and several others.

Hadoop MapReduce Cookbook

Name: Hadoop MapReduce Cookbook

Author: Srinath Perera, Thilina Gunarathne

Hadoop Version: 1.0.x

Publisher: Packt Publishing

Date of Publishing: January 25, 2013

Sample Chapter: Chapter 6: Analytics

Hadoop MapReduce Cookbook is a one-stop guide to processing large and complex data sets using the Hadoop ecosystem. The book introduces simple examples and then dives deep to solve in-depth big data use cases.

Hadoop in Practice

Name: Hadoop in Practice

Author: Alex Holmes

Hadoop Version: 1.0

Publisher: Manning

Date of Publishing: Fall 2012.

Sample Chapter: Chapter 1

Hadoop in Action

Name: Hadoop in Action

Author: Chuck Lam

Hadoop Version: 0.19-0.20

Publisher: Manning

Date of Publishing: December, 2010

Sample Chapter: Chapter 1

Hadoop in Action introduces the subject and teaches you how to write programs in the MapReduce style. It starts with a few easy examples and then moves quickly to show Hadoop use in more complex data analysis tasks. Included are best practices and design patterns of MapReduce programming.

Hadoop: The Definitive Guide, 2nd Edition

Name: Hadoop: The Definitive Guide, 2nd Edition

Author: Tom White

Hadoop Version: 0.20-0.21

Publisher: O'Reilly

Date of Publishing: September 2010

Pro Hadoop

Name: Pro Hadoop

Author: Jason Venner

Hadoop Version: 0.20

Publisher: Apress

Date of Publishing: June 22, 2009

Jason says "This book is a step by step guide to writing, running and debugging Map/Reduce jobs using Hadoop, and to installing and managing Hadoop Clusters. It is ideal for training new Map/Reduce users and Cluster administrators and for polishing existing Hadoop skills."

Hadoop: The Definitive Guide

Name: Hadoop: The Definitive Guide

Author: Tom White

Hadoop Version: 0.20

Publisher: O'Reilly

Date of Publishing: June 19, 2009

Forthcoming Books

Books (last edited 2013-03-10 17:38:57 by SteveLoughran)