Differences between revisions 17 and 18
Revision 17 as of 2014-03-14 07:28:59
Size: 8515
Comment:
Revision 18 as of 2014-03-25 19:20:21
Size: 8154
Comment: usual trim of marketing hype on a new packtpub book
Deletions are marked like this. Additions are marked like this.
Line 28: Line 28:
'''Sample Chapter:''' [[http://www.packtpub.com/sites/default/files/9781783285655_Chapter-03.pdf?utm_source=packtpub&utm_medium=free&utm_campaign=pdf|Chapter 3: Detecting System Bottlenecks]]

Optimizing Hadoop for !MapReduce book is an example-based tutorial that deals with Optimizing Hadoop for !MapReduce job performance. This book introduces readers to advanced !MapReduce concepts and teaches them about topics ranging from identifying the factors that affect !MapReduce job performance to tuning the !MapReduce configuration. The book is a guide to utilizing user's cluster’s node resources to run !MapReduce jobs optimally.
'''Sample Chapter:''' [[http://www.packtpub.com/sites/default/files/9781783285655_Chapter-03.pdf|Chapter 3: Detecting System Bottlenecks]]

Optimizing Hadoop for !MapReduce book is an example-based tutorial that deals with Optimizing Hadoop for !MapReduce job performance.

Hadoop Books

These books are listed in order of publication, most recent first. The Apache Software Foundation does not endorse any specific book. The links to Amazon are affiliated with the specific author. That said, we also encourage you to support your local bookshops, by buying the book from any local outlet, especially independent ones.

Books in Print

Here are the books that are currently in print -in order of publishing-, along with the Hadoop version they were written against. One problem anyone writing a book will encounter is that Hadoop is a very fast-moving target, and that things can change fast. Usually this is for the better, when a book says "Hadoop can't" they really mean "the version of Hadoop we worked with couldn't", and that the situation may have improved since then. If you have any query about Hadoop, don't be afraid to ask on the relevant user mailing lists.

Optimizing Hadoop for MapReduce

Name: Optimizing Hadoop for MapReduce

Author: Khaled Tannir

Publisher: Packt Publishing

Date of Publishing: February 21, 2014

Sample Chapter: Chapter 3: Detecting System Bottlenecks

Optimizing Hadoop for MapReduce book is an example-based tutorial that deals with Optimizing Hadoop for MapReduce job performance.

Scaling Big Data with Hadoop and Solr

Name: Scaling Big Data with Hadoop and Solr

Author: Hrishikesh Karambelkar

Publisher: Packt Publishing

Date of Publishing: August 26, 2013

Sample Chapter: Chapter 2: Understanding Solr

Scaling Big Data with Hadoop and Solr is a step-by-step guide to building a search engine while scaling data. Starting with the basics of Apache Hadoop and Solr, this book then dives into advanced topics of optimizing search with some real-world use cases and sample Java code.

Hadoop Operations and Cluster Management Cookbook

Name: Hadoop Operations and Cluster Management Cookbook

Author: Shumin Guo

Hadoop Version: 2.x

Publisher: Packt Publishing

Date of Publishing: July 24, 2013

Sample Chapter: Chapter 3: Configuring a Hadoop Cluster

Hadoop Operations and Cluster Management Cookbook is a guide for designing and managing a Hadoop cluster.

Hadoop Beginner's Guide

Name: Hadoop Beginner's Guide

Author: Garry Turkington

Hadoop Version: 1.0.x

Publisher: Packt Publishing

Date of Publishing: February 22, 2013

Sample Chapter: Chapter 4: Developing MapReduce Programs

Written for complete beginners to Hadoop, covers how to install and run Hadoop on a local Ubuntu host or create an on-demand Hadoop cluster on Amazon Web Services (EC2), before getting to grips with MapReduce.

Hadoop Real World Solutions Cookbook

Name: Hadoop Real World Solutions Cookbook

Author: Jonathan Owens, Brian Femiano, Jon Lentz

Hadoop Version: CDH3

Publisher: Packt Publishing

Date of Publishing: February 7, 2013

Sample Chapter: Chapter 6: Big Data Analysis

Collection of real world code analytics and design patterns using various tools from the Hadoop community. Each recipe walks the reader through the implementation, or in some cases debugging and configuration tuning. The book covers various tools including MapReduce, Hive, Pig, MRUnit, serialization using Avro/Thrift/ProtoBuffs, Giraph, Accumulo and several others.

Hadoop MapReduce Cookbook

Name: Hadoop MapReduce Cookbook

Author: Srinath Perera, Thilina Gunarathne

Hadoop Version: 1.0.x

Publisher: Packt Publishing

Date of Publishing: January 25, 2013

Sample Chapter: Chapter 6: Analytics

Hadoop MapReduce Cookbook is a guide to processing large and complex data sets using Hadoop MapReduce.

Hadoop Operations

Name: Hadoop Operations

Author: Eric Sammers

Hadoop Version: 1.x, CDH3.x

Publisher: O'Reilly Press

Date of Publishing: September 2012.

A guide to running large-scale Hadoop clusters, written by someone who has practical experience in such deployments.

Hadoop in Practice

Name: Hadoop in Practice

Author: Alex Holmes

Hadoop Version: 1.0

Publisher: Manning

Date of Publishing: Fall 2012.

Sample Chapter: Chapter 1

Hadoop: The Definitive Guide, 3rd Edition

Name: Hadoop: The Definitive Guide, 3rd Edition

Author: Tom White

Hadoop Version: 1.x

Publisher: O'Reilly

Date of Publishing: May 2012

Sample Chapter: Sample Chapter

Hadoop in Action

Name: Hadoop in Action

Author: Chuck Lam

Hadoop Version: 0.19-0.20

Publisher: Manning

Date of Publishing: December, 2010

Sample Chapter: Chapter 1

Hadoop in Action introduces the subject and shows how to write programs in the MapReduce style. It starts with a few easy examples and then moves quickly to show Hadoop use in more complex data analysis tasks. Included are best practices and design patterns of MapReduce programming.

Hadoop: The Definitive Guide, 2nd Edition

Name: Hadoop: The Definitive Guide, 2nd Edition

Author: Tom White

Hadoop Version: 0.20-0.21

Publisher: O'Reilly

Date of Publishing: September 2010

Pro Hadoop

Name: Pro Hadoop

Author: Jason Venner

Hadoop Version: 0.20

Publisher: Apress

Date of Publishing: June 22, 2009

Jason says "This book is a step by step guide to writing, running and debugging Map/Reduce jobs using Hadoop, and to installing and managing Hadoop Clusters. It is ideal for training new Map/Reduce users and Cluster administrators and for polishing existing Hadoop skills."

Hadoop: The Definitive Guide

Name: Hadoop: The Definitive Guide

Author: Tom White

Hadoop Version: 0.20

Publisher: O'Reilly

Date of Publishing: June 19, 2009

Forthcoming Books

Books (last edited 2014-03-25 19:20:21 by SteveLoughran)