Sunday, April 26, 2015

Five Best Sources for learning Hadoop

Apart from Apache community,I would recommend the below sources/books for learning Hadoop:

1. Data Intensive Text Processing : This is available free of cost at this link

https://lintool.github.io/MapReduceAlgorithms/MapReduce-book-final.pdf

The book focuses on the theoritical aspects of Map reduce rather than programmatical aspect. It has algorithmic approach and dwelves into the various paradigms of Map reduce. This is a must book to start with Map Reduce. A 175 page book but serves as an excellent source to understand the roots of Map Reduce.

2. Yahoo Developer Website:

These are excellent tutorials that focus on the basics of hadoop. Very simple and concise these are a must for beginners.
https://developer.yahoo.com/hadoop/tutorial/

There is an option of downloading the tutorials for off line study.

3. Hadoop A Definitive Guide

Author :Tom White

This book doesnt touch the bare bones of Hadoop but emphasises on programming and vividly focuses on the echo system. The author takes up complex examples to explore the various nuances of Hadoop. Hence this is best studied once we are thorough with the Yahoo Developer tutorials.

4. Hadoop in action

The book contains several Hadoop examples in a problem solution format. However the book assumes some basic knowledge in Hadoop and Map Reduce. Also all the examples are Java based and hence readers need to be thorough in basic Java. But its an excellent source to explore Hadoop.

5. Kaggle.com

Want to be a champ? Then this site is an ultimate source to compete in Hadoop. Its  challenging and would generate profound interest in Big Data.

No comments:

Post a Comment