This is a book by T. White published by O’Reilly Media, Inc.
- notes are based on the 4th edition (2015) from Safari books [https://www.safaribooksonline.com/library/view/hadoop-the-definitive/9781491901687/]
Table of Contents
Hadoop Fundamentals
- Meet Hadoop
- MapReduce
- The Hadoop Distributed File System
- YARN
- Hadoop I/O
Hadoop MapReduce
- Developing a MapReduce Application
- Writing a Unit Test with MRUnit
- Running Locally on Test Data
- Hadoop Pseudo Distributed Mode
- MapReduce Workflows: Apache Oozie
- How MapReduce Works (see YARN)
- MapReduce Types and Formats
- MapReduce Features
Hadoop Operations
- Setting Up a Hadoop Cluster
- Administering Hadoop
Related Projects
Case Studies
- Composable Data at Cerner
- Biological Data Science: Saving Lives with Software
- Cascading