Database Reference
In-Depth Information
Chapter 1. Getting Started with Apache
Hadoop
Apache Hadoop is a widely used open source distributed computing framework that is em-
ployed to efficiently process large volumes of data using large clusters of cheap or com-
modity computers. In this chapter, we will learn more about Apache Hadoop by covering
the following topics:
• History of Apache Hadoop and its trends
• Components of Apache Hadoop
• Understanding the Apache Hadoop daemons
• Introducing Cloudera
• What is CDH?
• Responsibilities of a Hadoop administrator
Search WWH ::




Custom Search