Database Reference
In-Depth Information
Installing CDH
With a good background knowledge of CDH and its components, let's go ahead and install
CDH on a cluster. The remainder of this chapter is going to be hands-on, and we will try to
cover everything that is needed to get a fully functional cluster that runs CDH5.
CDH can be installed using one of the following two methods:
• Installation using the operating system's package manager ( yum / rpm )
• Installation using Cloudera Manager
In this section, we will cover installation using the operating system's package manager. In-
stallation of CDH using Cloudera Manager will be covered in Chapter 5 , Using Cloudera
Manager .
For the installation, we will be using four servers that run CentOS 6.4 (64-bit) as the oper-
ating system.
The following diagram shows a simple four-node Hadoop cluster that runs MapReduce
Version 1 ( MRv1 ):
By performing the following installation instructions, we will try to configure our cluster to
look like the one shown in the preceding diagram:
1. The first step is to make sure that all our servers are running Oracle Java Version 7.
To install Oracle Java, you can download Oracle JDK from Oracle's website. The
Oracle Java installation file that I have downloaded is jdk-7u51-linux-
x64.rpm . After downloading the file, perform the following instructions as the
root user:
$ chmod +x jdk-7u51-linux-x64.rpm
$ rpm -ivh jdk-7u51-linux-x64.rpm
Search WWH ::




Custom Search