Database Reference
In-Depth Information
Figure 2-5. The Task Tracker user interface
Now that you have tasted the flavor of Hadoop V1, shut it down and get ready to install Hadoop V2.
Hadoop V2 Installation
In moving on to Hadoop V2, you will this time download and use the Cloudera stack. Specifically, you will install CDH
4 because it is available for both 32-bit and 64-bit machines and it supports YARN. I have chosen to install the latest
manual CDH release available at the time of this writing.
In this section, you will not only learn how to obtain and install the Cloudera Hadoop packages; you'll also
find out how to install, run, and use ZooKeeper, as well as how to configure Hadoop V2. You'll tour the necessary
directories on the Linux file system and HDFS; lastly, you'll find out how to control the servers as Linux services.
To begin, you need to add a CDH repository file on all machines as root under /etc/yum.repos.d/. You create a
file named cloudera-cdh4.repo on each server, with the following contents:
root@hc1r1m1 yum.repos.d]# cat cloudera-cdh4.repo
[cloudera-cdh4]
name=Cloudera's Distribution for Hadoop, Version 4
baseurl= http://archive.cloudera.com/cdh4/redhat/6/x86_64/cdh/4/
gpgkey = http://archive.cloudera.com/cdh4/redhat/6/x86_64/cdh/RPM-GPG-KEY-cloudera
gpgcheck = 1
 
Search WWH ::




Custom Search