Exploring HDFS Federation and Its High Availability - Cloudera Administration

Database Reference

In-Depth Information

Configuring jobtracker high availability

Use the following steps from user hduser to configure and HA jobtracker for your

cluster:

1. Stop all the tasktrackers by executing the following command on all the nodes that

host tasktrackers:

$ sudo service hadoop-0.20-mapreduce-tasktracker stop

2. Stop the jobtracker by executing the following command on the node that hosts the

jobtracker:

$ sudo service hadoop-0.20-mapreduce-jobtracker stop

3. Remove the installed jobtracker using the following command from

node1.hcluster :

$ sudo yum remove hadoop-0.20-mapreduce-jobtracker

4. Install the following HA jobtracker package on two independent nodes, which in

our case would be node1.hcluster and node2.hcluster :

$ sudo yum install hadoop-0.20-mapreduce-jobtrackerha

5. If you intend to use automatic failover for the jobtracker, install the ZooKeeper

failover controller using the following command on node1.hcluster and

node2.hcluster :

$ sudo yum install hadoop-0.20-mapreduce-zkfc

6. Update the mapred-site.xml file to include the following properties and copy

it to all the nodes in the cluster:

<name>mapred.job.tracker</name>

<value>myjobtracker</value>

</property>

<name>mapred.jobtrackers.myjobtracker</name>

</property>

Search WWH ::

Custom Search

Home