Database Reference
In-Depth Information
Configuring jobtracker high availability
Use the following steps from user
hduser
to configure and HA jobtracker for your
cluster:
1. Stop all the tasktrackers by executing the following command on all the nodes that
host tasktrackers:
$ sudo service hadoop-0.20-mapreduce-tasktracker stop
2. Stop the jobtracker by executing the following command on the node that hosts the
jobtracker:
$ sudo service hadoop-0.20-mapreduce-jobtracker stop
3. Remove the installed jobtracker using the following command from
node1.hcluster
:
$ sudo yum remove hadoop-0.20-mapreduce-jobtracker
4. Install the following HA jobtracker package on two independent nodes, which in
our case would be
node1.hcluster
and
node2.hcluster
:
$ sudo yum install hadoop-0.20-mapreduce-jobtrackerha
5. If you intend to use automatic failover for the jobtracker, install the ZooKeeper
failover controller using the following command on
node1.hcluster
and
node2.hcluster
:
$ sudo yum install hadoop-0.20-mapreduce-zkfc
6. Update the
mapred-site.xml
file to include the following properties and copy
it to all the nodes in the cluster:
<property>
<name>mapred.job.tracker</name>
<value>myjobtracker</value>
</property>
<property>
<name>mapred.jobtrackers.myjobtracker</name>
<value>jt1,jt2</value>
</property>