Database Reference
In-Depth Information
Configuring jobtracker high availability
Use the following steps from user hduser to configure and HA jobtracker for your
cluster:
1. Stop all the tasktrackers by executing the following command on all the nodes that
host tasktrackers:
$ sudo service hadoop-0.20-mapreduce-tasktracker stop
2. Stop the jobtracker by executing the following command on the node that hosts the
jobtracker:
$ sudo service hadoop-0.20-mapreduce-jobtracker stop
3. Remove the installed jobtracker using the following command from
node1.hcluster :
$ sudo yum remove hadoop-0.20-mapreduce-jobtracker
4. Install the following HA jobtracker package on two independent nodes, which in
our case would be node1.hcluster and node2.hcluster :
$ sudo yum install hadoop-0.20-mapreduce-jobtrackerha
5. If you intend to use automatic failover for the jobtracker, install the ZooKeeper
failover controller using the following command on node1.hcluster and
node2.hcluster :
$ sudo yum install hadoop-0.20-mapreduce-zkfc
6. Update the mapred-site.xml file to include the following properties and copy
it to all the nodes in the cluster:
<property>
<name>mapred.job.tracker</name>
<value>myjobtracker</value>
</property>
<property>
<name>mapred.jobtrackers.myjobtracker</name>
<value>jt1,jt2</value>
</property>
Search WWH ::




Custom Search