Databases Reference
In-Depth Information
If you are prompted to accept the authenticity of localhost, answer
yes
to the prompt.
If you are logged in successfully without having to enter your password, you are ready to proceed.
Otherwise, you will need to run the following commands to set up key-based authentication,
without the need of a password:
$ ssh-keygen -t rsa -P '' -f ~/.ssh/id_rsa
$ cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
Hadoop can be run on a single-node in pseudo-distributed mode. In the pseudo-distributed mode,
each Hadoop daemon runs as a separate Java process.
Here are the essential installation steps:
1.
Edit
conf/core-site.xml
by replacing the empty
<configuration></configuration>
tags with:
<configuration>
<property>
<name>fs.default.name</name>
<value>hdfs://localhost:9000</value>
</property>
</configuration>
(This confi gures the HDFS daemon.)
2.
Edit
conf/hdfs-site.xml
by replacing the empty
<configuration></configuration>
tags with:
<configuration>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
</configuration>
(This confi gures the replication factor. A replication factor of 1 means no replication. You
need more than a single node to have a higher replication factor.)
3.
Edit
conf/mapred-site.xml
by replacing the empty
<configuration></configuration>
tags with:
<configuration>
<property>
<name>mapred.job.tracker</name>
<value>localhost:9001</value>
</property>
</configuration>
(This confi gures the MapReduce daemon.)
4.
Test the pseudo-distributed setup by formatting the Hadoop Distributed File System
(HDFS) on this single system:
bin/hadoop namenode -format