Database Reference
In-Depth Information
Installing Apache ZooKeeper
To install Apache ZooKeeper, log in as hduser and execute the following command:
$ sudo yum install zookeeper-server
You can configure Apache Zookeeper using the configuration files present under /etc/
zookeeper/conf .
With these components installed, you are now ready to use the cluster for data processing.
You could use Flume to ingest streaming data from external sources to HDFS, Sqoop or
Sqoop 2 to get data from external databases, Pig and Hive to write scripts and queries, and
use Apache Oozie to schedule them as required.
There are several other CDH components that can be installed along with the previously
mentioned components. However, we will leave the rest and see how they can be installed
while going through Cloudera Manager in Chapter 5 , Using Cloudera Manager .
Search WWH ::




Custom Search