Database Reference
In-Depth Information
directory
"/etc/test_application/conf"
ddo
action
:create
end
ZooKeeper
License
Apache License, Version 2.0
Activity
Medium
Purpose
Coordination
Official Page
Hadoop Integration API Compatible
Hadoop and HDFS are effective tools for distributing work across many machines, but some-
times you need to quickly share little bits of information between a number of simultan-
eously running processes. ZooKeeper is built for exactly this sort of need: it's an effective
mechanism for storing and sharing small amounts of state and configuration data across
many machines.
For example, let's say you have a job that takes information from a large number of small
files, transforms that data, and puts the information into a database.
You could store the information in a file on a fileshare or in HDFS, but accessing that in-
formation from many machines can be very slow and attempting to update the information
can be difficult due to synchronization issues.
A slightly better approach would be to move the connection information into a MapReduce
job configuration file. Even then, you would need to update a file for every analytic every