Database Reference
In-Depth Information
Streaming systems like Storm, Spark Streaming, or Samza make it possible to run real-
time, distributed computations on unbounded streams of data and emit results to Ha-
doop storage or external systems.
Search
The Solr search platform can run on a Hadoop cluster, indexing documents as they are
added to HDFS, and serving search queries from indexes stored in HDFS.
Despite the emergence of different processing frameworks on Hadoop, MapReduce still
has a place for batch processing, and it is useful to understand how it works since it intro-
duces several concepts that apply more generally (like the idea of input formats, or how a
dataset is split into pieces).
Search WWH ::




Custom Search