Information Technology Reference
In-Depth Information
TABLE 17.1
MapReduce Cloud Implementations
Distribution
Model
Owner
Imp Name and Website
Start Time
Last Release
Google
Google MapReduce,
http://labs.google.com/
papers/inapreduce.html
2004
Internal use
by Google
Apache
Hadoop, http://hadoop.
apache.org/
2004
Hadoop0.20.0,
April 22, 2009
Open source
GridGain
GridGain http://www.
gridgain.com.
2005
GridGain 2.1.1,
February 26, 2009
Open source
Nokia
Disco ht, tp://
discoproject.org/
2008
Disco 0.2.3,
September 9, 2009
Open source
Geni.com
SkyNet, http://skynet.
rubyforge.org
2007
SkynetO.9.3,
May 31, 2008
Open source
Manjrasoft
MapReduce.net (optional
service of Aneka),
http://www.manjrasoft.
com products.html
2008
Aneka 1.0,
March 27, 2009
Commercial
TABLE 17.2
Comparison of MapReduce Implementations
Google MapReduce
Hadoop
Disco
Focus
Data intensive
Data intensive
Data intensive
Architecture
Master-slave
Master-slave
Master-slave
Platform
Linux
Cross-platform
Linux, Mac OS X
Storage system
GFS
HDFS, CloudStore, S3
GlusterFS
Implementation
technology
C++
J AVA
Erlang
Programming
environment
JAVA and Python
JAVA, shell utilities using
Hadoop streaming, C++
using Hadoop pipes
Python
Deployment
Deployed on Google
clusters
Private and public cloud
(EC2)
Private and public
cloud (EC2)
Some users and
applications
Google
Baidu, NetSeer, A9.com,
Facebook
Nokia Research
Center
with the existing Nutch project to create an open-source implementation of
the MapReduce architecture. It later became an independent subproject of
Lucene, was embraced by Yahoo! after the lead developer for Hadoop became
an employee, and became an official Apache top-level project in February
of 2006. Hadoop now encompasses multiple subprojects in addition to the
base core, MapReduce, and Hadoop distributed file system (HDFS). These
additional subprojects provide enhanced application processing capabilities
Search WWH ::




Custom Search