Database Reference
In-Depth Information
well-known agencies that use Hadoop to conduct distributed computation are listed
in [ 5 ]. In addition, many companies provide Hadoop commercial execution and
support, including Cloudera, IBM, MapR, EMC, and Oracle.
Among modern industrial machinery and systems, sensors are widely deployed
to collect information for environment monitoring and failure forecasting, etc.
Bahga and others in [ 6 ] proposed a framework for data organization and cloud
computing infrastructure, termed CloudView. CloudView uses mixed architectures,
local nodes, and remote clusters based on Hadoop to analyze machine-generated
data. Local nodes are used for the forecast of real-time failures; clusters based on
Hadoop are used for complex offline analysis, e.g., case-driven data analysis.
The exponential growth of the genome data and the sharp drop of sequencing
cost transform bio-science and bio-medicine to data-driven science. Gunarathne et
al. in [ 7 ] utilized cloud computing infrastructures, Amazon AWS, Microsoft Azune,
and data processing framework based on MapReduce, Hadoop, and Microsoft
DryadLINQ to run two parallel bio-medicine applications: (a) assembly of genome
segments; (b) dimension reduction in the analysis of chemical structure. In the
subsequent application, the 166-D datasets used include 26,000,000 data points.
The authors compared the performance of all the frameworks in terms of efficiency,
cost, and availability. According to the study, the authors concluded that the
loose coupling will be increasingly applied to research on electron cloud, and the
parallel programming technology (i.e., MapReduce) framework may provide users
an interface with more convenient services and reduce unnecessary costs.
References
1. Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash
Lakshman, Alex Pilchin, Swaminathan Sivasubramanian, Peter Vosshall, and Werner Vogels.
Dynamo: amazon's highly available key-value store. In SOSP , volume 7, pages 205-220, 2007.
2. Luigi Atzori, Antonio Iera, and Giacomo Morabito. The internet of things: A survey. Computer
Networks , 54(15):2787-2805, 2010.
3. Yantao Sun, Min Chen, Bin Liu, and Shiwen Mao. Far: A fault-avoidant routing method
for data center networks with regular topology. In Proceedings of ACM/IEEE Symposium on
Architectures for Networking and Communications Systems (ANCS'13) . ACM, 2013.
4. Tom White. Hadoop: the definitive guide . O'Reilly, 2012.
5. Wiki. Applications and organizations using hadoop. http://wiki.apache.org/hadoop/PoweredBy ,
2013.
6. Arshdeep Bahga and Vijay K Madisetti. Analyzing massive machine maintenance data in a
computing cloud. Parallel and Distributed Systems, IEEE Transactions on , 23(10):1831-1843,
2012.
7. Thilina Gunarathne, Tak-Lon Wu, Jong Youl Choi, Seung-Hee Bae, and Judy Qiu. Cloud
computing paradigms for pleasingly parallel biomedical applications. Concurrency and Com-
putation: Practice and Experience , 23(17):2338-2354, 2011.
Search WWH ::




Custom Search