Database Reference
In-Depth Information
57. Alan Gates, Olga Natkovich, Shubham Chopra, Pradeep Kamath, Shravan Narayanam,
Christopher Olston, Benjamin Reed, Santhosh Srinivasan, and Utkarsh Srivastava.
Building a HighLevel Dataflow System on top of MapReduce: The Pig experience.
PVLDB
, 2(2):1414-1425, 2009.
58. Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung. The Google file system. In
SOSP
, pp. 29-43, 2003.
59. Amol Ghoting, Prabhanjan Kambadur, Edwin P. D. Pednault, and Ramakrishnan
Kannan. NIMBLE: A toolkit for the implementation of parallel data mining and machine
learning algorithms on mapreduce. In
KDD
, pp. 334-342, 2011.
60. Amol Ghoting, Rajasekar Krishnamurthy, Edwin P. D. Pednault, Berthold Reinwald,
Vikas Sindhwani, Shirish Tatikonda, Yuanyuan Tian, and Shivakumar Vaithyanathan.
SystemML: Declarative machine learning on MapReduce. In
ICDE
, pp. 231-242, 2011.
61. Yunhong Gu and Robert L. Grossman. Lessons learned from a year's worth of bench-
marks of large data clouds. In
SC-MTAGS
, 2009.
62. Alon Y. Halevy. Answering queries using views: A survey.
VLDB Journal
, 10(4):270-
294, 2001.
63. Yongqiang He, Rubao Lee, Yin Huai, Zheng Shao, Namit Jain, Xiaodong Zhang, and
Zhiwei Xu. RCFile: A fast and space-efficient data placement structure in MapReduce-
based warehouse systems. In
ICDE
, pp. 1199-1208, 2011.
64. Arvid Heise, Astrid Rheinlaender, Marcus Leich, Ulf Leser, and Felix Naumann.
Meteor/Sopremo: An extensible query language and operator model. In
BigData
, 2012.
65. Herodotos Herodotou. Hadoop performance models.
CoRR
, abs/1106.0940, 2011.
66. Herodotos Herodotou and Shivnath Babu. Profiling, What-if Analysis, and Cost-based
Optimization of MapReduce Programs.
PVLDB
, 4(11):1111-1122, 2011.
67. Herodotos Herodotou, Fei Dong, and Shivnath Babu. MapReduce programming and
cost-based optimization? Crossing THIS CHASM with Starfish.
PVLDB
, 4(12):1446-
1449, 2011.
68. Herodotos Herodotou, Harold Lim, Gang Luo, Nedyalko Borisov, Liang Dong, Fatma
Bilgen Cetin, and Shivnath Babu. Starfish: A self-tuning system for big data analytics.
In
CIDR
, pp. 261-272, 2011.
69. Tony Hey, Stewart Tansley, and Kristin Tolle, editors.
The Fourth Paradigm: Data-
Intensive Scientific Discovery
. Microsoft Research, October 2009.
70. Benjamin Hindman, Andy Konwinski, Matei Zaharia, and Ion Stoica. A common sub-
strate for cluster computing. In
HotCloud, USENIX Workshop
, 2009.
71. Jiewen Huang, Daniel J. Abadi, and Kun Ren. Scalable SPARQL querying of large RDF
Graphs.
PVLDB
, 4(11):1123-1134, 2011.
72. Mohammad Farhan Husain, James P. McGlothlin, Mohammad M. Masud, Latifur
R. Khan, and Bhavani M. Thuraisingham. Heuristics-based query processing for large
RDF graphs using cloud computing.
IEEE TKDE
, 23(9):1312-1327, 2011.
73. Michael Isard, Mihai Budiu, Yuan Yu, Andrew Birrell, and Dennis Fetterly. Dryad:
Distributed data-parallel programs from sequential building blocks. In
EuroSys
,
pp. 59-72, 2007.
74. Ming-Yee Iu and Willy Zwaenepoel. HadoopToSQL: A MapReduce query optimizer. In
EuroSys
, pp. 251-264, 2010.
75. Eaman Jahani, Michael J. Cafarella, and Christopher Ré. Automatic optimization for
MapReduce programs.
PVLDB
, 4(6):385-396, 2011.
76. David Jiang, Anthony K. H. Tung, and Gang Chen. MAP-JOIN-REDUCE: Toward scal-
able and efficient data analysis on large clusters.
IEEE TKDE
, 23(9):1299-1311, 2011.
77. Dawei Jiang, Beng Chin Ooi, Lei Shi, and Sai Wu. The performance of MapReduce: An
in-depth study.
PVLDB
, 3(1):472-483, 2010.
78. Alekh Jindal, Jorge-Arnulfo Quiane-Ruiz, and Jens Dittrich. Trojan data layouts: Right
shoes for a running elephant. In
SoCC
, 2011.
Search WWH ::
Custom Search