Database Reference
In-Depth Information
57. Alan Gates, Olga Natkovich, Shubham Chopra, Pradeep Kamath, Shravan Narayanam,
Christopher Olston, Benjamin Reed, Santhosh Srinivasan, and Utkarsh Srivastava.
Building a HighLevel Dataflow System on top of MapReduce: The Pig experience.
PVLDB , 2(2):1414-1425, 2009.
58. Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung. The Google file system. In
SOSP , pp. 29-43, 2003.
59. Amol Ghoting, Prabhanjan Kambadur, Edwin P. D. Pednault, and Ramakrishnan
Kannan. NIMBLE: A toolkit for the implementation of parallel data mining and machine
learning algorithms on mapreduce. In KDD , pp. 334-342, 2011.
60. Amol Ghoting, Rajasekar Krishnamurthy, Edwin P. D. Pednault, Berthold Reinwald,
Vikas Sindhwani, Shirish Tatikonda, Yuanyuan Tian, and Shivakumar Vaithyanathan.
SystemML: Declarative machine learning on MapReduce. In ICDE , pp. 231-242, 2011.
61. Yunhong Gu and Robert L. Grossman. Lessons learned from a year's worth of bench-
marks of large data clouds. In SC-MTAGS , 2009.
62. Alon Y. Halevy. Answering queries using views: A survey. VLDB Journal , 10(4):270-
294, 2001.
63. Yongqiang He, Rubao Lee, Yin Huai, Zheng Shao, Namit Jain, Xiaodong Zhang, and
Zhiwei Xu. RCFile: A fast and space-efficient data placement structure in MapReduce-
based warehouse systems. In ICDE , pp. 1199-1208, 2011.
64. Arvid Heise, Astrid Rheinlaender, Marcus Leich, Ulf Leser, and Felix Naumann.
Meteor/Sopremo: An extensible query language and operator model. In BigData , 2012.
65. Herodotos Herodotou. Hadoop performance models. CoRR , abs/1106.0940, 2011.
66. Herodotos Herodotou and Shivnath Babu. Profiling, What-if Analysis, and Cost-based
Optimization of MapReduce Programs. PVLDB , 4(11):1111-1122, 2011.
67. Herodotos Herodotou, Fei Dong, and Shivnath Babu. MapReduce programming and
cost-based optimization? Crossing THIS CHASM with Starfish. PVLDB , 4(12):1446-
1449, 2011.
68. Herodotos Herodotou, Harold Lim, Gang Luo, Nedyalko Borisov, Liang Dong, Fatma
Bilgen Cetin, and Shivnath Babu. Starfish: A self-tuning system for big data analytics.
In CIDR , pp. 261-272, 2011.
69. Tony Hey, Stewart Tansley, and Kristin Tolle, editors. The Fourth Paradigm: Data-
Intensive Scientific Discovery . Microsoft Research, October 2009.
70. Benjamin Hindman, Andy Konwinski, Matei Zaharia, and Ion Stoica. A common sub-
strate for cluster computing. In HotCloud, USENIX Workshop , 2009.
71. Jiewen Huang, Daniel J. Abadi, and Kun Ren. Scalable SPARQL querying of large RDF
Graphs. PVLDB , 4(11):1123-1134, 2011.
72. Mohammad Farhan Husain, James P. McGlothlin, Mohammad M. Masud, Latifur
R. Khan, and Bhavani M. Thuraisingham. Heuristics-based query processing for large
RDF graphs using cloud computing. IEEE TKDE , 23(9):1312-1327, 2011.
73. Michael Isard, Mihai Budiu, Yuan Yu, Andrew Birrell, and Dennis Fetterly. Dryad:
Distributed data-parallel programs from sequential building blocks. In EuroSys ,
pp. 59-72, 2007.
74. Ming-Yee Iu and Willy Zwaenepoel. HadoopToSQL: A MapReduce query optimizer. In
EuroSys , pp. 251-264, 2010.
75. Eaman Jahani, Michael J. Cafarella, and Christopher Ré. Automatic optimization for
MapReduce programs. PVLDB , 4(6):385-396, 2011.
76. David Jiang, Anthony K. H. Tung, and Gang Chen. MAP-JOIN-REDUCE: Toward scal-
able and efficient data analysis on large clusters. IEEE TKDE , 23(9):1299-1311, 2011.
77. Dawei Jiang, Beng Chin Ooi, Lei Shi, and Sai Wu. The performance of MapReduce: An
in-depth study. PVLDB , 3(1):472-483, 2010.
78. Alekh Jindal, Jorge-Arnulfo Quiane-Ruiz, and Jens Dittrich. Trojan data layouts: Right
shoes for a running elephant. In SoCC , 2011.
Search WWH ::




Custom Search