Distributed Programming for the Cloud - Large Scale and Big Data: Processing and Management

Database Reference

In-Depth Information

57. Alan Gates, Olga Natkovich, Shubham Chopra, Pradeep Kamath, Shravan Narayanam,

Christopher Olston, Benjamin Reed, Santhosh Srinivasan, and Utkarsh Srivastava.

Building a HighLevel Dataflow System on top of MapReduce: The Pig experience.

PVLDB , 2(2):1414-1425, 2009.

58. Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung. The Google file system. In

SOSP , pp. 29-43, 2003.

59. Amol Ghoting, Prabhanjan Kambadur, Edwin P. D. Pednault, and Ramakrishnan

Kannan. NIMBLE: A toolkit for the implementation of parallel data mining and machine

learning algorithms on mapreduce. In KDD , pp. 334-342, 2011.

60. Amol Ghoting, Rajasekar Krishnamurthy, Edwin P. D. Pednault, Berthold Reinwald,

Vikas Sindhwani, Shirish Tatikonda, Yuanyuan Tian, and Shivakumar Vaithyanathan.

SystemML: Declarative machine learning on MapReduce. In ICDE , pp. 231-242, 2011.

61. Yunhong Gu and Robert L. Grossman. Lessons learned from a year's worth of bench-

marks of large data clouds. In SC-MTAGS , 2009.

62. Alon Y. Halevy. Answering queries using views: A survey. VLDB Journal , 10(4):270-

294, 2001.

63. Yongqiang He, Rubao Lee, Yin Huai, Zheng Shao, Namit Jain, Xiaodong Zhang, and

Zhiwei Xu. RCFile: A fast and space-efficient data placement structure in MapReduce-

based warehouse systems. In ICDE , pp. 1199-1208, 2011.

64. Arvid Heise, Astrid Rheinlaender, Marcus Leich, Ulf Leser, and Felix Naumann.

Meteor/Sopremo: An extensible query language and operator model. In BigData , 2012.

65. Herodotos Herodotou. Hadoop performance models. CoRR , abs/1106.0940, 2011.

66. Herodotos Herodotou and Shivnath Babu. Profiling, What-if Analysis, and Cost-based

Optimization of MapReduce Programs. PVLDB , 4(11):1111-1122, 2011.

67. Herodotos Herodotou, Fei Dong, and Shivnath Babu. MapReduce programming and

cost-based optimization? Crossing THIS CHASM with Starfish. PVLDB , 4(12):1446-

1449, 2011.

68. Herodotos Herodotou, Harold Lim, Gang Luo, Nedyalko Borisov, Liang Dong, Fatma

Bilgen Cetin, and Shivnath Babu. Starfish: A self-tuning system for big data analytics.

In CIDR , pp. 261-272, 2011.

69. Tony Hey, Stewart Tansley, and Kristin Tolle, editors. The Fourth Paradigm: Data-

Intensive Scientific Discovery . Microsoft Research, October 2009.

70. Benjamin Hindman, Andy Konwinski, Matei Zaharia, and Ion Stoica. A common sub-

strate for cluster computing. In HotCloud, USENIX Workshop , 2009.

71. Jiewen Huang, Daniel J. Abadi, and Kun Ren. Scalable SPARQL querying of large RDF

Graphs. PVLDB , 4(11):1123-1134, 2011.

72. Mohammad Farhan Husain, James P. McGlothlin, Mohammad M. Masud, Latifur

R. Khan, and Bhavani M. Thuraisingham. Heuristics-based query processing for large

RDF graphs using cloud computing. IEEE TKDE , 23(9):1312-1327, 2011.

73. Michael Isard, Mihai Budiu, Yuan Yu, Andrew Birrell, and Dennis Fetterly. Dryad:

Distributed data-parallel programs from sequential building blocks. In EuroSys ,

pp. 59-72, 2007.

74. Ming-Yee Iu and Willy Zwaenepoel. HadoopToSQL: A MapReduce query optimizer. In

EuroSys , pp. 251-264, 2010.

75. Eaman Jahani, Michael J. Cafarella, and Christopher Ré. Automatic optimization for

MapReduce programs. PVLDB , 4(6):385-396, 2011.

76. David Jiang, Anthony K. H. Tung, and Gang Chen. MAP-JOIN-REDUCE: Toward scal-

able and efficient data analysis on large clusters. IEEE TKDE , 23(9):1299-1311, 2011.

77. Dawei Jiang, Beng Chin Ooi, Lei Shi, and Sai Wu. The performance of MapReduce: An

in-depth study. PVLDB , 3(1):472-483, 2010.

78. Alekh Jindal, Jorge-Arnulfo Quiane-Ruiz, and Jens Dittrich. Trojan data layouts: Right

shoes for a running elephant. In SoCC , 2011.

Large Scale and Big Data: Processing and Management

Search WWH ::

Custom Search

Home