Database Reference
In-Depth Information
Power Laws : Many phenomena obey a law that can be expressed as y = cx a for some power a , often around −2. Such
phenomena include the sales of the x th most popular topic, or the number of in-links to the x th most popular page.
1.6 References for Chapter 1
[ 7 ] is a clear introduction to the basics of data mining. [ 2 ] covers data mining principally
from the point of view of machine learning and statistics.
For construction of hash functions and hash tables, see [ 4 ] . Details of the TF.IDF meas-
ure and other matters regarding document processing can be found in [ 5 ] . See [ 3 ] for more
on managing indexes, hash tables, and data on disk.
Power laws pertaining to the Web were explored by [ 1 ] . The Matthew effect was first
observed in [ 6 ].
[1] A. Broder, R. Kumar, F. Maghoul, P. Raghavan, S. Rajagopalan, R. Stata, A. Tomkins, and J. Weiner, “Graph struc-
ture in the web,” Computer Networks 33:1-6, pp. 309-320, 2000.
[2] M.M. Gaber, Scientific Data Mining and Knowledge Discovery - Principles and Foundations , Springer, New York,
2010.
[3] H. Garcia-Molina, J.D. Ullman, and J. Widom, Database Systems: The Complete Book Second Edition, Prentice-
Hall, Upper Saddle River, NJ, 2009.
[4] D.E. Knuth, The Art of Computer Programming Vol. 3 ( Sorting and Searching ), Second Edition, Addison-Wesley,
Upper Saddle River, NJ, 1998.
[5] C.P. Manning, P. Raghavan, and H. Schütze, Introduction to Information Retrieval , Cambridge University Press,
2008.
[6] R.K. Merton, “The Matthew effect in science,” Science 159:3810, pp. 56-63, Jan. 5, 1968.
[7] P.-N. Tan, M. Steinbach, and V. Kumar, Introduction to Data Mining , Addison-Wesley, Upper Saddle River, NJ,
2005.
1 This startup attempted to use machine learning to mine large-scale data, and hired many of the top machine-learning
people to do so. Unfortunately, it was not able to survive.
2 See http://en.wikipedia.org/wiki/1854_Broad_Street_cholera_outbreak .
3 That is, assume our hypothesis that terrorists will surely buy a set of 10 items in common at some time during the
year. We don't want to address the matter of whether or not terrorists would necessarily do so.
Search WWH ::




Custom Search