Database Reference
In-Depth Information
[15] W. N. Francis and H. Kucera, “Brown Corpus Manual,” 1979.
[16] “Critical Assessment of Information Extraction in Biology
[Accessed 2 April 2014].
[17] J. J. Godfrey and E. Holliman, “Switchboard-1 Release 2,” Linguistic
Data Consortium, Philadelphia, 1997. [Online]. Available:
http://catalog.ldc.upenn.edu/LDC97S62
. [Accessed 2 April
2014].
[18] P. Koehn, “Europarl: A Parallel Corpus for Statistical Machine
Translation,”
MT Summit,
2005.
[19] N. Seco, T. Veale, and J. Hayes, “An Intrinsic Information Content
Metric for Semantic Similarity in WordNet,”
ECAI,
vol.
16
, pp.
1089-1090, 2004.
[20] P. Resnik, “Using Information Content to Evaluate Semantic
Similarity in a Taxonomy,” In
Proceedings of the 14th International Joint
Conference on Artificial Intelligence (IJCAI'95),
vol.
1
, pp. 448-453, 1995.
[21] T. Pedersen, “Information Content Measures of Semantic Similarity
Perform Better Without Sense-Tagged Text,”
Human Language
Technologies: The 2010 Annual Conference of the North American
Chapter of the Association for Computational Linguistics,
pp. 329-332,
June 2010.
[22] C. D. Manning, P. Raghavan, and H. Schütze, “Document and Query
Weighting Schemes,” in
Introduction to Information Retrieval,
Cambridge, United Kingdom, Cambridge University Press, 2008, p. 128.
[23] M. Porter, “Porter's English Stop Word List,” 12 February 2007.
[Online]. Available:
http://snowball.tartarus.org/algorithms/
english/stop.txt
.
[Accessed 2 April 2014].
[24] M. Steinbach, G. Karypis, and V. Kumar, “A Comparison of
Document Clustering Techniques,”
KDD workshop on text mining,
vol.
400
, no. 1, 2000.
[25] T. Joachims, “Transductive Inference for Text Classification Using
Support Vector Machines,”
ICML,
vol.
99
, pp. 200-209, 1999.
[26] P. Soucy and G. W. Mineau, “A Simple KNN Algorithm for Text
Categorization,”
ICDM,
pp. 647-648, 2001.
[27] B. Liu, X. Li, W. S. Lee, and P. S. Yu, “Text Classification by Labeling
Words,”
AAAI,
vol.
4
, pp. 425-430, 2004.