52 References Bartell, B. T., et al. (1992). Latent Semantic Indexing is an Optimal Special Case of Multidimensional Scaling. In Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Copenhagen, Denmark, pp. 161-167. Cleverdon, C. W. (1967). The Cranfield Tests on Index Language Devices. Aslib Proceedings, 19, 173-192. ---. (1972). On the Inverse Relationship of Recall and Precision. Journal of Documentation, 23, 195-201. Ding, C. H. Q. (1999). A Similarity-based Probability Model for Latent Semantic Indexing. In In Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Berkeley, California, pp. 58-65, New York, ACM Press. Dumais, S. T. (1995). Using LSI for information filtering: TREC-3 experiments. In D. Harman (Ed.), The Third Text Retrieval Conference (TREC3) National Institute of Standards and Technology Special Publication, 219-230. Berry, M. W., Dumais, S. T., and O'Brien, G. W. (1995). Using linear algebra for intelligent information retrieval. SIAM Review, 37(4), 1995, 573-595. Deerwester, S., et al. (1990). Indexing by Latent Semantic Analysis. Journal of the American Society for Information Science, 41(6), 391-407. Forsythe, G. E., Malcolm, M. A., & Moler, C. B. (1977). Computer Methods for Mathematical Computations. Prentice-Hall, Englewood Cliffs. Furnas, G. W., Landauer, T. K., Gomez, L. M., & Dumais, S. T. (1987). The Vocabulary problem in human-system communication. Communications of the ACM, 30(11), 964-971. Hofman, T. (1999). Probabilistic Latent Semantic Indexing. In Proceedings of the 22nd Annual International ACM SIGIR Conference on Research 53 and Development in Information Retrieval, Berkeley, California, pp. 50-57, New York, ACM Press. Jiang, F. & Littman, M. L. (Forthcoming). Approximate Dimension Equalization in Vector-based Information Retrieval. In Proceedings of the 17th Annual International Conference on Machine Learning, Stanford University. Landauer, T. K. & Dumais, S. T. (1997). A Solution to Plato’s Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. Psychological Review, 104(2), 211240. Landauer, T. K., Foltz, P. W., & Laham, D. (1998). Introduction to Latent Semantic Indexing. Discourse Processes, 25, 259-284. Lochbaum, K. E. & Streeter, L. A. (1989). Comparing and Combining the Effectiveness of Latent Semantic Indexing and the Ordinary Vector Space Model for Information Retrieval. Information Processing and Management, 25(6), 665-676. Losee, R. M. (1998). Text Retrieval and Filtering: Analytic Models of Performance. Kluwer, Boston. Manning, C. D. & Scheutze, H. (1999). Foundations of Statistical Natural Language Processing. MIT press, Cambridge. Oakes, M. P. (1998). Statistics for Corpus Linguistics. Edinburgh University Press, Edinburgh. Project Gutenberg. (2000, Retrieved 30 March 2000). http://www.promo.net/pg/. Salton, G. & McGill, M. (1983). An Introduction to Modern Information Retrieval. McGraw-Hill, New York. Salton, G., Wong, A., & Yang, C. S. (1975). A Vector Space Model for Automatic Indexing. Communications of the ACM, 18, 613-620. Strang, G. (1993). Introduction to Linear Algebra. Wellesley-Cambridge Press, Wellesley. 54 Story, R. E. (1996). An Explanation of the Effectiveness of Latent Semantic Indexing by means of a Bayesian Regression Model. Information Processing & Management, 32(3), 329-344. Tong, Y. L. (1990). The Multivariate Normal Distribution, Springer-Verlag. Van Rijsbergen, C. (1979). Information Retrieval (Second Edition). Butterworths, London. Wolfram, S. et al. (1999). Mathematica (version 4.0), Wolfram Research, Inc., http://www.wolfram.com. Yang, Y. (1995). Noise Reduction in a Statistical Approach to Text Categorization. In Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, Washington, pp. 256-263.