lecture0-CourseOverview

advertisement
Web Search and Mining
Course Overview
Web Search and Mining
Wu-Jun Li
Department of Computer Science and Engineering
Shanghai Jiao Tong University
Lecture 0: Course Overview
1
Web Search and Mining
Course Overview
General Information
 Instructor: Wu-Jun Li (李武军)




Email: liwujun@cs.sjtu.edu.cn
Homepage: http://www.cs.sjtu.edu.cn/~liwujun
Office: Rm 3-537, SEIEE Building
Office Hours: Thur 10:00am - 11:00am
 Course web site: http://www.cs.sjtu.edu.cn/~liwujun/course/wsm.html
 Teaching Assistant: TBD
 Lecture Time:
Wed 10:00 - 10:45 & 10:55 - 11:40
Fri 12:55 - 13:40 & 14:00 - 14:45
 Lecture Venue:
Rm 308, Rui-Qiu Chen Building(陈瑞球楼308)
2
Web Search and Mining
Course Overview
Textbook
 Christopher D. Manning, Prabhakar Raghavan, and Hinrich
Schütze. Introduction to Information Retrieval. Cambridge
University Press, 2008.
 The English reprint edition (英文影印版) can be bought
through China-Pub (http://www.china-pub.com/193197).
You can also download it from the book website
(http://nlp.stanford.edu/IR-book/information-retrievalbook.html).
3
Web Search and Mining
Course Overview
Reference Books

Bruce Croft, Donald Metzler, and Trevor Strohman. Search Engines: Information
Retrieval in Practice. Addison Wesley, 2009.
(The English reprint edition can be bought through China-Pub.)

Bing Liu. Web Data Mining: Exploring Hyperlinks, Contents and Usage Data.
Springer, 2006.

Jiawei Han, and Micheline Kamber. Data Mining: Concepts and Techniques. Morgan
Kaufmann, Second Edition, 2006.
(The English reprint edition can be bought through China-Pub.)

Trevor Hastie, Robert Tibshirani, Jerome Friedman. The Elements of Statistical
Learning: Data Mining, Inference, and Prediction. Springer, Second Edition,2009.
(http://www-stat.stanford.edu/~tibs/ElemStatLearn/index.html)

Christopher M. Bishop. Pattern Recognition and Machine Learning. Springer, 2006.
4
Web Search and Mining
Course Overview
Course Topics
 Architecture of search engines
 The basics of information retrieval (IR)
 index construction and compression; Boolean retrieval;
vector space model; evaluation of IR systems; relevance
feedback and query expansion
 Probabilistic IR and language models
 Data mining and machine learning (ML) basics
 supervised learning; unsupervised learning; matrix
factorization
 Graph mining, social search and recommender
systems
5
Web Search and Mining
Course Overview
Prerequisites
 Data structure
 Design and analysis of algorithms
 Linear algebra
 Probability theory
6
Web Search and Mining
Course Overview
Grading Scheme
 In class quizzes (30%)
 Homework (30%)
 Project + presentation (40%)
7
Web Search and Mining
Course Overview
Late Assignments
 Assignments turned in late will be penalized 20% per
late day
8
Web Search and Mining
Course Overview
Academic Honor Code
 Honesty and integrity are central to the academic
work.
 All your submitted assignments must be entirely your
own (or your own group's).
 Any student found cheating or performing plagiarism
will receive a final score of zero for this course.
9
Web Search and Mining
Course Overview
Question?
10
Download