here - FAU Semantic Web Course

advertisement
COT 6930 - Web 2.0 Architectures and Algorithms - Spring 2015
Online and Live Course Sections
Course Syllabus: Web 2.0 architectures, models and patterns; and algorithms for searching, ranking, recommending,
social networking, clustering, and classification.
Text books: 1. Mining the Social Web, Matthew A Russel, O'Reilly, 2nd edition, 2013, 978-1-449-36761-9; 2.
Governor, J., Hinchcliffe, D., and Nickull, D., Web 2.0 Architectures, O’Reilly, 2009
Pre-requisite: Graduate student status in computer science, computer engineering, or ITOM; Consent of Instructor
for graduate majors in other fields; Programming background in Python. Python is easy to learn. A good book on
Python: McKinney, W., Python for Data Analytics, O’Reilly, 2013
Instructor: Ravi Shankar, Professor CEECS; (561) 297-3470, shankar@fau.edu; Office Hours: MWF 3 to 5 PM
Course Time and Place: MWF 2 to 2.50 PM, FL 401, Online and Live sections
Course Description: Web 2.0, or Social Web, is architecture of user participation, and goes beyond the page
metaphor of Web 1.0 to deliver rich user experiences (Tim O’Reilly, 2005). The course will discuss the architectural
models and patterns, and algorithms that make today’s web experience so unique from this social networking
perspective. The course is not about web-enabling technologies such as JSON, XUL and RESTful. The algorithms
discussed are optimized for social networking issues such as information retrieval across various formats, indexing,
link analysis, ranking, user click analysis, natural language processing, recommending, discovering user groups,
link-based clustering, and rule-based data classification. Practical examples of social web mining at Twitter,
Facebook, LinkedIn, Google+, and Github will be covered. Use of semantic web to enhance the experience will be
explored.
Grading:
 Five Assignments (in Python) with data at key social web sites: 40% - during the semester
 Mid-term and Final Exam 30% - during the 7th and final week
 Research Paper: 20% - during the second half of the semester; due at the end of the semester
 Web 2.0 community service (review advances in architecture, standard, algorithm, , etc.) 10% - anytime
Topics to cover: Number of 50 minute lectures in parentheses.
1. Web 2.0 Architecture - Intro, new client-server model, reference architecture, and patterns (4)
2. Web profile - Data and Meta data; Distributed, Large, Sparse, and Dynamic databases; and Social Linkages.
Embedding intelligent Web 2.0 behavior (4)
3. Trending with Frequency Analysis: An overview example with Twitter (4)
4. Social Graphs, Meta Tags, and Click Analysis: Example of Facebook (4)
5. Clustering Algorithms - Identification of user groups with traditional, clustering, link-based, and K-means
algorithms. Example of LinkedIn(4)
6. Searching Algorithms: Information retrieval, indexing and ranking; Precision and recall. Example of
Google+ (4)
7. Natural Language Processing - Web crawling with link analysis and Discovering Semantics - Examples on
WWW (4)
8. Algorithms for suggestions and recommendations: collaborative filtering and content-based recommendation.
Example of Github (4)
9. Time Series Trends and Analysis - Example of mailbox mining (4)
10. Classification Algorithms - structural, statistical and rule-based classifiers; life cycle of a classifier (4)
11. Research Issues – Performance and computing architectures, Semantic Web, Socio-Technical Systems,
Natural Language Processing, Visualization, and Domain specific constraints (4)
References:
 Marmanis, H., and Babenko, D., Algorithms of the Intelligent Web, Manning, 2009;
 O’Reilly, T., What is Web 2.0 - Design Patterns and Business Models for the Next Generation of Software,
http://oreilly.com/web2/archive/what-is-web-20.html, 2006
 McKinney, W., Python for Data Analytics, O’Reilly, 2013
 Contemporary topics in Web 2.0, senior student presentations, http://csi.fau.edu/courses/senior-seminar/,
2013
 McCandless, M., Hatcher, E., and Gospodnetic, O., Lucene in Action, Manning, 2nd edition, 2010
 Visualization and Analytics, special issues, IEEE Computer Magazine, May and July 2013
 Socio-Technical Systems Engineering Handbook, http://archive.cs.st-andrews.ac.uk/STSEHandbook/index.html , accessed in 2013.
 Medical domain infrastructure: http://www.nlm.nih.gov/research/umls/
Download