CPS 49S Google: The Computer Science Within and its Impact on Society

advertisement
CPS 49S
Google: The Computer Science
Within and its Impact on Society
Shivnath Babu
Spring 2008
Teams
• Search Internals
– Bryan, Jason
• Non-Google Search
– Karima, Minerva
• Advertising
– Casey, Nick
• Fraud Detection
– A.J., Eddie
• Economy
– Dan, Danny, Sam
• Services
– Justin, Raquel
• Privacy
– Ashley, Kelly
• Future
– Kyle, Nolan, Taylor
Tentative Presentation Order
•
•
•
•
•
•
•
•
•
Jason
Bryan
Karima
Minerva
Danny
Nick
Sam
Casey
A.J.
•
•
•
•
•
•
•
•
•
Eddie
Dan
Emily
Ashley
Raquel
Justin
Taylor
Kyle
Nolan
Background
• HTML
–
–
–
–
–
Tags
Links (hyperlinks) and anchor text
Meta tags
Creating your web page at Duke
Submitting your web site to Google
• Internet and Web
–
–
–
–
Router
URL
IP address
Domain name service
Overview of Reading Material
• Search technology
– Crawling, indexing, query processing
• Data about who and what of web searches
• The intent behind search
– Informational, navigational, transactional
• The monetary angle
• Improving current search
Search Technology
• Search as seen by the user
• Search as seen by the search engine
– http://www.google.com/intl/en/corporate/tech.html
– Crawling  document repositories
– Indexing
• What all information can be extracted from a web
page? (Analysis)
• How to organize this information
– Processing user queries
Search Technology (contd.)
• Holy grail of a search engine: find the true intent
of the searcher
– What makes this hard?
– Metadata, tags, cue words, concept
• Improving search
– First, Second, and Third generation search (more of
this later)
– Tracking links that users follow
Who/What/Why
• Who is searching
• What/Why
– Informational
– Navigational
– Transactional
• Categorize the following searches
– Southwest airlines
– Raleigh hotel
– Abraham Lincoln
Web Search Vs. Information Retrieval
• What are the similarities?
• What are the dissimilarities?
The Monetary Angle
• What does the graph on Page 35 represent?
• Paid search
– Who is paying? (50 cents per click, let us do the
math)
– What are the stakes?
– The local market
Search Innovations
• First generation search engines
• Second generation search engines
• Third generation search engines
Quiz 1, HW1, and Assigned Readings
• For Tuesday (1/22) and Thursday (1/24) of next
week
– Early paper by the Google guys
– Posted on the course readings web page
• Readings for Thursday (1/24)
– How Internet Search Engines Work
– Chapter 2 from textbook (continued)
• HW1 will be posted tomorrow (Wednesday)
– Is due next Thursday
– Has to be submitted as the URL of a web page you
create at Duke
• Quiz 1 on Tuesday 1/29
Download