CS4273 Distributed System Technologies and Programming Tutorial on Web Search

advertisement
CS4273
Distributed System Technologies and Programming
Tutorial on Web Search
1. Modify the HttpServer.java to make it multi-threading.
2. How does a web server collect back links information of web pages in HTTP/1.0?
3. The page ranking formula computes the ranks of page popularities. What are the other
factors that affect the rank of pages produced by Google for a query?
4. Construct a sorted inverted file for the following documents, with times (number of
times a term appears) and positions a term appears in a document:
Document
Text
1
Gold silver truck is a silver truck painted with gold
2
Shipment of gold damaged in a fire
3
Delivery of silver arrived in a silver truck
4
Shipment of gold arrived in a truck
Download