Searching the World Wide Web Introduction • Directories, Search Engines, and Metasearch Engines • Search Fundamentals • Search Strategies • How Does a Search Engine Work? From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 1 Searching the World Wide Web Directories, Search Engines, and Metasearch Engines • Directories • Popular Directories • Search Engines • Popular Search Engines • Metasearch Engines • Popular Metasearch Engines • White Pages From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 2 Searching the World Wide Web Directories Directories, Search Engines, and Metasearch Engines • Hierarchical representation of hyperlinks • Top level of general topics • Sublevels of more specialized subtopics • Easy to use • Not necessary to know exactly what looking for From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 3 Searching the World Wide Web Directories • General directory • • • • Web directory Web guide Subject directory Yahoo! --- www.yahoo.com • Specialized • Subject guides • Gateway pages • Financial aid resource center --- www.theoldschool.org From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 4 Searching the World Wide Web Popular Directories Directories, Search Engines, and Metasearch Engines • AOL NetFind • CNET Search.com • Excite • Infoseek • Looksmart • Lycos • Yahoo! From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 5 Searching the World Wide Web Search Engines Directories, Search Engines, and Metasearch Engines • Computer program: • • • • Accepts form containing query Searches database Returns URL Permits query revision • Specific • Query syntax From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 6 Searching the World Wide Web Search Engines • General search engine • Google --- www.google.com • Specialty search engine • Vertical search engine • Topic search engine • MySimon --- www.mysimon.com From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 7 Searching the World Wide Web Popular Search Engines • • • • • • • • Directories, Search Engines, and Metasearch Engines AOL NetFind AltaVista Excite HotBot InfoSeek Lycos WebCrawler Google From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 8 Searching the World Wide Web Metasearch Engines Directories, Search Engines, and Metasearch Engines • All-in-one search engine • Call other search engines • Use single query • More matches From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 9 Searching the World Wide Web Popular Metasearch Engines Directories, Search Engines, and Metasearch Engines • Metasearch • Metacrawler • MetaFind • SavvySearch From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 10 Searching the World Wide Web White Pages Directories, Search Engines, and Metasearch Engines • Information about individuals • Bigfoot • Four11 • WhoWhere • Yellow pages From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 11 Searching the World Wide Web Search Fundamentals • Introduction • Search Terminology • Pattern Matching Queries • Boolean Queries • Search Domain • Search Subjects From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 12 Searching the World Wide Web Introduction Search Fundamentals • Information bar • Search form area • Directory area • Links From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 13 Searching the World Wide Web Search Terminology Search Fundamentals • Search tool • Query • Query syntax • Query semantics • Hit • Match • Relevancy score From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 14 Searching the World Wide Web Pattern Matching Queries Search Fundamentals • Enter keyword(s) • Search engine returns URLs Boolean Queries • George Boole • AND, OR, and NOT From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 15 Searching the World Wide Web Search Fundamentals From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 16 Searching the World Wide Web Search Domain Search Fundamentals • Current web • Newsgroups • Specialized databases • Internet From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 17 Searching the World Wide Web Search Subjects Search Fundamentals • A way to view the search queries of anonymous users in real time • • • • • How busy “Spy” on other users “See” modifications Various interests Personal interests From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 18 Searching the World Wide Web Search Strategies Search Fundamentals • Introduction • Too Few Hits: Search Generalization • Too Many Hits: Search Specialization From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 19 Searching the World Wide Web Introduction Search Strategies • Determine which search engine to use: • • • • • User-friendly interface Documentation Conveniently accessible Database size Relevancy scores From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 20 Searching the World Wide Web Too Few Hits: Search Generalization Search Strategies • Eliminate keywords • Remove AND or NOT • Enlarge search domain • General keywords • Directory or metasearch engine From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 21 Searching the World Wide Web Too Many Hits: Search Specialization Search Strategies • Add keywords • Add AND or NOT • Capitalize proper nouns • Use first 20 URLs • Directory From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 22 Searching the World Wide Web How Does a Search Engine Work? • Search Engine Components • User Interface • Searcher • Evaluator • Gatherer • Indexer From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 23 Searching the World Wide Web Search Engine Components How Does a Search Engine Work? • User interface • Searcher • Evaluator • Gatherer • Indexer From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 24 Searching the World Wide Web User Interface How Does a Search Engine Work? • Submit queries • Display results • Relevancy scores • Matched page summaries Searcher • Program • Searches database for matches From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 25 Searching the World Wide Web Evaluator How Does a Search Engine Work? • Locates URLs • Result set • Factors affecting relevancy score: • • • • Times query words appear in page Query words in title Query words in CONTENT attribute Number of query words appearing in document From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 26 Searching the World Wide Web Gatherer How Does a Search Engine Work? • Traverses Web • Breadth-first search • In levels “across” pages • Depth-first search • Chain of hyperlinks “down” From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 27 Searching the World Wide Web How Does a Search Engine Work? From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 28 Searching the World Wide Web How Does a Search Engine Work? From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 29 Searching the World Wide Web Gatherer How Does a Search Engine Work? • Miscellaneous facts: • • • • • Heavy load on Web servers Restricted depth of searches Trouble with framed documents Dependent on collection of documents Full text indexing From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 30 Searching the World Wide Web Indexer How Does a Search Engine Work? • Organizes (indexes) gathered data • URL • Document title • Descriptive keywords From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 31