Search Engine Strategies . . . Beyond Yahoo and Google Presented by Linda J. Goff, Head, Instructional Services CSUS Library LJG 8/11/04 Fall, 2004 Today’s Agenda Web Structure, Jargon & definitions. How search engines think and work. Picking the right web search tool. Searching techniques & tips. Evaluating your sources thinking critically about information. Demonstration. CSUS Library Instruction Fall 2004 Glossary blogging browser cache cookies html http hypertext link CSUS Library Instruction Fall 2004 Metasearch Invisible Web phishing portal sites SacLink telnet URL What is the World Wide Web? The World Wide Web (WWW) is a global interactive, dynamic, cross-platform, graphical hypertext information system that runs on the Internet. CSUS Library Instruction Fall 2004 The Web is Growing Exponentially Over 10% of World is connected! Internet users estimated to be 605.60 million as of September 2002. CSUS Library Instruction Fall 2004 Source: http://www.nua.ie/surveys/how_many_online/ 3 Main Types of Search Tools:... Web Directory - Hierarchical organized in a classification system. Standard Search Engine – uses mathematical algorithms and boolean searches for keyword searches Expert Pages – reviewed list. CSUS Library Instruction Fall 2004 Expert Pages Infomine - Scholarly Internet Resource Collection http://infomine.ucr.edu/ Librarians Index to the Internet Information You Can Trust http://lii.org/ The WWW Virtual Library http://www.vlib.org CSUS Librarian Guides: http://library.csus.edu/guides/ CSUS Library Instruction Fall 2004 Now Web search tools can ... CSUS Library Instruction Fall 2004 Search multiple search engines simultaneously. Find sites that answer natural language questions. Ranks sites by how many links have been made to them. Sorts matches into folders by categories. Have advanced searching features Or - a combination of the above. Metasearch engines Search simultaneously across multiple search engines and displays top sites in each: – Dogpile.com – Vivisimo.com Warning: Some now charge for higher listings, e.g., Overture CSUS Library Instruction Fall 2004 Natural Language Search CSUS Library Instruction Fall 2004 Type your questions in Natural Language, e.g., AskJeeves.com Analyzes words, grammar and syntax, and uses "templatics" to look for patterns in the way questions are asked. Jeeves responds with one or more closely related questions that it already knows the answer to. Part 2 How Search Engines Think and Work Search Engines Most search engines and databases use Boolean Operators to create search statements, e.g. (domestic or family) and (violence not sexual abuse) CSUS Library Instruction Fall 2004 Boolean Operators AND requires both terms to appear in the items that are retrieved. OR requires either term to appear in the items that are retrieved. NOT excludes a term. Boolean Search Strategy a AND b family and violence a OR c family or domestic b NOT d violence not sexual abuse a a bb b c d What Search Engines Don’t Search ‘Bots only crawl the visible web which is only about 20% of everything that is on the Internet. They don’t look at the “Deep Web”, or “The Invisible Web.” CSUS Library Instruction Fall 2004 Invisible Web contains CSUS Library Instruction Fall 2004 Commercial databases that charge a fee, e.g., library research databases of periodical articles. Sites that require membership or a login. Searchable pages such as catalogs, phone books or directories, e.g.AMA Physician Search. Library Databases Access Authentication automatic for users with Web access via CSUS and SacLink. CSUS users with other Internet Service Providers (AOL, Prodigy etc.) must create a P.I.N. in EUREKA for authentication to access Library databases. To connect from off campus go to http://www.lib.csus.edu/databases/help/page. CSUS Library Instruction Fall 2004 Choose based on your Information Need Try Noodle Tools: http://www.noodletools.com/de bbie/literacies/information/5loc ate/adviceengine.html CSUS Library Instruction Fall 2004 Search Engine Comparisons Most have built-in search tips or help screens. Boolean operators, phrase searching and other limiters are often available. Be aware! Some now charge for higher page placement e.g, Overture. CSUS Library Instruction Fall 2004 Handout See “Searching the Web” handout of special search features and URLs for most popular search engines. http://libweb.uoregon.edu/guide s/searchweb/srchweb-info.html CSUS Library Instruction Fall 2004 Part 3 Search Tips & Strategies World Wide Web Reading Parts of the URL http://www.lib.csus.edu/databases/ The part before the colon is the access method or protocol, (hypertext transfer protocol). The part after the double slashes is the net address or domain name of the computer where the resource is located. The directory path and filename come after the next slash. Common Codes in Domain Names edu - higher education com - commercial firms (+22 million) gov - government agencies mil - military (US) org - general noncommercial organizations net - computer networks int - international organizations State or Country of origin: – uk (United Kingdom) – ca (Canada) – ca.us (California. United States) New Suffixes added by ICANN, effective Spring 2002 .info (anyone) .biz (business) .name (individuals) .pro (professionals) CSUS Library Instruction Fall 2004 .museum (accredited) .aero (Airtransport industry) .coop (business cooperatives) Kids.us (Content and technology restrictions) Think critically about the information you find on the Web... Anybody can publish anything on the Web. There are no editors and no central authorities. There are no guarantees that the site you find will be there next time you look. CSUS Library Instruction Fall 2004 Questions you should ask when evaluating a Web page: Who is the author or sponsor? What authority/expertise do they have? What is the purpose/scope of the page? Is it current? When was it last updated? How complete and accurate is the information? Does it have a bias? How usable is it? Do the the links work? CSUS Library Instruction Fall 2004 You must... Examine assumptions and possible biases. Distinguish between fact and opinion. Compare and contrast related pieces of information from other sources (print and online). CSUS Library Instruction Fall 2004 Bogus sites proliferate POP! the First Human Male Pregnancy – http://www.malepregnancy.com Dihydrogen Monoxide Research – http://www.dhmo.org/ Clones-R-Us – http://www.d-b.net/dti/ CSUS Library Instruction Fall 2004 Sites need to be examined carefully and compared Martin Luther King Jr. – A Historical Examination – http://www.mlking.org The King Center http://web.archive.org/web/200102081 60923/http://thekingcenter.org/ – http://www.thekingcenter.com/ CSUS Library Instruction Fall 2004 Web Searching Tips Use unique words or phrases. Check spelling ! Use synonyms or multiple spellings (e.g., marijuana marihuana) Try more than one search engine. Use words like “research” or “policy” to find more scholarly sites. Use domain limit feature e.g., Domain:edu or domain:gov CSUS Library Instruction Fall 2004 Citing Electronic Sources Look for it on the Library Home Page under Databases and Periodical Indexes. Look on the left for Guides For General & News and click on Citing Electronic Sources. The URL is... http://www.lib.csus.edu/guides/budge/eo graphy.htm. WARNING Con artists and scams are proliferating on the Web. Don’t use your credit card number unless you are assured of a secure system. Don’t download unfamiliar software. Don’t give out personal information. CSUS Library Instruction Fall 2004 Browser Configuration Tips Clear the memory cache before you begin a search session. It will speed up your response time. Use the following path for I.E.: Tools -> Internet Options -> Delete Files. For Netscape use : Edit -> Preferences -> Advanced -> Cache Delete Cookies at the same screen. CSUS Library Instruction Fall 2004 Shortcuts Use Bookmarks or Favorites Use Go from the pull-down menus instead of the Back button or use the History or right mouse button. Use the Stop and Reload buttons if loading a document takes too long. CTRL ALT DEL will bring up Windows 2000 Task Manager and you can close the browser if it is not responding. CSUS Library Instruction Fall 2004 Part 4 Popular Search Engines World Wide Web US Digital Media Universe Audience Reach Home & Work Users January 2003 KEY: GG=Google, YH=Yahoo, MSN=MSN, AOL=AOL, AJ=Ask Jeeves, OVR=Overture (GoTo), IS=InfoSpace, NS=Netscape, AV=AltaVista, LY=Lycos, ELINK=EarthLink.com, LS=LookSmart, CSUS Library Instruction Fall 2004 http://searchenginewatch.com/reports/netratings.html Billions Of Textual Documents Indexed as of Sept 2, 2003 KEY: GG=Google ATW=AllTheWeb, INK=Inktomi, AV=AltaVista, TMA=Teoma. CSUS Library Instruction Fall 2004 Source: http://searchenginewatch.com/reports/article.php/2156481 There are specialized search engines for almost every topic For a list of over 3,000 search engines go to Search Engine Guide: http://www.searchengineguide.com For detailed information aimed at search professionals try SearchEngineWatch: http://www.searchenginewatch.com CSUS Library Instruction Fall 2004 Most Popular Search Engines All the Web, AltaVista, Gigablast, Ask Jeeves, Dogpile, Google, HotBot, Metacrawler, LookSmart, Lycos, MSN Search, Netscape Search, Teoma, WiseNut and Yahoo! (Source: SearchEngineWatch.com) CSUS Library Instruction Fall 2004 Google.com Result rankings are based on the number of links made to the site from other web pages. Give you sites that web page creators have “voted” for with their links. An .edu link counts more than one from a .com page. CSUS Library Instruction Fall 2004 Google.com Most popular search engine & searches greatest number of pages (3.3 million) Special features include Advanced search, Image, Froogle, Blogger, Google Catalogs etc. CSUS Library Instruction Fall 2004 Vivisimo.com Queries one or more web search engines (Metasearch). Clusters Documents into groups based on this information. Groups the documents Orders the groups and the documents within each group. Displays the hierarchical categories. CSUS Library Instruction Fall 2004 Yahoo.com Originated “Directory” format to organize sites by subject and subheadings. Can personalize: “My Yahoo”. Geographic versions “Get Local.” CSUS Library Instruction Fall 2004 Teoma.com Results – ranked list. Refine – suggestions to narrow your search. Resources – link collections from experts to enthusiasts. Watch out for “Sponsored” page results – paid listings. CSUS Library Instruction Fall 2004 Hotbot.com Advanced searching in Hotbot and other search engines lets you limit by: – – – – – CSUS Library Instruction Fall 2004 Language Domain Region Date Content etc. Alltheweb.com Indexes 3.15 billion pages (almost as many as Google). You can customized your preferences. Language translator and language settings. CSUS Library Instruction Fall 2004 This PowerPoint presentation was prepared by: Linda J. Goff Head, Instructional Services University Library California State University, Sacramento. ljgoff@csus.edu http://www.lib.csus.edu/services/instruction/indiv/ CSUS Library Instruction Fall 2004 LJG:2/16/2004 Search Engine Comparison Always try more than one! http://www.llrx.com /features/searchen ginechart.htm CSUS Library Instruction Fall 2004 CSUS Library Instruction Fall 2004