Information Seeking Behavior Prof. Marti Hearst SIMS 202, Lecture 25 Today Information Seeking Behavior Combine tactics into strategies Two parts of a process: search and retrieval analysis and synthesis of search results This is a fuzzy area; we will look at several different working theories. Marti A. Hearst SIMS 202, Fall 1997 Finding Out About Three phases: Asking of a question Construction of an answer Assessment of the answer Part of an iterative process Examine tactics and strategies for each phase Marti A. Hearst SIMS 202, Fall 1997 Tactics vs. Strategies Tactic: short term goals and maneuvers operators, actions Strategy: overall planning link a sequence of operators together to achieve some end Marti A. Hearst SIMS 202, Fall 1997 Lexis-Nexis Interface What tactics did you use? What strategies did you use? Marti A. Hearst SIMS 202, Fall 1997 Search Tactics and Strategies Search Tactics Bates 79 Search Strategies Belkin et al. 93, 94 Bates 90 O’Day and Jeffries 93 Marti A. Hearst SIMS 202, Fall 1997 Information Search Tactics (after Bates 79) Monitoring tactics Source-level tactics keep search on track navigate to and within sources Term and Search Formulation tactics designing search forumulation selection and revision of specific terms within search formulation Marti A. Hearst SIMS 202, Fall 1997 Term Tactics Move around the thesaurus superordinate, subordinate, coordinate neighbor (semantic or alphabetic) trace -- pull out terms from information already seen as part of search (titles, etc) morphological and other spelling variants antonyms (contrary) Marti A. Hearst SIMS 202, Fall 1997 Search Formulation Tactics Include or exclude terms Boolean focus queries Marti A. Hearst SIMS 202, Fall 1997 Source-level Tactics “Bibble”: Survey: look for a pre-defined result set e.g., a good link page on web look ahead, review available options e.g., don’t simply use the first term or first source that comes to mind Cut: eliminate large proportion of search domain e.g., search on rarest term first Marti A. Hearst SIMS 202, Fall 1997 Source-level Tactics (cont.) Stretch Scaffold use source in unintended way e.g., use patents to find addresses take an indirect route to goal e.g., when looking for references to obscure poet, look up contemporaries Cleave binary search in an ordered file Marti A. Hearst SIMS 202, Fall 1997 Monitoring Tactics (strategy-level) Check Weigh make a cost/benefit analysis of current or anticipated actions Pattern compare original goal with current state recognize common strategies Correct Errors Record keep track of (incomplete) paths Marti A. Hearst SIMS 202, Fall 1997 Additional Considerations (Bates 79) Add a Sort tactic! More detail is needed about short-term cost/benefit decision rule strategies When to stop? How to judge when enough information has been gathered? How to decide when to give up an unsuccesful search? When to stop searching in one source and move to another? Marti A. Hearst SIMS 202, Fall 1997 Information Seeking Strategies (Belkin et al. 93, 94) A multi-dimensional space: very simple tactic types very simple goal types information vs. meta-information Create a strategy type by choosing a value from each dimension Marti A. Hearst SIMS 202, Fall 1997 ISS Dimensions Goal of interaction Method of interaction learning (browsing to get to know an area) selection (identifying useful items) scanning (looking for something interesting) searching (looking for a specific known item) Resource type information meta-information Marti A. Hearst SIMS 202, Fall 1997 Information Seeking Strategies (Modified from Belkin et al. 93) ISS Scan A B C D E F G H Search Learn X X X X X X X X X X X X Select Info Meta -Info X X X X X X X X X X X X Marti A. Hearst SIMS 202, Fall 1997 Example ISS’s ISS G: prototypical specific search search through a specific information source retrieve articles that match a keyword specification of the topic Marti A. Hearst SIMS 202, Fall 1997 Example ISS’s ISS B: prototypical undirected search user approaches system with some vague idea about a topic scans through a meta-information structure learns about general topic information Marti A. Hearst SIMS 202, Fall 1997 Example ISS’s ISS D: scan through a table-of-contents of a journal to select items on a particular topic ISS A: scan through a periodicals shelf to learn what journals are available on a given topic Marti A. Hearst SIMS 202, Fall 1997 New strategy types What happens if we place Bates’ tactic types into Belkin et al.’s strategy space? Marti A. Hearst SIMS 202, Fall 1997 “Berry-Picking” as an Information Seeking Strategy (Bates 90) Standard IR model assumes the information need remains the same throughout the search process Berry-picking model interesting information is scattered like berries among bushes the query is continually shifting Marti A. Hearst SIMS 202, Fall 1997 Berry-picking model (cont.) The query is continually shifting Users may move through a variety of sources New information may yield new ideas and new directions The query is not satisfied by a single, final retrieved set, but rather by a series of selections and bits of information found along the way Marti A. Hearst SIMS 202, Fall 1997 A sketch of a searcher… “moving through many actions towards a general goal of satisfactory completion of research related to an information need.” (after Bates 90) Q2 Q4 Q3 Q1 Q5 Q0 Marti A. Hearst SIMS 202, Fall 1997 Implications Interfaces should make it easy to store intermediate results Interfaces should make it easy to follow trails with unanticipated results Difficulties with evaluation Marti A. Hearst SIMS 202, Fall 1997 Orienteering (O’Day & Jeffries 93) Interconnected but diverse searches on a single, problem-based theme Focus on information delivery rather than search performance Classifications resulting from an extended observational study: 15 clients of professional intermediaries financial analyst, venture capitalist, product marketing engineer, statistician, etc. Marti A. Hearst SIMS 202, Fall 1997 Orienteering (O’Day & Jeffries 93) Defined three main search types monitoring a well-known topic over time e.g., research four competitors every quarter following a plan a typical approach to the task at hand e.g., improve business process X exploratory explore topic in an undirected fashion get to know an unfamiliar industry Marti A. Hearst SIMS 202, Fall 1997 Orienteering Trends: (O’Day & Jeffries 93) A series of interconnected but diverse searches on one problem-based theme This happened in all three search modes Each analyst did at least two search types Each stage followed by reading, assimilation, and analysis of resulting material Marti A. Hearst SIMS 202, Fall 1997 Orienteering (O’Day & Jeffries 93) *Searches tended to trigger new directions Overview, then detail, repeat Information need shifted between search requests Context of problem and previous searches were carried to next stage of search *The value was contained in the accumulation of search results, not the final result set *Observations verified Bates’ predictions Marti A. Hearst SIMS 202, Fall 1997 Orienteering (O’Day & Jeffries 93) Triggers: motivation to switch from one strategy to another next logical step in a plan encountering something interesting explaining change finding missing pieces Marti A. Hearst SIMS 202, Fall 1997 Stop Conditions Categories not as clear as for triggers People stopped searching when no more compelling triggers finished an appropriate amount of searching for the task specific inhibiting factor e.g., learning market was too small lack of increasing returns (O’Day & Jeffries 93) 80/20 rule Missing information/inferences ok business world different than scholarship Marti A. Hearst SIMS 202, Fall 1997 After the Search: Analyzing and Synthesizing Results Orienteering Study Sensemaking Work Marti A. Hearst SIMS 202, Fall 1997 Analyzing and Synthesizing Search Results Orienteering Post-Search Behaviors: Read and Annotate Analyze six main types 80% fell into six main types the rest: cross-reference summarize find evocative visualizations miscellaneous Marti A. Hearst SIMS 202, Fall 1997 Post-Search Analysis Types (O’Day & Jeffries 93) Trends Comparisons Aggregation and Scaling Identifying a Critical Subset Assessing Interpreting Marti A. Hearst SIMS 202, Fall 1997 SenseMaking The process of encoding retrieved information to answer task-specific questions Combine (Russell et al. 93) internal cognitive resources external retrieved resources Create a good representation an iterative process contend with a cost/benefit tradoff Marti A. Hearst SIMS 202, Fall 1997 “Sensemaking” in the Business Intelligence Analysis Task (Russell et al. 93) Established analysis scheme Select important documents Collect documents For each topic, instantiate schema Organize documents by topic Collect additional required documents Write report Evaluate responses to report Generate final report Marti A. Hearst SIMS 202, Fall 1997 Sensemaking (Russell et al. 93) An anytime activity at any point a workable solution available usually more time -> better solution usually more properties -> better solution Marti A. Hearst SIMS 202, Fall 1997 Sensemaking (Russell et al. 93) A good strategy maximizes long term rate of gain example: new technology brings more info faster uniform increase in useful and useless information best strategy: throw out bad stuff faster Marti A. Hearst SIMS 202, Fall 1997 Sensemaking (Russell et al. 93) Most of the effort is in synthesis of a good representation covers the data increase usability decrease cost-of-use Marti A. Hearst SIMS 202, Fall 1997 Coming Up User Interfaces for Information Access Using MetaData in Search Hypertext Navigation and Search Marti A. Hearst SIMS 202, Fall 1997