On Personal Search

1 Evaluating ASSOCIATIVE BROWSING by Simulation Jin Y. Kim / W. Bruce Croft / David Smith 2 * What do you remember about your documents? Registration James James Use search if you recall keywords! 3 * What if keyword search is not enough? Registration Associative browsing to the rescue! 4 * Probabilistic User Modeling • Query generation model • Term selection from a target document [Kim&Croft09] • State transition model • Use browsing when result looks marginally relevant • Link selection model • Click on browsing suggestions based on perceived relevance 5 * Simulating Interaction using Probabilistic User Model Target Doc : Initial Query : James Registration Search Not Relevant (RankD > 50 ) Marginally Relevant (11 < RankD < 50 ) Reformulated Query : Click On a Result : Two Dollar Registration 1. Two Dollar Regist… Target Doc at Top 10 Target Doc at Top 10 End * A User Model for Link Selection • User’s browsing behavior [Smucker&Allan06] • Fan-out 1~3: the number of clicks per ranked list • BFS vs. DFS : the order in which documents are visited * A User Model for Link Selection • User’s level of knowledge • Random : randomly click on a ranked list • Informed : more likely to click on more relevant item • Oracle : always click on the most relevant item • Relevance estimated using the position of target item 1 … 2 … 3 … 1 … 2 … 4 … 5 … 3 … 4 … 1 … 5 … 2 … 3 … 4 … 5 … * Evaluation Results • Simulated interaction was generated using CS collection • 63,260 known-item finding sessions in total • The Value of Browsing • Browsing was used in 15% of all sessions • Browsing saved 42% of sessions when used • Comparison with User Study Results • Roughly matches in terms of overall usage and success ratio Evaluation Type Total Browsing used Successful Simulation 63,260 9,410 (14.8%) 3,957 (42.0%) User Study 290 42 (14.5%) 15 (35.7%) * Evaluation Results • Success Ratio of Browsing 0.48 0.46 0.44 0.42 0.4 random informed 0.38 oracle 0.36 0.34 0.32 0.3 FO1 FO2 More Exploration FO3 10 * Summary Associative Browsing Model Evaluation by Simulation • Simulated evaluation showed very similar statistics to user study in when and how successfully associative browsing is used • Simulated evaluation reveals a subtle interaction between the level of knowledge and the degree of exploration Any Questions? Jin Y. Kim / W. Bruce Croft / David Smith 11 * Simulation of Know-item Finding using Memory Model t1 t3 t2 t4 t3 t5 • Build the model of user’s memory • Model how the memory degrades over time • Generate search and browsing behavior on the model • Query-term selection from the memory model • Use information scent to guide browsing choices [Pirolli, Fu, Chi] • Update the memory model during the interaction • New terms and associations are learned 12 OPTIONAL SLIDES * Evaluation Results • Lengths of Successful Sessions 2.5 2 1.5 FO1 FO2-BFS 1 FO3-BFS 0.5 0 random informed oracle 2.5 2 1.5 FO1 FO2-DFS 1 FO3-DFS 0.5 0 random informed oracle 14 * Summary of Previous Evaluation • User study by DocTrack Game [Kim&Croft11] • Collect public documents in UMass CS department • Build a web interface by which participants can find documents • Department people were asked to join and compete • Limitations • Fixed collection, with a small set of target tasks • Hard to evaluate with varying system parameters • Simulated Evaluation as a Solution • Build a model of user behavior • Generate simulated interaction logs If search How would accuracy its effectiveness improves byvary X%,for how diverse will itgroups affect user of behavior? users? 15 * Building the Associative Browsing Model 1. Document Collection 2. Concept Extraction 3. Link Extraction 4. Link Refinement Term Similarity Temporal Similarity Co-occurrence 16 * DocTrack Game Target Item Find It! 17 * Community Efforts based on the Datasets

On Personal Search

Related documents

Products

Support

On Personal Search

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib