HT06, Position Paper, Tagging, Taxonomy, Flickr, Academic Article, ToRead, Presentation Cameron Marlow, Mor Naaman, danah boyd, Marc Davis Yahoo! Research What Are Tags? “A tag is a keyword or descriptive term associated with an item as means of classification by means of a folksonomy. Tags are usually chosen informally and personally by the author/creator of the item — i.e. not usually as part of some formally defined classification scheme. Tags are typically used in dynamic, flexible, automatically generated internet taxonomies for online resources such as computer files, web pages, digital images, and internet bookmarks.” Wikipedia, 2006 Hypertext 2006 Cameron Marlow - Yahoo! Research 2 Del.icio.us Hypertext 2006 Cameron Marlow - Yahoo! Research 3 Flickr Hypertext 2006 Cameron Marlow - Yahoo! Research 4 Why Yahoo? Yahoo, circa 1996 Hypertext 2006 Cameron Marlow - Yahoo! Research 5 Why Yahoo? Yahoo, circa 2004 Hypertext 2006 Cameron Marlow - Yahoo! Research 6 Why Yahoo? + Hypertext 2006 Cameron Marlow - Yahoo! Research 7 Motivation Introduce tagging for academic audiences Create a common language for current practitioners Point to potential and possible directions of further research Method Develop a model of tagging Survey existing systems and features Develop taxonomy Tagging Model Hypertext 2006 Taxonomy Prelim. Study Cameron Marlow - Yahoo! Research Future Work 8 Tagging Systems: Simple Model Hypertext 2006 Cameron Marlow - Yahoo! Research 9 Tagging - Simple Model Keywords Describing connected resources Sounds familiar? (Automatic resource compilation by analyzing hyperlink structure and associated text, Chakrabarti et al, 1998) Hypertext 2006 Cameron Marlow - Yahoo! Research 10 What About Anchor Text? Democratization and personalization Extent and scale User-inclusive model (not site- or page-based) Notion of connected/related users Intent of action (e.g., description vs. navigation or reference) Richness of context Hypertext 2006 Cameron Marlow - Yahoo! Research 11 Wait! Where Are We? Tagging Model Hypertext 2006 Taxonomy Prelim Study Cameron Marlow - Yahoo! Research Future Work 12 Taxonomy Create a common language Point out differences and generative factors Two taxonomies Systems Incentives (see paper) Hypertext 2006 Cameron Marlow - Yahoo! Research 13 Systems Taxonomy Who How What Where from … Structure and nature of resulting tags Hypertext 2006 Cameron Marlow - Yahoo! Research 14 Tagging Rights Who is allowed to tag a resource? Self-tagging Hypertext 2006 Permission-based Cameron Marlow - Yahoo! Research Open 15 Tagging Support Does the system “help” in tagging? Blind Hypertext 2006 Suggested Cameron Marlow - Yahoo! Research Viewable 16 Tag Aggregation How tags for individual resources are aggregated Set Hypertext 2006 Bag Cameron Marlow - Yahoo! Research 17 Object Type What is the type of resource being tagged? Textual Hypertext 2006 Non-textual Cameron Marlow - Yahoo! Research 18 Object Source Where the object media originates from User-contributed Hypertext 2006 System Cameron Marlow - Yahoo! Research Global 19 Where are we now? Tagging Model Hypertext 2006 Taxonomy Prelim. Study Cameron Marlow - Yahoo! Research Future Work 20 Case Study: Flickr and Del.icio.us Flickr Rights: Permission-based Support: Blind Aggregation: Set Type: Non-textual Source: User-contributed Del.icio.us Rights: Owner Support: Suggested Aggregation: Bag Type: Textual Source: Global Hypertext 2006 Cameron Marlow - Yahoo! Research 21 Growth of tags Total number of distinct tags Number of distinct tags in 10 user collections, over time Index of photo Hypertext 2006 Cameron Marlow - Yahoo! Research 22 Similar to del.icio.us? Scales are different! Figure from Golder et al, 2005 Total number of distinct tags 1000 500 0 0 2500 5000 Index of bookmark Hypertext 2006 Cameron Marlow - Yahoo! Research 23 Total number of distinct tags Together Index of photo Hypertext 2006 Cameron Marlow - Yahoo! Research 24 Case Study: Flickr and Del.icio.us Flickr Rights: Permission-based Support: Blind Aggregation: Set Type: Non-textual Source: User-contributed Del.icio.us Rights: Owner Support: Suggested Aggregation: Bag Type: Textual Source: Global Hypertext 2006 Cameron Marlow - Yahoo! Research 25 Case Study: Flickr Nobody tags other people’s content Why? Hypertext 2006 Not collected In user’s account Not identified As coming from the tagger Not prominent In the interface, as “opinion” Not aggregated Can’t “vote” on tag/item pair Cameron Marlow - Yahoo! Research 26 Almost done Tagging Model Hypertext 2006 Taxonomy Prelim. Study Cameron Marlow - Yahoo! Research Future Work 27 Future Research Search / IR Comparison of hypertext and tags Spam detection Linguistics / NLP Taxonomy generation Sociolinguistics Collaborative Filtering Identify trends (locally and globally) Trust metrics Identify influencers … Hypertext 2006 Cameron Marlow - Yahoo! Research 28 Thank You Cameron Marlow cameronm@yahoo-inc.com http://research.yahoo.com Data? Hypertext 2006 Cameron Marlow - Yahoo! Research 29