How Many Bits Per Rating Daniel Kluver1 , Tien T. Nguyen1 , Michael Ekstrand1 , Shilad Sen2 , John Riedl1 1 GroupLens Research Dept. of Computer Science and Engineering University of Minnesota 2 Math, Stats, and Computer Science Dept. Macalester College September 11, 2012 Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 1 / 49 ? Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 2 / 49 Math Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 3 / 49 Math Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 4 / 49 Math Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 5 / 49 Math Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 6 / 49 Math Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 7 / 49 Better Math Item-Item, User-User, Matrix Factorization, Feature Weighted Linear Stacking Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 8 / 49 Math Predict, Recommend, Explain Predictions, Diversify Recomendations Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 9 / 49 Math Normalization, Re-rating based Denosing Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 10 / 49 Math User training, Surgery, Fraud detection, Intercrainial Preference Elicitation Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 11 / 49 Math Problem - ? Better - ? Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 12 / 49 Better Math Better - Prediction Accuracy Issue - Magic Barrier Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 13 / 49 Math Better - ? Problem - Users rate inconsistantly Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 14 / 49 Goal: Choose the best user interface. Define Best Looks good Makes users happy Ratings are fast Most information about user preferences Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 15 / 49 Goal: Choose the best user interface. Define Best Looks good Makes users happy Ratings are fast Most information about user preferences Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 15 / 49 Words to know Preference bits A measure of information about user preferences. Bits per rating Measures how much preference information is contained in ratings. Bits per second Measures the efficiency of an interface at capturing preference bits. Bits per prediction Measures how much preference information is contained in predictions. Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 16 / 49 Goal: Measure information about user preferences. Define Preference Define Information Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 17 / 49 Goal: Measure information about user preferences. Define Preference The tendency to consistently behave as if you placed value (positive or negative) on something. Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 18 / 49 Preferences n. - The tendency to consistently behave as if you placed value on something. πScience Figure: My value for science is rather large Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 19 / 49 Articulated Values Deterministic πBatman Kluver, Nguyen, Ekstrand, Sen, Riedl () RBatman How Many Bits Per Rating September 11, 2012 20 / 49 Basic Values πBat Memories ... πMan Brain Math RBatman πBombs Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 21 / 49 Partially Articulated Values Not Deterministic πBatman RBatman All noise in rating comes from completely hidden context. Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 22 / 49 Goal: Measure information about user preferences. Define Information Thanks to Claude Shannon, Information is a solved problem. Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 23 / 49 Mutual Information I (X ; Y ) = P(x,y ) P(x, y )log ( y P(x)P(y ) ) P P x Measurement of how much knowing X increases our certainty about Y on average. Normally given in bits Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 24 / 49 Bringing it all together We can use mutual information to measure how much information anything tells us about user preferences. We call this measurement Preference Bits. If something has a lot of preference bits then it is good at explaining user preference. Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 25 / 49 Words to know Preference bits A measure of information about user preferences. Bits per rating Measures how much preference information is contained in ratings. Bits per second Measures the efficiency of an interface at capturing preference bits. Bits per prediction Measures how much preference information is contained in predictions. Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 26 / 49 Goal: Measure information entering the recommender. Ratings(Ru,i ) enter the recommender. Ratings measure user preferences (πu,i ). Therefore we want to measure this: Measure This πu,i Kluver, Nguyen, Ekstrand, Sen, Riedl I(π,R) () How Many Bits Per Rating Ru,i September 11, 2012 27 / 49 Goal: Measure information entering the recommender. Observable πu,i Ru,i Not Observable Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 28 / 49 Measuring Input Preference Bits π R1 R2 Prior work solves this problem with two ratings. Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 29 / 49 Information Processing Inequality X Y Z X and Z are conditionally independent given Y When this is true, I (X ; Z ) ≤ I (X ; Y ) Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 30 / 49 Measuring Input Preference Bits π R1 R2 For two conditionally independent re-ratings I (R1 ; R2 ) ≤ I (R; π) We will use this to measure input preference bits. Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 31 / 49 Measuring Input Preference Bits 2 Big assumptions: R1 conditionally independent with R2 given π R1 and R2 are generated by the same π Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 32 / 49 Picking User Interfaces Split users between rating interfaces Have users rate a bunch of movies Some time later, have the users rate the same items Compare preference bits between conditions Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 33 / 49 Picking User Interfaces Split users between rating interfaces Have users rate a bunch of movies Some time later, have the users rate the same items Compare preference bits between conditions We haven’t run this (yet) No one else has either Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 33 / 49 Dataset Analysis From Cosley et. al. Seeing is believing. 2-point 6-point 10-point 0.423 Kluver, Nguyen, Ekstrand, Sen, Riedl () 0.825 How Many Bits Per Rating 0.813 September 11, 2012 34 / 49 Preference bits per rating Effect of Rating Scale on Input Preference Bits 1.8 1.6 1.4 1.2 1.0 0.8 0.6 10 20 30 40 mR 50 60 70 More rating choices, more bits Information hits a limit I I More noise less bits More preference options more bits Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 35 / 49 Words to know Preference bits A measure of information about user preferences. Bits per rating Measures how much preference information is contained in ratings. Bits per second Measures the efficiency of an interface at capturing preference bits. Bits per prediction Measures how much preference information is contained in predictions. Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 36 / 49 A Problem More rating options ⇒ more information More rating options ⇒ more cognitive load More rating options ⇒ slower ratings 1 slower ratings ⇒ less ratings less ratings ⇒ less information. More rating options ⇒ less and more information? 1 Sparling et. al. Rating: How difficult is it? Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 37 / 49 Slow Rating High Information Fast Rating Low Information Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 38 / 49 Solution: Bits Per Second Measuring bits per rating is easy. Measuring ratings per second is also easy. It turns out measure bits per second is also easy. # Bits # Ratings # Bits × = 1 Rating 1 second 1 second Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 39 / 49 Bits per second: Does it matter? Using Sparling et. al. Rating: how difficult is it? and Cosley et. al. Seeing is Believing we can estimate. 2-point = 0.1082 10-point = 0.1878 Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 40 / 49 Words to know Preference bits A measure of information about user preferences. Bits per rating Measures how much preference information is contained in ratings. Bits per second Measures the efficiency of an interface at capturing preference bits. Bits per prediction Measures how much preference information is contained in predictions. Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 41 / 49 Output Preference Bits Predictions (Pu,i ) leave the recommender. Predictions predict user preferences (πu,i ) The amount of preference information leaving the recommender with I (π; P) We measure this as I (R; P) ≤ I (π; P) Yes, its just a fancy accuracy measure, But it handles varying scales well Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 42 / 49 Suggested use: Choosing how many stars to use. From Jester dataset (Goldberg et. al.) Out Preference Bit 0.25 ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0.20 0.15 ● ● ● ● ● ● ● 0.10 ● 0.05 0.00 ● 2−point 5−point 10−point 20−point 100−point Output Scale Input Scale ● 2−point ● 5−point ● 10−point ● 20−point ● 100−point More prediction options, more bits Information hits a limit (again) I I input scale controls limit most bits at 10 point scale Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 43 / 49 Preference Bits Measure with: Mutual information Math A measure of information about user preferences. Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 44 / 49 Bits Per Rating Mutual information between ratings and preferences I(π;R) Measure with: I(R1;R2) Math Measures how much preference information is contained in ratings. Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 45 / 49 Bits Per Second Measure with: Bits per rating times Ratings per second Math Measures the efficiency of an interface at capturing preference bits. Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 46 / 49 Bits Per Prediction Mutual information between prediction and preferences I(π;P) Measure with: I(R;P) Math Measures how much preference information is contained in Predictions. Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 47 / 49 The Next Steps How many stars should we use? What information helps users the most? What are the difference the preference bits of different domains? Does preference bits hold any relationship with user satisfaction? Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 48 / 49 Thank you Kluver, Nguyen, Ekstrand, Sen, Riedl () How Many Bits Per Rating September 11, 2012 49 / 49