pptx - Georgia Institute of Technology

Efﬁcient Retrieval of Recommendations in a Matrix Factorization Framework Noam Koenigstein Parikshit Ram Yuval Shavitt School of Electrical Engineering Computational Science & Engineering School of Electrical Engineering Tel Aviv University Georgia Institute of Technology Tel Aviv University Motivation • In the field of Recommender System, Matrix Factorization (MF) models have shown superior accuracy for recommendation tasks. E.g., The Netflix Prize, KDD-Cup’11, etc. • Training is fast. Computing test scores is fast. But… Retrieval of Recommendations (RoR) is s--l--o--w ! • This problem is well known in the industry, yet never been approached before in academia! ITEMS 2 Yahoo! Music: 2 1M Users 625K Items 4 5 U S E R S 3 3 Naïve Multithreading: High latency + wasteful 2 .... 4 1 2 2 5 6 Tera elements ~300 multiplications ~5 days CPU Reduction to Inner Product 𝑟𝑢𝑖 = 𝐩𝑇𝑢 𝐪𝑖 = 𝐩𝑢 𝐪𝑖 cos 𝜃𝑢𝑖 𝑞𝑖 = 𝑎𝑟𝑔𝑚𝑎𝑥𝑞∈𝑆 𝐪T 𝐩𝑢 𝐩𝑢 =1 Core problem: Given a user vector 𝐩𝑢 and a set 𝑆 of item, find an item vector 𝑞𝑖 that will maximize 𝐩𝑇𝑢 𝐪𝑖 𝐩𝒖 (𝟏) 𝐪𝐢 (𝟏) 𝐩𝒖 (𝟐) 𝐪𝐢 (𝟐) 𝐩𝒖 (𝟑) 𝐪𝐢 (𝟑) 𝐩𝒖 (𝟒) 𝐪𝐢 (𝟒) . . . . . . 𝐩𝒖 (𝒏) 𝐪𝐢 (𝒏) 1 𝒃𝒊 Best Matches Algorithms • Metric Space • Cosine Similarity • Locality Sensitive Hashing Metric Trees R R Branch-and-bound Algorithm Bounding Inner Product Similarity Approximate Solution Users vectors can be normalized  Users can be clustered based on their spherical angle! 𝑟𝑢𝑖 = 𝐩𝑇𝑢 𝐪𝑖 = 𝐪𝑖 cos 𝜃𝑢𝑖 𝐩𝑢 =1 Relative Error Bound What is the error when recommendations are retrieved based on an approximate user vector? 𝑝𝑐 𝑇 𝑞𝑖 − 𝑝𝑢 𝑇 𝑞𝑖 𝑒𝑟𝑟 = 𝑜𝑝𝑡 𝑝𝑐 𝑇 𝑞𝑖 ≤1− cos 𝜃𝑝𝑐 𝑞𝑖 + Δ cos 𝜃𝑝𝑐 𝑞𝑖 Adaptive Approximate Solution Experimentations Set-up MovieLens Netflix Yahoo! Music 1,000,206 100,480,507 252,800,275 Users 6,040 480,189 1,000,990 Items 3,952 17,770 624,961 95.81% 98.82% 99.96% Ratings Sparsity Yahoo! Music Recommendations: Modeling Music Ratings with Temporal Dynamics and Item Taxonomy Gideon Dror, Noam Koenigstein, Yehuda Koren (RecSys-11`) Exact Alg. Speedup Approximate Alg. Speedup Speedup vs. Precision Speedup vs. MedianRank Conclusions • We introduce a new and relevant research problem • An exact solution with limited speedup • An approximate solution with a tradeoff between accuracy and speedup • Much room for further research…

pptx - Georgia Institute of Technology

Related documents

Products

Support

pptx - Georgia Institute of Technology

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib