Similarity & Recommendation Arjen P. de Vries CWI Scientific Meeting September 27th 2013.

Slides:



Advertisements
Similar presentations
Recommender System A Brief Survey.
Advertisements

Recommender Systems & Collaborative Filtering
Prediction Modeling for Personalization & Recommender Systems Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Collaborative Filtering Sue Yeon Syn September 21, 2005.
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
1 RegionKNN: A Scalable Hybrid Collaborative Filtering Algorithm for Personalized Web Service Recommendation Xi Chen, Xudong Liu, Zicheng Huang, and Hailong.
COLLABORATIVE FILTERING Mustafa Cavdar Neslihan Bulut.
Supervisor: Associate Prof. Jiuyong Li(John) Student: Kang Sun Date: 28 th May 2010.
Item-based Collaborative Filtering Idea: a user is likely to have the same opinion for similar items [if I like Canon cameras, I might also like Canon.
Memory-Based Recommender Systems : A Comparative Study Aaron John Mani Srinivasan Ramani CSCI 572 PROJECT RECOMPARATOR.
CS345 Data Mining Recommendation Systems Netflix Challenge Anand Rajaraman, Jeffrey D. Ullman.
1 Collaborative Filtering and Pagerank in a Network Qiang Yang HKUST Thanks: Sonny Chee.
1 Preserving Privacy in Collaborative Filtering through Distributed Aggregation of Offline Profiles The 3rd ACM Conference on Recommender Systems, New.
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
Recommender Systems; Social Information Filtering.
Recommender systems Ram Akella November 26 th 2008.
1 Collaborative Filtering: Latent Variable Model LIU Tengfei Computer Science and Engineering Department April 13, 2011.
Chapter 12 (Section 12.4) : Recommender Systems Second edition of the book, coming soon.
Item-based Collaborative Filtering Recommendation Algorithms
Steffen Staab 1WeST Web Science & Technologies University of Koblenz ▪ Landau, Germany User Modeling and Recommendations – Part 2.
Mark Levene, An Introduction to Search Engines and Web Navigation © Pearson Education Limited 2005 Slide 9.1 Chapter 9 : Social Networks What is a social.
Distributed Networks & Systems Lab. Introduction Collaborative filtering Characteristics and challenges Memory-based CF Model-based CF Hybrid CF Recent.
1 Information Filtering & Recommender Systems (Lecture for CS410 Text Info Systems) ChengXiang Zhai Department of Computer Science University of Illinois,
Recommender systems Drew Culbert IST /12/02.
Training and Testing of Recommender Systems on Data Missing Not at Random Harald Steck at KDD, July 2010 Bell Labs, Murray Hill.
+ Recommending Branded Products from Social Media Jessica CHOW Yuet Tsz Yongzheng Zhang, Marco Pennacchiotti eBay Inc. eBay Inc.
User Models for Personalization Josh Alspector Chief Technology Officer.
User Modeling, Recommender Systems & Personalization Pattie Maes MAS 961- week 6.
IMPROVING E-COMMERCE COLLABORATIVE RECOMMENDATIONS BY SEMANTIC INFERENCE OF NEIGHBORS’ PRACTICAL EXPERTISE 6 th International Workshop on Semantic Media.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
Presented By :Ayesha Khan. Content Introduction Everyday Examples of Collaborative Filtering Traditional Collaborative Filtering Socially Collaborative.
Google News Personalization: Scalable Online Collaborative Filtering
Toward the Next generation of Recommender systems
1 Recommender Systems Collaborative Filtering & Content-Based Recommending.
1 Social Networks and Collaborative Filtering Qiang Yang HKUST Thanks: Sonny Chee.
RecBench: Benchmarks for Evaluating Performance of Recommender System Architectures Justin Levandoski Michael D. Ekstrand Michael J. Ludwig Ahmed Eldawy.
EigenRank: A Ranking-Oriented Approach to Collaborative Filtering IDS Lab. Seminar Spring 2009 강 민 석강 민 석 May 21 st, 2009 Nathan.
A Content-Based Approach to Collaborative Filtering Brandon Douthit-Wood CS 470 – Final Presentation.
Evaluation of Recommender Systems Joonseok Lee Georgia Institute of Technology 2011/04/12 1.
Case studies in recommender systems  The MovieLens data set, others –focus on improving the Mean Absolute Error …  What about the business.
1 Collaborative Filtering & Content-Based Recommending CS 290N. T. Yang Slides based on R. Mooney at UT Austin.
EigenRank: A ranking oriented approach to collaborative filtering By Nathan N. Liu and Qiang Yang Presented by Zachary 1.
Recommender Systems Debapriyo Majumdar Information Retrieval – Spring 2015 Indian Statistical Institute Kolkata Credits to Bing Liu (UIC) and Angshul Majumdar.
Recommender Systems. Recommender Systems (RSs) n RSs are software tools providing suggestions for items to be of use to users, such as what items to buy,
Collaborative Filtering Zaffar Ahmed
Cold Start Problem in Movie Recommendation JIANG CAIGAO, WANG WEIYAN Group 20.
Foxtrot seminar Capturing knowledge of user preferences with recommender systems Stuart E. Middleton David C. De Roure, Nigel R. Shadbolt Intelligence,
KNN CF: A Temporal Social Network kNN CF: A Temporal Social Network Neal Lathia, Stephen Hailes, Licia Capra University College London RecSys ’ 08 Advisor:
Community-Based Link Prediction/Recommendation in the Bipartite Network of BoardGameGeek.com Brett Boge CS 765 University of Nevada, Reno.
Collaborative Filtering via Euclidean Embedding M. Khoshneshin and W. Street Proc. of ACM RecSys, pp , 2010.
Personalization Services in CADAL Zhang yin Zhuang Yuting Wu Jiangqin College of Computer Science, Zhejiang University November 19,2006.
User Modeling and Recommender Systems: recommendation algorithms
Experimental Study on Item-based P-Tree Collaborative Filtering for Netflix Prize.
Company LOGO MovieMiner A collaborative filtering system for predicting Netflix user’s movie ratings [ECS289G Data Mining] Team Spelunker: Justin Becker,
Recommender Systems Based Rajaraman and Ullman: Mining Massive Data Sets & Francesco Ricci et al. Recommender Systems Handbook.
Dependency Networks for Inference, Collaborative filtering, and Data Visualization Heckerman et al. Microsoft Research J. of Machine Learning Research.
Collaborative Deep Learning for Recommender Systems
Collaborative Filtering - Pooja Hegde. The Problem : OVERLOAD Too much stuff!!!! Too many books! Too many journals! Too many movies! Too much content!
Analysis of massive data sets Prof. dr. sc. Siniša Srbljić Doc. dr. sc. Dejan Škvorc Doc. dr. sc. Ante Đerek Faculty of Electrical Engineering and Computing.
ItemBased Collaborative Filtering Recommendation Algorithms 1.
Collaborative Filtering With Decoupled Models for Preferences and Ratings Rong Jin 1, Luo Si 1, ChengXiang Zhai 2 and Jamie Callan 1 Language Technology.
Opinion spam and Analysis 소프트웨어공학 연구실 G 최효린 1 / 35.
Recommender systems 06/10/2017 S. Trausan-Matu.
Recommender Systems & Collaborative Filtering
Asymmetric Correlation Regularized Matrix Factorization for Web Service Recommendation Qi Xie1, Shenglin Zhao2, Zibin Zheng3, Jieming Zhu2 and Michael.
Martin Rajman, Martin Vesely
Adopted from Bin UIC Recommender Systems Adopted from Bin UIC.
Personalizing Search on Shared Devices
Recommender Systems: Collaborative & Content-based Filtering Features
Interactive Information Retrieval
Presentation transcript:

Similarity & Recommendation Arjen P. de Vries CWI Scientific Meeting September 27th 2013

Recommendation Informally: –Search for information “without a query” Three types: –Content-based recommendation –Collaborative filtering (CF) Memory-based Model-based –Hybrid approaches

Recommendation Informally: –Search for information “without a query” Three types: –Content-based recommendation –Collaborative filtering Memory-based Model-based –Hybrid approaches Today’s focus!

Collaborative Filtering Collaborative filtering (originally introduced by Patti Maes as “social information filtering”) 1. Compare user judgments 2. Recommend differences between similar users Leading principle: People’s tastes are not randomly distributed –A.k.a. “You are what you buy”

Collaborative Filtering Benefits over content-based approach –Overcomes problems with finding suitable features to represent e.g. art, music –Serendipity –Implicit mechanism for qualitative aspects like style Problems: large groups, broad domains

Context Recommender systems –Users interact (rate, purchase, click) with items

Context Recommender systems –Users interact (rate, purchase, click) with items

Context Recommender systems –Users interact (rate, purchase, click) with items

Context Recommender systems –Users interact (rate, purchase, click) with items

Context Nearest-neighbour recommendation methods –The item prediction is based on “similar” users

Context Nearest-neighbour recommendation methods –The item prediction is based on “similar” users

Similarity

s(, ) sim(, )s(, )

Research Question How does the choice of similarity measure determine the quality of the recommendations?

Sparseness Too many items exist, so many ratings will be missing A user’s neighborhood is likely to extend to include “not-so-similar” users and/or items

“Best” similarity? Consider cosine similarity vs. Pearson similarity Most existing studies report Pearson correlation to lead to superior recommendation accuracy

“Best” similarity? Common variations to deal with sparse observations: –Item selection: Compare full profiles, or only on overlap –Imputation: Impute default value for unrated items –Filtering: Threshold on minimal similarity value

“Best” similarity? Cosine superior (!), but not for all settings –No consistent results

Analysis

Distance Distribution In high dimensions, nearest neighbour is unstable: If the distance from query point to most data points is less than (1 + ε) times the distance from the query point to its nearest neighbour Beyer et al. When is “nearest neighbour” meaningful? ICDT 1999

Distance Distribution Beyer et al. When is “nearest neighbour” meaningful? ICDT 1999

Distance Distribution Quality q(n, f): Fraction of users for which the similarity function has ranked at least n percent of the user community within a factor f of the nearest neighbour’s similarity value (well... its corresponding distance)

Distance Distribution

NN k Graph Graph associated with the top k nearest neighbours Analysis focusing on the binary relation of whether a user does or does not belong to a neighbourhood –Ignore similarity values (already included in the distance distribution analysis)

NN k Graph

MRR vs. Features Quality: –If most of the user population is far away, high similarity correlates with effectiveness –If most of the user population is close, high similarity correlates with ineffectiveness

MRR vs. Features

Conclusions (so far) “Similarity features” correlate with recommendation effectiveness –“Stability” of a metric (as defined in database literature on k-NN search in high dimensions) is related to its ability to discriminate between good and bad neighbours

Future Work How to exploit this knowledge to now improve recommendation systems?

News Recommendation Challenge

Thanks Alejandro Bellogín – ERCIM fellow in the Information Access group Details: Bellogín and De Vries, ICTIR 2013.