Company LOGO MovieMiner A collaborative filtering system for predicting Netflix user’s movie ratings [ECS289G Data Mining] Team Spelunker: Justin Becker,

Slides:



Advertisements
Similar presentations
Recommender Systems & Collaborative Filtering
Advertisements

Differentially Private Recommendation Systems Jeremiah Blocki Fall A: Foundations of Security and Privacy.
Jeff Howbert Introduction to Machine Learning Winter Collaborative Filtering Nearest Neighbor Approach.
Learning to Recommend Hao Ma Supervisors: Prof. Irwin King and Prof. Michael R. Lyu Dept. of Computer Science & Engineering The Chinese University of Hong.
Intro to RecSys and CCF Brian Ackerman 1. Roadmap Introduction to Recommender Systems & Collaborative Filtering Collaborative Competitive Filtering 2.
Item-based Collaborative Filtering Idea: a user is likely to have the same opinion for similar items [if I like Canon cameras, I might also like Canon.
Rubi’s Motivation for CF  Find a PhD problem  Find “real life” PhD problem  Find an interesting PhD problem  Make Money!
Memory-Based Recommender Systems : A Comparative Study Aaron John Mani Srinivasan Ramani CSCI 572 PROJECT RECOMPARATOR.
CS345 Data Mining Recommendation Systems Netflix Challenge Anand Rajaraman, Jeffrey D. Ullman.
Learning Bit by Bit Collaborative Filtering/Recommendation Systems.
Recommendations via Collaborative Filtering. Recommendations Relevant for movies, restaurants, hotels…. Recommendation Systems is a very hot topic in.
Customizable Bayesian Collaborative Filtering Denver Dash Big Data Reading Group 11/19/2007.
Malicious parties may employ (a) structure-based or (b) label-based attacks to re-identify users and thus learn sensitive information about their rating.
1 Introduction to Recommendation System Presented by HongBo Deng Nov 14, 2006 Refer to the PPT from Stanford: Anand Rajaraman, Jeffrey D. Ullman.
Collaborative Filtering CMSC498K Survey Paper Presented by Hyoungtae Cho.
Chapter 12 (Section 12.4) : Recommender Systems Second edition of the book, coming soon.
Item-based Collaborative Filtering Recommendation Algorithms
Identifying and Incorporating Latencies in Distributed Data Mining Algorithms Michael Sevilla.
Collaborative Filtering - Rajashree. Apache Mahout In 2008 as a subproject of Apache’s Lucene project Mahout absorbed the Taste open source collaborative.
Performance of Recommender Algorithms on Top-N Recommendation Tasks
Distributed Networks & Systems Lab. Introduction Collaborative filtering Characteristics and challenges Memory-based CF Model-based CF Hybrid CF Recent.
Item Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karpis, Joseph KonStan, John Riedl (UMN) p.s.: slides adapted from:
Collaborative Filtering Recommendation Reporter : Ximeng Liu Supervisor: Rongxing Lu School of EEE, NTU
EMIS 8381 – Spring Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.
RecBench: Benchmarks for Evaluating Performance of Recommender System Architectures Justin Levandoski Michael D. Ekstrand Michael J. Ludwig Ahmed Eldawy.
EigenRank: A Ranking-Oriented Approach to Collaborative Filtering IDS Lab. Seminar Spring 2009 강 민 석강 민 석 May 21 st, 2009 Nathan.
Artificial Intelligence with Web Applications Dell Zhang Birkbeck, University of London 2010/11.
Temporal Diversity in Recommender Systems Neal Lathia, Stephen Hailes, Licia Capra, and Xavier Amatriain SIGIR 2010 April 6, 2011 Hyunwoo Kim.
A more efficient Collaborative Filtering method Tam Ming Wai Dr. Nikos Mamoulis.
A Content-Based Approach to Collaborative Filtering Brandon Douthit-Wood CS 470 – Final Presentation.
EigenRank: A ranking oriented approach to collaborative filtering By Nathan N. Liu and Qiang Yang Presented by Zachary 1.
Recommender Systems Debapriyo Majumdar Information Retrieval – Spring 2015 Indian Statistical Institute Kolkata Credits to Bing Liu (UIC) and Angshul Majumdar.
Recommender Systems. Recommender Systems (RSs) n RSs are software tools providing suggestions for items to be of use to users, such as what items to buy,
Cosine Similarity Item Based Predictions 77B Recommender Systems.
Pearson Correlation Coefficient 77B Recommender Systems.
Singular Value Decomposition and Item-Based Collaborative Filtering for Netflix Prize Presentation by Tingda Lu at the Saturday Research meeting 10_23_10.
Singular Value Decomposition and Item-Based Collaborative Filtering for Netflix Prize Presentation by Tingda Lu at the Saturday Research meeting 10_23_10.
Recommender Systems with Social Regularization Hao Ma, Dengyong Zhou, Chao Liu Microsoft Research Michael R. Lyu The Chinese University of Hong Kong Irwin.
Collaborative Filtering via Euclidean Embedding M. Khoshneshin and W. Street Proc. of ACM RecSys, pp , 2010.
ICONIP 2010, Sydney, Australia 1 An Enhanced Semi-supervised Recommendation Model Based on Green’s Function Dingyan Wang and Irwin King Dept. of Computer.
Online Evolutionary Collaborative Filtering RECSYS 2010 Intelligent Database Systems Lab. School of Computer Science & Engineering Seoul National University.
User Modeling and Recommender Systems: recommendation algorithms
Matrix Factorization & Singular Value Decomposition Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Experimental Study on Item-based P-Tree Collaborative Filtering for Netflix Prize.
Item-Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl GroupLens Research Group/ Army.
Recommender Systems Based Rajaraman and Ullman: Mining Massive Data Sets & Francesco Ricci et al. Recommender Systems Handbook.
Reputation-aware QoS Value Prediction of Web Services Weiwei Qiu, Zhejiang University Zibin Zheng, The Chinese University of HongKong Xinyu Wang, Zhejiang.
The Wisdom of the Few Xavier Amatrian, Neal Lathis, Josep M. Pujol SIGIR’09 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
ItemBased Collaborative Filtering Recommendation Algorithms 1.
Slope One Predictors for Online Rating-Based Collaborative Filtering Daniel Lemire, Anna Maclachlan In SIAM Data Mining (SDM’05), Newport Beach, California,
Item-Based Collaborative Filtering Recommendation Algorithms
Chapter 14 – Association Rules and Collaborative Filtering © Galit Shmueli and Peter Bruce 2016 Data Mining for Business Analytics (3rd ed.) Shmueli, Bruce.
Hao Ma, Dengyong Zhou, Chao Liu Microsoft Research Michael R. Lyu
StressSense: Detecting Stress in Unconstrained Acoustic Environments using Smartphones Hong Lu, Mashfiqui Rabbi, Gokul T. Chittaranjan, Denise Frauendorfer,
Matrix Factorization and Collaborative Filtering
Data Mining: Concepts and Techniques
Item-Based P-Tree Collaborative Filtering applied to the Netflix Data
Recommender Systems & Collaborative Filtering
CF Recommenders.
CS728 The Collaboration Graph
Asymmetric Correlation Regularized Matrix Factorization for Web Service Recommendation Qi Xie1, Shenglin Zhao2, Zibin Zheng3, Jieming Zhu2 and Michael.
North Dakota State University Fargo, ND USA
Adopted from Bin UIC Recommender Systems Adopted from Bin UIC.
Collaborative Filtering Nearest Neighbor Approach
M.Sc. Project Doron Harlev Supervisor: Dr. Dana Ron
Advanced Artificial Intelligence
Q4 : How does Netflix recommend movies?
North Dakota State University Fargo, ND USA
Movie Recommendation System
North Dakota State University Fargo, ND USA
Presentation transcript:

Company LOGO MovieMiner A collaborative filtering system for predicting Netflix user’s movie ratings [ECS289G Data Mining] Team Spelunker: Justin Becker, Philip Fisher-Ogden

The Problem Given a set of entries, predict the ratings values for unknown entries. Example: –X-Men, Philip, 5, –Spiderman 3, Philip, 4, –X-Men, Justin, 4, –Spiderman 3, Justin, ?, What rating do you predict Justin would give Spiderman 3?

Our Approach - Motivation Motivating Factors –Review current approaches taken by the Netflix prize top leaders –Leverage and extend existing libraries, to minimize the ramp-up time required to implement a working system –Utilize the UC Davis elvis cluster to alleviate any scale problems

What - Our Approach Collaborative Filtering (CF)‏ –Weighted average of predictions from the following recommenders: Slope One recommender Item-based recommender User-based recommender

What - Our Approach Leveraging three CF recommenders –Similarities: Each uses prior preference information to predict values for unrated entries –Differences: How is the similarity between two entries computed? How are the neighbors selected? How are the interpolation weights determined? Image source:

Why - Our Approach Why Collaborative Filtering? –“Those who agreed in the past tend to agree again in the future“ –Requires no external data sources –Uses k-Nearest-Neighbor approaches to predict the class (rating) of an unknown entry –Exists a full features CF Java library- Taste –CF is one of two main approaches used by the Netflix prize top leaders (with the other being SVD).

How – Slope One Recommender Introduced by Daniel Lemire and Anna Maclachlan Simple and accurate predictor Average difference between two items Weighted average to produce better results Number of user having rated both items

Ex: Slope One Recommender Average difference between X-Men and Spiderman 3 is 1. Justin's rating for Spiderman 3 is then 4+1=5 X-Men Spiderman 3Batman BeginsNacho Libre Justin4?54 Philip5342 Dan4455 Ian3433 Michael2315

How – User-based Recommender Predicts a user u’s rating for an item i: –Find the k nearest neighbors to the user u Similarity measure = Pearson correlation Missing preferences are inferred by using the user’s average rating –Interpolate between those in-common neighbors’ ratings for item i Interpolation weights = Pearson correlation Neighbors are ignored if they did not rate i

Ex: User-based Recommender X-Men Spiderman 3Batman BeginsNacho Libreavg Justin4? Philip Dan Ian Michael centered data (user average)‏ X-MenSpiderman 3Batman BeginsNacho Libre Eucl Norm Justin ? Philip Dan Ian Michael

Ex: User-based Recommender Similarities are calculated using the Pearson correlation coefficient (on centered data): Interpolation between nearest neighbors produces the prediction: User-user similarities Justin-Phil Justin-Dan Justin-Ian E-16 Justin-Michael Prediction using 2-nearest neighbors Philip, Dan round(prediction)‏4

How – Item-based Recommender Predicts a user u’s rating for an item i: –Find the k most similar items to i Similarity measure = Pearson correlation –Keep only similar items also rated by u –Interpolate between the remaining items’ ratings Interpolation weights = Pearson correlation –Note: Item-item similarities allow for more efficient computations as cnt(items) << cnt(users) and, thus, the similarity matrix can be pre- computed and leveraged as needed.

Ex: Item-based Recommender X-Men Spiderman 3Batman BeginsNacho Libre Justin4?54 Philip5342 Dan4455 Ian3433 Michael2315 avg centered data (item average)‏ X-Men Spiderman 3Batman BeginsNacho Libre Justin0.4? Philip Dan Ian Michael Eucl Norm

Ex: Item-based Recommender Similarities are calculated using the Pearson correlation coefficient (on centered data): Interpolation between nearest neighbors produces the prediction: Item-item similarities S-Xm0 S-BB S-NL Prediction from 2-nearest neighbors BB, NL round(prediction)‏5

Initial Results Bottom line: correct=91934, loss=319,710 Parameters used: 40% user, 60% item, 20 nearest neighbors ~97% scored with composite recommender (user,item)‏ ~3% scored with random recommender RMSE

Final Results Bottom line: correct=106,253, loss=236,523 Parameters used: 25% user, 5% item, 70% slope one, 20 nearest neighbors ~97% scored with composite recommender (user, item, slope one)‏ ~3% scored with weighted average RMSE:

Questions? ?

Conclusion