Q4 : How does Netflix recommend movies?

Slides:

Advertisements

Similar presentations

Recommender System A Brief Survey.

Advertisements

CS910: Foundations of Data Analytics Graham Cormode Recommender Systems.

Item Based Collaborative Filtering Recommendation Algorithms

Lirong Xia Reinforcement Learning (2) Tue, March 21, 2014.

Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.

Least squares CS1114

Jeff Howbert Introduction to Machine Learning Winter Collaborative Filtering Nearest Neighbor Approach.

Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University Note to other teachers and users of these.

G54DMT – Data Mining Techniques and Applications Dr. Jaume Bacardit

Intro to RecSys and CCF Brian Ackerman 1. Roadmap Introduction to Recommender Systems & Collaborative Filtering Collaborative Competitive Filtering 2.

Rubi’s Motivation for CF  Find a PhD problem  Find “real life” PhD problem  Find an interesting PhD problem  Make Money!

Customizable Bayesian Collaborative Filtering Denver Dash Big Data Reading Group 11/19/2007.

Discriminant Analysis Testing latent variables as predictors of groups.

Item-based Collaborative Filtering Recommendation Algorithms

Matrix Factorization Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.

Chapter 3 Data Exploration and Dimension Reduction 1.

Performance of Recommender Algorithms on Top-N Recommendation Tasks RecSys 2010 Intelligent Database Systems Lab. School of Computer Science & Engineering.

1 Information Filtering & Recommender Systems (Lecture for CS410 Text Info Systems) ChengXiang Zhai Department of Computer Science University of Illinois,

EMIS 8381 – Spring Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

Report #1 By Team: Green Ensemble AusDM 2009 ENSEMBLE Analytical Challenge: Rules, Objectives, and Our Approach.

Multiple Regression The Basics. Multiple Regression (MR) Predicting one DV from a set of predictors, the DV should be interval/ratio or at least assumed.

Jeff Howbert Introduction to Machine Learning Winter Regression Linear Regression.

Netflix Netflix is a subscription-based movie and television show rental service that offers media to subscribers: Physically by mail Over the internet.

EigenRank: A Ranking-Oriented Approach to Collaborative Filtering IDS Lab. Seminar Spring 2009 강 민 석강 민 석 May 21 st, 2009 Nathan.

Investigation of Various Factorization Methods for Large Recommender Systems G. Takács, I. Pilászy, B. Németh and D. Tikk 10th International.

Evaluation of Recommender Systems Joonseok Lee Georgia Institute of Technology 2011/04/12 1.

EigenRank: A ranking oriented approach to collaborative filtering By Nathan N. Liu and Qiang Yang Presented by Zachary 1.

Collaborative Filtering with Temporal Dynamics Yehuda Koren Yahoo Research Israel KDD’09.

Pairwise Preference Regression for Cold-start Recommendation Speaker: Yuanshuai Sun

ICDCS 2014 Madrid, Spain 30 June-3 July 2014

Chapter1: Introduction Chapter2: Overview of Supervised Learning

Ensemble Methods Construct a set of classifiers from the training data Predict class label of previously unseen records by aggregating predictions made.

Collaborative Filtering via Euclidean Embedding M. Khoshneshin and W. Street Proc. of ACM RecSys, pp , 2010.

Online Evolutionary Collaborative Filtering RECSYS 2010 Intelligent Database Systems Lab. School of Computer Science & Engineering Seoul National University.

Matrix Factorization & Singular Value Decomposition Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.

Item-Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl GroupLens Research Group/ Army.

CPH Dr. Charnigo Chap. 11 Notes Figure 11.2 provides a diagram which shows, at a glance, what a neural network does. Inputs X 1, X 2,.., X P are.

ItemBased Collaborative Filtering Recommendation Algorithms 1.

1 Dongheng Sun 04/26/2011 Learning with Matrix Factorizations By Nathan Srebro.

Estimating standard error using bootstrap

Announcements Paper presentation Project meet with me ASAP

Matrix Factorization and Collaborative Filtering

Statistics 202: Statistical Aspects of Data Mining

Mining Utility Functions based on user ratings

Collaborative Filtering for Streaming data

Chapter 7. Classification and Prediction

Maximum Entropy Models and Feature Engineering CSCI-GA.2591

MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.

Multiple Regression.

Adopted from Bin UIC Recommender Systems Adopted from Bin UIC.

Collaborative Filtering Nearest Neighbor Approach

Advanced Artificial Intelligence

Collaborative Filtering Matrix Factorization Approach

CSCI B609: “Foundations of Data Science”

Word Embedding Word2Vec.

North Dakota State University Fargo, ND USA

An Introduction To The Backpropagation Algorithm

ITEM BASED COLLABORATIVE FILTERING RECOMMENDATION ALGORITHEMS

Contact: Machine Learning – (Linear) Regression Wilson Mckerrow (Fenyo lab postdoc) Contact:

Matrix Factorization & Singular Value Decomposition

CSE 491/891 Lecture 25 (Mahout).

North Dakota State University Fargo, ND USA

Neural networks (1) Traditional multi-layer perceptrons

Artificial Intelligence 10. Neural Networks

Statistical Models and Machine Learning Algorithms --Review

Recommendation Systems

Recommender Systems Group 6 Javier Velasco Anusha Sama

Semi-Supervised Learning

MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.

Presentation transcript:

Q4 : How does Netflix recommend movies? Networked Life: 20 Questions and Answers (M. Chiang, Princeton University) Q4 : How does Netflix recommend movies? Prof. Hongseok Kim

Netflix Business 1 : DVD Business 2 : Online streaming Rental business in 1997 Just wait for DVDs to arrive by mail Cannot receive a new DVD without returning the old one Sliding window Business 2 : Online streaming Streaming movies and TV programs Up to 23 million subscribers by April 2011

Examples Amazon: Content-based filtering YouTube: Co-visitation counts Pandora: Experts + Thumbs up or down Netflix: Collaborative-filtering

Input User ID Movie ID Rating Timing 𝒖 𝒊 {1, 2, 3, 4, 5} , 𝒓 𝒖𝒊 {1, 2, 3, 4, 5} , 𝒓 𝒖𝒊 Timing date of rating , 𝒕 𝒖𝒊

Output Predicted rating Example) Predicted rating : 4.2 User will rate 4 stars with 80% probability & 5 stars with 20% probability

Metric Customer satisfaction Prediction effectiveness Prediction error RMSE Hamming distance Hard to gather data C : (u,i) pairs The smaller the RMSE, the better the recommendation system

The Netflix Prize Objective October 2006 10% over Cinematch? Could recommendation accuracy be improved by 10% RMSE over what Netflix was using? October 2006 Open, online, international competition 10% over Cinematch? $1M and 100 Million data points 1999 ~ 2006(7 years) 480,000 users 17,770 movies Skewed, Sparse data

Data Sets Similar statistical properties Can be used by each competing team as often as they want At most once a day Final decision is based on comparison of RMSE on the test set

Timeline 5,000 teams 44,000 submissions

The problem Unknown ratings to be predicted (Only Netflix knows)

Challenges and solutions Large and sparse data Two main types of techniques for recommendation Content-based filter : Amazon Only looks at each row in isolation and attaches labels to the columns If you like a comedy with X, you will probably like another comedy with X Collaborative filter : Netflix Exploits all the data in the entire table Neighborhood method Compute a similarity score, Similar movies & users Latent factor method Hidden, low-dimensional structures

A few detours Least squares Convex optimization Implicit feedback Linear regressions Convex optimization Generalizes linear programs Implicit feedback Which movies she browsed, which ones she watched, and which ones she bothered to rate at all are all helpful hints Temporal dynamics Time-dependent parameters Allows the model to capture changes in a person’s taste and in trends of the movie market, as well as the mood of the day

Parameterized models

Baseline predictor Average predictor Baseline predictor RMSE = (𝑢,𝑖) 𝑟 𝑢,𝑖 𝐶 C= (u,i) pairs

RMSE minimization Condition : user1, two movie(A,B)

Least squares B에 대해 미분

Solution

Regularization Overfitting Regularization Least squares solutions often suffer from the overfitting problem Fits the known data in the training set so well that it loses the flexibility to adjust to a new data set Regularization A standard technique to avoid overfitting Minimize weight of parameters Original least square Trade-off parameter Penalty

After baseline predictor Error matrix Prediction matrix Actual rating matrix

Convex optimization Minimize convex objective function Least squares is a special case of convex optimization Subject to convex constraint set Easy in theory and in practice

Convex set (c) (d) (e) Which is a convex set?

Convex set Definition Most important property Separate by a line

Convex function Which is a convex function?

Convex function Second derivative test Hessian matrix of a function All eigenvalues of hessian matrix are non-negative Positive Semi Definite(PSD) 𝑓( 𝑥 1 , 𝑥 2 , … 𝑥 𝑛 ) ( 𝛻 2 𝑓) 𝑖𝑗 = 𝜕 2 𝑓 𝜕 𝑥 𝑖 𝜕 𝑦 𝑗

Neighborhood method From local to global structure Pairwise statistical correlation User-user Two similar people Movie-movie Two similar movie

Similarity metric Cosine coefficient

Neighborhood

Neighborhood predictor Baseline predictor + weighted sum of ratings from neighbor movies weight Similar movie Baseline predictor Normalize

Summary

Example test data training data

Baseline predictor min 30 training data 15 variables b 30x15

Prediction User Movie

Rating matrix(Estimated by the baseline predictor)

Prediction

Similarity Use the cosine coefficient to measure the similarity between movies represented in The entire similarity matrix

Neighborhood predictor

Prediction

Summary Netflix Prize is a special case of recommendation system Collaborative filter leverages similarities among users or among movies to make prediction Minimizing RMSE may lead to least squares A special case of convex optimization