Personalized Recommendation on Dynamic Content Using Predictive Bilinear Models Wei ChuSeung-Taek Park WWW 2009 Audience Science Yahoo! Labs.

Slides:



Advertisements
Similar presentations
1 ©2009 MeeMix MeeMix – A personalized Experience.
Advertisements

Recommender System A Brief Survey.
Predicting User Interests from Contextual Information
Effective Change Detection Using Sampling Junghoo John Cho Alexandros Ntoulas UCLA.
Google News Personalization: Scalable Online Collaborative Filtering
Google News Personalization Scalable Online Collaborative Filtering
Temporal Query Log Profiling to Improve Web Search Ranking Alexander Kotov (UIUC) Pranam Kolari, Yi Chang (Yahoo!) Lei Duan (Microsoft)
Inferring User Political Preferences from Streaming Communications Svitlana Volkova 1, Glen Coppersmith 2 and Benjamin Van Durme 1,2 1 Center for Language.
Prediction Modeling for Personalization & Recommender Systems Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Modelling Relevance and User Behaviour in Sponsored Search using Click-Data Adarsh Prasad, IIT Delhi Advisors: Dinesh Govindaraj SVN Vishwanathan* Group:
A Graph-based Recommender System Zan Huang, Wingyan Chung, Thian-Huat Ong, Hsinchun Chen Artificial Intelligence Lab The University of Arizona 07/15/2002.
SIGIR 2013 Recap September 25, 2013.
1.Accuracy of Agree/Disagree relation classification. 2.Accuracy of user opinion prediction. 1.Task extraction performance on Bing web search log with.
- 1 - Intro to Content Optimization Yury Lifshits. Yahoo! Research Largely based on slides by Bee-Chung Chen, Deepak Agarwal & Pradheep Elango.
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
Yehuda Koren , Joe Sill Recsys’11 best paper award
Click Evidence Signals and Tasks Vishwa Vinay Microsoft Research, Cambridge.
1 The Four Dimensions of Search Engine Quality Jan Pedersen Chief Scientist, Yahoo! Search 19 September 2005.
Time-dependent Similarity Measure of Queries Using Historical Click- through Data Qiankun Zhao*, Steven C. H. Hoi*, Tie-Yan Liu, et al. Presented by: Tie-Yan.
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
Recommender systems Ram Akella November 26 th 2008.
1 Collaborative Filtering: Latent Variable Model LIU Tengfei Computer Science and Engineering Department April 13, 2011.
Measuring and Monitoring Social Media Presence Measuring and Monitoring Social Media Presence Rim Dakelbab.
Cohort Modeling for Enhanced Personalized Search Jinyun YanWei ChuRyen White Rutgers University Microsoft BingMicrosoft Research.
EVENT IDENTIFICATION IN SOCIAL MEDIA Hila Becker, Luis Gravano Mor Naaman Columbia University Rutgers University.
Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng.
Content Recommendation on Y! sites Deepak Agarwal Stanford Info Seminar 17 th Feb, 2012.
ICML’11 Tutorial: Recommender Problems for Web Applications Deepak Agarwal and Bee-Chung Chen Yahoo! Research.
Anindya Ghose Sha Yang Stern School of Business New York University An Empirical Analysis of Sponsored Search Performance in Search Engine Advertising.
1 Pengjie Ren, Zhumin Chen and Jun Ma Information Retrieval Lab. Shandong University 报告人:任鹏杰 2013 年 11 月 18 日 Understanding Temporal Intent of User Query.
Gradual Adaption Model for Estimation of User Information Access Behavior J. Chen, R.Y. Shtykh and Q. Jin Graduate School of Human Sciences, Waseda University,
David Stern Ralf Herbrich Thore Graepel Microsoft Research Cambridge, UK Horst Samulowitz National ICT Australia University of Melbourne Melbourne, Australia.
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
Improving Web Search Ranking by Incorporating User Behavior Information Eugene Agichtein Eric Brill Susan Dumais Microsoft Research.
RecSys 2011 Review Qi Zhao Outline Overview Sessions – Algorithms – Recommenders and the Social Web – Multi-dimensional Recommendation, Context-
+ Recommending Branded Products from Social Media Jessica CHOW Yuet Tsz Yongzheng Zhang, Marco Pennacchiotti eBay Inc. eBay Inc.
Evaluation Methods and Challenges. 2 Deepak Agarwal & Bee-Chung ICML’11 Evaluation Methods Ideal method –Experimental Design: Run side-by-side.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
Hao Wu Nov Outline Introduction Related Work Experiment Methods Results Conclusions & Next Steps.
©2015 Apigee Corp. All Rights Reserved. Preserving signal in customer journeys Joy Thomas, Apigee Jagdish Chand, Visa.
Social Media & Indian Beer Industry: A Research Report Picture Source:
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
Google News Personalization: Scalable Online Collaborative Filtering
- 1 - Recommender Problems for Content Optimization Deepak Agarwal Yahoo! Research MMDS, June 15 th, 2010 Stanford, CA.
Content Optimization at Yahoo! Todd Beaupré. #1: 34 MM #1: 19 MM #1: 97 MM Re #1: 45 MM #1: 33 MM #1: 20 MM #1: 22 MM #1: 31 MM #1: 17.3 MM.
David Stern, Thore Graepel, Ralf Herbrich Online Services and Advertising Group MSR Cambridge.
Pairwise Preference Regression for Cold-start Recommendation Speaker: Yuanshuai Sun
Google News Personalization Big Data reading group November 12, 2007 Presented by Babu Pillai.
TO Each His Own: Personalized Content Selection Based on Text Comprehensibility Date: 2013/01/24 Author: Chenhao Tan, Evgeniy Gabrilovich, Bo Pang Source:
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Unsupervised Streaming Feature Selection in Social Media
1 Raghu Ramakrishnan Research Fellow Chief Scientist, Audience and Cloud Computing Yahoo! Purple Clouds:
Regression Based Latent Factor Models Deepak Agarwal Bee-Chung Chen Yahoo! Research KDD 2009, Paris 6/29/2009.
E-commerce in Your Inbox Product Recommendations at Scale
Saving Bitrate vs. Users: Where is the Break-Even Point in Mobile Video Quality? ACM MM’11 Presenter: Piggy Date:
Item-Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl GroupLens Research Group/ Army.
Portrait of an Online Shopper: Understanding and Predicting Consumer Behavior Farshad Kooti* Kristina Lerman* Luca Aiello † Mihajlo Grbovic † Nemanja Djuric.
Collaborative Deep Learning for Recommender Systems
PREDICTION ON TWEET FROM DYNAMIC INTERACTION Group 19 Chan Pui Yee Wong Tsz Wing Yeung Chun Kit.
Algorithms for Position Bias Correction in Ranking Anirban Majumder Machine Learning, Amazon.
A Case Study of Behavior-driven Conjoint Analysis on Yahoo
He Xiangnan Research Fellow National University of Singapore
Data Mining: Concepts and Techniques
Matchbox Large Scale Online Bayesian Recommendations
Make Predictions Using Azure Machine Learning Studio
Intro to Content Optimization
Methods and Metrics for Cold-Start Recommendations
The Four Dimensions of Search Engine Quality
Author: Kazunari Sugiyama, etc. (WWW2004)
Comparative Evaluation of SOM-Ward Clustering and Decision Tree for Conducting Customer-Portfolio Analysis By 1Oloyede Ayodele, 2Ogunlana Deborah, 1Adeyemi.
Presentation transcript:

Personalized Recommendation on Dynamic Content Using Predictive Bilinear Models Wei ChuSeung-Taek Park WWW 2009 Audience Science Yahoo! Labs.

Outline Dynamic content –Yahoo! Front Page Today Module –Difficulties on new users and new items Personalized recommendation –Global level, one-size-fits-all / most popular –Segmented level, segmentation –Individual level, personalization Methodology –Predictive bilinear models Findings in the case study Conclusions WWW 2009

Dynamic Content WWW 2009 Yahoo! Front Page

At default, the article at F1 is highlighted at the Story position. Articles are selected from a hourly-refreshed article pool. Replacement on out-of-date articles happens every a few hours. GOAL: select the most attractive article for the Story position to draw users attention and then increase users retention. Dynamic Content WWW 2009 Today Module

Dynamic Content WWW 2009 Today Module a)Click-through rate (CTR) is decaying temporally, e.g. breaking news. b)About 40% clickers are first-time clickers. c)Lifetime of an article is usually short, only a few hours; d)The universe of content pool is dynamic. 9 days data

Difficulties on Dynamic Content WWW 2009 Collaborative filtering provides good solution to a closed world –Overlaps in feedback across users are relatively high –The universe of content items is almost static CTR is decaying temporally –Difficult to compare users feedback on the same article received at different time slots Lifetime of an article is usually short, only a few hours –Reduce overlaps in feedback across users The universe of content pool is dynamic –Have to wait for clicks on new items (content-based filtering helps) –Storage and retrieval of historical ratings of retired items are demanding About 40% clickers are first-time clickers –Hard on new users without historical ratings (most popular is baseline) Cold-Start Recommendation existing items new items existing users Collaborative filtering Content-based filtering new users most popular WAIT

Solution: Feature-based modeling WWW 2009 Users with open profiles –Demographical information, age, gender, location –Property usage over Yahoo! networks –Search logs, purchase history etc. Content profile management –Static descriptors: categories, title, bags of words of textual content etc. –Temporal characteristics: popularity, CTR, freshness etc. Feature-based regression models for personalization at individual level –New items: initialize popularity based on static meta features –New users: estimate preferences on items based on user features

Methodology WWW 2009 Data representation –User features ( D features per user) –Content features ( C features per article) –Historical feedback (story click or not) Predictive bilinear models –Bilinear score for a user/article pair the b -th feature of user the a -th feature of item affinity between and affinitysportsfinance age age male C 100 D 1000

Model fitting on historical feedback –Regression on continuous targets –Logistic regression on binary targets –Probabilistic framework Optimal affinities at maximum a posteriori (MAP) estimate Prediction Offline Optimization affinitysportsfinance age 50?? age 20?? male?? WWW 2009 affinitysportsfinance age age male C 100 D 1000

Data collection –Random serving policy –Temporal partition –About 40 million events for training –About 5 million distinct users –Test events (about 0.6 million story clicks) Offline performance metric –Click Portion: the fraction of clicks at rank position r Case Study WWW 2009 Application: Front Page Today Module

Data collection –Random serving policy –Temporal partition –About 40 million events for training –About 5 million distinct users –Test events (about 0.6 million story clicks) Offline performance metric –Click Portion: the fraction of clicks at rank position r Case Study WWW 2009 Application: Front Page Today Module Click Rank : 2 at the moment of the click event in test LikeDislike

Data collection –Random serving policy –Temporal partition –About 40 million events for training –About 5 million distinct users –Test events (about 0.6 million story clicks) Offline performance metric –Click Portion: the fraction of test clicks at rank r Case Study WWW 2009 Application: Front Page Today Module at the moment of click events in testClick Rank : 1 LikeDislike

Case Study WWW 2009 Baseline: select the article with the highest CTR (EMP) –One-size-fits-all approach by online CTR tracking (Agarwal et al. NIPS 2009; Agarwal et al. WWW 2009) Segmentation –Age/gender-based segmentation with 6 clusters (GM) –Conjoint analysis with 5 clusters (Chu et al. KDD 2009) (SEG5) Collaborative filtering –Item-based collaborative filtering (IBCF) –Content-based filtering (CB) –Hybrid CB with CTR (CB+EMP) : Feature-based personalized models –Bilinear regression (RG) –Logistic bilinear regression (LRG) –LRG without article CTR feature (LRG-CTR) Matchbox: Large Scale Bayesian Recommendations Stern, Herbrich and Graepel (WWW2009) Microsoft Res. Thursday XL-2, Statistical Methods

Case Study WWW 2009 Lift over the baseline EMP one-size-fits-all SEG5: tensor conjoint analysis with 5 clusters CB+EMP: Logistic Bilinear Models

Case Study WWW 2009 A utility function (overall performance at top 4 positions) where is Click Portion at rank r

Bucket Test WWW 2009 Lift on offline metric (click portion) of three segmentation models Gender: –male, female, unknown AgeGender: –11 segments Tensor-5 (SEG5): –5 clusters Method: Tensor-5 > AgeGender > Gender Lift at rank 1: 0.08 > 0.65 > 0.55

Bucket Test WWW 2009 Method: Tensor-5 > AgeGender > Gender Lift on offline metric : 8% > 6.5% > 5.5% Lift in online bucket test : 3.24 > 2.45% > 1.49%

Summary WWW 2009 Feature-based bilinear regression models for personalized recommendation on cold-start situation of dynamic content. The affinities between user attributes and content features are optimized by learning from historical user feedback. Alleviate cold-start difficulties by leveraging available information at both user and item sides. Significantly outperform six competitive approaches at global, segmented or individual levels on an offline metric. We validated our offline metric by bucket test on segmentation models.

Acknowledgment WWW 2009 We thank our colleague: Raghu Ramakrishnan Scott Roy Deepak Agarwal Bee-Chung Chen Pradheep Elango Ajoy Sojan Todd Beaupre Nitin Motgi Amit Phadke Seinjuti Chakraborty Joe Zachariah

Questions?