Adaptive, Personalized Diversity for Visual Discovery

Slides:



Advertisements
Similar presentations
Multichannel Personalization At Scale: Building Incremental Value With #dunnhumby.
Advertisements

Modelling Relevance and User Behaviour in Sponsored Search using Click-Data Adarsh Prasad, IIT Delhi Advisors: Dinesh Govindaraj SVN Vishwanathan* Group:
INTRODUCTION TO MACHINE LEARNING Bayesian Estimation.
Part II – TIME SERIES ANALYSIS C5 ARIMA (Box-Jenkins) Models
Recommender Systems Aalap Kohojkar Yang Liu Zhan Shi March 31, 2008.
Recall: Query Reformulation Approaches 1. Relevance feedback based vector model (Rocchio …) probabilistic model (Robertson & Sparck Jones, Croft…) 2. Cluster.
1 LM Approaches to Filtering Richard Schwartz, BBN LM/IR ARDA 2002 September 11-12, 2002 UMASS.
"Consumers and New Technologies: A Marketing Perspective" Course 3 Product management on the Internet: Personalisation and customisation Jacques Nantel.
Cohort Modeling for Enhanced Personalized Search Jinyun YanWei ChuRyen White Rutgers University Microsoft BingMicrosoft Research.
Analyzing Business Introductory Training Course for Store Managers.
Binary Variables (1) Coin flipping: heads=1, tails=0 Bernoulli Distribution.
Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng.
Airstream use case Houssam Nassif AISTATS’15.
1 Information Filtering & Recommender Systems (Lecture for CS410 Text Info Systems) ChengXiang Zhai Department of Computer Science University of Illinois,
Focused Matrix Factorization for Audience Selection in Display Advertising BHARGAV KANAGAL, AMR AHMED, SANDEEP PANDEY, VANJA JOSIFOVSKI, LLUIS GARCIA-PUEYO,
Your New FSU EMarket “Before and After” Guide Shopping, Favorites, and More...
Privacy risks of collaborative filtering Yuval Madar, June 2012 Based on a paper by J.A. Calandrino, A. Kilzer, A. Narayanan, E. W. Felten & V. Shmatikov.
RecSys 2011 Review Qi Zhao Outline Overview Sessions – Algorithms – Recommenders and the Social Web – Multi-dimensional Recommendation, Context-
Tag Ranking Present by Jie Xiao Dept. of Computer Science Univ. of Texas at San Antonio.
Bayesian Extension to the Language Model for Ad Hoc Information Retrieval Hugo Zaragoza, Djoerd Hiemstra, Michael Tipping Presented by Chen Yi-Ting.
Karthik Raman, Pannaga Shivaswamy & Thorsten Joachims Cornell University 1.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Evaluation of Recommender Systems Joonseok Lee Georgia Institute of Technology 2011/04/12 1.
Post-Ranking query suggestion by diversifying search Chao Wang.
Bayesian Optimization. Problem Formulation Goal  Discover the X that maximizes Y  Global optimization Active experimentation  We can choose which values.
1 Dongheng Sun 04/26/2011 Learning with Matrix Factorizations By Nathan Srebro.
Topic Modeling for Short Texts with Auxiliary Word Embeddings
Recommender Systems 11/04/2017
5 Ways to Optimize eCommerce Search Performance Presented by:
Mining Utility Functions based on user ratings
Data Mining: Concepts and Techniques
Recommender Systems & Collaborative Filtering
DATA COLLECTION METHODS IN NURSING RESEARCH
Figure 5: Change in Blackjack Posterior Distributions over Time.
Chapter 7. Classification and Prediction
Particle Swarm Optimization (2)
Adaptive, Personalized Diversity for
Improving your list of results in EBSCO Discovery Service (EDS)
US Consumer Analysis: Apparel and Footwear February, 2017.
New Trends Drive Menswear Sales
Preface to the special issue on context-aware recommender systems
CH 5: Multivariate Methods
Personalized Social Image Recommendation
Martin Rajman, Martin Vesely
Tingdan Luo 05/02/2016 Interactively Optimizing Information Retrieval Systems as a Dueling Bandits Problem Tingdan Luo
Segmentation Strategies: An Application to Nanostores in Bogota
Information Retrieval Models: Probabilistic Models
Machine Learning Feature Creation and Selection
Partial Credit Scoring for Technology Enhanced Items
Detecting Online Commercial Intention (OCI)
Compact Query Term Selection Using Topically Related Text
How to Run a DataOnDemand Report
Bandit’s Paradise: The Next Generation of Test-and-Learn Marketing
Lecture 12: Data Wrangling
Q4 : How does Netflix recommend movies?
Postdoc, School of Information, University of Arizona
How to quantify beautiful? How to engage users?
Automating Profitable Growth™
Chapter 2: Evaluative Feedback
Recommender Systems Copyright: Dietmar Jannah, Markus Zanker and Gerhard Friedrich (slides based on their IJCAI talk „Tutorial: Recommender Systems”)
Chapter Two Marcom’s Challenges: Enhancing Brand Equity, Influencing Behavior, and Being Accountable  2007 Thomson South-Western.
Marketing Communications Challenges: Enhancing Brand Equity, Influencing Behavior, and Being Accountable © 2010 South-Western, a part of Cengage Learning.
CS590I: Information Retrieval
Learning to Rank Typed Graph Walks: Local and Global Approaches
INF 141: Information Retrieval
Chapter Two Marcom’s Challenges: Enhancing Brand Equity, Influencing Behavior, and Being Accountable  2007 Thomson South-Western.
Get ready for trending fashion shopping with savings...
Chapter 2: Evaluative Feedback
Interactive Information Retrieval
Presentation transcript:

Adaptive, Personalized Diversity for Visual Discovery RecSys’16 Best Paper Award Houssam Nassif Be presenting a paper describing our solutions for overcoming the challenges we faced for launching a novel amazon product.

Amazon Stream: www.amazon.com/stream Stream is a new product discovery experience designed to show customer a large collection of products in an endless beautiful imagery. No customer intent. Window shopping.

Challenges Need for product scoring function Need for diverse stream Explore new products and exploit popular ones Need for diverse stream Hedge against the uncertainty in customer intent Need for season-aware stream Customers can engage with relevant products all year round Need for personalized stream Customers can focus more on products that they like The first challenge is to come up with a method for scoring products. Since new products get added to our catalog everyday, we need trade-off between exploring new products and exploiting the popular ones.

Explore-exploit in product scoring Product ID Brand Price Category 100352463 Jacket Calvin Klein $98 Bayesian Linear Probit Regression Click or No-click Product attributes, x Attribute weight distributions, w Customer actions Thompson Sampling Bandit Scoring function based on Probit Regression. Model maintains a weight distribution for each product attribute. Use multi-arm bandit to sample from the weights. Balances between exploring new products and exploiting popular ones. Product Score = F( Σi wi xi ) Wi ~ N(0,1)

Ranking by Product Scores If we rank products only by their scores, we might end up with a stream of very similar products.

Challenges Need for product scoring function Need for diverse stream Explore new products and exploit popular ones Need for diverse stream Hedge against the uncertainty in customer intent Need for season-aware stream Customers can engage with relevant products all year round Need for personalized stream Customers can focus more on products that they like So, there is a real need for diverse stream. It hedges against the uncertainly in customer intent and provides more exciting experience.

Diversity Diverse stream consists of a large variety of categories Stream Category: (Department, Product Type, Price Range) womens-dress-high A stream is considered diverse if it consists of products from a large variety of categories. we define a category as the combination of three product attributes, namely, department, product type, and price range.

Submodular diversity mix +3 +5 +11 +2 +4 Two submodular functions with different utilities. Create a mix. At some point the incremental utility of adding a circle is higher. Let us formalize Diversified list:

Submodular Utility Define utility of a product set D: “weighted sum of per-category product counts and scores” utility(D) = Σk wk log[1 + c(k, D) + s(k,D)] Utility of product set is defined as the weighted sum of per-category product counts and scores. the larger the weight for a category, the more products from that category will appear in the stream. If W of category A is twice that of B, we will have twice as much A than B. Weight for category k Number of products in category k Total score of products in category k

Results: Diversified Ranking Submodular vs multinomial diversity Submodular ranker put more relevant products at the top Metric Result Time spent +0.05% product view count -1.32% Product clicks +9.82% The results suggests that submodular diversified ranking is better at putting more relevant product at the top of stream.

Challenges Need for item scoring function Need for diverse stream Explore new products and exploit popular ones Need for diverse stream Hedge against the uncertainty in customer intent Need for season-aware stream Customers can engage with relevant products all year round Need for personalized stream Customers can focus more on products that they like Next, we need a season-aware stream that puts more weights on in-season products and less on off-season ones.

Adaptive Category Weights Fashion trends are seasonal Estimate weight for a category by its smoothed click-through-rate over a rolling-window wk = (clicksk + αk) / (viewsk + αk + βk) estimate category weights by smoothed category CTR over a rolling-window.

Results: Seasonality Metric Result Time spent +5.39% Item view count Adaptive vs static global weights Showing seasonally relevant products increases customer engagement Metric Result Time spent +5.39% Item view count +1.08% Product clicks +8.29% The results showed significant improvement across all metrics.

Challenges Need for product scoring function Need for diverse stream Explore new products and exploit popular ones Need for diverse stream Hedge against the uncertainty in customer intent Need for season-aware stream Customers can engage with relevant products all year round Need for personalized stream Customers can focus more on products that they like Finally we consider a personalized stream so that our customers can focus more on the products that they like.

Recall Define utility of a product set D: “weighted sum of per-category product counts and scores” utility(D) = Σk wk log[1 + c(k, D) + s(k,D)] The highlighted weights are what we are personalizing for the customers Weight for category k Number of products in category k Total score of products in category k

Personalized weights, Ep(w|c,d)[w] = (c + d) / |c + d|1 Personalization Category weights are personalized using past actions Agnostic of the diversification Easy to incorporate: searches, purchases, or returns Modeling: clicks, c ~ Multinomial(w) weights, w ~ Dirichlet(d) where d are default weights Personalize category weights using customer past actions. See clicks as drawn from a Multinomial distribution with the parameter vector w as Dirichlet prior. Generative model. Can incorporate external signals such as searches, purchases, or returns. Personalized weights, Ep(w|c,d)[w] = (c + d) / |c + d|1

Personalized Category Weights Distribution over categories might be sparse (i.e., filter bubble) personalized category weight vector eg. Mostly sparse. There is issue of filter bubble in which customers may not see interesting products in other categories. So, we need a way to encourage discovery in such cases.

Category-category Correlations Diffuse preference for category A to category B based on the correlation between A and B “clicked A then clicked B” diffuse customer preferences for a category to another category based on their correlation. Matrix of conditional probabilities. probability of customer interacting with product from column B given that customer had interacted with raw A high intensity: see text Customers who interacted with any women’s categories tend to also interact with women’s accessory, low-price dress and low-price handbag.

 × Customer preference vector Category-category correlations We diffuse preferences by multiplying the correlation matrix and the customer’s preference vector. The resulting customer preference vector is smoother and exposes more categories for the purpose of discovery. Smoothed customer preference vector

Results: Personalized Diversifier Personalized vs global weights Personalizing weights: Decreases scroll through Increases customer interactions Metric Result Time spent +1.10% Item view count -4.95% Product clicks +12.58% This results suggested that personalization exposed more relevant products at the top of stream.

Takeaways Explore/exploit strategy is important for product discovery Modeling diminishing returns is essential for diversity Seasonal and fashion trends can be learned from customer behavior Personalization enhances customer experience Informational pitch

Thank you! Questions? houssamn@amazon.com We are hiring!