Focused Matrix Factorization for Audience Selection in Display Advertising BHARGAV KANAGAL, AMR AHMED, SANDEEP PANDEY, VANJA JOSIFOVSKI, LLUIS GARCIA-PUEYO,

Slides:

Advertisements

Similar presentations

Amit Goyal Laks V. S. Lakshmanan RecMax: Exploiting Recommender Systems for Fun and Profit University of British Columbia

Advertisements

VSMC MIMO: A Spectral Efficient Scheme for Cooperative Relay in Cognitive Radio Networks 1.

LEARNING INFLUENCE PROBABILITIES IN SOCIAL NETWORKS Amit Goyal Francesco Bonchi Laks V. S. Lakshmanan University of British Columbia Yahoo! Research University.

Effective Keyword Based Selection of Relational Databases Bei Yu, Guoliang Li, Karen Sollins, Anthony K.H Tung.

1 RegionKNN: A Scalable Hybrid Collaborative Filtering Algorithm for Personalized Web Service Recommendation Xi Chen, Xudong Liu, Zicheng Huang, and Hailong.

Contextual Advertising by Combining Relevance with Click Feedback D. Chakrabarti D. Agarwal V. Josifovski.

Active Learning and Collaborative Filtering

8.2 Discretionary Access Control Models Weiling Li.

Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.

Integrating Bayesian Networks and Simpson’s Paradox in Data Mining Alex Freitas University of Kent Ken McGarry University of Sunderland.

1 Abstract This paper presents a novel modification to the classical Competitive Learning (CL) by adding a dynamic branching mechanism to neural networks.

Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.

Recommender systems Ram Akella November 26 th 2008.

1 Presenter: Chien-Chih Chen Proceedings of the 2002 workshop on Memory system performance.

Scaling and Attitude Measurement in Travel and Hospitality Research Research Methodologies CHAPTER 11.

Item-based Collaborative Filtering Recommendation Algorithms

Online Dictionary Learning for Sparse Coding International Conference on Machine Learning, 2009 Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro.

 C. C. Hung, H. Ijaz, E. Jung, and B.-C. Kuo # School of Computing and Software Engineering Southern Polytechnic State University, Marietta, Georgia USA.

WEMAREC: Accurate and Scalable Recommendation through Weighted and Ensemble Matrix Approximation Chao Chen ⨳ , Dongsheng Li

A Comparative Study of Search Result Diversification Methods Wei Zheng and Hui Fang University of Delaware, Newark DE 19716, USA

 An important problem in sponsored search advertising is keyword generation, which bridges the gap between the keywords bidded by advertisers and queried.

Wancai Zhang, Hailong Sun, Xudong Liu, Xiaohui Guo.

Predicting Content Change On The Web BY : HITESH SONPURE GUIDED BY : PROF. M. WANJARI.

EMIS 8381 – Spring Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.

CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.

Exploring Online Social Activities for Adaptive Search Personalization CIKM’10 Advisor ： Jia Ling, Koh Speaker ： SHENG HONG, CHUNG.

Protecting Sensitive Labels in Social Network Data Anonymization.

Classification and Ranking Approaches to Discriminative Language Modeling for ASR Erinç Dikici, Murat Semerci, Murat Saraçlar, Ethem Alpaydın 報告者：郝柏翰 2013/01/28.

Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.

Shared Memory Parallelization of Decision Tree Construction Using a General Middleware Ruoming Jin Gagan Agrawal Department of Computer and Information.

Evaluating FERMI features for Data Mining Applications Masters Thesis Presentation Sinduja Muralidharan Advised by: Dr. Gagan Agrawal.

Online Learning for Collaborative Filtering

Performance Prediction for Random Write Reductions: A Case Study in Modelling Shared Memory Programs Ruoming Jin Gagan Agrawal Department of Computer and.

Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.

Autonomic scheduling of tasks from data parallel patterns to CPU/GPU core mixes Published in: High Performance Computing and Simulation (HPCS), 2013 International.

BEHAVIORAL TARGETING IN ON-LINE ADVERTISING: AN EMPIRICAL STUDY AUTHORS: JOANNA JAWORSKA MARCIN SYDOW IN DEFENSE: XILING SUN & ARINDAM PAUL.

Evaluating Network Security with Two-Layer Attack Graphs Anming Xie Zhuhua Cai Cong Tang Jianbin Hu Zhong Chen ACSAC (Dec., 2009) 2010/6/151.

Round-Robin Discrimination Model for Reranking ASR Hypotheses Takanobu Oba, Takaaki Hori, Atsushi Nakamura INTERSPEECH 2010 Min-Hsuan Lai Department of.

Matching Users and Items Across Domains to Improve the Recommendation Quality Created by: Chung-Yi Li, Shou-De Lin Presented by: I Gde Dharma Nugraha 1.

Graph-based Text Classification: Learn from Your Neighbors Ralitsa Angelova ， Gerhard Weikum : Max Planck Institute for Informatics Stuhlsatzenhausweg.

Adaptive Multi-Threading for Dynamic Workloads in Embedded Multiprocessors 林鼎原 Department of Electrical Engineering National Cheng Kung University Tainan,

Fig.1. Flowchart Functional network identification via task-based fMRI To identify the working memory network, each participant performed a modified version.

2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.

DNPC08 Review of Standard LDZ System Charges 6 September 2010.

Voice Activity Detection based on OptimallyWeighted Combination of Multiple Features Yusuke Kida and Tatsuya Kawahara School of Informatics, Kyoto University,

Xutao Li1, Gao Cong1, Xiao-Li Li2

Pairwise Preference Regression for Cold-start Recommendation Speaker: Yuanshuai Sun

A DYNAMIC APPROACH TO THE SELECTION OF HIGH ORDER N-GRAMS IN PHONOTACTIC LANGUAGE RECOGNITION Mikel Penagarikano, Amparo Varona, Luis Javier Rodriguez-

Accelerating Dynamic Time Warping Clustering with a Novel Admissible Pruning Strategy Nurjahan BegumLiudmila Ulanova Jun Wang 1 Eamonn Keogh University.

Evaluation of gene-expression clustering via mutual information distance measure Ido Priness, Oded Maimon and Irad Ben-Gal BMC Bioinformatics, 2007.

An Introduction Student Name: Riaz Ahmad Program: MSIT( ) Subject: Data warehouse & Data Mining.

Bloom Cookies: Web Search Personalization without User Tracking Authors: Nitesh Mor, Oriana Riva, Suman Nath, and John Kubiatowicz Presented by Ben Summers.

FISM: Factored Item Similarity Models for Top-N Recommender Systems

Date: 2012/5/28 Source: Alexander Kotov. al(CIKM’11) Advisor: Jia-ling, Koh Speaker: Jiun Jia, Chiou Interactive Sense Feedback for Difficult Queries.

A Supervised Machine Learning Algorithm for Research Articles Leonidas Akritidis, Panayiotis Bozanis Dept. of Computer & Communication Engineering, University.

Collaborative Filtering via Euclidean Embedding M. Khoshneshin and W. Street Proc. of ACM RecSys, pp , 2010.

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

Scalable Learning of Collective Behavior Based on Sparse Social Dimensions Lei Tang, Huan Liu CIKM ’ 09 Speaker: Hsin-Lan, Wang Date: 2010/02/01.

Using category-Based Adherence to Cluster Market-Basket Data Author : Ching-Huang Yun, Kun-Ta Chuang, Ming-Syan Chen Graduate : Chien-Ming Hsiao.

1 VLDB, Background What is important for the user.

Authors: Jiang Xie, Ian F. Akyildiz

Recommending Forum Posts to Designated Experts

WSRec: A Collaborative Filtering Based Web Service Recommender System

A Fast Trust Region Newton Method for Logistic Regression

Asymmetric Correlation Regularized Matrix Factorization for Web Service Recommendation Qi Xie1, Shenglin Zhao2, Zibin Zheng3, Jieming Zhu2 and Michael.

Tingdan Luo 05/02/2016 Interactively Optimizing Information Retrieval Systems as a Dueling Bandits Problem Tingdan Luo

Location Recommendation — for Out-of-Town Users in Location-Based Social Network Yina Meng.

Recommending Mobile Apps - A Collaborative Filtering Viewpoint

NON-NEGATIVE COMPONENT PARTS OF SOUND FOR CLASSIFICATION Yong-Choon Cho, Seungjin Choi, Sung-Yang Bang Wen-Yi Chu Department of Computer Science &

Presentation transcript:

Focused Matrix Factorization for Audience Selection in Display Advertising BHARGAV KANAGAL, AMR AHMED, SANDEEP PANDEY, VANJA JOSIFOVSKI, LLUIS GARCIA-PUEYO, JEFF YUAN PRESENTER: I GDE DHARMA NUGRAHA CHONNAM NATIONAL UNIVERSITY

Outlined  Introduction  Problem in Matrix Factorization  Focused Matrix Factorization Model  Model Learning and Inference  Implementation  Experimental Evaluation  Conclusion

Introduction  Audience selection or audience retrieval is the problem in display advertising to display ads for those users who are most likely to show interest and respond positively to the campaigns.  The user’s past feedback on this campaign can be leveraged to construct such a list using collaborative filtering techniques such as matrix factorization.  However, the user-campaign interaction is typically extremely sparse, hence the conventional matrix factorization does not perform well.  Moreover, simply combining the users feedback from all campaigns does not address this since it dilutes the focus on target campaign in consideration.

Introduction  To resolve these issues, this paper propose a novel focused matrix factorization (FMF) which ◦Learns users’ preference towards the specific campaign products, while also exploiting the information about related products. ◦Exploit the product taxonomy to discover related campaigns and design models to discriminate between the users’ interest towards campaign products and non-campaign product.

Introduction  The illustration of different approach in this paper is shown in the figure.

Problem in Matrix Factorization

Focused Matrix Factorization Model

Model Learning and Inference

 To use SGD, a term from the summation is sampled, which denote using (i, u 1, u 2 ). Depending on whether the item is from the target campaign (i.e., i ϵ T) or from some non-target campaign j (i.e., i ϵ N j ), this model obtain two sets of gradients which are show in the figure.

Implementation  This model is developed using C++ for a multi-core implementation and BOOST library package for storing the factor matrices.  The global state maintained by the SGD algorithm consists of the 3 factor matrices {v S, v N, v I } and the α vector. A lock is introduced for each row in factor matrices.  In the SGD algorithm, in each iteration of training, its execute 3 steps. ◦The first step, sampling a 3-tuple (i, u 1, u 2 ). ◦The second step, read the appropriate user and item factors and compute the gradients with respect to them. ◦The third step, update the factor matrices based on the gradients.  Using locks over such small vector can result in significant increase in the processing time. To alleviate this problem, caching technique is proposed.

Implementation

Experimental Evaluation  Experimental Setup ◦Dataset for evaluation use the log of previous advertising campaigns obtained from a major advertising network. ◦The dataset contains information about the item corresponding to various advertising campaigns and an anonymized list of users who actually responded to the campaign by making a purchase of the campaign item. ◦In addition, the dataset that contain a taxonomy over the various item in the campaign. ◦It contain users and around a million items in the taxonomy. ◦The taxonomy dataset contains 3 level deep, with around 1500 nodes at lowest level, 270 at the middle level and 23 top level categories. ◦Overall, the dataset contain 23 campaigns.

Experimental Evaluation

 Cross-validation/Parameter Sweep ◦For each of the experiments, a parameter sweep over MapReduce cluster was executed. ◦The parameter that sweep over included U,  I, N and K, the number of factors. ◦For each setting of parameter was evaluated in 4 different initializations and picked the best initialization for each configuration, in terms of performance on the validation dataset. ◦AUC was choose over the test set for a given number of factors to report the experiments.

Experimental Evaluation  Experimental Results ◦The first experiment, GMF, MF and FMF2 technique was compared for different campaigns. ◦The figure show the result

Experimental Evaluation  Experimental Results ◦The second experiment, performance over the individual campaigns was examined. ◦The figure show the result

Experimental Evaluation  Experimental Results ◦The third experiment, to examine the best performance model across all factor sizes for each campaign. ◦The figure show the result

Experimental Evaluation  Experimental Results ◦The forth experiment, to examine the influence of taxonomy for each model. ◦The figure show the result.

Experimental Evaluation  Experimental Results ◦The last experiment, to compare the performance between FMF model. ◦For each of the four campaigns, FMF1, FMF2 and FMF3 model was trained. ◦For the figure, the models FMF1 and FMF2 perform much better than FMF3. ◦The reason is caused by FMF3 have much more constrained than the other two models.

Experimental Evaluation  Experimental Results ◦Effect of Campaign Size ◦The figure show the result for different campaign size ◦The performance of FMF2 model is a function of the target campaign size, i.e., the number of items in the target campaign. ◦From the figure, show that the performance of FMF2 is robust and largely unaffected by the campaign size.

Experimental Evaluation  Experimental Results ◦Effect of Intra-campaign relationship (Campaign Homogeneity) ◦In this experiment, the performance of FMF2 models as a function of the homogeneity of the target campaign was explored. ◦From the figure show that the AUC scores increase as long as the homogeneity of the campaign.

Experimental Evaluation  Experimental Results ◦Effect of Inter-campaign relationship (Information Transfer) ◦This experiment explores the effect of inter-campaign relationship for information transfer in the FMF2 model. ◦This experiment pick a fairly homogeneous campaign X and split it into two parts X 1 and X 2. Then picked another campaign Y and constructed two configuration using X 1, X 2 and Y. X 1 become the target campaign, in config 1, X 2 become the non-target campaign and in config 2, Y become the non- target campaign. ◦The figure show that config 1 has higher AUC score than config 2 since config 1 has X 2 as the non-target campaign which is highly similar to X 1.

Experimental Evaluation  Experimental Results ◦The last experiment, to compare the performance between FMF model. ◦Efficiency ◦In this experiment, the trade-offs that is obtained by using the caching technique is demonstrated. ◦The result show in the figure. When the threshold is set to 0, there is complete synchronization. As the threshold is increased, the synchronization with the global copy is performed less often, resulting in faster runtime but less accuracy.

Conclusion  This paper propose Focused Matrix Factorization (FMF) model to appropriately borrow relevant information from other campaign while still retaining focus on the target campaigns.  The experiment result show that FMF model consistently outperforms the traditional matrix factorization techniques over all kinds of campaigns.  In addition, the experiment resulting the character of the conditions which the approach will obtain significant improvements.