A Search-based Method for Forecasting Ad Impression in Contextual Advertising Defense.

Slides:



Advertisements
Similar presentations
Answering Approximate Queries over Autonomous Web Databases Xiangfu Meng, Z. M. Ma, and Li Yan College of Information Science and Engineering, Northeastern.
Advertisements

Google News Personalization: Scalable Online Collaborative Filtering
Alex Cheung and Hans-Arno Jacobsen August, 14 th 2009 MIDDLEWARE SYSTEMS RESEARCH GROUP.
1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
1 VLDB 2006, Seoul Mapping a Moving Landscape by Mining Mountains of Logs Automated Generation of a Dependency Model for HUG’s Clinical System Mirko Steinle,
Learning Influence Probabilities in Social Networks 1 2 Amit Goyal 1 Francesco Bonchi 2 Laks V. S. Lakshmanan 1 U. of British Columbia Yahoo! Research.
LEARNING INFLUENCE PROBABILITIES IN SOCIAL NETWORKS Amit Goyal Francesco Bonchi Laks V. S. Lakshmanan University of British Columbia Yahoo! Research University.
Qinqing Gan Torsten Suel Improved Techniques for Result Caching in Web Search Engines Presenter: Arghyadip ● Konark.
TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets Chun Chen 1, Feng Li 2, Beng Chin Ooi 2, and Sai Wu 2 1 Zhejiang University, 2 National.
Split Questionnaire Designs for Consumer Expenditure Survey Trivellore Raghunathan (Raghu) University of Michigan BLS Workshop December 8-9, 2010.
Discrete Choice Model of Bidder Behavior in Sponsored Search Quang Duong University of Michigan Sebastien Lahaie
Catching the Drift: Learning Broad Matches from Clickthrough Data Sonal Gupta, Mikhail Bilenko, Matthew Richardson University of Texas at Austin, Microsoft.
Contextual Advertising by Combining Relevance with Click Feedback D. Chakrabarti D. Agarwal V. Josifovski.
1 Learning User Interaction Models for Predicting Web Search Result Preferences Eugene Agichtein Eric Brill Susan Dumais Robert Ragno Microsoft Research.
Search Engines & Search Engine Optimization (SEO) Presentation by Saeed El-Darahali 7 th World Congress on the Management of e-Business.
Information Retrieval in Practice
Search Engines and Information Retrieval
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
Efficient Processing of Top-k Spatial Keyword Queries João B. Rocha-Junior, Orestis Gkorgkas, Simon Jonassen, and Kjetil Nørvåg 1 SSTD 2011.
Ph.D. DefenceUniversity of Alberta1 Approximation Algorithms for Frequency Related Query Processing on Streaming Data Presented by Fan Deng Supervisor:
Scaling Personalized Web Search Glen Jeh, Jennfier Widom Stanford University Presented by Li-Tal Mashiach Search Engine Technology course (236620) Technion.
Online Expansion of Rare Queries for Sponsored Search Defended by Mykell Miller.
1 Internet Advertising Ramana Yerneni, Yahoo! Labs August 17, 2010.
Preference Analysis Joachim Giesen and Eva Schuberth May 24, 2006.
1 Measurement-based Characterization of a Collection of On-line Games Chris Chambers Wu-chang Feng Portland State University Sambit Sahu Debanjan Saha.
Abstract Shortest distance query is a fundamental operation in large-scale networks. Many existing methods in the literature take a landmark embedding.
Search Engine Optimization (SEO)
Overview of Search Engines
Niranjan Balasubramanian Aruna Balasubramanian Arun Venkataramani University of Massachusetts Amherst Energy Consumption in Mobile Phones: A Measurement.
Search Engine Marketing Pay Per Click. Distinctions: SEM vs. SEO Search Engine Marketing (SEM), aka –“paid search” –“pay per click” (PPC) Search Engine.
“ The Initiative's focus is to dramatically advance the means to collect,store,and organize information in digital forms,and make it available for searching,retrieval,and.
Stream Clustering CSE 902. Big Data Stream analysis Stream: Continuous flow of data Challenges ◦Volume: Not possible to store all the data ◦One-time.
Navigating and Browsing 3D Models in 3DLIB Hesham Anan, Kurt Maly, Mohammad Zubair Computer Science Dept. Old Dominion University, Norfolk, VA, (anan,
USING HADOOP & HBASE TO BUILD CONTENT RELEVANCE & PERSONALIZATION Tools to build your big data application Ameya Kanitkar.
Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng.
Search Engines and Information Retrieval Chapter 1.
Search Engines & Search Engine Optimization (SEO).
Improving Web Search Ranking by Incorporating User Behavior Information Eugene Agichtein Eric Brill Susan Dumais Microsoft Research.
Ramakrishnan Srikant Sugato Basu Ni Wang Daryl Pregibon 1.
Evaluation Methods and Challenges. 2 Deepak Agarwal & Bee-Chung ICML’11 Evaluation Methods Ideal method –Experimental Design: Run side-by-side.
1 Discovering Authorities in Question Answer Communities by Using Link Analysis Pawel Jurczyk, Eugene Agichtein (CIKM 2007)
Pairwise Document Similarity in Large Collections with MapReduce Tamer Elsayed, Jimmy Lin, and Douglas W. Oard Association for Computational Linguistics,
Search - on the Web and Locally Related directly to Web Search Engines: Part 1 and Part 2. IEEE Computer. June & August 2006.
Qingqing Gan Torsten Suel CSE Department Polytechnic Institute of NYU Improved Techniques for Result Caching in Web Search Engines.
CISC Machine Learning for Solving Systems Problems Presented by: Alparslan SARI Dept of Computer & Information Sciences University of Delaware
인지구조기반 마이닝 소프트컴퓨팅 연구실 박사 2 학기 박 한 샘 2006 지식기반시스템 응용.
BEHAVIORAL TARGETING IN ON-LINE ADVERTISING: AN EMPIRICAL STUDY AUTHORS: JOANNA JAWORSKA MARCIN SYDOW IN DEFENSE: XILING SUN & ARINDAM PAUL.
LOGO A comparison of two web-based document management systems ShaoxinYu Columbia University March 31, 2009.
Web Search Module 6 INST 734 Doug Oard. Agenda The Web Crawling  Web search.
Ads Jim Jansen College of Information Sciences and Technology The Pennsylvania State University
Contextual Advertising by Combining Relevance with Click Feedback Deepak Agarwal Joint work with Deepayan Chakrabarti & Vanja Josifovski Yahoo! Research.
Building a Distributed Full-Text Index for the Web by Sergey Melnik, Sriram Raghavan, Beverly Yang and Hector Garcia-Molina from Stanford University Presented.
Pairwise Preference Regression for Cold-start Recommendation Speaker: Yuanshuai Sun
Empirical Analysis of Search Advertising Strategies BHANU C. VATTIKONDA, VACHA DAVE, SAIKAT GUHA, ALEX C. SNOEREN.
Concept-based P2P Search How to find more relevant documents Ingmar Weber Max-Planck-Institute for Computer Science Joint work with Holger Bast Torino,
ApproxHadoop Bringing Approximations to MapReduce Frameworks
Introduction to Information Retrieval Introduction to Information Retrieval CS276 Information Retrieval and Web Search Chris Manning and Pandu Nayak Efficient.
Learning to Rank: From Pairwise Approach to Listwise Approach Authors: Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li Presenter: Davidson Date:
1 Learning to Impress in Sponsored Search Xin Supervisors: Prof. King and Prof. Lyu.
1 A latent information function to extend domain attributes to improve the accuracy of small-data-set forecasting Reporter : Zhao-Wei Luo Che-Jung Chang,Der-Chiang.
1 VLDB, Background What is important for the user.
Does Internet media traffic really follow the Zipf-like distribution? Lei Guo 1, Enhua Tan 1, Songqing Chen 2, Zhen Xiao 3, and Xiaodong Zhang 1 1 Ohio.
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
Information Retrieval in Practice
Evaluation Anisio Lacerda.
Information Retrieval in Practice
Map Reduce.
Measurement-based Characterization of a Collection of On-line Games
Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.
Supporting Online Analytics with User-Defined Estimation and Early Termination in a MapReduce-Like Framework Yi Wang, Linchuan Chen, Gagan Agrawal The.
Presentation transcript:

A Search-based Method for Forecasting Ad Impression in Contextual Advertising Defense

Overview Background: Web and contextual advertising Motivation: importance of volume forecasting in contextual advertising Methodology: forecasting volume as an inverse of the ad retrieval Experiments

Web Advertising Huge impact on the Web and beyond $21 billion industry Main textual advertising channels: Search advertising Contextual advertising

Contextual Advertising (CA)

CA Basics Supports a variety of the web ecosystem Selects ads based on the “context”: Web page where the ads are placed Users that are viewing this page Interplay of three participants: Publisher Advertiser Ad network Advertiser’s goal is to obtain web traffic

Importance of Impression Volume Critical in planning and budgeting advertising campaigns Common questions for advertisers and intermediaries: Bid value Impact of ad variations Timing of the campaign

A Challenging Problem of Impression forecasting CA platforms are complex systems Have hundreds of contributing features A moving target, dynamic Publisher‘s content and traffic vary over time Large scale computation: billions of page views, hundreds of millions of distinct pages, and hundreds of millions of ads Dynamic bid landscape Competitors and what they are willing to pay

Current practice Run test ad in real traffic for a few days Simultaneously with the baseline Compare with the baseline Obvious drawbacks: Use ad serving infrastructure Expensive Inefficient Very long turn-around time

Forecasting as Inverse of Ad Retrieval Ad retrieval: given a page and a set of ads find the best ads Forecasting: given an ad and a set of past impressions, find where the ad would have been shown if it were in the system This work: assumes ads selected based on similarity of features: Use the WAND (Broder et al, CIKM 2003) DAAT algorithm as page selection Similarity of ad and context feature vectors: requires monotonic scoring function – this work uses dot product Features can be based on either user of page context.

Conceptual Work Flow Keep all the data used in ad retrieval for a given period For an unseen/incoming ad: Examine each impression Score the ad using the ad retrieval algorithm Compare the ad score with the score of the lowest ranking ad shown in the page view Count the impressions where the ad would have been shown

Main challenge: scale In order to beat scalability problem: Index only unique pages Adaptation of the WAND algorithm for count aggregation needed in forecasting A Two-level Process Use a posting list order to allow early termination

Indexing Unique Pages The revenue estimate of an ad-page pair: score(p,a) = similarity(p,a)*bid Revenue estimate for the lowest ranking ad: minScore p For repeating pages the similarity is constant However, ads and bids vary: Could change the lowest ranking ad of a unique page Only one index entry per unique page: What revenue to store for the lowest ranking ads? Save a distribution of estimates {rev1 … revn} Assign median to the minScore p MinScore p is recomputed based on the current ad supply

Two-level process (Impression forecasting) First phase (approximate) evaluation: maxWeight f = max{w f,p : for all p} Full evaluation:

Framework Offline processing Analyzing the pages Building a page inverted index Creating a page statistics file Online processing We use the inverted page index and page statistics to forecast the # of impressions of a given ad. Output Given a ad and bid, output the # of imp Give a ad, output the curve describe the relation b/w bid and # of impressions

Experiment Results Day to day forecast Week to week forecast

Observations: Similar results between day-day and week- week forecasting. The errors seems big, however, Due to the traffic fluctuation. Even with large margin of error, our result is still significant (it ’ s the best of its kind, and it ’ s still acceptable in campaigning budgeting and advertising strategy)

Top row has a good prediction. Bottom row does not match well due to traffic fluctuation, but match the trend and sharp very well.

Tradeoff b/w efficiency and accuracy Changing the value of minScore p will have effect on the output of the first level

Ad Variation Example Subtle difference could lead to dramatic performance change

Conclusion Ad retrieval algorithm is the determining factor in the CA impression volume forecasting Introduced a search-based forecasting as inverse of ad retrieval Promising experimental results Further work: combine search with learning approaches to further improve forecasting.