Predictive Client-Side Profiles for Personalized Advertising Misha Bilenko and Matt Richardson.

Slides:



Advertisements
Similar presentations
Managing Hardware and Software Assets
Advertisements

Display Ads and Page Performance: A Brief Tour of the Ad Ecosystem Tony Ralph.
1 SANS Technology Institute - Candidate for Master of Science Degree 1 Assessing Privacy Risks of Flash Cookies Kevin Fuller and Stacy Jordan February.
Topics we will discuss tonight: 1.Introduction to Google Adwords platform 2.Understanding how to text ads are used. Display advertising will not be discussed.
Web Programming 1 Darby Chang Web Programming. Cookie 2 Web Programming.
17th February, 2000 by Maciej Korzeniowski (CERN-IT-IA-MI) 1 Oracle Discoverer Product Presentation  This is an ad hoc query and analysis tool for.
Psychological Advertising: Exploring User Psychology for Click Prediction in Sponsored Search Date: 2014/03/25 Author: Taifeng Wang, Jiang Bian, Shusen.
EFRONT V4 EXTENSIONS ARCHITECTURE. The goal  To offer more flexibility to 3 rd party users to modify eFront functionality  To further extend eFront.
Yes, that data is correct How our Google Shopping numbers exploded from Q to Q By Gaby Galiani, Account Manager.
Online Privacy and Codes of Conduct Peter Fleischer Global Privacy Counsel my personal blog:
■ Google’s Ad Distribution Network ■ Primary Benefits of AdWords ■ Online Advertising Stats and Trends ■ Appendix: Basic AdWords Features ■ Introduction.
Chapter 11 Privacy Policies and Behavioral Marketing.
Catching the Drift: Learning Broad Matches from Clickthrough Data Sonal Gupta, Mikhail Bilenko, Matthew Richardson University of Texas at Austin, Microsoft.
TARGETED, NOT TRACKED: CLIENT-SIDE SOLUTIONS FOR PRIVACY-FRIENDLY BEHAVIORAL ADVERTISING Janice Tsai Misha Bilenko Matt Richardson.
WELCOME TO THE MCCLOUD SERVICES CUSTOMER WEB PORTAL TUTORIAL.
CLICK FRAUD Alexander Tuzhilin By Vinny Rey. Why was the study done? Google was getting sued by advertisers because of click fraud. Google agreed to have.
© 2006 Pearson Education Canada Inc Canadian Advertising in Action Chapter 12 Internet Communications.
Google Online Marketing Challenge (GOMC)
3-1 Chapter Three. 3-2 Secondary Data vs. Primary Data Secondary Data: Data that have been gathered previously. Primary Data: New data gathered to help.
July 25, 2005 PEP Workshop, UM A Single Sign-On Identity Management System Without a Trusted Third Party Brian Richardson and Jim Greer ARIES Lab.
PRIVAD: PRACTICAL PRIVACY IN ONLINE ADVERTISING Offense: Arindam Paul.
Automated Tracking of Online Service Policies J. Trent Adams 1 Kevin Bauer 2 Asa Hardcastle 3 Dirk Grunwald 2 Douglas Sicker 2 1 The Internet Society 2.
WEB ANALYTICS Prof Sunil Wattal. Business questions How are people finding your website? What pages are the customers most interested in? Is your website.
HTTP: cookies and advertising Concepts to cover:  web page content (including ads) from multiple site: composition at client  cookies  third-party cookies:
AdWords Instructor: Dawn Rauscher. Quality Score in Action 0a2PVhPQhttp:// 0a2PVhPQ.
Fall 2006 Davison/LinCSE 197/BIS 197: Search Engine Strategies 6-1 Module II Overview PLANNING: Things to Know BEFORE You Start… Why SEM? Goal Analysis.
HAL R VARIAN FEBRUARY 16, 2009 PRESENTED BY : SANKET SABNIS Online Ad Auctions 1.
1 3 Web Proxies Web Protocols and Practice. 2 Topics Web Protocols and Practice WEB PROXIES  Web Proxy Definition  Three of the Most Common Intermediaries.
David Pardoe Doran Chakraborty Peter Stone The University of Texas at Austin Department of Computer Science TacTex-09: A Champion Bidding Agent for Ad.
Anindya Ghose Sha Yang Stern School of Business New York University An Empirical Analysis of Sponsored Search Performance in Search Engine Advertising.
Understanding and Predicting Graded Search Satisfaction Tang Yuk Yu 1.
Privacy, P3P and Internet Explorer 6 P3P Briefing – 11/16/01.
Azure Backup New Business Model March 16 th 2015.
1 Tradedoubler & Mobile Mobile web & app tracking technical overview.
Privacy-Aware Personalization for Mobile Advertising
Optimizing Marketing Spend Through Multi-Source Conversion Attribution David Jenkins.
Big Data Bijan Barikbin Denisa Teme Matthew Joseph.
Canadian Advertising in Action, 6th ed. Keith J. Tuckwell ©2003 Pearson Education Canada Inc Elements of the Internet World Wide Web World.
Privacy Debate: Urgent Issue or Industry Hype? Getting Better all the time Can’t get no worse Bridging the Alan Chapell.
Online Advertising Greg Lackey. Advertising Life Cycle The Past Mass media Current Media fragmentation The Future Target market Audio/visual enhancements.
Hiding in the Mobile Crowd: Location Privacy through Collaboration.
Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.
Bids and Budgeting Quiz Review. How is Ad Rank determined?
Georgios Kontaxis‡, Michalis Polychronakis‡, Angelos D. Keromytis‡, and Evangelos P.Markatos* ‡Columbia University and *FORTH-ICS USENIX-SEC (August, 2012)
BEHAVIORAL TARGETING IN ON-LINE ADVERTISING: AN EMPIRICAL STUDY AUTHORS: JOANNA JAWORSKA MARCIN SYDOW IN DEFENSE: XILING SUN & ARINDAM PAUL.
U.S. Department of Commerce Web Advisory Group Minding Your Own Business The Platform for Privacy Preferences Project.
DaaS (Desktop as a Service) Last Update: July 15 th, 2015.
Windows Role-Based Access Control Longhorn Update
Ads Jim Jansen College of Information Sciences and Technology The Pennsylvania State University
1 Patron Data Management and Library Systems: A Vendor Perspective ALA Conference Summer, 2004.
Bloom Cookies: Web Search Personalization without User Tracking Authors: Nitesh Mor, Oriana Riva, Suman Nath, and John Kubiatowicz Presented by Ben Summers.
Restoring Privacy, Cleaning Your Computer's Cookies and Beacons.
Lecture III: Challenges for software engineering with the cloud CS 4593 Cloud-Oriented Big Data and Software Engineering.
Week 1 Introduction to Search Engine Optimization.
Module 5: Managing Content. Overview Publishing Content Executing Reports Creating Cached Instances Creating Snapshots and Report History Creating Subscriptions.
Online Data Storage Companies MY Docs Online. Comparison Name Personal Edition Enterprise Edition Transcription Edition Price $9.95 monthly rate $4.99.
Some from Chapter 11.9 – “Web” 4 th edition and SY306 Web and Databases for Cyber Operations Cookies and.
Cloud Computing: Legislative and Regulatory Frameworks Presentation to AREGNET Ria M. Thomas 29 April 2014 Occid-OrientStrategies.
Talal H. Noor, Quan Z. Sheng, Lina Yao,
Building Regression Tests With PeopleSoft Test Framework
Overview – SOE PatchTT November 2015.
Browser Settings *Failure to have the correct Browser cache setting may result in incorrect data being displayed. This is the procedure to allow Indistar.
Hybrid Cloud Architecture for Software-as-a-Service Provider to Achieve Higher Privacy and Decrease Securiity Concerns about Cloud Computing P. Reinhold.
Google AdWords Integration
Personalizing Search on Shared Devices
Training Deck – SEM & Facebook Ads
What is Cookie? Cookie is small information stored in text file on user’s hard drive by web server. This information is later used by web browser to retrieve.
Current Developments in Differential Privacy
Let’s browse the web User browses to a website
Cross Site Request Forgery (CSRF)
Presentation transcript:

Predictive Client-Side Profiles for Personalized Advertising Misha Bilenko and Matt Richardson

Cookie-cleared User Sees This Ad

User with Cookies Sees A Different Ad

All Advertising Should Be Personalized  Driven by economics  Publishers, platforms: average CPM rates 2.7x higher [Beales ‘10]  Advertisers: 6x gain in CTR [Yao et al. ‘08]  What about users?  “It’s a little creepy, especially if you don’t know what’s going on” [NYT ‘11]  Ad industry: users can opt out via  Privacy advocates: third-party tracking must be regulated  Browsers: Do Not Track (FF, IE, Safari), KeepMyOptOuts (Chrome)  Legislation: multiple bills/hearings in US; European e-Privacy directive

This Talk  Client-side profiles balance ad personalization and user control  Compact profile construction as an online optimization problem  Machine learning for profile construction  Experiments: revenue difference for client-side vs. server-side

Privacy Problem: Lack of Knowledge+Control  Users do not know what is stored, where and why  Use, retention, sharing  Users cannot edit or delete their behavioral data  Deleting cookies insufficient: re-identification, LBOs, local storage  Opting out ≠ having your data purged  Most users find tracking invasive when asked [McDonald-Cranor ’10]  But don’t do much about it: Do Not Track adoption in Firefox: 4-6%  Do Not Track regulation proposals misguided, impractical  Mandatory opt-in toxic to publishers;“3 rd party” is a false bogeyman  Alternative: “Do No Track Server-side”

Server-side User Profiles in Advertising (query or url)

Server-side User Profiles in Advertising (query or url) (ad)

Server-side User Profiles in Advertising (query or url) (ad)

Client-only Profiles

+ No plugins (AdNostic, RePRIV, Privad: users install plugins) + No major changes to serving infrastructure + Targeting server-side (advanced features/algorithms) + Profile update server-side (advanced features/algorithms) + Platform cost-saving: not paying for profile storage - Must trust ad platform to comply with policy and not retain  Debatable proposition for security researchers…  …but HTTP-header Do Not Track makes the same assumption  …because we generally trust companies to be law-abiding  …and it aligns with their long-term incentives

Profile Update: Problem Definition Query Ad Click Pageview

Personalization Modalities in Advertising  Profile uses for ad platforms:  Selection: profile keywords enhance pool of considered ads  Allocation: improving CTR prediction, pricing and ranking  Profiles uses for advertisers  Bid increments: trigger for keyword matching context *and* profile  Differentiation between casual vs. strong user interest  Supported by conversion rate trends

Profile Utility with CPC Bid Increments Probability that profile will match future context Probability of profile- matched ad clicked Bid increment Revenue with profiles Revenue without profile (non-personalized)

Core Problem: Profile Update Probability of being shown and clicked Bid increment Newly incremented ads due to this keyword

Keyword Utility: Learning to the Rescue Probability of being shown and clicked Bid increment Newly incremented ads due to this keyword

Putting it All Together: Profile Update  Key trick: keep a cache of recent contexts with the profile  Used only for expansion, not for charging increments!

Experimental Setup  Replay a large user sample (2.4M) from two months of Bing logs  Profiles constructed online and scored against actual ad clicks  Pessimistic: underestimates effects from improvements in pClick/ranking  Dataset construction on Cosmos (MapReduce)  Runs on compressed data on multicore (L-BFGS logistic regression)  Features: frequency/recency, historical counts, decay windows, etc.  $$$ question: how do client-side and server-side profiles compare?  Evaluate the effects of:  Profile size: used for matching  Cache size: used for expanding the candidate selection pool

Client-side vs. Server-side Utility  Cache size: number of query events stored client-side  Moderate cache size performs close to optimal

Client-side vs. Server-side vs. Oracle  What % of future user activity can we match at all?  Caveat: depends on matching function (graph)

Conclusions  Client-side profiles balance industry and privacy concerns  Require little change to current ad platform infrastructure  Retain 97+% of server-side personalization revenue gains  Principled utility-based framework for ad personalization  Quantifies gains from offering bid-increments

Probability of being shown and clicked Bid increment Newly incremented ads due to this keyword

Making Profiles Incentive-Compatible

More on Trusting the Platform  If I have to trust the server anyway, why not trust it to store my profile as well?  Trusting not to store is a lower bar than trusting to properly handle profile  Storing profile on server = Trusting any team with access to your profile to:  Know the policies  Correctly implement things like opt-out, retention, publication.  Either never copy your history, or ensure your edits/deletions are propagated through to all copies.  Not to share it with any other team that might not know these things  Storing profile on client = Trusting just the team that receives the profile to use it and throw it away.