Understanding the Network and User-Targeting Properties of Web Advertising Networks Yong Wang 1,2 Daniel Burgener 1 Aleksandar Kuzmanovic 1 Gabriel Maciá-Fernández.

Slides:



Advertisements
Similar presentations
The Internet.
Advertisements

David Martin for DAML-S Coalition 05/08/2003 OWL-S: Bringing Services to the Semantic Web David Martin SRI International
The Internet and the Web
Mining User Similarity Based on Location History Yu Zheng, Quannan Li, Xing Xie Microsoft Research Asia.
Detecting Malicious Flux Service Networks through Passive Analysis of Recursive DNS Traces Roberto Perdisci, Igino Corona, David Dagon, Wenke Lee ACSAC.
4.01 How Web Pages Work.
Analyzing Content-Level Properties of the Web Adversphere Yong Wang* **, Daniel Burgener**, Aleksandar Kuzmanovic**, Gabriel Maciá-Fernández*** * University.
IPv6 seen from DNS Dr. WANG Xin, CNNIC Labs 25 Aug 2009.
CLive Cloud-Assisted P2P Live Streaming
University of Illinois Visualizing Text Loretta Auvil UIUC February 25, 2011.
On the Effectiveness of Measurement Reuse for Performance-Based Detouring David Choffnes Fabian Bustamante Fabian Bustamante Northwestern University INFOCOM.
Ao-Jan Su and Aleksandar Kuzmanovic Department of EECS Northwestern University Thinning Akamai USENIX/ACM SIGCOMM IMC ’08.
One-Click Hosting Services: A File-Sharing Hideout Demetris Antoniades Evangelos P. Markatos ICS-FORTH Heraklion,
CDNs & Replication Prof. Vern Paxson EE122 Fall 2007 TAs: Lisa Fowler, Daniel Killebrew, Jorge Ortiz.
Flash Crowds And Denial of Service Attacks: Characterization and Implications for CDNs and Web Sites Aaron Beach Cs395 network security.
Drafting Behind Akamai (Travelocity-Based Detouring) Aleksandar Kuzmanovic Northwestern University Joint work with: A. Su, D. Choffnes, and F. Bustamante.
1 Drafting Behind Akamai (Travelocity-Based Detouring) AoJan Su, David R. Choffnes, Aleksandar Kuzmanovic, and Fabian E. Bustamante Department of Electrical.
Topics in this presentation: The Web and how it works Difference between Web pages and web sites Web browsers and Web servers HTML purpose and structure.
Web Caching and CDNs March 3, Content Distribution Motivation –Network path from server to client is slow/congested –Web server is overloaded Web.
Unconstrained Endpoint Profiling (Googling the Internet)‏ Ionut Trestian Supranamaya Ranjan Aleksandar Kuzmanovic Antonio Nucci Northwestern University.
Caching and Content Distribution Networks. Web Caching r As an example, we use the web to illustrate caching and other related issues browser Web Proxy.
New Challenges in Cloud Datacenter Monitoring and Management
Web Analytics
Harini Sridharan Stephen Duraski.  Behavioral Targeting (BT) is a technique that uses a user’s web-browsing behavior to determine which ads to display.
1 Integrating a Network IDS into an Open Source Cloud Computing Environment 1st International Workshop on Security and Performance in Emerging Distributed.
MPlane – Building an Intelligent Measurement Plane for the Internet Maurizio Dusi – NEC Laboratories Europe NSF Workshop on perfSONAR.
USING HADOOP & HBASE TO BUILD CONTENT RELEVANCE & PERSONALIZATION Tools to build your big data application Ameya Kanitkar.
1. 1.Charting the CDNs(locating all their content and DNS servers). 2.Assessing their server availability. 3.Quantifying their world-wide delay performance.
1 One-Click Hosting Services: A File-Sharing Hideout Demetris Antoniades Evangelos P. Markatos ICS-FORTH Heraklion,
Ao-Jan Su, David R. Choffnes, Fabián E. Bustamante and Aleksandar Kuzmanovic Department of EECS Northwestern University Relative Network Positioning via.
Towards Street-Level Client- Independent IP Geolocation Yong Wang, UESTC/Northwestern Daniel Burgener, Northwestern Marcel Flores, Northwestern Aleksandar.
Web Application Firewall (WAF) RSA ® Conference 2013.
Understanding Cross-site Linking in Online Social Networks Yang Chen 1, Chenfan Zhuang 2, Qiang Cao 1, Pan Hui 3 1 Duke University 2 Tsinghua University.
Grid-based Sensor Network Service on Future Internet By Mohammad Mehedi Hassan Student ID:
APNIC Update The state of IP address distribution and IPv6 deployment status Miwa Fujii Senior IPv6 Program Specialist APNIC.
Application of Content Computing in Honeyfarm Introduction Overview of CDN (content delivery network) Overview of honeypot and honeyfarm New redirection.
Understanding Crowds’ Migration on the Web Yong Wang Komal Pal Aleksandar Kuzmanovic Northwestern University
CDN: Content Distribution Networks  References:  CS613 textbook, “Computer Networking – A Top-Down Approach”, 6 th edition. Chapter  The text.
Grid-based Future Internet with Wireless sensor network By Mohammad Mehedi Hassan Student ID:
Studying Spamming Botnets Using Botlab 台灣科技大學資工所 楊馨豪 2009/10/201 Machine Learning And Bioinformatics Laboratory.
BEHAVIORAL TARGETING IN ON-LINE ADVERTISING: AN EMPIRICAL STUDY AUTHORS: JOANNA JAWORSKA MARCIN SYDOW IN DEFENSE: XILING SUN & ARINDAM PAUL.
Server Performance, Scaling, Reliability and Configuration Norman White.
Drafting Behind Akamai (Travelocity-Based Detouring) Dr. Yingwu Zhu.
Chapter Twelve Digital Interactive Media Arens|Schaefer|Weigold Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Globally Distributed Content Delivery Presenter: Baoning Wu 03/25/2003.
Content Distribution Network, Proxy CDN: Distributed Environment
Skynet: A Cloud-Based Data Transfer Architecture Aleksandar Kuzmanovic
The Internet. Important Terms Network Network Internet Internet WWW (World Wide Web) WWW (World Wide Web) Web page Web page Web site Web site Browser.
Introduction Web analysis includes the study of users’ behavior on the web Traffic analysis – Usage analysis Behavior at particular website or across.
John S. Otto Mario A. Sánchez John P. Rula Fabián E. Bustamante Northwestern, EECS.
1 DATA-DRIVEN SOLUTIONS. 2 KEYWORD-LEVEL SEARCH RETARGETING TARGET USERS BASED ON THEIR RECENT SEARCH HISTORY AND SEARCH QUERIES. A user performs a search.
Some from Chapter 11.9 – “Web” 4 th edition and SY306 Web and Databases for Cyber Operations Cookies and.
Drafting Behind Akamai (Travelocity-Based Detouring) Ao-Jan Su, David R. Choffnes, Aleksandar Kuzmanovic and Fabián E. Bustamante Department of EECS Northwestern.
4.01 How Web Pages Work.
4.01 How Web Pages Work.
4.01 How Web Pages Work.
Understanding The Cloud
PRESENTATION ON THE ONLINE GAMBLING - REGULATION 2 March 2012
DNS-sly: Avoiding Censorship through Network Complexity
Automated ad placement
Mr Veli Bhengu Product Manager: Broadband Services March
Geospatial Technology Evolution and Future Trends
Smart Portal To Protect Child Online
Content Distribution Networks
Web Mining Department of Computer Science and Engg.
Read this to find out how the internet works!
4.01 How Web Pages Work.
4.01 How Web Pages Work.
Unconstrained Endpoint Profiling (Googling the Internet)‏
Presentation transcript:

Understanding the Network and User-Targeting Properties of Web Advertising Networks Yong Wang 1,2 Daniel Burgener 1 Aleksandar Kuzmanovic 1 Gabriel Maciá-Fernández 3 1 UESTC (China) 2 Northwestern University (USA) 3 University of Granada (Spain)

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks “Online advertising is a $20 billion industry that is growing rapidly… It has become an integral and inseparable part of the World Wide Web” Motivation However, neither public auditing nor monitoring mechanisms still exist in this emerging area

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Contributions We present our initial efforts on building a network and content-level auditing service for Web-based ad networks. Such an ad auditing service can effectively monitor and regulate ad industry. –Firstly, it helps potential new advertisers/publishers in the decision of choosing commissioners which better meet their requirements. –Secondly, it allows commissioners to evaluate their own networks, with the aim of detecting potential design flaws and points of failure with reduced quality of service.

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Publisher Background 1) Uploads ads 2) Works with 4) Fetches web page 5) Sends web pages’s content and scripts for ad 7) Sends ad (and cookies) 3) Provides scripts Advertiser Commissioner End user … Commissioner’s Ad servers 6) Fetches ad (and send cookies)

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Outline Charting Online Advertising Network Infrastructure Network-Level Performance Content-Level Performance

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Outline Charting Online Advertising Network Infrastructure –Evaluation Platform –Candidates Selection –Finding Canonical Names –Mapping CNames to IP Addresses –Mapping IP Addresses to Locations Network-Level Performance Content-Level Performance

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Evaluation Platform Open Recursive DNSPlanetLab RegionCountriesServers % of total CountriesServers % of total N. America Europe Asia S. America Oceania Africa Total

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Candidates Selection √ √ √

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Finding Canonical Names AOL-Adtech : a627.g.akamai.net, a973.g.akamai.net, e1611.c.akamaiedge.net AOL-Tacoda : a1131.g.akamai.net, a1406.g.akamai.net, e922.p.akamaiedge.net AOL- Advertising:a949.g.akamai.net, a957.g.akamai, a1539.g.akamai.net, a1626.g.akamai.net, e1066.c.akamaiedge.net CommissionersSub-companiesRepresentative domains Google Google-Googlepagead.1.google.com Google-Doubleclickpagead.1.doubleclick.net AOL/ Akamai AOL-Adtech3 AOL- Adsonara950.g.akamai.net AOL- Tacoda3 AOL- Advertising5 Adbladeweb.adblade.com

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Mapping CNames to IP Addresses Commissioners # of IP Ad content serversAd DNS servers Google3066 AOL/Akamai Adblade12 The difference of the discovery capacity between two platforms 286 ÷306 = 93.5%

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Mapping IP Addresses to Locations Region # of IP GoogleAOL/AkamaiAdblade AdDNSAdDNSAdDNS N. America Europe Asia S. America Oceania Africa Unknowns Total

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Outline Charting Online Advertising Network Infrastructure Network-Level Performance –Delay Performance –Ad vs. Publisher Networks Content-Level Performance

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Delay Performance Delay for ad content servers AOL/Akamai > Google > Adblade Ad content servers > Ad DNS servers Delay for ad DNS servers

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Commissioner Ad vs. Publisher Networks Ad network is worse Ad network is better In CDN case, Google-Google ≈ Publisher network In No-CDN case, Google-Google > Publisher network

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Commissioner Ad vs. Publisher Networks In CDN case, AOL-Adsonar ≈ Publisher network In No-CDN case, AOL-Adsonar > Publisher network Ad network is worse Ad network is better There exists no internal mechanism within a CDN to recognize and correct such anomalies.

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Commissioner Ad vs. Publisher Networks Ad networks is worse Ad networks is better In CDN case, Adblade < Publisher network In No-CDN case, Adblade > Publisher network Ad network is worse Ad network is better

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Commissioner Ad vs. Publisher Networks The discrepancy between publishers’ and commissioners’ ad networks can be quite high. There exists no internal mechanism within a CDN to recognize and correct the huge discrepancy between commissioner and publisher network, even if both are served by the same CDN.

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Outline Charting Online Advertising Network Infrastructure Network-Level Performance Content-Level Performance –Distribution Mechanisms –Location-Based Advertising –Behavioral Targeting

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Distribution Mechanisms (Similarity) Google-Google has a large pool of ads and distributes different ads into different servers.

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Distribution Mechanisms (Similarity) Adblade has a smaller pool of ads and puts all of them in the same machine (or a cluster of machines).

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Distribution Mechanisms (Similarity) AOL uses different pools of ads depending on the location of the servers

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Distribution Mechanisms Regional similarities in AOL-Adsonar AOL uses finer-grained location-based advertising, e.g., city- level advertising, in the U.S.

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Location-Based Advertising Percentage of vantage points observing location-based ads Three commissioners deploy location-based advertising at various levels of granularity CommissionerCity (%)State (%)No info (%) Google AOL-Adsonar Adblade Google > Adblade > AOL CDN-based commissioners lag behind others in achieving finer-grained location-based advertising.

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Behavioral Targeting Percentage increase of observed ’sport’ related ads when behavioral targeting is enabled (’local/uniform cookie’) compared with disabled (’no cookie’) CommissionerLocal cookie (%)Uniform cookie (%) Google253 AOL-Adsonar135 Adblade00 Data-center oriented commissioners are capable of collecting user profiles and applying behavioral targeting more effectively Both Google and AOL-Adsonar associate a user profile only with an ad server close to this user Establish baseline (disable cookies, and access all websites, which may not be related to sports) Establish browsing pattern (enable cookies, and only visit websites fit in the category “sports”) Local cookie (enable cookies, and access all websites, which may not be related to sports in order to determine the difference when behavioral targeting is used) Uniform cookie (enable cookies, and copy the cookies from one computer to all PL nodes, and then retrieving ads again to check whether profile data is stored locally or globally)

Yong Wang Understanding the Network and User-Targeting Properties of Web Advertising Networks Conclusions We deployed an ad auditing service that can be universally applied to arbitrary commissioners’ networks. Using this service, we performed an extensive network- and content-level analysis. Our findings bring useful auditing information to all entities involved in the online advertising business.

Thank You