Exploring Cross-Application Cellular Traffic Optimization with Baidu TrafficGuard Zhenhua Li, Weiwei Wang, Tianyin Xu, Xin Zhong, Xiang-Yang Li, Yunhao.

Slides:



Advertisements
Similar presentations
The Internet and the Web
Advertisements

Justine Sherry*, Shaddi Hasan*, Colin Scott*, Arvind Krishnamurthy†,
Cost-Based Cache Replacement and Server Selection for Multimedia Proxy Across Wireless Internet Qian Zhang Zhe Xiang Wenwu Zhu Lixin Gao IEEE Transactions.
Fundamentals of Multimedia Part III: Multimedia Communications and Networking Chapter 15 : Network Services and Protocols for Multimedia Communications.
Pervasive Web Content Delivery with Efficient Data Reuse Chi-Hung Chi and Cao Yang School of Computing National University of Singapore
Cloud Download : Using Cloud Utilities to Achieve High-quality Content Distribution for Unpopular Videos Yan Huang, Tencent Research, Shanghai, China Zhenhua.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
EEC-484/584 Computer Networks Discussion Session for HTTP and DNS Wenbing Zhao
Optimize tomorrow today. TM 1 Optimize tomorrow today. Arlene Minkiewicz, Chief Scientist PRICE Systems, LLC Software.
Differentiated Multimedia Web Services Using Quality Aware Transcoding S. Chandra, C.Schlatter Ellis and A.Vahdat InfoCom 2000, IEEE Journal on Selected.
V1.00 © 2009 Research In Motion Limited Introduction to Mobile Device Web Development Trainer name Date.
2/24/2000 Network Performance Effects on HTTP/1.1, CSS1, and PNG Luis Rivera Henrik Frystyk Nielsen : W3C James Gettys : DEC Anselm Baird-Smith: W3C.
Application Layer  We will learn about protocols by examining popular application-level protocols  HTTP  FTP  SMTP / POP3 / IMAP  Focus on client-server.
 Proxy Servers are software that act as intermediaries between client and servers on the Internet.  They help users on private networks get information.
Timecard: Controlling User-Perceived Delays in Server-Based Mobile Applications Lenin Ravindranath, Jitu Padhye, Ratul Mahajan, Hari Balakrishnan.
COMPUTER TERMS PART 1. COOKIE A cookie is a small amount of data generated by a website and saved by your web browser. Its purpose is to remember information.
Towards Understanding Modern Web Traffic
1 Enabling Secure Internet Access with ISA Server.
22-Aug-15 | 1 |1 | Help! I need more servers! What do I do? Scaling a PHP application.
Justine Sherry*, Shaddi Hasan*, Colin Scott*, Arvind Krishnamurthy†,
Company Presentation September 15, CONFIDENTIAL.
Supporting Strong Cache Coherency for Active Caches in Multi-Tier Data-Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan, S. Krishnamoorthy,
File Formats COM 366 Web Design & Layout. Native file format –Format native to software program –.psd > PhotoShop default Preserves layers –Use “Save.
Designing for the Unknown. Challenges of Web Design As frustrating as it may be – there is no guarantee that people will see/experience your web pages.
{ Content Distribution Networks ECE544 Dhananjay Makwana Principal Software Engineer, Semandex Networks 5/2/14ECE544.
Databases and the Internet. Lecture Objectives Databases and the Internet Characteristics and Benefits of Internet Server-Side vs. Client-Side Special.
Adobe Dreamweaver CS5 Introduction Web Site Development and Adobe Dreamweaver CS5.
GIS technologies and Web Mapping Services
1 March 24, 2011 Smartphones Can Assist Efficient Use of Network Resources Ömer Mubarek Senior Member of Technical Staff Advanced Technology, Research.
A Transcoding Proxy for HTML Web Pages: Web Page Sampling and Conversion Evaluation. Andrew Stone CS525m.
Adapting web pages for mobile devices Fredrik Andersson Hannes Nevalainen.
03/09/2003Helsinki University of Technology1 Overview of Thesis Topic Presented By: Zhao Xuetao.
1 A Comparative Study of Handheld and Non-Handheld Traffic in Campus Wi-Fi Networks Aaron Gember, Ashok Anand, and Aditya Akella University of Wisconsin—Madison.
T-CloudDisk: A Tunable Cloud Storage Service for Flexible Batched Synchronization Zhenhua Li *, Tsinghua University He Xiao, Tsinghua University Linsong.
AjaxScope & Doloto: Towards Optimizing Client-side Web 2.0 App Performance Ben Livshits Microsoft Research (joint work with Emre.
Timecard: Controlling User-Perceived Delays in Server-Based Mobile Applications Lenin Ravindranath, Jitu Padhye, Ratul Mahajan, Hari Balakrishnan.
Kiew-Hong Chua a.k.a Francis Computer Network Presentation 12/5/00.
1 Installing and Maintaining ISA Server Planning an ISA Server Deployment Understand the current network infrastructure. Review company security.
World Wide Web “WWW”, "Web" or "W3". World Wide Web “WWW”, "Web" or "W3"
How the Web Works Building a Website – Lesson 1. How People Access the Web Browsers People access websites using software called a web browser. To view.
Empirical Quantification of Opportunities for Content Adaptation in Web Servers Michael Gopshtein and Dror Feitelson School of Engineering and Computer.
Intel Research & Development ETA: Experience with an IA processor as a Packet Processing Engine HP Labs Computer Systems Colloquium August 2003 Greg Regnier.
On The Cooperation of Web Clients and Proxy Caches Yiu Fai Sit, Francis C.M. Lau, Cho-Li Wang Department of Computer Science The University of Hong Kong.
Web Design and Development. World Wide Web  World Wide Web (WWW or W3), collection of globally distributed text and multimedia documents and files 
CellSDN: Software-Defined Cellular Core networks Xin Jin Princeton University Joint work with Li Erran Li, Laurent Vanbever, and Jennifer Rexford.
Random Logic l Forum.NET l ASP.NET performance Forum.NET 5 th Meeting ● March 2006 By Yaniv Sharon.
IF STARBUCKS USED INSTART LOGIC, THEY WOULD HAVE A FASTER RESPONSIVE WEBSITE BY PETER BLUM.
Enhance Your Page Load Speed And Improve Traffic.
Computer Basics Introduction CIS 109 Columbia College.
Technology for Mobile Network Operators Business Case: Improve Overall Service Quality and Customer Experience While Dramatically Lowering Carrier Costs.
GENERAL SCALABILITY CONSIDERATIONS
University of Maryland College Park
HTTP and Abstraction on the Internet
CIIT-Human Computer Interaction-CSC456-Fall-2015-Mr
What is WWW? The term WWW refers to the World Wide Web or simply the Web. The World Wide Web consists of all the public Web sites connected to the Internet.
HTTP and Abstraction on the Internet
Vocabulary Prototype: A preliminary sketch of an idea or model for something new. It’s the original drawing from which something real might be built or.
Securing the Network Perimeter with ISA 2004
Vocabulary Prototype: A preliminary sketch of an idea or model for something new. It’s the original drawing from which something real might be built or.
EE 122: Network Applications
Video Transcoding for Wireless Video
Computer Communication & Networks
Evaluating Proxy Caching Algorithms in Mobile Environments
Edge computing (1) Content Distribution Networks
Web Design Designing for the Unknown.
Towards Web-based Delta Synchronization for Cloud Storage Services
Creating Images for the Web
HTTP and Abstraction on the Internet / The Need for DNS
Presentation transcript:

Exploring Cross-Application Cellular Traffic Optimization with Baidu TrafficGuard Zhenhua Li, Weiwei Wang, Tianyin Xu, Xin Zhong, Xiang-Yang Li, Yunhao Liu, Christo Wilson, and Ben Y. Zhao

China has the world’s largest population of mobile cellular users. Source:

China’s mobile users have notably small cellular data plan. 93.6% of China’s users have no more than 500MB of traffic plan per month. Source: iMedia Research 69.1% have no more than 200MB.

Traffic overage is pervasive in China. Severe (25.8%) Slight (33.3%) Not at all (23.7%) Balanced (15.1%) Unknown(2.1%) 59.1% of China’s cellular users overuse their data plan. Source: iMedia Research

Efforts towards optimizing mobile traffic Application-specific data compression 50+% of mobile traffic reduction with moderate latency penalty. Flywheel: Google’s data compression proxy (NSDI’15) Chrome browser Flywheel HTTP sites → transcode images to the WebP format → minify and gzip text content

Mobile web browser traffic is only a (small) portion of today’s cellular traffic.

Baidu TrafficGuard Optimizing cross-app cellular traffic - data compression - content validation - cross-app caching Filtering out unwanted traffic - malicious/ads/overnight traffic Two years of deployment experience - 10 millions of users & 0.2 millions of active users daily - support all Android 4.0+ devices

Backend Proxy Software middlebox + TG mobile app TrafficGuard Overview HTTP In China, HTTP requests dominate the mobile web requests (80%). ① Handle all cellular traffic across all apps ② Filter out unwanted traffic (e.g., malicious) GET / RESP

TG mobile app (client-side support) Customized VPN - based on TUN - manipulate layer-3 packets at user space - no root privilege required Cooperation w/ the backend - Local filter for malicious links - Value-based cross-app caching Apps TCP/IP TUN NIC Cellular network User space Kernel

Backend: web proxy + software middleboxes Traffic filtering* Content validation Cross-app caching* Data compression Traffic reduction mechanisms *cooperate w/ the client-side TG mobile app

Backend: web proxy + software middleboxes Traffic filtering* Content validation Cross-app caching* Data compression Traffic reduction mechanisms *cooperate w/ the client-side TG mobile app

Data compression Characteristics of HTTP content types Content Type% HTTP Requests% HTTP Traffic Size (KB) Median Mean Text49.0%15.7% Image32.0%71.0% Octet-stream10.0%5.5% Zip8.1%5.1% Audio & video0.03%2.6% Other0.87%0.1% *Text: HTML, CSS, JSON, XML, JS, etc

Text compression Text compression is not worthwhile. → very small in size (mean: 0.2KB) → reduce HTTP traffic by 1.4% → considerable computation overhead at both client and cloud side. TrafficGuard does not compress text objects.

Image compression 71% of HTTP traffic are images, and 40% of these images are oversized. Challenges: → cannot transcode all the images to WebP (Flywheel’s approach) because not all existing apps support WebP → cannot use WebP as the transfer format as the client-side transcoding overhead is not acceptable → cannot simply rescale images which may distort UI layout and degrade UX

Image compression 71% of HTTP traffic are images, and 40% of these images are oversized. Challenges: → cannot transcode all the images to WebP (Flywheel’s approach) because not all existing apps support WebP → cannot use WebP as the transfer format as the client-side transcoding overhead is not acceptable → cannot simply rescale images which may distort UI layout and degrade UX

Image compression 71% of HTTP traffic are images, and 40% of these images are oversized. Challenges: → cannot transcode all the images to WebP (Flywheel’s approach) because not all existing apps support WebP → cannot use WebP as the transfer format as the client-side transcoding overhead is not acceptable → cannot simply rescale images which may distort UI layout and degrade UX

Image compression Adjusting the quality of the images by tuning QF (Quality Factors) of JPEG images QF3015 JPEG: 500× KB (88%) 84KB (75%) 70KB (61%) 48KB (42%) 114KB (100%) Median QF = 80

Image compression Adjusting the quality of the images by tuning QF (Quality Factors) of JPEG images → categorize images into Large, Medium, Small, & Tiny - based on the Ziproxy standard → tune QF to balance compression ratio and quality - quality is measured by SSIM (Structural Similarity) → transcode PNG & GIF to JPEG if possible times smaller overhead than WebP transcoding % of the images are in JPEG

Image compression Adjusting the quality of the images by tuning QF (Quality Factors) of JPEG images → categorize images into Large, Medium, Small, & Tiny - based on the Ziproxy standard → tune QF to balance compression ratio and quality - quality is measured by SSIM (Structural Similarity) → transcode PNG & GIF to JPEG if possible times smaller overhead than WebP transcoding % of the images are in JPEG

Image compression Adjusting the quality of the images by tuning QF (Quality Factors) of JPEG images → categorize images into Large, Medium, Small, & Tiny - based on the Ziproxy standard → tune QF to balance compression ratio and quality - quality is measured by SSIM (Structural Similarity) → transcode PNG & GIF to JPEG if possible % of the images are in JPEG times smaller overhead than WebP transcoding 25.4% of traffic is reduced by such image compression!

Traffic filtering Two-lever filtering cooperated by both the client- side mobile app and the backend 29M malicious links 102M ads links 1M links most frequently visited 72%~78% malicious & ad links are filtered out locally.

Total cellular traffic reduction 9.95% 0.93%0% 2.72% 25.35% 5.16% 0%

Per-user traffic saving Saving ratio of users’ cellular traffic CDF Reduce overage instances by 10.7 times!! 55% of our users save over a quarter 20% of our user save over a half

Latency penalty & battery overhead End-to-end latency penalty - worst case: 474ms - mean: 282ms & median: 53ms Battery consumption - negligible (93mW on average) - sometimes reduce battery consumption

Conclusion TrafficGuard achieves 36% of cellular traffic reduction across different apps without degrading user experiences. Baidu is working on integrating TrafficGuard with cellular carrier infrastructure, which can further reduce end-to-end latency.

Cross-app value-based caching HTTP GET Clients sync value tables with the backend on an hourly basis Content cache: 50MB + Content cache Value table VBWC User apps Value table …… VBWC Hit

From the requesting app’s perspective, the broken images appear to be missing due to a network error and thus are handled as such. Content validation High portion (10.6%) of broken images - cannot be decoded by any image decoders - wasting 3.2% of all image traffic - sending a “Broken Warning” response to the users

Comparison to other redundancy elimination (RE) approaches Take EndRE (end-to-end RE) as an example better (1.2+% more) traffic saving × higher implementation complexity × higher memory requirements × higher consistency requirement Value-based caching is a simpler & cheaper solution.