On the Use and Performance of Content Distribution Networks

Slides:



Advertisements
Similar presentations
On the Use and Performance of Content Distribution Networks Yin Zhang Joint work with Balachander Krishnamurthy and Craig Wills AT&T Labs Research, WPI.
Advertisements

CHAPTER 15 WEBPAGE OPTIMIZATION. LEARNING OBJECTIVES How to test your web-page performance How browser and server interactions impact performance What.
TCP/IP Protocol Suite 1 Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 22 World Wide Web and HTTP.
Amazon CloudFront An introductory discussion. What is Amazon CloudFront? 5/31/20122© e-Zest Solutions Ltd. Amazon CloudFront is a web service for content.
1 Server Selection & Content Distribution Networks (slides by Srini Seshan, CS CMU)
1 Content Delivery Networks iBAND2 May 24, 1999 Dave Farber CTO Sandpiper Networks, Inc.
EEC-484/584 Computer Networks Lecture 6 Wenbing Zhao
Hypertext Transfer Protocol Kyle Roth Mark Hoover.
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
EEC-484/584 Computer Networks Discussion Session for HTTP and DNS Wenbing Zhao
CDNs & Replication Prof. Vern Paxson EE122 Fall 2007 TAs: Lisa Fowler, Daniel Killebrew, Jorge Ortiz.
Reliable Distributed Systems
Design, Implementation and Evaluation of a Client Characterization Driven Web Server Balachander Krishnamurthy – AT&T Research Labs Balachander Krishnamurthy.
Drafting Behind Akamai (Travelocity-Based Detouring) Aleksandar Kuzmanovic Northwestern University Joint work with: A. Su, D. Choffnes, and F. Bustamante.
1 Drafting Behind Akamai (Travelocity-Based Detouring) AoJan Su, David R. Choffnes, Aleksandar Kuzmanovic, and Fabian E. Bustamante Department of Electrical.
Adaptive Content Delivery for Scalable Web Servers Authors: Rahul Pradhan and Mark Claypool Presented by: David Finkel Computer Science Department Worcester.
The Medusa Proxy A Tool For Exploring User- Perceived Web Performance Mimika Koletsou and Geoffrey M. Voelker University of California, San Diego Proceeding.
Content Delivery Networks. History Early 1990s sees 100% growth in internet traffic per year 1994 o Netscape forms and releases their first browser.
Web Content Delivery Networks Yogesh Bhumralkar. CDN: Motivations zCongestion in the Internet. zWeb Servers sometimes become overloaded due to too many.
Caching and Content Distribution Networks. Web Caching r As an example, we use the web to illustrate caching and other related issues browser Web Proxy.
Content Delivery Networks (CDN) Dr. Yingwu Zhu Reverse Proxy Reverse Proxy Reverse Proxy Intranet Web Cache Architecure Browser Local ISP cache L4 Switch.
Content Distribution Network (CDN) Performance Punit Shah CSE581 Internet Technologies OGI, OHSU 2002, Jan 16th.
Information-Centric Networks05a-1 Week 5 / Paper 1 On the use and performance of content distribution networks –Balachander Krishnamurthy, Craig Wills,
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
Caching and Content Distribution Networks. Some Interesting Observations r Top 1 % of all documents account for 20% - 35% of proxy requests r Top 10%
On the Use and Performance of Content Distribution Networks Balachander Krishnamurthy Craig Wills Yin Zhang Presenter: Wei Zhang CSE Department of Lehigh.
{ Content Distribution Networks ECE544 Dhananjay Makwana Principal Software Engineer, Semandex Networks 5/2/14ECE544.
SMUCSE 4344 application layer. SMUCSE 4344 application vs. application-layer protocols application-layer protocol is just one piece –how the end hosts.
Server tools. Site server tools can be utilised to build, host, track and monitor transactions on a business site. There are a wide range of possibilities.
CDN Brokering* Presented By Nick Arnold Authors Alexandros Biliris, et. Al.
CPSC 441: Multimedia Networking1 Outline r Scalable Streaming Techniques r Content Distribution Networks.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
Making the Best of the Best-Effort Service (2) Advanced Multimedia University of Palestine University of Palestine Eng. Wisam Zaqoot Eng. Wisam Zaqoot.
The Intranet.
Drafting Behind Akamai (Travelocity-Based Detouring) Dr. Yingwu Zhu.
Globally Distributed Content Delivery Presenter: Baoning Wu 03/25/2003.
Content Distribution Network, Proxy CDN: Distributed Environment
Ming-Chen Zhao, Paarijaat Aditya, Yin Lin Andreas Haeberlen, Peter Druschel, Bruce Maggs, and William Wishon A First Look at a Hybrid Content Delivery.
On the Effect of Server Adaptation for Web Content Delivery IMW ’ 02, Marseille, Nov Joint work with Balachander Krishnamurthy (AT&T) Craig Wills.
Information-Centric Networks Section # 5.1: Content Distribution Instructor: George Xylomenos Department: Informatics.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
An Analysis of Internet Content Delivery Systems 19 rd November, 2007 Youngsub CSE, SNU.
John S. Otto Mario A. Sánchez John P. Rula Fabián E. Bustamante Northwestern, EECS.
Performance Evaluation of Redirection Schemes in Content Distribution Networks Jussi Kangasharju, Keith W. Ross Institut Eurecom Jim W. Roberts France.
Whole Page Performance Leeann Bent and Geoffrey M. Voelker University of California, San Diego.
19 – Multimedia Networking
Tiny http client and server
The Intranet.
How HTTP Works Made by Manish Kushwaha.
DNS-sly: Avoiding Censorship through Network Complexity
Content Distribution Networks
Caching Temporary storage of frequently accessed data (duplicating original data stored somewhere else) Reduces access time/latency for clients Reduces.
Ad-blocker circumvention System
Internet and Intranet.
The Internet.
Practical Censorship Evasion Leveraging Content Delivery Networks
Utilization of Azure CDN for the large file distribution
Co* Projects : CoDNS, CoDeploy, CoMon
Internet and Intranet.
ECE 671 – Lecture 16 Content Distribution Networks
Content Distribution Networks
Multimedia and Networks
Content Distribution Networks
Internet and Intranet.
Web Server Technology Unit 10 Website Design and Development.
Content Delivery and Remote DNS services
EE 122: Lecture 22 (Overlay Networks)
Internet and Intranet.
AKAMAI Content Delivery Services
Presentation transcript:

On the Use and Performance of Content Distribution Networks ACM SIGCOMM Internet Measurement Workshop

Motivation What is a CDN? State of CDNs A network of servers delivering content on behalf of an origin site State of CDNs A number of CDN companies E.g. Akamai, Digital Island, Speedera Used by many popular origin sites E.g., CNN, CNBC, … Little has been published on the use and performance of existing CDNs 11/02/2001 IMW'2001

Research Questions to Answer What CDN techniques are being used? What is the extent to which CDNs are being used by popular origin sites? What is the nature of CDN-served content? What methodology can be used to measure the relative performance of CDNs? What are specific CDNs performing both relative to origin servers and among themselves? This talk tries to answer them based on a large-scale, client-centric study conducted in Sept. 2000 and Jan. 2001 11/02/2001 IMW'2001

What CDN redirection techniques are being used? Techniques examined DNS redirection (DR) Full-site delivery (DR-F) Partial-site delivery (DR-P) URL rewriting (UR) Hybrid scheme (URDR) URL rewriting + DNS redirection Techniques NOT examined Manual hyperlink selection HTTP redirection Layer 4 switching Layer 7 switching CDN Server CDN Name Server Request/ Response CDN server IP CDN server name Origin Server Client 11/02/2001 IMW'2001

How widely are CDNs being used? Sources of data CDN use by popular sites Type Datasets Date/Duration Sites Periodic crawl HotMM127 2 months: Nov. & Dec. 2000 127 URL588-MM500 1030 Proxy log LMC 1 week in Sept. 2000 3 NLANR 1 week in Jan. 2001 9 Nov. 1999 1-2% out of ~600 [KW00] Dec. 2000 HotMM127: 31% (Akamai: 98%) URL588-MM500: 17% (Akamai: 85%) 11/02/2001 IMW'2001

Nature of CDN-served Content Daily change characteristics of CDN-served objects Nature of HTTP-requested CDN content Images account for 96-98% CDN-served objects, or 40-60% CDN-served bytes Akamai serves 85-98% CDN-served objects (bytes) Cache hit rates of CDN-served images are generally 20-30% higher than non-CDN served images Dataset HotMM127 URL588-MM500 #Objects 24.9K 75.0K Prev. seen URL 89% 86% Prev. seen URL w/ changes 2.2% 3.2% Change: No-cache directives, lmodtime changed/missing, MD5 change 11/02/2001 IMW'2001

Performance Study: Methodology Client Origin Server CDN Name Server CDN Server 1 2 3 Get CDN server IP address URL rewriting – first get CDN server name Warm up CDN cache Retrieve pages using “httperf” Parallel-1.0 – 4 HTTP/1.0 Serial-1.1 -- 2 persistent HTTP/1.1 Pipeline-1.1 – 1 pipelined HTTP/1.1 2 3 1 General Methodology: From N client sites periodically download pages from different CDNs and origin sites. 11/02/2001 IMW'2001

Content for Performance Study Challenge: Different CDNs have different customers. How to compare “apples” to “apples”? Solution: Canonical Pages Create template page based on distributions of the number and size of embedded images at popular sites In our study, we download 54 images and record download time for the first 6, 12, 18, 54 images. For each CDN, construct a canonical page with a list of image URLs currently served by the CDN from a single origin site, that closely match the sizes in the template page. 11/02/2001 IMW'2001

Measurement Infrastructure CDNs *AT&T ICDS NOT tested due to conflict of interest. Origin sites US: Amazon, Bloomberg, CNN, ESPN, MTV, NASA, Playboy, Sony, Yahoo International: 2 Europe, 2 Asia, 1 South America, 1 Australia Client sites 24 NIMI client sites in 6 countries NIMI: National Internet Measurement Infrastructure Well-connected: mainly academic and laboratory sites Technique DR-F DR-P UR URDR CDNs Adero Akamai, Speedera, Digital Island Clearway Fasttide 11/02/2001 IMW'2001

Response Time Results (I) Excluding DNS Lookup Time Cumulative Probability CDNs generally provide much shorter download time. 11/02/2001 IMW'2001

Response Time Results (II) Including DNS Lookup Time Cumulative Probability DNS overhead is a serious performance bottleneck for some CDNs. 11/02/2001 IMW'2001

Impact of Protocol Options and the Number of Images Mean Download Performance Range for Different Numbers of Images and Protocol Options (Jan. 2001) Protocol Option Site Mean Download Time Range (sec.) 6 images 12 images 18 images 54 images Parallel-1.0 CDN 0.26-0.76 0.40-1.23 0.58-1.53 1.49-3.31 US Origin 1.63 2.45 3.40 8.42 Serial-1.1 0.27-0.53 0.42-0.81 0.61-1.13 1.46-2.52 1.06 1.46 1.96 4.87 Pipeline-1.1 0.26-0.50 0.37-0.67 0.47-0.88 1.09-2.04 Partial Support CDNs perform significantly better than origin sites, although reducing the number of images (e.g. due to caching) and using HTTP/1.1 options reduces the performance difference. 11/02/2001 IMW'2001

Effectiveness of DNS Load Balancing A common practice in DNS redirection is using very small DNS TTLs. To examine how effective this is, we modified our basic test to include a fixed IP address for each CDN, which is looked up once every eight hours. We then compare the download performance for fixed IP address and the new IP address resolved each time. We can group our results into four categories, shown in different colors in this plot. Black means the download time for the new IP address is actually worse than the fixed IP address; Blue means, the new IP address has better download time, but if we add in the DNS lookup time, the new IP address gives worse total time. Green means the DNS returns the same IP, which means the DNS lookup time is avoidable if you use a large TTL. These three categories are all bad categories. The only good category is the RED one. It says even if you add in the DNS lookup time, the new address does better than the fixed address. It’s somewhat striking to see that at most of the time, you are in one of the bad categories. We have also looked at the distribution of the response time for fixed and new IP addresses. The fixed IP address almost always perform the better than new IP address with a small DNS TTL. These findings suggest that small DNS TTLs generally can not improve client download times. Small DNS TTLs generally do not improve download times. 11/02/2001 IMW'2001

Effectiveness of DNS Load Balancing (cont’d) Parallel-1.0 Download Performance for CDN Server at New and Fixed IP Addresses (Jan. 01) CDN (technique) Mean completion time (sec.) 90% completion time (sec.) New IP Fixed IP Adero (DR-F) 5.40 1.09 9.60 1.60 Akamai (DR-P) 1.15 1.00 3.05 3.00 Digisle (DR-P) 1.31 1.21 2.30 1.70 Fasttide (URDR) 2.10 1.46 4.72 3.25 Speedera (DR-P) 0.72 0.53 1.53 1.01 Small DNS TTLs generally do not improve download times in either average or worst case situations. 11/02/2001 IMW'2001

Ongoing Research: CDN Performance for Streaming Media Emerging content – streaming media Streaming media account for less than 1% CDN-served objects, but 14-20% CDN-served bytes Methodology Similar to the one for static images Streaming content examined ASF (Advanced Streaming Format) streamed over HTTP Canonical streaming media object Encoding rates: 38/100/300 Kbps Duration: 10 sec. (specified via HTTP headers) 11/02/2001 IMW'2001

CDN Performance For Streaming Media: Preliminary Results CDN Performance on Streaming Media: Mean DNS, First Byte, and Last Byte (relative to Target Delay of 10 sec) Delays CDN DNS (sec) First Byte (sec) Last Byte (sec) 38Kbps 100Kbps 300Kbps Akamai 0.42 0.83 1.08 1.01 1.18 Digisle 0.22 3.35 3.55 1.09 1.35 Intel 0.00 0.33 0.30 0.49 0.51 Navisite 0.11 0.28 0.45 0.44 0.54 Yahoo 0.13 0.32 0.52 0.50 0.68 11/02/2001 IMW'2001

Summary There is a clear increase in the number and percentage of popular origin sites using CDNs may have decreased subsequently … CDNs performed significantly better than origin sites, although caching and HTTP/1.1 options both reduce the performance difference Small DNS TTLs generally do not improve client download times in either average or worst case situations Our methodology can be extended to test CDN performance for delivering streaming media More streaming media results available in the TM version: http://www.research.att.com/~bala/papers/abcd-tm.ps.gz 11/02/2001 IMW'2001