An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.

Slides:



Advertisements
Similar presentations
Clayton Sullivan PEER-TO-PEER NETWORKS. INTRODUCTION What is a Peer-To-Peer Network A Peer Application Overlay Network Network Architecture and System.
Advertisements

Web Cache Characterizing Roles of Front-end Servers in End-to-End Performance of Dynamic Content Distribution  Li ZHANG  Dakuo WANG.
Pervasive Web Content Delivery with Efficient Data Reuse Chi-Hung Chi and Cao Yang School of Computing National University of Singapore
On the Effectiveness of Measurement Reuse for Performance-Based Detouring David Choffnes Fabian Bustamante Fabian Bustamante Northwestern University INFOCOM.
Measurement, Modeling and Analysis of a Peer-to-Peer File-Sharing Workload Krishna Gummadi, Richard Dunn, Stefan Saroiu Steve Gribble, Hank Levy, John.
1 Content Delivery Networks iBAND2 May 24, 1999 Dave Farber CTO Sandpiper Networks, Inc.
1 School of Computing Science Simon Fraser University, Canada Modeling and Caching of P2P Traffic Mohamed Hefeeda Osama Saleh ICNP’06 15 November 2006.
Spring 2003CS 4611 Content Distribution Networks Outline Implementation Techniques Hashing Schemes Redirection Strategies.
Cis e-commerce -- lecture #6: Content Distribution Networks and P2P (based on notes from Dr Peter McBurney © )
The Internet Useful Definitions and Concepts About the Internet.
Measurement, Modeling, and Analysis of a Peer-to-Peer File sharing Workload Krishna P. Gummadi, Richard J. Dunn, Stefan Saroiu, Steven D. Gribble, Henry.
1 The Content and Access Dynamics of a Busy Web Server: Findings and Implications Venkata N. Padmanabhan Microsoft Research Lili Qiu Cornell University.
Web Caching Robert Grimm New York University. Before We Get Started  Interoperability testing  Type theory 101.
Exploiting Content Localities for Efficient Search in P2P Systems Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang 1 1 College of William and Mary,
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna.
Web Caching Robert Grimm New York University. Before We Get Started  Illustrating Results  Type Theory 101.
Squirrel: A decentralized peer- to-peer web cache Paul Burstein 10/27/2003.
Adaptive Content Delivery for Scalable Web Servers Authors: Rahul Pradhan and Mark Claypool Presented by: David Finkel Computer Science Department Worcester.
1 Web Content Delivery Reading: Section and COS 461: Computer Networks Spring 2007 (MW 1:30-2:50 in Friend 004) Ioannis Avramopoulos Instructor:
Web Caching and CDNs March 3, Content Distribution Motivation –Network path from server to client is slow/congested –Web server is overloaded Web.
The Medusa Proxy A Tool For Exploring User- Perceived Web Performance Mimika Koletsou and Geoffrey M. Voelker University of California, San Diego Proceeding.
Caching and Content Distribution Networks. Web Caching r As an example, we use the web to illustrate caching and other related issues browser Web Proxy.
Content Distribution Networks (CDNs) Mike Freedman COS 461: Computer Networks Lectures: MW 10-10:50am in Architecture N101
Content Distribution Network (CDN) Performance Punit Shah CSE581 Internet Technologies OGI, OHSU 2002, Jan 16th.
Information-Centric Networks05a-1 Week 5 / Paper 1 On the use and performance of content distribution networks –Balachander Krishnamurthy, Craig Wills,
Towards Understanding Modern Web Traffic
1 Enabling Secure Internet Access with ISA Server.
Web Caching and Content Delivery. Caching for a Better Web Performance is a major concern in the Web Proxy caching is the most widely used method to improve.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
On the Use and Performance of Content Distribution Networks Balachander Krishnamurthy Craig Wills Yin Zhang Presenter: Wei Zhang CSE Department of Lehigh.
1 Caching  Temporary storage of frequently accessed data (duplicating original data stored somewhere else)  Reduces access time/latency for clients 
Content Distribution March 8, : Application Layer1.
{ Content Distribution Networks ECE544 Dhananjay Makwana Principal Software Engineer, Semandex Networks 5/2/14ECE544.
Global NetWatch Copyright © 2003 Global NetWatch, Inc. Factors Affecting Web Performance Getting Maximum Performance Out Of Your Web Server.
On the Scale and Performance of Cooperative Web Proxy Caching University of Washington Alec Wolman, Geoff Voelker, Nitin Sharma, Neal Cardwell, Anna Karlin,
Skype P2P Kedar Kulkarni 04/02/09.
CPSC 441: Multimedia Networking1 Outline r Scalable Streaming Techniques r Content Distribution Networks.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
An Efficient Approach for Content Delivery in Overlay Networks Mohammad Malli Chadi Barakat, Walid Dabbous Planete Project To appear in proceedings of.
Web Caching and Content Distribution: A View From the Interior Syam Gadde Jeff Chase Duke University Michael Rabinovich AT&T Labs - Research.
Quantitative Evaluation of Unstructured Peer-to-Peer Architectures Fabrício Benevenuto José Ismael Jr. Jussara M. Almeida Department of Computer Science.
CONTENT DELIVERY NETWORKS
Efficient P2P Search by Exploiting Localities in Peer Community and Individual Peers A DISC’04 paper Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang.
Empirical Quantification of Opportunities for Content Adaptation in Web Servers Michael Gopshtein and Dror Feitelson School of Engineering and Computer.
A P2P-Based Architecture for Secure Software Delivery Using Volunteer Assistance Purvi Shah, Jehan-François Pâris, Jeffrey Morgan and John Schettino IEEE.
Performance of Web Proxy Caching in Heterogeneous Bandwidth Environments IEEE Infocom, 1999 Anja Feldmann et.al. AT&T Research Lab 발표자 : 임 민 열, DB lab,
Implementing ISA Server Caching
Streaming and Content Delivery SECTIONS 7.4 AND 7.5.
Hot Systems, Volkmar Uhlig
Content Distribution Network, Proxy CDN: Distributed Environment
Content Delivery Networks: Status and Trends Speaker: Shao-Fen Chou Advisor: Dr. Ho-Ting Wu 5/8/
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
An Analysis of Internet Content Delivery Systems 19 rd November, 2007 Youngsub CSE, SNU.
#16 Application Measurement Presentation by Bobin John.
Content Distribution Networks (CDNs)
/ Fast Web Content Delivery An Introduction to Related Techniques by Paper Survey B Li, Chien-chang R Sung, Chih-kuei.
On the scale and performance of cooperative Web proxy caching 2/3/06.
John S. Otto Mario A. Sánchez John P. Rula Fabián E. Bustamante Northwestern, EECS.
Performance Evaluation of Redirection Schemes in Content Distribution Networks Jussi Kangasharju, Keith W. Ross Institut Eurecom Jim W. Roberts France.
Whole Page Performance Leeann Bent and Geoffrey M. Voelker University of California, San Diego.
Does Internet media traffic really follow the Zipf-like distribution? Lei Guo 1, Enhua Tan 1, Songqing Chen 2, Zhen Xiao 3, and Xiaodong Zhang 1 1 Ohio.
Presented by Michael Rainey South Mississippi Linux Users Group
Accelerating Peer-to-Peer Networks for Video Streaming
Ad-blocker circumvention System
Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting
On the Scale and Performance of Cooperative Web Proxy Caching
A Measurement Study of Napster and Gnutella
Content Distribution Networks + P2P File Sharing
The Internet and Electronic mail
Content Distribution Networks + P2P File Sharing
Presentation transcript:

An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of the 5th Symposium on Operating Systems Design and Implementation December 2002

Outline Goals of Paper Overview of Content Delivery Systems Experimental Methodology Results Caching Conclusions

Goals Quantify the increasing importance of novel content delivery systems Characterize the behavior of these systems from the perspectives of clients, objects, and servers Derive implications for caching in these systems

Content Delivery Systems HTTP Web Traffic Content Delivery Networks Akamai Peer-to-peer file sharing networks Gnutella Kazaa

HTTP Traffic Clients request objects from web servers using HTTP Most web objects are small, 5-10KB. Web object requests follow a Zipf-like distribution Caching Cache hit rate increases logarithmically with client population Impossible for dynamic content

Zipf Distribution Compared with Sun Log Data

Content Delivery Networks (CDNs) Dedicated collections of servers that are geographically distributed Provide static content, e.g. images, streaming video Allows user to access replica of content that is “close” Replica location done via DNS interposition or URL rewriting at origin servers Redirection adds overhead Reduces average download response time

Peer-to-Peer Systems Peers form a distributed system to exchange content Batch-style downloads Most peers have low-availability and limited network capacity Files transferred via direct connection between peers

Experiment Methodology Use passive network monitoring to collect trace of TCP traffic between University of Washington (UW) to rest of Internet Collected 9 days of data, over 20 TB

Some Interesting Observations UW is an HTTP content provider Exported TB. Imported 3.44 TB Bandwidth consumption (in+out).2% Akamai 6.04% Gnutella 14.3% WWW 36.9% Kazaa Rest is other TCP protocols: mail, streaming video/audio, etc.

Some More Interesting Observations Compared to 1999 study HTML traffic has decreased 43% GIF/JPG traffic has decreased 59% AVI/MPG traffic increased nearly 400% MP3 traffic increased nearly 300%

Objects Median P2P object size is 4MB. Median Web object is 2KB 5% of Kazaa objects are over 100MB Top 1% of Kazaa objects account for 50% of bytes transferred For Web, top 1% account for 16% of bytes transferred

Clients For both Web and Kazaa, small number of clients account for large portion of traffic In Web, top 200 clients (0.5% of the population) account for 13% of the traffic In Kazaa, top 200 clients (4% of the population) account for 50% of the traffic

Servers Would expect server load for Kazaa to be much more distributed than for WWW This is not the case: Top 500 external Web servers provide 22% of the bytes Top 500 external Kazaa servers provide 10% of the bytes

Scalability With respect to bandwidth cost: adding another 450 Kazaa clients would be equivalent to doubling the web client population (from 40,000 to 80,000)

CDN Caching Do CDNs provide any performance benefits over local proxy cache? If Akamai traffic were directed to proxy cache instead: 88% ideal object hit rate (all objects cacheable) 50% practical hit rate Conclusion: Widely deployed proxy caches reduce need for separate CDNs

P2P Caching Inbound cache byte hit rate = 35% Outbound cache byte hit rate = 85% Hit rate increases with client population 1,000 clients = 40% hit rate 500,000 clients = 85% hit rate Conclusion: Reverse P2P cache saves the most bandwidth

Conclusions P2P traffic accounts for majority of HTTP bytes transferred P2P objects are significantly larger than Web objects Small number of large objects account for a large percentage of P2P traffic Small number of clients and servers responsible for majority of P2P traffic P2P traffic creates significant bandwidth load