Content Delivery Networks - Principles & Practice Northeastern& Akamai Technologies Ravi Sundaram.

Slides:



Advertisements
Similar presentations
CSE331: Introduction to Networks and Security Lecture 8 Fall 2002.
Advertisements

Akamai Content Delivery Network Slides from Bruce Maggs.
Amazon CloudFront An introductory discussion. What is Amazon CloudFront? 5/31/20122© e-Zest Solutions Ltd. Amazon CloudFront is a web service for content.
1 Server Selection & Content Distribution Networks (slides by Srini Seshan, CS CMU)
The Challenges of Delivering Content on the Internet Tom Leighton Chief Scientist Akamai Technologies.
1 Content Delivery Networks iBAND2 May 24, 1999 Dave Farber CTO Sandpiper Networks, Inc.
Engineering a Content Delivery Network COMPSCI 214 Computer Networks and Distributed Systems Bruce Maggs.
Engineering a Content Delivery Network Bruce Maggs.
Experience with some Principles for Building an Internet-Scale Reliable System Mike Afergan (Akamai and MIT) Joel Wein (Akamai and Polytechnic University,
Spring 2003CS 4611 Content Distribution Networks Outline Implementation Techniques Hashing Schemes Redirection Strategies.
EEC-484/584 Computer Networks Lecture 6 Wenbing Zhao
Internet Networking Spring 2006 Tutorial 12 Web Caching Protocols ICP, CARP.
Internet Content Providers End Users The Internet: Simple on the Outside…
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
The Internet Useful Definitions and Concepts About the Internet.
EEC-484/584 Computer Networks Discussion Session for HTTP and DNS Wenbing Zhao
1 Spring Semester 2007, Dept. of Computer Science, Technion Internet Networking recitation #13 Web Caching Protocols ICP, CARP.
CDNs & Replication Prof. Vern Paxson EE122 Fall 2007 TAs: Lisa Fowler, Daniel Killebrew, Jorge Ortiz.
Anycast Jennifer Rexford Advanced Computer Networks Tuesdays/Thursdays 1:30pm-2:50pm.
1 Drafting Behind Akamai (Travelocity-Based Detouring) AoJan Su, David R. Choffnes, Aleksandar Kuzmanovic, and Fabian E. Bustamante Department of Electrical.
ECOMMERCE TECHNOLOGY SUMMER 2002 COPYRIGHT © 2002 MICHAEL I. SHAMOS eCommerce Technology Web Content Delivery.
Evaluation of the Proximity between Web Clients and their Local DNS Servers Z. Morley Mao UC Berkeley C. Cranor, M. Rabinovich,
1 Web Content Delivery Reading: Section and COS 461: Computer Networks Spring 2007 (MW 1:30-2:50 in Friend 004) Ioannis Avramopoulos Instructor:
Web Content Delivery Networks Yogesh Bhumralkar. CDN: Motivations zCongestion in the Internet. zWeb Servers sometimes become overloaded due to too many.
Caching and Content Distribution Networks. Web Caching r As an example, we use the web to illustrate caching and other related issues browser Web Proxy.
Information-Centric Networks05a-1 Week 5 / Paper 1 On the use and performance of content distribution networks –Balachander Krishnamurthy, Craig Wills,
Content Distribution Networks CPE 401 / 601 Computer Network Systems Modified from Ravi Sundaram, Janardhan R. Iyengar, and others.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
On the Use and Performance of Content Distribution Networks Balachander Krishnamurthy Craig Wills Yin Zhang Presenter: Wei Zhang CSE Department of Lehigh.
CSE 534 – Fundamentals of Computer Networks Lecture 11: Content Delivery Networks (Over 1 billion served … each day) Based on slides by D. NEU.
 Zhichun Li  The Robust and Secure Systems group at NEC Research Labs  Northwestern University  Tsinghua University 2.
{ Content Distribution Networks ECE544 Dhananjay Makwana Principal Software Engineer, Semandex Networks 5/2/14ECE544.
1 Chapter 6: Proxy Server in Internet and Intranet Designs Designs That Include Proxy Server Essential Proxy Server Design Concepts Data Protection in.
Global Internet Content Delivery Akamai Technologies and Carnegie Mellon University Bruce Maggs.
Akamai vs. Flash Crowds and Distributed Denial of Service Akamai Technologies & Carnegie Mellon Bruce Maggs.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
How Akamai Handles Large Events Bruce Maggs Carnegie Mellon Duke Akamai Technologies.
1 On the Placement of Web Server Replicas Lili Qiu, Microsoft Research Venkata N. Padmanabhan, Microsoft Research Geoffrey M. Voelker, UCSD IEEE INFOCOM’2001,
Lesson 1 What Is the World Wide Web?. Objectives Upon completion of this lesson, you should be able to: Explain what the World Wide Web is and how it.
1 On the Placement of Web Server Replicas Lili Qiu, Microsoft Research Venkata N. Padmanabhan, Microsoft Research Geoffrey M. Voelker, UCSD IEEE INFOCOM’2001,
CONTENT DELIVERY NETWORKS
TCP/IP (Transmission Control Protocol / Internet Protocol)
DYNAMIC LOAD BALANCING ON WEB-SERVER SYSTEMS by Valeria Cardellini Michele Colajanni Philip S. Yu.
Setup and Management for the CacheRaQ. Confidential, Page 2 Cache Installation Outline – Setup & Wizard – Cache Configurations –ICP.
Content Distribution Network, Proxy CDN: Distributed Environment
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
Overlay Networks : An Akamai Perspective
Information-Centric Networks Section # 5.1: Content Distribution Instructor: George Xylomenos Department: Informatics.
Content Delivery Networks: Status and Trends Speaker: Shao-Fen Chou Advisor: Dr. Ho-Ting Wu 5/8/
Hiearchial Caching in Traffic Server. Hiearchial Caching  A set of techniques and mechanisms to increase the size and performance of network caches.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 6: Planning, Configuring, And Troubleshooting WINS.
Content Distribution Networks (CDNs)
Engineering a Content Delivery Network Bruce Maggs.
Internet Traffic Engineering Motivation: –The Fish problem, congested links. –Two properties of IP routing Destination based Local optimization TE: optimizing.
CIS679: Anycast r Review of Last lecture r Network-layer Anycast m Single-path routing for anycast messages r Application-layer anycast.
WHAT'S THE DIFFERENCE BETWEEN A WEB APPLICATION STREAMING NETWORK AND A CDN? INSTART LOGIC.
Performance Evaluation of Redirection Schemes in Content Distribution Networks Jussi Kangasharju, Keith W. Ross Institut Eurecom Jim W. Roberts France.
Content Distribution Networks
Engineering a Content Delivery Network
Practical Censorship Evasion Leveraging Content Delivery Networks
Distributed Content in the Network: A Backbone View
IS4680 Security Auditing for Compliance
Content Distribution Networks
AWS Cloud Computing Masaki.
Engineering a Content Delivery Network
EE 122: Lecture 22 (Overlay Networks)
AKAMAI Content Delivery Services
Engineering a Content Delivery Network
The Evolution of a Content Delivery Network: A 21-Year Perspective
Presentation transcript:

Content Delivery Networks - Principles & Practice Northeastern& Akamai Technologies Ravi Sundaram

Outline CDNs - Review of mechanicsCDNs - Review of mechanics FirstPoint - Traffic Management for mirrored websitesFirstPoint - Traffic Management for mirrored websites

Internet Content Providers End Users The Web: Simple on the Outside…

NAP UUNet Qwest AOL Network Providers Content Providers End Users Peering Points …But Problematic on the Inside

Why does my click not work Latency - Browser takes a long time to load the pageLatency - Browser takes a long time to load the page Packet Loss - Browser hangs, user needs to hit refreshPacket Loss - Browser hangs, user needs to hit refresh Jitter - Streams are jerkyJitter - Streams are jerky Server load - Browser connects but does not fully load the pageServer load - Browser connects but does not fully load the page Broken/missing contentBroken/missing content

The Akamai Solution Servers at Network Edge Content Providers End Users NAP

3 Content Provider’s Web Server DNS 1 Downloading - before CDNs User enters enters Browser requests IP address for requests IP address for Browser requests embedded objectsBrowser requests embedded objects Content provider’s web server returns HTMLContent provider’s web server returns HTML Browser requests HTMLBrowser requests HTML DNS returns IP addressDNS returns IP address Browser obtains IP addresses for hostnames listed in URLs of objects embedded on pageBrowser obtains IP addresses for hostnames listed in URLs of objects embedded on page Content provider’s web server returns embedded objectsContent provider’s web server returns embedded objects

DNS Resolution Browser’s Cache 1 OS 2 Local Name Server 3.com.net Root (InterNIC) 4 xyz.com DNS Servers TTL: 1 Day TTL: 30 Minutes

Origin - Content Provider’s Web Server Delivery of Whole Site 6 6. Browser obtains content from optimal Akamai server DNS 1. Browser requests DNS for IP of DNS returns IP of optimal Akamai server 5. Akamai server assembles page, contacting origin as needed 5 4. Browser requests Akamai server for content 4 2. DNS follows CNAME redirect to 2

Delivery of Whole Site - DNS Redirect DNS CNAME RECORD CNAME 2D

Delivery of Whole Site - Page Assembly Site owners create container pages that can be populated with varying content Container Page [TTL=5d] [XYZ news, content, promotions, etc. TTL=5d] [Breaking headlines TTL=2h] [TTL=15m] [TTL=8h]

Benefits of CDNs Improved end-user experienceImproved end-user experience -reduce latency -reduce loss -reduce jitter Reduced network congestionReduced network congestion Increased scalabilityIncreased scalability Improved fault-toleranceImproved fault-tolerance Reduced vulnerabilityReduced vulnerability Reduced costsReduced costs

Outline CDNs - Review of mechanicsCDNs - Review of mechanics FirstPoint - Traffic Management for mirrored websitesFirstPoint - Traffic Management for mirrored websites

What is FirstPoint Traffic management system for mirrored websitesTraffic management system for mirrored websites Directs browser to the optimal mirrorDirects browser to the optimal mirror DNS basedDNS based Application level anycastApplication level anycast

Why FirstPoint Content providers have mirrored websitesContent providers have mirrored websites Content providers only want to offload embedded contentContent providers only want to offload embedded content -Control -Security -Performance

Mapping Problem How to improve user experience?

What is the Mapping Problem Problem of directing requests to servers so as to optimize end-user experienceProblem of directing requests to servers so as to optimize end-user experience -reduce latency -reduce loss -reduce jitter Assumption - servers are fine Assumption - servers are fine Applicable to 2 mirrors or 1500 Akamai locationsApplicable to 2 mirrors or 1500 Akamai locations

Attempt Measure which is closerMeasure which is closer -Closeness changes over time Measure frequentlyMeasure frequently -Bothers people -Too many to do ~500,000 unique nameservers on any given day 10 sec per measurement cycle

Idea TopologyTopology -relatively static -changes in BGP time -order of hours if not days CongestionCongestion -dynamic -changes in round-trip time -order of milliseconds

Topology Discovery - Proxy points Cluster X Y

Aliasing Router fabrics using HSRP (hot stand-by routing protocol)Router fabrics using HSRP (hot stand-by routing protocol) -correlate over time Routers with multiple interfacesRouters with multiple interfaces -source address of UDP/ICMP packets

Set cover Let sets represent proxy pointsLet sets represent proxy points Let elements represent nameserversLet elements represent nameservers Find minimum collection of proxy points covering nameserversFind minimum collection of proxy points covering nameservers X covers 1, 2, 3 and 4 X 1 234

Topology Discovery At each mirror maintain list of partial paths to nameserversAt each mirror maintain list of partial paths to nameservers At each epoch extend paths by 1, in randomized fashion, and exchange with other mirrorAt each epoch extend paths by 1, in randomized fashion, and exchange with other mirror If the two (partial) paths to a namerver have intersected then declare that nameserver done.If the two (partial) paths to a namerver have intersected then declare that nameserver done. If path has reached forbidden IP then waitIf path has reached forbidden IP then wait Use pair of proxies in case of failureUse pair of proxies in case of failure

Topology Discovery - Proxy points Data exchange

Topology Discovery 500,000 nameservers 500,000 nameservers reduced to 90,000 proxy points (clusters)

Histogram of cluster sizes

Congestion Measurement Problem - Still too many measurements to do. 90,000 measurements every 10s with 32B packets requires a few Mbps per mirror. Problem - Still too many measurements to do. 90,000 measurements every 10s with 32B packets requires a few Mbps per mirror. Solution - Importance based sampling Solution - Importance based sampling

CDF of End-user Load

Load Estimation 500,000 nameservers reduced to 90,000 clusters 90,000 clusters 7,000 account for 95% end-user load!

Mapping Problem - Solved! Maps built every 10s

FirstPoint Customers - how to tell?Customers - how to tell? -look for CNAME to akadns.net Customers - who?Customers - who? -High traffic content providers -Yahoo!, Microsoft, TicketMaster etc Price - don’t ask :)Price - don’t ask :) Competitors - whoCompetitors - who -one-of-a-kind service -boxes: Cisco, F5, Foundry

FirstPoint - other aspects Load-balancingLoad-balancing -estimate-based -feedback-based : https, snmp -cost-based: 95/5 Fast cutout in case of failoverFast cutout in case of failover Highly fault-tolerantHighly fault-tolerant -hardware duplication, leader election -overlay routing, BGP-based anycast Integration with other servicesIntegration with other services -DOS/Load failover

Microsoft

Related Work TopologyTopology -Spring, Mahajan, Wetherall, Sigcomm ‘02 -Govindan, Tangmunarunkit, Infocom ‘00 ClusteringClustering -Krishnamurthy, Wang, Sigcomm ‘00 -Bezstavros, Mehrotra, WWC ‘01 -Barford, Gast, Globecom 02 ClusteringClustering -Shaikh, Tewari, Agrawal, Infocom ‘00 -Krishnamurthy, Wills, Zhang, Sigcomm IMW ‘01

Patents (pending) Global load balancing across mirrored data centers. Utility # Global load balancing across mirrored data centers. Utility # Method for predicting file download time from mirrored data centers in a global computer network. Utility # Method for predicting file download time from mirrored data centers in a global computer network. Utility # Method for generating a network map. Utility # Method for generating a network map. Utility # Method and system for protecting websites from public Internet threats. Filed 15 July 2002Method and system for protecting websites from public Internet threats. Filed 15 July 2002

Principles Open design principleOpen design principle -You need all the help you can get -Do not eliminate the obvious without trying first -Give serendipity a chance Scaling principleScaling principle -factor 10 difference means different domain -different domains need different techniques The common case principleThe common case principle -Zipf law is your friend -things cluster -optimize the common case

Conclusion The Internet will never be fast enough in all placesThe Internet will never be fast enough in all places People will want access to the Internet all the time and everywherePeople will want access to the Internet all the time and everywhere