How Akamai Handles Large Events Bruce Maggs Carnegie Mellon Duke Akamai Technologies.

Slides:



Advertisements
Similar presentations
Akamai Content Delivery Network Slides from Bruce Maggs.
Advertisements

CS 4700 / CS 5700 Network Fundamentals Lecture 15: Content Delivery Networks (Over 1 billion served … each day) Revised 10/22/2014.
4.01 How Web Pages Work.
Key Algorithms in a Content Delivery System Akamai Technologies and Carnegie Mellon University Bruce Maggs.
1 Content Delivery Networks iBAND2 May 24, 1999 Dave Farber CTO Sandpiper Networks, Inc.
Engineering a Content Delivery Network COMPSCI 214 Computer Networks and Distributed Systems Bruce Maggs.
Engineering a Content Delivery Network Bruce Maggs.
19 Historical overview Main challenge: How to distribute content in high quality over the Internet cost-effectively? • Traditional “Best-effort” model:
Internet Content Providers End Users The Internet: Simple on the Outside…
EEC-484/584 Computer Networks Discussion Session for HTTP and DNS Wenbing Zhao
2/15/2011Harvard Bits1. 2/15/2011Harvard Bits2 US Telegraph “Network” in 1856.
1 Web Content Delivery Reading: Section and COS 461: Computer Networks Spring 2007 (MW 1:30-2:50 in Friend 004) Ioannis Avramopoulos Instructor:
Introduction 2: Internet, Intranet, and Extranet J394 – Perancangan Situs Web Program Sudi Manajemen Universitas Bina Nusantara.
Application Layer  We will learn about protocols by examining popular application-level protocols  HTTP  FTP  SMTP / POP3 / IMAP  Focus on client-server.
Content Delivery Networks. History Early 1990s sees 100% growth in internet traffic per year 1994 o Netscape forms and releases their first browser.
Tradeoffs in CDN Designs for Throughput Oriented Traffic Minlan Yu University of Southern California 1 Joint work with Wenjie Jiang, Haoyuan Li, and Ion.
AKAMAI Content Delivery Services AKAMAI Content Delivery Services CIS726 : PRESENTATION Avinash Ponugoti Avinash Ponugoti Nagarjuna Nagulapati Sathish.
Caching and Content Distribution Networks. Web Caching r As an example, we use the web to illustrate caching and other related issues browser Web Proxy.
Content Delivery Networks (CDN) Dr. Yingwu Zhu Reverse Proxy Reverse Proxy Reverse Proxy Intranet Web Cache Architecure Browser Local ISP cache L4 Switch.
Information-Centric Networks05a-1 Week 5 / Paper 1 On the use and performance of content distribution networks –Balachander Krishnamurthy, Craig Wills,
Asfandyar Qureshi (MIT) Rick Weber (Akamai) Hari Balakrishnan (MIT) John Guttag (MIT) Bruce Maggs (Duke/Akamai) cutting the electric bill for internet-
Efficient Content Distribution on Internet. Who pays for showing a Web page to a user? Receiving side –Users pay to small ISPs, who pay to big ISPs, who.
How the World Wide Web Works
The Role of Content Delivery Networks in Protecting Web Sites from Attacks Bruce Maggs VP for Research, Akamai Technologies.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
Content Distribution March 8, : Application Layer1.
CSE 534 – Fundamentals of Computer Networks Lecture 11: Content Delivery Networks (Over 1 billion served … each day) Based on slides by D. NEU.
Consistent Hashing: Load Balancing in a Changing World
Content Overlays (Nick Feamster) February 25, 2008.
{ Content Distribution Networks ECE544 Dhananjay Makwana Principal Software Engineer, Semandex Networks 5/2/14ECE544.
Lecturer: Ghadah Aldehim
CPS 214 Computer Networks and Distributed Systems
Global Internet Content Delivery Akamai Technologies and Carnegie Mellon University Bruce Maggs.
Akamai vs. Flash Crowds and Distributed Denial of Service Akamai Technologies & Carnegie Mellon Bruce Maggs.
Lecture 10: 9/26/2002CS149D Fall CS149D Elements of Computer Science Ayman Abdel-Hamid Department of Computer Science Old Dominion University Lecture.
Protocol(TCP/IP, HTTP) 송준화 조경민 2001/03/13. Network Computing Lab.2 Layering of TCP/IP-based protocols.
Chongseok Park 1.  It is a global system of interconnected computer networks that use the standard Internet protocol suite and it serves billions.
Bellwether: Surrogate Services for Popular Content Duane Wessels & Ted Hardie NANOG 19 June 12, 2000.
UNIT 13 The World Wide Web.
CDN: Content Distribution Networks  References:  CS613 textbook, “Computer Networking – A Top-Down Approach”, 6 th edition. Chapter  The text.
YOU'RE GONNA BREAK THE INTERNET WITH THAT THING!.
1 UNIT 13 The World Wide Web Lecturer: Kholood Baselm.
Streaming and Content Delivery SECTIONS 7.4 AND 7.5.
Bruce Maggs Duke University Akamai Technologies Carnegie Mellon University delivering content to the next billion.
Globally Distributed Content Delivery Presenter: Baoning Wu 03/25/2003.
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
Content Delivery Networks in Flux Bruce Maggs Duke University Akamai Technologies.
The Internet What is the Internet? The Internet is a lot of computers over the whole world connected together so that they can share information. It.
Engineering a Content Delivery Network Bruce Maggs.
Md Tareq Adnan Centralized Approach : Server & Clients Slow content must traverse multiple backbones and long distances Unreliable.
Algorithms used by CDNs Stable Marriage Algorithm Consistent Hashing.
1 UNIT 13 The World Wide Web. Introduction 2 Agenda The World Wide Web Search Engines Video Streaming 3.
1 UNIT 13 The World Wide Web. Introduction 2 The World Wide Web: ▫ Commonly referred to as WWW or the Web. ▫ Is a service on the Internet. It consists.
HOW THE INTERNET WORKS. Introduction : The internet has brought revolutionary changes Has become a medium for interaction and information Can access to.
Consistent Hashing. Hashing E.g., h(x) = (((a x + b) mod P) mod |B|), where P is prime, P > |U| a,b chosen uniformly at random from Z P x is a serial.
Engineering a Content Delivery Network
Content Distribution Networks
Caching Temporary storage of frequently accessed data (duplicating original data stored somewhere else) Reduces access time/latency for clients Reduces.
Algorithmic Nuggets in Content Delivery
Direct Internet 3 Iridium Proprietary and Confidential 9/18/2018.
ECE 671 – Lecture 16 Content Distribution Networks
Content Distribution Networks
Edge computing (1) Content Distribution Networks
HTTP and Abstraction on the Internet / The Need for DNS
Internet and Web Simple client-server model
Engineering a Content Delivery Network
Unit-3.
AKAMAI Content Delivery Services
Engineering a Content Delivery Network
The Evolution of a Content Delivery Network: A 21-Year Perspective
Presentation transcript:

How Akamai Handles Large Events Bruce Maggs Carnegie Mellon Duke Akamai Technologies

Traffic Statistics 30,000+ domains 1.1 Tbps daily peak traffic 6,419 terabytes / day 274 billion hits / day 274 million unique client IP addresses / day in 2009 expect to deliver more bits than in combined

Network Deployment Servers 950+ Networks 67+ Countries POPs

Assigning Clients to Servers “High level”: Map client to a cluster based on client’s nameserver’s IP address. Algorithm: stable marriage with multi-dimensional hierarchical capacity constraints.“High level”: Map client to a cluster based on client’s nameserver’s IP address. Algorithm: stable marriage with multi-dimensional hierarchical capacity constraints. “Low level”: Assign client to specific server or servers within cluster based on content requested. Algorithm: consistent hashing.“Low level”: Assign client to specific server or servers within cluster based on content requested. Algorithm: consistent hashing.

Embedded Image Delivery (e.g., Amazon) <html><head> Welcome to xyz.com! Welcome to xyz.com! </head><body> <img src=“ Welcome to our Web site! Welcome to our Web site! Click here to enter Click here to enter </body></html> Embedded URLs are Converted to ARLs ak

End User Akamai DNS Resolution Akamai High-Level DNS Servers 4 ak.xyz.com 1 Browser’s Cache OS 2 Local Name Server Akamai Low-Level DNS Servers 6 ak.xyz.com select cluster select servers within cluster

Mapping Maps IP address of client’s name server and type of content being requested to an Akamai cluster.Maps IP address of client’s name server and type of content being requested to an Akamai cluster. Note: Doesn’t depend on content provider (although indicated by ak.xyz.com).Note: Doesn’t depend on content provider (although indicated by ak.xyz.com). Special cases: Akamai Accelerated Network Partners (AANPs)Special cases: Akamai Accelerated Network Partners (AANPs) General case: “Core Point” analysisGeneral case: “Core Point” analysis

Core Points Core point X is the first router at which all paths to nameservers 1, 2, 3, and 4 intersect.Core point X is the first router at which all paths to nameservers 1, 2, 3, and 4 intersect. Traceroute once per day from 300 clusters to 280,000 nameservers.Traceroute once per day from 300 clusters to 280,000 nameservers. X Akamai cluster 3 Akamai cluster 2 Akamai cluster 1

Core Points 280,000 nameservers (98.8% of requests) reduced to 30,000 core points 30,000 core points ping core points every 6 minutes

Akamai Cluster Servers pool resources RAM Disk Throughput

Hashing E.g., h(x) = (((a x + b) mod P) mod |B|), where P is prime, P > |U| a,b chosen uniformly at random from Z P x is a serial number (designates content provider) Universe U of all possible objects, set B of buckets. object: set of web objects with same serial number bucket: web server Hash function h: U  B Assigns objects to buckets

f(d) = d + 1 mod 5 Difficulty changing number of buckets bucket object f(d) = d + 1 mod 4

Consistent Hashing Idea: Map both objects and buckets to unit circle. object bucket Assign object to next bucket on circle in clockwise order. new bucket

How it’s really done a212: a213: a214: a215: random permutations of servers Why? To spread load for one serial number.

Strategies for Handling Large Events Flash crowds (no advance notice): Rely on rapid load balancing. Cross fingers.Flash crowds (no advance notice): Rely on rapid load balancing. Cross fingers. Planned events: “pre-warm” caches with objects to be served.Planned events: “pre-warm” caches with objects to be served.

Content Delivery on 9/11 (2001) Akamai’s network had capacity for all content providers requesting serviceAkamai’s network had capacity for all content providers requesting service Total bits served on September 11 was approximately 3.5 times normalTotal bits served on September 11 was approximately 3.5 times normal Traffic was higher on September 12Traffic was higher on September 12 (But not as high as January 7, 2002)(But not as high as January 7, 2002)

News Site A – Image Traffic

News Site A – Streaming

News Site B – HTML Traffic

News Site B – Image Traffic

News Site B – Streaming

Portal A – Image Traffic

Sports Site A – Image Traffic

Steve Jobs Keynote (2002)

Akamai live streaming infrastructure AAAAAA Reflectors Edge servers Source Trace format [Client IP, Player ID, Stream URL, Session start time, Session duration, Mean receiving bandwidth]

Extensive traces (2004) ~1,000,000 daily requests ~200,000 daily client IP addresses from over 200 countries ~1,000 daily streams ~1,000 edge servers ~Everyday, over a 3-month period

Audio vs. video Unknown 22% Audio 71% Video 7% Most streams are audio.

Stream types Non-stop (76%) vs. short duration (24%) –All video streams have short duration Smooth arrivals (50%) vs. sudden onset (50%) –Flash crowds are common

Obama Inauguration (2009) 7 million simultaneous data streams, most carrying live video total traffic peaking at 2 terabits per second requests per minute fifth place historically, behind election night in November and three sports events (a World Cup soccer playoff and two NCAA basketball playoff games)

© 2008 Akamai PROPRIETARY 2: Network and Link Characterization Before inauguration

© 2008 Akamai PROPRIETARY 2: Network and Link Characterization Just after – the Internet’s performance starts to degrade massively

© 2008 Akamai PROPRIETARY 2: Network and Link Characterization And later is mostly better but not great