Content Delivery Networks (CDN) Dr. Yingwu Zhu Reverse Proxy Reverse Proxy Reverse Proxy Intranet Web Cache Architecure Browser Local ISP cache L4 Switch.

Slides:



Advertisements
Similar presentations
Information-Centric Networks05c-1 Week 5 / Paper 3 Democratizing content publication with Coral –Michael J. Freedman, Eric Freudenthal, David Mazières.
Advertisements

CS 4700 / CS 5700 Network Fundamentals Lecture 15: Content Delivery Networks (Over 1 billion served … each day) Revised 10/22/2014.
Incentives Build Robustness in BitTorrent Bram Cohen.
1 Server Selection & Content Distribution Networks (slides by Srini Seshan, CS CMU)
Akamai networks,48000 servers and 70 countries in the world.
Jan 17, 2001CSCI {4,6}900: Ubiquitous Computing1 Announcements.
1 Content Delivery Networks iBAND2 May 24, 1999 Dave Farber CTO Sandpiper Networks, Inc.
19 Historical overview Main challenge: How to distribute content in high quality over the Internet cost-effectively? • Traditional “Best-effort” model:
A Taxonomy and Survey of Content Delivery Networks Meng-Huan Wu 2011/10/26 1.
1 From Internet Data Centers to Data Centers in the Cloud Data Centers Evolution − Internet Data Center − Enterprise Data Centers −Web 2.0 Mega Data Centers.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Cis e-commerce -- lecture #6: Content Distribution Networks and P2P (based on notes from Dr Peter McBurney © )
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
What’s a Web Cache? Why do people use them? Web cache location Web cache purpose There are two main reasons that Web cache are used:  to reduce latency.
CDNs & Replication Prof. Vern Paxson EE122 Fall 2007 TAs: Lisa Fowler, Daniel Killebrew, Jorge Ortiz.
SERVER LOAD BALANCING Presented By : Priya Palanivelu.
Anycast Jennifer Rexford Advanced Computer Networks Tuesdays/Thursdays 1:30pm-2:50pm.
1 Drafting Behind Akamai (Travelocity-Based Detouring) AoJan Su, David R. Choffnes, Aleksandar Kuzmanovic, and Fabian E. Bustamante Department of Electrical.
1 Web Content Delivery Reading: Section and COS 461: Computer Networks Spring 2007 (MW 1:30-2:50 in Friend 004) Ioannis Avramopoulos Instructor:
Web Caching and CDNs March 3, Content Distribution Motivation –Network path from server to client is slow/congested –Web server is overloaded Web.
Content Delivery Networks. History Early 1990s sees 100% growth in internet traffic per year 1994 o Netscape forms and releases their first browser.
Web Content Delivery Networks Yogesh Bhumralkar. CDN: Motivations zCongestion in the Internet. zWeb Servers sometimes become overloaded due to too many.
Technologies for Building Content Delivery Networks Pei Cao Cisco Systems, Inc.
AKAMAI Content Delivery Services AKAMAI Content Delivery Services CIS726 : PRESENTATION Avinash Ponugoti Avinash Ponugoti Nagarjuna Nagulapati Sathish.
Caching and Content Distribution Networks. Web Caching r As an example, we use the web to illustrate caching and other related issues browser Web Proxy.
Web Cache. Introduction what is web cache?  Introducing proxy servers at certain points in the network that serve in caching Web documents for faster.
Content Distribution Network (CDN) Performance Punit Shah CSE581 Internet Technologies OGI, OHSU 2002, Jan 16th.
Information-Centric Networks05a-1 Week 5 / Paper 1 On the use and performance of content distribution networks –Balachander Krishnamurthy, Craig Wills,
Efficient Content Distribution on Internet. Who pays for showing a Web page to a user? Receiving side –Users pay to small ISPs, who pay to big ISPs, who.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
Caching and Content Distribution Networks. Some Interesting Observations r Top 1 % of all documents account for 20% - 35% of proxy requests r Top 10%
On the Use and Performance of Content Distribution Networks Balachander Krishnamurthy Craig Wills Yin Zhang Presenter: Wei Zhang CSE Department of Lehigh.
1 Caching  Temporary storage of frequently accessed data (duplicating original data stored somewhere else)  Reduces access time/latency for clients 
Content Distribution March 8, : Application Layer1.
1. 1.Charting the CDNs(locating all their content and DNS servers). 2.Assessing their server availability. 3.Quantifying their world-wide delay performance.
{ Content Distribution Networks ECE544 Dhananjay Makwana Principal Software Engineer, Semandex Networks 5/2/14ECE544.
CPSC 441: Multimedia Networking1 Outline r Scalable Streaming Techniques r Content Distribution Networks.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
12-Oct-15Ubiquitous Computing1 Outline Applications –Central Server –Hierarchical –Peer-to-peer.
Application of Content Computing in Honeyfarm Introduction Overview of CDN (content delivery network) Overview of honeypot and honeyfarm New redirection.
Kiew-Hong Chua a.k.a Francis Computer Network Presentation 12/5/00.
CONTENT DELIVERY NETWORKS
Globally Distributed Content Delivery Presenter: Baoning Wu 03/25/2003.
Content Distribution Network, Proxy CDN: Distributed Environment
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
Content Delivery Networks: Status and Trends Speaker: Shao-Fen Chou Advisor: Dr. Ho-Ting Wu 5/8/
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
An Analysis of Internet Content Delivery Systems 19 rd November, 2007 Youngsub CSE, SNU.
Content Distribution Networks (CDNs)
September 2008 Josilene Aires Moreira.  Overview  CDN Topology  CDNs nowadays  Contructing a CDN ◦ Basic model ◦ Modules ◦ Characteristics  References.
John S. Otto Mario A. Sánchez John P. Rula Fabián E. Bustamante Northwestern, EECS.
WHAT'S THE DIFFERENCE BETWEEN A WEB APPLICATION STREAMING NETWORK AND A CDN? INSTART LOGIC.
Performance Evaluation of Redirection Schemes in Content Distribution Networks Jussi Kangasharju, Keith W. Ross Institut Eurecom Jim W. Roberts France.
MICROSOFT AJAX CDN (CONTENT DELIVERY NETWORK) Make Your ASP.NET site faster to retrieve.
Content Delivery Cloud A Better Alternative To Your Content Delivery Network (CDN) ©2013 Riverbed Technology Confidential and Proprietary.
Scalability and CDNs Lecture X NWEN 304. Notices Progress presentations Room: CO219 (your lab room) Please be on time and ready to go. We will *try* to.
Content Distribution Networks
Content Distribution Networks
Caching Temporary storage of frequently accessed data (duplicating original data stored somewhere else) Reduces access time/latency for clients Reduces.
Ad-blocker circumvention System
VIRTUAL SERVERS Presented By: Ravi Joshi IV Year (IT)
Web Caching? Web Caching:.
Utilization of Azure CDN for the large file distribution
Distributed Systems CS
Content Distribution Networks
On the Use and Performance of Content Distribution Networks
Content Distribution Networks
AWS Cloud Computing Masaki.
EE 122: Lecture 22 (Overlay Networks)
AKAMAI Content Delivery Services
Presentation transcript:

Content Delivery Networks (CDN) Dr. Yingwu Zhu

Reverse Proxy Reverse Proxy Reverse Proxy Intranet Web Cache Architecure Browser Local ISP cache L4 Switch Data Center ISP cdn cache Content Server Content Server Content Server Content Server Reverse Proxy Browser cache Browser cache cdn

History 1998 – 1 st CDNs appear. Save $ by putting more web sites on a CDN, reliability and scalability without expensive hardware and management 1999 – several companies (Akamai, Mirror Image) became the specialists in providing fast and reliable delivery of Web content, earning large profits 2000 – U.S. only, CDNs are a huge market generating $905 millions, reaching $12 billion by – the flash crowd event (numerous users access a web site simultaneously), e.g., Sept when users flooded popular news sites, making the sites unavailable. Flash events transfer more $ to CDN sale income 2002 – Large-scale ISPs (AT&T) tend to build their own CDN functionality, providing customized services 2004 – More than 3000 companies using CDNs, spending more than $20 million monthly. CDN providers doubled their revenue from streaming media operations in 2004 compared to – CDN revenue for both streaming video and Internet radio is estimated to grow at 40%, spending more than $450 million for delivery of news, film, sports, music and entertainment.

Content Delivery - a bit of History Individual Web servers Increase in Web content Web Server Farms Issue of Flash Crowds Replication of same Web content around the globe in a net of Web servers Not financially viable for individual content providers (say, bbc.com) to set up their own server networks

Content Delivery Networks (CDN) What : Geographically distributed network of Web servers around the globe (by an individual provider, E.g. Akamai). Why: Improve the performance and scalability of content retrieval. How: Allow several content providers to replicate their content in a network of servers.

Conventional CDN Architecture Classical Example: Akamai Figure Ref:

Conventional CDN Architectures Commercial CDN Centralized Client-Server Architecture Owned by corporate companies E.g: Akamai Academic CDN Peer-to-peer Architecture Designed to reduce the cost E.g: Globule

What is CDN ? The CDNs are means to offload some or all of the (mainly static content) content delivery burden from the origin server. A replica server, which delivers content on behalf of the origin server is called a CDN server. Aimed to address … –Client perceived latency (e.g. web browsers). –Capacity management of the server. –Caching as a side-effect.

What is CDN ? CDN is an architecture for efficient delivery of (web) content to a large number of clients CDNs are operated by companies which charge content providers for the delivery services CDNs are mostly transparent to the end-user – Meaning: You can see CDNs being used only if you look at actual DNS requests or read HTML-source of a page Commercial CDNs for actual content delivery: –Akamai, Panther Express, SAVVIS, VitalStream Academic CDNs for research on content delivery: –CoDeeN, CoralCDN, Globule

A Big Picture

Advantages of using CDN Reduce customers’ needs in investing web site infrastructures and decrease operational cost of managing such infrastructures Bypass traffic jams on the web – Requested data is close to the clients –Avoid traversing bottleneck links Improve content delivery quality, speed, and reliability Reduce load on the original server Load balancing?

CDN – why? One of the main goals of CDNs is to put content provider in control over how her content is cached Content provider signs a contract with CDN –Contract specifies how content can be cached Contract also means CDN will follow what content provider wants CDNs typically charge per-byte of traffic served CDNs can be used for any kind of content –Typically main use is for web content –Streaming media has also been delivered over CDNs

CDN--How? Original servers A set of surrogate servers or CDN servers –Geographically distributed worldwide –Cache original servers’ content Routers –deliver the client’s requests to a best fitted CDN server (latency, load balancing, etc) Network elements –Distribute content from the original servers to surrogate/CDN servers Accounting mechanism –Provide logs and accounting info. to the original servers

How does CDN work? Users send requests to origin server Requests somehow intercepted by redirection service Redirection service forwards user’s request to the “best” CDN content server Content served from the CDN content server

CDN- Design Issues CDN operates CDN content servers Content servers are placed close to users –In terms of network distance Some or all of the content from the content provider (original server) is replicated on the content servers –Different content servers might have different content Users access content from the “nearest” content server Challenges: –How to redirect clients (request redirection)? –How to replicate content? Usually happens over a private network Can optimize according to many criteria

Request Redirection Key to CDNs Select the most appropriate CDN content server for user requests –DNS redirection Complete/full Partial –URL rewrite

Request Redirection DNS redirection Authoritative DNS server is controlled by the CDN infrastructure. Distributes the load to the various CDN servers depending whatever policy (e.g. round- robin, least loaded CDN server, geographical distance etc.) using DNS trick. URL rewriting Main page still comes from the origin server, but URL for the embedded objects, e.g. images, clips are rewritten, which points to a any of the CDN server. Some vendors rewrite using hostname and some uses IP address directly.

Full Site DNS redirection example Vendors: Adero(Full), Akami and Digital Island (Partial) index.html IP for yahoo.com (not ) GET index.html … Origin Server CDN controlled DNS Server

DNS Redirection Client’s DNS request comes to CDN’s nameserver –Somehow, see below for two possibilities Typically the request has to go through some steps through the CDN’s DNS hierarchy Each step redirects the client to a nearby nameserver Finally, last nameserver returns the address of a nearby content server For the infrastructure, CDN needs to measure the state of the network –Needed to determine which servers are the closest –Network measurements to determine current state

Two DNS Redirection Types Full redirection –Any request for origin server is redirected to CDN –Basically, CDN takes control of content provider’s DNS zone –Benefit: All requests are automatically redirected –Disadvantage: May send lots of traffic to CDN, hence expensive for the content provider, $ per byte Partial redirection –Content provider marks which objects are to be served from CDN –Typically, larger objects like images are selected –Refer to images as: –When client wants to retrieve image, DNS request for cdn.com gets resolved by CDN and image is fetched from the selected content server –Pro: Fine-grained control over what gets delivered –Con: Have to (manually) mark content for CDN

Two DNS Redirection Types Full redirection –All requests redirected to content servers Partial redirection –Get HTML page from origin server, images from content server – Need to open new TCP connection for images Question: Which gives better performance to users?

DNS Redirection: other issues DNS redirection has one (big) problem –Because redirection is based on DNS queries, the content server is chosen based on who sent that query DNS queries do not come from clients, but from the DNS servers used by the clients Why is this a problem? In many cases it’s not a problem –For example, clients in a university use university’s nameserver In many cases, it’s a big problem –Larger ISPs might run only a few nameservers –Especially in US for dial-up users, DNS lookups are concentrated –This means the content server is optimized for the nameserver, not the actual client –The difference can sometimes be very large

URL rewrite Modify pages at the origin server on the fly Change embedded URL’s based on up-to- date knowledge of the network and CDN server loads Does not require additional DNS lookups Fasttide, Clearway

Partial DNS redirect/URL rewriting example Vendors: Clearway (URL RW) index.html About Us

DNS Redirection vs. URL Rewrite Discussion Comparison?

CDN: other issues Content server placement Content selection Content outsourcing

Content Server Placement Minimize user-perceived latency –Put content servers close to the users Minimize cost –Content outsourcing cost Algorithms to achieve both

Content selection How much content should be replicated to content server? Full site replication –Simple, but high storage cost, outsourcing cost Partial replication –Content grouping based on correlation or access frequency –Replicate content groups

Content Outsourcing Cooperating push-based –Content is prefetched to content servers from the original server –Content servers cooperate in order to reduce the replication and update cost –CDNs maintain the mapping between content and content servers

Some Facts... CDN mainly used for image files (static contents). Content server by the CDN is a static in the nature. Only 0.3% content changed for existing URLs and at the most 13% new URLs were introduced. Large increase in deployment in the CDN between Nov 99 (only 1-2% of top 670 sites) and Dec 2000 (25% of the popular sites). Akamai seems to be most popular CDN vendor. Images are 96-98% of the CDN served contents. But only 40-46% of the CDN-served bytes. Rest is dynamic content ? CDN images cache-hit rate is 30-80%. CDNs can not used for something that involves authentication etc.