Cis3.2 -- e-commerce -- lecture #6: Content Distribution Networks and P2P (based on notes from Dr Peter McBurney © 2002-2005)

Slides:



Advertisements
Similar presentations
1 Server Selection & Content Distribution Networks (slides by Srini Seshan, CS CMU)
Advertisements

19 – Multimedia Networking. Multimedia Networking7-2 Multimedia and Quality of Service: What is it? multimedia applications: network audio and video (“continuous.
Peer to Peer (P2P) Networks and File sharing. By: Ryan Farrell.
EEC-484/584 Computer Networks Lecture 6 Wenbing Zhao
Internet Networking Spring 2006 Tutorial 12 Web Caching Protocols ICP, CARP.
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
Spotlighting Decentralized P2P File Sharing Archie Kuo and Ethan Le Department of Computer Science San Jose State University.
The Internet Useful Definitions and Concepts About the Internet.
EEC-484/584 Computer Networks Discussion Session for HTTP and DNS Wenbing Zhao
1 Spring Semester 2007, Dept. of Computer Science, Technion Internet Networking recitation #13 Web Caching Protocols ICP, CARP.
Application architectures
Anycast Jennifer Rexford Advanced Computer Networks Tuesdays/Thursdays 1:30pm-2:50pm.
Internet Networking Spring 2002 Tutorial 13 Web Caching Protocols ICP, CARP.
Web Caching and CDNs March 3, Content Distribution Motivation –Network path from server to client is slow/congested –Web server is overloaded Web.
Application Layer  We will learn about protocols by examining popular application-level protocols  HTTP  FTP  SMTP / POP3 / IMAP  Focus on client-server.
1CS 6401 Peer-to-Peer Networks Outline Overview Gnutella Structured Overlays BitTorrent.
Caching and Content Distribution Networks. Web Caching r As an example, we use the web to illustrate caching and other related issues browser Web Proxy.
Caching and Content Distribution Networks. Some Interesting Observations r Top 1 % of all documents account for 20% - 35% of proxy requests r Top 10%
Introduction to Peer-to-Peer Networks. What is a P2P network Uses the vast resource of the machines at the edge of the Internet to build a network that.
P2P File Sharing Systems
1 Networks, advantages & types of What is a network? Two or more computers that are interconnected so they can exchange data, information & resources.
1 Napster & Gnutella An Overview. 2 About Napster Distributed application allowing users to search and exchange MP3 files. Written by Shawn Fanning in.
Introduction Widespread unstructured P2P network
Data Communications and Computer Networks Chapter 2 CS 3830 Lecture 10 Omar Meqdadi Department of Computer Science and Software Engineering University.

Peer-to-Peer Overlay Networks. Outline Overview of P2P overlay networks Applications of overlay networks Classification of overlay networks – Structured.
1 Telematica di Base Applicazioni P2P. 2 The Peer-to-Peer System Architecture  peer-to-peer is a network architecture where computer resources and services.
2: Application Layer1 Chapter 2 Application Layer Computer Networking: A Top Down Approach Featuring the Internet, 2 nd edition. Jim Kurose, Keith Ross.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Introduction to Peer-to-Peer Networks. What is a P2P network A P2P network is a large distributed system. It uses the vast resource of PCs distributed.
1 Computer Communication & Networks Lecture 28 Application Layer: HTTP & WWW p Waleed Ejaz
Introduction of P2P systems
Vulnerabilities in peer to peer communications Web Security Sravan Kunnuri.
Chapter 2: Application layer
CPSC 441: Multimedia Networking1 Outline r Scalable Streaming Techniques r Content Distribution Networks.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
MySQL and PHP Internet and WWW. Computer Basics A Single Computer.
Making the Best of the Best-Effort Service (2) Advanced Multimedia University of Palestine University of Palestine Eng. Wisam Zaqoot Eng. Wisam Zaqoot.
Ch 2. Application Layer Myungchul Kim
1 Peer-to-Peer Systems r Application-layer architectures r Case study: BitTorrent r P2P Search and Distributed Hash Table (DHT)
2: Application Layer1 Chapter 2: Application layer r 2.1 Principles of network applications  app architectures  app requirements r 2.2 Web and HTTP r.
Enabling Peer-to-Peer SDP in an Agent Environment University of Maryland Baltimore County USA.
Data Communications and Computer Networks Chapter 2 CS 3830 Lecture 8 Omar Meqdadi Department of Computer Science and Software Engineering University of.
Content distribution networks (CDNs) r The content providers are the CDN customers. Content replication r CDN company installs hundreds of CDN servers.
CSx760 Computer Networks1 HTTP. CSx760 Computer Networks2 The Web: Some Jargon r Web page: m consists of “objects” m addressed by a URL r Most Web pages.
The Intranet.
1 Peer-to-Peer Technologies Seminar by: Kunal Goswami (05IT6006) School of Information Technology Guided by: Prof. C.R.Mandal, School of Information Technology.
Computer Networking P2P. Why P2P? Scaling: system scales with number of clients, by definition Eliminate centralization: Eliminate single point.
2: Application Layer 1 Chapter 2 Application Layer Computer Networking: A Top Down Approach, 5 th edition. Jim Kurose, Keith Ross Addison-Wesley, April.
ADVANCED COMPUTER NETWORKS Peer-Peer (P2P) Networks 1.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public ITE PC v4.0 Chapter 1 1 Application Layer Functionality and Protocols.
Content Distribution Network, Proxy CDN: Distributed Environment
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
INTERNET TECHNOLOGIES Week 10 Peer to Peer Paradigm 1.
2: Application Layer 1 CMPT 371 Data Communications and Networking Chapter 2 Application Layer - 2.
P2P Search COP6731 Advanced Database Systems. P2P Computing  Powerful personal computer Share computing resources P2P Computing  Advantages: Shared.
P2P Search COP P2P Search Techniques Centralized P2P systems  e.g. Napster, Decentralized & unstructured P2P systems  e.g. Gnutella.
Chapter 2 Application Layer Computer Networking: A Top Down Approach, 4 th edition. Jim Kurose, Keith Ross Addison-Wesley, July 2007.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
CSEN 404 Application Layer II Amr El Mougy Lamia Al Badrawy.
CS Spring 2010 CS 414 – Multimedia Systems Design Lecture 24 – Introduction to Peer-to-Peer (P2P) Systems Klara Nahrstedt (presented by Long Vu)
19 – Multimedia Networking
Introduction to Networks
Administrative Things
Web Caching? Web Caching:.
Internet Networking recitation #12
ECE 671 – Lecture 16 Content Distribution Networks
Computer Communication & Networks
Presentation transcript:

cis e-commerce -- lecture #6: Content Distribution Networks and P2P (based on notes from Dr Peter McBurney © )

2 Introduction In this lecture, we will discuss: Content Distribution Networks Peer-to-Peer (P2P) networks. These are alternatives to the Client-Server architecture discussed in the last lecture. Some slides: Copyright James Kurose & Keith Ross.

3 Client-Server architecture Server provides content to clients on an always-on basis. Content consists of objects with names (URLs) –Content transferred includes both static and dynamic objects. Scalability problems: –We can only scale up by centralizing content storage –While possible, there are costs to this.

4 Web caches (proxy server) User sets browser: Web accesses via cache Browser sends all HTTP requests to cache –object in cache: cache –returns object –else cache requests object from origin server, then returns object to client Goal: To satisfy client request without involving origin server client Proxy server client HTTP request HTTP response HTTP request HTTP response origin server origin server

5 Web Caching (2) Cache acts as both client and server Cache can do up-to-date check using If-modified-since HTTP header –Issue: should cache take risk and deliver cached object without checking? –Heuristics are used. Typically cache is installed by ISP (university, company, residential ISP) Why Web caching? Reduce response time for client requests. Reduce traffic on an institution’s access link. Internet dense with caches enables “poor” content providers to effectively deliver content

6 Content Distribution Networks (CDNs) Dedicated collections of servers located strategically across a wide area Hold content on behalf of Content Providers Content usually comprises only static objects Proprietary architectures (closed systems)

7 Content Distribution Networks (CDNs) The content providers are the CDN customers. Content replication CDN company installs hundreds of CDN servers throughout Internet –In lower-tier ISPs, close to users CDN replicates its customers’ content in CDN servers. When provider updates content, CDN updates servers. Origin server in UK CDN distribution node CDN server in South America CDN server in USA CDN server in Asia

8 CDN Example origin server Distributes HTML Replaces: with HTTP request for DNS query for HTTP request for Origin server CDNs authoritative DNS server Nearby CDN server CDN company cdn.com Distributes gif files Uses its authoritative DNS server to route redirect requests

9 More about CDNs Routing requests CDN creates a “map”, indicating distances from leaf ISPs and CDN nodes When query arrives at authoritative DNS server: – server determines ISP from which query originates –uses “map” to determine best CDN server Not only Web pages Streaming stored audio/video Streaming real-time audio/video –CDN nodes create application-layer overlay network

10 Peer-to-Peer (P2P) Networks In these networks, any node may perform any function A node may be a client (requesting a service) and a server (providing that same service) Recently arisen as models for distribution of music and video, and increasingly used for content distribution Usually voluntary participation Content usually static objects (eg, music files, video files).

11 P2P File Sharing: Example Alice runs P2P client application on her notebook computer Intermittently connects to Internet; gets new IP address for each connection Asks for “Hey Jude” Application displays other peers that have copy of Hey Jude. Alice chooses one of the peers, Bob File is copied from Bob’s PC to Alice’s notebook While Alice downloads, other users uploading from Alice Alice’s peer is both a Web client and a transient Web server. All peers are servers = highly scalable!

12 P2P: Locating content How does Alice know where to look in the P2P network for “Hey Jude”? Several approaches: Centralized Directory Decentralized Directory Query Flooding.

13 P2P: Centralized Directory Example: Napster (original) 1) When peer connects, it informs central server: –IP address –content 2) Alice queries for “Hey Jude” 3) Alice requests file from Bob centralized directory server peers Alice Bob

14 P2P: Problems with centralized directory Single point of failure Performance bottleneck Copyright infringement Legal vulnerability File transfer is decentralized, but locating content is highly centralized.

15 P2P: Decentralized Directory Example: Kazaa Each peer is either a group leader or assigned to a group leader. Group leader tracks the content in all its children. Peer queries group leader; group leader may query other group leaders.

16 Decentralized directories Overlay network Peers are nodes Edges between peers and their group leaders Edges between some pairs of group leaders Virtual neighbors Bootstrap node A new peer connecting to the P2P network is either assigned to a group leader or designated as a group leader.

17 Decentralized directories (2) Advantages No centralized directory server –Location service distributed over peers –More difficult to shut down Disadvantages Bootstrap node needed –Vulnerable to shut-down Group leaders can get overloaded

18 P2P: Query Flooding Example: Gnutella No hierarchy Use bootstrap node to learn about other peers Send query to neighbors Neighbors forward query If queried peer has object, it sends message back to querying peer join

19 P2P: Query flooding (2) Advantages Peers have similar responsibilities: no group leaders Highly decentralized No peer maintains directory information Disadvantages Excessive query traffic Query radius: may not have content when present Bootstrap node Maintenance of overlay network.

20 Intelligent Query Flooding Research at University of Liverpool has explored intelligent query flooding methods Queries are sent only to nodes believed to know the answer or believed to be close to a node which knows the answer –This requires some notion of semantic distance. This can result in faster and more efficient searches

21 Comparison One study at University of Washington, USA, found: P2P accounts for more than 75% of all internet traffic Median size of P2P object is 3 orders of magnitude larger than median size of client-server object The fact that P2P is used to transfer large objects is the main reason that P2P accounts for such a great proportion of traffic The study of P2P networks is still in its infancy – there are many issues unknown.