1 A Hybrid Architecture for Cost-Effective On-Demand Media Streaming Mohamed Hefeeda & Bharat Bhargava CS Dept, Purdue University Support: NSF, CERIAS.

Slides:



Advertisements
Similar presentations
Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
Advertisements

P2P data retrieval DHT (Distributed Hash Tables) Partially based on Hellerstein’s presentation at VLDB2004.
Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
Peer to Peer and Distributed Hash Tables
Peer-to-Peer Systems Chapter 25. What is Peer-to-Peer (P2P)? Napster? Gnutella? Most people think of P2P as music sharing.
Clayton Sullivan PEER-TO-PEER NETWORKS. INTRODUCTION What is a Peer-To-Peer Network A Peer Application Overlay Network Network Architecture and System.
Search and Replication in Unstructured Peer-to-Peer Networks Pei Cao, Christine Lv., Edith Cohen, Kai Li and Scott Shenker ICS 2002.
Denial-of-Service Resilience in Peer-to-Peer Systems D. Dumitriu, E. Knightly, A. Kuzmanovic, I. Stoica and W. Zwaenepoel Presenter: Yan Gao.
Resilient Peer-to-Peer Streaming Paper by: Venkata N. Padmanabhan Helen J. Wang Philip A. Chou Discussion Leader: Manfred Georg Presented by: Christoph.
CompSci 356: Computer Network Architectures Lecture 21: Content Distribution Chapter 9.4 Xiaowei Yang
An Overview of Peer-to-Peer Networking CPSC 441 (with thanks to Sami Rollins, UCSB)
1 On-Demand Media Streaming Over the Internet Mohamed M. Hefeeda Advisor: Prof. Bharat Bhargava October 16, 2002.
11/4/2003ACM Multimedia 2003, Berkeley, CA1 A Framework for Cost-Effective Peer- to-Peer Content Distribution Mohamed Hefeeda Ph.D. Candidate Advisor:
Peer-to-Peer Networks as a Distribution and Publishing Model Jorn De Boever (june 14, 2007)
Dr. Zahid Anwar. Simplified Architecture of Linux Cluster Simplified Architecture of a Single Computer Simplified architecture of an enterprise cluster.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada ISP-Friendly Peer Matching without ISP Collaboration Mohamed Hefeeda (Joint.
Network Coding for Large Scale Content Distribution Christos Gkantsidis Georgia Institute of Technology Pablo Rodriguez Microsoft Research IEEE INFOCOM.
1 Peer-to-Peer Media Streaming Mohamed M. Hefeeda Advisor: Prof. Bharat Bhargava March 12, 2003.
Scalable and Continuous Media Streaming on Peer-to-Peer Networks M. Sasabe, N. Wakamiya, M. Murata, H. Miyahara Osaka University, Japan Presented By Tsz.
FRIENDS: File Retrieval In a dEcentralized Network Distribution System Steven Huang, Kevin Li Computer Science and Engineering University of California,
Topics in Reliable Distributed Systems Lecture 2, Fall Dr. Idit Keidar.
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
P2VoD: Providing Fault Tolerant Video-on-Demand Streaming in Peer-to-Peer Environment Tai T.Do, Kien A. Hua, Mounir A. Tantaoui Proc. of the IEEE Int.
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
1March -05 Jiangchuan Liu with Xinyan Zhang, Bo Li, and T.S.P.Yum Infocom 2005 CoolStreaming/DONet: A Data-Driven Overlay Network for Peer-to-Peer Live.
Chord-over-Chord Overlay Sudhindra Rao Ph.D Qualifier Exam Department of ECECS.
Topics in Reliable Distributed Systems Fall Dr. Idit Keidar.
1 School of Computing Science Simon Fraser University CMPT 880: Peer-to-Peer Systems Mohamed Hefeeda 10 January 2005.
Wide-area cooperative storage with CFS
A Framework for Cost-Effective Peer-to- Peer Content Distribution Mohamed Hefeeda and Bharat Bhargava Department of Computer Sciences Purdue University.
Peer-to-peer Multimedia Streaming and Caching Service by Won J. Jeon and Klara Nahrstedt University of Illinois at Urbana-Champaign, Urbana, USA.
On-Demand Media Streaming Over the Internet Mohamed M. Hefeeda, Bharat K. Bhargava Presented by Sam Distributed Computing Systems, FTDCS Proceedings.
A Peer-to-Peer On-Demand Streaming Service and Its Performance Evaluation Yang Guo, Kyoungwon Suh, Jim Kurose, Don Towsley University of Massachusetts,
Middleware for P2P architecture Jikai Yin, Shuai Zhang, Ziwen Zhang.
Communication Part IV Multicast Communication* *Referred to slides by Manhyung Han at Kyung Hee University and Hitesh Ballani at Cornell University.
Storage management and caching in PAST PRESENTED BY BASKAR RETHINASABAPATHI 1.
P2P File Sharing Systems
Roger ZimmermannCOMPSAC 2004, September 30 Spatial Data Query Support in Peer-to-Peer Systems Roger Zimmermann, Wei-Shinn Ku, and Haojun Wang Computer.
Peer-to-Peer Overlay Networks. Outline Overview of P2P overlay networks Applications of overlay networks Classification of overlay networks – Structured.
Introduction to Peer-to-Peer Networks. What is a P2P network A P2P network is a large distributed system. It uses the vast resource of PCs distributed.
Peer to Peer Research survey TingYang Chang. Intro. Of P2P Computers of the system was known as peers which sharing data files with each other. Build.
1 School of Computing Science Simon Fraser University CMPT 880: Internet Architectures and Protocols Introduction to Peer-to-Peer Systems Instructor: Dr.
Overcast: Reliable Multicasting with an Overlay Network CS294 Paul Burstein 9/15/2003.
Jonathan Walpole CSE515 - Distributed Computing Systems 1 Teaching Assistant for CSE515 Rahul Dubey.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
Resilient Peer-to-Peer Streaming Presented by: Yun Teng.
Using the Small-World Model to Improve Freenet Performance Hui Zhang Ashish Goel Ramesh Govindan USC.
1 Distributed Hash Tables (DHTs) Lars Jørgen Lillehovde Jo Grimstad Bang Distributed Hash Tables (DHTs)
TOMA: A Viable Solution for Large- Scale Multicast Service Support Li Lao, Jun-Hong Cui, and Mario Gerla UCLA and University of Connecticut Networking.
Quantitative Evaluation of Unstructured Peer-to-Peer Architectures Fabrício Benevenuto José Ismael Jr. Jussara M. Almeida Department of Computer Science.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
Peer-Assisted Content Distribution Pablo Rodriguez Christos Gkantsidis.
1 Peer-to-Peer Technologies Seminar by: Kunal Goswami (05IT6006) School of Information Technology Guided by: Prof. C.R.Mandal, School of Information Technology.
Peer to Peer A Survey and comparison of peer-to-peer overlay network schemes And so on… Chulhyun Park
PROP: A Scalable and Reliable P2P Assisted Proxy Streaming System Computer Science Department College of William and Mary Lei Guo, Songqing Chen, and Xiaodong.
Peer-to-Peer Media Streaming ZIGZAG - Ye Lin PROMISE – Chanjun Yang SASABE - Kung-En Lin.
Peer to Peer Network Design Discovery and Routing algorithms
CoopNet: Cooperative Networking
Algorithms and Techniques in Structured Scalable Peer-to-Peer Networks
INTERNET TECHNOLOGIES Week 10 Peer to Peer Paradigm 1.
CS 347Notes081 CS 347: Parallel and Distributed Data Management Notes 08: P2P Systems.
P2P Search COP6731 Advanced Database Systems. P2P Computing  Powerful personal computer Share computing resources P2P Computing  Advantages: Shared.
P2P Search COP P2P Search Techniques Centralized P2P systems  e.g. Napster, Decentralized & unstructured P2P systems  e.g. Gnutella.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
Malugo – a scalable peer-to-peer storage system..
Cost-Effective Video Streaming Techniques Kien A. Hua School of EE & Computer Science University of Central Florida Orlando, FL U.S.A.
Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting
Early Measurements of a Cluster-based Architecture for P2P Systems
Dynamic Replica Placement for Scalable Content Delivery
EE 122: Lecture 22 (Overlay Networks)
Presentation transcript:

1 A Hybrid Architecture for Cost-Effective On-Demand Media Streaming Mohamed Hefeeda & Bharat Bhargava CS Dept, Purdue University Support: NSF, CERIAS FTDCS 2003

2  Imagine a media server streaming movies to clients -Target environments Distance learning, corporate streaming, …  Current approaches -Unicast Centralized Proxy caches CDN (third-party) -Multicast Network layer Application layer  Proposed approach (Peer-to-Peer) Motivations

3 Unicast Streaming: Centralized  Pros -Easy to deploy, administer  Cons -Limited scalability -Reliability concerns -Load on the backbone -High deployment cost $$$…..$  Note: -A server with T3 link (~45 Mb/s) supports only up to 45 concurrent users at 1Mb/s CBR!

4 Unicast streaming: Proxy  Pros -Better performance -Less load on backbone and server -Prefix caching, selective caching, staging, …  Cons -Proxy placement and management -High deployment cost $$$…..$

5 Unicast Streaming: CDN  Pros -Good performance -Suitable for web pages with moderate-size objects  Cons -Co$t: CDN charges for every megabyte served!  -Not suitable for VoD service; movies are quite large (~Gbytes)  Note [Raczkowski’02] -Cost ranges from 0.25 to 2 cents/MByte -For a one-hour streamed to 1,000 clients, content provider pays $264 to CDN (at 0.5 cents/MByte)!

6 Multicast Streaming: Network Layer  Pros -Efficient! -Asynchronous client Patching, skyscraper, … [e.g., Mahanti, et al. ToN’03]  Cons -Scalability of the routers -Asynchronous  tune to multiple channels -Require high inbound bandwidth (2R) -Not widely deployed Patch stream

7 Multicast Streaming: Application Layer  Pros -Deployable  Cons -Assumes end systems can support (outbound bandwidth) multiple folds of the streaming rate

8 P2P Approach: Key Ideas  Make use of underutilized peers’ resources  Make use of heterogeneity  Multiple peers serve a requesting peer  Network-aware peers organization

9 P2P Approach (cont’d)  Pros -Cost effective -Deployable (no new hardware, nor network support) -No extra inbound bandwidth (R) -Small load on supplying peers (< R) -Less stress on backbone  Challenges -Achieve good quality Unreliable, limited capacity, peers Highly dynamic environment

10 P2P Approach: Entities  Peer -Level of cooperation (rate, bw, coonections)  Super peer -Help in searching and dispersion  Seed peer -Initiate the streaming  Stream -Time-ordered sequence of packets  Media file -Divided into equal-size segments  Super Peers play special roles  Hybrid system

11 Hybrid System: Issues  Peers Organization -Two-level peers clustering -Join, leave, failure, overhead, super peer selection  Operation -Client protocol: manage multiple suppliers Switch suppliers Effect of switching on quality  Cluster-Based Searching -Find nearby suppliers  Cluster-Based Dispersion -Disseminate new media files Details are in the extended version of the paper

12 Peers Organization: Two-Level Clustering  Based on BGP tables -[Oregon RouteViews]  Network-cluster [KW 00] -Peers sharing the same network prefix /16 (purdue), /16 (cmu) , , ,  AS-cluster -All network clusters within an Autonomous System Snapshot of a BGP routing table

13 Peers Organization: Join Bootstrap data structure Join

14 Client Protocol  Building blocks of the protocol to be run by a requesting peer  Three phases -Availability check Search for all segments with the full rate -Streaming Stream (and playback) segment by segment -Caching Store enough copies of the media file in each cluster to serve all expected requests from that cluster Which segments to cache?

15 Client Protocol (cont'd)  Phase II: Streaming t j = t j-1 + δ /* δ: time to stream a segment */ For j = 1 to N do At time t j, get segment s j as follows: P jConnect to every peer P x in P j (in parallel) and Download from byte b x-1 to b x -1 Note: b x = |s j | R x /R Example: P 1, P 2, and P 3 serving different pieces of the same segment to P 4 with different rates

16 Dispersion (cont'd)  Dispersion Algorithm (basic idea): -/* Upon getting a request from P y to cache N y segments */ -C  getCluster (P y ) -Compute available (A) and required (D) capacities in cluster C -If A < D P y caches N y segments in a cluster-wide round robin fashion (CWRR) –All values are smoothed averages –Average available capacity in C: –CWRR Example: (10-segment file) P 1 caches 4 segments: 1,2,3,4 P 2 then caches 7 segments: 5,6,7,8,9,10,1

17 Evaluation Through Simulation  Client parameters -Effect of switching on quality  System Parameters -Overall system capacity, -Average waiting time, -Load/Role on the seeding peer -Scenarios: different client arrival patterns (constant rate, Poisson, flash crowd) and different levels of peer cooperation  Performance of the dispersion algorithm -Compare against random dispersion algorithm

18 Simulation: Topology –Large, Hierarchical (more than 13,000 nodes) -Ex. 20 transit domains, 200 stub domains, 2,100 routers, and a total of 11,052 end hosts –Used GT-ITM and ns-2

19 Client: Effect of Switching  Initial buffering is needed to hide suppliers switching  Tradeoff: small segment size  small buffering but more overhead (encoding, decoding, processing)

20 System: Capacity and Load on Seed Peer Load on seeding peer ─ The role of the seeding peer is diminishing For 50%: After 5 hours, we have 100 concurrent clients (6.7 times original capacity) and none of them is served by the seeding peer System capacity

21 System: Under Flash Crowd Arrivals  Flash crowd  sudden increase in client arrivals

22 System: Under Flash Crowd (cont'd) ─ The role of the seeding peer is still just seeding During the peak, we have 400 concurrent clients (26.7 times original capacity) and none of them is served by the seeding server (50% caching) Load on seeding peer System capacity

23 Dispersion Algorithm ─ Avg. number of hops: 8.05 hops (random), 6.82 hops (ours)  15.3% savings ─ For a domain with a 6-hop diameter: Random: 23% of the traffic was kept inside the domain Cluster-based: 44% of the traffic was kept inside the domain 5% caching

24 Conclusions  A hybrid model for on-demand media streaming -P2P streaming; Powerful peers do more work -peers with special roles (super peers) -Cost-effective, deployable -Supports large number of clients; including flash crowds  Two-level peers clustering -Network-conscious peers clustering -Helps in keeping the traffic local  Cluster-based dispersion pushes contents closer to clients (within the same domain)  -Reduces number of hops traversed by the stream and the load on the backbone

25 Thank You!

26 P2P Systems: Basic Definitions  Peers cooperate to achieve desired functions -Cooperate: share resources (CPU, storage, bandwidth), participate in the protocols (routing, replication, …) -Functions: file-sharing, distributed computing, communications, …  Examples -Gnutella, Napster, Freenet, OceanStore, CFS, CoopNet, SpreadIt, …  Well, aren’t they just distributed systems? -P2P == distributed systems? Background

27 P2P vs. Distributed Systems  P2P = distributed systems++; -Ad-hoc nature -Peers are not servers [Saroui et al., MMCN’02 ] Limited capacity and reliability -Much more dynamism -Scalability is a more serious issue (millions of nodes) -Peers are self-interested (selfish!) entities 70% of Gnutella users share nothing [Adar and Huberman ’00] -All kind of Security concerns Privacy, anonymity, malicious peers, … you name it! Background

28 P2P Systems: Rough Classification [Lv et al., ICS’02], [Yang et al., ICDCS’02]  Structured (or tightly controlled, DHT) +Files are rigidly assigned to specific nodes +Efficient search & guarantee of finding –Lack of partial name and keyword queries Ex.: Chord [Stoica et al., SIGCOMM’01], CAN [Ratnasamy et al., SIGCOMM’01], Pastry [Rowstron and Druschel, Middleware’01]  Unstructured (or loosely controlled) +Files can be anywhere +Support of partial name and keyword queries –Inefficient search (some heuristics exist) & no guarantee of finding Ex.: Gnutella  Hybrid (P2P + centralized), super peers notion) -Napster, KazaA Background

29 File-sharing vs. Streaming  File-sharing -Download the entire file first, then use it -Small files (few Mbytes)  short download time -A file is stored by one peer  one connection -No timing constraints  Streaming -Consume (playback) as you download -Large files (few Gbytes)  long download time -A file is stored by multiple peers  several connections -Timing is crucial Background

30 Related Work: Streaming Approaches  Distributed caches [e.g., Chen and Tobagi, ToN’01 ] -Deploy caches all over the place -Yes, increases the scalability Shifts the bottleneck from the server to caches! -But, it also multiplies cost -What to cache? And where to put caches?  Multicast -Mainly for live media broadcast -Application level [Narada, NICE, Scattercast, Zigzag, … ] -IP level [e.g., Dutta and Schulzrine, ICC’01] Widely deployed? Background

31 Related Work: Streaming Approaches  P2P approaches -SpreadIt [Deshpande et al., Stanford TR’01] Live media  Build application-level multicast distribution tree over peers -CoopNet [Padmanabhan et al., NOSSDAV’02 and IPTPS’02] Live media  Builds application-level multicast distribution tree over peers On-demand  Server redirects clients to other peers  Assumes a peer can (or is willing to) support the full rate  CoopNet does not address the issue of quickly disseminating the media file Background