11/4/2003ACM Multimedia 2003, Berkeley, CA1 A Framework for Cost-Effective Peer- to-Peer Content Distribution Mohamed Hefeeda Ph.D. Candidate Advisor:

Slides:



Advertisements
Similar presentations
Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
Advertisements

Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
Replication Strategies in Unstructured Peer-to-Peer Networks Edith Cohen Scott Shenker This is a modified version of the original presentation by the authors.
Peer-to-Peer Systems Chapter 25. What is Peer-to-Peer (P2P)? Napster? Gnutella? Most people think of P2P as music sharing.
Clayton Sullivan PEER-TO-PEER NETWORKS. INTRODUCTION What is a Peer-To-Peer Network A Peer Application Overlay Network Network Architecture and System.
On Large-Scale Peer-to-Peer Streaming Systems with Network Coding Chen Feng, Baochun Li Dept. of Electrical and Computer Engineering University of Toronto.
On the Economics of P2P Systems Speaker Coby Fernandess.
Peer-to-Peer Systems (cntd.) 15/11 – 2004 INF5070 – Media Servers and Distribution Systems:
Search and Replication in Unstructured Peer-to-Peer Networks Pei Cao, Christine Lv., Edith Cohen, Kai Li and Scott Shenker ICS 2002.
Resilient Peer-to-Peer Streaming Paper by: Venkata N. Padmanabhan Helen J. Wang Philip A. Chou Discussion Leader: Manfred Georg Presented by: Christoph.
PROMISE: Peer-to-Peer Media Streaming Using CollectCast Mohamed Hafeeda, Ahsan Habib et al. Presented By: Abhishek Gupta.
11/4/2003ACM Multimedia 2003, Berkeley, CA1 PROMISE: Peer-to-Peer Media Streaming Using CollectCast Mohamed Hefeeda 1 Joint work with Ahsan Habib 2, Boyan.
An Overview of Peer-to-Peer Networking CPSC 441 (with thanks to Sami Rollins, UCSB)
1 On-Demand Media Streaming Over the Internet Mohamed M. Hefeeda Advisor: Prof. Bharat Bhargava October 16, 2002.
1 A Hybrid Architecture for Cost-Effective On-Demand Media Streaming Mohamed Hefeeda & Bharat Bhargava CS Dept, Purdue University Support: NSF, CERIAS.
Open Problems in Data- Sharing Peer-to-Peer Systems Neil Daswani, Hector Garcia-Molina, Beverly Yang.
Peer-to-Peer Networks as a Distribution and Publishing Model Jorn De Boever (june 14, 2007)
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada ISP-Friendly Peer Matching without ISP Collaboration Mohamed Hefeeda (Joint.
PROMISE A Peer-to-Peer Media Streaming System Using CollectCast CPSC Presentation by Patrick Wong.
1 Peer-to-Peer Media Streaming Mohamed M. Hefeeda Advisor: Prof. Bharat Bhargava March 12, 2003.
FRIENDS: File Retrieval In a dEcentralized Network Distribution System Steven Huang, Kevin Li Computer Science and Engineering University of California,
Service Differentiated Peer Selection An Incentive Mechanism for Peer-to-Peer Media Streaming Ahsan Habib, Member, IEEE, and John Chuang, Member, IEEE.
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
Prefix Caching assisted Periodic Broadcast for Streaming Popular Videos Yang Guo, Subhabrata Sen, and Don Towsley.
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
PROMISE: Peer-to-Peer Media Streaming Using CollectCast M. Hefeeda, A. Habib, B. Botev, D. Xu, and B. Bhargava ACM Multimedia 2003, November 2003.
1March -05 Jiangchuan Liu with Xinyan Zhang, Bo Li, and T.S.P.Yum Infocom 2005 CoolStreaming/DONet: A Data-Driven Overlay Network for Peer-to-Peer Live.
An Overlay Multicast Infrastructure for Live/Stored Video Streaming Visual Communication Laboratory Department of Computer Science National Tsing Hua University.
Chord-over-Chord Overlay Sudhindra Rao Ph.D Qualifier Exam Department of ECECS.
Introspective Replica Management Yan Chen, Hakim Weatherspoon, and Dennis Geels Our project developed and evaluated a replica management algorithm suitable.
Topics in Reliable Distributed Systems Fall Dr. Idit Keidar.
Distributed DBMS© 2001 M. Tamer Özsu & Patrick Valduriez Page 0.1 Outline Introduction Background Distributed DBMS Architecture Distributed Database Design.
1 School of Computing Science Simon Fraser University CMPT 880: Peer-to-Peer Systems Mohamed Hefeeda 10 January 2005.
11/4/2003ACM Multimedia 2003, Berkeley, CA1 PROMISE: Peer-to-Peer Media Streaming Using CollectCast Mohamed Hefeeda 1 Joint work with Ahsan Habib 2, Boyan.
A Framework for Cost-Effective Peer-to- Peer Content Distribution Mohamed Hefeeda and Bharat Bhargava Department of Computer Sciences Purdue University.
On-Demand Media Streaming Over the Internet Mohamed M. Hefeeda, Bharat K. Bhargava Presented by Sam Distributed Computing Systems, FTDCS Proceedings.
PROMISE: Peer-to-Peer Media Streaming Using CollectCast Presented by: Randeep Singh Gakhal CMPT 886, July 2004.
Storage management and caching in PAST PRESENTED BY BASKAR RETHINASABAPATHI 1.
1 Proceeding the Second Exercises on Computer and Systems Engineering Professor OKAMURA Laboratory. Othman Othman M.M.
Peer-to-Peer Overlay Networks. Outline Overview of P2P overlay networks Applications of overlay networks Classification of overlay networks – Structured.
Introduction to Peer-to-Peer Networks. What is a P2P network A P2P network is a large distributed system. It uses the vast resource of PCs distributed.
Peer to Peer Research survey TingYang Chang. Intro. Of P2P Computers of the system was known as peers which sharing data files with each other. Build.
1 School of Computing Science Simon Fraser University CMPT 880: Internet Architectures and Protocols Introduction to Peer-to-Peer Systems Instructor: Dr.
Resilient Peer-to-Peer Streaming Presented by: Yun Teng.
Using the Small-World Model to Improve Freenet Performance Hui Zhang Ashish Goel Ramesh Govindan USC.
1 Distributed Hash Tables (DHTs) Lars Jørgen Lillehovde Jo Grimstad Bang Distributed Hash Tables (DHTs)
Quantitative Evaluation of Unstructured Peer-to-Peer Architectures Fabrício Benevenuto José Ismael Jr. Jussara M. Almeida Department of Computer Science.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
1 Peer-to-Peer Technologies Seminar by: Kunal Goswami (05IT6006) School of Information Technology Guided by: Prof. C.R.Mandal, School of Information Technology.
A Utility-based Approach to Scheduling Multimedia Streams in P2P Systems Fang Chen Computer Science Dept. University of California, Riverside
APPLICATION LAYER MULTICASTING
PROP: A Scalable and Reliable P2P Assisted Proxy Streaming System Computer Science Department College of William and Mary Lei Guo, Songqing Chen, and Xiaodong.
Peer-to-Peer Media Streaming ZIGZAG - Ye Lin PROMISE – Chanjun Yang SASABE - Kung-En Lin.
Aug 22, 2002Sigcomm 2002 Replication Strategies in Unstructured Peer-to-Peer Networks Edith Cohen AT&T Labs-research Scott Shenker ICIR.
CoopNet: Cooperative Networking
Algorithms and Techniques in Structured Scalable Peer-to-Peer Networks
Peer-to-Peer Systems: An Overview Hongyu Li. Outline  Introduction  Characteristics of P2P  Algorithms  P2P Applications  Conclusion.
LOOKING UP DATA IN P2P SYSTEMS Hari Balakrishnan M. Frans Kaashoek David Karger Robert Morris Ion Stoica MIT LCS.
INTERNET TECHNOLOGIES Week 10 Peer to Peer Paradigm 1.
Peer-to-Peer Video Systems: Storage Management CS587x Lecture Department of Computer Science Iowa State University.
P2P Search COP6731 Advanced Database Systems. P2P Computing  Powerful personal computer Share computing resources P2P Computing  Advantages: Shared.
P2P Search COP P2P Search Techniques Centralized P2P systems  e.g. Napster, Decentralized & unstructured P2P systems  e.g. Gnutella.
The Biologically Inspired Distributed File System: An Emergent Thinker Instantiation Presented by Dr. Ying Lu.
Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting
Early Measurements of a Cluster-based Architecture for P2P Systems
Peer-to-Peer Video Services
Ying Qiao Carleton University Project Presentation at the class:
Peer-to-Peer Information Systems Week 6: Performance
Dynamic Replica Placement for Scalable Content Delivery
EE 122: Lecture 22 (Overlay Networks)
Presentation transcript:

11/4/2003ACM Multimedia 2003, Berkeley, CA1 A Framework for Cost-Effective Peer- to-Peer Content Distribution Mohamed Hefeeda Ph.D. Candidate Advisor: Bharat Bhargava Department of Computer Sciences Purdue University

2 Motivation  Lots of underutilized end systems (peers) connected to the Internet  Success of Peer-to-Peer (P2P) paradigm -Kazaa, Gnutella, …  Fairly high cost for distributing digital contents (large videos)  Why not share? Everybody benefits!

3 Motivation (cont’d)  Our contribution … -Collaborative P2P Framework for Content Distribution Peer contributes little, but we have too many of them!  Two settings for the framework -Infrastructure Content provider employs peers’ resources to disseminate contents -Cooperative resource sharing Peers cooperate/coordinate to serve requests

4 Motivation (cont’d)  What we gain … -Cost-effectiveness (for supplier & client) -Ease of deployment (on end systems) -Availability (large degree of redundancy) -Scalability (more peers  more resources) -…..  What we need to do … -Address research challenges

5 Motivation (cont’d)  Research problems -Select and match multiple supplying peers with a requesting peer -Aggregate and coordinate contributions from peers -Adapt to peer failures and network conditions -Disseminate contents into the system -Consider peer rationality (self-interest) into protocols -Assess and incorporate peer trustworthiness

6 Outline  Cooperative environment [ACM MM’03] -CollectCast -PROMISE -Evaluation: simulation and implementation (PlanetLab)  Infrastructure [J. Computer Networks, To appear 03] -Hybrid (super peer) architecture -Peer clustering and organization -Searching and dispersion algorithms -Evaluation: simulation  Current/Future work -Rationality (Economics) [Tech Reports 03] -Trustworthiness (Security)

7 CollectCast  CollectCast is a new P2P service -Middleware layer between a P2P lookup substrate and applications  Previous work either -Assume one sender, e.g., [Tran, et al. 03, Bawa, et al. 02] Ignores peer limited capacity -Or, multiple senders but no careful selection, e.g., [Padmanabhan, et al. 02] Ignores peer diversity and network conditions

8 CollectCast (cont’d)  Functions -Infer and label topology -Select best sending peers for each session -Coordinate contributions from peers -Adapt to peer failures and network conditions

9 CollectCast: Peer Selection  Considers -R p, A p (t) -e2e available bandwidth and loss rate -Shared path segments  Problem formulation: -Find P actv that

10 PROMISE and Experiments on PlanetLab  PROMISE is a P2P media streaming system built on top of CollectCast  Tested on PlanetLab test bed  Extended Pastry to support multiple peer look up  Used several MPGE-4 movie traces  Select peers using topology-aware (the one used in CollectCast) and end-to-end

11 CollectCast: Performance Packet-level performance  Smoother aggregated rate achieved by CollectCast Frame-level performance  Much fewer frames miss their deadlines with CollectCast  CollectCast requires smaller initial buffering time to ensure all frames meet their deadlines

12 Outline  Cooperative environment [ACM MM’03] -CollectCast -PROMISE -Evaluation: simulation and implementation (PlanetLab)  Infrastructure [J. Computer Networks, To appear 03] -Hybrid (super peer) architecture -Peer clustering and organization -Searching and dispersion algorithms -Evaluation: simulation  Current/Future work -Rationality (Economics) [Tech Reports 03] -Trustworthiness (Security)

13  Target environments -Streaming (on-demand) videos to many clients Distance learning, media center, corporate streaming, …  Current approaches -Unicast Centralized Proxy caches CDN (third-party) -Multicast Network layer Application layer  Proposed approach (Peer-to-Peer) P2P Infrastructure

14 Unicast Streaming Centralized  Easy to deploy, administer  Limited scalability  Reliability concerns  Load on the backbone  High deployment cost $$$….$ CDN  Good performance  Suitable for web pages with moderate-size objects  Co$t: CDN charges for every megabyte served! [Raczkowski 02]  For one-hour streamed to 1,000 clients, content provider pays $264 to CDN (at 0.5 cents/MByte)

15 Multicast Streaming Network layer  Efficient!  Asynchronous client: Patching, skyscraper, … [Mahanti, et al. 03]  Tune to multiple channels  Require inbound bandwidth 2R+  Scalability of routers  Not widely deployed!! Patch stream Application layer  Deployable  E.g., [Narada 02, NICE 02, Zigzag 03, …]  Assume end systems can support (outbound bandwidth) multiple folds of the streaming rate

16 P2P Approach: Key Ideas  Make use of underutilized peers’ resources  Make use of heterogeneity  Multiple peers serve a requesting peer  Network-aware peer organization Super Peers play special roles  Hybrid Architecture

17 Hybrid Architecture: Issues  Peer Organization -Two-level peer clustering -Join, leave, failure, overhead, super peer selection  Cluster-Based Searching -Find nearby suppliers  Cluster-Based Dispersion -Efficiently disseminate new media files

18 Peer Organization  Previous client clustering techniques -Too coarse [Barford, et al. 02] Few large clusters Good for cache placement -Too fine [Bestavros, et al. 01] [Krishnamurthy, et al. 00] Many small clusters  Our approach -Balanced, suitable for P2P environments -Use BGP tables [RouteViews, Univ. of Oregon] -Validated by Internet Statistics

19 Peer Organization: Two-Level Clustering Snapshot of a BGP routing table  Two-level clustering -Network cluster Peers sharing same network prefix -AS cluster All network clusters within an Autonomous System

20 Dispersion (cont'd)  Dispersion Algorithm (basic idea) -/* Upon getting a request from P y to cache N y segments */ -C  getCluster (P y ) -Compute available (A) and required (D) capacities in cluster C -If A < D P y caches N y segments in a cluster-wide round robin fashion (CWRR) –All values are smoothed averages –Average available capacity in C: –CWRR Example: (10-segment file) P 1 caches 4 segments: 1,2,3,4 P 2 then caches 7 segments: 5,6,7,8,9,10,1

21 Performance Under Flash Crowd Arrivals  Flash crowd  sudden increase in client arrivals Client Arrival Pattern

22 System Under Flash Crowd (cont'd) ─ The role of the seeding peer is just seeding During the peak, we have 400 concurrent clients (26.7 times original capacity) and none of them is served by the seeding server (50% caching) Load on seeding peer System capacity

23 Dispersion Algorithm ─ Avg. number of hops 8.05 hops (random), 6.82 hops (ours)  15.3% savings ─ For a domain with a 6-hop diameter Random: 23% of the traffic was kept inside the domain Cluster-based: 44% of the traffic was kept inside the domain 5% caching

24 Outline  Cooperative environment [ACM MM’03] -CollectCast -PROMISE -Evaluation: simulation and implementation (PlanetLab)  Infrastructure [J. Computer Networks, To appear 03] -Hybrid (super peer) architecture -Peer clustering and organization -Searching and dispersion algorithms -Evaluation: simulation  Current/Future work -Rationality (Economics) [Tech Reports 03] -Trustworthiness (Security)

25 Rationality in P2P Systems  Rationality ≡ self-interest ≡ maximize one’s own utility  Rationality is a growing concern in P2P systems [Shneidman & Parkes 03] [Adar & Huberman 00] [Golle, et al. 01] -Rational nodes consume more than they contribute  Jeopardizes growth of P2P systems  Infrastructure -Provider motivates peers to contribute resources -Revenue sharing mechanism [Hefeeda, et al., Tech Report 03]  Cooperative -Enforce fair contribution/consumption of resources -Develop distributed incentive mechanisms [in progress]

26 Dealing with Rationality  Ideas from economic theory ported to computer science  Mechanism Design (MD) [e.g., Fudenburg & Triole 91] -Inverse game theory: how to design a game to yield the desired outcomes in equilibrium -Define strategies, “rules of the game”, for selfish agents  Algorithmic Mechanism Design (AMD) [Nissan & Ronen 99] -MD + computational complexity considerations  Distributed Algorithm Mechanism Design (DAMD) [Feigenbaum & Shenker 02] [Feigenbaum et al. 02] -AMD + Distributed

27 Cooperative: Incentive Mechanism  Peers are agents with costs (storage and BW)  Want to replicate (provision) objects into the network to optimize a system-wide function (e.g. minimize the Expected Search Size (ESS))  Problem -Design an incentive mechanism that yields optimal replication strategy, given that nodes are rational  Related Work -Optimal replication with obedient nodes [Cohen & Shenker 02] Replicating objects in proportion to the square root of their rates results in minimum ESS

28 Incentives??  What may serve as Incentives in P2P??? -Pricing (monetary) E.g., micro payments - Non-pricing (non-monetary) Higher priority Peers ranking Peers membership level ….. -Success story “User of the Day”, listing of users

29 Trustworthy P2P Systems  As always, Security is a big issue!  More peers join if they can trust other peers  How to assess trust? And, how to use it?  Research Problems -Evidence identification What may serve as an evidence? E.g., fraction of time a peer fulfilled its commitment -Evidence collection How to collect sufficient instances of this evidence? Complexity: communications, processing -Trust models Using evidences, build web of trust among peers -Trust-based searching Find me a peer that, with high probability, will fulfill its duties!!!

30 Conclusions  A P2P framework for content distribution  CollectCast and PROMISE -Cooperative media streaming environment  Hybrid Architecture -P2P streaming with super peer assisting in searching and dispersion -Cost-effective infrastructure for content distribution managed by a provider  Rationality (Current) -Incentive-compatible network provisioning  Trustworthiness (Future) -Trust-based searching schemes

31 Thank You! Questions, Suggestions, Comments are appreciated !  More information and papers at …

32

33 P2P Systems: Basic Definitions  Peers cooperate to achieve desired functions -Cooperate: share resources (CPU, storage, bandwidth), participate in the protocols (routing, replication, …) -Functions: file-sharing, distributed computing, communications, …  Examples -Gnutella, Napster, Freenet, OceanStore, CFS, CoopNet, SpreadIt, …  Well, aren’t they just distributed systems? -P2P == distributed systems? Background

34 P2P vs. Distributed Systems  P2P = distributed systems++; -Ad-hoc nature -Peers are not servers [Saroui, et al. 02 ] Limited capacity and reliability -Much more dynamism -Scalability is a more serious issue (millions of nodes) -Peers are self-interested (selfish!) entities 70% of Gnutella users share nothing [Adar & Huberman 00] -All kind of Security concerns Privacy, anonymity, malicious peers, … you name it! Background

35 P2P Systems: Rough Classification [Lv et al., ICS’02], [Yang et al., ICDCS’02]  Structured (or tightly controlled, DHT) +Files are rigidly assigned to specific nodes +Efficient search & guarantee of finding –Lack of partial name and keyword queries Ex.: Chord [Stoica, et al. 01], CAN [Ratnasamy, et al. 01], Pastry [Rowstron & Druschel 01]  Unstructured (or loosely controlled) +Files can be anywhere +Support of partial name and keyword queries –Inefficient search (some heuristics exist) & no guarantee of finding Ex.: Gnutella  Hybrid (P2P + centralized), super peer notion) -Napster, KazaA Background

36 File-sharing vs. Streaming  File-sharing -Download the entire file first, then use it -Small files (few Mbytes)  short download time -A file is stored by one peer  one connection -No timing constraints  Streaming -Consume (playback) as you download -Large files (few Gbytes)  long download time -A file is stored by multiple peers  several connections -Timing is crucial Background

37 Streaming Approaches  Distributed caches [e.g., Chen and Tobagi, ToN’01 ] -Deploy caches all over the place -Yes, increases the scalability Shifts the bottleneck from the server to caches! -But, it also multiplies cost -What to cache? And where to put caches?  Multicast -Mainly for live media broadcast -Application level [Narada, NICE, Scattercast, Zigzag, … ] -IP level [e.g., Dutta and Schulzrine, ICC’01] Widely deployed? Background

38 Streaming Approaches (cont'd)  P2P approaches -SpreadIt [Deshpande, et al., Stanford TR 01] Live media  Build application-level multicast distribution tree over peers -CoopNet [Padmanabhan et al. 02] Live media  Builds application-level multicast distribution tree over peers On-demand  Server redirects clients to other peers  Assumes a peer can (or is willing to) support the full rate  CoopNet does not address the issue of quickly disseminating the media file Background