Peer-to-Peer Video Systems: Storage Management CS587x Lecture Department of Computer Science Iowa State University.

Slides:



Advertisements
Similar presentations
Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.
Advertisements

Class-constrained Packing Problems with Application to Storage Management in Multimedia Systems Tami Tamir Department of Computer Science The Technion.
Replication Strategies in Unstructured Peer-to-Peer Networks Edith Cohen Scott Shenker This is a modified version of the original presentation by the authors.
Scalable Content-Addressable Network Lintao Liu
Maximum Battery Life Routing to Support Ubiquitous Mobile Computing in Wireless Ad Hoc Networks By C. K. Toh.
Novasky: Cinematic-Quality VoD in a P2P Storage Cloud Speaker : 童耀民 MA1G Authors: Fangming Liu†, Shijun Shen§,Bo Li†, Baochun Li‡, Hao Yin§,
Clayton Sullivan PEER-TO-PEER NETWORKS. INTRODUCTION What is a Peer-To-Peer Network A Peer Application Overlay Network Network Architecture and System.
Study of Hurricane and Tornado Operating Systems By Shubhanan Bakre.
Technion –Israel Institute of Technology Computer Networks Laboratory A Comparison of Peer-to-Peer systems by Gomon Dmitri and Kritsmer Ilya under Roi.
Search and Replication in Unstructured Peer-to-Peer Networks Pei Cao, Christine Lv., Edith Cohen, Kai Li and Scott Shenker ICS 2002.
Small-world Overlay P2P Network
Peer-to-Peer Networks as a Distribution and Publishing Model Jorn De Boever (june 14, 2007)
Beneficial Caching in Mobile Ad Hoc Networks Bin Tang, Samir Das, Himanshu Gupta Computer Science Department Stony Brook University.
Scalable and Continuous Media Streaming on Peer-to-Peer Networks M. Sasabe, N. Wakamiya, M. Murata, H. Miyahara Osaka University, Japan Presented By Tsz.
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
Improving Proxy Cache Performance: Analysis of Three Replacement Policies Dilley, J.; Arlitt, M. A journal paper of IEEE Internet Computing, Volume: 3.
Decentralized resource management for a distributed continuous media server Cyrus Shahabi and Farnoush Banaei-Kashani IEEE Transactions on Parallel and.
P2P: Advanced Topics Filesystems over DHTs and P2P research Vyas Sekar.
Exploiting Content Localities for Efficient Search in P2P Systems Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang 1 1 College of William and Mary,
ICNP'061 Benefit-based Data Caching in Ad Hoc Networks Bin Tang, Himanshu Gupta and Samir Das Department of Computer Science Stony Brook University.
Search and Replication in Unstructured Peer-to-Peer Networks Pei Cao Cisco Systems, Inc. (Joint work with Christine Lv, Edith Cohen, Kai Li and Scott Shenker)
presented by Hasan SÖZER1 Scalable P2P Search Daniel A. Menascé George Mason University.
Performance Evaluation of Peer-to-Peer Video Streaming Systems Wilson, W.F. Poon The Chinese University of Hong Kong.
A Hybrid Caching Strategy for Streaming Media Files Jussara M. Almeida Derek L. Eager Mary K. Vernon University of Wisconsin-Madison University of Saskatchewan.
Caching And Prefetching For Web Content Distribution Presented By:- Harpreet Singh Sidong Zeng ECE Fall 2007.
Wide-area cooperative storage with CFS
Web Caching Schemes For The Internet – cont. By Jia Wang.
On-Demand Media Streaming Over the Internet Mohamed M. Hefeeda, Bharat K. Bhargava Presented by Sam Distributed Computing Systems, FTDCS Proceedings.
CS401 presentation1 Effective Replica Allocation in Ad Hoc Networks for Improving Data Accessibility Takahiro Hara Presented by Mingsheng Peng (Proc. IEEE.
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 34 – Media Server (Part 3) Klara Nahrstedt Spring 2012.
1CS 6401 Peer-to-Peer Networks Outline Overview Gnutella Structured Overlays BitTorrent.
1 Algorithms for Bandwidth Efficient Multicast Routing in Multi-channel Multi-radio Wireless Mesh Networks Hoang Lan Nguyen and Uyen Trang Nguyen Presenter:
1 Proxy-Assisted Techniques for Delivering Continuous Multimedia Streams Lixin Gao, Zhi-Li Zhang, and Don Towsley.
Storage management and caching in PAST PRESENTED BY BASKAR RETHINASABAPATHI 1.
Roger ZimmermannCOMPSAC 2004, September 30 Spatial Data Query Support in Peer-to-Peer Systems Roger Zimmermann, Wei-Shinn Ku, and Haojun Wang Computer.
Peer-to-Peer Computing CS587x Lecture Department of Computer Science Iowa State University.
09/07/2004Peer-to-Peer Systems in Mobile Ad-hoc Networks 1 Lookup Service for Peer-to-Peer Systems in Mobile Ad-hoc Networks M. Tech Project Presentation.
Higashino Lab. Maximizing User Gain in Multi-flow Multicast Streaming on Overlay Networks Y.Nakamura, H.Yamaguchi and T.Higashino Graduate School of Information.
Using the Small-World Model to Improve Freenet Performance Hui Zhang Ashish Goel Ramesh Govindan USC.
Segment-Based Proxy Caching of Multimedia Streams Authors: Kun-Lung Wu, Philip S. Yu, and Joel L. Wolf IBM T.J. Watson Research Center Proceedings of The.
A Distributed Clustering Framework for MANETS Mohit Garg, IIT Bombay RK Shyamasundar School of Tech. & Computer Science Tata Institute of Fundamental Research.
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
Streaming over Subscription Overlay Networks Department of Computer Science Iowa State University.
Replication Strategies in Unstructured Peer-to-Peer Networks Edith CohenScott Shenker Some slides are taken from the authors’ original presentation.
Your university or experiment logo here Caitriana Nicholson University of Glasgow Dynamic Data Replication in LCG 2008.
Live Streaming over Subscription Overlay Networks CS587x Lecture Department of Computer Science Iowa State University.
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
Serverless Network File Systems Overview by Joseph Thompson.
Efficient P2P Search by Exploiting Localities in Peer Community and Individual Peers A DISC’04 paper Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang.
K-Anycast Routing Schemes for Mobile Ad Hoc Networks 指導老師 : 黃鈴玲 教授 學生 : 李京釜.
PROP: A Scalable and Reliable P2P Assisted Proxy Streaming System Computer Science Department College of William and Mary Lei Guo, Songqing Chen, and Xiaodong.
HADOOP DISTRIBUTED FILE SYSTEM HDFS Reliability Based on “The Hadoop Distributed File System” K. Shvachko et al., MSST 2010 Michael Tsitrin 26/05/13.
On Reducing Mesh Delay for Peer- to-Peer Live Streaming Dongni Ren, Y.-T. Hillman Li, S.-H. Gary Chan Department of Computer Science and Engineering The.
Aug 22, 2002Sigcomm 2002 Replication Strategies in Unstructured Peer-to-Peer Networks Edith Cohen AT&T Labs-research Scott Shenker ICIR.
LOOKING UP DATA IN P2P SYSTEMS Hari Balakrishnan M. Frans Kaashoek David Karger Robert Morris Ion Stoica MIT LCS.
INTERNET TECHNOLOGIES Week 10 Peer to Peer Paradigm 1.
P2P Search COP P2P Search Techniques Centralized P2P systems  e.g. Napster, Decentralized & unstructured P2P systems  e.g. Gnutella.
Antidio Viguria Ann Krueger A Nonblocking Quorum Consensus Protocol for Replicated Data Divyakant Agrawal and Arthur J. Bernstein Paper Presentation: Dependable.
On Improving the Performance Dependability of Unstructured P2P Systems via Replication ANIRBAN MONDAL YI LIFU MASARU KITSUREGAWA Institute of Industrial.
1 Traffic Engineering By Kavitha Ganapa. 2 Introduction Traffic engineering is concerned with the issue of performance evaluation and optimization of.
System Models Advanced Operating Systems Nael Abu-halaweh.
Geethanjali College Of Engineering and Technology Cheeryal( V), Keesara ( M), Ranga Reddy District. I I Internal Guide Mrs.CH.V.Anupama Assistant Professor.
Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications * CS587x Lecture Department of Computer Science Iowa State University *I. Stoica,
Cost-Effective Video Streaming Techniques Kien A. Hua School of EE & Computer Science University of Central Florida Orlando, FL U.S.A.
An example of peer-to-peer application
The Impact of Replacement Granularity on Video Caching
On Scheduling of Peer-to-Peer Video Services
Peer-to-Peer Video Services
Peer-to-Peer Information Systems Week 6: Performance
Presentation transcript:

Peer-to-Peer Video Systems: Storage Management CS587x Lecture Department of Computer Science Iowa State University

Outline Introduction on P2P video systems What and why Research issues Storage management Service scheduling

P2P Video Systems Definition A P2P video system is a logical system consisting a number of hosts collaborating for the purpose of video services Features Resource sharing  A host can be both a service requester and a service provider There is no central server and no host guarantees the existence of any video  a host can introduce a video to the system and then delete it quickly The participation is fully volunteer  A seeding host might quit the service at a later time

Why not ALM ALM relies on client forwarding to improve the scalability of the original server High demand on client bandwidth  In ALM, a client needs to have a bandwidth at least 2x of the video playback rate  In P2P video systems, a client can download a video for its own playback first and then provide it back to serve others Limited client participation  In ALM, a client can contribute its resource only when it is being served  In P2P video systems, a client can become a service provider of a video by keeping a copy of the video can participate in video services even if it does not request video itself s c1c3 c4c2 c5 c6

Building a P2P Video System A P2P video system can be built on top of a P2P file sharing system by extending its functionality to support video services Choosing a P2P file sharing system Napster-type?  Single point of failure, scalability issue, etc.. Gnutella-like?  Great for its simplicity, but searching cost could be high Freenet-like?  Great for its search efficiency, but such efficiency is achieved by proactive replication, which may be inappropriate Replicating a video over its query path may dramatically increase service latency Streaming a video through its query path is untenable

A hybrid of Gnutella and Freenet Adopt Freenet’s routing-table and depth-first search strategy, but replicate only the file pointer Why does Freenet replicate a file, not its pointer? What are the drawbacks of using routing-table? Adopt Gnutella’s download mechanism When a client find a video, it downloads the file from the source directly AB C D E (V, E)

Video Management Why this becomes a problem Videos are large  Statistics show that a majority of clients contribute less than 1 GB of disk space streaming out of a video takes a substantial communication bandwidth  This seems to be more concerned by users If we allow a client to cache a video in its all or not at all, we may end up only a few clients can contribute  This will again create the problem of server bottleneck

Challenges of Caching Partial Videos Caching collaboration Given a fixed amount of disk space, what part of a video should a client cache? How can we find a set of video segments that are cached by many different hosts?

Challenges of Caching Partial Videos Caching collaboration Given a fixed amount of disk space, what part of a video should a client cache? How can we find a set of video segments that are cached by many different hosts? Cache allocation Should a client cache a newly download video? If yes, how much cache space should be allocated for this video? If the cache is full, what existing data should be expunged to make room?

Caching Collaboration: Cell Cell concept A cell is a cluster of hosts that together can supply a complete video Each host in a cell caches a number of video segments A CacheTable is used to track who caches what

Cell Advantages Reducing cache redundancy Caching and deletion of file segments can be coordinated at cell level The cost of such caching collaboration can be minimal since each cell typically contain only a small number of member  A cell is split whenever possible Reducing search cost Looking for a set of video segments now becomes looking for any single segment  Search scope is dramatically reduced: determined by the distance to the nearest caching host instead of the farthest caching host

Cache Operation Cache procedure Retrieve CacheTable from a cell that provides service Cache the least redundant segments in CacheTable Modify CacheTable and update other cell members

Split Operation Why split Keep the cell size small and reduce maintenance overhead Exhaustive search is impractical To split k hosts into two separate groups, there are O(2 k ) different combinations Proposed split algorithm Can split a cell whenever possible Takes only O(k 2 ) computation

Split Operation Split procedure Put all members in OldSet and create an empty NewSet Find a host in OldSet and move it to NewSet which satisfies two conditions:  Without this host OldSet can still supply a complete file  This host can provide maximum number of segments that are currently not in NewSet Repeat this procedure until no such host can be found If NewSet can supply a complete file then the original cell can be split into two cells

Split Example

Delete Operation Deletion happens when a host needs to recycle its cache space Delete procedure Delete most redundant segments in CacheTable Modify CacheTable and update other cell members

Merge Operation Merge happens when a cell is broken Cell members delete non-redundant segments Cell members found off-line Merge procedure Find a list of cells For each member in the broken cell, select a cell from the list and add it into this cell Split the resulting cell if possible Select procedure A host should join a cell whose least redundant segments can be maximally reinforced

Implementation Issues CacheTable consistency Using mutual exclusion mechanism such as majority quorum algorithm to avoid concurrent updates on CacheTable Using optimistic concurrency control mechanism. When concurrent Delete operations happen and a cell is broken, Merge operation will be invoked to reorganize the cell memebers Cell data redundancy control Increasing the data redundancy level in a cell to avoid frequent Split and Merge operations

Cache Allocation Given a newly downloaded video, should it be cached? If yes, three questions remain: How much cache space should be allocated for the new file Which part of the file should cached  This is solved by Cell technique If necessary, what data in the cache should be expunged to make space for the new file

Traditional Schemes Least Recently Used (LRU) and Least Frequently Used (LFU) Keep popular data items in the cache to reduce the service cost of popular items Unpopular files might be quickly deleted Suitable for systems with a central server but might cause unpopular files to vanish in a decentralized P2P system

Uniform and Proportional Allocation Uniform allocation Maintains equal number of copies in the system for all files Protects unpopular files from being extinct but suffers from load balancing problem Proportional allocation Allocates cache space based on the file popularity Achieves optimal load balancing but unpopular files might become extinct quickly

Controlled Inverse Proportional (CIP) Cache Allocation 1. Determine if a newly downloaded file should be cached 2. Calculate the amount of cache space to be allocated for the new file Try to allocate equal cache space for all videos 3. If there is no empty space for a new file, some redundant segments of cached videos are deleted Look at the CacheTable of each video 4. If there are no redundant segments, a cached video will be deleted to make room for the new video segments Which segments should be cached is determined by Cell

CIP-H A heuristic solution Compare the popularity of the new video with the popularity of each cached video Among all cached videos that have higher popularity than that of the new video, choose the one with the lowest popularity to replace Advantages Files with lower popularity then the new video will be retained Popular videos are replaced dynamically and most popular videos are also retained

CIP-α A solution based on analysis model Define Caching Probability of video i as P i =(p min /p i ) a  p min : the popularity of the least popular video  p i : the popularity of video i  α : an adjustable parameter By adjusting α, the caching effect of CIP-α can vary between uniform and proportional allocation  When α= 0, P i =1, CIP-α becomes proportional allocation  When α= 1, P i =p min /p i, CIP-α becomes uniform allocation

Performance Metrics Accessibility Minimum number of hops that a query message needs to be delivered in order to find a complete file Measures the cost of searching a large file Availability Number of distinct copies of the file in the whole system Measures the cache space allocated for a file in the system

Schemes for Comparison Caching collaboration Cell vs. Random  Random: a host randomly chooses segments to cache after downloading a video Cache allocation CIP-h vs. CIP-a vs. Uniform vs. Proportional  Uniform: all videos are allocated equal cache space in the whole system  Proportional: the total cache space allocated to a video in the whole system is proportional to its popularity

Simulation Setup Simulated in a Gnutella-like P2P network of 5000 hosts with the average distance between two hosts of 7 hops The video popularity follows Zipf-like distribution with skew set to 1 The cache space among hosts also follows Zipf-like distribution with skew set to videos are shared. In the beginning each file has one seeding host. Then hosts begin to download files and cache files. When the network caching capacity has been sufficiently utilized we collect the performance data

Effect of Cell on Video Accessibility Results in Cell and Random show some similarities The file accessibility in Cell is much better than that in Random

Effect of CIP on Video Availability CIP provides balance between Uniform and Proportional CIP-αprovides more fine tuned control over the file availability

Effect of Cell/CIP on Video Accessibility In all cases the accessibility in Cell/CIP is better than that in Random/Uniform and Random/Proportional

Effect of Cell/CIP on Video Availability The curves of Cell/CIP fall between Random/Uniform and Random/Proportional Adjusting αcan change CIP- αbetween Uniform and Proportional

Conclusions Caching collaboration Cell caching can effectively balance the data redundancy and reduce the search cost for large files Cache allocation CIP cache allocation can achieve loading balancing and maintaining unpopular files in the system