A Measurement Study of a Peer-to-Peer Video-on-Demand System Bin Cheng 1, Xuezheng Liu 2, Zheng Zhang 2 and Hai Jin 1 1 Huazhong University of Science.

Slides:



Advertisements
Similar presentations
謝文婷 SocialTube: P2P-assisted Video Sharing in Online Social Networks Authors: Ze Li ; Haiying Shen ; Hailang Wang ; Guoxin Liu ; Jin Li.
Advertisements

Rarest First and Choke Algorithms are Enough Arnaud LEGOUT INRIA, Sophia Antipolis France G. Urvoy-Keller and P. Michiardi Institut Eurecom France.
Novasky: Cinematic-Quality VoD in a P2P Storage Cloud Speaker : 童耀民 MA1G Authors: Fangming Liu†, Shijun Shen§,Bo Li†, Baochun Li‡, Hao Yin§,
On Large-Scale Peer-to-Peer Streaming Systems with Network Coding Chen Feng, Baochun Li Dept. of Electrical and Computer Engineering University of Toronto.
Kangaroo: Video Seeking in P2P Systems Xiaoyuan Yang †, Minas Gjoka ¶, Parminder Chhabra †, Athina Markopoulou ¶, Pablo Rodriguez † † Telefonica Research.
Prediction-based Prefetching to Support VCR-like Operations in Gossip-based P2P VoD Systems Tianyin Xu, Weiwei Wang, Baoliu Ye Wenzhong Li, Sanglu Lu,
Esma Yildirim Department of Computer Engineering Fatih University Istanbul, Turkey DATACLOUD 2013.
Cloud Download : Using Cloud Utilities to Achieve High-quality Content Distribution for Unpopular Videos Yan Huang, Tencent Research, Shanghai, China Zhenhua.
Analyzing and Improving BitTorrent Ashwin R. Bharambe ( Carnegie Mellon University ) Cormac Herley ( Microsoft Research, Redmond ) Venkat Padmanabhan (
Memory System Characterization of Big Data Workloads
Fresh Analysis of Streaming Media Stored on the Web Rabin Karki M.S. Thesis Presentation Advisor: Mark Claypool Reader: Emmanuel Agu 10 Jan, 2011.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Peer-to-peer Multimedia Streaming and Caching Service Jie WEI, Zhen MA May. 29.
Multimedia Proxy Caching Mechanism for Quality Adaptive Streaming Applications in the Internet R. Rejaie, H. Yu, M. Handley, D. Estrin.
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
1 A Framework for Lazy Replication in P2P VoD Bin Cheng 1, Lex Stein 2, Hai Jin 1, Zheng Zhang 2 1 Huazhong University of Science & Technology (HUST) 2.
Improving ISP Locality in BitTorrent Traffic via Biased Neighbor Selection Ruchir Bindal, Pei Cao, William Chan Stanford University Jan Medved, George.
Peer-Assisted Content Distribution Networks: Techniques and Challenges Pei Cao Stanford University.
APEX: A Personalization Framework to Improve Quality of Experience for DVD-like Functions in P2P VoD Applications Tianyin Xu, Baoliu Ye, Qinhui Wang, Wenzhong.
CoolStreaming/DONet: A Data- driven Overlay Network for Peer- to-Peer Live Media Streaming INFOCOM 2005 Xinyan Zhang, Jiangchuan Liu, Bo Li, and Tak- Shing.
Prefix Caching assisted Periodic Broadcast for Streaming Popular Videos Yang Guo, Subhabrata Sen, and Don Towsley.
HHMSM: A Hierarchical Hybrid Multicast Stream Merging Scheme For Large-Scale Video-On-Demand Systems Hai Jin and Dafu Deng Huazhong University of Science.
Understanding Mesh-based Peer-to-Peer Streaming Nazanin Magharei Reza Rejaie.
Performance Evaluation of Peer-to-Peer Video Streaming Systems Wilson, W.F. Poon The Chinese University of Hong Kong.
1 USC INFORMATION SCIENCES INSTITUTE Proxy Caching Mechanism for Multimedia Playback Streams in the Internet R. Rejaie, M. Handley, H. Yu, D. Estrin USC/ISI.
On-Demand Media Streaming Over the Internet Mohamed M. Hefeeda, Bharat K. Bhargava Presented by Sam Distributed Computing Systems, FTDCS Proceedings.
Some recent work on P2P content distribution Based on joint work with Yan Huang (PPLive), YP Zhou, Tom Fu, John Lui (CUHK) August 2008 Dah Ming Chiu Chinese.
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 34 – Media Server (Part 3) Klara Nahrstedt Spring 2012.
# Idowu Samuel O. # Kashif Shahzad # Arif Kamal M7001E - Multimedia systems [ltu.se] ©2011.
Can Internet Video-on-Demand Be Profitable? SIGCOMM 2007 Cheng Huang (Microsoft Research), Jin Li (Microsoft Research), Keith W. Ross (Polytechnic University)
Challenges, Design and Analysis of a Large-scale P2P-VoD System Dr. Yingwu Zhu.
1 Enabling near-VoD via P2P Networks Siddhartha Annapureddy Saikat Guha, Dinan Gunawardena Christos Gkantsidis, Pablo Rodriguez World Wide Web, 2007.
Architectures for Peer-to-Peer Content Distribution Christos Gkantsidis Microsoft Research, Cambridge, UK Joint work with: Pablo Rodriguez, John Miller,
Page 18/25/2015 CSE 40373/60373: Multimedia Systems CSE 4/60373: Multimedia Systems  Outline for today  32: Y.-F. Chen, Y. Huang, R. Jana, H. Jiang,
1 Proceeding the Second Exercises on Computer and Systems Engineering Professor OKAMURA Laboratory. Othman Othman M.M.
Dynamic and Decentralized Approaches for Optimal Allocation of Multiple Resources in Virtualized Data Centers Wei Chen, Samuel Hargrove, Heh Miao, Liang.
Exploring VoD in P2P Swarming Systems By Siddhartha Annapureddy, Saikat Guha, Christos Gkantsidis, Dinan Gunawardena, Pablo Rodriguez Presented by Svetlana.
COCONET: Co-Operative Cache driven Overlay NETwork for p2p VoD streaming Abhishek Bhattacharya, Zhenyu Yang & Deng Pan.
Scalable On-Demand Media Streaming with Packet Loss Recovery A. Mahanti, D. L. Eager, (USask) M. K. Vernon, D S-Stukel (Wisc) Presented by Cheng Huang.
Cluster and Grid Computing Lab, Huazhong University of Science and Technology, Wuhan, China Supporting VCR Functions in P2P VoD Services Using Ring-Assisted.
SIGCOMM Outline  Introduction  Datasets and Metrics  Analysis Techniques  Engagement  View Level  Viewer Level  Lessons  Conclusion.
Gil EinzigerRoy Friedman Computer Science Department Technion.
Web Cache Replacement Policies: Properties, Limitations and Implications Fabrício Benevenuto, Fernando Duarte, Virgílio Almeida, Jussara Almeida Computer.
1 Towards Cinematic Internet Video-on-Demand Bin Cheng, Lex Stein, Hai Jin and Zheng Zhang HUST and MSRA Huazhong University of Science & Technology Microsoft.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada Video Streaming over Cooperative Wireless Networks Mohamed Hefeeda (Joint.
1 CS 425 Distributed Systems Fall 2011 Slides by Indranil Gupta Measurement Studies All Slides © IG Acknowledgments: Jay Patel.
A Measurement Based Memory Performance Evaluation of High Throughput Servers Garba Isa Yau Department of Computer Engineering King Fahd University of Petroleum.
HUAWEI TECHNOLOGIES CO., LTD. Page 1 Survey of P2P Streaming HUAWEI TECHNOLOGIES CO., LTD. Ning Zong, Johnson Jiang.
Quantitative Evaluation of Unstructured Peer-to-Peer Architectures Fabrício Benevenuto José Ismael Jr. Jussara M. Almeida Department of Computer Science.
Othman Othman M.M., Koji Okamura Kyushu University 1.
Multicast instant channel change in IPTV systems 1.
OMFS An Object-Oriented Multimedia File System for Cluster Streaming Server CHENG Bin, JIN Hai Cluster & Grid Computing Lab Huazhong University of Science.
Fast Crash Recovery in RAMCloud. Motivation The role of DRAM has been increasing – Facebook used 150TB of DRAM For 200TB of disk storage However, there.
Paper Survey of DHT Distributed Hash Table. Usages Directory service  Very little amount of information, such as URI, metadata, … Storage  Data, such.
A Simple Model for Analyzing P2P Streaming Protocols Zhou Yipeng Chiu DahMing John, C.S. Lui The Chinese University of Hong Kong.
PROP: A Scalable and Reliable P2P Assisted Proxy Streaming System Computer Science Department College of William and Mary Lei Guo, Songqing Chen, and Xiaodong.
1 Push-to-Peer Video-on-Demand System. 2 Abstract Content is proactively push to peers, and persistently stored before the actual peer-to-peer transfers.
A simple model for analyzing P2P streaming protocols. Seminar on advanced Internet applications and systems Amit Farkash. 1.
A Measurement Based Memory Performance Evaluation of Streaming Media Servers Garba Isa Yau and Abdul Waheed Department of Computer Engineering King Fahd.
MiddleMan: A Video Caching Proxy Server NOSSDAV 2000 Brian Smith Department of Computer Science Cornell University Ithaca, NY Soam Acharya Inktomi Corporation.
SHADOWSTREAM: PERFORMANCE EVALUATION AS A CAPABILITY IN PRODUCTION INTERNET LIVE STREAM NETWORK ACM SIGCOMM CING-YU CHU.
Peer-to-Peer Video Systems: Storage Management CS587x Lecture Department of Computer Science Iowa State University.
Improving QoS in BitTorrent-like VoD Systems Yan Yang Alix L.H. Chow Leana Golubchik Dannielle Bragg Univ. of Southern California Harvard University InfoCom.
Inside the New Coolstreaming: Principles, Measurements and Performance Implications Bo Li, Susu Xie, Yang Qu, Gabriel Y. Keung, Chuang Lin, Jiangchuan.
Performance Limitations of ADSL Users: A Case Study Matti Siekkinen, University of Oslo Denis Collange, France Télécom R&D Guillaume Urvoy-Keller, Ernst.
An Analysis of Internet Content Delivery Systems 19 rd November, 2007 Youngsub CSE, SNU.
Challenges, Design and Analysis of a Large-scale P2P-VoD System Yan Huang, Tom Z. J. Fu, Dah-Ming Chiu, John C. S. Lui and Cheng Huang Chinese University.
Presenter: Yue Zhu, Linghan Zhang A Novel Approach to Improving the Efficiency of Storing and Accessing Small Files on Hadoop: a Case Study by PowerPoint.
Video On Demand.
Small Is Not Always Beautiful
Presentation transcript:

A Measurement Study of a Peer-to-Peer Video-on-Demand System Bin Cheng 1, Xuezheng Liu 2, Zheng Zhang 2 and Hai Jin 1 1 Huazhong University of Science and Technology 2 Microsoft Research Asia IPTPS 2007, Feb

Motivation  VoD is every coach potato’s dream Select anything, start at any time, jump to anywhere  Centralized VoD is costly Servers, bandwidth, contents (  )  P2P VoD is attractive, but challenging: Harder than streaming: no single stream; unpredictable, multiple “swarms” Harder than file downloading: globally optimal (e.g. “rarest first”) policy inapplicable VoD is a superset of file downloading and streaming

Main Contribution  Detailed measurement of a real, deployed P2P VoD system What do we measure?  E.g. What does it mean that a system delivers good UX? How far off are we from an ideal system? How does users behave? Etc. Etc…  Problems spotted There is a great tension between scalability and UX Network heterogeneity is an issue  Is P2P VoD a luxury that poor peers cannot afford?

Outline Motivation  System background: GridCast  Measurement methodology  Evaluation Overall performance User behavior and UXexperience  Conclusions

GridCast Overview  Tracker server Index all joined peers  Source server Stores a copy for every video file  Web portal Provide channel list  Peer Feed data to player Cache all fetched data of the current file Exchange data with others channel list Initial neighbor list source web tracker

One Overlay per Channel  Finding the partners Get the initial content-closer set from the tracker when joining Periodically gossip with some near- & far-neighbors (30s) Look up new near-neighbors from the current neighbors when seeking Refresh the tracker every 5minutes t

Scheduling (every 10s) Current position Fetch the next 200 seconds from partners (if they have them) Feed to the player Fetch the next 10 seconds from the source server if no partners have them If bandwidth budget allows, fetch the rarest anchor from the source server or partners next 200 seconds next 10 seconds

Anchor Prefetching  Anchors are used to improve seek latency Each anchor is a segment of 10 seconds Anchors are 5 minutes apart Playhead adjusted to the nearest anchor (if present) 5 Minutes10s

DataSet Summary Log duration Sept. & Oct Number of visited users About 20,000 Percent of CERNET users 98% Percent of no-CERNET users Netcom: 1% Unicom: 0.6% Unicom: 0.4% Percent of NAT users 22.8% Maximal online users More than 360 Number of sessions About 250,000 Number of videos About 1,200 channels Average Code rate 500~600kbps Movie length Mostly about one hour Total bytes from the source server 11,420GB Total bytes played by peers 15,083GB

System Setup  GridCast was deployed since May 2006 The tracker server and the Web server share one machine One source server with 100Mb, 2GB Memory and 1 TB disk  Popularity keeps on climbing up; in Dec 2006 – Users : 91K; sessions: 290K; total bytes from server: 22TB  Peer logs collected at the tracker (30s) Latency, jitter, buffer map and anchor usage Sep-log and Oct-log w/o and w/ log, respectively  Just a matter of switch the codepath as the peer joins in  The source server keeps other statistics (e.g. total bytes served)

Experimental Results  Overall performance  User experience  Anchor prefetching scheme

Strong Diurnal Pattern  Hot time vs. cold time Hot time (10:00 ~24:00) Cold time (0:00 ~ 10:00)  Two peaks After lunch time & before midnight Higher at weekends or holidays

Scalability  Ideal model: only the lead peer fetches from the source server  cs model: all data from the source server Significantly decreases the source server load (against cs), especially in hot time. Follows quite closely the ideal curve. # of active channel increase 3x from cold to hot – the long tail effect!

Overall Performance  Scalability over hour of day With the same bandwidth consuming as the centralized server, GridCast can support more users. From cold time to hot time, we get more benefit from the p2p sharing, but the source server load also increases. In general, we get 28% improvement compared with cs. The improvement can be higher when the scale becomes larger or each peer can provide more data by caching previously-watched files. more channels are involved when the number of users increases

Understand the Ceiling  Utilization = data from peers / total fetched data Calculated from the snapshots  For the ideal model, utilization = (n-1)/n n is # of users in a session; or concurrency  GridCast achieves the ideal when n is large Why?

Why do we fall short (when n is small)  The peer cannot get the content if: It’s only available from the server (missing content); caused by random seeks It exists in disconnected peers; caused by NAT Its partners do not have enough bandwidth missing content dominates for those unpopular files

UX: latency  Startup Latency ( 70 % < 5s, 90 % < 10s )  Seek latency ( 70% < 3.5s, 90% < 8s )  Seek latency is smaller: There is a 2-second delay to create TCP connections with initial partners Short seeks hit cached data

UX: jitter  For sessions with 5 minutes, 72.3% has not any jitter  For sessions with 40 minutes, 40.6% has not any jitter  Avg. delayed data: 3~4%

Reasons for Bad UX  Network capacity CERNET to CERNET: >100KB/s Non-CERNET to Non-CERNET: 20~50KB/s CERNET to Non-CERNET: 4-5KB/s Bad UX in Non-CERNET region might have prevented swarm to form

Reasons for Bad UX (cont.)  Server stress and UX is inversely correlated Hot time -> lots of active channels -> long tail -> high server stress -> bad UX Most pronounced for movies at the tail (next slide)

UX Correlation with Concurrency  Higher concurrency: Reduces both startup and seek latencies Reduces amount of jitters  Getting close to that of cold time

User Seek Behavior  Seek behavior (Without anchor) FORWARD  BACKWARD  Short seeks dominate (80% within 500seconds) BACKWORAD:FORWARD ~= 3:7

Seek Behavior vs. Popularity  Fewer seeks in more popular channels  More popular channels usually have longer sessions  So: stop making bad movies

Benefit of Anchor Prefetching  Significant reduction of seek latency FORWARD seeks get more benefit (seeks < 1s jump from 33% to 63%)  “next-anchor first” is statistically optimal from any one peer’s point of view “rarest-first” is globally optimal in reducing the load of the source server (sees 30% prefetched but unused

Anchor prefetching  Cost Anchor utilization = played anchors / fetched anchors More seek, higher utilization Longer duration, higher utilization Avg. utilization = 70%

Conclusions  A few things are not new: Diurnal pattern; the looooooooong tail of content  A few things are new: Seeking behaviors (e.g. 7:3 split of forward/backward seeks; 80% seeks are short etc.) The correlation of UX to source server stress and concurrency  A few things are good to know: Even moderate concurrency improves system utilization and UX Simple prefetching helps to improve seeking performance  A few things remain to be problematic The looooooong tail Network heterogeneity  A lot remain to be done (and are being done) Multi-file caching and proactive replication

  Thank you! Q&A