Yale LANS ShadowStream: Performance Evaluation as a Capability in Production Internet Live Streaming Networks Chen Tian Richard Alimi Yang Richard Yang.

Slides:



Advertisements
Similar presentations
Network Resource Broker for IPTV in Cloud Computing Lei Liang, Dan He University of Surrey, UK OGF 27, G2C Workshop 15 Oct 2009 Banff,
Advertisements

On Scheduling Vehicle-Roadside Data Access Yang Zhang Jing Zhao and Guohong Cao The Pennsylvania State University.
1 ShadowStream: Performance Experimentation as a Capability in Production Internet Live Streaming Networks Present by: Chen Alexandre Tian (HUST) Richard.
Hadi Goudarzi and Massoud Pedram
X stream Project proposal. Project goals: Students Students: Academic Supervisor Academic Supervisor: Advisors: Developing and Implementing a large scale.
Trace Analysis Chunxu Tang. The Mystery Machine: End-to-end performance analysis of large-scale Internet services.
Kangaroo: Video Seeking in P2P Systems Xiaoyuan Yang †, Minas Gjoka ¶, Parminder Chhabra †, Athina Markopoulou ¶, Pablo Rodriguez † † Telefonica Research.
Doc.: IEEE /0604r1 Submission May 2014 Slide 1 Modeling and Evaluating Variable Bit rate Video Steaming for ax Date: Authors:
Distributed Multimedia Systems
Using DSVM to Implement a Distributed File System Ramon Lawrence Dept. of Computer Science
TRANSIMS Research and Deployment Project TRACC TSM Staff Dr. Vadim Sokolov Dr. Joshua Auld Dr. Kuilin Zhang Mr. Michael Hope.
CloudScale: Elastic Resource Scaling for Multi-Tenant Cloud Systems Zhiming Shen, Sethuraman Subbiah, Xiaohui Gu, John Wilkes.
Resilient Peer-to-Peer Streaming Paper by: Venkata N. Padmanabhan Helen J. Wang Philip A. Chou Discussion Leader: Manfred Georg Presented by: Christoph.
Technical Architectures
Rheeve: A Plug-n-Play Peer- to-Peer Computing Platform Wang-kee Poon and Jiannong Cao Department of Computing, The Hong Kong Polytechnic University ICDCSW.
Network Coding for Large Scale Content Distribution Christos Gkantsidis Georgia Institute of Technology Pablo Rodriguez Microsoft Research IEEE INFOCOM.
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
Shadow Configurations: A Network Management Primitive Richard Alimi, Ye Wang, Y. Richard Yang Laboratory of Networked Systems Yale University.
Shadow Configurations: A Network Management Primitive Richard Alimi, Ye Wang, and Y. Richard Yang Laboratory of Networked Systems Yale University February.
1March -05 Jiangchuan Liu with Xinyan Zhang, Bo Li, and T.S.P.Yum Infocom 2005 CoolStreaming/DONet: A Data-Driven Overlay Network for Peer-to-Peer Live.
Asper School of Business University of Manitoba Systems Analysis & Design Instructor: Bob Travica System architectures Updated: November 2014.
Understanding Mesh-based Peer-to-Peer Streaming Nazanin Magharei Reza Rejaie.
CS218 – Final Project A “Small-Scale” Application- Level Multicast Tree Protocol Jason Lee, Lih Chen & Prabash Nanayakkara Tutor: Li Lao.
1 Exploring Data Reliability Tradeoffs in Replicated Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh Matei Ripeanu.
Inferring the Topology and Traffic Load of Parallel Programs in a VM environment Ashish Gupta Peter Dinda Department of Computer Science Northwestern University.
New Challenges in Cloud Datacenter Monitoring and Management
Replay Debugging for Distributed Systems Dennis Geels, Gautam Altekar, Ion Stoica, Scott Shenker.
Yale LANS Optimizing Cost and Performance for Content Multihoming Hongqiang Harry Liu Ye Wang Yang Richard Yang Hao Wang Chen Tian Aug. 16, 2012 Hongqiang.
TitleEfficient Timing Channel Protection for On-Chip Networks Yao Wang and G. Edward Suh Cornell University.
COnvergence of fixed and Mobile BrOadband access/aggregation networks Work programme topic: ICT Future Networks Type of project: Large scale integrating.
Chapter 13 Starting Design: Logical Architecture and UML Package Diagrams.
Exploring VoD in P2P Swarming Systems By Siddhartha Annapureddy, Saikat Guha, Christos Gkantsidis, Dinan Gunawardena, Pablo Rodriguez Presented by Svetlana.
Providing Controlled Quality Assurance in Video Streaming across the Internet Yingfei Dong, Zhi-Li Zhang and Rohit Rakesh Computer Networking and Multimedia.
Advanced Network Architecture Research Group 2001/11/149 th International Conference on Network Protocols Scalable Socket Buffer Tuning for High-Performance.
Institute of Computer and Communication Network Engineering OFC/NFOEC, 6-10 March 2011, Los Angeles, CA Lessons Learned From Implementing a Path Computation.
Monitoring Latency Sensitive Enterprise Applications on the Cloud Shankar Narayanan Ashiwan Sivakumar.
Michael Ernst, page 1 Collaborative Learning for Security and Repair in Application Communities Performers: MIT and Determina Michael Ernst MIT Computer.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
Experience with Using a Performance Predictor During Development a Distributed Storage System Tale Lauro Beltrão Costa *, João Brunet +, Lile Hattori #,
The Only Constant is Change: Incorporating Time-Varying Bandwidth Reservations in Data Centers Di Xie, Ning Ding, Y. Charlie Hu, Ramana Kompella 1.
Software-defined Networking Capabilities, Needs in GENI for VMLab ( Prasad Calyam; Sudharsan Rajagopalan;
1 Towards Cinematic Internet Video-on-Demand Bin Cheng, Lex Stein, Hai Jin and Zheng Zhang HUST and MSRA Huazhong University of Science & Technology Microsoft.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada Video Streaming over Cooperative Wireless Networks Mohamed Hefeeda (Joint.
An Efficient Approach for Content Delivery in Overlay Networks Mohammad Malli Chadi Barakat, Walid Dabbous Planete Project To appear in proceedings of.
A Measurement Based Memory Performance Evaluation of High Throughput Servers Garba Isa Yau Department of Computer Engineering King Fahd University of Petroleum.
HUAWEI TECHNOLOGIES CO., LTD. Page 1 Survey of P2P Streaming HUAWEI TECHNOLOGIES CO., LTD. Ning Zong, Johnson Jiang.
Advanced Network Architecture Research Group 2001/11/74 th Asia-Pacific Symposium on Information and Telecommunication Technologies Design and Implementation.
Doc.: IEEE /0648r0 Submission May 2014 Chinghwa Yu et. al., MediaTekSlide 1 Performance Observation of a Dense Campus Network Date:
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
1 Wide Area Network Emulation on the Millennium Bhaskaran Raman Yan Chen Weidong Cui Randy Katz {bhaskar, yanchen, wdc, Millennium.
Review of Parnas’ Criteria for Decomposing Systems into Modules Zheng Wang, Yuan Zhang Michigan State University 04/19/2002.
SHADOWSTREAM: PERFORMANCE EVALUATION AS A CAPABILITY IN PRODUCTION INTERNET LIVE STREAM NETWORK ACM SIGCOMM CING-YU CHU.
REST Style Large Measurement Platform Protocol draft-liu-lmap-rest-00.txt Dapeng Liu(Presenter) Lingli Deng China Mobile Shihui Duan CATR Cathy Li China.
Load Rebalancing for Distributed File Systems in Clouds.
UCI Large-Scale Collection of Application Usage Data to Inform Software Development David M. Hilbert David F. Redmiles Information and Computer Science.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
Building PetaScale Applications and Tools on the TeraGrid Workshop December 11-12, 2007 Scott Lathrop and Sergiu Sanielevici.
MicroGrid Update & A Synthetic Grid Resource Generator Xin Liu, Yang-suk Kee, Andrew Chien Department of Computer Science and Engineering Center for Networked.
1 Scalability and Accuracy in a Large-Scale Network Emulator Nov. 12, 2003 Byung-Gon Chun.
Introduction to Operating Systems
Shadow Configurations: A Network Management Primitive
Self Healing and Dynamic Construction Framework:
The Impact of Replacement Granularity on Video Caching
Suman Bhunia and Shamik Sengupta
ShadowStream: Performance Evaluation as a Capability in Production Internet Live Stream Network CS598 Advanced Multimedia Systems Fall 2017 Prof. Klara.
A Framework for Automatic Resource and Accuracy Management in A Cloud Environment Smita Vijayakumar.
Introduction to Operating Systems
Software models - Software Architecture Design Patterns
Towards Predictable Datacenter Networks
Presentation transcript:

Yale LANS ShadowStream: Performance Evaluation as a Capability in Production Internet Live Streaming Networks Chen Tian Richard Alimi Yang Richard Yang David Zhang Aug. 16, 2012 Chen Tian Richard Alimi Yang Richard Yang David Zhang Aug. 16, 2012

Yale LANS Live Streaming is a Major Internet App

Yale LANS Poor Performance After Updates Lacking sufficient evaluation before release

Yale LANS Don’t We Already Have … Emulab PlanetLab …. Testbeds Gradually rolling out Testing Channels They are not enough !

Yale LANS Live Streaming Background We focus on hybrid live streaming systems: CDN + P2P

Yale LANS Live Streaming Background We focus on hybrid live streaming systems: CDN + P2P

Yale LANS With Connection Limit Testbed: Misleading Results at Small Scale Production Default Small-ScaleLarge-Scale Piece Missing Ratio 3.7% 0.7% 64.8% 3.5% Live streaming performance can be highly non-linear.

Yale LANS Testbed: Misleading Results due to Missing Features Piece Missing Ratio # Timed-out Requests # Received Duplicate Packets # Received Outdated Packets LAN Style (Same BW) 1.5% ADSL Style (Same BW) 7.3% Realistic features can have large performance impacts.

Yale LANS Testing Channel: Lacking QoE Protection

Yale LANS Testing Channel: Lacking Orchestration What we want is … What we have is …

Yale LANS ShadowStream Design Goal Protection of real user QoE Transparent orchestration of testing conditions Use production network for testing with

Yale LANS Roadmap Motivation Protection Design Orchestration Design Evaluations Conclusions and Future Work

Yale LANS Protection: Basic Scheme Note: R denotes Repair, E denotes Experiment

Yale LANS Example Illustration: E Success

Yale LANS Example Illustration: E Success

Yale LANS Example Illustration: E Success

Yale LANS Example Illustration: E Fail

Yale LANS Example Illustration: E Fail

Yale LANS Example Illustration: E Fail

Yale LANS Example Illustration: E Fail

Yale LANS How to Repair? Choice 1: dedicated CDN resources (R=rCDN) –Benefit: simple –Limitations requires resource reservation, –e.g., 100,000 clients x 1 Mbps = 100 Gbps may not work well when there is network bottleneck

Yale LANS How to Repair? Choice 2: production machine (R=production) –Benefit 1: Larger resource pool –Benefit 2: Fine-tuned algorithms –Benefit 3: A unified approach to protection & orchestration (later)

Yale LANS R= Production: Resource Competition Competition leads to underestimation on Experiment performance Repair and Experiment compete on client upload bandwidth

Yale LANS R= Production: Misleading Result missing ratio x+y=θ 0 accurate result repair demand misleading result

Yale LANS Putting Together: PCE

Yale LANS Putting Together: PCE

Yale LANS Implementing PCE Streaming machine transparent of testing state Streaming machines are isolated from each other Requirements

Yale LANS Implementing PCE: base observation A simple partitioned sliding window to partition downloading tasks among PCE automatically unavailable piece missing responsibility transferred

Yale LANS Client Components

Yale LANS Roadmap Motivation Protection Design Orchestration Design Evaluations Conclusions and Future Work

Yale LANS Orchestration Challenges How to start an Experiment streaming machine –Transparent to real viewers How to control the arrival/departure of each Experiment machine in a scalable way

Yale LANS Transparent Orchestration Idea

Yale LANS Transparent Orchestration Idea

Yale LANS Transparent Orchestration Idea

Yale LANS Distributed Activation of Testing Orchestrator distributes parameters to clients Each client independently generates its arrival time according to the same distribution function F(t) Together they achieve global arrival pattern –Cox and Lewis Theorem

Yale LANS Orchestrator Components

Yale LANS Roadmap Motivation Protection Design Orchestration Design Evaluations Conclusions and Future Work

Yale LANS Software Implementation Compositional Runtime –Modular design, including scheduler, dynamic loading of blocks, etc. –3400 lines of code Pre-packaged blocks –HTTP integration, UDP sockets and debugging –500 lines of code Live streaming machine –4200 lines of code

Yale LANS Experimental Opportunities

Yale LANS Protection and Accuracy Virtual Playpoint Real Playpoint Buggy 8.73% N/A R=rCDN 8.72% 0% R=rCDN w/ bottleneck 8.81% 5.42% Piece Missing Ratio

Yale LANS Protection and Accuracy Virtual Playpoint Real Playpoint PCE bottleneck 9.13% 0.15% PCE w/ higher bottleneck 8.85% 0% Piece Missing Ratio

Yale LANS Orchestration: Distributed Activation

Yale LANS Utility on Top: Deterministic Replay Event Message Random seeds Control non-deterministic inputsPractical per-client log size Log Size 100 clients; 650 seconds223KB 300 clients; 1,800 seconds714KB

Yale LANS Roadmap Motivation Protection Design Orchestration Design Evaluations Conclusions and Future Work

Yale LANS Contributions Design and implementation of a novel live streaming network that introduces performance evaluation as an intrinsic capability in production networks –Scalable (PCE) protection of QoE despite large- scale Experiment failures –Transparent orchestration for flexible testing

Yale LANS Future Work Large-scale deployment and evaluation Apply the Shadow (Experiment->Validation- >Repair) scheme to other applications Extend the Shadow (Experiment->Validation- >Repair) scheme –E.g., repair does not mean do the same job as Experiment, as long as it masks visible failures

Yale LANS Adaptive Rate Streaming Repair Accuracy Protected QoE Protection Overhead Follow 1.26x 1.59x 1.49 Kbps Base 1.26x 1.42x 3.69 Kbps Adaptive 1.26x 1.58x 1.39 Kbps

Yale LANS Thank you!

Yale LANS Questions?

Yale LANS backup

Yale LANS Poor Performance After Updates Lacking sufficient evaluation before release

Yale LANS Related Work Debugging and evaluation of distributed systems –e.g., ODR, Friday, DieCast Based on a key observation Allows scenarios customization FlowVisor –Allocate a fixed portion of tasks and resources

Yale LANS Why Not Testing Channel: orchestration What we want is … What we have is …

Yale LANS Experiment Specification & Triggering A testing should define: –One or more classes of clients –Client-wide arrival rate functions –Client-wide life duration function Triggering Condition: prediction based

Yale LANS Experiment Transition Connectivity Transition Playbuffer State Transition More details in the paper: Replace Early Departed Clients, Independent Departure Control

Yale LANS ShadowStream Design Goal Production networks By adding protection and orchestration into production networks, we have …. Live Testing ! Testbeds

Yale LANS State of Art: Hybrid Systems

Yale LANS Putting Together : ShadowStream The first system, in the context of live streaming, that can perform live testing with both protection and orchestration Design the Repair system that can simultaneously provide protection and experiment accuracy Fully implemented and evaluated

Yale LANS Problem: Resource Competition Repair and Experiment compete on key resource (client upload bandwidth) Competition may lead to systematic underestimation on Experiment performance How to get around ?

Yale LANS Experiment Orchestration list Experiment Specification & Triggering Independent Arrivals Control Experiment Transition Replace Early Departed Clients Independent Departure Control

Yale LANS Example Illustration

Yale LANS From Idea to System

Yale LANS Extended Works Dynamic Streaming Deterministic Replay

Yale LANS Example Illustration XX