Vengatanathan Krishnamoorthi, Niklas Carlsson

Slides:



Advertisements
Similar presentations
Imbalanced data David Kauchak CS 451 – Fall 2013.
Advertisements

IEEE JSAC Special Issue Adaptive Media Streaming Submissions by April 1 Details at
Saamer Akhshabi, Constantine Dovrolis Georgia Institute of Technology An Experimental Evaluation of Rate Adaptation Algorithms in Adaptive Video Streaming.
Confused, Timid, and Unstable: Picking a Video Streaming Rate is Hard Published in 2012 ACM’s Internet Measurement Conference (IMC) Five students from.
Doc.: IEEE /0604r1 Submission May 2014 Slide 1 Modeling and Evaluating Variable Bit rate Video Steaming for ax Date: Authors:
Lava: A Reality Check of Network Coding in Peer-to-Peer Live Streaming Mea Wang, Baochun Li Department of Electrical and Computer Engineering University.
Suphakit Awiphan, Takeshi Muto, Yu Wang, Zhou Su, Jiro Katto
Dynamic Adaptive Streaming over HTTP2.0. What’s in store ▪ All about – MPEG DASH, pipelining, persistent connections and caching ▪ Google SPDY - Past,
Measurements of Congestion Responsiveness of Windows Streaming Media (WSM) Presented By:- Ashish Gupta.
Source-Adaptive Multilayered Multicast Algorithms for Real- Time Video Distribution Brett J. Vickers, Celio Albuquerque, and Tatsuya Suda IEEE/ACM Transactions.
Wide-scale Botnet Detection and Characterization Anestis Karasaridis, Brian Rexroad, David Hoeflin.
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
1 A Framework for Lazy Replication in P2P VoD Bin Cheng 1, Lex Stein 2, Hai Jin 1, Zheng Zhang 2 1 Huazhong University of Science & Technology (HUST) 2.
Junxian Huang 1 Feng Qian 2 Yihua Guo 1 Yuanyuan Zhou 1 Qiang Xu 1 Z. Morley Mao 1 Subhabrata Sen 2 Oliver Spatscheck 2 1 University of Michigan 2 AT&T.
Receiver-driven Layered Multicast Paper by- Steven McCanne, Van Jacobson and Martin Vetterli – ACM SIGCOMM 1996 Presented By – Manoj Sivakumar.
1 An SLA-Oriented Capacity Planning Tool for Streaming Media Services Lucy Cherkasova, Wenting Tang, and Sharad Singhal HPLabs,USA.
Physical Layer Informed Adaptive Video Streaming Over LTE Xiufeng Xie, Xinyu Zhang Unviersity of Winscosin-Madison Swarun KumarLi Erran Li MIT Bell Labs.
Smita Vijayakumar Qian Zhu Gagan Agrawal 1.  Background  Data Streams  Virtualization  Dynamic Resource Allocation  Accuracy Adaptation  Research.
The NIProxy: a Flexible Proxy Server Supporting Client Bandwidth Management and Multimedia Service Provision Maarten Wijnants Wim Lamotte.
Congestion Control in CSMA-Based Networks with Inconsistent Channel State V. Gambiroza and E. Knightly Rice Networks Group
Wide-scale Botnet Detection and Characterization Anestis Karasaridis, Brian Rexroad, David Hoeflin In First Workshop on Hot Topics in Understanding Botnets,
OPERETTA: An Optimal Energy Efficient Bandwidth Aggregation System Karim Habak†, Khaled A. Harras‡, and Moustafa Youssef† †Egypt-Japan University of Sc.
A Simple Model for Analyzing P2P Streaming Protocols Zhou Yipeng Chiu DahMing John, C.S. Lui The Chinese University of Hong Kong.
Peer-to-Peer Streaming of Scalable Video in Future Internet Application Speaker : 吳靖緯 MA0G0101 Communications Magazine, IEEE, On page(s): 128.
A simple model for analyzing P2P streaming protocols. Seminar on advanced Internet applications and systems Amit Farkash. 1.
Understanding the Impact of Network Dynamics on Mobile Video User Engagement M. Zubair Shafiq (Michigan State University) Jeffrey Erman (AT&T Labs - Research)
Emir Halepovic, Jeffrey Pang, Oliver Spatscheck AT&T Labs - Research
SHADOWSTREAM: PERFORMANCE EVALUATION AS A CAPABILITY IN PRODUCTION INTERNET LIVE STREAM NETWORK ACM SIGCOMM CING-YU CHU.
Development of a QoE Model Himadeepa Karlapudi 03/07/03.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
© 2016 TM Forum | 1 Virtual CPE Platform in the Home.
Machine Learning: Ensemble Methods
Experience Report: System Log Analysis for Anomaly Detection
P.Demestichas (1), S. Vassaki(2,3), A.Georgakopoulos(2,3)
Accelerating Peer-to-Peer Networks for Video Streaming
Instructor Materials Chapter 6: Quality of Service
OPERATING SYSTEMS CS 3502 Fall 2017
Purdue University, Georgia Institute of Technology, AT&T Labs Research
DASH2M: Exploring HTTP/2 for Internet Streaming to Mobile Devices
University of Maryland College Park
Introduction to Load Balancing:
Action Breakout Session
Fast Pattern-Based Throughput Prediction for TCP Bulk Transfers
Outline Introduction Related Work
Injong Rhee ICMCS’98 Presented by Wenyu Ren
3 | Analyzing Server, Network, and Client Health
Aditya Mavlankar and David Varodayan
A First Look at Performance of TV Streaming Sticks
Scheduling Jobs Across Geo-distributed Datacenters
Server Allocation for Multiplayer Cloud Gaming
DASH Background Server provides multiple qualities of the same video
Video through a Crystal Ball:
A Framework for Automatic Resource and Accuracy Management in A Cloud Environment Smita Vijayakumar.
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 6: Quality of Service Connecting Networks.
Amogh Dhamdhere, Hao Jiang and Constantinos Dovrolis
Pong: Diagnosing Spatio-Temporal Internet Congestion Properties
Evaluating Transaction System Performance
Advanced Computer Networks
Smita Vijayakumar Qian Zhu Gagan Agrawal
iSRD Spam Review Detection with Imbalanced Data Distributions
Performance Evaluation of Computer Networks
IP Control Gateway (IPCG)
Performance Evaluation of Computer Networks
Performance and Scalability Issues of Multimedia Digital Library
Outline System architecture Current work Experiments Next Steps
Algorithms for Selecting Mirror Sites for Parallel Download
Kostas Kolomvatsos, Christos Anagnostopoulos
Modeling and Evaluating Variable Bit rate Video Steaming for ax
Conviva & Sky A real-world OTT video Quality of Experience case study
Yu Guan, Chengyuan Zheng, Xinggong Zhang, Zongming Guo, Junchen Jiang
Presentation transcript:

Vengatanathan Krishnamoorthi, Niklas Carlsson BUFFEST: Predicting Buffer Conditions and Real-time Requirements of HTTP(S) Adaptive Streaming Clients Vengatanathan Krishnamoorthi, Niklas Carlsson MMSys’17 Presented by Wenyu Ren

Motivation Network operators want to properly provision their networks and provide clients with the best possible service. The QoE can vary significantly as networks go through different utilization phases and data flows compete for bandwidth. It is important to understand user experience and real- time requirements associated with different network flows.

Motivation Flow classification In the past: classify flows based on the underlying services they provide E.g. real-time streaming vs. peer-to-peer downloads Since video streaming is responsible for the majority of today’s network traffic, classifying all video flows into a single class is not helpful. Ideally: continually and individually (re)classify video flows based on their clients’ current buffer conditions Streaming clients with different buffer conditions have highly heterogeneous real-time requirements. The requirements change over the duration of a playback session. Stalls is the factor that has the largest impact on clients’ QoE. t

Problem Description Goal: classifying video streaming lows based on the clients’ current buffer conditions. Challenges The use of multiple encodings allows efficient quality adaptation but also decreases correlation between packet-level throughput and buffer conditions. Usage of HTTPS prevents operators from observing HTTP requests for video chunks and associated metadata, restricting classifiers to TCP/IP packet-level information.

Buffer Condition Estimation Capturing the buffer occupancy of clients is important when trying to understand the clients’ playback experience. Playback stalls are key indicators of user satisfaction and significantly impact video abandonment. Stalls typically occur due to empty playback buffers. Identification of clients with low buffer conditions can also be used to improve users’ QoE. Knowledge with different granularity can be used for capacity planning, offloading and adaptation of resource allocation and availability, etc.

Candidate Approaches Client-side instrumentation Most popular streaming services using their own proprietary players. Significant restriction on accessible parameters. Introduce additional load on the system. Cannot evaluate non-instrumented clients. Passive traffic monitoring (paper’s approach) More scalable and allows us to consider non-instrumented clients. Only provides approximations. Explicit inform Require collaboration between content providers and network operators. Can be manipulated by clients wanting preferential treatment.

Classification Framework Overview Tradeoff Accuracy & Complexity vs. Access to Information Event-based buffer emulation module Online classifiers Uses a trusted proxy design for extracting as much application-layer and metadata information as possible. Make decisions based on metrics easily calculated in real time from TCP/IP packet-level traces.

Classification Framework Overview This figure summarizes the BUFFEST framework and its components, when used for online classification. Here, a buffer emulation module is used for automated labeling of sample lows. To extract HTTP information from HTTPS connections, the emulator module relies on a trusted proxy. By simultaneously calculating summary metrics on the corresponding packet-level data we can create labeled training datasets, which we use to extract online classification rules. The online classifiers are then applied with these rules on the encrypted packet-level data.

Background YouTube streaming service During a playback session, the client typically downloads video from one CDN server. YouTube clients also communicate with a statistics server that collects client-side playback statistics. With YouTube, each encoding of the video is given a separate identifier and range requests are instead used to download chunk sequences.

Buffer Emulation Module Proxy measurements Application-level information for each HTTP request and response logged by a trusted proxy Packet-level information Manifest file of each video and the video’s metadata with chunk boundaries for each video quality encoding Information about all statistical reports Emulating the buffer at the NIC Using the above data, our emulator module reconstructs the buffer conditions of a player, assuming it gets access to each chunk as soon as the chunk is fully downloaded. The emulator keeps track of the current state (i.e., "buffering", "playing", or "stalled") and the next event that can change the player’s state, including chunk download completions and the buffer dropping to zero (causing stalls).

Buffer Emulation Module Comparison of the buffer estimated at the NIC (using emulator) and observed at the player (API). The emulator captures the general buffer dynamics. The figure shows an example comparison between the buffer occupancy reported by the API and the emulated buffer (observed at the NIC) during a streaming session of a 5.5 minute long video. In the example, the client has relatively good bandwidth conditions, allowing it to download chunks at a high quality for most of the session. As desired, the two buffer curves (for the emulated buffer and the actual buffer) nicely follow each other, showing that the emulator captures the general buffer dynamics. Optional: The reasons for the slightly larger estimates are that in this example the time instances when playback starts are almost the same for the emulated player (whose startup instance partially can be adjusted with the help of statistical reports) and the real player, and the NIC always receives the chunks before the player (since the players experience additional operating system (OS) related delays, for example).

Buffer Emulation Module CDF of the differences in buffer sizes observed at the NIC (emulator) and the player (API). There are a few larger differences. Mostly due to differences in when chunks are seen on the NIC (emulated player) and by the API (real player). OS-related delays Player internals We next take a closer look at the difference in buffer sizes observed at the NIC (emulated) and the player (using API). The figure shows the cumulative distribution function (CDF) of the difference between the two buffer sizes, measured at 1 second intervals during playback, over a large number of playback sessions. Although the differences typically are relatively small, we observe a few larger differences.   Most of the observed differences are due to differences in when chunks are seen on the NIC (emulated player) and by the API (real player). First, there is a delay between when a chunk is fully downloaded, as seen on the NIC, and when it is available at the player. This is in part due to OS-related delays. Second, a more subtle but highly noticeable difference occurs due to how and when the player receives consecutive chunks within a range request.

Buffer Emulation Module Coarse-grained classification The emulator can be used as a good estimator of the coarse- grained buffer conditions of the player itself, despite the OS-related delays. While the above OS and player internals make the exact buffer conditions impossible to capture using only network data, we have found that the framework can distinguish clients with low buffer conditions from other clients. To illustrate how this technique can achieve the goal, the figure shows the actual buffer conditions for clients that the emulator estimates will have low buffer conditions and intermediate buffer conditions. Note that there is a clear separation between these two categories and most of the cases that are misclassified are due to overestimations. These results show that the emulator can be used as a good estimator of the coarse-grained buffer conditions of the player itself, despite the OS-related delays.

Online Classification Metrics The exponential weighted moving average (EWMA) for different window weights for both the per-second throughput 𝑋 𝛼 and the inter- request times 𝐼 𝛼 . How long time ( 𝑇 𝛼 𝑋 ) that the weighted throughput metric 𝑋 𝛼 has been below a threshold 𝑋 𝛼 ∗ , and for how long time ( 𝑇 𝛼 𝐼 ) that the weighted inter-request time metric 𝐼 𝛼 has been above a threshold 𝐼 𝛼 ∗ . Classifiers Threshold-based classifiers The best throughput-based classifiers almost always outperform the inter-request- based classifiers. Machine learning classifiers Tested the techniques based on decision trees and Support Vector Machine (SVM). The two-class boosted decision tree classifier provided the best scores.

Online Classification Prediction evaluation Threshold-based classifiers The two figures show the CDF of the buffer conditions as seen when the threshold-based online classifiers predicted low buffer conditions, and put them in contrast to the conditions as observed over all sessions. The substantial differences in the CDFs are encouraging as it shows that relatively simple classifiers can be useful in predicting low buffer conditions even when the traffic is encrypted.

Online Classification Prediction evaluation Boosted decision tree classifiers Overall, these results show that the boosted decision tree classifier provides a good tool to predict instances with low buffer conditions. By careful selection of B* we can also achieve a good tradeoff between the number of lagged low buffer instances and the accuracy with which these are reported.

Conclusion BUFFEST: classification framework that includes both an event-based buffer emulator module and training modules for online classifiers. The emulator module leverages a trusted proxy method to extract required information about the video flows delivered to the clients, allowing us to identify chunk boundaries and track buffer conditions. Using the emulator for automated labeling of network traces provides an effective training framework for simpler online classifiers that only use TCP/IP packet-level information.

Pros and Cons Pros This paper is the first to provide automated re-classification of encrypted streaming flows based on the expected buffer conditions and urgency of different clients. The buffer emulation model is validated by an instrumented YouTube client, synchronized screen captures of videos sessions using a different commercial streaming service, as well as YouTube’s statistical reports. The flexible design makes the framework easily extendable for other services. Cons There is no implementation demonstrating the real benefit of the framework. There is no time overhead analysis of the online classifiers.