Bandwidth Allocation in a Self-Managing Multimedia File Server Vijay Sundaram and Prashant Shenoy Department of Computer Science University of Massachusetts.

Slides:



Advertisements
Similar presentations
Feedback Control Real-Time Scheduling: Framework, Modeling, and Algorithms Chenyang Lu, John A. Stankovic, Gang Tao, Sang H. Son Presented by Josh Carl.
Advertisements

Energy Efficiency through Burstiness Athanasios E. Papathanasiou and Michael L. Scott University of Rochester, Computer Science Department Rochester, NY.
Continuous Media 1 Differs significantly from textual and numeric data because of two fundamental characteristics: –Real-time storage and retrieval –High.
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 14 – Introduction to Multimedia Resource Management Klara Nahrstedt Spring 2012.
T.Sharon-A.Frank 1 Multimedia Quality of Service (QoS)
Multimedia Systems As Presented by: Craig Tomastik.
Distributed Multimedia Systems
Real-Time Scheduling CIS700 Insup Lee October 3, 2005 CIS 700.
Silberschatz, Galvin and Gagne  2002 Modified for CSCI 399, Royden, Operating System Concepts Operating Systems Lecture 19 Scheduling IV.
Queueing Models for P2P Systems.  Extend classical queuing theory for P2P systems.  Develop taxonomy for different variations of these queuing models.
Dynamic Guard Bandwidth Scheme for Wireless Broadband Networks IEEE INFOCOM 2001.
Adaptive Video Streaming in Vertical Handoff: A Case Study Ling-Jyh Chen, Guang Yang, Tony Sun, M. Y. Sanadidi, Mario Gerla Computer Science Department,
Computer Science Deadline Fair Scheduling: Bridging the Theory and Practice of Proportionate-Fair Scheduling in Multiprocessor Servers Abhishek Chandra.
Quality of Service in IN-home digital networks Alina Albu 7 November 2003.
A Grid Resource Broker Supporting Advance Reservations and Benchmark- Based Resource Selection Erik Elmroth and Johan Tordsson Reporter : S.Y.Chen.
1 Multimedia Streaming via TCP: An Analytic Performance Study Bing Wang, Jim Kurose, Prashant Shenoy, Don Towsley.
RAIDs Performance Prediction based on Fuzzy Queue Theory Carlos Campos Bracho ECE 510 Project Prof. Dr. Duncan Elliot.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 11: Monitoring Server Performance.
CS 104 Introduction to Computer Science and Graphics Problems Operating Systems (4) File Management & Input/Out Systems 10/14/2008 Yang Song (Prepared.
U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science Dynamic Resource Allocation for Shared Data Centers Using Online Measurements.
Web Server Software Architectures Author: Daniel A. Menascé Presenter: Noshaba Bakht.
Little’s Theorem Examples Courtesy of: Dr. Abdul Waheed (previous instructor at COE)
A Hybrid Caching Strategy for Streaming Media Files Jussara M. Almeida Derek L. Eager Mary K. Vernon University of Wisconsin-Madison University of Saskatchewan.
Adaptive Content Delivery for Scalable Web Servers Authors: Rahul Pradhan and Mark Claypool Presented by: David Finkel Computer Science Department Worcester.
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 34 – Media Server (Part 3) Klara Nahrstedt Spring 2012.
Computer Science Cataclysm: Policing Extreme Overloads in Internet Applications Bhuvan Urgaonkar and Prashant Shenoy University of Massachusetts.
Exploiting Virtualization for Delivering Cloud based IPTV Services Speaker : 吳靖緯 MA0G IEEE Conference on Computer Communications Workshops.
1 Proxy-Assisted Techniques for Delivering Continuous Multimedia Streams Lixin Gao, Zhi-Li Zhang, and Don Towsley.
The Design and Performance of A Real-Time CORBA Scheduling Service Christopher Gill, David Levine, Douglas Schmidt.
Smart Learning Services Based on Smart Cloud Computing
Computer System Lifecycle Chapter 1. Introduction Computer System users, administrators, and designers are all interested in performance evaluation. Whether.
Distributed Quality-of-Service Routing of Best Constrained Shortest Paths. Abdelhamid MELLOUK, Said HOCEINI, Farid BAGUENINE, Mustapha CHEURFA Computers.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
Dynamic and Decentralized Approaches for Optimal Allocation of Multiple Resources in Virtualized Data Centers Wei Chen, Samuel Hargrove, Heh Miao, Liang.
Efficient Scheduling of Heterogeneous Continuous Queries Mohamed A. Sharaf Panos K. Chrysanthis Alexandros Labrinidis Kirk Pruhs Advanced Data Management.
(C) 2009 J. M. Garrido1 Object Oriented Simulation with Java.
1 An SLA-Oriented Capacity Planning Tool for Streaming Media Services Lucy Cherkasova, Wenting Tang, and Sharad Singhal HPLabs,USA.
© Pearson Education Limited, Chapter 16 Physical Database Design – Step 7 (Monitor and Tune the Operational System) Transparencies.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 11: Monitoring Server Performance.
Budget-based Control for Interactive Services with Partial Execution 1 Yuxiong He, Zihao Ye, Qiang Fu, Sameh Elnikety Microsoft Research.
Chapter 3 System Performance and Models. 2 Systems and Models The concept of modeling in the study of the dynamic behavior of simple system is be able.
Quality of Service Karrie Karahalios Spring 2007.
Smita Vijayakumar Qian Zhu Gagan Agrawal 1.  Background  Data Streams  Virtualization  Dynamic Resource Allocation  Accuracy Adaptation  Research.
1 Scheduling The part of the OS that makes the choice of which process to run next is called the scheduler and the algorithm it uses is called the scheduling.
Introduction to the new mainframe © Copyright IBM Corp., All rights reserved. Chapter 4: Working with data sets.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 11: Monitoring Server Performance.
Erasure Coding for Real-Time Streaming Derek Leong and Tracey Ho California Institute of Technology Pasadena, California, USA ISIT
Performance evaluation on grid Zsolt Németh MTA SZTAKI Computer and Automation Research Institute.
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 30 – Media Server (Part 5) Klara Nahrstedt Spring 2009.
Chapter 3 System Performance and Models Introduction A system is the part of the real world under study. Composed of a set of entities interacting.
1 Real-Time Scheduling. 2Today Operating System task scheduling –Traditional (non-real-time) scheduling –Real-time scheduling.
“A cost-based admission control algorithm for digital library multimedia systems storing heterogeneous objects” – I.R. Chen & N. Verma – The Computer Journal.
Data Scheduling for Multi-item and transactional Requests in On-demand Broadcast Nitin Pabhu Vijay Kumar MDM 2005.
OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.
Content caching and scheduling in wireless networks with elastic and inelastic traffic Group-VI 09CS CS CS30020 Performance Modelling in Computer.
Operating System concerns for Multimedia Multimedia File Systems -Jaydeep Punde.
1 Exploiting Nonstationarity for Performance Prediction Christopher Stewart (University of Rochester) Terence Kelly and Alex Zhang (HP Labs)
NUS.SOC.CS5248 Ooi Wei Tsang 1 Course Matters. NUS.SOC.CS5248 Ooi Wei Tsang 2 Make-Up Lecture This Saturday, 23 October TR7, 1-3pm Topic: “CPU scheduling”
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 17 – QoS Classes and Setup Operations Klara Nahrstedt Spring 2014.
Dynamic Resource Allocation for Shared Data Centers Using Online Measurements By- Abhishek Chandra, Weibo Gong and Prashant Shenoy.
OPERATING SYSTEMS CS 3502 Fall 2017
Multimedia Systems Operating System Presentation On
CS 414 – Multimedia Systems Design Lecture 31 – Media Server (Part 5)
Klara Nahrstedt Spring 2010
A Framework for Automatic Resource and Accuracy Management in A Cloud Environment Smita Vijayakumar.
Data Orgnization Frequently accessed data on the same storage device?
Smita Vijayakumar Qian Zhu Gagan Agrawal
Introduction to Operating Systems
Dissemination of Dynamic Data on the Internet
PMD Auto Load Balancing
Presentation transcript:

Bandwidth Allocation in a Self-Managing Multimedia File Server Vijay Sundaram and Prashant Shenoy Department of Computer Science University of Massachusetts ACM Multimedia 2001

Outline Introduction Introduction Workload Monitoring Module Workload Monitoring Module Adaptive Bandwidth Manager Adaptive Bandwidth Manager Estimating Bandwidth Requirement based on Disk Utilizations Estimating Bandwidth Requirement based on Disk Utilizations Estimating Bandwidth Requirement based on the Arrival Rate Estimating Bandwidth Requirement based on the Arrival Rate Computing the Reservations of Each Class Computing the Reservations of Each Class Self-Managing Bandwidth Allocation in a Multi-disk Server Self-Managing Bandwidth Allocation in a Multi-disk Server Experimental Result Experimental Result Conclusion Conclusion

Introduction Best-effort service: simple interference  starvation occurs when bursty workload Mutually-exclusive storage: no interference  static partition of storage space automatic partition is very hard performance is very bad Reservations: no interference  each class reserves a certain fraction of the bandwidth (i.e., R be = 1 – R rt ) performance is good (i.e., meet the deadlines of real-time requests while providing low average response time for best-effort requests) Best-effort class (e.g. Textual data, image file) needs low average response time. Real-time class (e.g. Streaming Media, video) needs to meet the deadlines of requests.

Workload Monitoring Module W: the window size I: the measurement interval Workload Monitoring Module tracks various aspects of resource usage.

Workload Monitoring Module Request Arrival Rate: Request number: N be, N rt Request size: S be, S rt Request Waiting Times: Queue length: q be, q rt Disk Utilizations: View as bandwidth Time spent in servicing requests: Disk Utilization:

Adaptive Bandwidth Manager Estimating Bandwidth Requirement based on Disk Utilizations Estimating Bandwidth Requirement based on Disk Utilizations Estimating Bandwidth Requirement based on the Arrival Rate Estimating Bandwidth Requirement based on the Arrival Rate Computing the Reservations of Each Class Computing the Reservations of Each Class

Adaptive Bandwidth Manager U rt U be high percentile of U rt Perc( U rt ) median of U be s Median( U be )

Adaptive Bandwidth Manager Estimating Bandwidth Requirement based on Disk Utilizations Estimating Bandwidth Requirement based on Disk Utilizations Best-effort class uses the median of the utilization distribution, denoted by Best-effort class uses the median of the utilization distribution, denoted by Real-time class uses a high percentile of the utilization distribution, denoted by Real-time class uses a high percentile of the utilization distribution, denoted by Perc α ( U rt ) Median α ( U be ) Perc( U rt ) Median( U be )

Adaptive Bandwidth Manager Estimating Bandwidth Requirement based on Arrival Rate Estimating Bandwidth Requirement based on Arrival Rate The total disk utilization is always 100% during a overload and no longer reflects the relative needs of each class. The total disk utilization is always 100% during a overload and no longer reflects the relative needs of each class. No allocation can actually satisfy the total bandwidth needs of two classes, the goal of the bandwidth manager should be to ensure stable overload behavior and ensure that the allocations reflect the relative needs of the two classes. No allocation can actually satisfy the total bandwidth needs of two classes, the goal of the bandwidth manager should be to ensure stable overload behavior and ensure that the allocations reflect the relative needs of the two classes. The bandwidth needs of the best-effort class are computed as The bandwidth needs of the best-effort class are computed as The bandwidth needs of the real-time class are computed as The bandwidth needs of the real-time class are computed as

Adaptive Bandwidth Manager Computing the Reservations of Each Class Computing the Reservations of Each Class Case 1: Case 1: neither class utilizes its entire allocation neither class utilizes its entire allocation when and when and the allocations of the two classes remains unchanged the allocations of the two classes remains unchanged Case 2: Case 2: the best-effort class utilizes its entire allocation the best-effort class utilizes its entire allocation when and when and increase the allocation of best-effort class increase the allocation of best-effort class and correspondingly decrease the allocation of real-time class Case 3: Case 3: the real-time class utilizes its entire allocation the real-time class utilizes its entire allocation when and when and increase the allocation of real-time class increase the allocation of real-time class and correspondingly decrease the allocation of best-effort class Case 4: Case 4: overload overload when and ; when or when and ; when or based on arrival rate, allocations are computed as and based on arrival rate, allocations are computed as and User specified bounds and User specified bounds and

Self-Managing Bandwidth Allocation in a Multi-disk Server Compute the allocations on individual disks Compute the allocations on individual disks Compute the average and maximum allocation of each class Compute the average and maximum allocation of each class the average allocation of the best-effort class across all disks the average allocation of the best-effort class across all disks the maximum allocation of the best-effort class across all disks the maximum allocation of the best-effort class across all disks the average allocation of the real-time class across all disks the average allocation of the real-time class across all disks the maximum allocation of the real-time class across all disks the maximum allocation of the real-time class across all disks Compute the allocation as a linear combination of the average and the maximum load Compute the allocation as a linear combination of the average and the maximum load The final allocation is normalized as follows: The final allocation is normalized as follows:

Experimental Result

Conclusion Automating bandwidth allocation techniques Automating bandwidth allocation techniques consist of two key components: consist of two key components: the workload monitoring module that efficiently monitors the load in each application class the workload monitoring module that efficiently monitors the load in each application class the adaptive bandwidth manager that uses these workload statistics to dynamically determine the allocation of each class. the adaptive bandwidth manager that uses these workload statistics to dynamically determine the allocation of each class. provide control over the time-scale of allocation via tunable parameters. provide control over the time-scale of allocation via tunable parameters. have stable behavior during transient overload. have stable behavior during transient overload. exploit the semantics of each class while determining their allocations. exploit the semantics of each class while determining their allocations. provide significant advantages over static bandwidth allocation. provide significant advantages over static bandwidth allocation.