U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science Dynamic Resource Allocation for Shared Data Centers Using Online Measurements.

Slides:



Advertisements
Similar presentations
Pricing for Utility-driven Resource Management and Allocation in Clusters Chee Shin Yeo and Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS)
Advertisements

Feedback Control Real-Time Scheduling: Framework, Modeling, and Algorithms Chenyang Lu, John A. Stankovic, Gang Tao, Sang H. Son Presented by Josh Carl.
Hadi Goudarzi and Massoud Pedram
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Solving POMDPs Using Quadratically Constrained Linear Programs Christopher Amato.
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 14 – Introduction to Multimedia Resource Management Klara Nahrstedt Spring 2012.
A Cyber-Physical Systems Approach to Energy Management in Data Centers Presented by Chen He Adopted form the paper authors.
Aleksandar Kuzmanovic and Edward W. Knightly Rice Networks Group Measuring Service in Multi-Class Networks.
Proactive Prediction Models for Web Application Resource Provisioning in the Cloud _______________________________ Samuel A. Ajila & Bankole A. Akindele.
U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science Dynamic Provisioning for Multi-tier Internet Applications Bhuvan Urgaonkar, Prashant.
Providing Performance Guarantees for Cloud Applications Anshul Gandhi IBM T. J. Watson Research Center Stony Brook University 1 Parijat Dube, Alexei Karve,
SIGMETRICS 2008: Introduction to Control Theory. Abdelzaher, Diao, Hellerstein, Lu, and Zhu. CPU Utilization Control in Distributed Real-Time Systems Chenyang.
Adaptive Data Collection Strategies for Lifetime-Constrained Wireless Sensor Networks Xueyan Tang Jianliang Xu Sch. of Comput. Eng., Nanyang Technol. Univ.,
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Server Consolidation in Virtualized Data Centers Prashant Shenoy University of Massachusetts.
Computer Science Deadline Fair Scheduling: Bridging the Theory and Practice of Proportionate-Fair Scheduling in Multiprocessor Servers Abhishek Chandra.
Adaptive Sampling for Sensor Networks Ankur Jain ٭ and Edward Y. Chang University of California, Santa Barbara DMSN 2004.
U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science Quantifying the Benefits of Resource Multiplexing in On-Demand Data Centers Abhishek.
Looking at the Server-side of P2P Systems Yi Qiao, Dong Lu, Fabian E. Bustamante and Peter A. Dinda Department of Computer Science Northwestern University.
U NIVERSITY OF M ASSACHUSETTS Department of Computer Science Automatic Heap Sizing Ting Yang, Matthew Hertz Emery Berger, Eliot Moss University of Massachusetts.
Differentiated Multimedia Web Services Using Quality Aware Transcoding S. Chandra, C.Schlatter Ellis and A.Vahdat InfoCom 2000, IEEE Journal on Selected.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Virtualization in Data Centers Prashant Shenoy
An Adaptive Multi-Objective Scheduling Selection Framework For Continuous Query Processing Timothy M. Sutherland Bradford Pielech Yali Zhu Luping Ding.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science From Cloud Computing to Sensor Networks: Distributed Computing Research at LASS.
Bandwidth Allocation in a Self-Managing Multimedia File Server Vijay Sundaram and Prashant Shenoy Department of Computer Science University of Massachusetts.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Emery Berger University of Massachusetts, Amherst Operating Systems CMPSCI 377 Lecture.
Jianliang XU, Dik L. Lee, and Bo Li Dept. of Computer Science Hong Kong Univ. of Science & Technology April 2002 On Bandwidth Allocation for Data Dissemination.
A Prediction-based Real-time Scheduling Advisor Peter A. Dinda Prescience Lab Department of Computer Science Northwestern University
Computer Science Cataclysm: Policing Extreme Overloads in Internet Applications Bhuvan Urgaonkar and Prashant Shenoy University of Massachusetts.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Black-box and Gray-box Strategies for Virtual Machine Migration Timothy Wood, Prashant.
Computer Science 1 Resource Overbooking and Application Profiling in Shared Hosting Platforms Bhuvan Urgaonkar Prashant Shenoy Timothy Roscoe † UMASS Amherst.
Self-Adaptive QoS Guarantees and Optimization in Clouds Jim (Zhanwen) Li (Carleton University) Murray Woodside (Carleton University) John Chinneck (Carleton.
Efficient Scheduling of Heterogeneous Continuous Queries Mohamed A. Sharaf Panos K. Chrysanthis Alexandros Labrinidis Kirk Pruhs Advanced Data Management.
Column Generation Approach for Operating Rooms Planning Mehdi LAMIRI, Xiaolan XIE and ZHANG Shuguang Industrial Engineering and Computer Sciences Division.
U NIVERSITY OF M ASSACHUSETTS A MHERST Department of Computer Science 2011 Predicting Solar Generation from Weather Forecasts Using Machine Learning Navin.
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.
OPTIMAL SERVER PROVISIONING AND FREQUENCY ADJUSTMENT IN SERVER CLUSTERS Presented by: Xinying Zheng 09/13/ XINYING ZHENG, YU CAI MICHIGAN TECHNOLOGICAL.
U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science An Analytical Model for Multi-tier Internet Services and its Applications Bhuvan.
Rensselaer Polytechnic Institute Rajagopal Iyengar Combinatorial Approaches to QoS Scheduling in Multichannel Wireless Systems Rajagopal Iyengar Rensselaer.
Budget-based Control for Interactive Services with Partial Execution 1 Yuxiong He, Zihao Ye, Qiang Fu, Sameh Elnikety Microsoft Research.
Chapter 3 System Performance and Models. 2 Systems and Models The concept of modeling in the study of the dynamic behavior of simple system is be able.
Adaptive Virtual Machine Provisioning in Elastic Multi-tier Cloud Platforms Fan Zhang, Junwei Cao, Hong Cai James J. Mulcahy, Cheng Wu Tsinghua University,
U NIVERSITY OF M ASSACHUSETTS A MHERST Department of Computer Science 2011 Blink: Managing Server Clusters on Intermittent Power Navin Sharma, Sean Barker,
Euro-Par, A Resource Allocation Approach for Supporting Time-Critical Applications in Grid Environments Qian Zhu and Gagan Agrawal Department of.
Strong Cache Consistency Support for Domain Name System Xin Chen, Haining Wang, Sansi Ren and Xiaodong Zhang College of William and Mary, Williamsburg,
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science 1 Automatic Heap Sizing: Taking Real Memory into Account Ting Yang, Emery Berger,
Computer Science 1 Resource Overbooking and Application Profiling in Shared Hosting Platforms Bhuvan Urgaonkar Prashant Shenoy Timothy Roscoe † UMASS Amherst.
An Energy-efficient Task Scheduler for Multi-core Platforms with per-core DVFS Based on Task Characteristics Ching-Chi Lin Institute of Information Science,
Authors: Mianyu Wang, Nagarajan Kandasamy, Allon Guez, and Moshe Kam Proceedings of the 3 rd International Conference on Autonomic Computing, ICAC 2006,
NUS.SOC.CS5248 A Time Series-based Approach for Power Management in Mobile Processors and Disks X. Liu, P. Shenoy and W. Gong Presented by Dai Lu.
U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science Dynamic Resource Management in Internet Data Centers Bhuvan Urgaonkar Laboratory.
“A cost-based admission control algorithm for digital library multimedia systems storing heterogeneous objects” – I.R. Chen & N. Verma – The Computer Journal.
OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.
1 Exploiting Nonstationarity for Performance Prediction Christopher Stewart (University of Rochester) Terence Kelly and Alex Zhang (HP Labs)
Scheduling MPI Workflow Applications on Computing Grids Juemin Zhang, Waleed Meleis, and David Kaeli Electrical and Computer Engineering Department, Northeastern.
Author Utility-Based Scheduling for Bulk Data Transfers between Distributed Computing Facilities Xin Wang, Wei Tang, Raj Kettimuthu,
Ensieea Rizwani An energy-efficient management mechanism for large-scale server clusters By: Zhenghua Xue, Dong, Ma, Fan, Mei 1.
U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science Dynamic Resource Management in Internet Hosting Platforms Ph.D. Thesis Defense.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science John Cavazos J Eliot B Moss Architecture and Language Implementation Lab University.
Project Proposal “Profile-driven QoS-aware Online Services” by Amitayu Das Sri Hari Krishna Narayanan CSE 598B Fall 2005.
Zeta: Scheduling Interactive Services with Partial Execution Yuxiong He, Sameh Elnikety, James Larus, Chenyu Yan Microsoft Research and Microsoft Bing.
1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.
Spark on Entropy : A Reliable & Efficient Scheduler for Low-latency Parallel Jobs in Heterogeneous Cloud Huankai Chen PhD Student at University of Kent.
Dynamic Resource Allocation for Shared Data Centers Using Online Measurements By- Abhishek Chandra, Weibo Gong and Prashant Shenoy.
OPERATING SYSTEMS CS 3502 Fall 2017
Exploiting Sharing for Data Center Consolidation
The Impact of Replacement Granularity on Video Caching
Regulating Data Flow in J2EE Application Server
Measuring Service in Multi-Class Networks
Vlad Nae, Radu Prodan, Thomas Fahringer Institute of Computer Science
Dynamic Provisioning for Multi-tier Internet Applications
Realizing Closed-loop, Online Tuning and Control for Configurable-Cache Embedded Systems: Progress and Challenges Islam S. Badreldin*, Ann Gordon-Ross*,
Presentation transcript:

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science Dynamic Resource Allocation for Shared Data Centers Using Online Measurements Abhishek Chandra Weibo Gong Prashant Shenoy UMASS Amherst

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 2 Motivation Data Centers Server farms Rent computing and storage resources to applications Revenue for meeting QoS guarantees Goals: Satisfy application QoS guarantees Maximize resource utilization of platform Robustness against “Slashdot” effects

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 3 Dynamic Resource Allocation Periodically re-allocate resources among applications Estimate resource requirements for near future Challenges: Reallocation at short time-scales No prior workload profiling/knowledge Low overhead Approach: Online Measurement-based Allocation

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 4 Talk Outline Motivation System Model Dynamic Allocation Techniques Experimental Results Conclusions

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 5 Resource Model Queuing System Generalized Processor Sharing (GPS) scheduler Request classes Different arrival processes, service time distributions QoS Goal: Mean Response Time GPS Resource

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 6 Dynamic Resource Allocation MONITOR System Metrics Resource Shares APPLICATION MODELS Expected Load PREDICTOR Measured Usage ALLOCATOR Rsrc Reqmts RESOURCE

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 7 Dynamic Resource Allocation ALLOCATOR PREDICTOR MONITOR System Metrics APPLICATION MODELS Expected Load Measured Usage RESOURCE

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 8 Monitoring Measure system and application metrics Queue lengths Request response times Monitoring windows Adaptation Window History Time Measurement Interval

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 9 History Adaptation Window Prediction Short-term prediction of workload characteristics Request arrival rate Average service time Use history of measured system metrics Mean Last value AR(1)

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 10 Prediction Accuracy Prediction Error Workload Prediction Time (min)

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 11 Dynamic Resource Allocation PREDICTOR MONITOR APPLICATION MODELS Expected Load Rsrc Reqmts ALLOCATOR Resource Shares RESOURCE

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 12 Measurement-based Model Goal: Relate QoS metric to resource requirement Idea: Model parameterized by online measurements Advantages: Parameters do not need to be computed Allow adaptation to dynamic workload Proposed: Transient Queuing System Description

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 13 Transient Queuing Model Transient queuing behavior over adaptation window Relation between mean response time T ¯ and application share w Little’s Law: Relation is parameterized by the measured workload Arrival rate λ and mean service time s ¯

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 14 Resource Allocation: Utility Model Discontent function: Measures the QoS violations of an application Constrained Optimization problem u1u1 u2u2 Optimization

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 15 Constrained Optimization Formulation Non-linear Optimization Problem: Response Time Discontent D i Goal subject to Solved using Lagrange multiplier method

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 16 Talk Outline Motivation System Model Dynamic Allocation Techniques Experimental Results Conclusions

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 17 Experimental Setup Simulation experiments Soccer World Cup’98 Traces Results based on a 24-hour portion of the trace 755,000 requests Mean req rate: 8.7 req/sec Mean req size: 8.47 KB

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 18 Share AllocationWorkloads Adaptation to Transient Overloads Shares adapt to changing workload characteristics

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 19 Adaptation: System Discontent GPS without reallocationGPS with reallocation System Discontent is lowered substantially

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 20 Conclusions Dynamic Resource Allocation needed for data centers Measurement-based allocation: Monitoring and Prediction gather online state Use this state for application modeling and allocation Future Work: Prediction policies Utility functions

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science 21 Related Work Prediction Statistical Prediction Models [Zhang00] Application Models Queuing-Theoretic Models [Carlstrom02,Liu01] Control-Theoretic Models [Abdelzaher02,Lu01] Data Centers MUSE [Chase01] COD [Moore02] Oceano [Appleby01]