Dynamic Placement of Virtual Machines for managing sla violations

Slides:



Advertisements
Similar presentations
VM Interference and Placement for Server Consolidation Umesh Bellur IIT Bombay.
Advertisements

Feedback Control Real-Time Scheduling: Framework, Modeling, and Algorithms Chenyang Lu, John A. Stankovic, Gang Tao, Sang H. Son Presented by Josh Carl.
QoS-based Management of Multiple Shared Resources in Dynamic Real-Time Systems Klaus Ecker, Frank Drews School of EECS, Ohio University, Athens, OH {ecker,
Management and Control of Domestic Smart Grid Technology IEEE Transactions on Smart Grid, Sep Albert Molderink, Vincent Bakker Yong Zhou
Hadi Goudarzi and Massoud Pedram
Overcoming Limitations of Sampling for Agrregation Queries Surajit ChaudhuriMicrosoft Research Gautam DasMicrosoft Research Mayur DatarStanford University.
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Power Aware Virtual Machine Placement Yefu Wang. 2 ECE Introduction Data centers are underutilized – Prepared for extreme workloads – Commonly.
Power Management in Cloud Computing using Green Algorithm -Kushal Mehta COP 6087 University of Central Florida.
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Sandpiper : Black box and Gray-Box resource management for Virtual Machines Journal : Computer Networks: The International Journal of Computer and Telecommunications.
David Chu--UC Berkeley Amol Deshpande--University of Maryland Joseph M. Hellerstein--UC Berkeley Intel Research Berkeley Wei Hong--Arched Rock Corp. Approximate.
Green Cloud Computing Hadi Salimi Distributed Systems Lab, School of Computer Engineering, Iran University of Science and Technology,
Virtualization and Cloud Computing Virtualization David Bednárek, Jakub Yaghob, Filip Zavoral.
System Analysis and Optimization 1 1 Efficient Resource Provisioning in Compute Clouds via VM Multiplexing Xiaoqiao Meng, Canturk Isci, Jeffrey Kephart,
Efficient Autoscaling in the Cloud using Predictive Models for Workload Forecasting Roy, N., A. Dubey, and A. Gokhale 4th IEEE International Conference.
SLA-aware Virtual Resource Management for Cloud Infrastructures
IPOEM: A GPS Tool for Integrated Management in Virtualized Data Centers Hui Zhang 1, Kenji Yoshihira 1, Ya-Yunn Su 2, Guofei Jiang 1, Ming Chen 3, Xiaorui.
An Optimization Problem in Adaptive Virtual Environments Ananth I. Sundararaj Manan Sanghi Jack R. Lange Peter A. Dinda Prescience Lab Department of Computer.
October 3, 2005CIS 7001 Compositional Real-Time Scheduling Framework Insik Shin.
Resource Management in Virtualization-based Data Centers Bhuvan Urgaonkar Computer Systems Laboratory Pennsylvania State University Bhuvan Urgaonkar Computer.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Black-box and Gray-box Strategies for Virtual Machine Migration Timothy Wood, Prashant.
Traffic modeling and Prediction ----Linear Models
Ekrem Kocaguneli 11/29/2010. Introduction CLISSPE and its background Application to be Modeled Steps of the Model Assessment of Performance Interpretation.
Department of Computer Science Engineering SRM University
Bargaining Towards Maximized Resource Utilization in Video Streaming Datacenters Yuan Feng 1, Baochun Li 1, and Bo Li 2 1 Department of Electrical and.
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Network Aware Resource Allocation in Distributed Clouds.
An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.
© 2009 IBM Corporation 1 Improving Consolidation of Virtual Machines with Risk-aware Bandwidth Oversubscription in Compute Clouds Amir Epstein Joint work.
RECON: A TOOL TO RECOMMEND DYNAMIC SERVER CONSOLIDATION IN MULTI-CLUSTER DATACENTERS Anindya Neogi IEEE Network Operations and Management Symposium, 2008.
Autonomic SLA-driven Provisioning for Cloud Applications Nicolas Bonvin, Thanasis Papaioannou, Karl Aberer Presented by Ismail Alan.
Chapter 3 System Performance and Models. 2 Systems and Models The concept of modeling in the study of the dynamic behavior of simple system is be able.
A Brief Intro to Virtualiztion. What is Virtualization? An abstraction Usually performed via software Many different types –Hardware –Software –Data –Network.
COMS E Cloud Computing and Data Center Networking Sambit Sahu
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Profiling and Modeling Resource Usage.
High Performance Computing on Virtualized Environments Ganesh Thiagarajan Fall 2014 Instructor: Yuzhe(Richard) Tang Syracuse University.
Dynamic Resource Monitoring and Allocation in a virtualized environment.
© 2010 AT&T Intellectual Property. All rights reserved. AT&T, the AT&T logo and all other AT&T marks contained herein are trademarks of AT&T Intellectual.
Power-Aware Scheduling of Virtual Machines in DVFS-enabled Clusters
Performance evaluation of component-based software systems Seminar of Component Engineering course Rofideh hadighi 7 Jan 2010.
Copyright © 2011, Performance Evaluation of a Green Scheduling Algorithm for Energy Savings in Cloud Computing Truong Vinh Truong Duy; Sato,
A dynamic optimization model for power and performance management of virtualized clusters Vinicius Petrucci, Orlando Loques Univ. Federal Fluminense Niteroi,
Managing Server Energy and Operational Costs Chen, Das, Qin, Sivasubramaniam, Wang, Gautam (Penn State) Sigmetrics 2005.
VGreen: A System for Energy Efficient Manager in Virtualized Environments G. Dhiman, G Marchetti, T Rosing ISLPED 2009.
Modeling Virtualized Environments in Simalytic ® Models by Computing Missing Service Demand Parameters CMG2009 Paper 9103, December 11, 2009 Dr. Tim R.
OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.
Copyright © 2010, Performance and Power Management for Cloud Infrastructures Hien Nguyen Van; Tran, F.D.; Menaud, J.-M. Cloud Computing (CLOUD),
Resource Allocation in Hospital Networks Based on Green Cognitive Radios 王冉茵
Efficient Resource Provisioning in Compute Clouds via VM Multiplexing
Cloud Computing Lecture 5-6 Muhammad Ahmad Jan.
Dynamic Placement of Virtual Machines for Managing SLA Violations NORMAN BOBROFF, ANDRZEJ KOCHUT, KIRK BEATY SOME SLIDE CONTENT ADAPTED FROM ALEXANDER.
Capacity Planning in a Virtual Environment Chris Chesley, Sr. Systems Engineer
Unit 2 VIRTUALISATION. Unit 2 - Syllabus Basics of Virtualization Types of Virtualization Implementation Levels of Virtualization Virtualization Structures.
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
Dynamic Resource Allocation for Shared Data Centers Using Online Measurements By- Abhishek Chandra, Weibo Gong and Prashant Shenoy.
1 Automated Power Management Through Virtualization Anne Holler, VMware Anil Kapur, VMware.
OPERATING SYSTEMS CS 3502 Fall 2017
Reinforcement Learning Based Virtual Cluster Management
Hydra: Leveraging Functional Slicing for Efficient Distributed SDN Controllers Yiyang Chang, Ashkan Rezaei, Balajee Vamanan, Jahangir Hasan, Sanjay Rao.
Analyzing Security and Energy Tradeoffs in Autonomic Capacity Management Wei Wu.
Standards and Patterns for Dynamic Resource Management
A Brief Intro to Virtualiztion
A Brief Intro to Virtualiztion
Effective VM Sizing in Virtualized Data Centers
CARP: Compression-Aware Replacement Policies
Lecture 2 Part 3 CPU Scheduling
An Optimization Problem in Adaptive Virtual Environments
Towards Predictable Datacenter Networks
Presentation transcript:

Dynamic Placement of Virtual Machines for managing sla violations Norman Bobroff Andrzej Kochut Kurk Beaty IBM T.J. Watson Research Center IFIP/IEEE International Symposium on Integrated Network Management, 2007 Presented by: Yun Liaw

Outline Introduction Server Workload Signatures that Benefit from Dynamic Placement Forecasting Placement Algorithm Simulation Studies Related Works Conclusion and Comments 2019/10/30

Introduction Low average utilization of servers is a well-known cost concern in datacenter To guarantee good performance at periods of peak demand, processing capacity is over-provisioned for many business applications The demand’s strong daily variability leading to low average utilization Traditional deployment pattern: One application per OS, and one OS per one physical machine Consolidation at application and OS levels can mitigate inefficiencies in using physical resources Application level consolidation requires considerable skill to ensure isolation between co-hosted applications within an OS image OS-based consolidation: A hypervisor executes on a physical machine (PM) and presents an abstraction of the underlying hardware to multiple virtual machines (VM) 2019/10/30

Introduction Server consolidation can be static or dynamic Static consolidation: Use historical average resource utilization as input to an algorithm that maps VMs to PMs Recomputed for a long period of time Mapping algorithm is done off-line Dynamic consolidation: Implemented at shorter timescale Need the ability to do live migration of VMs Live Migration: To migrate a VM from one PM to another without service interruption 2019/10/30

Introduction In this paper, a management algorithm for dynamic resource allocation in virtualized server environments is proposed Three parts of the algorithm (MFR) Measuring historical data Forecasting the future demand Remapping VMs to PMs

Introduction This paper’s contribution Method for classification of workload signatures to servers which benefit most from dynamic migration Forecasting technique suited for handling time series of resource demands Algorithm that remaps VMs to PMs Goal: minimizing the number of PMs Constraints: To support the workload at a specific rate of SLA violations p 2019/10/30

The algorithm – Key insight Consider a single PM that hosts a single VM The PM’s CPU capacity can be dynamically adjusted at time intervals of length τ Goal: To adjust CPU demand to ensure the probability of VM’s CPU demand exceeding capacity of PM is no greater than p

The algorithm – Key insight Ui: The CPU demand Prob. Density function of VM’s historical CPU demand Time series of VM’s CPU demand

Server workload signatures The properties that make a VM workload suitable for dynamic management: The timescale over which the resource demand varies must exceed the rebalancing interval τ Large resource variability Resource demand must be predictable The resource demand has deterministic periodic component Strong autocorrelation – possible to obtain low-error predictor 2019/10/30

2 1 3 [A] 1: raw CPU demand time series 2: autocorrelation function 3: cumulative dist. of demand 4: periodogram 1 3 2 4 [A]: strong autocorrelation, large variability and now distinct periodic component [B]: low autocorrelation, low variability and no distinct periodic component [C]: strong autocorrelation, large variability, and distinct periodic component [C] [B] 1 3 2019/10/30

Analytical formula for gain from dynamic management Notations: Reallocation interval: τ Demand probability density function: u(x) p-percentile of distribution u: Lp Distribution of predicted time series: ūτ(x) p-percentile of distribution of predictor error: Ep(τ) Gain G(τ): 2019/10/30

Forecasting Goal: building a predictor that forecasts the probability distribution of a demand in a future observation interval based on the historical usage data Approach (based on the work [20]) Decomposing the demand time series Ui into a sum of periodic components Dj S.t. where n is an integer and pj is a period Residual component of the demand 2019/10/30 [20] G.Jenkins, et al., Time Series Analysis: Forecasting and Control, Prentice Hall, 1994

Forecasting Approach (cont’d) Decouple Dj from the time series Ui Use a low-pass filter to smooth the time series Subdivide the smoothed time series into contiguous intervals of length pj Average the intervals together to form Dj Subtract the Dj from the Ui Model ri using a class of autoregression process -- AR(2) εi : the error term represented by Gaussian variable α: parameters estimated from data

[A] “The key to achieving a gain from dynamic management is the width of the predictor ‘s error distribution being less than the width of the total demand distribution” [B] [C] 2019/10/30

Management algorithm Management Objective: The remapping algorithm To minimize: the time-averaged number of active PMs hosting VMs Constraint: the rate of demand overloading the resource capacity is bounded by a specified threshold p The remapping algorithm A version of bin-packing problem (NP-hard) based on the first-fit heuristic approximation 2019/10/30

Management algorithm Notations: For each VM, use the forecast approach described earlier to forecast the max usage Sort VMs in descending order 2019/10/30

Management algorithm Notations: cp(μ, σ2): the capacity needed to guarantee an error rate less than p Taken off the VMs from the ordered list and attempt to place it on the PMs For each target PM, compute the distribution of the sum of the resource demands of all VMs allocated to this PM 2019/10/30

Management algorithm Notations: Taken off the VMs from the ordered list and attempt to place it on the PMs If p-percentile of this distribution is no greater than the capacity of the PM, the VM is allocated to this PM If the list of PMs is exhausted without satisfying this condition, the VM is assigned to the PM that has the smallest difference between allocated demands and its capacity 2019/10/30

Simulation studies The methodology is validated using simulations driven by traces gathered from hundreds of production servers running multiple OSes, and broad variety of applications The simulation studies include: Verify that MFR meets SLA targets Quantify the reduction in number of SLA violations Quantify the number of PMs used to support a workload Explore the relationship between the remapping interval and the gain from dynamic management Perform measurements on the VMWare ESX testbed To determine properties of a practical VM migration 2019/10/30

Simulation studies Meeting SLA objects For a given p, a set of 10 simulations is executed 2019/10/30

Simulation studies Quantifying the reduction in rate of SLA violations Map a single set of 25 VMs onto a set of PMs 2019/10/30

Simulation studies Quantifying the reduction in number of used PMs Compare the time-averaged number of active PMs required by MFR and static allocation at a specific target rate p 2019/10/30

Simulation studies Quantifying the reduction in number of used PMs Compare the required PMs under different target rate p 2019/10/30

Simulation studies Relationship between the remapping interval and the gain The quality of predictor decreases with the length of the prediction horizon 2019/10/30

Related Works Virtualization Time series forecasting Storage-based virtualization OS-level virtualization Application-level virtualization Time series forecasting Resource Management bin-packing heuristics resource economy in the datacenter The alg. trying to minimize the number of migrations 2019/10/30

Conclusions A management algorithm for dynamic allocation of VMs to PMs is presented providing probabilistic SLA guarantees Time series analysis and bin-packing heuristic combined to minimize the number of PMs required to support a workload A method for characterizing the gain that a given VM can achieve from migration is presented 2019/10/30

Comments Only one measured attribute – CPU Some functions are mysterious… for me The perspective of recent paper in VM consolidation are similar 2019/10/30

Analytical formula for gain from dynamic management The G(τ) above can be simplified to We apply the approximation that Thus Note: The mean of the predictor is the same as the mean of underlying distribution since the predictor is unbiased 2019/10/30