Effective VM Sizing in Virtualized Data Centers

Slides:



Advertisements
Similar presentations
Capacity Planning in a Virtual Environment
Advertisements

Hadi Goudarzi and Massoud Pedram
Class-constrained Packing Problems with Application to Storage Management in Multimedia Systems Tami Tamir Department of Computer Science The Technion.
SLA-Oriented Resource Provisioning for Cloud Computing
Walter Binder University of Lugano, Switzerland Niranjan Suri IHMC, Florida, USA Green Computing: Energy Consumption Optimized Service Hosting.
Power Aware Virtual Machine Placement Yefu Wang. 2 ECE Introduction Data centers are underutilized – Prepared for extreme workloads – Commonly.
1 Advancing Supercomputer Performance Through Interconnection Topology Synthesis Yi Zhu, Michael Taylor, Scott B. Baden and Chung-Kuan Cheng Department.
Energy-efficient Virtual Machine Provision Algorithms for Cloud System Ching-Chi Lin Institute of Information Science, Academia Sinica Department of Computer.
Green Cloud Computing Hadi Salimi Distributed Systems Lab, School of Computer Engineering, Iran University of Science and Technology,
Intelligent Workload Factoring for A Hybrid Cloud Computing Model Hui Zhang Guofei Jiang Haifeng Chen Kenji Yoshihira Akhilesh Saxena NEC Laboratories.
By Sean Danko.  What is Virtualization  How does Virtualization Work  History of Virtualization  Why Should I Virtualize  Infrastructure  Advantages.
SLA-aware Virtual Resource Management for Cloud Infrastructures
IPOEM: A GPS Tool for Integrated Management in Virtualized Data Centers Hui Zhang 1, Kenji Yoshihira 1, Ya-Yunn Su 2, Guofei Jiang 1, Ming Chen 3, Xiaorui.
Green IT and Data Centers Darshan R. Kapadia Gregor von Laszewski 1.
Dynamic and Decentralized Approaches for Optimal Allocation of Multiple Resources in Virtualized Data Centers Wei Chen, Samuel Hargrove, Heh Miao, Liang.
Department of Computer Science Engineering SRM University
How to Resolve Bottlenecks and Optimize your Virtual Environment Chris Chesley, Sr. Systems Engineer
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.
1 An SLA-Oriented Capacity Planning Tool for Streaming Media Services Lucy Cherkasova, Wenting Tang, and Sharad Singhal HPLabs,USA.
Network Aware Resource Allocation in Distributed Clouds.
Cloud Computing Energy efficient cloud computing Keke Chen.
Energy-Efficient Soft Real-Time CPU Scheduling for Mobile Multimedia Systems Wanghong Yuan, Klara Nahrstedt Department of Computer Science University of.
Ranking Queries on Uncertain Data: A Probabilistic Threshold Approach Wenjie Zhang, Xuemin Lin The University of New South Wales & NICTA Ming Hua,
© 2009 IBM Corporation 1 Improving Consolidation of Virtual Machines with Risk-aware Bandwidth Oversubscription in Compute Clouds Amir Epstein Joint work.
RECON: A TOOL TO RECOMMEND DYNAMIC SERVER CONSOLIDATION IN MULTI-CLUSTER DATACENTERS Anindya Neogi IEEE Network Operations and Management Symposium, 2008.
Challenges towards Elastic Power Management in Internet Data Center.
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
Power-Aware Scheduling of Virtual Machines in DVFS-enabled Clusters
Joint Power Optimization Through VM Placement and Flow Scheduling in Data Centers DAWEI LI, JIE WU (TEMPLE UNIVERISTY) ZHIYONG LIU, AND FA ZHANG (CHINESE.
VGreen: A System for Energy Efficient Manager in Virtualized Environments G. Dhiman, G Marchetti, T Rosing ISLPED 2009.
Dynamic Pricing of Internet Bandwidth via Chance Constrained Programming Xin Guo IBM Watson Research Center John Tomlin IBM Almaden Research.
The final exam solutions. Part I, #1, Central limit theorem Let X1,X2, …, Xn be a sequence of i.i.d. random variables each having mean μ and variance.
Project Name Program Name Project Scope Title Project Code and Name Insert Project Branding Image Here.
Copyright © 2010, Performance and Power Management for Cloud Infrastructures Hien Nguyen Van; Tran, F.D.; Menaud, J.-M. Cloud Computing (CLOUD),
Accounting for Load Variation in Energy-Efficient Data Centers
The bin packing problem. For n objects with sizes s 1, …, s n where 0 < s i ≤1, find the smallest number of bins with capacity one, such that n objects.
Efficient Resource Provisioning in Compute Clouds via VM Multiplexing
Dynamic Placement of Virtual Machines for Managing SLA Violations NORMAN BOBROFF, ANDRZEJ KOCHUT, KIRK BEATY SOME SLIDE CONTENT ADAPTED FROM ALEXANDER.
Ensieea Rizwani An energy-efficient management mechanism for large-scale server clusters By: Zhenghua Xue, Dong, Ma, Fan, Mei 1.
Capacity Planning in a Virtual Environment Chris Chesley, Sr. Systems Engineer
1 Automated Power Management Through Virtualization Anne Holler, VMware Anil Kapur, VMware.
A Hierarchical Edge Cloud Architecture for Mobile Computing IEEE INFOCOM 2016 Liang Tong, Yong Li and Wei Gao University of Tennessee – Knoxville 1.
SketchVisor: Robust Network Measurement for Software Packet Processing
OPERATING SYSTEMS CS 3502 Fall 2017
Memory Management.
ECO 173 Chapter 10: Introduction to Estimation Lecture 5a
Measurement-based Design
Green cloud computing 2 Cs 595 Lecture 15.
Load Balancing and Data centers
Analyzing Security and Energy Tradeoffs in Autonomic Capacity Management Wei Wu.
Optimal CyberSecurity Analyst Staffing Plan
Sample Mean Distributions
ECO 173 Chapter 10: Introduction to Estimation Lecture 5a
Descriptive Statistics: Numerical Methods
Zhen Xiao, Qi Chen, and Haipeng Luo May 2013
ElasticTree: Saving Energy in Data Center Networks
Discrete Event Simulation - 4
Lecture 2 – Monte Carlo method in finance
Qingbo Zhu, Asim Shankar and Yuanyuan Zhou
Partition Starter Find out what disk partitioning is, state key features, find a diagram and give an example.
Virtualization.
Sampling Distributions
Lecture 2 Part 3 CPU Scheduling
The Greening of IT November 1, 2007.
Simple Linear Regression
Lecture 18 Syed Mansoor Sarwar
Chapter-5 Traffic Engineering.
Virtualization Dr. S. R. Ahmed.
Dynamic Placement of Virtual Machines for managing sla violations
Towards Predictable Datacenter Networks
Presentation transcript:

Effective VM Sizing in Virtualized Data Centers Ming Chen1, Hui Zhang2, Ya-Yunn Su3, Xiaorui Wang1, Guofei Jiang2, Kenji Yoshihira2 1. University of Tennessee 2. NEC Laboratories America 3. National Taiwan University

Virtualized data centers: server consolidation and green IT Server consolidation - virtualization facilitates consolidation of several physical servers onto a single high end system — Reduces management costs/overheads — Increases overall utilization Resource Pool Green IT - computing more, consume less — Improving infrastructure efficiency —Increasing IT productivity Organizations are constantly faced with the challenge of accommodating increasing amounts of data, increasing numbers of devices and users, and increasingly powerful servers to support critical applications—all while controlling data center costs for power, cooling, and other operations. improving infrastructure efficiency - strategies that can have a significant effect on overall data center performance per watt, such as using energy-efficient equipment, optimizing data center temperature (77 degree), and utilizing best-practices data center design. increasing IT productivity - increasing the actual amount of computing work completed in the data center relative to the amount of power used. Although these two options are key tactics in controlling energy use, they generally represent only an incremental savings. Substantially reducing energy use typically requires adopting a more strategic approach to managing the IT infrastructure than these approaches can provide. Today Future Data center useful work IT load power DCiE = DCpW = Total data center Input power Total facility power 11/27/2018 DCiE: Data center infrastructure efficiency DCpW: Data center performance per Watt 2 2

In virtualized data centers… Server utilization based performance and power management mechanisms VMware DPM, NEC SSC, IBM Tivoli… Overload threshold CPUhigh CPU utilization CPUlow Power-saving mode 11/27/2018

VM sizing – a resource management primitive in virtualized data centers How much resource allocated to this VM? CPU utilization Sizing over the maximal load? Low resource utilization!!! Sizing over the average load? High performance violations!!! time Cumulative Distribution Function of Server Normalized -percentile Loads (5,415 servers of 10 IT systems) Maximal load is much larger than the average load 90% of the servers have the maximal load at least 2.2 times larger than their average load; 50% of the servers have the maximal load at least 7.2 times larger than their average load. 11/27/2018

Effective VM sizing Effective size, a new VM sizing concept under probabilistic SLAs A probabilistic SLA example [Bobroff2007] Prob[server x’s CPU utilization at any time > 90%] < 5% A VM’s effective size is decided by four factors its own workload performance constraint defined as probabilistic SLAs the resource capacity of the server the VMs co-hosted in the server Check the NOMS06 paper for a reference 11/27/2018

Stochastic bin packing problem Given a set of items, whose size is described by independent random variables S = {X1,X2, … ,Xn}, and an overflow probability p, Partition the set S into the smallest number of set (bins) S1 ,… , Sk such that for all 1 ≤ j ≤ k. VMs workload machines SLA Effective sizing is the basis of a family of O(1)-approximation algorithms for the stochastic bin packing problem. 11/27/2018

Effective Sizing – intrinsic demand Let a random variable Xi represent a VM i's resource demand, and Cj is the resource capacity of server j. The intrinsic demand of VM i on server j is defined as and Nij is the maximal value of N satisfying the following constraint Intuitively, Nij is the maximal number of VMs that can be packed into server j without breaking the probabilistic SLA when all the VMs have the same workload pattern as VM i. where Uk are independent and identically distributed (i.i.d.) random variables with the same distribution as Xi. 11/27/2018

Intrinsic demand – one example Statistical multiplexing rocks! Overflowing prob Effective sizing example: i.i.d random variables with normal distribution (server overload probability = 2.5%) 11/27/2018

Intrinsic demand – analysis Theorem 1. For items with independent Poisson distributions, the First Fit Decreasing (FFD) deterministic bin packing algorithm with effective sizing (intrinsic demand) finds a solution to the stochastic bin packing problem with at most (1.22B*+1) bins of size 1, where B* is the minimum possible number of bins. Theorem 2. For items with independent normal distributions, the First Fit Decreasing deterministic bin packing algorithm with effective sizing (intrinsic demand) finds a solution to the stochastic bin packing problem with at most (1.22B*+1) bins of size 1+rc, where B* is the minimum possible number of bins, and rc ≤ 0.496. 11/27/2018

Intrinsic demand may not be enough Workload independence assumption might not hold in practice 11/27/2018

Effective Sizing – correlation-aware demand Let a random variable Xi represent a VM i's resource demand, and another random variable Xj represent a server j's existing aggregate resource demand from the VMS already allocated to it. The correlation-aware demand of VM i on server j is defined as where σ2i and σ2j are the variances of the random variables Xi and Xj; ρ is the correlation coefficient between Xi and Xj; Zα denotes the α-percentile for the unit normal distribution (α= 1-p). For example, if we want the overflow probability p = 0.25%, then α= 99.75%, and Zα = 3. Intuitively, Nij is the maximal number of VMs that can be packed into server j without breaking the probabilistic SLA when all the VMs have the same workload pattern as VM i. 11/27/2018

Applying effective sizing in production systems Practical issues in many dimensions Product implementation VM migration cost History and correlation aware (HCA) VM placement algorithm in the paper. Workload distribution modeling Workload stationarity Application-layer SLAs Please see discussions in the paper. 11/27/2018

Data center workload traces Traces on 2525 servers from 10 IT systems Each is regarded as a VM in the simulations. Monitoring data: CPU utilization. 1 week length, 15 minute monitoring frequency 672 time points 11/27/2018 13

Simulation methodology All physical servers have homogenous hardware specs. CPU resource: 3GHZ X 4 (Quadra-core) (the most common CPU model in the traces) Memory constraint: the maximal number of VMs allowable if the server is memory bounded (4, 8, 16, …) At the beginning of each time window, provoke the server consolidation scheme Using the monitoring information in the previous window to make decision During each time window, measure the placement scheme by The number of active servers Server overflowing probability p=5% in the evaluation. Five server consolidation schemes B1: FFD + average load B2: FFD + maximal load B3: FFD + VMware DPM VM sizing (μ+2σ, μ - mean, σ – standard deviation) B4: FFD + 95-percentile load ES-CA: FFD + effective sizing 11/27/2018

Simulation results Effective sizing 11/27/2018 46% less servers than max-load sizing 23% less servers than VMware DPM 10% less servers than 95-percentile 11/27/2018

Simulation results Effective sizing 11/27/2018 34% less servers than max-load sizing 16% less servers than VMware DPM 11% less servers than 95-percentile ES-CA 11/27/2018

Conclusions & Future Work Effective sizing, a new VM sizing method in server consolidation. O(1)-approxmiation algorithms for stochastic bin packing problem. Migration-cost and workload-correlation aware VM placement algorithms. Future work Server consolidation in multiple dimensions. CPU, memory, disk, network. 11/27/2018