RECON: A TOOL TO RECOMMEND DYNAMIC SERVER CONSOLIDATION IN MULTI-CLUSTER DATACENTERS Anindya Neogi IEEE Network Operations and Management Symposium, 2008.

Slides:



Advertisements
Similar presentations
QoS-based Management of Multiple Shared Resources in Dynamic Real-Time Systems Klaus Ecker, Frank Drews School of EECS, Ohio University, Athens, OH {ecker,
Advertisements

Capacity Planning in a Virtual Environment
Active Appearance Models
Hadi Goudarzi and Massoud Pedram
Class-constrained Packing Problems with Application to Storage Management in Multimedia Systems Tami Tamir Department of Computer Science The Technion.
VCRIB: Virtual Cloud Rule Information Base Masoud Moshref, Minlan Yu, Abhishek Sharma, Ramesh Govindan HotCloud 2012.
SLA-Oriented Resource Provisioning for Cloud Computing
Fast Algorithms For Hierarchical Range Histogram Constructions
Power Aware Virtual Machine Placement Yefu Wang. 2 ECE Introduction Data centers are underutilized – Prepared for extreme workloads – Commonly.
The Who, What, Why and How of High Performance Computing Applications in the Cloud Soheila Abrishami 1.
SPECIFYING AND MONITORING GUARANTEES IN COMMERCIAL GRIDS THROUGH SLA Sven Graupner Vijay MachirajuAad van Moorsel IEEE/ACM International Symposium on Clustering.
Green Cloud Computing Hadi Salimi Distributed Systems Lab, School of Computer Engineering, Iran University of Science and Technology,
Decision Making: An Introduction 1. 2 Decision Making Decision Making is a process of choosing among two or more alternative courses of action for the.
Memory Buddies: Exploiting Page Sharing for Smart Colocation in Virtualized Data Centers Timothy Wood, Gabriel Tarasuk-Levin, Prashant Shenoy, Peter Desnoyers*,
System Analysis and Optimization 1 1 Efficient Resource Provisioning in Compute Clouds via VM Multiplexing Xiaoqiao Meng, Canturk Isci, Jeffrey Kephart,
Efficient Autoscaling in the Cloud using Predictive Models for Workload Forecasting Roy, N., A. Dubey, and A. Gokhale 4th IEEE International Conference.
SLA-aware Virtual Resource Management for Cloud Infrastructures
1 Virtual Machine Resource Monitoring and Networking of Virtual Machines Ananth I. Sundararaj Department of Computer Science Northwestern University July.
OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.
IPOEM: A GPS Tool for Integrated Management in Virtualized Data Centers Hui Zhang 1, Kenji Yoshihira 1, Ya-Yunn Su 2, Guofei Jiang 1, Ming Chen 3, Xiaorui.
By- Jaideep Moses, Ravi Iyer , Ramesh Illikkal and
T HE M ETHOD AND T OOL OF C OST A NALYSIS FOR C LOUD C OMPUTING X INHUI L I Y ING L I T IANCHENG L IU J IE Q IU F ENGCHUN W ANG Presented By Sneha Arvind.
Measuring zSeries System Performance Dr. Chu J. Jong School of Information Technology Illinois State University 06/11/2012 Sponsored in part by Deer &
Generating Adaptation Policies for Multi-Tier Applications in Consolidated Server Environments College of Computing Georgia Institute of Technology Gueyoung.
Energy Efficiency in Cloud Data Centers: Energy Efficient VM Placement for Cloud Data Centers Doctoral Student : Chaima Ghribi Advisor : Djamal Zeghlache.
Department of Computer Science Engineering SRM University
How to Resolve Bottlenecks and Optimize your Virtual Environment Chris Chesley, Sr. Systems Engineer
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.
An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.
An Efficient Clustering Algorithm For Low Power Clock Tree Synthesis Rupesh S. Shelar Enterprise Microprocessor Group Intel Corporation, Hillsboro, OR.
Chapter 3 System Performance and Models. 2 Systems and Models The concept of modeling in the study of the dynamic behavior of simple system is be able.
Chapter 4 Storage Management (Memory Management).
A performance evaluation approach openModeller: A Framework for species distribution Modelling.
Dynamic Resource Monitoring and Allocation in a virtualized environment.
Euro-Par, A Resource Allocation Approach for Supporting Time-Critical Applications in Grid Environments Qian Zhu and Gagan Agrawal Department of.
Data Tagging Architecture for System Monitoring in Dynamic Environments Bharat Krishnamurthy, Anindya Neogi, Bikram Sengupta, Raghavendra Singh (IBM Research.
Energy Aware Consolidation for Cloud Computing Srikanaiah, Kansal, Zhao Usenix HotPower 2008.
Major objective of this course is: Design and analysis of modern algorithms Different variants Accuracy Efficiency Comparing efficiencies Motivation thinking.
Performance evaluation of component-based software systems Seminar of Component Engineering course Rofideh hadighi 7 Jan 2010.
Copyright © 2011, Performance Evaluation of a Green Scheduling Algorithm for Energy Savings in Cloud Computing Truong Vinh Truong Duy; Sato,
Job scheduling algorithm based on Berger model in cloud environment Advances in Engineering Software (2011) Baomin Xu,Chunyan Zhao,Enzhao Hua,Bin Hu 2013/1/251.
A dynamic optimization model for power and performance management of virtualized clusters Vinicius Petrucci, Orlando Loques Univ. Federal Fluminense Niteroi,
Embedded System Lab. 정범종 A_DRM: Architecture-aware Distributed Resource Management of Virtualized Clusters H. Wang et al. VEE, 2015.
1 Iterative Integer Programming Formulation for Robust Resource Allocation in Dynamic Real-Time Systems Sethavidh Gertphol and Viktor K. Prasanna University.
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.
VMware vSphere Configuration and Management v6
Modeling Virtualized Environments in Simalytic ® Models by Computing Missing Service Demand Parameters CMG2009 Paper 9103, December 11, 2009 Dr. Tim R.
Chapter 8 Physical Database Design. Outline Overview of Physical Database Design Inputs of Physical Database Design File Structures Query Optimization.
OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.
Copyright © 2010, Performance and Power Management for Cloud Infrastructures Hien Nguyen Van; Tran, F.D.; Menaud, J.-M. Cloud Computing (CLOUD),
Efficient Resource Provisioning in Compute Clouds via VM Multiplexing
Dynamic Placement of Virtual Machines for Managing SLA Violations NORMAN BOBROFF, ANDRZEJ KOCHUT, KIRK BEATY SOME SLIDE CONTENT ADAPTED FROM ALEXANDER.
Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks Author: P. Kokkinos, K. Christodoulopoulos, A. Kretsis, and E. Varvarigos.
Capacity Planning in a Virtual Environment Chris Chesley, Sr. Systems Engineer
- 세부 1 - 이종 클라우드 플랫폼 데이터 관리 브로커 연구 및 개발 Network and Computing Lab.
Introduction to Performance Tuning Chia-heng Tu PAS Lab Summer Workshop 2009 June 30,
The PLA Model: On the Combination of Product-Line Analyses 강태준.
Dynamic Resource Allocation for Shared Data Centers Using Online Measurements By- Abhishek Chandra, Weibo Gong and Prashant Shenoy.
1 Automated Power Management Through Virtualization Anne Holler, VMware Anil Kapur, VMware.
INTRODUCTION TO WIRELESS SENSOR NETWORKS
OPERATING SYSTEMS CS 3502 Fall 2017
Memory Management.
Reinforcement Learning Based Virtual Cluster Management
Adaptive Cloud Computing Based Services for Mobile Users
Feifei Li, Ching Chang, George Kollios, Azer Bestavros
Presented By: Darlene Banta
Realizing Closed-loop, Online Tuning and Control for Configurable-Cache Embedded Systems: Progress and Challenges Islam S. Badreldin*, Ann Gordon-Ross*,
Dynamic Placement of Virtual Machines for managing sla violations
Presentation transcript:

RECON: A TOOL TO RECOMMEND DYNAMIC SERVER CONSOLIDATION IN MULTI-CLUSTER DATACENTERS Anindya Neogi IEEE Network Operations and Management Symposium, 2008 Sameep Mehta Presented by: Yun Liaw IBM India Research Lab

Outline  Introduction  ReCon Overview  Mapping Logic  Experimental Validation  Related Works  Discussions  Conclusion and Comments 2015/10/11 2

Introduction  Server virtualization has regained popularity for various reasons  Virtual machines (VMs) support more flexible and finer grain resource allocation  Physical server’s cost of management and total cost of ownership (TCO) has gone up drastically  Virtualization enables consolidation of a number of smaller machines as VMs on a large server  Leads to more efficient utilization of hardware resources  Saving floor spaces, saving management cost 2015/10/11 3

Introduction (cont’d)  ReCon: a tool that uses historical resource usage monitoring data to recommend a dynamic or static consolidation plan on servers 2015/10/11 4 White: high utilizationBlack: low utilization

ReCon Overview  Trace data: a set of measurements taken from the system, typically in a timeseries format  E.g., CPU, memory, etc.  Cost  Static cost: the base cost of running a physical server with associated workload  Dynamic cost: the cost that varies with the utilization  VM migration cost 2015/10/11 5

ReCon Overview (cont’d)  Constraints: to restrict the space of possible mappings between VMs and physical servers  System constraints  Application level constraints  Legal constraints  “What-if” input configuration: For users be able to tweak the input parameters and review the impact of consolidation  Time window size of dynamic consolidation  The period that a server should have no workload to consider turning it off 2015/10/11 6

ReCon Overview (cont’d)  Optimal Mapping Algorithm: To take all parameters, costs, constraints, configurations and process the trace data to generate static or dynamic server consolidation  Consolidation window: the non-overlapping time window to divide the historical data for dynamic consolidation For each time window, a optimal mapping from VM to physical servers are created In static consolidation, the time window is assumed to be the entire trace 2015/10/11 7

Mapping Logic – Basic Notations  Let VM = {VM 1, VM 2,…, VM N }  Each VM i observes and stores K variables O = {O (1,i), O (2,i),…, O (K,i) }  Each VM i is monitored for T time steps, the time series generated by j th sensor of VM i is 2015/10/11 8 Informal Problem Statement Given N application VMs, find n physical machines where n < N such that each VM is assigned to one physical machine while satisfying domain specified constraints Informal Problem Statement Given N application VMs, find n physical machines where n < N such that each VM is assigned to one physical machine while satisfying domain specified constraints

Mapping Logic - Constraints  Virtual machine constraints: Each VM i is associated with a list of M i constraints  Physical server constraints: Each physical server P i is associated with a list of L i constraints  The j th constraint of VM i which should hold in the interval [t 1, t 2 ]  The constraint is said to be satisfied if 9 Where P is the properties of the environment/architecture in time [t 1, t 2 ]

Mapping Logic – Optimization Problem Formulation  Assume that in the initial, each VM (application) is hosted by one physical machine, and each physical machine hosts exactly one VM  |VM| = |P| = N  n is not known a priori, and N is the upper bound of n  A: a N×N matrix, such that A i,j =1 specifies that VM i is assigned to P j  A will be a diagonal matrix in the initial  Y: a |P| bit long vector, such that Y i =1 implies that P i is currently running some VMs  Y will be a vector with all 1 in the initial 2015/10/11 10

Mapping Logic – Optimization Problem Formulation  Cost i : the fixed cost incurred if P i is active  MCost i,j : the cost for migrating VM i to P j  F: a function that calculates the dynamic cost if one or more VMs are assigned to it  Currently this function uses the CPU utilization for computing the dynamic cost  The benefit function attained by the consolidation is as the following function 11 The cost of initial settingFixed cost of running physical servers Cost of VMs migrating to P j

Mapping Logic – Optimization Problem Formulation  the first term of B is fixed and does not change while maximizing the function, therefore the objective function can be transferred to minimize 2015/10/11 12

Mapping Logic – Dynamic/Static Consolidation  Dynamic Consolidation  Assume the consolidation window size is T s  Firstly minimize the optimization function in time interval [1, T s ], and generate the assignment matrix A [1,T s ]  While consolidating for time interval [T s +1, 2*T s ], using the new set of constraints and A [1,T s ] as the starting point for optimization  Static Consolidation  Set all migration costs to zero  Set the consolidation window to cover the whole time period 2015/10/11 13

Experimental Validation – Data Set  The trace data was collected using the Model Driven Monitoring System (MDMS) [8]  186 physical servers  35 clusters with each cluster supporting one application  Approximately 15 parameters are monitored for every server But in this paper, authors use CPU utilization data only  Parameter are sampled at 5 minutes interval  The optimization problem solver: AMPL and CPLEX 14 [8] B. Krishnamurthy, et al., “Data tagging architecture for system monitoring in dynamic environments,” in IEEE NOMS, 2008

Experimental Validation – Evaluation Metrics  Time efficiency  To measure how fast it works given the size of data  The efficiency ε is defined as T R /T S T S : the consolidation window size T R : the time taken by ReCon to generate consolidation plan ε ~ 0 for a highly efficient tool ε ≧ 1 renders the tool useless for all practical purpose  Effectiveness  The percentage of physical machines that can be turned off by packing N VMs onto n i physical machines while satisfying all constraints in the corresponding consolidation window I  The effectiveness S is given by (N - n i )/N S ~ 1 implies most of the physical machines can be turned off 2015/10/11 15

Experimental Validation – Change in recommendations VS migration cost  Recommendation: The merging of two VMs onto the same physical server  Migration cost:  Inter cluster migration cost is normalized to be 100  Intra-cluster migration cost is varied as percentage of inter migration cost 2015/10/11 16

Experimental Validation – Efficiency Results 2015/10/11 17 VMs 1~175 Consolidation Window 10 ~240 (min) Time Taken on

Experimental Validation – Change in recommendations over time period  To study how the recommendations vary with change in the consolidation window size  Results:  As the time window is increased, the number of recommendations decreases More samples makes it difficult to satisfy the constraints  Time Window Size The time window size should not be too big in order to capture the dynamic behavior The time window size should not be too small so that the optimization engine is not used repeatedly without any gain Recommend value: T=300 minutes 2015/10/11 18

2015/10/11 19 X axis: time window Y axis: cost saving

Experimental Validation – Change in Cluster Over Time  To study the effect of recommendations on individual clusters  Based on the mean and standard deviation of saving, the clusters can be categorized into four groups  Low Variation – Low Saving  Low Variation – High Saving  High Variation – Low Saving  High Variation – High Saving 2015/10/11 20

21 Low Variation Low Saving Low Variation High Saving High Variation Low Saving High Variation High Saving X axis: time window Y axis: cost saving

Related Works  There is significant work in capacity planning and runtime resource management domain without bringing in the aspect of virtualization  VMware’s Distributed Resource Scheduler (DRS)  Bobroff et al. describe algorithms for reconsolidation in a dynamic setting while managing SLA violations  In static consolidation several bin-packing heuristics have been used to map VMs to physical servers 2015/10/11 22

Discussion and Future Work  Handling multiple attributes  The implementation cannot exploit the relationships or correlation among attributes E.g., the time lag relation between high CPU utilization and high I/O utilization  Runtime reconfiguration tool  In order to convert the planning tool into a real time decision module, highly efficient implementation and forecasting logic is needed Machine learning and time series forecasting techniques are the candidates for the author’s next step  Extending what-if analysis 2015/10/11 23

Conclusion  A VM consolidation planning tool called ReCon is provided  To analyze the historical resource consumption data  The consolidation problem is formulated in an optimization framework  Time varying constraints are easily incorporate to temporal change in workload characteristics  Different migration cost function, virtualization models can be plugged into the tool 2015/10/11 24

Comment  The problem is well-formulated  But the mentioned cost functions are mysterious 2015/10/11 25