U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Black-box and Gray-box Strategies for Virtual Machine Migration Timothy Wood, Prashant.

Slides:



Advertisements
Similar presentations
Live migration of Virtual Machines Nour Stefan, SCPD.
Advertisements

Capacity Planning in a Virtual Environment
SLA-Oriented Resource Provisioning for Cloud Computing
The Who, What, Why and How of High Performance Computing Applications in the Cloud Soheila Abrishami 1.
1 Characterization of Software Aging Effects in Elastic Storage Mechanisms for Private Clouds Rubens Matos, Jean Araujo, Vandi Alves and Paulo Maciel Presenter:
VSphere vs. Hyper-V Metron Performance Showdown. Objectives Architecture Available metrics Challenges in virtual environments Test environment and methods.
Sandpiper : Black box and Gray-Box resource management for Virtual Machines Journal : Computer Networks: The International Journal of Computer and Telecommunications.
XENMON: QOS MONITORING AND PERFORMANCE PROFILING TOOL Diwaker Gupta, Rob Gardner, Ludmila Cherkasova 1.
Virtualization and Cloud Computing Virtualization David Bednárek, Jakub Yaghob, Filip Zavoral.
Memory Buddies: Exploiting Page Sharing for Smart Colocation in Virtualized Data Centers Timothy Wood, Gabriel Tarasuk-Levin, Prashant Shenoy, Peter Desnoyers*,
Dynamic Resource Allocation using Virtual Machine for Cloud Computing Environment Zhen Xiao, Weijia Song, and Qi Chen Dept. of Computer Science Peking.
CloudScale: Elastic Resource Scaling for Multi-Tenant Cloud Systems Zhiming Shen, Sethuraman Subbiah, Xiaohui Gu, John Wilkes.
U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science Dynamic Provisioning for Multi-tier Internet Applications Bhuvan Urgaonkar, Prashant.
Efficient Autoscaling in the Cloud using Predictive Models for Workload Forecasting Roy, N., A. Dubey, and A. Gokhale 4th IEEE International Conference.
COMMA: Coordinating the Migration of Multi-tier applications 1 Jie Zheng* T.S Eugene Ng* Kunwadee Sripanidkulchai† Zhaolei Liu* *Rice University, USA †NECTEC,
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Server Consolidation in Virtualized Data Centers Prashant Shenoy University of Massachusetts.
Virtualizing Mission-Critical Apps 1PM EST, 3/29/2011 Ilya Mirman Philip Thomas.
U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science Quantifying the Benefits of Resource Multiplexing in On-Demand Data Centers Abhishek.
Energy Efficient Prefetching – from models to Implementation 6/19/ Adam Manzanares and Xiao Qin Department of Computer Science and Software Engineering.
Energy Efficient Prefetching with Buffer Disks for Cluster File Systems 6/19/ Adam Manzanares and Xiao Qin Department of Computer Science and Software.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Virtualization in Data Centers Prashant Shenoy
Adaptive Content Delivery for Scalable Web Servers Authors: Rahul Pradhan and Mark Claypool Presented by: David Finkel Computer Science Department Worcester.
By- Jaideep Moses, Ravi Iyer , Ramesh Illikkal and
Virtualization for Cloud Computing
Virtualization Performance H. Reza Taheri Senior Staff Eng. VMware.
Resource Management in Virtualization-based Data Centers Bhuvan Urgaonkar Computer Systems Laboratory Pennsylvania State University Bhuvan Urgaonkar Computer.
Computer Science Cataclysm: Policing Extreme Overloads in Internet Applications Bhuvan Urgaonkar and Prashant Shenoy University of Massachusetts.
AGILE, DYNAMIC PROVISIONING OF MULTITIER INTERNET APPLICATIONS Bhuvan Urgaonkar, Prashant Shenoy, Abhishek Chandray, and Pawan Goyal ACM Transactions on.
Measuring zSeries System Performance Dr. Chu J. Jong School of Information Technology Illinois State University 06/11/2012 Sponsored in part by Deer &
Black-box and Gray-box Strategies for Virtual Machine Migration Timothy Wood, Prashant Shenoy, Arun Venkataramani, and Mazin Yousif † Univ. of Massachusetts.
Computer Science 1 Resource Overbooking and Application Profiling in Shared Hosting Platforms Bhuvan Urgaonkar Prashant Shenoy Timothy Roscoe † UMASS Amherst.
Report : Zhen Ming Wu 2008 IEEE 9th Grid Computing Conference.
Adaptive Control of Virtualized Resources in Utility Computing Environments HP Labs: Xiaoyun Zhu, Mustafa Uysal, Zhikui Wang, Sharad Singhal University.
Department of Computer Science Engineering SRM University
How to Resolve Bottlenecks and Optimize your Virtual Environment Chris Chesley, Sr. Systems Engineer
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.
U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science An Analytical Model for Multi-tier Internet Services and its Applications Bhuvan.
Storage Management in Virtualized Cloud Environments Sankaran Sivathanu, Ling Liu, Mei Yiduo and Xing Pu Student Workshop on Frontiers of Cloud Computing,
Improving Network I/O Virtualization for Cloud Computing.
Profile Driven Component Placement for Cluster-based Online Services Christopher Stewart (University of Rochester) Kai Shen (University of Rochester) Sandhya.
IISWC 2007 Panel Benchmarking in the Web 2.0 Era Prashant Shenoy UMass Amherst.
Challenges towards Elastic Power Management in Internet Data Center.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Profiling and Modeling Resource Usage.
Our work on virtualization Chen Haogang, Wang Xiaolin {hchen, Institute of Network and Information Systems School of Electrical Engineering.
Dynamic Resource Monitoring and Allocation in a virtualized environment.
Swapping to Remote Memory over InfiniBand: An Approach using a High Performance Network Block Device Shuang LiangRanjit NoronhaDhabaleswar K. Panda IEEE.
Server Virtualization
Computer Science 1 Resource Overbooking and Application Profiling in Shared Hosting Platforms Bhuvan Urgaonkar Prashant Shenoy Timothy Roscoe † UMASS Amherst.
VGreen: A System for Energy Efficient Manager in Virtualized Environments G. Dhiman, G Marchetti, T Rosing ISLPED 2009.
1 Virtual Machine Memory Access Tracing With Hypervisor Exclusive Cache USENIX ‘07 Pin Lu & Kai Shen Department of Computer Science University of Rochester.
Embedded System Lab. 정범종 A_DRM: Architecture-aware Distributed Resource Management of Virtualized Clusters H. Wang et al. VEE, 2015.
Modeling Virtualized Environments in Simalytic ® Models by Computing Missing Service Demand Parameters CMG2009 Paper 9103, December 11, 2009 Dr. Tim R.
U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science Dynamic Resource Management in Internet Data Centers Bhuvan Urgaonkar Laboratory.
1 Agility in Virtualized Utility Computing Hangwei Qian, Elliot Miller, Wei Zhang Michael Rabinovich, Craig E. Wills {EECS Department, Case Western Reserve.
Cloud Computing Lecture 5-6 Muhammad Ahmad Jan.
Cloud Computing – UNIT - II. VIRTUALIZATION Virtualization Hiding the reality The mantra of smart computing is to intelligently hide the reality Binary->
Capacity Planning in a Virtual Environment Chris Chesley, Sr. Systems Engineer
1 Best Practices for Monitoring Databases on VMware Dean Richards Senior DBA, Confio Software.
Dynamic Resource Allocation for Shared Data Centers Using Online Measurements By- Abhishek Chandra, Weibo Gong and Prashant Shenoy.
Workload Distribution Architecture
Introduction to Load Balancing:
Measurement-based Design
Distributed Network Traffic Feature Extraction for a Real-time IDS
Zhen Xiao, Qi Chen, and Haipeng Luo May 2013
DotSlash: An Automated Web Hotspot Rescue System
Cloud Computing Architecture
Specialized Cloud Architectures
Presentation transcript:

U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Black-box and Gray-box Strategies for Virtual Machine Migration Timothy Wood, Prashant Shenoy, Arun Venkataramani, and Mazin Yousif * University of Massachusetts Amherst * Intel, Portland

Enterprise Data Centers Data Centers are composed of: Large clusters of servers Network attached storage devices Multiple applications per server Shared hosting environment Multi-tier, may span multiple servers Allocates resources to meet Service Level Agreements (SLAs) Virtualization increasingly common

Benefits of Virtualization Run multiple applications on one server Each application runs in its own virtual machine Maintains isolation Provides security Rapidly adjust resource allocations CPU priority, memory allocation VM migration “Transparent” to application No downtime, but incurs overhead How can we use virtualization to more efficiently utilize data center resources?

Data Center Workloads Web applications see highly dynamic workloads Multi-time-scale variations Transient spikes and flash crowds Time (days) Arrivals per min Time (hrs) Request Rate (req/min) Arrivals per min How can we provision resources to meet these changing demands?

Provisioning Methods Hotspots form if resource demand exceeds provisioned capacity Static over-provisioning Allocate for peak load Wastes resources Not suitable for dynamic workloads Difficult to predict peak resource requirements Dynamic provisioning Adjust based on workload Often done manually Becoming easier with virtualization

Problem Statement How can we automatically detect and eliminate hotspots in data center environments? Use VM migration and dynamic resource allocation!

Outline Introduction & Motivation System Overview When? How much? And Where to? Implementation and Evaluation Conclusions

Research Challenges Sandpiper: automatically detect and mitigate hotspots through virtual machine migration When to migrate? Where to move to? How much of each resource to allocate? How much information needed to make decisions? A migratory bird

Sandpiper Architecture Nucleus Monitor resources Report to control plane One per server Control Plane Centralized server Hotspot Detector Detect when a hotspot occurs Profiling Engine Decide how much to allocate Migration Manager Determine where to migrate Nucleus VM 1 VM 2 HotspotDetector Control Plane MigrationManagerProfilingEngine … PM = Physical Machine VM = Virtual Machine PM 1PM N

Black-Box and Gray-Box Black-box: only data from outside the VM Completely OS and application agnostic Gray-Box: access to OS stats and application logs Request level data can improve detection and profiling Not always feasible – customer may control OS Gray Box Application logs OS statistics Black Box ??? Is black-box sufficient? What do we gain from gray-box data?

Outline Introduction & Motivation System Overview When? How much? And Where to? Implementation and Evaluation Conclusions

Black-box Monitoring Xen uses a “Driver Domain” Special VM with network and disk drivers Nucleus runs here CPU Scheduler statistics Network Linux device information Memory Detect swapping from disk I/O Only know when performance is poor Hypervisor Driver Domain NucleusVM

Hotspot Detection – When? Resource Thresholds Potential hotspot if utilization exceeds threshold Only trigger for sustained overload Must be overloaded for k out of n measurements Autoregressive Time Series Model Use historical data to predict future values Minimize impact of transient spikes Time Utilization Time Utilization Time Utilization Not overloaded Hotspot Detected!

How much of each resource to give a VM Create distribution from time series Provision to meet peaks of recent workload What to do if utilization is at 100%? Gray-box Request level knowledge can help Can use application models to determine requirements Resource Profiling – How much? Historical data % Utilization Probability Utilization Profile

Determining Placement – Where to? Migrate VMs from overloaded to underloaded servers Use Volume to find most loaded servers Captures load on multiple resource dimensions Highly loaded servers are targeted first Migrations incur overhead Migration cost determined by RAM Migrate the VM with highest Volume/RAM ratio Volume = 1 1-cpu 1 1-net 1 1-mem ** cpu mem net Maximize the amount of load transferred while minimizing the overhead of migrations

Placement Algorithm First try migrations Displace VMs from high Volume servers Use Volume/RAM to minimize overhead Don’t create new hotspots! What if high average load in system? Swap if necessary Swap a high Volume VM for a low Volume one Requires 3 migrations Can’t support both at once PM1PM2 VM3 VM2 VM1 VM4 PM1PM2 VM3 VM2 VM4 Spare VM1 VM5 Migration Swap Swaps increase the number of hotspots we can resolve

Outline Introduction & Motivation System Overview When? How much? And Where to? Implementation and Evaluation Conclusions

Implementation Use Xen virtualization software Testbed of twenty 2.4Ghz P4 servers Apache , PHP , MySQL Synthetic PHP applications RUBiS – multi-tier ebay-like web application

Migration Effectiveness 3 Physical servers, 5 virtual machines VMs serve CPU intensive PHP scripts Migration triggered when CPU usage exceeds 75% Sandpiper detects and responds to 3 hotspots PM 1 PM 2 PM 3 CPU Usage (stacked)

Memory Hotspots Virtual machine runs SpecJBB benchmark Memory utilization increases over time Black-box increases by 32MB if page-swapping observed Gray-box maintains 32 MB free Significantly reduces page-swapping Gray-box can improve application performance by proactively increasing allocation

Data Center Prototype 16 server cluster runs realistic data center applications on 35 virtual machines 6 servers (14 VMs) become simultaneously overloaded 4 CPU hotspots and 2 network hotspots Sandpiper eliminates all hotspots in four minutes Uses 7 migrations and 2 swaps Despite migration overhead, VMs see fewer periods of overload

Related Work Menasce and Bennani 2006 Single server resource management VIOLIN and Virtuoso Use virtualization for dynamic resource control in grid computing environments Shirako Migration used to meet resource policies determined by application owners VMware Distributed Resource Scheduler Automatically migrates VMs to ensure they receive their resource quota

Summary Virtual Machine migration is a viable tool for dynamic data center provisioning Sandpiper can rapidly detect and eliminate hotspots while treating each VM as a black-box Gray-Box information can improve performance in some scenarios Proactive memory allocations Future work Improved black-box memory monitoring Support for replicated services

Thank you

Stability During Overload Predict future usage Will not migrate if destination could become overloaded Each set of migrations must eliminate a hotspot Algorithm only performs bounded number of migrations Measured Predicted

Sandpiper Overhead CPU/mem same as monitoring tools (1%) Network bandwidth negligible Placement algorithm completes in less than 10 seconds for up to 750 VMs Can distribute computation if necessary

Gray v. Black - Apache Load spikes on 2 web servers cause CPU saturation Black-box underestimates each VM’s requirement Does not know how much more to allocate Requires 3 sequential migrations to resolve hotspot Gray-box correctly judges resource requirements by using application logs Initiates 2 migrations in parallel Eliminates hotspot 60% faster Web Server Response Time Migrations