JustRunIt: Experiment-Based Management of Virtualized Data Centers Wei Zheng Ricardo Bianchini Rutgers University Yoshio Turner Renato Santos John Janakiraman.

Slides:



Advertisements
Similar presentations
Wei Lu 1, Kate Keahey 2, Tim Freeman 2, Frank Siebenlist 2 1 Indiana University, 2 Argonne National Lab
Advertisements

Remus: High Availability via Asynchronous Virtual Machine Replication
Diagnosing Performance Overheads in the Xen Virtual Machine Environment Aravind Menon Willy Zwaenepoel EPFL, Lausanne Jose Renato Santos Yoshio Turner.
Automatic Configuration of Internet Services Wei Zheng, Ricardo Bianchini, and Thu Nguyen Department of Computer Science Rutgers University.
Difference Engine: Harnessing Memory Redundancy in Virtual Machines by Diwaker Gupta et al. presented by Jonathan Berkhahn.
Walter Binder University of Lugano, Switzerland Niranjan Suri IHMC, Florida, USA Green Computing: Energy Consumption Optimized Service Hosting.
Green Cloud Computing Hadi Salimi Distributed Systems Lab, School of Computer Engineering, Iran University of Science and Technology,
XENMON: QOS MONITORING AND PERFORMANCE PROFILING TOOL Diwaker Gupta, Rob Gardner, Ludmila Cherkasova 1.
CloudScale: Elastic Resource Scaling for Multi-Tenant Cloud Systems Zhiming Shen, Sethuraman Subbiah, Xiaohui Gu, John Wilkes.
Introduction to Virtualization
Copyright 2009 FUJITSU TECHNOLOGY SOLUTIONS PRIMERGY Servers and Windows Server® 2008 R2 Benefit from an efficient, high performance and flexible platform.
SLA-aware Virtual Resource Management for Cloud Infrastructures
IPOEM: A GPS Tool for Integrated Management in Virtualized Data Centers Hui Zhang 1, Kenji Yoshihira 1, Ya-Yunn Su 2, Guofei Jiang 1, Ming Chen 3, Xiaorui.
COMS E Cloud Computing and Data Center Networking Sambit Sahu
ENFORCING PERFORMANCE ISOLATION ACROSS VIRTUAL MACHINES IN XEN Diwaker Gupta, Ludmila Cherkasova, Rob Gardner, Amin Vahdat Middleware '06 Proceedings of.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Virtualization in Data Centers Prashant Shenoy
© 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Automated Workload Management in.
By- Jaideep Moses, Ravi Iyer , Ramesh Illikkal and
VIRTUALIZATION AND YOUR BUSINESS November 18, 2010 | Worksighted.
Virtualization for Cloud Computing
VIRTUALISATION OF HADOOP CLUSTERS Dr G Sudha Sadasivam Assistant Professor Department of CSE PSGCT.
Copyright Tim Antonowicz, This work is the intellectual property of the author. Permission is granted for this material to be shared for non- commercial,
Adaptive Server Farms for the Data Center Contact: Ron Sheen Fujitsu Siemens Computers, Inc Sever Blade Summit, Getting the.
1 MASTERING (VIRTUAL) NETWORKS A Case Study of Virtualizing Internet Lab Avin Chen Borokhovich Michael Goldfeld Arik.
Resource Management in Virtualization-based Data Centers Bhuvan Urgaonkar Computer Systems Laboratory Pennsylvania State University Bhuvan Urgaonkar Computer.
Presented by : Ran Koretzki. Basic Introduction What are VM’s ? What is migration ? What is Live migration ?
Using Virtualization in the Classroom. Using Virtualization in the Classroom Session Objectives Define virtualization Compare major virtualization programs.
Self-Adaptive QoS Guarantees and Optimization in Clouds Jim (Zhanwen) Li (Carleton University) Murray Woodside (Carleton University) John Chinneck (Carleton.
Networking Virtualization Using FPGAs Russell Tessier, Deepak Unnikrishnan, Dong Yin, and Lixin Gao Reconfigurable Computing Group Department of Electrical.
Adaptive Control of Virtualized Resources in Utility Computing Environments HP Labs: Xiaoyun Zhu, Mustafa Uysal, Zhikui Wang, Sharad Singhal University.
Department of Computer Science Engineering SRM University
Virtual Machine Hosting for Networked Clusters: Building the Foundations for “Autonomic” Orchestration Based on paper by Laura Grit, David Irwin, Aydan.
SAIGONTECH COPPERATIVE EDUCATION NETWORKING Spring 2009 Seminar #1 VIRTUALIZATION EVERYWHERE.
A Cloud is a type of parallel and distributed system consisting of a collection of inter- connected and virtualized computers that are dynamically provisioned.
Bottlenecks: Automated Design Configuration Evaluation and Tune.
November , 2009SERVICE COMPUTATION 2009 Analysis of Energy Efficiency in Clouds H. AbdelSalamK. Maly R. MukkamalaM. Zubair Department.
Virtualization Concepts Presented by: Mariano Diaz.
Improving Network I/O Virtualization for Cloud Computing.
Building Green Cloud Services at Low Cost Josep Ll. Berral, Íñigo Goiri, Thu D. Nguyen, Ricard Gavaldà, Jordi Torres, Ricardo Bianchini.
Zero-copy Migration for Lightweight Software Rejuvenation of Virtualized Systems Kenichi Kourai Hiroki Ooba Kyushu Institute of Technology.
RECON: A TOOL TO RECOMMEND DYNAMIC SERVER CONSOLIDATION IN MULTI-CLUSTER DATACENTERS Anindya Neogi IEEE Network Operations and Management Symposium, 2008.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Profiling and Modeling Resource Usage.
Using Virtual Servers for the CERN Windows infrastructure Emmanuel Ormancey, Alberto Pace CERN, Information Technology Department.
Usenix Annual Conference, Freenix track – June 2004 – 1 : Flexible Database Clustering Middleware Emmanuel Cecchet – INRIA Julie Marguerite.
Quantifying and Improving I/O Predictability in Virtualized Systems Cheng Li, Inigo Goiri, Abhishek Bhattacharjee, Ricardo Bianchini, Thu D. Nguyen 1.
Server Virtualization
A dynamic optimization model for power and performance management of virtualized clusters Vinicius Petrucci, Orlando Loques Univ. Federal Fluminense Niteroi,
VGreen: A System for Energy Efficient Manager in Virtualized Environments G. Dhiman, G Marchetti, T Rosing ISLPED 2009.
Embedded System Lab. 정범종 A_DRM: Architecture-aware Distributed Resource Management of Virtualized Clusters H. Wang et al. VEE, 2015.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
A. Frank - P. Weisberg Operating Systems Structure of Operating Systems.
June 30 - July 2, 2009AIMS 2009 Towards Energy Efficient Change Management in A Cloud Computing Environment: A Pro-Active Approach H. AbdelSalamK. Maly.
Full and Para Virtualization
Technical Reading Report Virtual Power: Coordinated Power Management in Virtualized Enterprise Environment Paper by: Ripal Nathuji & Karsten Schwan from.
Cloud Computing Lecture 5-6 Muhammad Ahmad Jan.
CSE598c - Virtual Machines - Spring Diagnosing Performance Overheads in the Xen Virtual Machine EnvironmentPage 1 CSE 598c Virtual Machines “Diagnosing.
Cloud Computing – UNIT - II. VIRTUALIZATION Virtualization Hiding the reality The mantra of smart computing is to intelligently hide the reality Binary->
© 2004 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Understanding Virtualization Overhead.
Unit 2 VIRTUALISATION. Unit 2 - Syllabus Basics of Virtualization Types of Virtualization Implementation Levels of Virtualization Virtualization Structures.
Virtualization Neependra Khare
Xin Li, Chen Qian University of Kentucky
Is Virtualization ready for End-to-End Application Performance?
Clinton A Jones Eastern Kentucky University Department of Technology
Management of Virtual Execution Environments 3 June 2008
Comparison of the Three CPU Schedulers in Xen
Zhen Xiao, Qi Chen, and Haipeng Luo May 2013
Cloud computing mechanisms
Specialized Cloud Architectures
Virtualization Dr. S. R. Ahmed.
A workload-aware energy model for VM migration
Presentation transcript:

JustRunIt: Experiment-Based Management of Virtualized Data Centers Wei Zheng Ricardo Bianchini Rutgers University Yoshio Turner Renato Santos John Janakiraman HP Labs

Motivation Managing data center is a challenging task – Resource allocation, evaluation of software/hardware upgrades, capacity planning, etc. – Decisions affect performance, availability, energy consumption State-of-the-art uses modeling for these tasks – Models give insight into system behavior – Fast exploration of large parameter spaces Modeling has some important drawbacks – Consumes a very expensive resource: human labor – Needs to be re-calibrated and re-validated as the systems evolve

Our Approach Idea: experiments are a better approach – Consume a cheaper resource: machine time (and energy) – High fidelity JustRunIt: an infrastructure for experiment-based management of virtualized data centers Management system or administrator can use JustRunIt results to perform management tasks – Resource management and hardware/software upgrades – Select the best value for software tunables – Evaluate the correctness of administrator actions

Outline Motivation JustRunIt design and implementation Evaluation – Case study 1: resource management – Case study 2: hardware upgrades Related work Conclusion

Target Environment Virtualized data centers host multiple independent Internet services Each service comprises multiple tiers, e.g. a web tier, an application tier, and a database tier Each service has strict negotiated SLAs (Service Level Agreements), e.g. response time All services are hosted in VMs for isolation, easy migration, management flexibility

Data Center with JustRunIt Creates sandbox Clones VMs Applies configuration changes Duplicates live workload to sandbox Properties – No effect on on-line services – Does not replicate entire service – Almost service-independent Assess performance and energy of different configurations A1 A2 A1 A2 A3 A2 A3 A1 A2 A3 W1 W2 W1 W2 W3 W1 W2 W3 D1 D2 D1 D3 D2 D3 D1 D2 D3 S-A2 Sandbox S-W2S-D2 On-line system

JustRunIt Architecture XX XXX XIIX IIII XXIX TT T Driver Checker - Parameter Ranges - Heuristics - Time Limit - Parameter Ranges - Heuristics - Time Limit Interpolator Management Entity Param values Experiment results Param values Experiment results Experimenter JustRunIt param2 param1

Experimenter Step 1: Clone subset of production system to a sandbox – VM cloning: Modify Xen live migration to resume original VM instead of destroying it – Storage cloning: LVM copy-on- write snapshot for sandbox VM – L2/L3 network address translation: implemented in driver domain netback driver to prevent network address conflict Step 2: Apply configuration changes – Exs: CPU allocation, frequency VM

Proxies filter requests/replies from the sandbox VM Emulates the timing and functional behavior of preceding and following service tiers – Application protocol level requests/replies (e.g. HTTP) Experimenter Tier-N VMIn-ProxyOut-Proxy Sandbox VM Step 3: Duplicates live workload to sandbox using proxies

JustRunIt Architecture XX XXX XIIX IIII XXIX TT T Driver Checker - Parameter Ranges - Heuristics - Time Limit - Parameter Ranges - Heuristics - Time Limit Interpolator Management Entity Param values Experiment results Param values Experiment results Experimenter JustRunIt param2 param1

Driver Goal: Fill in results matrix within a time limit Corners Midpoints (recursive) Heuristics –Cancel experiments if gain for a resource addition falls below a threshold –Cancel experiments for tiers that do not produce the largest gains from a resource addition XX X X XX X X X CPU allocation CPU Freq (min,min)

JustRunIt Architecture XX XXX XIIX IIII XXIX TT T Driver Checker - Parameter Ranges - Heuristics - Time Limit - Parameter Ranges - Heuristics - Time Limit Interpolator Management Entity Param values Experiment results Param values Experiment results Experimenter JustRunIt param2 param1

Interpolator and Checker For simplicity, we use linear interpolation Checker will verify the interpolated result by invoking the experimenter to run corresponding experiments in the background

Cost of JustRunIt Building JustRunIt needs human effort also – The most time-consuming part is proxies implementation – Current proxies understand HTTP, mod_jk, MySQL protocols – Developed from an open source proxy daemon, each proxy need 800~1500 new lines of C code Cost of VM Cloning: 42 lines of Python code in xend and 244 lines of C in netback driver The engineering cost of JustRunIt can be amortized for any service based on the same protocols

Outline Motivation JustRunIt design and implementation Evaluation – Case study 1: resource management – Case study 2: hardware upgrades Related work Conclusion

Methodology 15 HP Proliant C-class blades (8G, 2 Xeon dual-core) interconnected with Gbit network 2 types of 3 tier Internet service – RUBiS: online auction service modeled after Ebay.com – TPC-W: online book store modeled after Amazon.com Xen 3.3 with Linux Dom0 pinned to separate core for performance isolation

3-tier service with one node per tier; two nodes for proxies Overhead exposed – slight RT degradation, no effect on TP Overhead on On-line Service?

Fidelity of The Sandbox Execution? Throughput (reqs/s) Application server at 400 requests/second (similar results for higher load) Response Time Throughput

Automated Management Management Entity Data CenterJustRunIt “What if” Answer Monitoring Action Change Result

Case Study 1: Resource Management Goal: consolidate the hosted services onto the smallest possible set of nodes, while satisfying all SLAs Management entity invokes JustRunIt when response time SLA is violated, or when SLA is met by a large margin Management entity uses performance-resource matrix to determine resource needs Management entity performs bin packing (via simulated annealing) to minimize number of physical machines and number of VM migrations

Case Study 1: Resource Management 9 blades: 2 for first tier; 2 for second tier; 2 for third tier; 3 for load balancing and storage service 4 services are populated Each VM allocated 50% CPU SLA: 50ms Service 0 workload is increased to 1500reqs/sec after 2 mins

Resource Management with JustRunIt 4 services on 11 nodes SLA = 50ms Increase load on S0 Run 3 exps for 3 mins JRI Modeling Violating SLA Running experiments Migrating Violating SLA Solving model Migrating

Case Study 2: Hardware Upgrades Goal: evaluate if hardware upgrade allow further consolidation and lower overall power consumption JustRunIt uses one instance of new hardware in sandbox to determine the consolidation savings Bin packing determines necessary number of new machines to accommodate production workload

Case Study 2: Hardware Upgrades Initial server uses 90% of one CPU core on old hardware (emulate using low frequency mode) New machine (emulate using high frequency mode) requires 72% This would allow further consolidation in a large system

Modeling, feedback control, and machine learning for managing data centers [Stewart’05, Stewart’08, Padala’07, Padala’09, Cohen’04] Scaling down data centers emulation [Gupta’06, Gupta’08] Sandboxing and duplication for managing data centers [Nagaraja’04, Tan’05, Oliveira’06] Run experiments quickly [Osogami’06, Osogami’07] Selecting experiments to run [Zheng’07, Shivam’08] Related Work

Conclusions JustRunIt infrastructure combines well with automated management systems Answers “what-if” questions realistically and transparently Can support a variety of management tasks Future investigation – Tier interactions – Different workload mix – Build proxies for a database server

THANK YOU! QUESTIONS?