Live Migration of Virtual Machines Christopher Clark, Keir Fraser, Steven Hand, Jacob Gorm Hansen, Eric Jul, Christian Limpach, Ian Pratt, Andrew Warfield.

Slides:



Advertisements
Similar presentations
Remus: High Availability via Asynchronous Virtual Machine Replication
Advertisements

Live migration of Virtual Machines Nour Stefan, SCPD.
Live Migration of Virtual Machines Presented by: Edward Armstrong University of Guelph.
Energy Efficiency through Burstiness Athanasios E. Papathanasiou and Michael L. Scott University of Rochester, Computer Science Department Rochester, NY.
XEN AND THE ART OF VIRTUALIZATION Paul Barham, Boris Dragovic, Keir Fraser, Steven Hand, Tim Harris, Alex Ho, Rolf Neugebauer, lan Pratt, Andrew Warfield.
Fast and Safe Performance Recovery on OS Reboot Kenichi Kourai Kyushu Institute of Technology.
A Fast Rejuvenation Technique for Server Consolidation with Virtual Machines Kenichi Kourai Shigeru Chiba Tokyo Institute of Technology.
Exploiting Data Deduplication to Accelerate Live Virtual Machine Migration Xiang Zhang 1,2, Zhigang Huo 1, Jie Ma 1, Dan Meng 1 1. National Research Center.
Live Migration of Virtual Machines Christopher Clark, Keir Fraser, Steven Hand, Jacob Gorm Hansen, Eric Jul, Christian Limpach, Ian Pratt, Andrew Warfield.
Heterogeneous Live Migration of Virtual Machines Pengcheng Liu, Ziye Yang, Xiang Song, Yixun Zhou, Haibo Chen, and Binyu Zang Parallel Processing Institute,
Xen Virtual Machine Monitor Performance Isolation E0397 Lecture 17/8/2010 Many slides based verbatim on “Xen Credit Scheduler Wiki”
1 Cheriton School of Computer Science 2 Department of Computer Science RemusDB: Transparent High Availability for Database Systems Umar Farooq Minhas 1,
An Efficient Process Live Migration Mechanism for Load Balanced Distributed Virtual Environments Balazs Gerofi, Hajime Fujita, Yutaka Ishikawa Yutaka Ishikawa.
COMMA: Coordinating the Migration of Multi-tier applications 1 Jie Zheng* T.S Eugene Ng* Kunwadee Sripanidkulchai† Zhaolei Liu* *Rice University, USA †NECTEC,
Post-Copy Live Migration of Virtual Machines Michael R. Hines, Umesh Deshpande, Kartik Gopalan Computer Science, Binghamton University(SUNY) SIGOPS 09’
Predicting The Performance Of Virtual Machine Migration Presented by : Eli Nazarov Sherif Akoush, Ripduman Sohan, Andrew W.Moore, Andy Hopper University.
Remus: High Availability via Asynchronous Virtual Machine Replication.
CacheMind: Fast Performance Recovery Using a Virtual Machine Monitor Kenichi Kourai Kyushu Institute of Technology, Japan.
PRASHANTHI NARAYAN NETTEM.
1 Distributed Systems: Distributed Process Management – Process Migration.
Operating Systems.
Implementing Failover Clustering with Hyper-V
Xen and the Art of Virtualization. Introduction  Challenges to build virtual machines Performance isolation  Scheduling priority  Memory demand  Network.
Virtualization Performance H. Reza Taheri Senior Staff Eng. VMware.
Presented by : Ran Koretzki. Basic Introduction What are VM’s ? What is migration ? What is Live migration ?
Robert Bradford, Evangelos Kotsovinos, Anja Feldmann, Harald Schiöberg Presented by Kit Cischke.
Virtualization and Cloud Computing Research at Vasabilab Kasidit Chanchio Vasabilab Dept of Computer Science, Faculty of Science and Technology, Thammasat.
About the Presentations The presentations cover the objectives found in the opening of each chapter. All chapter objectives are listed in the beginning.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
Zero-copy Migration for Lightweight Software Rejuvenation of Virtualized Systems Kenichi Kourai Hiroki Ooba Kyushu Institute of Technology.
Hadi Salimi Distributed Systems Lab, School of Computer Engineering, Iran University of Science and Technology, Fall 2010 Performance.
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
Live Migration of Virtual Machines
Live Migration of Virtual Machines Christopher Clark, Keir Fraser, Steven Hand, Jacob Gorm Hansen†,Eric Jul†, Christian Limpach, Ian Pratt, Andrew Warfield.
A Measurement Based Memory Performance Evaluation of High Throughput Servers Garba Isa Yau Department of Computer Engineering King Fahd University of Petroleum.
Virtualization 3 Subtitle: “What can we do to a VM?” Learning Objectives: – To understand the VM-handling mechanisms of a hypervisor – To understand how.
The xCloud and Design Alternatives Presented by Lavone Rodolph.
Computer Systems Week 14: Memory Management Amanda Oddie.
VTurbo: Accelerating Virtual Machine I/O Processing Using Designated Turbo-Sliced Core Embedded Lab. Kim Sewoog Cong Xu, Sahan Gamage, Hui Lu, Ramana Kompella,
IT Pro Day Windows Server 2012 Hyper-V – The next chapter Michel Luescher, Senior Consultant Microsoft Thomas Roettinger, Program Manager Microsoft.
Latency Reduction Techniques for Remote Memory Access in ANEMONE Mark Lewandowski Department of Computer Science Florida State University.
Breaking Barriers Exploding with Possibility Breaking Barriers Exploding with Possibility The Cloud Era Unveiled.
SECTION 5: PERFORMANCE CHRIS ZINGRAF. OVERVIEW: This section measures the performance of MapReduce on two computations, Grep and Sort. These programs.
Efficient Live Checkpointing Mechanisms for computation and memory-intensive VMs in a data center Kasidit Chanchio Vasabilab Dept of Computer Science,
COMP25212: Virtualization 3 Subtitle: “What can we do to a VM?” Learning Objectives: –To understand the VM-handling mechanisms of a hypervisor –To understand.
Trusted Passages: Managing Trust Properties of Open Distributed Overlays Faculty: Mustaque Ahamad, Greg Eisenhauer, Wenke Lee and Karsten Schwan PhD Students:
Virtual cloud R 陳昌毅 R 顏昭恩 R 黃伯淳 2010/06/03.
EECS 262a Advanced Topics in Computer Systems Lecture 20 VM Migration/VM Cloning November 13 th, 2013 John Kubiatowicz and Anthony D. Joseph Electrical.
Hands-On Virtual Computing
ECHO A System Monitoring and Management Tool Yitao Duan and Dawey Huang.
Disco: Running Commodity Operating Systems on Scalable Multiprocessors Presented by: Pierre LaBorde, Jordan Deveroux, Imran Ali, Yazen Ghannam, Tzu-Wei.
FTOP: A library for fault tolerance in a cluster R. Badrinath Rakesh Gupta Nisheeth Shrivastava.
Live Migration of Virtual Machines Authors: Christopher Clark, Keir Fraser, Steven Hand, Jacob Gorm Hansen, Eric Jul, Christian Limpach, Ian Pratt, Andrew.
E Virtual Machines Lecture 6 Topics in Virtual Machine Management Scott Devine VMware, Inc.
IT Pro Day Windows Server 2012 Hyper-V – The next chapter Michel Luescher, Senior Consultant Microsoft Thomas Roettinger, Program Manager Microsoft.
VIRTUAL MACHINE – VMWARE. VIRTUAL MACHINE (VM) What is a VM? – A virtual machine (VM) is a software implementation of a computing environment in which.
CS 695 Topics in Virtualization and Cloud Computing, Autumn 2012 CS 695 Topics in Virtualization and Cloud Computing Live Migration of Virtual Machines.
Presented by Yoon-Soo Lee
Credits: 3 CIE: 50 Marks SEE:100 Marks Lab: Embedded and IOT Lab
Chapter 9: Virtual Memory
Kenichi Kourai Hiroki Ooba Kyushu Institute of Technology, Japan
XenFS Sharing data in a virtualised environment
Group 8 Virtualization of the Cloud
Introduction to Operating Systems
Sho Kawahara and Kenichi Kourai Kyushu Institute of Technology, Japan
John Kubiatowicz Electrical Engineering and Computer Sciences
HC Hyper-V Module GUI Portal VPS Templates Web Console
Chapter-1 Computer is an advanced electronic device that takes raw data as an input from the user and processes it under the control of a set of instructions.
Microsoft Virtual Academy
Efficient Migration of Large-memory VMs Using Private Virtual Memory
Presentation transcript:

Live Migration of Virtual Machines Christopher Clark, Keir Fraser, Steven Hand, Jacob Gorm Hansen, Eric Jul, Christian Limpach, Ian Pratt, Andrew Warfield University of Cambridge Computer Laboratory Department of Computer Science UK University of Copenhagen 1

Introduction Live OS migration ◦ Migrating an entire OS and all of its applications as an unit ◦ Memory state can be transferred in a consistent and efficient fashion ◦ Allow a separation of concerns between the users and operator ◦ Minimize the downtime and total migration time ◦ Pre-copy approach 2

Related Work Vmotion Process Migration ◦ Residual dependency  an ongoing need for a host to maintain data structures or provide functionality for a process even after the process migrates away from the host 3

Design(1) - Migrating Memory Minimize both downtime and total migration time ◦ Downtime – the period during which the service is unavailable ◦ Total Migration Time – the duration between when migration is initiated and when the original VM can be discarded 4

Design(2) - Migrating Memory Three phases of memory transfer ◦ Push phase  Source VM continues running  Pages are pushed across the network to destination ◦ Stop and copy phase  The source VM stopped, pages are copied across to the destination VM ◦ Pull phase  New VM executes and find faults  “Pull” pages from the source Pre- copy ◦ A bounded iterative push phase with a very short stop and copy phase 5

Design(3) – Network & Disk Network ◦ Generate an unsolicited ARP reply from the migrated host, advertising the IP has moved to a new location ◦ A small number of in-flight packets maybe lost Disk ◦ Network-attached Storage(NAS) 6

Overhead Due to copying Downtime (VM out of service) Downtime (VM out of service) VM running normally on Destination Host VM running normally on Source Host Design(4) – Logical Steps Stage 0: Pre- Migration Stage 0: Pre- Migration Stage 1: Reservation Stage 1: Reservation Stage 2: Iterative Pre-copy Stage 2: Iterative Pre-copy Stage 3: Stop and copy Stage 3: Stop and copy Stage 5: Activation Stage 5: Activation Stage 4: Commitmen t Stage 4: Commitmen t 7

Design(5) – Logical Steps Stage 0: Pre-Migration ◦ Preselect target host Stage 1: Reservation ◦ Confirm the resource are available on destination host Stage 2: Iterative Pre- copy ◦ First iteration, all pages are transferred from source to destination ◦ Subsequent iteration, copy dirty pages during the previous transfer phase 8

Design(6) – Logical Steps Stage 3: Stop and copy ◦ Stop the running OS at source host ◦ Redirect the network flow to destination host ◦ CPU state and remaining memory pages are transferred Stage 4: Commitment ◦ Destination host indicates to source it has successfully received a consistent OS image ◦ Source Host acknowledge and now can be discard Stage 5: Activation ◦ VM starts on Destination host 9

Writable Working Set (WWS) WWS ◦ A certain set of pages will be written often ◦ Should be transferred via stop and copy phase ◦ Use Xen’s shadow page tables to track(?) 10

11

Implementation Issues(1) Dynamic Rate - Limiting ◦ Administrator decides a minimum(m) and a maximum(M) bandwidth limit ◦ Transfer speed (v) ◦ Subsequent round calculate the dirtying rate (r)  r = dirty pages / duration of previous round 12

Implementation Issues(2) Dynamic Rate - Limiting ◦ The first round v = m ◦ Next round v = v*r ◦ Pre-copy will be terminated when v > M or remain pages less than 256KB 13

Implementation Issues(3) Rapid Page Dirtying ◦ The page dirtying is often physically clustered ◦ “Peek” those pages dirtied in the previous round Stunning Rogue Process ◦ Some process may produce dirty memory at fast speed ◦ Ex. A test program which writes one word in every page was able to dirty memory at a rate of 320Gbit/sec ◦ Forking a monitoring thread within the OS kernel when migrating begins ◦ Monitor the WWS of individual processes ◦ If the process dirty memory too fast, then “stun” it 14

15

Implementation Issues(4) Freeing Page Cache Pages ◦ OS can tell some or all of the free pages ◦ Do not transfer these pages while the first iteration ◦ Reduce transferred time 16

Implementation Issues(5) Two method for initiating and managing state transfer ◦ Managed migration  A migration daemon running in the management VM ◦ Self migration  Implemented within the migratee OS  A small stub required on the destination machine (?) 17

Evaluation(1) Dell PE-2650 server-class machine Dual Xeon 2Ghz CPUs 2GB memory Broadcom TG3 network interface Gigabit Ethernet Netapp F840 NAS XenLinux

Evaluation(2)- Simple Web Server Continuously serving a single 512KB file to a set of 100 clients 19

Evaluation(3)- SPECweb99 SPECweb99 – a application-level benchmark for evaluating web server 20

Evaluation(4) Quake 3 server – an online game server with 6 player ◦ Downtime: 50ms Diabolical Workload ◦ Running a 512MB host and use a simple program that writes constantly to a 256MB region of memory ◦ Downtime : 3.5sec ◦ Rare in real world 21

Conclusion Minimal impact on running services Small downtime with realistic server 22

Virtual Machine Files 23

File format(1).XML File ◦ Save VM Configuration details ◦ Named with the GUID 24

File format(2).BIN files ◦ This file contains the memory of a virtual machine or snapshot that is in a saved state(running programs, data for those programs, word processing documents you are viewing, etc.).VSV files ◦ This file contains the saved state from the devices associated with the virtual machine. 25

File format(3).Vhd files ◦ These are the virtual hard disk files for the virtual machine(save things such as files, folders, file system and disk partitions).avhd files ◦ These are the differencing disk files used for virtual machine snapshots 26