An architecture for space sharing HPC and commodity workloads in the cloud Jack Lange Assistant Professor University of Pittsburgh.

Slides:



Advertisements
Similar presentations
Partition and Isolate: Approaches for Consolidating HPC and Commodity Workloads Jack Lange Assistant Professor University of Pittsburgh.
Advertisements

Palacios and Kitten: New High Performance Operating Systems For Scalable Virtualized and Native Supercomputing John R. Lange and Kevin Pedretti Trammell.
Beowulf Supercomputer System Lee, Jung won CS843.
Minimal-overhead Virtualization of a Large Scale Supercomputer John R. Lange and Kevin Pedretti, Peter Dinda, Chang Bae, Patrick Bridges, Philip Soltero,
Performance Analysis of Virtualization for High Performance Computing A Practical Evaluation of Hypervisor Overheads Matthew Cawood University of Cape.
Xen , Linux Vserver , Planet Lab
Heterogeneous Live Migration of Virtual Machines Pengcheng Liu, Ziye Yang, Xiang Song, Yixun Zhou, Haibo Chen, and Binyu Zang Parallel Processing Institute,
HPMMAP: Lightweight Memory Management for Commodity Operating Systems
Virtual Machines Measure Up John Staton Karsten Steinhaeuser University of Notre Dame December 15, 2005 Graduate Operating Systems, Fall 2005 Final Project.
Disco Running Commodity Operating Systems on Scalable Multiprocessors.
Hosted VMM Architecture Advantages: –Installs and runs like an application –Portable – host OS does I/O access –Coexists with applications running on.
A. Frank - P. Weisberg Operating Systems Structure of Operating Systems.
Virtualization for Cloud Computing
Virtual Machine Monitors CSE451 Andrew Whitaker. Hardware Virtualization Running multiple operating systems on a single physical machine Examples:  VMWare,
CSE598C Virtual Machines and Their Applications Operating System Support for Virtual Machines Coauthored by Samuel T. King, George W. Dunlap and Peter.
Measuring zSeries System Performance Dr. Chu J. Jong School of Information Technology Illinois State University 06/11/2012 Sponsored in part by Deer &
Symbiotic Virtualization John R. Lange Thesis Proposal Department of Electrical Engineering and Computer Science Northwestern University June 2009.
Virtualization Concept. Virtualization  Real: it exists, you can see it.  Transparent: it exists, you cannot see it  Virtual: it does not exist, you.
SymCall: Symbiotic Virtualization Through VMM-to-Guest Upcalls John R. Lange and Peter Dinda University of Pittsburgh (CS) Northwestern University (EECS)
Symbiotic Virtualization John R. Lange Ph.D. Final Defense Department of Electrical Engineering and Computer Science Northwestern University August 9,
Dual Stack Virtualization: Consolidating HPC and commodity workloads in the cloud Brian Kocoloski, Jiannan Ouyang, Jack Lange University of Pittsburgh.
Supporting GPU Sharing in Cloud Environments with a Transparent
Microkernels, virtualization, exokernels Tutorial 1 – CSC469.
UNIX System Administration OS Kernal Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept Kernel or MicroKernel Concept: An OS architecture-design.
A Cloud is a type of parallel and distributed system consisting of a collection of inter- connected and virtualized computers that are dynamically provisioned.
Achieving Isolation in Consolidated Environments Jack Lange Assistant Professor University of Pittsburgh.
LiNK: An Operating System Architecture for Network Processors Steve Muir, Jonathan Smith Princeton University, University of Pennsylvania
So, Jung-ki Distributed Computing System LAB School of Computer Science and Engineering Seoul National University Implementation of Package Management.
Operating System Support for Virtual Machines Samuel T. King, George W. Dunlap,Peter M.Chen Presented By, Rajesh 1 References [1] Virtual Machines: Supporting.
Principles of Scalable HPC System Design March 6, 2012 Sue Kelly Sandia National Laboratories Abstract: Sandia National.
Operating System Support for Virtual Machines Sam King George Dunlap Peter Chen CoVirt Project, University of Michigan.
Eric Keller, Evan Green Princeton University PRESTO /22/08 Virtualizing the Data Plane Through Source Code Merging.
การติดตั้งและทดสอบการทำคลัสเต อร์เสมือนบน Xen, ROCKS, และไท ยกริด Roll Implementation of Virtualization Clusters based on Xen, ROCKS, and ThaiGrid Roll.
Virtualization: Not Just For Servers Hollis Blanchard PowerPC kernel hacker.
The Best of Both Worlds with On-Demand Virtualization Thawan Kooburat and Michael M. Swift On-Demand Virtualization allows systems to benefit from virtualization.
Virtual Machine Monitors: Technology and Trends Jonathan Kaldor CS614 / F07.
Hadi Salimi Distributed Systems Lab, School of Computer Engineering, Iran University of Science and Technology, Fall 2010 Performance.
Tessellation: Space-Time Partitioning in a Manycore Client OS Rose Liu 1,2, Kevin Klues 1, Sarah Bird 1, Steven Hofmeyr 3, Krste Asanovic 1, John Kubiatowicz.
Our work on virtualization Chen Haogang, Wang Xiaolin {hchen, Institute of Network and Information Systems School of Electrical Engineering.
High Performance Computing on Virtualized Environments Ganesh Thiagarajan Fall 2014 Instructor: Yuzhe(Richard) Tang Syracuse University.
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition, Chapter 2: Operating-System Structures.
Introduction to the Palacios VMM and the V3Vee Project John R. Lange Department of Computer Science University of Pittsburgh September 28th, 2010.
Multi-stack System Software Jack Lange Assistant Professor University of Pittsburgh.
© 2012 MELLANOX TECHNOLOGIES 1 Disruptive Technologies in HPC Interconnect HPC User Forum April 16, 2012.
Headline in Arial Bold 30pt HPC User Forum, April 2008 John Hesterberg HPC OS Directions and Requirements.
 Virtual machine systems: simulators for multiple copies of a machine on itself.  Virtual machine (VM): the simulated machine.  Virtual machine monitor.
Architecting a Symbiotic Virtual Machine Monitor for Scalable High Performance Computing John R. Lange Department of Electrical Engineering and Computer.
The Role of Virtualization in Exascale Production Systems Jack Lange Assistant Professor University of Pittsburgh.
Introduction to virtualization
A. Frank - P. Weisberg Operating Systems Structure of Operating Systems.
Partitioned Multistack Evironments for Exascale Systems Jack Lange Assistant Professor University of Pittsburgh.
Full and Para Virtualization
Enabling Technologies for Distributed Computing Dr. Sanjay P. Ahuja, Ph.D. Fidelity National Financial Distinguished Professor of CIS School of Computing,
THAWAN KOOBURAT MICHAEL SWIFT UNIVERSITY OF WISCONSIN - MADISON 1 The Best of Both Worlds with On-Demand Virtualization.
Disco: Running Commodity Operating Systems on Scalable Multiprocessors Presented by: Pierre LaBorde, Jordan Deveroux, Imran Ali, Yazen Ghannam, Tzu-Wei.
CSE 451: Operating Systems Winter 2015 Module 25 Virtual Machine Monitors Mark Zbikowski Allen Center 476 © 2013 Gribble, Lazowska,
Silberschatz, Galvin and Gagne ©2013 Operating System Concepts – 9 th Edition Chapter 4: Threads.
Cloud Computing – UNIT - II. VIRTUALIZATION Virtualization Hiding the reality The mantra of smart computing is to intelligently hide the reality Binary->
Background Computer System Architectures Computer System Software.
Virtualization Neependra Khare
Virtualization for Cloud Computing
Virtual Machine Monitors
Is Virtualization ready for End-to-End Application Performance?
Virtualization, Empathic Systems, and Sensors
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Operating System Support for Virtual Machines
Brian Kocoloski Jack Lange Department of Computer Science
CSE 451: Operating Systems Autumn Module 24 Virtual Machine Monitors
A Virtual Machine Monitor for Utilizing Non-dedicated Clusters
CSE 451: Operating Systems Autumn Module 24 Virtual Machine Monitors
Presentation transcript:

An architecture for space sharing HPC and commodity workloads in the cloud Jack Lange Assistant Professor University of Pittsburgh

HPC in the cloud Palacios VMM – HPC oriented VMM – Specifically targeting supercomputing environments Current state of supercomputing OSes – Linux appears to be the future Not necessarily a good thing Palacios and Linux – Lightweight environments on heavyweight OSes

3 Palacios VMM OS-independent embeddable virtual machine monitor Open source and freely available – 1000s of unique downloads Users: – Kitten: Lightweight supercomputing OS from Sandia National Labs – MINIX 3 – Linux Ideally suited to lightweight supercomputing OSes Successfully used on supercomputers, clusters (Infiniband and Ethernet), and servers

4 A brief Sandia OS history SUNMOS (Sandia/UNM Operating System) – In response to Paragon issues – Library OS – Single task, no demand paging PUMA/Cougar/Catamount – Multitasking – Portals: High performance message passing API Kitten – Lightweight kernel based on Linux ABI – Built from the ground up Ported selected features from Linux CNL (Compute Node Linux) – Cray – Modified version of Linux

5 Palacios as an HPC VMM Minimalist interface – Follows the “lightweight OS” philosophy Compile and runtime configurability – Create a VMM tailored to specific environments Low noise Contiguous memory allocations Passthrough resources and resource partitioning

6 HPC Virtualization Virtualization is very useful for HPC, but… Only if it doesn ’ t hurt performance Virtualized RedStorm with Palacios and Kitten – Evaluated with HPC system benchmarks Cray XT4 204,200 cores 284 TFlops 2.5 MegaWatts

7 Scalability at Large Scale (Weak Scaling) Catamount Guest OS CTH: multi-material, large deformation, strong shockwave simulation Within 3% Scalable IPDPS 2010 VEE 2011

Supercomputing and Linux Lightweight kernels are great for application performance – But their days seem to be numbered – Probably not operational OS of choice for new supercomputers Linux seems to be taking over – Compute Node Linux (CNL), ZeptoOS, etc For some of us, this is not necessarily a “good thing” – Linux introduces significantly more overhead – Subset of applications are hurt (e.g. SAGE ~17% slowdown on CNL) – Ironically Linux is the more closed environment Challenge – Can we provide a lightweight environment on a heavyweight OS? – If we can do this on a supercomputer why not in the cloud?

HPC in the cloud Are supercomputers converging with the cloud? No (at least not yet) – Noise issues – Consolidation – Non reserved resources – Uncontrolled topology Very bad for tightly coupled parallel apps – Can virtualization provide a solution?

Why not other VMMs? Virtualization is considered an “Enterprise” tool – Not for HPC Example: KVM design issues – Userspace handlers – Loose CPU affinities – Complicated memory management – HW resets on context switch

Palacios and Linux Palacios provides a lightweight environment – Internally manages node resources – Does not bother with “enterprise features” Claim: Palacios can provide a scalable HPC environment on commodity platforms – Turn a commodity cloud node into a supercomputer node

Palacios on Linux Bypass Linux management layers Palacios selectively takes over resource management – Memory, devices, CPUs – Repurpose existing mechanisms Allocate large chunks of resources and manage them internally Kernel module – Does not require kernel modifications – Implements lightweight interface – Compatible with Fedora 15 kernel Available in Palacios 1.3 (Nov. 2011)

Hardware Palacios VMM HPC VM KVM VM Hadoop Processes Resource Manager Linux/KVM Direct User Feedback Application Performance Measurements Palacios and Linux

Palacios and Linux memory Linux disables selected memory blocks – Internally (via memory hotplug) – Large contiguous blocks (128MB / block) – Memory is still accessible! Palacios assumes responsibility of disabled blocks – Uses internal memory allocator Palacios and Linux coexist without interference – User chooses physical memory regions to assign

Palacios memory hierarchy Palacios Memory Manager Linux Assigned Memory Palacios Assigned Memory Palacios VM MemoryLinux Application Linux SL*B allocator Physical RAM OS Memory manager Page Tables Guest Page Tables HPC VM Application LWK allocator

Results Evaluations are currently underway – Have collected preliminary results HPC Benchmark (hpccg), run time (secs) – 1 VM, 4 cores ConfigurationMeanStddev Host Kitten Kitten guest (Palacios) Kitten guest (KVM) Host Linux High standard deviation implies poor scalability

Open issues with I/O IOMMU’s and SRIOV are now prevalent – Direct access to network hardware – Palacios can provide high performance I/O Two issues – VM placement (possibly random topology) – Traffic contention These should be solvable – Intelligent placement and scheduling – Specialized network architectures Optical networks, reservable networks, others

18 Infiniband on Commodity Linux 2 node Infiniband Ping Pong bandwidth measurement (Linux guest on IB cluster) VEE 2011

Conclusion Clouds are not supercomputers – Despite their claims to the contrary – Virtualization can bring them closer to convergence Virtualization can bypass host OS overheads Palacios and Linux – Lightweight environments on heavyweight OS

Thank you Jack Lange – – V3Vee Project –