Palacios and Kitten: New High Performance Operating Systems For Scalable Virtualized and Native Supercomputing John R. Lange and Kevin Pedretti Trammell.

Slides:



Advertisements
Similar presentations
1 Towards Virtual Passthrough I/O on Commodity Devices Lei Xia, Jack Lange, Peter Dinda {lxia, jarusl, Department of Electrical.
Advertisements

Virtualization Dr. Michael L. Collard
Virtual Machine Technology Dr. Gregor von Laszewski Dr. Lizhe Wang.
Technology Drivers Traditional HPC application drivers – OS noise, resource monitoring and management, memory footprint – Complexity of resources to be.
Partition and Isolate: Approaches for Consolidating HPC and Commodity Workloads Jack Lange Assistant Professor University of Pittsburgh.
 You have exascale problems? ◦ Load Balancing? ◦ Failure? ◦ Power Management?  My system software will solve these problems.
Advanced Virtualization Techniques for High Performance Cloud Cyberinfrastructure Andrew J. Younge Ph.D. Candidate Indiana University Advisor: Geoffrey.
Minimal-overhead Virtualization of a Large Scale Supercomputer John R. Lange and Kevin Pedretti, Peter Dinda, Chang Bae, Patrick Bridges, Philip Soltero,
HPMMAP: Lightweight Memory Management for Commodity Operating Systems
Network Implementation for Xen and KVM Class project for E : Network System Design and Implantation 12 Apr 2010 Kangkook Jee (kj2181)
Why Linux is a Bad Idea as a Compute Node OS (for Balanced Systems) Ron Brightwell Sandia National Labs Scalable Computing Systems Department
A. Frank - P. Weisberg Operating Systems Structure of Operating Systems.
Operating System Support for Virtual Machines Sam King George Dunlap Peter Chen CoVirt Project, University of Michigan.
Virtual Machines. Virtualization Virtualization deals with “extending or replacing an existing interface so as to mimic the behavior of another system”
Virtualization for Cloud Computing
Virtual Machine Monitors CSE451 Andrew Whitaker. Hardware Virtualization Running multiple operating systems on a single physical machine Examples:  VMWare,
CSE598C Virtual Machines and Their Applications Operating System Support for Virtual Machines Coauthored by Samuel T. King, George W. Dunlap and Peter.
Tanenbaum 8.3 See references
Symbiotic Virtualization John R. Lange Thesis Proposal Department of Electrical Engineering and Computer Science Northwestern University June 2009.
1 MOLAR: MOdular Linux and Adaptive Runtime support Project Team David Bernholdt 1, Christian Engelmann 1, Stephen L. Scott 1, Jeffrey Vetter 1 Arthur.
Virtualization Concept. Virtualization  Real: it exists, you can see it.  Transparent: it exists, you cannot see it  Virtual: it does not exist, you.
SymCall: Symbiotic Virtualization Through VMM-to-Guest Upcalls John R. Lange and Peter Dinda University of Pittsburgh (CS) Northwestern University (EECS)
Symbiotic Virtualization John R. Lange Ph.D. Final Defense Department of Electrical Engineering and Computer Science Northwestern University August 9,
Dual Stack Virtualization: Consolidating HPC and commodity workloads in the cloud Brian Kocoloski, Jiannan Ouyang, Jack Lange University of Pittsburgh.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy under contract.
Microkernels, virtualization, exokernels Tutorial 1 – CSC469.
SAIGONTECH COPPERATIVE EDUCATION NETWORKING Spring 2009 Seminar #1 VIRTUALIZATION EVERYWHERE.
Achieving Isolation in Consolidated Environments Jack Lange Assistant Professor University of Pittsburgh.
Principles of Scalable HPC System Design March 6, 2012 Sue Kelly Sandia National Laboratories Abstract: Sandia National.
Virtualization Concepts Presented by: Mariano Diaz.
Operating System Support for Virtual Machines Sam King George Dunlap Peter Chen CoVirt Project, University of Michigan.
การติดตั้งและทดสอบการทำคลัสเต อร์เสมือนบน Xen, ROCKS, และไท ยกริด Roll Implementation of Virtualization Clusters based on Xen, ROCKS, and ThaiGrid Roll.
An architecture for space sharing HPC and commodity workloads in the cloud Jack Lange Assistant Professor University of Pittsburgh.
Virtualization: Not Just For Servers Hollis Blanchard PowerPC kernel hacker.
Virtual Machine and its Role in Distributed Systems.
Hadi Salimi Distributed Systems Lab, School of Computer Engineering, Iran University of Science and Technology, Fall 2010 Performance.
COMS E Cloud Computing and Data Center Networking Sambit Sahu
High Performance Computing on Virtualized Environments Ganesh Thiagarajan Fall 2014 Instructor: Yuzhe(Richard) Tang Syracuse University.
Introduction to the Palacios VMM and the V3Vee Project John R. Lange Department of Computer Science University of Pittsburgh September 28th, 2010.
MATRIX MULTIPLY WITH DRYAD B649 Course Project Introduction.
Multi-stack System Software Jack Lange Assistant Professor University of Pittsburgh.
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
Architecting a Symbiotic Virtual Machine Monitor for Scalable High Performance Computing John R. Lange Department of Electrical Engineering and Computer.
Cloud Operating System Unit 09 Cloud OS Core Technology M. C. Chiang Department of Computer Science and Engineering National Sun Yat-sen University Kaohsiung,
The Role of Virtualization in Exascale Production Systems Jack Lange Assistant Professor University of Pittsburgh.
Introduction to virtualization
Full and Para Virtualization
Lecture 12 Virtualization Overview 1 Dec. 1, 2015 Prof. Kyu Ho Park “Understanding Full Virtualization, Paravirtualization, and Hardware Assist”, White.
Virtualization One computer can do the job of multiple computers, by sharing the resources of a single computer across multiple environments. Turning hardware.
Operating-System Structures
Protection of Processes Security and privacy of data is challenging currently. Protecting information – Not limited to hardware. – Depends on innovation.
CSE 451: Operating Systems Winter 2015 Module 25 Virtual Machine Monitors Mark Zbikowski Allen Center 476 © 2013 Gribble, Lazowska,
Cloud Computing – UNIT - II. VIRTUALIZATION Virtualization Hiding the reality The mantra of smart computing is to intelligently hide the reality Binary->
IMPROVEMENT OF COMPUTATIONAL ABILITIES IN COMPUTING ENVIRONMENTS WITH VIRTUALIZATION TECHNOLOGIES Abstract We illustrates the ways to improve abilities.
Virtualization Neependra Khare
A move towards Greener Planet
Introduction to Virtualization
Virtualization.
Virtual Machine Monitors
Virtualization Dr. Michael L. Collard
Virtual Servers.
Running other code under LINUX
Virtualization, Empathic Systems, and Sensors
Operating System Support for Virtual Machines
Brian Kocoloski Jack Lange Department of Computer Science
CSE 451: Operating Systems Autumn Module 24 Virtual Machine Monitors
Virtualization Dr. S. R. Ahmed.
A Virtual Machine Monitor for Utilizing Non-dedicated Clusters
CSE 451: Operating Systems Autumn Module 24 Virtual Machine Monitors
Hypervisor A hypervisor or virtual machine monitor (VMM) is computer software, firmware or hardware that creates and runs virtual machines. A computer.
Presentation transcript:

Palacios and Kitten: New High Performance Operating Systems For Scalable Virtualized and Native Supercomputing John R. Lange and Kevin Pedretti Trammell Hudson, Peter Dinda, Zheng Cui, Lei Xia, Patrick Bridges, Andy Gocke, Steven Jaconette, Mike Levenhagen and Ron Brightwell Northwestern University Sandia National Labs University of New Mexico

2 Summary Palacios –First VMM for scalable HPC –Open Source and available Kitten –First open source Lightweight Kernel for High Performance Computing (HPC) –Open Source and available Proved HPC virtualization is effective at scale –Performance within 5% of native –Largest scale study of virtualization

3 What is a virtual machine? Run an OS as an application –Run multiple OS environments on a single machine –Start, stop, pause –Can easily move entire OS environments Hardware OS Application Hardware Guest OS Application Host OS/VMM Guest OS Application Guest OS Application Guest VMM Page Tables CPU state Hardware Emulate Page Tables CPU state Hardware

4 What are VMMs currently used for? Server Consolidation Fault tolerance Legacy application support Debugging Isolation Virtual appliances Failover and disaster recovery Market size –2007: $5.5 billion –2011: $11.7 billion $16.70 Billion $7.58 Billion

5 High Performance Computing (HPC) Large scale simulations to solve Big Problems

6 Virtualization in HPC Fault tolerance –RedStorm MTBI target: 50 hours –RedStorm Min TTR: 30 minutes – 1 hour Broader usage –Allow applications to select best OS Only if it doesn’t degrade performance… –Tightly coupled parallel applications –Very large scale A.B. Nagarajan, F. Mueller, C. Engelmann, and S.L. Scott Proactive Fault Tolerance for HPC with Xen Virtualization ICS 2007

7 Palacios VMM OS-independent embeddable virtual machine monitor Developed at Northwestern and University of New Mexico Open source and freely available –Downloaded over 1000 times as of July 2009 Users: –Kitten: Lightweight supercomputing OS from Sandia National Labs –MINIX 3 –Modified Linux versions Successfully used on supercomputers, clusters (Infiniband and Ethernet), and servers

8 Palacios as an HPC VMM Minimalist interface –Suitable for an LWK Compile and runtime configurability –Create a VMM tailored to specific environments Low noise Contiguous memory pre-allocation Passthrough resources and resource partitioning

1991 – Sandia/UNM OS (SUNMOS), nCube – Linux – SUNMOS ported to Intel Paragon (1800 nodes)‏ 1993 – SUNMOS experience used to design Puma First implementation of Portals communication architecture 1994 – Linux – Puma ported to ASCI Red (4700 nodes)‏ Renamed Cougar, productized by Intel 1997 – Stripped down Linux used on Cplant (2000 nodes)‏ Difficult to port Puma to COTS Alpha server Included Portals API 2002 – Cougar ported to ASC Red Storm (13000 nodes)‏ Renamed Catamount, productized by Cray Host and NIC-based Portals implementations 2004 – IBM develops LWK (CNK) for BG/L/P ( nodes)‏ 2005 – IBM & ETI develop LWK (C64) for Cyclops64 (160 cores/die)‏ Lightweight Kernel Timeline

10 Kitten: An Open Source LWK Better match for user expectations –Provides mostly Linux-compatible user environment Including threading –Supports unmodified compiler toolchains and ELF executables Better match vendor expectations –Modern code-base with familiar Linux-like organization Drop-in compatible with Linux –Infiniband support End-goal is deployment on future capability system

11 Complexity Scalable HPC performance requires minimal overhead ComponentLines of code Kitten~33,000 Palacios~28,000 Total~61,000 Xen: 580k lines (50k – 80k core) KVM: 50k-60k lines + Kernel dependencies (??) + User level devices (180k)

12 HPC Performance Evaluation Virtualization is very useful for HPC, but… Only if it doesn’t hurt performance Virtualized RedStorm with Palacios –Evaluated with Sandia’s system evaluation benchmarks 17 th fastest supercomputer Cray XT cores ~3500 sq ft 2.5 MegaWatts $90 million

13 Virtualized performance (Catamount) Within 5% Scalable HPCCG: conjugant gradient solver

14 CatamountCompute Node Linux Comparison of Operating Systems HPCCG: conjugant gradient solver Shadow Paging

15 Comparison of Operating Systems CatamountCompute Node Linux CTH: multi-material, large deformation, strong shockwave simulation

16 Large Scale Study Evaluation on full RedStorm system –12 hours of dedicated system time on full machine –Largest virtualization performance scaling study to date Measured performance at exponentially increasing scales –Up to 4096 nodes Publicity –New York Times –Slashdot –HPCWire –Communications of the ACM –PC World

17 Scalability at Large Scale (Catamount) CTH: multi-material, large deformation, strong shockwave simulation Within 3% Scalable

18 Commodity Systems Kitten and Palacios fully support commodity systems –Infiniband clusters –Ethernet servers –Generic PC hardware Palacios embeddable in many OSes –Kitten –MINIX 3 –Linux –GeekOS

19 Infiniband on Commodity Linux 2 node Infiniband Ping Pong bandwidth measurement (Linux guest on IB cluster)

20 Summary Virtualization can scale –Near native performance for optimized VMM/guest (within 5%) VMM needs to know about guest internals –Should modify behavior for each guest environment –Example: Paging method to use depends on guest Black Box inference is not desirable in HPC environment –Unacceptable performance overhead –Convergence time –Mistakes have large consequences Need guest cooperation –Guest and VMM relationship should be symbiotic –Paper forthcoming (4096 scaling results and techniques)

21 Future Work Continue exploring virtualization in HPC –NU, UNM and SNL collaboration –Granted 5 million hours on Jaguar Current fastest supercomputer in the world Oak Ridge National Labs Cray XT5 224,256 cores 4352 sq. ft 6.95 MegaWatts $104 million

22 Palacios and Kitten –Two open source tools for HPC –Proved virtualization of HPC systems can scale Contributions Welcome!! Conclusion