Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.

Slides:



Advertisements
Similar presentations
Threads, SMP, and Microkernels
Advertisements

Distributed Processing, Client/Server and Clusters
Distributed Systems CS
Class CS 775/875, Spring 2011 Amit H. Kumar, OCCS Old Dominion University.
Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
Study of Hurricane and Tornado Operating Systems By Shubhanan Bakre.
Distributed Shared Memory
Cache Coherent Distributed Shared Memory. Motivations Small processor count –SMP machines –Single shared memory with multiple processors interconnected.
Computer Architecture Introduction to MIMD architectures Ola Flygt Växjö University
Threads, SMP, and Microkernels Chapter 4. Process Resource ownership - process is allocated a virtual address space to hold the process image Scheduling/execution-
Chapter 4 Threads, SMP, and Microkernels Patricia Roy Manatee Community College, Venice, FL ©2008, Prentice Hall Operating Systems: Internals and Design.
Using DSVM to Implement a Distributed File System Ramon Lawrence Dept. of Computer Science
Disco Running Commodity Operating Systems on Scalable Multiprocessors Presented by Petar Bujosevic 05/17/2005 Paper by Edouard Bugnion, Scott Devine, and.
Introduction to MIMD architectures
Distributed Processing, Client/Server, and Clusters
Chapter 16 Client/Server Computing Patricia Roy Manatee Community College, Venice, FL ©2008, Prentice Hall Operating Systems: Internals and Design Principles,
Disco: Running Commodity Operating Systems on Scalable Multiprocessors Bugnion et al. Presented by: Ahmed Wafa.
G Robert Grimm New York University Disco.
Distributed Computing Software based solutions to Parallel Computing.
Chapter 17 Parallel Processing.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
Multiprocessors CSE 471 Aut 011 Multiprocessors - Flynn’s Taxonomy (1966) Single Instruction stream, Single Data stream (SISD) –Conventional uniprocessor.
PRASHANTHI NARAYAN NETTEM.
1 Distributed Systems: Distributed Process Management – Process Migration.
DISTRIBUTED COMPUTING
The Origin of the VM/370 Time-sharing system Presented by Niranjan Soundararajan.
Introduction to Symmetric Multiprocessors Süha TUNA Bilişim Enstitüsü UHeM Yaz Çalıştayı
Computer System Architectures Computer System Software
SSI-OSCAR A Single System Image for OSCAR Clusters Geoffroy Vallée INRIA – PARIS project team COSET-1 June 26th, 2004.
UNIX System Administration OS Kernal Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept Kernel or MicroKernel Concept: An OS architecture-design.
Disco : Running commodity operating system on scalable multiprocessor Edouard et al. Presented by Jonathan Walpole (based on a slide set from Vidhya Sivasankaran)
CS533 Concepts of Operating Systems Jonathan Walpole.
 What is an operating system? What is an operating system?  Where does the OS fit in? Where does the OS fit in?  Services provided by an OS Services.
WING: A Consistent Computing Platform Yiwei Ci 22/02/2012
Chapter 6 Operating System Support. This chapter describes how middleware is supported by the operating system facilities at the nodes of a distributed.
AN EXTENDED OPENMP TARGETING ON THE HYBRID ARCHITECTURE OF SMP-CLUSTER Author : Y. Zhao 、 C. Hu 、 S. Wang 、 S. Zhang Source : Proceedings of the 2nd IASTED.
Introduction, background, jargon Jakub Yaghob. Literature T.G.Mattson, B.A.Sanders, B.L.Massingill: Patterns for Parallel Programming, Addison- Wesley,
Processes and Threads Processes have two characteristics: – Resource ownership - process includes a virtual address space to hold the process image – Scheduling/execution.
Comparison of Distributed Operating Systems. Systems Discussed ◦Plan 9 ◦AgentOS ◦Clouds ◦E1 ◦MOSIX.
Threads, SMP, and Microkernels Chapter 4. Process Resource ownership - process is allocated a virtual address space to hold the process image Scheduling/execution-
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
Disco: Running Commodity Operating Systems on Scalable Multiprocessors Edouard et al. Madhura S Rama.
OPERATING SYSTEM SUPPORT DISTRIBUTED SYSTEMS CHAPTER 6 Lawrence Heyman July 8, 2002.
Disco : Running commodity operating system on scalable multiprocessor Edouard et al. Presented by Vidhya Sivasankaran.
Computer Science Lecture 7, page 1 CS677: Distributed OS Multiprocessor Scheduling Will consider only shared memory multiprocessor Salient features: –One.
PARALLEL PROCESSOR- TAXONOMY. CH18 Parallel Processing {Multi-processor, Multi-computer} Multiple Processor Organizations Symmetric Multiprocessors Cache.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
OpenMP for Networks of SMPs Y. Charlie Hu, Honghui Lu, Alan L. Cox, Willy Zwaenepoel ECE1747 – Parallel Programming Vicky Tsang.
COMP381 by M. Hamdi 1 Clusters: Networks of WS/PC.
Disco: Running Commodity Operating Systems on Scalable Multiprocessors Presented by: Pierre LaBorde, Jordan Deveroux, Imran Ali, Yazen Ghannam, Tzu-Wei.
Cluster computing. 1.What is cluster computing? 2.Need of cluster computing. 3.Architecture 4.Applications of cluster computing 5.Advantages of cluster.
Background Computer System Architectures Computer System Software.
Page 1 2P13 Week 1. Page 2 Page 3 Page 4 Page 5.
Running Commodity Operating Systems on Scalable Multiprocessors Edouard Bugnion, Scott Devine and Mendel Rosenblum Presentation by Mark Smith.
Chapter 16 Client/Server Computing Dave Bremer Otago Polytechnic, N.Z. ©2008, Prentice Hall Operating Systems: Internals and Design Principles, 6/E William.
INTRODUCTION TO HIGH PERFORMANCE COMPUTING AND TERMINOLOGY.
XtreemOS IP project is funded by the European Commission under contract IST-FP Scientific coordinator Christine Morin, INRIA Presented by Ana.
OpenMosix, Open SSI, and LinuxPMI
Distributed Shared Memory
Reactive NUMA A Design for Unifying S-COMA and CC-NUMA
Introduction to Operating Systems
Distributed System Structures 16: Distributed Structures
Threads, SMP, and Microkernels
Outline Midterm results summary Distributed file systems – continued
Hybrid Programming with OpenMP and MPI
High Performance Computing
Outline Review of Quiz #1 Distributed File Systems 4/20/2019 COP5611.
A Virtual Machine Monitor for Utilizing Non-dedicated Clusters
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA (Rennes, France)

2 Motivation Clusters as an alternative to multiprocessor machines for high performance computing Workloads of scientific applications Independent sequential processes Compute intensive, huge memory requirements Parallel applications Shared memory (multithreaded applications, OpenMP) Message passing (MPI) Hybrid applications

3 Some Issues … No obvious solution to support standard Posix multithreaded applications on clusters Memory distribution Need of efficient placement and load-balancing strategies to take advantage of all cluster resources Efficient process migration Scientific applications execution time may be greater than the cluster MTBF High availability and checkpointing

4 Single System Image Operating System Vision of a single machine (virtual SMP) Same interface as a traditional OS for an SMP machine Same vision for all applications Efficiency Properties of a SSI OS Resource distribution transparency Intra- and inter- application resource sharing High availability Scalability

5 Kerrighed SSI OS Combining high performance, high availability and ease of programming Global resource management Processor, memory, disk Integrated resource management Dynamic resource management To deal with configuration changes Extension of the standard OS running on each node Small clusters < 100 nodes

6 Outline Global process management Global memory management Conclusion and Perspectives

7 Global Process Management Global scheduling policy Load balancing Several policies Configurable modular global scheduler The policy can be changed without stopping the operating system or the applications The local scheduler on each node is not modified

8 Architecture of the Global Scheduler Standard OS Global scheduler Monitors Local Analyzers Node 1Node 2

9 Process Management Mechanisms Memory Disk Network Memory Disk Network Process state extraction Process creation Process checkpt Process migration Global scheduler (Application management) Process state extraction Process creation Process checkpt Process migration Global scheduler (Application management)

10 Checkpointing Common mechanisms for supporting checkpointing protocols for both shared memory and message-passing applications Efficient checkpoint creation Several memory checkpoints between two disk checkpoints Disk checkpoints stored on local disks Incremental checkpoints Combination of data replication for efficiency and for high availability for shared memory applications Data replication due to data sharing exploited to decrease the cost of checkpoint creation Recovery data can be used for the computation until the first modification

11 Process Migration Communicating processes can migrate Processes sharing memory Processes communicating with data streams (sockets, pipes, …) Efficiency of the process transfer Address space transfered on demand (containers) Efficiency of the process execution after migration Efficient access to open files (containers) Global management of data streams

12 Global Memory Management Different services Shared virtual memory Remote paging Cooperative file cache A unique concept: the container Software object to store and share data cluster wide (COMA like management) Global management of physical memory Segments of a process address space, files are associated to containers

13 Integration of Containers in a Standard OS Host Operating System Memory Manager File System Linker Memory VM Manager Linker Disk Manager Linker Host Operating System Memory Manager File System Linker Memory VM Manager Linker Disk Manager Linker Container Disk

14 Conclusion & Perspectives A SSI OS for clusters is still missing in 2003 Kerrighed represents a promising approach A first prototype based on Linux is available Current work directions High availability and checkpointing OpenMP on Kerrighed Experimentation with industrial applications EDF, DGA Grid-aware OS for a federation of clusters

15 Kerrighed has been filed as a community trademark.