OS 2020: Slide 1December 6 th, 2011Swarm Lab Opening Tessellation OS: an OS for the Swarm John Kubiatowicz

Slides:



Advertisements
Similar presentations
Wei Lu 1, Kate Keahey 2, Tim Freeman 2, Frank Siebenlist 2 1 Indiana University, 2 Argonne National Lab
Advertisements

Dr. Kalpakis CMSC 621, Advanced Operating Systems. Fall 2003 URL: Distributed System Architectures.
Chorus and other Microkernels Presented by: Jonathan Tanner and Brian Doyle Articles By: Jon Udell Peter D. Varhol Dick Pountain.
Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
Introduction CSCI 444/544 Operating Systems Fall 2008.
Tessellation OS Interfaces and Mechanisms for Two-Level Scheduling John Kubiatowicz UC Berkeley
Using DSVM to Implement a Distributed File System Ramon Lawrence Dept. of Computer Science
Introduction to Operating Systems CS-2301 B-term Introduction to Operating Systems CS-2301, System Programming for Non-majors (Slides include materials.
Ameoba Designed by: Prof Andrew S. Tanenbaum at Vrija University since 1981.
1. Overview  Introduction  Motivations  Multikernel Model  Implementation – The Barrelfish  Performance Testing  Conclusion 2.
Contiki A Lightweight and Flexible Operating System for Tiny Networked Sensors Presented by: Jeremy Schiff.
INTRODUCTION OS/2 was initially designed to extend the capabilities of DOS by IBM and Microsoft Corporations. To create a single industry-standard operating.
Chapter 13 Embedded Systems
Tessellation OS Architecting Systems Software in a ManyCore World John Kubiatowicz UC Berkeley
EEC-681/781 Distributed Computing Systems Lecture 3 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Figure 1.1 Interaction between applications and the operating system.
Comparative Operating Systems Understanding the Kernel Structure Prashant Thuppala.
University of Pennsylvania 9/7/00CSE 3801 CSE 380 Introduction to Operating Systems Insup Lee Fall 00.
Operating System Organization
Slide 3-1 Copyright © 2004 Pearson Education, Inc. Operating Systems: A Modern Perspective, Chapter 3 Operating System Organization.
Operating Systems Concepts 1. A Computer Model An operating system has to deal with the fact that a computer is made up of a CPU, random access memory.
DESIGN OF A PLATFORM OF VIRTUAL SERVICE CONTAINERS FOR SERVICE ORIENTED CLOUD COMPUTING Carlos de Alfonso Andrés García Vicente Hernández.
Chapter 3.1:Operating Systems Concepts 1. A Computer Model An operating system has to deal with the fact that a computer is made up of a CPU, random access.
Hands-On Microsoft Windows Server 2008
Computer System Architectures Computer System Software
9/14/2015B.Ramamurthy1 Operating Systems : Overview Bina Ramamurthy CSE421/521.
Chapter 6 Operating System Support. This chapter describes how middleware is supported by the operating system facilities at the nodes of a distributed.
B.Ramamurthy9/19/20151 Operating Systems u Bina Ramamurthy CS421.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
1 Hardware Security Mechanisms Krste Asanovic U.C. Berkeley August 20, 2009.
Multicore Resource Management 謝政宏. 2 Outline Background Virtual Private Machines  Spatial Component  Temporal Component  Minimum and Maximum.
Tessellation: Refactoring the OS around Explicit Resource Containers with Continuous Adaptation John Kubiatowicz UC Berkeley June 5 th, 2013 Juan Colmenares,
Tessellation: Space-Time Partitioning in a Manycore Client OS Rose Liu 1,2, Kevin Klues 1, Sarah Bird 1, Steven Hofmeyr 3, Krste Asanovic 1, John Kubiatowicz.
OPERATING SYSTEMS Goals of the course Definitions of operating systems Operating system goals What is not an operating system Computer architecture O/S.
Copyright © George Coulouris, Jean Dollimore, Tim Kindberg This material is made available for private study and for direct.
CS Advanced Operating Systems Structures and Implementation Lecture 12 Dominant Resource Fairness Two-Level Scheduling March 11 th, 2013 Prof. John.
CE Operating Systems Lecture 3 Overview of OS functions and structure.
Slide 3-1 Copyright © 2004 Pearson Education, Inc. Operating Systems: A Modern Perspective, Chapter 3.
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
Copyright © cs-tutorial.com. Overview Introduction Architecture Implementation Evaluation.
(a) What is the output generated by this program? In fact the output is not uniquely defined, i.e., it is not always the same. So please give three examples.
Virtual Workspaces Kate Keahey Argonne National Laboratory.
Processes Introduction to Operating Systems: Module 3.
App. TypeApp. Name Distributed or Parallel A parallel version of the Gaussian elimination application SAGE (SAIC's Adaptive Grid Eulerian hydrocode) Adaptive.
Slide 1April 16 th, 2012Swarm Lab Seminar Reinventing Operating Systems for the Manycore Ubiquitous Swarm John Kubiatowicz
Real-Time Musical Applications On An Experimental OS For Multi-Core Processors 林鼎原 Department of Electrical Engineering National Cheng Kung University.
Energy-Aware Resource Adaptation in Tessellation OS 3. Space-time Partitioning and Two-level Scheduling David Chou, Gage Eads Par Lab, CS Division, UC.
6.894: Distributed Operating System Engineering Lecturers: Frans Kaashoek Robert Morris
Lecture 4 Page 1 CS 111 Online Modularity and Virtualization CS 111 On-Line MS Program Operating Systems Peter Reiher.
CS4315A. Berrached:CMS:UHD1 Introduction to Operating Systems Chapter 1.
Chapter 2 Introduction to OS Chien-Chung Shen CIS/UD
Tessellation: Space-Time Partitioning in a Manycore Client OS 林鼎原 Department of Electrical Engineering National Cheng Kung University Tainan, Taiwan, R.O.C.
Gaia An Infrastructure for Active Spaces Prof. Klara Nahrstedt Prof. David Kriegman Prof. Dennis Mickunas
Copyright © Curt Hill More on Operating Systems Continuation of Introduction.
Next Generation of Apache Hadoop MapReduce Owen
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
Introduction to Operating Systems
Operating Systems : Overview
Introduction to Data Management in EGI
Operating System Structure
Introduction to Operating Systems
Operating Systems Bina Ramamurthy CSE421 11/27/2018 B.Ramamurthy.
Mobile Agents.
Operating Systems : Overview
Operating Systems : Overview
Operating Systems : Overview
Operating Systems: A Modern Perspective, Chapter 3
Content Distribution Network
Harrison Howell CSCE 824 Dr. Farkas
Presentation transcript:

OS 2020: Slide 1December 6 th, 2011Swarm Lab Opening Tessellation OS: an OS for the Swarm John Kubiatowicz

OS 2020: Slide 2December 6 th, 2011Swarm Lab Opening Requirements for Swarm OS? What system structure required to support Swarm? – Integrate sensors, portable devices, cloud components – Guarantee responsiveness, real-time behavior, throughput – Services with guaranteed behavior, self-adapting to adjust for failure and performance predictability – Uniformly secure, durable, available data Swarm-OS Distributed Sense-Control-Actuate Platforms Innovative Devices and Materials Middleware and services Swarm App Store Integration

OS 2020: Slide 3December 6 th, 2011Swarm Lab Opening Today’s Software Reality Resources not well managed: QoS hard to achieve – 20 th -century notions of utilization and resource virtualization – Despite a cornucopia of resources – we still cannot get the ones we need when we need them! Services not easily interconnected – Every service has a unique API – Highly-specialized “stovepipes” often do not provide exactly what users are looking for  they end up integrating “by hand” – Tradeoffs between client and cloud not easy to achieve Too many things explicitly depend on location: – Where: is my data stored? (oops – it was there!) – Where: can I execute this piece of functionality? – Where: can I display this information? – Where: did I start this job (because I have to finish it there) And others don’t properly depend on location: – Here I am: do something about it!

OS 2020: Slide 4December 6 th, 2011Swarm Lab Opening Changing the Structure of Operating Systems (and the Application that run on them)

OS 2020: Slide 5December 6 th, 2011Swarm Lab Opening Guaranteed Resources What might we want to guarantee? – Examples: Guarantees of BW (say data committed to Cloud Storage) Guarantees of Requests/Unit time (DB service) Guarantees of Latency to Response (Deadline scheduling) Guarantees of maximum time to Durability in cloud Guarantees of total energy/battery power available to Cell What level of guarantee? – Firm Guarantee (Better than existing systems) With high confidence (specified), Maximum deviation, etc. What does it mean to have guaranteed resources? – A Service Level Agreement (SLA)? – Something else? “Impedance-mismatch” problem – The SLA guarantees properties that programmer/user wants – The resources required to satisfy SLA are not things that programmer/user really understands

OS 2020: Slide 6December 6 th, 2011Swarm Lab Opening Space-Time Partitioning Spatial Partition: Performance isolation – Each partition receives a vector of basic resources A number HW threads A portion of physical memory A portion of shared cache A fraction of memory bandwidth Partitioning varies over time – Fine-grained multiplexing and guarantee of resources Resources are gang-scheduled Controlled multiplexing, not uncontrolled virtualization Partitioning adapted to the system’s needs Time Space Space

OS 2020: Slide 7December 6 th, 2011Swarm Lab Opening New OS Primitive: the Cell Cell Properties: – A user-level software component, with guaranteed resources – Explicit security context which allows access to protected data – Knowledge of how to adapt itself to new environments (SEJITS) – Checkpoint/restart to provide fault tolerance, mobility and adaptation Execution Environment: – Explicitly parallel computation – Resource Guarantees – Trusted computing base – Secure channels (intra/interchip) with ability to suspend and restart during migration User-level Mem Mgmt Address Space A Address Space B Task User-level Scheduling Adaptation Checkpoint / Restart

OS 2020: Slide 8December 6 th, 2011Swarm Lab Opening Applications Composed of Interconnected Cells Component-based model of computation – Applications consist of interacting components – Components may be local or remote Communication defines Security Model – Channels are points at which data may be compromised – Channels define points for QoS constraints – Question: Can we provide proofs of correctness on inter-cell protocols? Naming process for initiating endpoints – Need to find consistent version of library code (within cell) – Need to find compatible remote services – Solution of version constraint problem for running application SecureChannel Device Drivers File Service SecureChannel SecureChannel SecureChannel SecureChannel Real-Time Cells (Audio, Video) Core Application Parallel Library

OS 2020: Slide 9December 6 th, 2011Swarm Lab Opening Two Level Scheduling in Tessellation OS Split monolithic scheduling into two pieces: – Course-Grained Resource Allocation and Distribution to Cells Chunks of resources (CPUs, Memory Bandwidth, QoS to Services) Ultimately a hierarchical process negotiated with service providers – Fine-Grained (User-Level) Application-Specific Scheduling Applications allowed to utilize their resources in any way they see fit Performance Isolation: Other components of the system cannot interfere with Cells use of resources Monolithic CPU and Resource Scheduling Application Specific Scheduling Resource Allocation And Distribution Two-Level Scheduling

OS 2020: Slide 10December 6 th, 2011Swarm Lab Opening Tessellation Kernel (Trusted) Architecture of Tessellation OS Policy Service STRG Validator Resource Planner Partition Mapping and Multiplexing Layer Partition Multiplexing Partition Mechanism Layer QoS Enforcement Partition Implementation Channel Authenticator Partitionable (and Trusted) Hardware Cores Physical Memory Network Bandwidth Cache/ Local Store NICs Major Change Request ACK/NACK Cell Creation and Resizing Requests From Users Admission Control Minor Changes ACK/NACK Cell All system resources Cell group with fraction of resources Cell Space-Time Resource Graph (STRG) (Current Resources) Global Policies / User Policies and Preferences Resource Allocation And Adaptation Mechanism Offline Models and Behavioral Parameters Online Performance Monitoring, Model Building, and Prediction Performance Counters Partition #1Partition #2 Partition #3 Cell #1 Cell #2 Cell #3 User/System Performance Reports TPM/ Crypto

OS 2020: Slide 11December 6 th, 2011Swarm Lab Opening Modeling and Adaptation Modeling of Applications – Static Profiling: may be useful with Cell guarantees – Multi-variable model building: Get performance as function of resources Adaptation according to User and System Policies – Convex optimization Relative importance of different Cells expressed via scaling functions – Walk through Configuration space Meet minimum QoS properties first, enhancement with excess resources

OS 2020: Slide 12December 6 th, 2011Swarm Lab Opening Tessellation on Multicore Disk I/O Drivers Other Devices Network QoS Monitor And Adapt Persistent Storage & Parallel File System Large Compute-Bound Application Large I/O-Bound Application Disk I/O Drivers Other Devices Network QoS Monitor And Adapt Persistent Storage & Parallel File System Large Compute-Bound Application Large I/O-Bound Application Disk I/O Drivers Other Devices Network QoS Monitor And Adapt Persistent Storage & Parallel File System Large Compute-Bound Application Large I/O-Bound Application Disk I/O Drivers Other Devices Network QoS Monitor And Adapt Persistent Storage & Parallel File System Compute-Bound Application I/O-Bound Application QoS Guarantees Cloud Storage BW QoS QoS Guarantees QoS Guarantees QoS Guarantees

OS 2020: Slide 13December 6 th, 2011Swarm Lab Opening On Toward the Swarm

OS 2020: Slide 14December 6 th, 2011Swarm Lab Opening Swarm Data Information as a First Class Citizen: – Current Viewpoint: Data is byproduct of computation – Much Better: Data independent of computation, outlasts computation, transformed by computation – Computation should be the ephemeral thing! Fallacy: Data Resides in a Particular Location – A breach of the system results in loss of privacy – Incorrect security configuration results in loss of integrity – A crash results in loss of updates or new information – Transient routing failure results in inaccessbility  Integrated, Secure, Deep Archival Storage – Data available from anywhere, anytime – Data encrypted all the time (except in authorized cells) – Data durable by default (coding, widespread replication)

OS 2020: Slide 15December 6 th, 2011Swarm Lab Opening Signature, Policy Version, GUID Start with Truly Portable Secure Data Data divided into globally-addressable capsules – Addressable by unique GUID and/or metadata search – Conceptually stored in THE Storage Cloud (cyberspace?)  If you can name it, you can use it! Integrated protection – Data is signed, has attached policy, Optionally encrypted – Unwrappable only in correct trusted environment – Legacy support: data without secure metadata Signature, Policy Version, GUID DecryptEncrypt Distributed Public Key Infrastructure Challenge/ Response

OS 2020: Slide 16December 6 th, 2011Swarm Lab Opening Level of indirection in network – “Decentralized Object Location and Routing” (DOLR) – All data and services explicitly named by secure hash (Sha256?) Deep Archival Storage in Cloud – Integrated use of coding for maximum durability GUID1 DOLR GUID1 GUID2 Location Independent Data and Services Service

OS 2020: Slide 17December 6 th, 2011Swarm Lab Opening Service B (Cloud) Cloud Storage Mobile OS The Terrestrial Petri Dish Service A (Cloud) Locality-Aware Routing Network

OS 2020: Slide 18December 6 th, 2011Swarm Lab Opening Hierarchical Resource Management – Multi-level resource reservation and adaptation – Resource guarantees and QoS at all levels Synthesis of Tasks from Preexisting Services – Transparent and adaptive choice of service providers Permanent Secure Archival Storage Applications in 2020

OS 2020: Slide 19December 6 th, 2011Swarm Lab Opening Conclusion Essential ideas: – Resource guarantees negotiated hierarchically – Continual adaptation and optimization – Deep Archival Storage available from anywhere, anytime – Mobility of secure data, computation (is there a difference?) Important components of future OS environment – Cells as Basic Unit of Resource and Security User-Level Software Component with Guaranteed Resources Secure Channels to other Cells – Observation, Monitoring, and Adaptation layers Machine learning, Convex Optimization – Portable Secure Data infrastructure If you can name it, you can use it Tessellation OS: