Multi-programming in THE Landon Cox February 3, 2016.

Slides:

Advertisements

Similar presentations

Advertisements

Concurrency Important and difficult (Ada slides copied from Ed Schonberg)

Chorus and other Microkernels Presented by: Jonathan Tanner and Brian Doyle Articles By: Jon Udell Peter D. Varhol Dick Pountain.

1 Semaphores and Monitors: High-level Synchronization Constructs.

CS444/CS544 Operating Systems Synchronization 2/16/2006 Prof. Searleman

1 Synchronization Coordinating the Activity of Mostly Independent Entities.

Page 1 Processes and Threads Chapter Processes 2.2 Threads 2.3 Interprocess communication 2.4 Classical IPC problems 2.5 Scheduling.

Computer Science 162 Discussion Section Week 2. Agenda Recap “What is an OS?” and Why? Process vs. Thread “THE” System.

1 Concurrent and Distributed Systems Introduction 8 lectures on concurrency control in centralised systems - interaction of components in main memory -

1 CS 333 Introduction to Operating Systems Class 2 – OS-Related Hardware & Software The Process Concept Jonathan Walpole Computer Science Portland State.

Monitors CSCI 444/544 Operating Systems Fall 2008.

Semaphores CSCI 444/544 Operating Systems Fall 2008.

CS533 - Concepts of Operating Systems

The Structure of the “THE” -Multiprogramming System Edsger W. Dijkstra Jimmy Pierce.

Race Conditions CS550 Operating Systems. Review So far, we have discussed Processes and Threads and talked about multithreading and MPI processes by example.

A. Frank - P. Weisberg Operating Systems Introduction to Cooperating Processes.

U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Emery Berger University of Massachusetts, Amherst Operating Systems CMPSCI 377 Lecture.

Stack Management Each process/thread has two stacks  Kernel stack  User stack Stack pointer changes when exiting/entering the kernel Q: Why is this necessary?

CSE 451: Operating Systems Autumn 2013 Module 6 Review of Processes, Kernel Threads, User-Level Threads Ed Lazowska 570 Allen.

Concurrency Recitation – 2/24 Nisarg Raval Slides by Prof. Landon Cox.

Operating System A program that controls the execution of application programs An interface between applications and hardware 1.

COP 4600 Operating Systems Spring 2011 Dan C. Marinescu Office: HEC 304 Office hours: Tu-Th 5:00-6:00 PM.

© 2009 Matthew J. Sottile, Timothy G. Mattson, and Craig E Rasmussen 1 Concurrency in Programming Languages Matthew J. Sottile Timothy G. Mattson Craig.

CS510 Concurrent Systems Introduction to Concurrency.

Introduction and Overview Questions answered in this lecture: What is an operating system? How have operating systems evolved? Why study operating systems?

Architecture Support for OS CSCI 444/544 Operating Systems Fall 2008.

Operating Systems ECE344 Ashvin Goel ECE University of Toronto OS-Related Hardware.

Processes and OS basics. RHS – SOC 2 OS Basics An Operating System (OS) is essentially an abstraction of a computer As a user or programmer, I do not.

Lecture 3 Process Concepts. What is a Process? A process is the dynamic execution context of an executing program. Several processes may run concurrently,

Hardware process When the computer is powered up, it begins to execute fetch-execute cycle for the program that is stored in memory at the boot strap entry.

CE Operating Systems Lecture 3 Overview of OS functions and structure.

Computers Operating System Essentials. Operating Systems PROGRAM HARDWARE OPERATING SYSTEM.

Ihr Logo Operating Systems Internals & Design Principles Fifth Edition William Stallings Chapter 2 (Part II) Operating System Overview.

COMP 111 Threads and concurrency Sept 28, Tufts University Computer Science2 Who is this guy? I am not Prof. Couch Obvious? Sam Guyer New assistant.

Operating Systems ECE344 Ashvin Goel ECE University of Toronto Mutual Exclusion.

Processes Introduction to Operating Systems: Module 3.

Processes CS 6560: Operating Systems Design. 2 Von Neuman Model Both text (program) and data reside in memory Execution cycle Fetch instruction Decode.

The Structure of the “THE”- Multiprogramming System Edsger W. Dijkstra Presented by: Jin Li.

CS399 New Beginnings Jonathan Walpole. 2 Concurrent Programming & Synchronization Primitives.

CPS110: Intro to memory Landon Cox February 14, 2008.

Problems with Semaphores Used for 2 independent purposes –Mutual exclusion –Condition synchronization Hard to get right –Small mistake easily leads to.

CPS110: Threads review and wrap up Landon Cox February 11, 2009.

U NIVERSITY OF M ASSACHUSETTS A MHERST Department of Computer Science Computer Systems Principles Synchronization Emery Berger and Mark Corner University.

Hardware process When the computer is powered up, it begins to execute fetch-execute cycle for the program that is stored in memory at the boot strap entry.

CSE 153 Design of Operating Systems Winter 2015 Midterm Review.

C H A P T E R E L E V E N Concurrent Programming Programming Languages – Principles and Paradigms by Allen Tucker, Robert Noonan.

CS533 Concepts of Operating Systems Jonathan Walpole.

Slides created by: Professor Ian G. Harris Operating Systems  Allow the processor to perform several tasks at virtually the same time Ex. Web Controlled.

Implementing Remote Procedure Call Landon Cox February 12, 2016.

D u k e S y s t e m s CPS 310 Threads and Concurrency: Topics Jeff Chase Duke University

Big Picture Lab 4 Operating Systems C Andras Moritz

1 Module 3: Processes Reading: Chapter Next Module: –Inter-process Communication –Process Scheduling –Reading: Chapter 4.5, 6.1 – 6.3.

Multi-programming in THE

CPS110: Reader-writer locks

CPS110: Threads review and wrap up

CS703 – Advanced Operating Systems

Background on the need for Synchronization

Multi-programming in THE

Threading And Parallel Programming Constructs

Critical section problem

Concurrency: Mutual Exclusion and Process Synchronization

CSE 451: Operating Systems Autumn 2003 Lecture 7 Synchronization

CSE 451: Operating Systems Autumn 2005 Lecture 7 Synchronization

CSE 451: Operating Systems Winter 2003 Lecture 7 Synchronization

Chapter 2 Processes and Threads 2.1 Processes 2.2 Threads

CSE 153 Design of Operating Systems Winter 19

CSE 153 Design of Operating Systems Winter 2019

CS333 Intro to Operating Systems

CSE 153 Design of Operating Systems Winter 2019

Operating System Overview

Presentation transcript:

Multi-programming in THE Landon Cox February 3, 2016

THE (First SOSP ’67) SIGOPS Hall of Fame summary “The first paper to suggest that an operating system be built in a structured way. That structure was a series of layers, each a virtual machine that introduced abstractions built using the functionality of lower layers. The paper stimulated a great deal of subsequent work in building operating systems as structured systems.”

Across systems categories Many common, important problems Fault tolerance Coordination of concurrent activities Geo. separated but linked data Large-scale data sets Protection from mistakes and attacks Interactions with many people All of these problems lead to complexity

Complexity How do we control complexity? Build abstractions that hide unimportant details Establish conventional interfaces Enable composition of simple, well-defined components None of these ideas a specific to computer systems These are just principles of good engineering Civil engineering, urban planning, mechanical engineering, aviation and space flight, electrical engineering, ecology and political science

Dealing with complexity How do you impose order on complex programs? Break functionality into modules Goal is to “decouple” unrelated functions Narrow the set of interactions between modules Hope to make whole system easier to reason about How do we specify interactions between code modules? Procedure calls int foo(char *buf) Why do procedure calls reduce complexity? They make interface between modules explicit They hide implementation details

Dealing with complexity P(a){…} C(){… y=P(x); …} Via what data structure do C and P communicate? They communicate via a shared, in-memory stack What is on the stack? Stack: regs, args, RA, P's vars, maybe args... C: push regs, push args, push PC+1, jmp P, pop args, pop regs,... R0 P: push vars,..., pop vars, mov xx -> R0, pop PC The call contract P returns P returns to where C said P restores stack pointer P doesn't modify stack (or C's other memory) P doesn't wedge machine (e.g., use up all heap memory)

Dealing with complexity P(a){…} C(){… y=P(x); …} Is the call contract enforced? At a low level none of it is enforced P can violate all terms of the contract Violations due to bugs and attacks What about P’s functionality (what it does)? Enforced? No. E.g., no guarantee that sqrt() returns the right answer Would like a specification (spec) that P is correct Can we enforce either the spec or contract? We cannot enforce a procedure’s spec (i.e., that it is correct) Can only apply testing to convince ourselves that code adheres to spec We can enforce the call contract (= enforced modularity)

Dealing with complexity P(a){…} C(){… y=P(x); …} Why can we enforce the call contract? Purely mechanical interaction Programmer’s intention is clear No semantic gap to cross Ways that call contracts are enforced? Put P and C in different processes (RPC) Trusted compiler/runtime transitions between C, P (Java) Hardware regulates transitions (kernel, user process)

Dealing with complexity P(a){…} C(){… y=P(x); …} Functions are programming language constructs Can you think of a PL construct that shatters modularity? goto Inspired famous rant by Dijkstra in CACM in 1968 “Go To Statement Considered Harmful” Several months earlier … “Structure of THE multi-programming system” "The unbridled use of the goto statement has as an immediate consequence that it becomes terribly hard to find a meaningful set of coordinates in which to describe the process progress.... The goto statement as it stands is just too primitive, it is too much an invitation to make a mess of one's program.”

THE-multiprogramming system An operating system is a very complex program Has to manage other processes Has to handle interrupts Has to handle I/O devices Has to handle synchronization primitives How THE handles all of this complexity Breaks functionality into stacked layers Higher layers depend on lower layers

THE-multiprogramming system What is a layer? A layer is a set of related procedure calls or functionality Represents a level in a hierarchy How do layers differ from procedure calls w/in a program? Code can only call within layer and into layers below Can build a layer without worrying about what happens above Why is this additional structure appealing? Further limits how modules interact Reduces dependencies between modules Hopefully reduces complexity and bugs Do not have to worry about code in Layer 1 calling into Layer 4

THE-multiprogramming system Batch system Users submit programs OS decides what to run 5 user processes at a time Multi-tasking Manage multiple programs concurrently Need to switch between programs

Multi-programming Want more than 1 process in phys memory Processes can have same virtual addrs Cannot share physical addrs Addresses must be translated Programs cannot manipulate phys addrs anymore Statically: occurs before execution Dynamically: occurs during execution Protection becomes harder

Static address translation Want two processes in memory With address independence Without translating on-the-fly Using static translation How to do this?

Static address translation Adjust loads/stores when put in memory This is called a linker-loader Programming Assume memory starts at 0 Linker-loader Adds 0x20000 to offsets for P2 Similar to linking phase of compile fffff (high memory).OS kernel ffff..User process ffff..User process (low memory) load $1, $base + offset

Layer 0 Layer 1 Layer 2 Layer 3 Layer 4 Scheduler Pager Console I/O devices User programs

Layer 0 Primary responsibilities of Layer 0 Ensure efficient, fair allocation of processor (government) Hide number of processors (abstraction layer) How did the scheduler ensure fair allocation? Handled all transitions between processes Some were voluntary transitions (synchronization calls) Some were involuntary transitions (timer interrupts) Could Layer 0 code be swapped out? No, timer-interrupt handler always has to be in memory

Layer 1 Primary responsibilities of Layer 1 Ensure efficient, fair allocation of memory (government) Hide drum addresses (abstraction layer) What were a segment, core page, and drum page? Segment = coarse-grained unit of data, visible to program Core page = in-memory container for a segment Drum page = on-disk container for a segment

Layer 1 How were processes isolated? No hardware support for virtual memory Relied on language and compiler Programs written in Algol used private segment identifiers Like the static address translation discussed last time Could Layer 1 code be swapped out? No, memory manager always has to be in memory Otherwise have a serious bootstrapping problem

Layer 1 How did paging work without hardware support? Similar to way compiler handles procedure calls Compiler needed to know which segments to be accessed Compiler could insert calls into layer 1 to page-in segments Was any of this enforced? No, just like procedure-call contract is not enforced At the lowest-level, could easily corrupt other processes Bugs and attacks could crash the whole system

Coordination of processes “A society of harmonious sequential processes” Each layer implemented as a process To move between layers, need way to transition safely What primitives provided safe switching? Semaphores: Up (V) and Down (P) Down (may) block the calling process Up (may) allow another process to be scheduled

Semaphores First defined by Dijkstra for THE Two operations: up and down // aka “P” (“proberen”) down () { do { // begin atomic if (value > 0) { value-- break } // end atomic } while (1) } // aka “V” (“verhogen”) up () { // begin atomic value++ // end atomic } What is going on here? Can value ever be < 0?

More semaphores Key state of a semaphore is its value Initial value determines semaphore’s behavior Value cannot be accessed outside semaphore (i.e., there is no semaphore.getValue() call) Semaphores can be both synchronization types Mutual exclusion (like locks) Ordering constraints (like monitors)

Semaphore mutual exclusion Ensure that 1 (or < N) thread is in critical section How do we make a semaphore act like a lock? Set initial value to 1 (or N) Like lock/unlock, but more general (could allow 2 threads in critical section if initial value = 2) Lock is equivalent to a binary semaphore s.down (); // critical section s.up ();

Semaphore ordering constraints Thread A waits for B before proceeding How to make a semaphore behave like a monitor? Set initial value of semaphore to 0 A is guaranteed to wait for B to finish Doesn’t matter if A or B run first Like a CV in which condition is “ sem.value==0” Can think of as a “prefab” condition variable // Thread A s.down (); // continue // Thread B // do task s.up ();

Prod.-cons. with semaphores Same before-after constraints If buffer empty, consumer waits for producer If buffer full, producer waits for consumer Semaphore assignments mutex (binary semaphore) fullBuffers (counts number of full slots) emptyBuffers (counts number of empty slots)

Prod.-cons. with semaphores Initial semaphore values? Mutual exclusion sem mutex (?) Machine is initially empty sem fullBuffers (?) sem emptyBuffers (?)

Prod.-cons. with semaphores Initial semaphore values? Mutual exclusion sem mutex (1) Machine is initially empty sem fullBuffers (0) sem emptyBuffers (MaxSodas)

Prod.-cons. with semaphores producer () { down (emptyBuffers) down (mutex) put soda in up (mutex) up (fullBuffers) } consumer () { down (fullBuffers) down (mutex) take soda out up (mutex) up (emptyBuffers) } Semaphore mutex(1),fullBuffers(0),emptyBuffers(MaxSodas) Use one semaphore for fullBuffers and emptyBuffers? Remember semaphores are like prefab CVs.

Prod.-cons. with semaphores Does the order of the down calls matter? Yes. Can cause “deadlock.” producer () { down (mutex) down (emptyBuffers) put soda in up (fullBuffers) up (mutex) } consumer () { down (mutex) down (fullBuffers) take soda out up (emptyBuffers) up (mutex) } Semaphore mutex(1),fullBuffers(0),emptyBuffers(MaxSodas) 2 1

Prod.-cons. with semaphores Does the order of the up calls matter? Not for correctness (possible efficiency issues). producer () { down (emptyBuffers) down (mutex) put soda in up (fullBuffers) up (mutex) } consumer () { down (fullBuffers) down (mutex) take soda out up (emptyBuffers) up (mutex) } Semaphore mutex(1),fullBuffers(0),emptyBuffers(MaxSodas)

Prod.-cons. with semaphores What about multiple consumers and/or producers? Doesn’t matter; solution stands. producer () { down (emptyBuffers) down (mutex) put soda in up (mutex) up (fullBuffers) } consumer () { down (fullBuffers) down (mutex) take soda out up (mutex) up (emptyBuffers) } Semaphore mutex(1),fullBuffers(0),emptyBuffers(MaxSodas)

Prod.-cons. with semaphores What if 1 full buffer and multiple consumers call down? Only one will see semaphore at 1, rest see at 0. producer () { down (emptyBuffers) down (mutex) put soda in up (mutex) up (fullBuffers) } consumer () { down (fullBuffers) down (mutex) take soda out up (mutex) up (emptyBuffers) } Semaphore mtx(1),fullBuffers(1),emptyBuffers(MaxSodas-1)

Monitors vs. semaphores Monitors Separate mutual exclusion and ordering Semaphores Provide both with same mechanism Semaphores are more “elegant” Single mechanism Can be harder to program

Monitors vs. semaphores Where are the conditions in both? Which is more flexible? Why do monitors need a lock, but not semaphores? // Monitors lock (mutex) while (condition) { wait (CV, mutex) } unlock (mutex) // Semaphores down (semaphore)

Monitors vs. semaphores When are semaphores appropriate? When shared integer maps naturally to problem at hand (i.e., when the condition involves a count of one thing) // Monitors lock (mutex) while (condition) { wait (CV, mutex) } unlock (mutex) // Semaphores down (semaphore)

Classic synchronization problem Customer room capacity = 9 Sofa capacity = 4 Standing room Each barber/stylist is a thread Only 9 customers in room at a time. Each customer is a thread

Customer () { } Break up into groups of 2 or 3 Try to come up with a solution We’ll talk about it in 10 minutes… Stylist(int id) { }

Salon with semaphores Customer () { room.down () // enter room sofa.down () // sit on sofa chair.down () // sit on chair sofa.up () customer.up () cut.down () // leave chair chair.up () // leave room room.up () } Stylist () { while (1) { customer.down() // cut hair cut.up () } } Semaphore room = 9, sofa = 4, chair = 3, customer = 0, cut = 0; Is anything weird here?

Customer () { lock.acquire (); while (numInRoom == 9) roomCV.wait (lock); numInRoom++; while (numOnSofa == 4) sofaCV.wait (lock); numOnSofa++; while (findFreeChair () == NONE) chairCV.wait (lock); stylist = findFreeChair (); freeChairs[stylist] = 1; numOnSofa--; sofaCV.signal (lock); customerCVs[stylist].signal (lock); cutCVs[stylist].wait (lock); freeChairs[stylist] = 0; chairCV.signal (lock); numInRoom--; roomCV.signal (lock); lock.release (); } lock; roomCV; numInRoom=0; sofaCV; numOnSofa=0; chairCV; freeChairs[3]; customerCVs[3]; cutCVs[3]; Enter room Sit on sofa Sit in chair Wake up stylist Wait for cut to finish Leave the shop

Stylist(int id) { lock.acquire (); while (1) { while (!freeChairs[id]) customerCVs[id].wait (lock); cutHair (); cutCVs[id].signal (); } lock.release (); } Customer () { lock.acquire (); while (numInRoom == 9) roomCV.wait (lock); numInRoom++; while (numOnSofa == 4) sofaCV.wait (lock); numOnSofa++; while (findFreeChair () == NONE) chairCV.wait (lock); stylist = findFreeChair (); freeChairs[stylist] = 1; numOnSofa--; sofaCV.signal (lock); customerCVs[stylist].signal (lock); cutCVs[stylist].wait (lock); freeChairs[stylist] = 0; chairCV.signal (lock); numInRoom--; roomCV.signal (lock); lock.release (); }

Next time More history Multics “Almost every important idea in OS can be found somewhere in Multics.” So complicated and hard to use, it inspired UNIX Questions?