Lecture 8 Page 1 CS 111 Winter 2014 Synchronization, Critical Sections and Concurrency CS 111 Operating Systems Peter Reiher.

Slides:



Advertisements
Similar presentations
Ch. 7 Process Synchronization (1/2) I Background F Producer - Consumer process :  Compiler, Assembler, Loader, · · · · · · F Bounded buffer.
Advertisements

Chapter 6: Process Synchronization
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition, Chapter 6: Process Synchronization.
EEE 435 Principles of Operating Systems Interprocess Communication Pt II (Modern Operating Systems 2.3)
Mutual Exclusion.
CH7 discussion-review Mahmoud Alhabbash. Q1 What is a Race Condition? How could we prevent that? – Race condition is the situation where several processes.
Parallel Processing (CS526) Spring 2012(Week 6).  A parallel algorithm is a group of partitioned tasks that work with each other to solve a large problem.
Review: Chapters 1 – Chapter 1: OS is a layer between user and hardware to make life easier for user and use hardware efficiently Control program.
CS444/CS544 Operating Systems Synchronization 2/16/2006 Prof. Searleman
I/O Hardware n Incredible variety of I/O devices n Common concepts: – Port – connection point to the computer – Bus (daisy chain or shared direct access)
Synchronization Principles. Race Conditions Race Conditions: An Example spooler directory out in 4 7 somefile.txt list.c scores.txt Process.
Concurrency: Mutual Exclusion, Synchronization, Deadlock, and Starvation in Representative Operating Systems.
1 Concurrency: Deadlock and Starvation Chapter 6.
Race Conditions CS550 Operating Systems. Review So far, we have discussed Processes and Threads and talked about multithreading and MPI processes by example.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Emery Berger University of Massachusetts, Amherst Operating Systems CMPSCI 377 Lecture.
Instructor: Umar KalimNUST Institute of Information Technology Operating Systems Process Synchronization.
Synchronization CSCI 444/544 Operating Systems Fall 2008.
Operating Systems CSE 411 CPU Management Oct Lecture 13 Instructor: Bhuvan Urgaonkar.
Lecture 5 Page 1 CS 111 Summer 2014 Process Communications, Synchronization, and Concurrency CS 111 Operating Systems Peter Reiher.
Object Oriented Analysis & Design SDL Threads. Contents 2  Processes  Thread Concepts  Creating threads  Critical sections  Synchronizing threads.
Lecture 8 Page 1 CS 111 Online Implementing Locks Create a synchronization object – Associated it with a critical section – Of a size that an atomic instruction.
CS 153 Design of Operating Systems Spring 2015 Lecture 11: Scheduling & Deadlock.
Concurrency, Mutual Exclusion and Synchronization.
Chapter 101 Multiprocessor and Real- Time Scheduling Chapter 10.
Games Development 2 Concurrent Programming CO3301 Week 9.
1 Announcements The fixing the bug part of Lab 4’s assignment 2 is now considered extra credit. Comments for the code should be on the parts you wrote.
COMP 111 Threads and concurrency Sept 28, Tufts University Computer Science2 Who is this guy? I am not Prof. Couch Obvious? Sam Guyer New assistant.
Operating Systems ECE344 Ashvin Goel ECE University of Toronto Mutual Exclusion.
CSE 451: Operating Systems Section 5 Midterm review.
Kernel Locking Techniques by Robert Love presented by Scott Price.
Lecture 8 Page 1 CS 111 Online Other Important Synchronization Primitives Semaphores Mutexes Monitors.
Deadlock Detection and Recovery
Chapter 6 – Process Synchronisation (Pgs 225 – 267)
Lecture 8 Page 1 CS 111 Fall 2015 Synchronization, Critical Sections and Concurrency CS 111 Operating Systems Peter Reiher.
Department of Computer Science and Software Engineering
CS399 New Beginnings Jonathan Walpole. 2 Concurrent Programming & Synchronization Primitives.
Operating Systems CSE 411 CPU Management Dec Lecture Instructor: Bhuvan Urgaonkar.
CS533 – Spring Jeanie M. Schwenk Experiences and Processes and Monitors with Mesa What is Mesa? “Mesa is a strongly typed, block structured programming.
U NIVERSITY OF M ASSACHUSETTS A MHERST Department of Computer Science Computer Systems Principles Synchronization Emery Berger and Mark Corner University.
CSE 153 Design of Operating Systems Winter 2015 Midterm Review.
Lecture 7 Page 1 CS 111 Online Process Communications and Concurrency CS 111 On-Line MS Program Operating Systems Peter Reiher.
Lecture 12 Page 1 CS 111 Online Using Devices and Their Drivers Practical use issues Achieving good performance in driver use.
Lecture 6 Page 1 CS 111 Summer 2013 Concurrency Solutions and Deadlock CS 111 Operating Systems Peter Reiher.
CS 153 Design of Operating Systems Winter 2016 Lecture 7: Synchronization.
CSCI1600: Embedded and Real Time Software Lecture 17: Concurrent Programming Steven Reiss, Fall 2015.
1 Critical Section Problem CIS 450 Winter 2003 Professor Jinhua Guo.
Lecture 4 Page 1 CS 111 Summer 2013 Scheduling CS 111 Operating Systems Peter Reiher.
Mutual Exclusion -- Addendum. Mutual Exclusion in Critical Sections.
Lecture 8 Page 1 CS 111 Online Synchronization, Critical Sections and Concurrency CS 111 On-Line MS Program Operating Systems Peter Reiher.
Web Server Architecture Client Main Thread for(j=0;j
Lecture 5 Page 1 CS 111 Summer 2013 Bounded Buffers A higher level abstraction than shared domains or simple messages But not quite as high level as RPC.
Parallelism and Concurrency
CSE 120 Principles of Operating
CS703 – Advanced Operating Systems
Outline Other synchronization primitives
Other Important Synchronization Primitives
Designing Parallel Algorithms (Synchronization)
Outline Parallelism and synchronization
Concurrency: Mutual Exclusion and Process Synchronization
CSE 451: Operating Systems Autumn 2004 Module 6 Synchronization
CSE 451: Operating Systems Autumn 2003 Lecture 7 Synchronization
CSE 451: Operating Systems Autumn 2005 Lecture 7 Synchronization
CSE 451: Operating Systems Winter 2004 Module 6 Synchronization
CSE 451: Operating Systems Winter 2003 Lecture 7 Synchronization
CSE 153 Design of Operating Systems Winter 19
CS333 Intro to Operating Systems
CSE 451 Section 1/27/2000.
CSE 153 Design of Operating Systems Winter 2019
CSE 542: Operating Systems
CSE 542: Operating Systems
Presentation transcript:

Lecture 8 Page 1 CS 111 Winter 2014 Synchronization, Critical Sections and Concurrency CS 111 Operating Systems Peter Reiher

Lecture 8 Page 2 CS 111 Winter 2014 Outline Parallelism and synchronization Critical sections and atomic instructions Using atomic instructions to build higher level locks Asynchronous completion Monitors

Lecture 8 Page 3 CS 111 Winter 2014 Benefits of Parallelism Improved throughput – Blocking of one activity does not stop others Improved modularity – Separating compound activities into simpler pieces Improved robustness – The failure of one thread does not stop others A better fit to emerging paradigms – Client server computing, web based services – Our universe is cooperating parallel processes

Lecture 8 Page 4 CS 111 Winter 2014 The Problem With Parallelism Making use of parallelism implies concurrency – Multiple actions happening at the same time – Or perhaps appearing to do so True parallelism is incomprehensible – Or nearly so – Few designers and programmers can get it right – Without help... Pseudo-parallelism may be good enough – Identify and serialize key points of interaction

Lecture 8 Page 5 CS 111 Winter 2014 Why Are There Problems? Sequential program execution is easy – First instruction one, then instruction two,... – Execution order is obvious and deterministic Independent parallel programs are easy – If the parallel streams do not interact in any way – Who cares what gets done in what order? Cooperating parallel programs are hard – If the two execution streams are not synchronized Results depend on the order of instruction execution Parallelism makes execution order non-deterministic Understanding possible outcomes of the computation becomes combinatorially intractable

Lecture 8 Page 6 CS 111 Winter 2014 Solving the Parallelism Problem There are actually two interdependent problems – Critical section serialization – Notification of asynchronous completion They are often discussed as a single problem – Many mechanisms simultaneously solve both – Solution to either requires solution to the other But they can be understood and solved separately

Lecture 8 Page 7 CS 111 Winter 2014 The Critical Section Problem A critical section is a resource that is shared by multiple threads – By multiple concurrent threads, processes or CPUs – By interrupted code and interrupt handler Use of the resource changes its state – Contents, properties, relation to other resources Correctness depends on execution order – When scheduler runs/preempts which threads – Relative timing of asynchronous/independent events

Lecture 8 Page 8 CS 111 Winter 2014 The Asynchronous Completion Problem Parallel activities move at different speeds One activity may need to wait for another to complete The asynchronous completion problem is how to perform such waits without killing performance – Without wasteful spins/busy-waits Examples of asynchronous completions – Waiting for a held lock to be released – Waiting for an I/O operation to complete – Waiting for a response to a network request – Delaying execution for a fixed period of time

Lecture 8 Page 9 CS 111 Winter 2014 Critical Sections What is a critical section? Functionality whose proper use in parallel programs is critical to correct execution If you do things in different orders, you get different results A possible location for undesirable non- determinism

Lecture 8 Page 10 CS 111 Winter 2014 Critical Sections and Re-entrant Code Consider a simple recursive routine: int factorial(x) { tmp = factorial( x-1 ); return x*tmp} Consider a possibly multi-threaded routine: void debit(amt) {tmp = bal-amt; if (tmp >=0) bal = tmp)} Neither would work if tmp was shared/static – Must be dynamic, each invocation has its own copy – This is not a problem with read-only information What if a variable has to be writeable? – Writable variables should be dynamic or shared And proper sharing often involves critical sections

Lecture 8 Page 11 CS 111 Winter 2014 Basic Approach to Critical Sections Serialize access – Only allow one thread to use it at a time – Using some method like locking Won’t that limit parallelism? – Yes, but... If true interactions are rare, and critical sections well defined, most code still parallel If there are actual frequent interactions, there isn’t any real parallelism possible – Assuming you demand correct results

Lecture 8 Page 12 CS 111 Winter 2014 Recognizing Critical Sections Generally includes updates to object state – May be updates to a single object – May be related updates to multiple objects Generally involves multi-step operations – Object state inconsistent until operation finishes This period may be brief or extended – Preemption leaves object in compromised state Correct operation requires mutual exclusion – Only one thread at a time has access to object(s) – Client 1 completes its operation before client 2 starts

Lecture 8 Page 13 CS 111 Winter 2014 Critical Section Example 1: Updating a File Process 1 Process 2 remove(“database”); fd = create(“database”); write(fd,newdata,length); close(fd); fd = open(“database”,READ); count = read(fd,buffer,length); remove(“database”); fd = create(“database”); fd = open(“database”,READ); count = read(fd,buffer,length); write(fd,newdata,length); close(fd); − This result could not occur with any sequential execution Process 2 reads an empty database

Lecture 8 Page 14 CS 111 Winter 2014 Critical Section Example 2: Re-entrant Signals First signal Second signal load r1,numsigs // = 0 add r1,=1 // = 1 store r1,numsigs // =1 load r1,numsigs // = 0 add r1,=1 // = 1 store r1,numsigs // =1 load r1,numsigs // = 0 numsigs add r1,=1 // = 1 load r1,numsigs // = 0 r1 add r1,=1 // = 1 store r1,numsigs // =1 The signal handlers share numsigs and r1... So numsigs is 1, instead of 2

Lecture 8 Page 15 CS 111 Winter 2014 Critical Section Example 3: Multithreaded Banking Code load r1, balance // = 100 load r2, amount1 // = 50 add r1, r2 // = 150 store r1, balance // = 150 Thread 1 Thread 2 load r1, balance // = 100 load r2, amount2 // = 25 sub r1, r2 // = 75 store r1, balance // = 75 load r1, balance // = 100 load r2, amount1 // = 50 add r1, r2 // = balance r1 r2 50 amount1 25 amount load r1, balance // = load r2, amount2 // = sub r1, r2 // = 75 store r1, balance // = store r1, balance // = CONTEXT SWITCH!!! 150 The $25 debit was lost!!!

Lecture 8 Page 16 CS 111 Winter 2014 Are There Real Critical Sections in Operating Systems? Yes! Shared data for multiple concurrent threads – Process state variables – Resource pools – Device driver state Logical parallelism – Created by preemptive scheduling – Asynchronous interrupts Physical parallelism – Shared memory, symmetric multi-processors

Lecture 8 Page 17 CS 111 Winter 2014 These Kinds of Interleavings Seem Pretty Unlikely To cause problems, things have to happen exactly wrong Indeed, that’s true But when you are executing a billion instructions per second, even very low probability events can happen with frightening frequency Often, one problem blows up everything that follows

Lecture 8 Page 18 CS 111 Winter 2014 Can’t We Solve the Problem By Disabling Interrupts? Much of our difficulty is caused by a poorly timed interrupt – Our code gets part way through, then gets interrupted – Someone else does something that interferes – When we start again, things are messed up Why not temporarily disable interrupts to solve those problems?

Lecture 8 Page 19 CS 111 Winter 2014 Problems With Disabling Interrupts Not an option in user mode – Requires use of privileged instructions Dangerous if improperly used – Could disable preemptive scheduling, disk I/O, etc. Delays system response to important interrupts – Received data isn’t processed ‘til interrupt serviced – Device will sit idle until next operation is initiated Doesn't help with symmetric multi-processors – Other processors can access the same memory Generally harms performance – To deal with rare problems

Lecture 8 Page 20 CS 111 Winter 2014 So How Do We Solve This Problem? Avoid shared data whenever possible – No shared data, no critical section – Not always feasible Eliminate critical sections with atomic instructions – Atomic (uninteruptable) read/modify/write operations – Can be applied to 1-8 contiguous bytes – Simple: increment/decrement, and/or/xor – Complex: test-and-set, exchange, compare-and-swap – What if we need to do more in a critical section? Use atomic instructions to implement locks – Use the lock operations to protect critical sections

Lecture 8 Page 21 CS 111 Winter 2014 Atomic Instructions – Test and Set A C description of a machine language instruction bool TS( char *p) { bool rc; rc = *p;/* note the current value*/ *p = TRUE;/* set the value to be TRUE*/ return rc;/* return the value before we set it*/ } if !TS(flag) { /* We have control of the critical section! */ }

Lecture 8 Page 22 CS 111 Winter 2014 Atomic Instructions – Compare and Swap Again, a C description of machine instruction bool compare_and_swap( int *p, int old, int new ) { if (*p == old) {/* see if value has been changed*/ *p = new;/* if not, set it to new value*/ return( TRUE);/* tell caller he succeeded*/ } else/* value has been changed*/ return( FALSE);/* tell caller he failed*/ } if (compare_and_swap(flag,UNUSED,IN_USE) { /* I got the critical section! */ } else { /* I didn’t get it. */ }

Lecture 8 Page 23 CS 111 Winter 2014 Solving Problem #3 With Compare and Swap Again, a C implementation int current_balance; writecheck( int amount ) { int oldbal, newbal; do { oldbal = current_balance; newbal = oldbal - amount; if (newbal < 0) return (ERROR); } while (!compare_and_swap( &current_balance, oldbal, newbal))... }

Lecture 8 Page 24 CS 111 Winter 2014 Why Does This Work? Remember, compare_and_swap() is atomic First time through, if no concurrency, – oldbal == current_balance – current_balance was changed to newbal by compare_and_swap() If not, – current_balance changed after you read it – So compare_and_swap() didn’t change current_balance and returned FALSE – Loop, read the new value, and try again

Lecture 8 Page 25 CS 111 Winter 2014 Will This Really Solve the Problem? If the compare & swap fails, we loop back and try again – If there is a conflicting thread isn’t it likely to simply fail again? Only if preempted during a four instruction window – By someone executing the same critical section Extremely low probability event – We will very seldom go through the loop even twice

Lecture 8 Page 26 CS 111 Winter 2014 Limitation of Atomic Instructions They only update a small number of contiguous bytes – Cannot be used to atomically change multiple locations E.g., insertions in a doubly-linked list They operate on a single memory bus – Cannot be used to update records on disk – Cannot be used across a network They are not higher level locking operations – They cannot “wait” until a resource becomes available – You have to program that up yourself Giving you extra opportunities to screw up

Lecture 8 Page 27 CS 111 Winter 2014 Implementing Locks Create a synchronization object – Associated it with a critical section – Of a size that an atomic instruction can manage Lock the object to seize the critical section – If critical section is free, lock operation succeeds – If critical section is already in use, lock operation fails It may fail immediately It may block until the critical section is free again Unlock the object to release critical section – Subsequent lock attempts can now succeed – May unblock a sleeping waiter

Lecture 8 Page 28 CS 111 Winter 2014 Using Atomic Instructions to Implement a Lock Assuming C implementation of test and set bool getlock( lock *lockp) { if (TS(lockp) == 0 ) return( TRUE); else return( FALSE); } void freelock( lock *lockp ) { *lockp = 0; }

Lecture 8 Page 29 CS 111 Winter 2014 Associating the Lock With a Critical Section Assuming same lock as in last example while (!getlock(crit_section_lock)) { yield(); /*or spin on lock */ } critical_section(); /*Access critical section */ freelock(crit_section_lock); Remember, while you’re in the critical section, no one else will be able to get the lock − Better not stay there too long − And definitely don’t go into infinite loop

Lecture 8 Page 30 CS 111 Winter 2014 Where Do We Put the Locking? Objects A and B share a critical section A and B are called by C and D, respectively Who locks, A/B or C/D? A/B - Down low in the detailed implementation – Object oriented modularity recommends this – Locking is part of the implementation C/D - Up high in the more semantically meaningful calling code – Locking needs may depend on how object is used – One logical transaction may span many method calls – In such cases, only the caller knows start/end/scope

Lecture 8 Page 31 CS 111 Winter 2014 Advisory vs. Enforced Locking Enforced locking – Happens whether or not the caller wants it – Done within the implementation of object methods Advisory locking – A convention that “good guys” are expected to follow – Users expected to lock object before calling methods Enforced locking is guaranteed to happen – It may sometimes be excessively conservative Advisory locking allows users more flexibility – Including the flexibility to do it wrong (or not at all)

Lecture 8 Page 32 CS 111 Winter 2014 Criteria for Correct Locking How do we know if a locking mechanism is correct? Four desirable criteria: 1.Correct mutual exclusion  Only one thread at a time has access to critical section 2.Progress  If resource is available, and someone wants it, they get it 3.Bounded waiting time  No indefinite waits, guaranteed eventual service 4.And (ideally) fairness  E.g. FIFO

Lecture 8 Page 33 CS 111 Winter 2014 Asynchronous Completion The second big problem with parallelism – How to wait for an event that may take a while – Without wasteful spins/busy-waits Examples of asynchronous completions – Waiting for a held lock to be released – Waiting for an I/O operation to complete – Waiting for a response to a network request – Delaying execution for a fixed period of time

Lecture 8 Page 34 CS 111 Winter 2014 Using Spin Waits to Solve the Asynchronous Completion Problem Thread A needs something from thread B – Like the result of a computation Thread B isn’t done yet Thread A stays in a busy loop waiting Sooner or later thread B completes Thread A exits the loop and makes use of B’s result Definitely provides correct behavior, but...

Lecture 8 Page 35 CS 111 Winter 2014 Well, Why Not? Waiting serves no purpose for the waiting thread – “Waiting” is not a “useful computation” Spin waits reduce system throughput – Spinning consumes CPU cycles – These cycles can’t be used by other threads – It would be better for waiting thread to “yield” They are actually counter-productive – Delays the thread that will post the completion – Memory traffic slows I/O and other processors

Lecture 8 Page 36 CS 111 Winter 2014 Another Solution Completion blocks Create a synchronization object – Associate that object with a resource or request Requester blocks awaiting event on that object – Yield the CPU until awaited event happens Upon completion, the event is “posted” – Thread that notices/causes event posts the object Posting event to object unblocks the waiter – Requester is dispatched, and processes the event

Lecture 8 Page 37 CS 111 Winter 2014 Blocking and Unblocking Exactly as discussed in scheduling lecture Blocking – Remove specified process from the “ready” queue – Yield the CPU (let scheduler run someone else) Unblocking – Return specified process to the “ready” queue – Inform scheduler of wakeup (possible preemption) Only trick is arranging to be unblocked – Because it is so embarrassing to sleep forever

Lecture 8 Page 38 CS 111 Winter 2014 Unblocking and Synchronization Objects Easy if only one thread is blocked on the object If multiple blocked threads, who should we unblock? – Everyone who is blocked? – One waiter, chosen at random? – The next thread in line on a FIFO queue? Depends on the resource – Can multiple threads use it concurrently? – If not, awaking multiple threads is wasteful Depends on policy – Should scheduling priority be used? – Consider possibility of starvation

Lecture 8 Page 39 CS 111 Winter 2014 The Thundering Herd Problem What if a large number of threads are blocked on a single resource? When the thread holding that resource unblocks, you wake them all up They contend for the resource and one gets it The rest get put to sleep And eventually it happens again But waking and sleeping many threads is expensive itself If this happens a lot, poor performance

Lecture 8 Page 40 CS 111 Winter 2014 A Possible Problem The sleep/wakeup race condition void sleep( eventp *e ) { while(e->posted == FALSE) { add_to_queue( &e->queue, myproc ); myproc->runstate |= BLOCKED; yield(); } void wakeup( eventp *e) { struct proce *p; e->posted = TRUE; p = get_from_queue(&e-> queue); if (p) { p->runstate &= ~BLOCKED; resched(); } /* if !p, nobody’s waiting */ } Consider this sleep code: And this wakeup code: What’s the problem with this?

Lecture 8 Page 41 CS 111 Winter 2014 A Sleep/Wakeup Race Let’s say thread B is using a resource and thread A needs to get it So thread A will call sleep() Meanwhile, thread B finishes using the resource – So thread B will call wakeup() No other threads are waiting for the resource

Lecture 8 Page 42 CS 111 Winter 2014 The Race At Work void sleep( eventp *e ) { while(e->posted == FALSE) { void wakeup( eventp *e) { struct proce *p; e->posted = TRUE; p = get_from_queue(&e-> queue); if (p) { } /* if !p, nobody’s waiting */ } Nope, nobody’s in the queue! add_to_queue( &e->queue, myproc ); myproc->runsate |= BLOCKED; yield(); } Yep, somebody’s locked it! Thread A Thread B The effect? Thread A is sleeping But there’s no one to wake him up CONTEXT SWITCH!

Lecture 8 Page 43 CS 111 Winter 2014 Solving the Problem There is clearly a critical section in sleep() – Starting before we test the posted flag – Ending after we put ourselves on the notify list During this section, we need to prevent – Wakeups of the event – Other people waiting on the event This is a mutual-exclusion problem – Fortunately, we already know how to solve those

Lecture 8 Page 44 CS 111 Winter 2014 Synchronization Objects Combine mutual exclusion and (optional) waiting Operations implemented safely – With atomic instructions – With interrupt disables Exclusion policies (one-only, read-write) Waiting policies (FCFS, priority, all-at-once) Additional operations (queue length, revoke)

Lecture 8 Page 45 CS 111 Winter 2014 Lock Contention The riddle of parallel multi-tasking: – If one task is blocked, CPU runs another – But concurrent use of shared resources is difficult – Critical sections serialize tasks, eliminating parallelism What if everyone needs to share one resource? – One process gets the resource – Other processes get in line behind him – Parallelism is eliminated; B runs after A finishes – That resource becomes a bottle-neck

Lecture 8 Page 46 CS 111 Winter 2014 What If It Isn’t That Bad? Say each thread is only somewhat likely to need a resource Consider the following system – Ten processes, each runs once per second – One resource they all use 5% of time (5ms/sec) – Half of all time slices end with a preemption Chances of preemption while in critical section – Per slice: 2.5%, per sec: 22%, over 10 sec: 92% Chances a 2nd process will need resource – 5% in next time slice, 37% in next second But once this happens, a line forms

Lecture 8 Page 47 CS 111 Winter 2014 Resource Convoys All processes regularly need the resource – But now there is a waiting line – Nobody can “just use the resource”, must get in line The delay becomes much longer – We don’t just wait a few  sec until resource is free – We must wait until everyone in front of us finishes – And while we wait, more people get into the line Delays rise, throughput falls, parallelism ceases Not merely a theoretical transient response

Lecture 8 Page 48 CS 111 Winter 2014 Resource Convoy Performance throughput offered load ideal convoy

Lecture 8 Page 49 CS 111 Winter 2014 Avoiding Contention Problems Eliminate the critical section entirely – Eliminate shared resource, use atomic instructions Eliminate preemption during critical section – By disabling interrupts … not always an option Reduce lingering time in critical section – Minimize amount of code in critical section – Reduce likelihood of blocking in critical section Reduce frequency of critical section entry – Reduce use of the serialized resource – Spread requests out over more resources

Lecture 8 Page 50 CS 111 Winter 2014 An Approach Based on Smarter Locking Reads and writes are not equally common – File read/write: reads/writes > 50 – Directory search/create: reads/writes > 1000 Writers generally need exclusive access Multiple readers can generally share a resource Read/write locks – Allow many readers to share a resource – Only enforce exclusivity when a writer is active

Lecture 8 Page 51 CS 111 Winter 2014 Lock Granularity How much should one lock cover? – One object or many – Important performance and usability implications Coarse grained - one lock for many objects – Simpler, and more idiot-proof – Results in greater resource contention Fine grained - one lock per object – Spreading activity over many locks reduces contention – Time/space overhead, more locks, more gets/releases – Error-prone: harder to decide what to lock when – Some operations may require locking multiple objects (which creates a potential for deadlock)

Lecture 8 Page 52 CS 111 Winter 2014 Lock Granularity: Pools Vs. Elements Consider a pool of objects, each with its own lock Most operations lock only one buffer within the pool Some operations require locking the entire pool – Two threads both try to add block A to the cache – Thread 1 looks for block B while thread 2 is deleting it The pool lock could become a bottle-neck – Minimize its use, reader/writer locking, sub-pools... buffer Abuffer Bbuffer C buffer Dbuffer E... pool of file system cache buffers

Lecture 8 Page 53 CS 111 Winter 2014 Other Important Synchronization Primitives Semaphores Mutexes Monitors

Lecture 8 Page 54 CS 111 Winter 2014 Semaphores Counters for sequence coord. and mutual exclusion Can be binary counters or more general – E.g., if you have multiple copies of the resource Call wait() on the semaphore to obtain exclusive access to a critical section – For binary semaphores, you wait till whoever had it signals they are done Call signal() when you’re done For sequence coordination, signal on a shared semaphore when you finish first step – Wait before you do second step

Lecture 8 Page 55 CS 111 Winter 2014 Mutexes A synchronization construct to serialize access to a critical section Typically implemented using semaphores Mutexes are one per critical section – Unlike semaphores, which protect multiple copies of a resource

Lecture 8 Page 56 CS 111 Winter 2014 Monitors An object oriented synchronization primitive – Sort of very OO mutexes – Exclusion requirements depend on object/methods – Implementation should be encapsulated in object – Clients shouldn't need to know the exclusion rules A monitor is not merely a lock – It is an object class, with instances, state, and methods – All object methods protected by a semaphore Monitors have some very nice properties – Easy to use for clients, hides unnecessary details – High confidence of adequate protection

Lecture 8 Page 57 CS 111 Winter 2014 Synchronization in Real World Operating Systems How is this kind of synchronization handled in typical modern operating systems? In the kernel itself? In user-level OS features?

Lecture 8 Page 58 CS 111 Winter 2014 Kernel Mode Synchronization Performance is a major concern – Many different types of exclusion are available Shared/exclusive, interrupt-safe, SMP-safe Choose type best suited to the resource and situation – Implementations are in machine language Carefully coded for optimum performance Extensive use of atomic instructions Imposes a greater burden on the callers – Most locking is explicit and advisory – Caller expected to know and follow locking rules

Lecture 8 Page 59 CS 111 Winter 2014 User Mode Synchronization Simplicity and ease of use of great importance – Conservative, enforced, one-size-fits-all locking E.g., exclusive use, block until available – Implicitly associated with protected system objects E.g., files, processes, message queues, events, etc. System calls automatically serialize all operations Explicit serialization is only rarely used – To protect shared resources in multi-threaded apps – Simpler behavior than kernel-mode – Typically implemented via system calls into the OS

Lecture 8 Page 60 CS 111 Winter 2014 Why Require System Calls for User Level Sync Operations? Mutual exclusion operations likely involve the blocking and unblocking of threads – These are controlled by the operating system Critical sections in the implementations of those operations must be protected – From interrupts or SMP parallelism The OS already has powerful serialization mechanisms – It is easier to build on top of these

Lecture 8 Page 61 CS 111 Winter 2014 Why Is Performance More Important for Kernel Sync? Multi-threaded execution in user mode is rare – High resource contention even rarer – So performance problems with user-mode serialization are extremely rare The OS, on the other hand, is always running multiple concurrent threads The OS also includes many high use resources Avoiding resource contention is key to achieving good multi-processor scalability

Lecture 8 Page 62 CS 111 Winter 2014 So Why Provide Multiple Sync Primitives? If performance (and correctness) is so vital in OS sync, why not do it once? – Quick and right Multiple types of locking operation lead to better performance – Least restrictive locking discipline (e.g. reader/writer locks) can greatly reduce resource contention – Choosing exactly when and which locks are obtained and released can minimize the time spent in the critical section – Lessening the danger of deadlocks

Lecture 8 Page 63 CS 111 Winter 2014 Case Study: Unix Synchronization Internal use is very specific to particular Unix implementation – Linux makes extensive use of semaphores internally But all Unix systems provide some user-level synchronization primitives – Including Linux

Lecture 8 Page 64 CS 111 Winter 2014 Unix User Synchronization Mechanisms Semaphores – Mostly supporting a Posix standard interface – sem_open, sem_close, sem_post, sem_wait Mutual exclusion file creation (O_EXCL) Advisory file locking (flock) – Shared/exclusive, blocking/non-blocking Enforced record locking (lockf) – Locks a contiguous region of a file – Lock/unlock/test, blocking/non-blocking All blocks can be aborted by a timer

Lecture 8 Page 65 CS 111 Winter 2014 Unix Asynchronous Completions Most events are associated with open files – Normal files and devices – Network or inter-process communication ports Users can specify blocking or non-blocking use – Non-blocking returns if no data is yet available – Poll if a logical channel is ready or would block – Select the first of n channels to become ready Users can also yield and wait – E.g., for the termination of a child process Signal will awaken a process from any blockage – E.g., alarm clock signal after specified time interval

Lecture 8 Page 66 CS 111 Winter 2014 Completion Events Available in Linux and other Unix systems Used in multithreaded programs One thread creates and starts a completion Another thread calls a routine to wait on that completion event The thread that completes it makes another call – Which results in the waiting thread being woken

Lecture 8 Page 67 CS 111 Winter 2014 Case Study: Windows Synchronization Windows includes many synchronization methods – File locking – Other synchronization primitives

Lecture 8 Page 68 CS 111 Winter 2014 Windows File Locking By default, Windows applications have exclusive access to files they open – Can allow sharing For shared files, byte range locking provided – Applications can specify ranges of bytes in the file to lock Shared or exclusive locking – Windows file systems treat locks as mandatory – Other applications using file mapping treat them as advisory

Lecture 8 Page 69 CS 111 Winter 2014 Other Windows Synchronization Primitives A wide range – Mutexes (of several kinds) – Critical regions (and guarded regions, which are less protected) – Event-based synchronization E.g., waiting for an event to complete – Semaphores – Spin locks (several kinds) – Timers