Computer Science 5204 Operating Systems Fall, 2010

Slides:



Advertisements
Similar presentations
Computer Science 5204 Operating Systems Fall, 2010 Dr. Dennis Kafura Course Overview 1.
Advertisements

Concurrency Issues Motivation, Problems, Directions Dennis Kafura - CS Operating Systems1.
Dr. Kalpakis CMSC 621, Advanced Operating Systems. Fall 2003 URL: Distributed System Architectures.
Chapter 19 Database Recovery Techniques
Computer Science Lecture 18, page 1 CS677: Distributed OS Last Class: Fault Tolerance Basic concepts and failure models Failure masking using redundancy.
Computer Science Lecture 17, page 1 CS677: Distributed OS Last Class: Fault Tolerance Basic concepts and failure models Failure masking using redundancy.
EEC 688/788 Secure and Dependable Computing Lecture 12 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
CS-550 (M.Soneru): Recovery [SaS] 1 Recovery. CS-550 (M.Soneru): Recovery [SaS] 2 Recovery Computer system recovery: –Restore the system to a normal operational.
EEC 688/788 Secure and Dependable Computing Lecture 13 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Page 1 Copyright © Alexander Allister Shvartsman CSE 6510 (461) Fall 2010 Selected Notes on Fault-Tolerance (12) Alexander A. Shvartsman Computer.
Transaction. A transaction is an event which occurs on the database. Generally a transaction reads a value from the database or writes a value to the.
 Mechanism for restoring a database quickly and accurately after loss or damage  RESPONSIBILITY OF ?????  Recovery facilities: Backup Facilities Backup.
Chapter 4 Database Management Systems. Chapter 4Slide 2 What is a Database Management System (DBMS)?  Database An organized collection of related data.
Academic Year 2014 Spring. MODULE CC3005NI: Advanced Database Systems “DATABASE RECOVERY” (PART – 1) Academic Year 2014 Spring.
TRANSACTION PROCESSING SYSTEM Liew Woei Song Muhammad Hofiz Achoson.
Commit Protocols. CS5204 – Operating Systems2 Fault Tolerance Causes of failure: process failure machine failure network failure Goals : transparent:
DBSQL 7-1 Copyright © Genetic Computer School 2009 Chapter 7 Transaction Management, Database Security and Recovery.
Chapter 13 Processing Controls. Operating System Integrity Operating system -- the set of programs implemented in software/hardware that permits sharing.
1 Database Systems CS204 Lecture 21 Transaction Processing I Asma Ahmad FAST-NU April 7, 2011.
Database Management System Module 5 DeSiaMorewww.desiamore.com/ifm1.
1 Welcome: To the second learning sequence “ Data Base (DB) and Data Base Management System (DBMS) “ Recap : In the previous learning sequence, we discussed.
File Access and Transfer. Issues 4 Access and transfer are different operations –with different requirements 4 Transfer –move the file from one place.
Chapter 19 Recovery and Fault Tolerance Copyright © 2008.
Computer Science 5204 Operating Systems Fall, 2011 Dr. Dennis Kafura Course Overview 1.
CMPT 454, Simon Fraser University, Fall 2009, Martin Ester 294 Database Systems II Coping With System Failures.
Chapter 1 Introduction to Databases. 1-2 Chapter Outline   Common uses of database systems   Meaning of basic terms   Database Applications  
CS5204 – Operating Systems 1 Checkpointing-Recovery.
Concurrency Control. Objectives Management of Databases Concurrency Control Database Recovery Database Security Database Administration.
 Distributed file systems having transaction facility need to support distributed transaction service.  A distributed transaction service is an extension.
Commit Algorithms Hamid Al-Hamadi CS 5204 November 17, 2009.
MBA 664 Database Management Dave Salisbury ( )
A Survey of Fault Tolerance in Distributed Systems By Szeying Tan Fall 2002 CS 633.
Computer Science 5204 Operating Systems Fall, 2012 Dr. Dennis Kafura Course Overview 1.
1 CS-550: Materials for the Final Exam - Tue., Dec. 11, 2001, 8 a.m. Distributed Systems 8.Client-server computing Architecture, Relational database applications.
Operating System Reliability Andy Wang COP 5611 Advanced Operating Systems.
Distributed Systems Architectures. Topics covered l Client-server architectures l Distributed object architectures l Inter-organisational computing.
CS4222 Principles of Database System
Processes and threads.
On-Line Transaction Processing
Managing Multi-User Databases
Database Transaction Abstraction I
Outline Types of Databases and Database Applications Basic Definitions
Last Class: Canonical Problems
Computer Science 5204 Operating Systems Fall, 2010
Operating System Reliability
Operating System Reliability
IS442 Information Systems Engineering
Introduction to Operating Systems
Batches, Transactions, & Errors
Database Processing: David M. Kroenke’s Chapter Nine: Part Two
Operating System Reliability
Operating System Reliability
What is Concurrent Programming?
Outline Introduction Background Distributed DBMS Architecture
Introduction to Database Management Systems
Outline Introduction Background Distributed DBMS Architecture
Database management concepts
Batches, Transactions, & Errors
What is Concurrent Programming?
Operating System Reliability
Database Processing: David M. Kroenke’s Chapter Nine: Part Two
Chapter 2 Operating System Overview
Operating System Overview
Distributed Systems (15-440)
Abstractions for Fault Tolerance
Concurrency Control.
Distributed Systems and Concurrency: Distributed Systems
Last Class: Fault Tolerance
Operating System Reliability
Operating System Reliability
Presentation transcript:

Computer Science 5204 Operating Systems Fall, 2010 Dr. Dennis Kafura Course Overview

Material intensive Balance Organization Syllabus on web site 35 +/- papers 25 audio/video files No required text Balance Theory vs. technology Contemporary vs. classic Survey vs. depth Centralized vs. distributed Syllabus on web site CS 5204 – Operating Systems

Course Web site CS 5204 – Operating Systems http://courses.cs.vt.edu/cs5204/fall10--kafura-NVC CS 5204 – Operating Systems

Major Topics and Themes Atomicity Order Consistency Theory Protocols Google 1. Concurrency x 2. Security 3. File systems 4. Fault tolerance CS 5204 – Operating Systems

1. Concurrency How can concurrent processing be structured on a single processor? What are the problems and issues in developing systems with concurrent activities? What are emerging models for concurrent programming? How can the problems with multi-threaded programming be alleviated? How can transaction-style semantics be supported locally in hardware or software? How can concurrency and communication be represented formally? CS 5204 – Operating Systems

Process vs. Thread Models . . . user kernel process-centered thread-centered CS 5204 – Operating Systems

Models of Concurrent Computation MapReduce Concurrent Collections (CnC) step item tag CS 5204 – Operating Systems

Thread-per-process models Sequential process Communication channel Communicating Sequential Processes Sammati Grace CS 5204 – Operating Systems

Supporting Transaction Semantics repeat { BeginTransaction(); /* initialize transaction */ <read input values> success = Validate(); /* test if inputs consistent */ if (success) { <generate updates> success = Commit(); /* attempt permanent update */ if (!success) Abort(); /* terminate if unable to commit */ } EndTransaction(); /* close transaction */ } until (success); TM Object Transaction New Object Old Object Locator object data CS 5204 – Operating Systems

π-Calculus An algebra that captures the notions of communication, interaction, and synchronization among concurrently executing entities. talk1 switch1 talk2 switch2 IdTrans Trans Idtrans Trans lose2 lose2 lose1 lose1 gain1 Control gain2 gain1 Control gain2 CS 5204 – Operating Systems

2. Security How can rights for access control be structured for effective use and management? How can a digital document be “signed” so as to identify authorship? How can communicating parties be confident of each other’s identities? How can a platform attest to the validity of its execution environment? How can access policies be expressed and enforced? CS 5204 – Operating Systems

Trusted Platform Module Security Overview Models of Protection Trusted Platform Module CS 5204 – Operating Systems

Security in distributed systems Describe explicit trust relationships Express security token issuance policies Provide security tokens that contain identities, capabilities, and/or delegation policies Express resource authorization and delegation policies CS 5204 – Operating Systems

3. File Systems What does a “typical” file structure look like? What problems can arise in the file sytems when systems fail? How can file systems be structured to withstand (or more easily recover from) system failures? How can file systems be structured to handle tera-bytes of information? CS 5204 – Operating Systems

Local File Systems Structures Log-structured file system Storage Stack Local File Systems Structures Log-structured file system Journaling and soft updates Google File System CS 5204 – Operating Systems

4. Fault Tolerance How can events be ordered in a distributed system lacking a shared clock? How can this ordering give rise to a form of virtual time? What are basic approaches to recovery from failure? What is the taxonomy of strategies of “backward” recovery? How can the state of system be captured so that it can be recovered in the event of failure? How can distributed elements agree on commit to accepting a change in the system state? CS 5204 – Operating Systems

P Q Event Ordering P1 Q1 P3 P2 Q3 Q2 How can the events on P be related to the events on Q? Which events of P “happened before” which events of Q? When does it matter how we answer these questions? CS 5204 – Operating Systems

Recovery erroneous state error valid state failure causes fault leads to recovery An error is a manifestation of a fault that can lead to a failure. Failure Recovery: backward recovery operation­based (do­undo­redo logs) state­based (checkpoints) forward recovery CS 5204 – Operating Systems