Outline Introduction Background Distributed DBMS Architecture

Slides:



Advertisements
Similar presentations
Outline Introduction Background Distributed Database Design
Advertisements

CS542: Topics in Distributed Systems Distributed Transactions and Two Phase Commit Protocol.
(c) Oded Shmueli Distributed Recovery, Lecture 7 (BHG, Chap.7)
CS 603 Handling Failure in Commit February 20, 2002.
1 ICS 214B: Transaction Processing and Distributed Data Management Lecture 12: Three-Phase Commits (3PC) Professor Chen Li.
CIS 720 Concurrency Control. Timestamp-based concurrency control Assign a timestamp ts(T) to each transaction T. Each data item x has two timestamps:
Computer Science Lecture 18, page 1 CS677: Distributed OS Last Class: Fault Tolerance Basic concepts and failure models Failure masking using redundancy.
ICS 421 Spring 2010 Distributed Transactions Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/16/20101Lipyeow.
Termination and Recovery MESSAGES RECEIVED SITE 1SITE 2SITE 3SITE 4 initial state committablenon Round 1(1)CNNN-NNNN Round 2FAILED(1)-CNNN--NNN Round 3FAILED.
Systems of Distributed Systems Module 2 -Distributed algorithms Teaching unit 3 – Advanced algorithms Ernesto Damiani University of Bozen Lesson 6 – Two.
Computer Science Lecture 17, page 1 CS677: Distributed OS Last Class: Fault Tolerance Basic concepts and failure models Failure masking using redundancy.
Non-blocking Atomic Commitment Aaron Kaminsky Presenting Chapter 6 of Distributed Systems, 2nd edition, 1993, ed. Mullender.
Distributed DBMSPage © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture Distributed Database.
Distributed DBMSPage © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture Distributed Database.
CS 603 Three-Phase Commit February 22, Centralized vs. Decentralized Protocols What if we don’t want a coordinator? Decentralized: –Each site broadcasts.
©Silberschatz, Korth and Sudarshan19.1Database System Concepts Distributed Transactions Transaction may access data at several sites. Each site has a local.
1 More on Distributed Coordination. 2 Who’s in charge? Let’s have an Election. Many algorithms require a coordinator. What happens when the coordinator.
Failure and Availibility DS Lecture # 12. Optimistic Replication Let everyone make changes –Only 3 % transactions ever abort Make changes, send updates.
1 ICS 214B: Transaction Processing and Distributed Data Management Distributed Database Systems.
Distributed Commit. Example Consider a chain of stores and suppose a manager – wants to query all the stores, – find the inventory of toothbrushes at.
Distributed DBMSPage © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture Distributed Database.
Commit Protocols. CS5204 – Operating Systems2 Fault Tolerance Causes of failure: process failure machine failure network failure Goals : transparent:
CS162 Section Lecture 10 Slides based from Lecture and
Distributed Transactions March 15, Transactions What is a Distributed Transaction?  A transaction that involves more than one server  Network.
Chapter 19 Recovery and Fault Tolerance Copyright © 2008.
Distributed Transactions Chapter 13
Operating Systems Distributed Coordination. Topics –Event Ordering –Mutual Exclusion –Atomicity –Concurrency Control Topics –Event Ordering –Mutual Exclusion.
Distributed Transaction Management, Fall 2002Lecture Distributed Commit Protocols Jyrki Nummenmaa
Fault Tolerance CSCI 4780/6780. Distributed Commit Commit – Making an operation permanent Transactions in databases One phase commit does not work !!!
University of Tampere, CS Department Distributed Commit.
Commit Algorithms Hamid Al-Hamadi CS 5204 November 17, 2009.
Distributed Transactions Chapter – Vidya Satyanarayanan.
Concurrency Control and Reliable Commit Protocol in Distributed Database Systems Jian Jia Chen 2002/05/09 Real-time and Embedded System Lab., CSIE, National.
Committed:Effects are installed to the database. Aborted:Does not execute to completion and any partial effects on database are erased. Consistent state:
Lampson and Lomet’s Paper: A New Presumed Commit Optimization for Two Phase Commit Doug Cha COEN 317 – SCU Spring 05.
Introduction to Distributed Databases Yiwei Wu. Introduction A distributed database is a database in which portions of the database are stored on multiple.
Distributed DBMSPage © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture Distributed Database.
Topics in Distributed Databases Database System Implementation CSE 507 Some slides adapted from Navathe et. Al and Silberchatz et. Al.
Distributed Databases – Advanced Concepts Chapter 25 in Textbook.
More on Fault Tolerance
Recovery in Distributed Systems:
Outline Introduction Background Distributed DBMS Architecture
Two Phase Commit CSE593 Transaction Processing Philip A. Bernstein
Chapter 19: Distributed Databases
Outline Introduction Background Distributed DBMS Architecture
CSC 8320 Advanced Operating Systems Xueting Liao
Database System Implementation CSE 507
Two phase commit.
Commit Protocols CS60002: Distributed Systems
RELIABILITY.
Outline Announcements Fault Tolerance.
2PC Recap Eventual Consistency & Dynamo
2PC Recap Eventual Consistency & Dynamo
Outline Introduction Background Distributed DBMS Architecture
Concurrency Control and Reliable Commit Protocol in Distributed Database Systems Jian Jia Chen 2002/05/09 Real-time and Embedded System Lab., CSIE, National.
Outline Introduction Background Distributed DBMS Architecture
Outline Introduction Background Distributed DBMS Architecture
CSE 486/586 Distributed Systems Concurrency Control --- 3
Outline Introduction Background Distributed DBMS Architecture
Assignment 5 - Solution Problem 1
Distributed Transactions
Exercises for Chapter 14: Distributed Transactions
Distributed Databases Recovery
UNIVERSITAS GUNADARMA
CIS 720 Concurrency Control.
CSE 486/586 Distributed Systems Concurrency Control --- 3
Last Class: Fault Tolerance
Outline Introduction Background Distributed DBMS Architecture
Outline Introduction Background Distributed DBMS Architecture
Transaction Communication
Presentation transcript:

Outline Introduction Background Distributed DBMS Architecture Distributed Database Design Distributed Query Processing Transaction Management Commit/Termination protocols – 2PC Building Distributed Database Systems (RAID) Mobile Database Systems Privacy, Trust, and Authentication Peer to Peer Systems

Useful References Textbook Principles of Distributed Database Systems, Chapter 12.4, 12.5.1 D. Skeen and M Stonebraker, A Formal Model of Crash Recovery in a Distributed System, IEEE Trans. Software Eng. 9(3): 219-228, 1983. D. Skeen, A Decentralized Termination Protocol, IEEE Symposium on Reliability in Distributed Software and Database Systems, July 1981.

Byzantine General Problem Two generals are situated on adjacent hills and enemy is in the valley in between. Enemy can defeat either general, but not both. To succeed, both generals must agree to either attack or retreat. The generals can communicate via messengers who are subject to capture or getting lost. The general may themselves be traitors or send inconsistent information.

Byzantine Agreement Problem of a set of processors to agree on a common value for an object. Processors may fail arbitrarily, die and revive randomly, send messages when they are not supposed to etc.

Atomicity Control from Book Commit protocols How to execute commit command for distributed transactions. Issue: how to ensure atomicity and durability? Termination protocols If a failure occurs, how can the remaining operational sites deal with it. Non-blocking : the occurrence of failures should not force the sites to wait until the failure is repaired to terminate the transaction. Recovery protocols When a failure occurs, how do the sites where the failure occurred deal with it. Independent : a failed site can determine the outcome of a transaction without having to obtain remote information. Independent recovery  non-blocking termination

General Terminology for Commit/Termination/Recovery Protocols Committed: Effects are installed to the database. Aborted: Does not execute to completion and any partial effects on database are erased. Consistent state: Derived state from serial execution. Inconsistency caused by: Concurrently executing transaction. Failures causing partial or incorrect execution of a transaction.

General Terminology for Commit/Termination/Recovery Protocols Commit protocols Protocols for directing the successful execution of a simple transaction Termination protocols Protocols at operational site to commit/abort an unfinished transaction after a failure Recovery protocols Protocols at failed site to complete all transactions outstanding at the time of failure

General Terminology for Commit/Termination/Recovery Protocols Distributed Crash Recovery: Centralized Protocols Hierarchical Protocols Linear Protocols Decentralized Protocols Phase: Consists of a message round where all Sites exchange messages. Two Phase Commit Protocol: ARGUS, LOCUS, INGRES Four Phase Commit Protocol: SSD-1 Quorum: Minimum number of sites needed to proceed with an action

Commit/Termination Protocols Two Phase Commit Three Phase Commit Four Phase Commit Linear, Centralized, Hierarchical, Decentralized Protocols

Two Phase Commit Site 1 Site 2 1. Trans. arrives. Message to ask for vote is sent to other site(s) Message is recorded. Site votes Y or N (abort) Vote is sent to site 1 2. The vote is received. If vote = Y on both sites, then Commit else Abort Either Commit or Abort based on the decision of site 1

Two-Phase Commit (2PC) Phase 1 : The coordinator gets the participants ready to write the results into the database Phase 2 : Everybody writes the results into the database Coordinator :The process at the site where the transaction originates and which controls the execution Participant :The process at the other sites that participate in executing the transaction Global Commit Rule: The coordinator aborts a transaction if and only if at least one participant votes to abort it. The coordinator commits a transaction if and only if all of the participants vote to commit it.

Local Protocols for the Centralized Two-Phase Commit Protocol q1 w1 a1 c1 c2 xact request start xact yes abort commit a2 no w2 q2 Site 1 (co-ordinator) Site 2 (slave)

Decentralized Two-Phase Commit Protocol ci ai wi qi start xact noi1 noin yesi1 yesin … no1i| |noni yes1i yesni Site i (i = 1,2,…n) receive send

Centralized 2PC (see book) ready? yes/no commit/abort? commited/aborted Phase 1 Phase 2

SDD-1 Four-Phase Commit Protocol q1 a1’ c1’ w1’ w1 a1 c1 c2 a2 q2 w2 Site 1 (co-ordinator) Site 2 (back-up) request xact2 act2 xact3xact4 abort2 commit2 yes3yes4 no3|no4 ack2 commit3commit4 abort3abort4 ci ai wi qi xacti noi yesi aborti commiti Site i (i = 3,4) (slave)

2PC Protocol Actions (see book) Coordinator Participant INITIAL INITIAL PREPARE write begin_commit in log write abort in log No Ready to Commit? VOTE-ABORT Yes WAIT VOTE-COMMIT write ready in log Yes write abort in log GLOBAL-ABORT READY Any No? No VOTE-COMMIT write commit in log Abort Type of msg ACK COMMIT ABORT write abort in log Commit ACK write commit in log write end_of_transaction in log ABORT COMMIT

Linear 2PC Phase 1 Prepare VC/VA VC/VA VC/VA VC/VA 1 2 3 4 5 N GC/GA VC: Vote-Commit, VA: Vote-Abort, GC: Global-commit, GA: Global-abort

Distributed 2PC Coordinator Participants Participants global-commit/ global-abort vote-abort/ vote-commit decision made independently prepare Phase 1 Phase 2

State Transitions in 2PC (see book) INITIAL INITIAL Commit command Prepare Prepare Vote-commit Prepare Vote-abort READY WAIT Vote-abort Vote-commit (all) Global-abort Global-commit Global-abort Global-commit Ack Ack ABORT COMMIT ABORT COMMIT Coordinator Participants

Site Failures - 2PC Termination (see book) COORDINATOR Timeout in INITIAL Who cares Timeout in WAIT Cannot unilaterally commit Can unilaterally abort Timeout in ABORT or COMMIT Stay blocked and wait for the acks INITIAL Commit command Prepare WAIT Vote-abort Vote-commit Global-abort Global-commit ABORT COMMIT

Site Failures - 2PC Termination PARTICIPANTS INITIAL Timeout in INITIAL Coordinator must have failed in INITIAL state Unilaterally abort Timeout in READY Stay blocked Prepare Vote-commit Prepare Vote-abort READY Global-abort Global-commit Ack Ack ABORT COMMIT

Site Failures - 2PC Recovery COORDINATOR Failure in INITIAL Start the commit process upon recovery Failure in WAIT Restart the commit process upon recovery Failure in ABORT or COMMIT Nothing special if all the acks have been received Otherwise the termination protocol is involved INITIAL Commit command Prepare WAIT Vote-commit Vote-abort Global-commit Global-abort ABORT COMMIT

Site Failures - 2PC Recovery PARTICIPANTS Failure in INITIAL Unilaterally abort upon recovery Failure in READY The coordinator has been informed about the local decision Treat as timeout in READY state and invoke the termination protocol Failure in ABORT or COMMIT Nothing special needs to be done INITIAL Prepare Vote-commit Prepare Vote-abort READY Global-abort Global-commit Ack Ack ABORT COMMIT

2PC Recovery Protocols –Additional Cases (see book) Arise due to non-atomicity of log and message send actions Coordinator site fails after writing “begin_commit” log and before sending “prepare” command treat it as a failure in WAIT state; send “prepare” command Participant site fails after writing “ready” record in log but before “vote-commit” is sent treat it as failure in READY state alternatively, can send “vote-commit” upon recovery Participant site fails after writing “abort” record in log but before “vote-abort” is sent no need to do anything upon recovery

2PC Recovery Protocols –Additional Case (see book) Coordinator site fails after logging its final decision record but before sending its decision to the participants coordinator treats it as a failure in COMMIT or ABORT state participants treat it as timeout in the READY state Participant site fails after writing “abort” or “commit” record in log but before acknowledgement is sent participant treats it as failure in COMMIT or ABORT state coordinator will handle it by timeout in COMMIT or ABORT state

Problem With 2PC Blocking Independent recovery is not possible Ready implies that the participant waits for the coordinator If coordinator fails, site is blocked until recovery Blocking reduces availability Independent recovery is not possible However, it is known that: Independent recovery protocols exist only for single site failures; no independent recovery protocol exists which is resilient to multiple-site failures. So we search for these protocols – 3PC