Notes 09: Transaction Processing

Slides:

Advertisements

Similar presentations

1 Shivnath Babu Concurrency Control (II) CS216: Data-Intensive Computing Systems.

Advertisements

Cs4432concurrency control1 CS4432: Database Systems II Lecture #21 Concurrency Control : Theory Professor Elke A. Rundensteiner.

Database Systems (資料庫系統)

1 Lecture 11: Transactions: Concurrency. 2 Overview Transactions Concurrency Control Locking Transactions in SQL.

1 Concurrency Control Conflict serializability Two phase locking Optimistic concurrency control Source: slides by Hector Garcia-Molina.

Cs4432concurrency control1 CS4432: Database Systems II Lecture #22 Concurrency Control Professor Elke A. Rundensteiner.

Concurrency Control II

1 ICS 214B: Transaction Processing and Distributed Data Management Lecture 2: Enforcing Serializable Schedules Professor Chen Li.

Cs4432concurrency control1 CS4432: Database Systems II Lecture #23 Concurrency Control Professor Elke A. Rundensteiner.

Cs4432concurrency control1 CS4432: Database Systems II Lecture #22 Concurrency Control: Locking-based Protocols Professor Elke A. Rundensteiner.

1 ICS 214B: Transaction Processing and Distributed Data Management Lecture 4: More on Locks Professor Chen Li.

1 CS216 Advanced Database Systems Shivnath Babu Notes 12: Concurrency Control (II)

Concurrency Control Amol Deshpande CMSC424. Approach, Assumptions etc.. Approach  Guarantee conflict-serializability by allowing certain types of concurrency.

CS4432: Database Systems II Lecture #26 Concurrency Control and Recovery Professor Elke A. Rundensteiner.

Concurrent Transactions Even when there is no “failure,” several transactions can interact to turn a consistent state into an inconsistent state.

Concurrency Control R&G - Chapter 17 Smile, it is the key that fits the lock of everybody's heart. Anthony J. D'Angelo, The College Blue Book.

Transaction Processing: Concurrency and Serializability 10/4/05.

Transactions or Concurrency Control. Introduction A program which operates on a DB performs 2 kinds of operations: –Access to the Database (Read/Write)

Transactions. Definitions Transaction (program): A series of Read/Write operations on items in a Database. Example: Transaction 1 Read(C) Read(A) Write(A)

Concurrency. Correctness Principle A transaction is atomic -- all or none property. If it executes partly, an invalid state is likely to result. A transaction,

Concurrency Control 18.1 – 18.2 Chiu Luk CS257 Database Systems Principles Spring 2009.

CS 277 – Spring 2002Notes 091 CS 277: Database System Implementation Notes 09: Concurrency Control Arthur Keller.

CS4432transaction management1 CS4432: Database Systems II Lecture #23 Transaction Management Professor Elke A. Rundensteiner.

Cs4432concurrency control1 CS4432: Database Systems II Concurrency Control.

CS4432: Database Systems II Transaction Management Motivation 1.

Chapter 181 Chapter 18: Concurrency Control (Slides by Hector Garcia-Molina,

Cs4432concurrency control1 CS4432: Database Systems II Lecture #22 Concurrency Control: Locking-based Protocols Professor Elke A. Rundensteiner.

1 Notes 09: Transaction Processing Slides are modified from the CS 245 class slides of Hector Garcia- Molina.

DBMS 2001Notes 8: Concurrency1 Principles of Database Management Systems 8: Concurrency Control Pekka Kilpeläinen (after Stanford CS245 slide originals.

1 CSE232A: Database System Principle Concurrency Control.

TRANSACTION MANAGEMENT R.SARAVANAKUAMR. S.NAVEEN..

1 Concurrency Control II: Locking and Isolation Levels.

1 CS542 Concurrency Control: Theory and Protocol Professor Elke A. Rundensteiner.

1 Concurrency Control Lecture 22 Ramakrishnan - Chapter 19.

1 Database Systems ( 資料庫系統 ) December 27, 2004 Chapter 17 By Hao-hua Chu ( 朱浩華 )

Jinze Liu. Option 1: run system, recording P(S); at end of day, check for P(S) cycles and declare if execution was good.

Jinze Liu. ACID Atomicity: TX’s are either completely done or not done at all Consistency: TX’s should leave the database in a consistent state Isolation:

CS 440 Database Management Systems

Transaction Management

Concurrency Control.

Transactions and Concurrency Control

Concurrency Control Techniques

Database Systems II Concurrency Control

Concurrency Control.

CS 245: Database System Principles Notes 09: Concurrency Control

Part- A Transaction Management

Transaction Management

CS 245: Database System Principles Notes 09: Concurrency Control

Transaction Management Overview

Cse 344 March 25th – Isolation.

בקרת בו זמניות (concurrency control)

Concurrency Control 11/22/2018.

Transaction Management

March 9th – Transactions

Lecture 21: Concurrency & Locking

Yan Huang - CSCI5330 Database Implementation – Concurrency Control

CS162 Operating Systems and Systems Programming Review (II)

Chapter 15 : Concurrency Control

Transaction Management Overview

Transaction Management

Temple University – CIS Dept. CIS661 – Principles of Data Management

CPSC-608 Database Systems

Transaction Management Overview

CS 245: Database System Principles Notes 09: Concurrency Control

Introduction to Database Systems CSE 444 Lectures 17-18: Concurrency Control November 5-7, 2007.

CONCURRENCY Concurrency is the tendency for different tasks to happen at the same time in a system ( mostly interacting with each other ) . Parallel.

Data-Intensive Computing Systems

Database Systems (資料庫系統)

Lecture 18: Concurrency Control

Database Systems (資料庫系統)

Presentation transcript:

Notes 09: Transaction Processing Slides are modified from the CS 245 class slides of Hector Garcia-Molina

MySQL and Transactions 3.3.6 SET TRANSACTION Syntax, https://dev.mysql.com/doc/refman/5.6/en/set-transaction.html 14.5.2.1 Transaction Isolation Levels, https://dev.mysql.com/doc/refman/5.7/en/innodb-transaction-isolation-levels.html

Reading Chapter 10.1, 10.2 Chapter 11.1 – 11.3, 11.6

Transactions A sequence of operations on one or more data items. Fundamental assumption: a transaction when run on a consistent state and without interference with other transactions will leave the database in a consistent state.

Termination of Transactions Commit: guarantee that all of the changes of the transaction are permanent Abort: guarantee that none of the effects of the transaction are permanent Reason for abort: user, transaction, system failure

Chapter 9 Concurrency Control T1 T2 … Tn DB (consistency constraints)

Example: T1: Read(A) T2: Read(A) A  A+100 A  A2 Write(A) Write(A) Read(B) Read(B) B  B+100 B  B2 Write(B) Write(B) Constraint: A=B

Schedule A A B 25 25 T1 T2 Read(A); A  A+100 125 Write(A); 25 25 125 250 250 250 T1 T2 Read(A); A  A+100 Write(A); Read(B); B  B+100; Write(B); Read(A);A  A2; Read(B);B  B2;

Schedule B A B 25 25 T1 T2 Read(A);A  A2; 50 Write(A); 25 25 50 150 150 150 T1 T2 Read(A);A  A2; Write(A); Read(B);B  B2; Write(B); Read(A); A  A+100 Read(B); B  B+100;

Schedule C A B 25 25 T1 T2 Read(A); A  A+100 125 Write(A); 25 25 125 250 250 250 T1 T2 Read(A); A  A+100 Write(A); Read(A);A  A2; Read(B); B  B+100; Write(B); Read(B);B  B2;

Schedule D A B 25 25 T1 T2 Read(A); A  A+100 125 Write(A); 25 25 125 250 50 150 250 150 T1 T2 Read(A); A  A+100 Write(A); Read(A);A  A2; Read(B);B  B2; Write(B); Read(B); B  B+100;

Schedule E A B 25 25 T1 T2’ Read(A); A  A+100 125 Write(A); Same as Schedule D but with new T2’ A B 25 25 125 25 125 125 T1 T2’ Read(A); A  A+100 Write(A); Read(A);A  A1; Read(B);B  B1; Write(B); Read(B); B  B+100;

Want schedules that are “good”, regardless of initial state and transaction semantics Only look at order of read and writes Example: Sc=r1(A)w1(A)r2(A)w2(A)r1(B)w1(B)r2(B)w2(B)

Example: Sc=r1(A)w1(A)r2(A)w2(A)r1(B)w1(B)r2(B)w2(B) Sc’=r1(A)w1(A) r1(B)w1(B)r2(A)w2(A)r2(B)w2(B) T1 T2

However, for Sd: Sd=r1(A)w1(A)r2(A)w2(A) r2(B)w2(B)r1(B)w1(B) as a matter of fact, T2 must precede T1 in any equivalent schedule, i.e., T2  T1

T1 T2 Sd cannot be rearranged into a serial schedule Also, T1  T2 T1 T2 Sd cannot be rearranged into a serial schedule Sd is not “equivalent” to any serial schedule Sd is “bad”

Returning to Sc Sc=r1(A)w1(A)r2(A)w2(A)r1(B)w1(B)r2(B)w2(B) T1  T2 T1  T2  no cycles  Sc is “equivalent” to a serial schedule (in this case T1,T2)

Concepts Transaction: sequence of ri(x), wi(x) actions Conflicting actions: r1(A) w2(A) w1(A) w2(A) r1(A) w2(A) Schedule: represents chronological order in which actions are executed Serial schedule: no interleaving of actions or transactions

Transaction A transaction T is a partial order, denoted by <, where T  { r[x], w[x]:x is a data item}  {a, c} aT iff cT Suppose t is either a or c, than for any other operation pT , p<t

Complete History A complete history H over T={T1, …, Tn} is a partial order , <H, where H= (i=1, …, n) Ti <H  (i=1, …, n) <I For any pair of conflicting operations p, q, either p <H q or q<Hp

Conflict equivalent histories H1, H2 are conflict equivalent if Both are over the same transactions and have the same operations, and They order the conflicting operations the same way, i.e., if H1 can be transformed into H2 by a series of swaps on non-conflicting actions.

Conflict Serializable History A history is conflict serializable if it is conflict equivalent to some serial history.

View Equivalence Let T1: x=x+1, T2: x=x*3 Let initially x=5 Serial histories: T1, T2, final value x=18 T2, T1, final value x=16 No efficient algorithm to guarantee view serializability

Precedence graph P(S) (S is schedule/ history) Nodes: transactions in S Arcs: Ti  Tj whenever - pi(A), qj(A) are actions in S - pi(A) <S qj(A) - at least one of pi, qj is a write

Exercise: What is P(S) for S = w3(A) w2(C) r1(A) w1(B) r1(C) w2(A) r4(A) w4(D) Is S serializable?

Lemma S1, S2 conflict equivalent  P(S1)=P(S2) Proof: Assume P(S1)  P(S2)   Ti: Ti  Tj in S1 and not in S2  S1 = …pi(A)... qj(A)… pi, qj S2 = …qj(A)…pi(A)... conflict  S1, S2 not conflict equivalent

Note: P(S1)=P(S2)  S1, S2 conflict equivalent Counter example: S1=w1(A) r2(A) w2(B) r1(B) S2=r2(A) w1(A) r1(B) w2(B)

Theorem P(S1) acyclic  S1 conflict serializable () Assume S1 is conflict serializable   Ss: Ss, S1 conflict equivalent  P(Ss) = P(S1)  P(S1) acyclic since P(Ss) is acyclic

Theorem P(S1) acyclic  S1 conflict serializable T1 T2 T3 () Assume P(S1) is acyclic Transform S1 as follows: (1) Take T1 to be transaction with no incident arcs (2) Move all T1 actions to the front S1 = ……. qj(A)…….p1(A)….. (3) we now have S1 = < T1 actions ><... rest ...> (4) repeat above steps to serialize rest!

How to enforce serializable schedules? Option 1: run system, recording P(S); at end of day, check for P(S) cycles and declare if execution was good

How to enforce serializable schedules? Option 2: prevent P(S) cycles from occurring T1 T2 ….. Tn Scheduler DB

What to do with aborted transactions? If a transaction T aborts, the DBMS must: Undo all values written by T Abort all transactions that read any value written by T (avoid dirty read) Ensure recoverability Strict execution: a transaction can read or write only committed values.

A locking protocol Two new actions: lock (exclusive): li (A) unlock: ui (A) T1 T2 lock table scheduler

Rule #1: Well-formed transactions Ti: … li(A) … pi(A) … ui(A) ... That is, if the transaction 1) locks an item in an appropriate mode before accessing it and 2) unlocks each item that it locks.

Rule #2 Legal scheduler S = …….. li(A) ………... ui(A) ……... If the transaction does not lock any item if it is already locked in a conflicting mode. no lj(A)

Exercise: What schedules are legal? What transactions are well-formed? S1 = l1(A)l1(B)r1(A)w1(B)l2(B)u1(A)u1(B) r2(B)w2(B)u2(B)l3(B)r3(B)u3(B) S2 = l1(A)r1(A)w1(B)u1(A)u1(B) l2(B)r2(B)w2(B)l3(B)r3(B)u3(B) S3 = l1(A)r1(A)u1(A)l1(B)w1(B)u1(B) l2(B)r2(B)w2(B)u2(B)l3(B)r3(B)u3(B)

Exercise: What schedules are legal? What transactions are well-formed? S1 = l1(A)l1(B)r1(A)w1(B)l2(B)u1(A)u1(B) r2(B)w2(B)u2(B)l3(B)r3(B)u3(B) S2 = l1(A)r1(A)w1(B)u1(A)u1(B) l2(B)r2(B)w2(B)l3(B)r3(B)u3(B) S3 = l1(A)r1(A)u1(A)l1(B)w1(B)u1(B) l2(B)r2(B)w2(B)u2(B)l3(B)r3(B)u3(B)

Schedule F T1 T2 l1(A);Read(A) A A+100;Write(A);u1(A) l2(A);Read(A) A Ax2;Write(A);u2(A) l2(B);Read(B) B Bx2;Write(B);u2(B) l1(B);Read(B) B B+100;Write(B);u1(B)

Schedule F A B T1 T2 25 25 l1(A);Read(A) A A+100;Write(A);u1(A) 125 A Ax2;Write(A);u2(A) 250 l2(B);Read(B) B Bx2;Write(B);u2(B) 50 l1(B);Read(B) B B+100;Write(B);u1(B) 150 250 150

Rule #3 Two phase locking (2PL) for transactions Ti = ……. li(A) ………... ui(A) ……... no unlocks no locks

# locks held by Ti Time Growing Shrinking Phase Phase

Schedule G T1 T2 l1(A);Read(A) A A+100;Write(A) l1(B); u1(A) A Ax2;Write(A);l2(B) delayed

Schedule G T1 T2 l1(A);Read(A) A A+100;Write(A) l1(B); u1(A) A Ax2;Write(A);l2(B) Read(B);B B+100 Write(B); u1(B) delayed

Schedule G T1 T2 l1(A);Read(A) A A+100;Write(A) l1(B); u1(A) A Ax2;Write(A);l2(B) Read(B);B B+100 Write(B); u1(B) l2(B); u2(A);Read(B) B Bx2;Write(B);u2(B); delayed

Schedule H (T2 reversed) T1 T2 l1(A); Read(A) l2(B);Read(B) A A+100;Write(A) B Bx2;Write(B) l1(B) l2(A) delayed delayed

Assume deadlocked transactions are rolled back They have no effect They do not appear in schedule E.g., Schedule H = This space intentionally left blank!

Next step: Show that rules #1,2,3  conflict- serializable schedules Advantage: Easy to guarantee these properties!

Conflict rules for li(A), ui(A): li(A), lj(A) conflict li(A), uj(A) conflict Note: no conflict < ui(A), uj(A)>, < li(A), rj(A)>,...

Theorem Rules #1,2,3  conflict (2PL) serializable schedule To help in proof: Definition Shrink(Ti) = SH(Ti) = first unlock action of Ti

Lemma Ti  Tj in S  SH(Ti) <S SH(Tj) Proof of lemma: Ti  Tj means that S = … pi(A) … qj(A) …; p,q conflict By rules 1,2: S = … pi(A) … ui(A) … lj(A) ... qj(A) … By rule 3: SH(Ti) SH(Tj) So, SH(Ti) <S SH(Tj)

Theorem Rules #1,2,3  conflict (2PL) serializable schedule Proof: (1) Assume P(S) has cycle T1  T2 …. Tn  T1 (2) By lemma: SH(T1) < SH(T2) < ... < SH(T1) (3) Impossible, so P(S) acyclic (4)  S is conflict serializable

Problem with 2PL Deadlock Deadlock detection, prevention, avoidance

Variations of 2PL Conservative 2PL:each transaction must acquire all its locks before any of the operations are submitted Adv: no deadlocks Disadv: need read and write sets in advance Starvation

Variations of 2PL Strict 2PL: transactions release their locks after either commit or abort Adv: transaction has all the locks needed, recoverable histories, avoid cascading aborts Disadv: cannot avoid deadlocks, performance tradeoffs

Deadlock Avoidance 1. Order database object in linear order and transactions must acquire locks in this order Adv.: avoid deadlock Disadv.: need order in advance and need to know all the operations in advance

Deadlock Avoidance 2. Time-out: if a transaction waits for a lock longer than a predefined time period then the transaction aborts Adv.: no information about the transactions is needed in advance Disadv.: potentially cascading aborts

Deadlock Avoidance 3. Wait-for-graph: keep a record on which transactions waits for which transaction to release a lock. If cycle then there is a deadlock. Adv.: simplified administration Disadv.: which transaction to abort?

Beyond the variations of 2PL protocol, it is all a matter of improving performance and allowing more concurrency…. Shared locks Multiple granularity (database -> cell) Inserts, deletes and phantoms Other types of C.C. mechanisms

Shared locks So far: S = ...l1(A) r1(A) u1(A) … l2(A) r2(A) u2(A) … Do not conflict Instead: S=... ls1(A) r1(A) ls2(A) r2(A) …. us1(A) us2(A)

Lock actions l-ti(A): lock A in t mode (t is S or X) u-ti(A): unlock t mode (t is S or X) Shorthand: ui(A): unlock whatever modes Ti has locked A

Rule #1 Well formed transactions Ti =... l-S1(A) … r1(A) … u1 (A) … Ti =... l-X1(A) … w1(A) … u1 (A) …

What about transactions that read and write same object? Option 1: Request exclusive lock Ti = ...l-X1(A) … r1(A) ... w1(A) ... u(A) …

Option 2: Upgrade What about transactions that read and write same object? Option 2: Upgrade (E.g., need to read, but don’t know if will write…) Ti=... l-S1(A) … r1(A) ... l-X1(A) …w1(A) ...u(A)… Think of - Get 2nd lock on A, or - Drop S, get X lock

Rule #2 Legal scheduler S = ....l-Si(A) … … ui(A) … no l-Xj(A) (but can have l-Sj(A) ) S = ... l-Xi(A) … … ui(A) … no l-Xj(A) no l-Sj(A)

A way to summarize Rule #2 Compatibility matrix Comp S X S true false X false false

Rule # 3 2PL transactions No change except for upgrades: (I) If upgrade gets more locks (e.g., S  {S, X}) then no change! (II) If upgrade releases read (shared) lock (e.g., S  X) - can be allowed in growing phase

Proof: similar to X locks case Theorem Rules 1,2,3  Conf.serializable for S/X locks schedules Proof: similar to X locks case Detail: l-ti(A), l-rj(A) do not conflict if comp(t,r) l-ti(A), u-rj(A) do not conflict if comp(t,r)

Lock types beyond S/X Examples: (1) increment/decrement lock (2) update lock

Example (1): increment lock Atomic increment action: INi(A) {Read(A); A  A+k; Write(A)} INi(A), INj(A) do not conflict! A=7 A=5 A=17 A=15 INj(A) +10 INi(A) +2 +10 INj(A) +2 INi(A)

Comp S X I S X I

How about adding another column and row for decrement operations? Comp S X I S T F F X F F F I F F T How about adding another column and row for decrement operations?

Update locks A common deadlock problem with upgrades: T1 T2 l-S1(A) l-X1(A) l-X2(A) --- Deadlock ---

Solution If Ti wants to read A and knows it may later want to write A, it requests update lock (not shared)

New request Comp S X U S X U Lock already held in

New request Comp S X U S T F T X F F F U TorF F F -> symmetric table? Lock already held in

Note: object A may be locked in different modes at the same time... S1=...l-S1(A)…l-S2(A)…l-U3(A)… l-S4(A)…? l-U4(A)…? To grant a lock in mode t, mode t must be compatible with all currently held locks on object

Database hierarchy (tree) Locking Granularity Database Relation Attributes Tuple Database hierarchy (tree)

Locking Granularity Tree: Implicit lock: Each node has a unique parent Each node in the hierarchy can be locked A requested node is locked in an explicit mode, e.g., exclusive lock Implicit lock: When a parent node is locked in an exclusive lock, each descendent is locked in an implicit lock

Intentional Lock Must be assigned top-down Indicate intention (e.g., IS, IX) Must tag both the ancestors and the descendents of a node

Locks IS: intentional share the requested node Descendents: can be explicitly locked in S or IS mode IX: intentional exclusive access to the requested node Descendents: can be locked explicitly in X, S, SIX, IS, and IX

Locks S: shared access to the requested node Descendents: implicit shared locks on all X: exclusive access to the requested node Descendents: implicit x-locks on all SIX: shared and intentional exclusive access on the requested node Descendents: implicit shared lock on all, and explicit lock in X, SIX, or IX mode

Compatibility IS IX S SIX X

How does locking work in practice? Every system is different (E.g., may not even provide CONFLICT-SERIALIZABLE schedules) But here is one (simplified) way ...

Sample Locking System: (1) Don’t trust transactions to request/release locks (2) Hold all locks until transaction commits # locks time

l(A),Read(A),l(B),Write(B)… Scheduler, part I lock table Read(A),Write(B) l(A),Read(A),l(B),Write(B)… Scheduler, part I lock table Scheduler, part II DB

Next class Distributed Database Systems