Presentation is loading. Please wait.

Presentation is loading. Please wait.

Providing High Availability Using Lazy Replication Rivaka Ladin, Barbara Liskov, Liuba Shrira, Sanjay Ghemawat Presented by Huang-Ming Huang.

Similar presentations


Presentation on theme: "Providing High Availability Using Lazy Replication Rivaka Ladin, Barbara Liskov, Liuba Shrira, Sanjay Ghemawat Presented by Huang-Ming Huang."— Presentation transcript:

1 Providing High Availability Using Lazy Replication Rivaka Ladin, Barbara Liskov, Liuba Shrira, Sanjay Ghemawat Presented by Huang-Ming Huang

2 Outline Model Algorithm Performance Analysis Discussion

3 Replication Model client RM FE Service Replication Manager Front ends Excerpt from “Distributed Systems – Concept and Design” by Coulouris, Dollimore and Kindberg

4 System Guarantees Each client obtains a consistent service over time Relaxed consistency between replicas Updates are applied with ordering guarantees that make the replicas sufficiently similar.

5 Operation Classification RM FE Client queryval update Query, prevVal, new Update, prev Update id gossip Excerpt from “Distributed Systems – Concept and Design” by Coulouris, Dollimore and Kindberg

6 Update operation classification Causal update Forced update : performed in the same order (relative to one another) at all replicas. Immediate update : performed at all replicas in the same order relative to all other operations.

7 Vector timestamp Given two timestamps T = (t 1,t 2, ,t n ) S = (s 1,s 2, ,s n ) T ≤ S ≡ t i ≤ s i for all i merge(T,S)= (max(t 1,s 1 ),…,max(t n,s n )) Each part of the vector timestamp corresponds to each replica manager in the system.

8 RM components Replica timestamp Update log Value Timestamp Value Timestamp table Executed operation table FE Other replicas Gossip Messages Updates Operationprevid Replica TimestampReplica log stable updates Excerpt from “Distributed Systems – Concept and Design” by Coulouris, Dollimore and Kindberg

9 Query The replica manager blocks the query q operation until the condition holds: q.prev <= valueTS The replica manger returns valueTS back to FE. FE updates its own timestamp frontEndTS := merge(frontEndTS, new)

10 ( r 1,r 2,…,r i +1,…,r n ) Causal Update ( r 1,r 2,…,r i,…,r n ) Update log FE ValueTS Value Executed operation table (p 1,p 2,…p n,) operationid ts=(p 1,p 2,…,p i +1,…,p n ) logRecord =(i, ts, u.op, u.prev, u.id) ts r.u.prev ≤ valueTS merge(ValueTS, r.ts) apply(value.r.u.op) executed  r.u.id Replication Manager i

11 Gossip messages Goal : bring the states of replication managers up to date. Consists of : Replication timestamp Update log Upon receiving gossip Merge the arriving log with its own Apply any unexecuted stable updates Eliminate redundant log and executed operation table entries

12 Control the size of update log Timestamp table keeps recent timestamps from messages sent by all other replicas. A log record r can be removed from the log when r.ts r.i < timestamp_table[j] r.i, for all j

13 Control the size of executed operation table Each update carries an extra time field FE returns an ACK Contains FE ’ s clock time after receiving the response for an update from RM. RM inserts the received ACK to the log.

14 Control the size of executed operation table (con ’ t) A message m from FE is late if m.time + δ< replica ’ s clock time An update is discard if it is late An ACK is kept at least until it is late Remove an entry c in executed operation table when an ACK for c ’ s update is received all records for c ’ s update have been discarded.

15 Forced Update Use the primary to assign a global unique identifier. The primary carries out a two phase protocol for updates.

16 Two phase protocol Upon receiving an update, the primary sends it to all other replicas. Upon receiving responses from all most half of the backups, the primary commit the update by insert the record to its log. Backups know the commitment from gossip messages.

17 Fail Recovery New coordinator informs participants about the failure. Participants inform coordinator about most recent forced updates Coordinator assign UID with the largest it knows after the sub-majority of replicas has responded.

18 Immediate Update Primary use 3 phase protocol. Pre-prepare Prepare Commit

19 3 phase protocol FE Update log primary backup update Give me your log and timestamp logRecord Update id

20 Number of Messages for different operations Query : 2 Casual : 2 + (N-1)/K Forced : 2  N/2  + (N-1)/K Immediate : 2N +2(  N/2  -1)+(N-1)K N : the number of replicas K : the number of update/ack pairs in a gossip.

21 Capacity of a 3-replica system Excerpt from “ Providing high Availability Using Lazy Replication ” by Ladin, Liskov, Shrira and Ghemawat

22 Capacity of the Unreplicated System Excerpt from “ Providing high Availability Using Lazy Replication ” by Ladin, Liskov, Shrira and Ghemawat

23 Discussion No time guarantee for gossip messages Not generally suitable for real-time application such as realtime conference updating shared document. Scalability Timestamp space grows as number of replicas grow. can be increased by making most of the replicas read-only

24 Qustions?


Download ppt "Providing High Availability Using Lazy Replication Rivaka Ladin, Barbara Liskov, Liuba Shrira, Sanjay Ghemawat Presented by Huang-Ming Huang."

Similar presentations


Ads by Google