Chapter 7 Consistency And Replication (CONSISTENCY PROTOCOLS)

Name: Chapter 7 Consistency And Replication (CONSISTENCY PROTOCOLS)
Uploaded: 2017-08-19T04:40:39+00:00
Duration: PTM16S10
Channel: Philomena McGee
Description: Chapter 7 Consistency And Replication (CONSISTENCY PROTOCOLS)

Chapter 7 Consistency And Replication (CONSISTENCY PROTOCOLS)
Anuja P Parameshwaran Instructor: Dr Yanqing Zhang Advanced Operating System

Overview: Continuous Consistency Bounding Numerical Deviations
Bounding Staleness Deviations Bounding Ordering Deviations Sequential Consistency Primary-Based Protocols Remote-Write Protocols Local-Write Protocols Replicated-Write Protocols Active Replication Voting (Quorum Based Protocols) Cache-Coherence Protocols Implementing Client-Centric Consistency Summary

Recap What is a Consistency Model? What is a Consistency Protocol?
-Used in DS like distributed memory systems or distributed data stores. -The system supports a given model if operations on memory follow specific rules. -Usually define rules for apparent order and visibility of updates What is a Consistency Protocol? - Describes implementation of specific Consistency Model.

Continuous Consistency:
Developed by “Yu and Vahdat” in-order to tackle 3 forms of consistency: Bounding Numerical Deviation: A solution for keeping the numerical deviation within bounds Concentrate on writes to a single data item x Each write W(x) has an associated weight that represents the numerical value by which x is updated  weight (W) [Assume weight (W) > 0]

Continuous consistency: Numerical deviation
W is initially forwarded to one of the N replicas, denoted as origin(W). TW[i,j]  writes executed by server Si that originated from Sj TW[i,j] = ∑{weight(W) | origin(W) = Sj & W Є Li} The goal is for any time t, to let the current value Vi at server Si deviate within bounds from the actual value v(t) of x. This actual value is completely determined by all submitted writes Actual value : N Value Vi of x at replica I : N v(t) = v(0) + ∑ TW[k,k] v(i) = v(0) + ∑ TW[i,k] k= k=1

Continuous consistency: Numerical deviation
For every server Si, associate an upperbound δi such that we need to enforce: v(t) – vi ≤ δi; for every server Si Sk maintains a view TWk [i,j] of what it believes Si will have as value for TW[i,j]. Solution Sĸ sends operations from its log to Si when it sees that TWĸ[i,k] is getting too far from TW[k,k], in particular, when TW[k,k] - TWĸ[i,k] > δi / (N -1)

Bounding Staleness Deviations:
There are many ways to keep the staleness of replicas within specified bounds One approach  timestamp each submitted write by its origin server Let server Sk keep a real-time vector clock RVCk where: RVCk[i] = T(i) means that Sk has seen all writes that have been submitted to Si up to time T(i) T(i) denotes the time local to Si

Bounding Staleness Deviations:
If the clocks between the replica servers are loosely synchronized: Whenever server Sk notes that T(k) - RVCk[i] is about to exceed a specified limit It simply starts pulling in writes that originated from Si with a timestamp later than RVCk[i]· A replica server is responsible for keeping its copy of x up to date regarding writes that have been issued elsewhere

Numeric VS Staleness Bounds:
Numerical bounds follows a push approach, by letting an origin server keep replicas up to date by forwarding writes Staleness Bounds follows a Pull approach, Replica servers pull in updates from origin servers

Bounding Ordering Deviations
Replica server tentatively applies updates that have been submitted to it. Each server will have a local queue of tentative writes [order yet to be determined] The ordering deviation is bounded by specifying the maximal length of the queue of tentative writes Enforce a globally consistent ordering of tentative writes using primary-based or quorum-based protocols.

Primary-Based Protocols
Provides a straightforward implementation of sequential consistency All processes see all write operations in the same order, no matter which backup server they use to perform read operations A distinction can be made as to whether: - The primary is fixed at a remote server - Write operations can be carried out locally after moving the primary to the process where the write operation is initiated x Primary

Remote- Write Protocols
All write operations need to be forwarded to a fixed single server Read operations can be carried out locally Also called (primary-backup protocol) Disadvantage: It may take a relatively long time before the process that initiated the update is allowed to continue, an update is implemented as a blocking operation Alternative: Non-blocking approach

Remote-Write Protocols

Remote- Write Protocols
Nonblocking approach: As soon as the primary has updated its local copy of x, it returns an acknowledgment. After that, it tells the backup servers to perform the update as well Advantage: write operations may speed up considerably Disadvantage: fault tolerance, updates may not be backed up by other servers

Local- Write Protocols
When a process wants to update data item x, it locates the primary copy of x, and subsequently moves it to its own location Advantage (in non-blocking protocol only): Multiple, successive write operations can be carried out locally, while reading processes can still access their local copy Updates are propagated to the replicas after the primary has finished with locally performing the updates

Local-Write Protocols

Local- Write Protocols
Example primary-backup protocol with local writes Distributed file systems that require a high degree of fault tolerance 1) Propagate writes to speed up performance Replica Server Fixed Central Server Replica Server 2) Updates propagated 3) Distribute updates to other replica servers Replica Server

Replicated-Write Protocols
Write operations can be carried out at multiple replicas instead of only one A distinction can be made between: - Active replication, in which an operation is forwarded to all replicas - Majority voting (Quorum-Based Protocols)

Active Replication Each replica has an associated process that carries out update operations.[operation sent to each replica] Problem: Operations need to be carried out in the same order everywhere Totally-ordered multicast mechanism. Such a multicast can be implemented using Lamport's logical clocks, Disadvantage: This implementation of multicasting does not scale well in large distributed systems. Alternative: Total ordering can be achieved using a central coordinator, also called a sequencer.

Active Replication Forward each operation to the sequencer, which assigns it a unique sequence number and subsequently forwards the operation to all replicas Operations are carried out in the order of their sequence number. 1.1 REPLICAS 1.1 SEQUENCER/CENTRAL CO-ORDINATOR REPLICAS 1.2 1.2

Quorum-Based Protocols
A different approach to supporting replicated writes is to use voting Clients to request and acquire the permission of multiple servers before either reading or writing a replicated data item How the algorithm works? A file is replicated on N servers To update a file: A client must first contact at least half the servers plus one (a majority) and get them to agree to do the update. Once they have agreed, the file is changed and a new version number is associated with the new file.

To read a replicated file? A client must also contact at least half the servers plus one and ask them to send the version numbers associated with the file. If all the version numbers are the same, this must be the most recent version 3 servers (V-8) 5 servers 2 servers (V-9)

To read a file of which N replicas exist: A client needs to assemble a read quorum, an arbitrary collection of any NR servers, or more To modify a file: A write quorum of at least Nw servers is required. The values of NR and Nw are subject to the following two constraints: NR +NW > N [To prevent read-write conflicts] and NW > N / 2 [To prevent write-write conflicts]

Figure: Three examples of the voting algorithm. (a) A correct choice of read and write set. (b) A choice that may lead to write-write conflicts. (c) A correct choice, known as ROWA (read one, write all).

Cache-Coherence Protocols
Caching solutions may differ in: coherence detection strategy (when) coherence enforcement strategy (How) Dynamic solutions: inconsistencies are detected at runtime. For example: A check is made with the server to see whether the cached data have been modified since they were cached

Implementing Client-Centric Consistency
Naive Implementation: Each write operation W is assigned a globally unique identifier. Such an identifier is assigned by the server to which the write had been submitted (Origin of W). For each client, a track of two sets of writes: The read set for a client consists of the writes relevant for the read operations performed by a client The write set consists of the identifiers of the writes performed by the client.

Monotonic-read consistency implementation: When a client performs a read operation at a server, that server is handed the client's read set to check whether all the identified writes have taken place locally (The size of the set may introduce a performance problem) If not, it contacts the other servers to ensure that it is brought up to date before carrying out the read operation Alternatively, the read operation is forwarded to a server where the write operations have already taken place After the read operation is performed, the write operations that have taken place at the selected server and which are relevant for the read operation are added to the client‘s read set

Monotonic-write consistency implementation: When a client initiates a new write operation at a server, the server is handed over the client's write set It then ensures that the identified write operations are performed first and in the correct order After performing the new operation, that operation's write identifier is added to the write set

Read-your-writes consistency : Requires that the server where the read operation is performed has seen all the write operations in the client's write set The writes can be fetched from other servers before the read operation is performed Problem: Could lead to poor response time

Writes-follow-reads consistency: Implemented by first bringing the selected server up to date with the write operations in the client's read set Then, later adding the identifier of the write operation to the write set, along with the identifiers in the read set

Improving Efficiency Read set and Write set associated with each client can become very large Solution: A client's read and write operations are grouped into sessions A session is typically associated with an application: it is opened when the application starts and is closed when it exits Whenever a client closes a session, the sets are cleared

Summary Consistency protocols describe specific implementations of consistency models A distinction can be made between primary-based protocols and replicated- write protocols In primary-based protocols, all update operations are forwarded to a primary copy that subsequently ensures the update is properly ordered and forwarded In replicated-write protocols, an update is forwarded to several replicas at the same time

Present… Hardware-Based Protocols: Software-Based Protocols:
Snoopy protocol Directory based protocol Hybrid cache coherence protocol Lock based protocol Software-Based Protocols: MSI protocol MESI protocol MOSI protocol MOESI protocol Dragon protocol

Cache Coherence for GPU Architectures:
GPU cache coherency is difficult and challenging Conventional cache coherency algorithms are not well suited for GPUs The management of transient state for thousands of in-flight memory accesses adds hardware and complexity overhead Coherence adds unnecessary traffic overheads to existing GPU applications. Temporal Coherence, a timestamp based coherence framework that reduces overheads of GPU coherence

References: Stenström, Per. "A survey of cache coherence schemes for multiprocessors."Computer 23, no. 6 (1990): Singh, Inderjit, Arrvindh Shriraman, Wilson WL Fung, Mike O'Connor, and Tor M. Aamodt. "Cache coherence for GPU architectures." In High Performance Computer Architecture (HPCA2013), 2013 IEEE 19th International Symposium on, pp IEEE, 2013. Tanenbaum, Andrew S., and Maarten Van Steen. Distributed systems. Prentice-Hall, 2007.

Chapter 7 Consistency And Replication (CONSISTENCY PROTOCOLS)

Similar presentations

Presentation on theme: "Chapter 7 Consistency And Replication (CONSISTENCY PROTOCOLS)"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Chapter 7 Consistency And Replication (CONSISTENCY PROTOCOLS)

Similar presentations

Presentation on theme: "Chapter 7 Consistency And Replication (CONSISTENCY PROTOCOLS)"— Presentation transcript:

Similar presentations

About project

Feedback