Presentation is loading. Please wait.

Presentation is loading. Please wait.

Consistency and CAP 1. Reasons for Replication Data replication is a common technique in distributed systems. There are two reasons for data replication:

Similar presentations


Presentation on theme: "Consistency and CAP 1. Reasons for Replication Data replication is a common technique in distributed systems. There are two reasons for data replication:"— Presentation transcript:

1 Consistency and CAP 1

2 Reasons for Replication Data replication is a common technique in distributed systems. There are two reasons for data replication: – It creases the reliability of a system. If one replica is unavailable or crashes, use another Protect against corrupted data – It improves the performance of a system. Scale with size of the distributed system (replicated Web servers) Scale in geographically distributed systems (Web proxies) The key issue is the need to maintain consistency of replicated data. – If one copy is modified, others become inconsistent. 2

3 Potential Issue 3

4 4

5 5

6 6

7 7

8 Consistency model A consistency model defines the semantics of read operation of a data store That is, the set of possible values that can be returned by the data store. 8

9 Strong Consistency: clients of a data storage always see the latest value that was written for a data object Eventual Consistency: – The data returned by a read operation is the value of the object at some past point in time but not necessarily the latest value. – The replicated objects converge towards identical copies in the absence of updates. 9

10 10 Enforcing Strong Consistency (Quorum algorithm) Two weights are given. – One is for the write operation ww – The other is for the read operation rw. – Each replication site has one token that carries some weight (normally 1). In order to carry out a read/write operation, a client needs to collect enough weight (i.e. tokens). – A read operation can be carried out if the weight collected by the client is greater or equal than rw; – A write operation can be carried out if the weight collected by the client is greater or equal to ww.

11 11 rw and ww have to satisfy the following: – ww > ns/2 ensure that the write operations are exclusive – (rw + ww) > ns ensure that read and writes excludes each other a read operation will never read a value which is out-of-date – ns is the number of tokens in the system, and each token carries weight 1

12 Enforcing Eventual Consistency Enforcing eventual consistency relies on two properties: – total propagation: updates can reach all the sites – consistent ordering: updates are processed by all the sites in consistent order 12

13 Issues with Strong and Eventual Consistency Strong consistency generally results in lower performance and reduced availability for reads or writes or both. Eventual consistency can confuse users and applications – A user may read some value for a data item and then later read an older value. – A user may update some data item based on reading some other data, while others read the updated item without seeing the data on which it is based. – These issues can appear even when only a single user or application is making data modifications. 13

14 Consistency models in industry Microsoft Azure: strong consistency Amazon S3: eventual consistency Many others: between strong and eventual 14

15 Weak consistency models with session guarantees A session is an abstraction for the sequence of read and write operations performed during the execution of an application. Let DB(S,t) be the ordered sequence of Writes that have been received by server S at or before time t. Four models – Read Your Writes – Monotonic Reads – Writes Follow Reads – Monotonic Writes 15

16 Read Your (my) Writes RYW-guarantee: If Read R follows Write W in a session and R is performed at server S at time t, then W is included in DB(S,t). After having updated a data item, a process always accesses the updated value and never sees an older value. 16

17 Monotonic Reads RelevantWrites(S,t,R) denotes the function that returns the set of Writes that define the values retrieved by read operation R. MR-guarantee: If Read R1 occurs before R2 in a session and R1 accesses server S1 at time t1 and R2 accesses server S2 at time t2, then RelevantWrites(S1,t1,R1) is a subset of DB(S2,t2). If a process has seen a particular value for the object, any subsequent accesses will never return any previous values. 17

18 Writes Follow Reads WFR-guarantee: If Read R1 precedes Write W2 in a session and R1 is performed at server S1 at time t1, then, for any server S2, if W2 is in DB(S2) then any W1 in RelevantWrites(S1,t1,R1) is also in DB(S2) and WriteOrder(W1,W2). The casual order of a sequence of write/read/write operations must be observed. 18

19 Monotonic Writes MW-guarantee: If Write W1 precedes Write W2 in a session, then, for any server S2, if W2 in DB(S2) then W1 is also in DB(S2) and WriteOrder(W1,W2). The system guarantees to serialize the writes by the same process. 19

20 CAP Theorem for distributed systems Cloud providers normally replicate data at different locations. – For availability and reliability To maintain the consistency of the replicated data, the operations on the replicated data need to be synchronised. The CAP theorem states that any networked shared-data system can have at most two of three desirable properties: – consistency (C) equivalent to having a single up-to- date copy of the data; – high availability (A) of that data (for updates); – tolerance to network partitions (P). 20

21 Consistency Consistency means the multiple copies need to be kept perfectly synchronised. – To require perfect consistency, communication between data centers hosting the copies is paramount. – The overall performance of such a system could drop as the number of replica required goes up. 21

22 Availability Although having a single copy of a data item ensures perfect consistency, such an arrangement does not provide high availability. – The copy becomes unavailable when the site hosting the copy goes down The only solution to the availability issue is through replicating the data. Increasing the number of sites with copies of the data directly increases the availability of the system. Replicas also help in load balancing concurrent read operations. 22

23 Partition Tolerance We get Consistency and Availability by replicating data on different data centers. If the network connectivity between two data centers is lost, both data centers are incapable of synchronizing state with each other. – If we allow read/write operations on these two data centers, it can be shown that the data in the two data centers won’t be consistent anymore. – If we decide consistency is important, and disable write operations during the network outage, we will loose “availability” as no update operations can be carried out. 23

24 Managing partition In the presence of partition, many systems choose availability at the expense of consistency (temporarily). – The consistency will be restored after the partition is repaired. – Eventual consistency 24

25 non-traditional data stores (NoSQL) Modern applications tend to process large amount of data. If an application relies upon persistence (i.e. storing data in databases), then data storage will probably become bottleneck There are two strategies for scaling any application. – vertical scaling – horizontal scaling 25

26 vertical scaling Vertical scaling means moving the application to larger computers. Advantage: easy Limitations – outgrowing the capacity of the largest system available – expensive 26

27 horizontal scaling Two approaches – Functional scaling: group data by function and spreading functional groups across databases. – Sharding: Split data within functional areas across multiple databases. 27

28 28

29 29

30 Sharding and functional scaling can be mixed. 30

31 ACID properties A task normally involves several operations on the DB. In a traditional DB, to maintain the consistency of the DB, the operations are grouped into a transaction. A transaction has the ACID properties. Atomicity. All of the operations in the transaction will complete, or none will. Consistency. The database will be in a consistent state when the transaction begins and ends. Isolation. The transaction will behave as if it is the only operation being performed upon the database. Durability. Upon completion of the transaction, the operation will not be reversed. 31

32 two-phase commit To ensure the consistency of a DB stored on several hosts, the 2PC is used to commit the operations of a transaction. First, the transaction coordinator asks each database involved to precommit the operation and indicate whether commit is possible. – If all databases agree the commit can proceed, then phase 2 begins. – If any database vetoes the commit, then all databases are asked to roll back their portions of the transaction. The transaction coordinator asks each database to commit the data. 32

33 A 2PC can only successfully complete if all its participants (i.e. the hosts of the DB) are available. The availability of the whole system is the product of the availability of each machine in the system. So, the availability of a system with many machines is lower than the availability of a single machine. ACID favours consistency over availability. 33

34 Many applications require high availability and good performance. According to the CAP theorem, there is a trade-off between consistency and availability. The NoSQL movement is about creating choices that focus on availability first and consistency second. Instead of conforming to ACID, NoSQL systems use BASE (basically available, soft state, eventually consistent). 34

35 Basically available The system supports data availability in a partial system failure. – there will be a response to any request. – some responses might fail to obtain the requested data or the data may be in an inconsistent state For example, if data are partitioned across five servers, BASE design encourages crafting operations in such a way that a server failure impacts only the operations accessing the data on the failed server. 35

36 Eventually Consistent The system will eventually become consistent once it stops receiving input. – Updates will be propagated to every site eventually 36

37 Begin transaction Insert into transaction(id, seller_id, buyer_id, amount); Queue message “update user(“seller”, seller_id, amount)”; Queue message “update user(“buyer”, buyer_id, amount)”; End transaction To implement BASE, many systems rely on some sort of message queue to persistently store and route data to various storage services that perform the actual database operations. 37

38 ACID conforming systems ensure strong consistency. In BASE conforming systems, concurrent and possible inconsistent updates might occur. – Concurrent updating the same calendar entry while the system is partitioned. Partially ordered timestamps and event logging should be used for the events in a BASE conforming system. – Identifying and resolving conflicting and inconsistent updates. 38

39 Soft State For many operations, it takes some time for the states of the system to settle. – Transfer money between two accounts – After deducting an amount from one user, the amount to be credited to the other user is placed in a message queue waiting to be executed – There is a lag between the start and the end of the transfer when the money is in the message queue and does not appear in neither users’ accounts For many applications, this time lag is acceptable. Accepting the state of the system is constantly changing means accepting the fact that sometimes the data returns from the system might not be accurate. 39

40 reviews Understand strong consistency and eventual consistency. How to ensure strong consistency? What are the issues with strong consistency and eventual consistency? Understand the four weak consistency models with session guarantees. For each of the four models above, describe an application that can benefit from the model. For each of the four models, describe a concrete scheme that can be used to enforce the model. 40

41 reviews Understand the CAP theorem for distributed systems. Assume you are developing a banking and a shopping cart applicant. Discuss how the CAP theorem and the operating environment might affect your development. 41

42 reviews Understand vertical scaling and horizontal scaling. Explain why horizontal scaling might cause difficulty in ensuring the ACID property. Why do many systems not conform to the ACID property? Understand the BASE properties. In BASE, how does a system ensure that an operation can be carried out in the presence of a system failure? [Note: a failure could be a crash failure of a site or a system partition] If you design a system conforming to the BASE properties, what issues do you need to address in the presence of system partition? 42

43 Further readings Douglas B. Terry, Alan J. Demers, Karin Petersen, Mike Spreitzer, Marvin Theimer, and Brent W. Welch. 1994. Session Guarantees for Weakly Consistent Replicated Data. In Proceedings of the Third International Conference on Parallel and Distributed Information Systems (PDIS '94). IEEE Computer Society, Washington, DC, USA, 140-149. Eric Brewer, "CAP Twelve Years Later: How the "Rules" Have Changed," Computer, vol. 45, no. 2, pp. 23-29, Feb. 2012 Dan Pritchett. 2008. BASE: An Acid Alternative. Queue 6, 3 (May 2008), 48-55. 43


Download ppt "Consistency and CAP 1. Reasons for Replication Data replication is a common technique in distributed systems. There are two reasons for data replication:"

Similar presentations


Ads by Google