Download presentation
Presentation is loading. Please wait.
Published byBelinda Richard Modified over 9 years ago
1
Replication and Consistency
2
Reference The Dangers of Replication and a Solution, Jim Gray, Pat Helland, Patrick O'Neil, and Dennis Shasha. In Proceedings of the ACM SIGMOD international conference on Management of Data, 1996
3
Introduction r When you have mobility, replication allows mobile nodes to read and update the database while disconnected from the network.
4
Eager Replication r All replicas synchronized to the same value immediately R R R RR R time
5
Eager Replication r All replicas synchronized to the same value r Lower update performance and response time R R R RR R time
6
Lazy Replication r One replica is updated by the transaction r Replicas synchronize asynchronously r Multiple versions of data R R R RR R time
7
Example r Consider a joint checking account. Suppose that it has $1,000 in it. r The account is replicated in three places: the wife’s checkbook, the husband’s checkbook and the bank’s ledger. r Eagar replication assumes that all three books have the same account balance. m It prevents the husband and wife from writing checks totaling more than $1,000.
8
Example r Lazy replication allows both the husband and wife to write check totaling $1,000 for a total of $2,000 in withdrawels. r When these checks arrived at the bank or when husband and wife communicate, someone or something reconciles the transactions. r The bank is the does the reconciliation by rejecting updates that cause an overdraft. r Lots of time may be spent reconciling.
9
Example r The database for a checking account is a single number, and a log of updates to that number. r Databases are usually more complex. r Disconnected operation and message delays mean lazy replication has more frequent reconciliation.
10
Concurrency Anomaly in Lazy Replication r R` - Which version of data should it see? r If committed transaction is ‘wrong’, conflict r Conflicts have to be reconciled R’ R R``` RR`` R` time
11
Scaleup pitfall r When the nodes divulge hopelessly we get system delusion – database is inconsistent and no obvious way to repair it R’ R R``` RR`` R` time
12
Regulate Replica Updates r Group: Any node with a copy can update item m Update anywhere r Master: Only a master can update the primary copy. All replicas are read-only. All update requests are sent to the master
13
Replication Strategies Propagation Vs. Ownership LazyEager GroupN transactions N object owners 1 transaction N object owners MasterN transactions 1 object owner 1 transaction 1 object owner Two tierN+1 transactions, 1 object owner Tentative locate update, eager base update
14
Eager Replication and Mobile Nodes r Read on disconnected clients may give stale data r Simple eager replication prohibits updates if any node is disconnected R R R RR R time
15
Eager Replication and Mobile Nodes r For high availability, eager replication systems allow updates among members of the cluster. r When a node joins a cluster, the cluster sends the new node all replica updates since the node was disconnected.
16
Eager Replication and Mobile Nodes r Even if all the nodes are connected all the time, updates may fail due to deadlocks that prevent serialization errors. m The probability of deadlocks and consequently failed transactions rises very quickly with transaction size and with the number of nodes. It is estimated that a 10-fold increase in nodes gives a 1000-fold increased in failed transactions.
17
Lazy Replication and Mobile Nodes r With lazy group replication, we have to wait for all nodes to come online to commit r Lazy master replication cannot work for mobile nodes and network connection is needed for transaction to complete
18
Lazy Replication and Mobile Nodes r Lazy group replication allows any node to update any local data. r When the transaction commits, a transaction is sent to every other node to apply the root transaction’s updates to the replicas at the destination node. r Two nodes may race to update the same object. This must be detected and reconciled.
19
Lazy Replication and Mobile Nodes r Timestamps are commonly used to detect and reconcile lazy-group transactional updates. r Each object carries the timestamp of its most recent update. r Each replica update carries the new value and is tagged with the old object timestamp. r Each node detects incoming replica updates that would overwrite earlier committed updates. r The node tests if the local replica’s timestamp and the update’s old timestamp are equal. r If so, the update is safe.
20
Lazy Replication and Mobile Nodes r The local replica’s timestamp advances to the new transaction’s timestamp and the object value is updated. r If the current timestamp of the local replica does not match the old timestamp seen by the root transaction, then the update may be “dangerous”. m The node rejects the incoming transaction and submits it for reconciliation.
21
Example Replication Scenario: #1 r Replicated DNS servers m One primary DNS server m Multiple replicas DNS1.UGA.EDU 128.192.1.9 DNS2.UGA.EDU 128.192.1.193 DNS3.UGA.EDU 168.24.242.249 m Replicas use zone transfers to get an up-to-date database from the the primary server m Transfers database every so often m Inconsistent state between transfers Lazy, master replication
22
Example Replication #2 r Palm Pilot Synchronization r Database (your address book) is in PIM (Outlook say), Palm Desktop, your Palm device. Updates are allowed anywhere. You could authorize your secretary to add items to your Outlook r Lazy group update
23
Example Replication #3 r Gnutella – when you add a new song into your computer, when do the other nodes see it? Eventually r Lazy group update
24
Example Replication #4 r Newsgroups r Everyone can post to newsgroup. You post in comp.risks from UWO, and your friend also posts at the same time from Toronto. My friend at Waterloo will see it in some order (UWO first and then Toronto or the other way around) r Lazy group replication
25
Example Replication #5 r Distributed databases with ACID syntax r Eager master
26
Convergence Property r If no new transactions arrive, if all the nodes are connected together, they will all converge to the same replicated state after exchanging replica updates r Updates may be lost because of newer updates r Commutative updates – incremental transformations that can be applied in any order
27
Two-Tier Replication r Mobile nodes m Disconnected most of the time. m Mobile nodes store Master version and Tentative version Master version on disconnected or lazy replica maybe outdated Most recent value due to local updates is maintained as a tentative value r Base Nodes m Always connected. Store a replica of the database. Items are mastered in base nodes
28
Two-Tier Transaction r Base transaction m Work only on master data m Produce new master data r Tentative transaction m Work on local tentative data m Produce new tentative versions m Also produce base transaction to be run at a later time on the base nodes r Acceptance criteria for each transaction update
29
Key Properties of Two-Tier Replication Schemes r Mobile nodes may make tentative database updates r Base transactions execute with single-copy serializability so the master base system state is the result of a serializable execution r A transaction becomes durable when the base transaction completes r Replicas at all connected nodes converge to the base system state r If all transactions commute, there are no reconciliations
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.