Consistent Data Replication: Is it feasible in WANs?

Slides:



Advertisements
Similar presentations
Distributed Processing, Client/Server and Clusters
Advertisements

Database Replication in WAN Yi Lin McGill University Distributed Information Systems.
Exploiting Distributed Version Concurrency in a Transactional Memory Cluster Kaloian Manassiev, Madalin Mihailescu and Cristiana Amza University of Toronto,
Replication. Topics r Why Replication? r System Model r Consistency Models r One approach to consistency management and dealing with failures.
Consistency and Replication Chapter 7 Part II Replica Management & Consistency Protocols.
Reliability on Web Services Presented by Pat Chan 17/10/2005.
Middleware based Data Replication providing Snapshot Isolation Yi Lin Bettina Kemme Marta Patiño-Martínez Ricardo Jiménez-Peris June 15, 2005.
Distributed Processing, Client/Server, and Clusters
Distributed components
Distributed Systems Fall 2010 Replication Fall 20105DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
Replicating Basic Components Bettina Kemme McGill University, Montreal, Canada.
Virtual Synchrony Jared Cantwell. Review Multicast Causal and total ordering Consistent Cuts Synchronized clocks Impossibility of consensus Distributed.
Database Replication techniques: a Three Parameter Classification Authors : Database Replication techniques: a Three Parameter Classification Authors :
Overview Distributed vs. decentralized Why distributed databases
Understanding Replication in Database & Distributed Systems SRDS’ Database Replication Techniques: A Three Parameter Classification M. Wiesmann F.
Transactional Services Ricardo Jiménez-Peris Marta Patiño-Martínez Technical University of Madrid 1 st Adapt Workshop 23 rd -24 th September 2002 Madrid,
Collaborative Web Caching Based on Proxy Affinities Jiong Yang, Wei Wang in T. J.Watson Research Center Richard Muntz in Computer Science Department of.
Distributed Systems Fall 2009 Replication Fall 20095DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
Distributed Systems 2006 Virtual Synchrony* *With material adapted from Ken Birman.
Database Replication Policies for Dynamic Content Applications Gokul Soundararajan, Cristiana Amza, Ashvin Goel University of Toronto EuroSys 2006: Leuven,
Consistent and Efficient Database Replication based on Group Communication Bettina Kemme School of Computer Science McGill University, Montreal.
Replicated Databases. Reading Textbook: Ch.13 Textbook: Ch.13 FarkasCSCE Spring
Applying Database Replication to Multi-player Online Games Yi Lin Bettina Kemme Marta Patiño-Martínez Ricardo Jiménez-Peris Oct 30, 2006.
Database replication policies for dynamic content applications Gokul Soundararajan, Cristiana Amza, Ashvin Goel University of Toronto Presented by Ahmed.
Replication (1). Topics r Why Replication? r System Model r Consistency Models – How do we reason about the consistency of the “global state”? m Data-centric.
Database Replication in WAN Yi Lin Supervised by: Prof. Kemme April 8, 2005.
Chap 7: Consistency and Replication
Replication (1). Topics r Why Replication? r System Model r Consistency Models r One approach to consistency management and dealing with failures.
Chapter 4 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University Building Dependable Distributed Systems.
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT By Jyothsna Natarajan Instructor: Prof. Yanqing Zhang Course: Advanced Operating Systems.
CSE 486/586, Spring 2012 CSE 486/586 Distributed Systems Replication Steve Ko Computer Sciences and Engineering University at Buffalo.
A Comparative Evaluation of Transparent Scaling Techniques for Dynamic Content Servers Presented by Chen Zhang Written by C. Amza, A. L. Cox,
Don’t be lazy, be consistent: Postgres-R, A new way to implement Database Replication Paper by Bettina Kemme and Gustavo Alonso, VLDB 2000 Presentation.
CalvinFS: Consistent WAN Replication and Scalable Metdata Management for Distributed File Systems Thomas Kao.
Replication Chapter Katherine Dawicki. Motivations Performance enhancement Increased availability Fault Tolerance.
EEC 688/788 Secure and Dependable Computing Lecture 10 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Reliable multicast Tolerates process crashes. The additional requirements are: Only correct processes will receive multicasts from all correct processes.
Replication & Fault Tolerance CONARD JAMES B. FARAON
CS6320 – Performance L. Grewe.
Slide credits: Thomas Kao
Network Load Balancing
6.4 Data and File Replication
Ganymed: Scalable Replication for Transactional Web Applications
The SNOW Theorem and Latency-Optimal Read-Only Transactions
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT -Sumanth Kandagatla Instructor: Prof. Yanqing Zhang Advanced Operating Systems (CSC 8320)
Chapter 17: Database System Architectures
Outline Announcements Fault Tolerance.
7.1. CONSISTENCY AND REPLICATION INTRODUCTION
Nicolas Schiper Fernando Pedone University of Lugano Switzerland
Replication Improves reliability Improves availability
Advanced Operating System
Consistency and Replication
Active replication for fault tolerance
Distributed Systems CS
EEC 688/788 Secure and Dependable Computing
From Viewstamped Replication to BFT
EEC 688/788 Secure and Dependable Computing
EEC 688/788 Secure and Dependable Computing
Lecture 21: Replication Control
Distributed Systems CS
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
EEC 688/788 Secure and Dependable Computing
EEC 688/788 Secure and Dependable Computing
The SMART Way to Migrate Replicated Stateful Services
Database System Architectures
EEC 688/788 Secure and Dependable Computing
Distributed Systems (15-440)
Lecture 21: Replication Control
The Gamma Database Machine Project
Sisi Duan Assistant Professor Information Systems
Presentation transcript:

Consistent Data Replication: Is it feasible in WANs? Yi Lin Bettina Kemme Marta Patiño-Martínez Ricardo Jiménez-Peris Sep 2, 2005

Data Replication: What,Why,How? Without Replication With Replication Toronto Montreal Ottawa Toronto Montreal Ottawa … … WAN Montreal Toronto Montreal Ottawa Benefits: Fault Tolerance, Performance Challenge: keep data consistent

Data Replication: challenge Keep data consistent w(x) Replica control x x x

Motivations Most replication protocols have been proved to perform well in LANs. Little work has been done in WANs GlobData [DMBS02], Tech Report [JHU02] Are these protocols also feasible in WANs? Protocols working well in LANs may not work well in WANs. Why? What are the bottlenecks? Any solutions?

Intro to Group Communication Systems GCS provides multicast primitives to all members in the group Group maintenance (removal of failed members, etc.) Ordering Unordered Total order (messages delivered in all members in the same order) Reliability Different degrees of delivery guarantees in case of site failures Analyzed in paper;

Data Replication: Using Group Communication Systems Total Order Read-Only requests: Executed in the local site Update requests: Multicast in total order firstly. executed according to total order delivery. Num of msgs for an update 1 total order w(x) w(x) w(x) w(x) x x x Symmetric

Data Replication: Using Group Communication Systems Read-Only requests: Executed in the local site Update requests: Request totally ordered firstly. executed only in the primary site Multicast the changes in unordered msg. Apply change in other sites Num of msgs for an update 1 total order + 1 unordered Local write (w(x)) 1 total order within response time Remote write (w(x)) 1 total order + 1 unordered within response time Total Order w(x) w(x) primary w(x) w(x) unordered x x x x x Primary Copy

Data Replication: Using Group Communication Systems Read-Only requests: Executed in the local site Update requests: Request totally ordered firstly. executed locally Multicast the changes in unordered msg. Apply change in other sites Num of msgs for an update 1 total order + 1 unordered No concurrent conflicting req 1 total order within response time Has concurrent conflicting req 1 total order + 1 unordered within response time Total Order w(x) w(x) w(x) w(x) unordered x x x x x Local Copy

Num of messages summary Symmetric Primary Copy Local Copy Total num of msgs 1 total order 1 unordered Num of msgs within respone time Local write No concurrent conflicting request Remote write Has concurrent conflicting request

Experiment (I) LAN WAN (5 sites, 100% update)

Experiment (I): Response time analysis

Experiment (II): Scalability in WAN Read-only requests Update requests 50% update, Symmetric

Different Total Order Algorithms Seq # token A (seq) A m m B B C C SEQUENCER TOKEN m2 m <1,0,0> A A m1 m2m1 <1,0,0> B B <1,0,0> C C LAMPORT Round Robin (ATOP)

Experiment (III): Different Total Order Alg 5 sites in WAN, without replication 5 sites in WAN, with replication 100% update, Symmetric,

Conclusions Consistent database replication is feasible in WANs; For deterministic applications, Symmetric approach is preferable. For non-deterministic applications, Local Copy is preferable; In WAN, total order multicast is crucial to response time. Round Robin total order has better performance over others; We have some other interesting optimizations. Please refer to our paper.

References [C-JDBC] E. Ceccet, J.Marguerite, and W. Zwaenepoel. C-JDBC: Flexible database clustering middleware. In USENIX conference 2004 [Ganymed] C. Plattner and G. Alonso. Ganymed: Scalable replication for transactional web applications. In Middleware, 2004. [GlobData] L. Rodrigues, H. Miranda, R. Almeida, J. Martins, and P. Vicente. Strong Replication in the GlobData Middleware. In Workshop on Dependable Middleware-Based Systems, 2002. [Middle-R] R. Jimenez-Peris, M. Patiòno-Martnez, B. Kemme, and G. Alonso. Improving Scalability of Fault Tolerant Database Clusters. In ICDCS'02. [Conflict-Aware] C. Amza, A. L. Cox, and W. Zwaenepoel. Conict-Aware Scheduling for Dynamic Content Applications. In USENIX Symp. on Internet Tech. and Sys., 2003. [State Machine] F. Pedone, R. Guerraoui, and A. Schiper. The Database State Machine Approach. Distributed and Parallel Databases, 14:71-98, 2003. [Spread] http://www.spread.org [JGroups] http://www.jgroups.org