CS 600.419 Storage Systems Dangers of Replication Materials taken from “J. Gray, P. Helland, P. O’Neil, and D. Shasha. The Dangers of Replication and a.

Slides:



Advertisements
Similar presentations
1 Scaleable Replicated Databases Jim Gray (Microsoft) Pat Helland (Microsoft) Dennis Shasha (Columbia) Pat ONeil (U.Mass)
Advertisements

What is OceanStore? - 10^10 users with files each - Goals: Durability, Availability, Enc. & Auth, High performance - Worldwide infrastructure to.
Eventual Consistency Jinyang. Sequential consistency Sequential consistency properties: –Latest read must see latest write Handles caching –All writes.
Cloudifying Source Code Repositories: How much does it cost? LADIS 2009 Big Sky, Montana Michael Siegenthaler Hakim Weatherspoon Cornell University.
Replication and Consistency (2). Reference r Replication in the Harp File System, Barbara Liskov, Sanjay Ghemawat, Robert Gruber, Paul Johnson, Liuba.
Study of Hurricane and Tornado Operating Systems By Shubhanan Bakre.
CMPT Dr. Alexandra Fedorova Lecture X: Transactions.
Database Replication techniques: a Three Parameter Classification Authors : Database Replication techniques: a Three Parameter Classification Authors :
CMPT 431 Dr. Alexandra Fedorova Lecture XII: Replication.
Mobile Computing and Databases - A Survey A presentation by Dharmesh Thakkar based on the publication by Daniel Barbara.
Overview Distributed vs. decentralized Why distributed databases
Tentative Updates in MINO Steven Czerwinski Jeff Pang Anthony Joseph John Kubiatowicz ROC Winter Retreat January 13, 2002.
EEC-681/781 Distributed Computing Systems Lecture 3 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Real-Time Distributed Databases By: Chris Scardino CSC536 Monday, May 2, 2005.
G Robert Grimm New York University Bayou: A Weakly Connected Replicated Storage System.
Ordering of events in Distributed Systems & Eventual Consistency Jinyang Li.
Overview  Strong consistency  Traditional approach  Proposed approach  Implementation  Experiments 2.
Searching in Unstructured Networks Joining Theory with P-P2P.
What Can Databases Do for Peer-to-Peer Steven Gribble, Alon Halevy, Zachary Ives, Maya Rodrig, Dan Suciu Presented by: Ryan Huebsch CS294-4 P2P Systems.
Definition of terms Definition of terms Explain business conditions driving distributed databases Explain business conditions driving distributed databases.
More on Replication and Consistency CS-4513 D-term More on Replication and Consistency CS-4513 Distributed Computing Systems (Slides include materials.
Managing Update Conflicts in Bayou, a Weakly Connected Replicated Storage System D. B. Terry, M. M. Theimer, K. Petersen, A. J. Demers, M. J. Spreitzer.
Mobility Presented by: Mohamed Elhawary. Mobility Distributed file systems increase availability Remote failures may cause serious troubles Server replication.
PNUTS: YAHOO!’S HOSTED DATA SERVING PLATFORM FENGLI ZHANG.
Distributed Deadlocks and Transaction Recovery.
Distributed Systems Tutorial 11 – Yahoo! PNUTS written by Alex Libov Based on OSCON 2011 presentation winter semester,
Practical Replication. Purposes of Replication Improve Availability Replicated databases can be accessed even if several replicas are unavailable Improve.
Orbe: Scalable Causal Consistency Using Dependency Matrices & Physical Clocks Jiaqing Du, EPFL Sameh Elnikety, Microsoft Research Amitabha Roy, EPFL Willy.
CS Storage Systems Lecture 14 Consistency and Availability Tradeoffs.
Jan 31, 2001CSCI {4,6}900: Ubiquitous Computing1 Recap. Ubiquitous Computing Vision –The Computer for the Twenty-First Century, Mark Weiser –The Coming.
Replication and Consistency. Reference The Dangers of Replication and a Solution, Jim Gray, Pat Helland, Patrick O'Neil, and Dennis Shasha. In Proceedings.
Presented by Dr. Greg Speegle April 12,  Two-phase commit slow relative to local transaction processing  CAP Theorem  Option 1: Reduce availability.
1 Update Strategies for First Class Futures Khan Muhammad, Ludovic Henrio INRIA, Univ. Nice Sophia Antipolis,CNRS.
Replication March 16, Replication What is Replication?  A technique for increasing availability, fault tolerance and sometimes, performance 
Consistent and Efficient Database Replication based on Group Communication Bettina Kemme School of Computer Science McGill University, Montreal.
Replicated Databases. Reading Textbook: Ch.13 Textbook: Ch.13 FarkasCSCE Spring
Byzantine fault-tolerance COMP 413 Fall Overview Models –Synchronous vs. asynchronous systems –Byzantine failure model Secure storage with self-certifying.
Preventive Replication in Database Cluster Esther Pacitti, Cedric Coulon, Patrick Valduriez, M. Tamer Özsu* LINA / INRIA – Atlas Group University of Nantes.
1 ACTIVE FAULT TOLERANT SYSTEM for OPEN DISTRIBUTED COMPUTING (Autonomic and Trusted Computing 2006) Giray Kömürcü.
Ch 10 Shared memory via message passing Problems –Explicit user action needed –Address spaces are distinct –Small Granularity of Transfer Distributed Shared.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
1 Distributed Databases BUAD/American University Distributed Databases.
Replica Consistency in a Data Grid1 IX International Workshop on Advanced Computing and Analysis Techniques in Physics Research December 1-5, 2003 High.
Feb 1, 2001CSCI {4,6}900: Ubiquitous Computing1 Eager Replication and mobile nodes Read on disconnected clients may give stale data Eager replication prohibits.
1 Multiversion Reconciliation for Mobile Databases Shirish Hemanath Phatak & B.R.Badrinath Presented By Presented By Md. Abdur Rahman Md. Abdur Rahman.
Transactions and Concurrency Control. Concurrent Accesses to an Object Multiple threads Atomic operations Thread communication Fairness.
Ing. Erick López Ch. M.R.I. Replicación Oracle. What is Replication  Replication is the process of copying and maintaining schema objects in multiple.
Caching Consistency and Concurrency Control Contact: Dingshan He
Geo-distributed Messaging with RabbitMQ
Write Conflicts in Optimistic Replication Problem: replicas may accept conflicting writes. How to detect/resolve the conflicts? client B client A replica.
Distributed Computing Systems CSCI 4780/6780. Scalability ConceptExample Centralized servicesA single server for all users Centralized dataA single on-line.
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT By Jyothsna Natarajan Instructor: Prof. Yanqing Zhang Course: Advanced Operating Systems.
DATABASE REPLICATION DISTRIBUTED DATABASE. O VERVIEW Replication : process of copying and maintaining database object, in multiple database that make.
Eventual Consistency Jinyang. Review: Sequential consistency Sequential consistency properties: –All read/write ops follow some total ordering –Read must.
Highly Available Services and Transactions with Replicated Data Jason Lenthe.
Implementation of Simple Cloud-based Distributed File System Group ID: 4 Baolin Wu, Liushan Yang, Pengyu Ji.
IHP Im Technologiepark Frankfurt (Oder) Germany IHP Im Technologiepark Frankfurt (Oder) Germany ©
Log Shipping, Mirroring, Replication and Clustering Which should I use? That depends on a few questions we must ask the user. We will go over these questions.
Mobility Victoria Krafft CS /25/05. General Idea People and their machines move around Machines want to share data Networks and machines fail Network.
Chapter 25: Advanced Data Types and New Applications
CSCI5570 Large Scale Data Processing Systems
Example Replicated File Systems
Ganymed: Scalable Replication for Transactional Web Applications
Replication and Availability in Distributed systems
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT -Sumanth Kandagatla Instructor: Prof. Yanqing Zhang Advanced Operating Systems (CSC 8320)
EECS 498 Introduction to Distributed Systems Fall 2017
7.1. CONSISTENCY AND REPLICATION INTRODUCTION
EECS 498 Introduction to Distributed Systems Fall 2017
Distributed Database Management Systems
Presentation transcript:

CS Storage Systems Dangers of Replication Materials taken from “J. Gray, P. Helland, P. O’Neil, and D. Shasha. The Dangers of Replication and a Solution. SIGMOD, 2006.”

CS Storage Systems What’s the danger? Replication of transactional data results in unstable system performance For consistent replication –Waits and deadlocks For update-anywhere-anytime replication –Reconciliations Both grow polynomially (w/ meaningful exponents) in the number of clients –Based on simple, lower bounds derived from mean-value analysis

CS Storage Systems What’s the point? This theme is predicated on the knowledge that globally consistent replication does not scale

CS Storage Systems Replication Policies Eager replication: –Copies are updated as part of the original transaction. Lazy replication: –One replica is updated. Other copies are updated asynchronously Update policy: –Group: any node can update its replica. –Master: only master updates its replica. The rest replicas are read only.

CS Storage Systems Representing Writes

CS Storage Systems Mastered and Group Replication

CS Storage Systems The Scale-up Pitfall Replication works well on small, prototype systems –But, at deployment, replication is unstable At larger scales –Messages propagation delay increases –Higher transaction rates For eager replication –More transactions with each txn taking longer For lazy transactions –Delays in reconciliation leads to system delusion

CS Storage Systems Analysis of Eager Group Replication Scaling laws –Third power of the number of nodes –Fifth power of the # of operations per transaction Problems with eager replication –Cannot be used by disconnected nodes –Probability of deadlocks (failed transactions) increases with systems size

CS Storage Systems Analysis of Lazy Group Replication Scaling laws –Third power of the number of nodes –third power of the # of operations per transaction Better than eager, but not so good

CS Storage Systems Analysis of Lazy Master Replication Scaling laws –second power of the number of nodes –fifth power of the # of operations per transaction

CS Storage Systems Status of Replication Negative scaling results –Don’t account for message delays (so it’s worse) –Can’t escape these via lazy vs eager options No reason for group replication –Master is the same (eager) or better (lazy) So, what do we do –Avoid scale, keep systems small

CS Storage Systems Two-Tier Replication Two node types: –Base nodes: Always connected, store replica, master most objects –Mobile nodes: often disconnected, store a replica, issues tentative transactions Two version types: –Master version: Exists at the object owner, other may have older versions –Tentative version: Local version is updated by tentative transactions

CS Storage Systems Pictures to Entertain

CS Storage Systems System Principles Hierarchies to reduce scale –Nodes (Master & Mobile-disconnected) –Transactions (Tentative and Eager/Consistent) Techniques –Convergence (Bayou-like eventual consistency) –Idempotence: encode writes in non-conflicting ways Does it fix any of Bayou’s semantic problems?

CS Storage Systems Conclusions Eager: waits and deadlocks Lazy converts waits and deadlocks into reconciliations Both do not scale. Two tier replication: –Supports mobile nodes –Combine eager-master-replication with local updates