Ganymed: Scalable Replication for Transactional Web Applications

Slides:



Advertisements
Similar presentations
Multiple Processor Systems
Advertisements

Dynamo: Amazon’s Highly Available Key-value Store
Distributed DBMS©M. T. Özsu & P. Valduriez Ch.14/1 Outline Introduction Background Distributed Database Design Database Integration Semantic Data Control.
Manajemen Basis Data Pertemuan 12 Matakuliah: M0264/Manajemen Basis Data Tahun: 2008.
Database Replication in WAN Yi Lin McGill University Distributed Information Systems.
Exploiting Distributed Version Concurrency in a Transactional Memory Cluster Kaloian Manassiev, Madalin Mihailescu and Cristiana Amza University of Toronto,
Title – Process Migration and Mobile Agents By David Aihe.
Replication Management. Motivations for Replication Performance enhancement Increased availability Fault tolerance.
Middleware based Data Replication providing Snapshot Isolation Yi Lin Bettina Kemme Marta Patiño-Martínez Ricardo Jiménez-Peris June 15, 2005.
Chapter 13 (Web): Distributed Databases
Replicating Basic Components Bettina Kemme McGill University, Montreal, Canada.
Distributed Locking. Distributed Locking (No Replication) Assumptions Lock tables are managed by individual sites. The component of a transaction at a.
Predicting Replicated Database Scalability Sameh Elnikety, Microsoft Research Steven Dropsho, Google Inc. Emmanuel Cecchet, Univ. of Mass. Willy Zwaenepoel,
Database Replication techniques: a Three Parameter Classification Authors : Database Replication techniques: a Three Parameter Classification Authors :
OCT1 Principles From Chapter One of “Distributed Systems Concepts and Design”
Understanding Replication in Database & Distributed Systems SRDS’ Database Replication Techniques: A Three Parameter Classification M. Wiesmann F.
Distributed Systems Fall 2009 Replication Fall 20095DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
Overview  Strong consistency  Traditional approach  Proposed approach  Implementation  Experiments 2.
What Can Databases Do for Peer-to-Peer Steven Gribble, Alon Halevy, Zachary Ives, Maya Rodrig, Dan Suciu Presented by: Ryan Huebsch CS294-4 P2P Systems.
Recovery Techniques in Distributed Databases Naveen Jones December 5, 2011.
6.4 Data and File Replication Gang Shen. Why replicate  Performance  Reliability  Resource sharing  Network resource saving.
PNUTS: YAHOO!’S HOSTED DATA SERVING PLATFORM FENGLI ZHANG.
Distributed Data Stores – Facebook Presented by Ben Gooding University of Arkansas – April 21, 2015.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
Database Replication. Replication Replication is the process of sharing information so as to ensure consistency between redundant resources, such as software.
Copyright © 2013 NetEase 马进 app DDB introduce.
Database Replication Policies for Dynamic Content Applications Gokul Soundararajan, Cristiana Amza, Ashvin Goel University of Toronto EuroSys 2006: Leuven,
CS Storage Systems Dangers of Replication Materials taken from “J. Gray, P. Helland, P. O’Neil, and D. Shasha. The Dangers of Replication and a.
/11/2003 C-JDBC: a High Performance Database Clustering Middleware Nicolas Modrzyk
Replication and Consistency. Reference The Dangers of Replication and a Solution, Jim Gray, Pat Helland, Patrick O'Neil, and Dennis Shasha. In Proceedings.
A Dynamic Data Grid Replication Strategy to Minimize the Data Missed Ming Lei, Susan Vrbsky, Xiaoyan Hong University of Alabama.
Consistent and Efficient Database Replication based on Group Communication Bettina Kemme School of Computer Science McGill University, Montreal.
Replicated Databases. Reading Textbook: Ch.13 Textbook: Ch.13 FarkasCSCE Spring
THE LITTLE ENGINE(S) THAT COULD: SCALING ONLINE SOCIAL NETWORKS B 圖資三 謝宗昊.
Distributed Computing Systems CSCI 4780/6780. Distributed System A distributed system is: A collection of independent computers that appears to its users.
Preventive Replication in Database Cluster Esther Pacitti, Cedric Coulon, Patrick Valduriez, M. Tamer Özsu* LINA / INRIA – Atlas Group University of Nantes.
Distributed Computing Systems CSCI 4780/6780. Geographical Scalability Challenges Synchronous communication –Waiting for a reply does not scale well!!
Applying Database Replication to Multi-player Online Games Yi Lin Bettina Kemme Marta Patiño-Martínez Ricardo Jiménez-Peris Oct 30, 2006.
Database Replication in Tashkent CSEP 545 Transaction Processing Sameh Elnikety.
IM NTU Distributed Information Systems 2004 Replication Management -- 1 Replication Management Yih-Kuen Tsay Dept. of Information Management National Taiwan.
Database replication policies for dynamic content applications Gokul Soundararajan, Cristiana Amza, Ashvin Goel University of Toronto Presented by Ahmed.
An Overview of Issues in P2P database systems Presented by Ahmed Ataullah Wednesday, November 29 th 2006.
Adaptive Location Management Model for Manet (ALM) Navid NIKAEIN Christian BONNET
From Digital Objects to Content across eInfrastructures Content and Storage Management in gCube Pasquale Pagano CNR –ISTI on behalf of Heiko Schuldt Dept.
Feb 1, 2001CSCI {4,6}900: Ubiquitous Computing1 Eager Replication and mobile nodes Read on disconnected clients may give stale data Eager replication prohibits.
Database Replication in WAN Yi Lin Supervised by: Prof. Kemme April 8, 2005.
Ing. Erick López Ch. M.R.I. Replicación Oracle. What is Replication  Replication is the process of copying and maintaining schema objects in multiple.
Distributed Computing Systems CSCI 4780/6780. Scalability ConceptExample Centralized servicesA single server for all users Centralized dataA single on-line.
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT By Jyothsna Natarajan Instructor: Prof. Yanqing Zhang Course: Advanced Operating Systems.
DATABASE REPLICATION DISTRIBUTED DATABASE. O VERVIEW Replication : process of copying and maintaining database object, in multiple database that make.
Highly Available Services and Transactions with Replicated Data Jason Lenthe.
Tzu-Han Wu Yi-Chi Chiang Han-Yang Ou
IHP Im Technologiepark Frankfurt (Oder) Germany IHP Im Technologiepark Frankfurt (Oder) Germany ©
Don’t be lazy, be consistent: Postgres-R, A new way to implement Database Replication Paper by Bettina Kemme and Gustavo Alonso, VLDB 2000 Presentation.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
Performance Driven Database Design for Scalable Web Applications Jozsef Patvarczki, Murali Mani, and Neil Heffernan Scaling up web applications requires.
Context-Aware Middleware for Resource Management in the Wireless Internet US Lab 신현정.
Dynamo: Amazon’s Highly Available Key-value Store
DNS.
Lecturer : Dr. Pavle Mogin
GlassFish in the Real World
6.4 Data and File Replication
A Replica Location Service
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT -Sumanth Kandagatla Instructor: Prof. Yanqing Zhang Advanced Operating Systems (CSC 8320)
Consistent Data Replication: Is it feasible in WANs?
Distributed Database Management Systems
OOTI Talks Apr-19 Johan J. Lukkien,
Network management system
Presentation transcript:

Ganymed: Scalable Replication for Transactional Web Applications Christian Plattner and Gustavo Alonso Presented by Ahmed Ataullah

Background and Motivation Is replication avoidable? Is our choice limited only to eager or lazy solutions? Or a mix of both..

What is Ganymed? One master copy, several slave copies Middleware and scheduling algorithm Key Idea: Separate the read-only and update requests Assumption: Incoming requests are a “reasonable” mix of update and read-only transactions

Scheduling Algorithm (RSI-PC) Read-only requests sent to any ‘available’ copy Update requests sent directly to the master copy Global database version number incremented after each update Writesets sent to all slaves after completion

Ganymed Architecture “…Ganymed resembles more a router than a scheduler.”

Test Results Change from GNY-1 to GNY-2 configuration is perhaps interesting…

Test Results (continued)

Reliability Concerns What if a replica fails? What if the master database fails? What if the scheduler(s) fail?

Other concerns… Is the assumption of balanced query load a fair one to make? What if most queries are update queries? At What point does the master copy become the bottleneck? Can the scheduler become the bottleneck?

Other concerns (continued) What about disconnected (or mobile) replicas? Serializablilty, freshness etc… Others Issues?

Improvement ideas… Have two master-copies working together Dynamically Increase/decrease number of master copies with respect to update load Khuzaima Daudjee and Kenneth Salem (Waterloo)

Improvement ideas (continued) Have two scheduling points What if the connection between the two points is unreliable? Improving the RSI-PC algorithm without degrading the performance

Conclusion Gynamed presents a neat idea Separate read-only and update transactions and exploit the properties of the expected query load Subsequent work has improved the concept substantially. Questions and Comments?