Real-Time Distributed Databases By: Chris Scardino CSC536 Monday, May 2, 2005.

Slides:



Advertisements
Similar presentations
Serializability in Multidatabases Ramon Lawrence Dept. of Computer Science
Advertisements

Optimistic Methods for Concurrency Control By : H.T. Kung & John T. Robinson Presenters: Munawer Saeed.
Relaxed Consistency Models. Outline Lazy Release Consistency TreadMarks DSM system.
Replication. Topics r Why Replication? r System Model r Consistency Models r One approach to consistency management and dealing with failures.
Distributed Databases John Ortiz. Lecture 24Distributed Databases2  Distributed Database (DDB) is a collection of interrelated databases interconnected.
Distributed Storage March 12, Distributed Storage What is Distributed Storage?  Simple answer: Storage that can be shared throughout a network.
CMPT 401 Summer 2007 Dr. Alexandra Fedorova Lecture X: Transactions.
Using DSVM to Implement a Distributed File System Ramon Lawrence Dept. of Computer Science
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
Chapter 13 (Web): Distributed Databases
Distributed Systems Fall 2010 Replication Fall 20105DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
CMPT Dr. Alexandra Fedorova Lecture X: Transactions.
ABCSG - Distributed Database 1 Data Management Distributed Database Data Replication.
Transaction Management and Concurrency Control
Overview Distributed vs. decentralized Why distributed databases
Distributed Systems Fall 2009 Replication Fall 20095DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
DBMS Functions Data, Storage, Retrieval, and Update
Chapter 12 Distributed Database Management Systems
©Silberschatz, Korth and Sudarshan18.1Database System Concepts Centralized Systems Run on a single computer system and do not interact with other computer.
CS 603 Data Replication February 25, Data Replication: Why? Fault Tolerance –Hot backup –Catastrophic failure Performance –Parallelism –Decreased.
Concurrency Control In Dynamic Database Systems Laurel Jones.
Distributed Databases
6.4 Data and File Replication Gang Shen. Why replicate  Performance  Reliability  Resource sharing  Network resource saving.
TRANSACTIONS AND CONCURRENCY CONTROL Sadhna Kumari.
Concurrency Control in Distributed Databases. By :- Rishikesh Mandvikar rmandvik[at]engr.smu.edu May 1, 2004.
Mobility in Distributed Computing With Special Emphasis on Data Mobility.
Database Design – Lecture 16
Replication March 16, Replication What is Replication?  A technique for increasing availability, fault tolerance and sometimes, performance 
Database Systems: Design, Implementation, and Management Tenth Edition Chapter 12 Distributed Database Management Systems.
Database Systems: Design, Implementation, and Management Ninth Edition Chapter 12 Distributed Database Management Systems.
Consistent and Efficient Database Replication based on Group Communication Bettina Kemme School of Computer Science McGill University, Montreal.
Distributed Database Systems Overview
Unit 9 Transaction Processing. Key Concepts Distributed databases and DDBMS Distributed database advantages. Distributed database disadvantages Using.
1 IRU Concurrency, Reliability and Integrity issues Geoff Leese October 2007 updated August 2008, October 2009.
By Phani Gowthami Tammineni. Overview This presentation is about the issues in real-time database systems and presents an overview of the state of the.
Replicated Databases. Reading Textbook: Ch.13 Textbook: Ch.13 FarkasCSCE Spring
1 ACTIVE FAULT TOLERANT SYSTEM for OPEN DISTRIBUTED COMPUTING (Autonomic and Trusted Computing 2006) Giray Kömürcü.
1 Distributed Databases BUAD/American University Distributed Databases.
Computer Science Lecture 13, page 1 CS677: Distributed OS Last Class: Canonical Problems Distributed synchronization and mutual exclusion Distributed Transactions.
XA Transactions.
A Survey on Optimistic Concurrency Control CAI Yibo ZHENG Xin
Ing. Erick López Ch. M.R.I. Replicación Oracle. What is Replication  Replication is the process of copying and maintaining schema objects in multiple.
Caching Consistency and Concurrency Control Contact: Dingshan He
Concurrency Control and Reliable Commit Protocol in Distributed Database Systems Jian Jia Chen 2002/05/09 Real-time and Embedded System Lab., CSIE, National.
Fault Tolerance and Replication
Topic Distributed DBMS Database Management Systems Fall 2012 Presented by: Osama Ben Omran.
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT By Jyothsna Natarajan Instructor: Prof. Yanqing Zhang Course: Advanced Operating Systems.
Introduction to Distributed Databases Yiwei Wu. Introduction A distributed database is a database in which portions of the database are stored on multiple.
Distributed Database Management Systems. Reading Textbook: Ch. 1, Ch. 3 Textbook: Ch. 1, Ch. 3 For next class: Ch. 4 For next class: Ch. 4 FarkasCSCE.
CSE 486/586, Spring 2012 CSE 486/586 Distributed Systems Replication Steve Ko Computer Sciences and Engineering University at Buffalo.
9 1 Chapter 9_B Concurrency Control Database Systems: Design, Implementation, and Management, Rob and Coronel.
Chapter 1 Database Access from Client Applications.
NOEA/IT - FEN: Databases/Transactions1 Transactions ACID Concurrency Control.
10 1 Chapter 10_B Concurrency Control Database Systems: Design, Implementation, and Management, Rob and Coronel.
Dsitributed File Systems
Topics in Distributed Databases Database System Implementation CSE 507 Some slides adapted from Navathe et. Al and Silberchatz et. Al.
Chapter 19: Distributed Databases
Replication.
Outline Announcements Fault Tolerance.
Concurrency Control II (OCC, MVCC)
7.1. CONSISTENCY AND REPLICATION INTRODUCTION
Chapter 10 Transaction Management and Concurrency Control
Concurrency Control and Reliable Commit Protocol in Distributed Database Systems Jian Jia Chen 2002/05/09 Real-time and Embedded System Lab., CSIE, National.
CSE 4340/5349 Mobile Systems Engineering
Distributed Database Management Systems
A View over Distributed databases
Atomic Commit and Concurrency Control
Distributed Database Management Systems
Distributed Systems and Concurrency: Distributed Systems
Concurrency control (OCC and MVCC)
Presentation transcript:

Real-Time Distributed Databases By: Chris Scardino CSC536 Monday, May 2, 2005

Goals of this Paper Real-Time Distributed Database Systems a relatively new area of research. Real-Time Distributed Database Systems a relatively new area of research. Increase Understanding of Real-Time Distributed Database Systems. Increase Understanding of Real-Time Distributed Database Systems. Present information about key factors of Real-Time Distributed Databases. Present information about key factors of Real-Time Distributed Databases.

Topics Hard vs. Soft RTDDBS Hard vs. Soft RTDDBS Concurrency Control in RTDDBS Concurrency Control in RTDDBS QoS and QoD through Replication QoS and QoD through Replication

Databases Database: Database: –A collection of related data with meaning Distributed Databases: Distributed Databases: –A database that consists of two or more data files located at different sites on a computer network.

Real-Time Distributed Databases Distributed Databases with the added constraint of completing operations within a certain amount of time to accurately reflect the outside world.

Real-Time Distributed Databases Strategies must suit system attributes Strategies must suit system attributes –Hard or Soft –Concurrency Control –Replication

Hard vs. Soft Hard Strict timing constraints Strict timing constraints Data and Service guaranteed Data and Service guaranteed Deadlines met to avoid catastrophe Deadlines met to avoid catastrophe Example: Example: –Control tower notifying planes where to land in inclement weather –Burglary System dispatch Soft Less strict timing constraints Less strict timing constraints Failure to meet deadlines not dangerous Failure to meet deadlines not dangerous Value of data declines after deadline. Value of data declines after deadline. –Example: Checkout line growing in a grocery store.

Concurrency Control Ensures data is accurate even in a distributed system Ensures data is accurate even in a distributed system Two main approaches: Two main approaches: –Prevent Collisions (Pessimistic) –Detect Collisions and Respond (Optimistic)

Pessimistic Concurrency Control Two-Phase Locking: Two-Phase Locking: –Transactions acquire locks on data –After transaction completed, locks are removed. –Ensures that data integrity isn’t compromised.

Pessimistic Concurrency Control Disadvantages to 2PL: Disadvantages to 2PL: –Don’t scale well to distributed systems –Difficult to maintain locks at different locations –Problems inherent to locking multiplied by number of sites

Optimistic Concurrency Control OCC: OCC: –Assumed that collisions won’t occur –Few or no read restrictions –Initial writing takes place on copy of data –Course of action can be decided based on collision

Optimistic Concurrency Control Disadvantages to OCC: Disadvantages to OCC: –The less servers, the more likely collisions –Collisions always cause rollbacks –Time wasted while restarting transactions

Variations on CC Neither OCC nor PCC perfect for RTDDBS Neither OCC nor PCC perfect for RTDDBS Variations/Augmentations frequent: Variations/Augmentations frequent: –DHP-2PL –OCC Wait-50

Replication Strategies in Real-Time Distributed Databases Replication Deadlines must be met Deadlines must be met Fault tolerance Fault tolerance Failure Transparency Failure Transparency Replication helps maintain Quality of Service and Data Freshness Replication helps maintain Quality of Service and Data Freshness

Full Replication All data replicated at multiple sites All data replicated at multiple sites –High fault tolerance –Failure transparency –Improved response times through querying local data

Full Replication Two Categories: –When replication takes place Eager or Lazy Replication Eager or Lazy Replication –Where the replication takes place Primary or Update-Anywhere Primary or Update-Anywhere

Full Replication: Eager vs. Lazy Update Eager Update Replicas updated as transaction happens. Replicas updated as transaction happens. High response times from clients. High response times from clients. Locked longer. Locked longer. High Overhead High Overhead Lazy Update Replicas updated after transaction committed. Replicas updated after transaction committed. Chance for inconsistency. Chance for inconsistency.

Full Replication: Update Anywhere vs. Primary Copy Update Anywhere Any replica can update other replicas. Any replica can update other replicas. Good for fault tolerance. Good for fault tolerance. High synching and update times. High synching and update times. Primary Copy One replica designated as “server”. One replica designated as “server”. Good for read-only transactions. Good for read-only transactions. Restarts at server mean long waits and missed deadlines. Restarts at server mean long waits and missed deadlines.

Alternative: Partial Replication JITRTR: Replicate as needed. JITRTR: Replicate as needed. Only parts of the database replicated to cut down on overhead. Only parts of the database replicated to cut down on overhead. Works best in static systems Works best in static systems

Conclusion Replication strategies must adapt to Real- Time systems. Replication strategies must adapt to Real- Time systems. Variations or Augmenting traditional algorithms necessary. Variations or Augmenting traditional algorithms necessary. Every RTDDB system has different requirements. Every RTDDB system has different requirements. No single excellent implementation for all systems. No single excellent implementation for all systems.