Distributed databases

Slides:



Advertisements
Similar presentations
Database Systems: Design, Implementation, and Management
Advertisements

ISOM Distributed Databases Arijit Sengupta. ISOM Learning Objectives Understand the concept and necessity of distributed databases Understand the types.
Distributed Databases John Ortiz. Lecture 24Distributed Databases2  Distributed Database (DDB) is a collection of interrelated databases interconnected.
Transaction.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
Chapter 13 (Web): Distributed Databases
Manajemen Basis Data Pertemuan 9 Matakuliah: M0264/Manajemen Basis Data Tahun: 2008.
1 Minggu 12, Pertemuan 23 Introduction to Distributed DBMS (Chapter , 22.6, 3rd ed.) Matakuliah: T0206-Sistem Basisdata Tahun: 2005 Versi: 1.0/0.0.
Distributed Databases Logical next step in geographically dispersed organisations goal is to provide location transparency starting point = a set of decentralised.
Chapter 25 Distributed Databases and Client-Server Architectures Copyright © 2004 Pearson Education, Inc.
ABCSG - Distributed Database 1 Data Management Distributed Database Data Replication.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide
Distributed Database Management Systems
Chapter 9 : Distributed Database.
Overview Distributed vs. decentralized Why distributed databases
1 © Prentice Hall, 2002 Chapter 13: Distributed Databases Modern Database Management 6 th Edition Jeffrey A. Hoffer, Mary B. Prescott, Fred R. McFadden.
Distributed Database Management Systems
Chapter 12 Distributed Database Management Systems
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 17 Client-Server Processing, Parallel Database Processing,
©Silberschatz, Korth and Sudarshan18.1Database System Concepts Centralized Systems Run on a single computer system and do not interact with other computer.
Definition of terms Definition of terms Explain business conditions driving distributed databases Explain business conditions driving distributed databases.
Outline Introduction Background Distributed Database Design
Distributed databases
Alexandria Dodd Janelle Toungett
DATABASE MANAGEMENT SYSTEMS 2 ANGELITO I. CUNANAN JR.
Distributed Databases
Distributed Databases and DBMSs: Concepts and Design
Distributed Database The University of California Berkeley Extension Copyright © 2011 Patrick McDermott.
1 Distributed and Parallel Databases. 2 Distributed Databases Distributed Systems goal: –to offer local DB autonomy at geographically distributed locations.
12 1 Chapter 12 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Database Design – Lecture 16
III. Current Trends: 1 - Distributed DBMSsSlide 1/32 III. Current Trends Part 1: Distributed DBMSs: Concepts and Design Lecture 12 (2 hours) Lecturer:
Database Systems Design, Implementation, and Management Coronel | Morris 11e ©2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or.
DISTRIBUTED DATABASES IN ADBMS Shilpa Seth
Session-9 Data Management for Decision Support
Lecture 5: Sun: 1/5/ Distributed Algorithms - Distributed Databases Lecturer/ Kawther Abas CS- 492 : Distributed system &
Session-8 Data Management for Decision Support
10 1 Chapter 10 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
Database Systems: Design, Implementation, and Management Tenth Edition Chapter 12 Distributed Database Management Systems.
Database Systems: Design, Implementation, and Management Ninth Edition Chapter 12 Distributed Database Management Systems.
Week 5 Lecture Distributed Database Management Systems Samuel ConnSamuel Conn, Asst Professor Suggestions for using the Lecture Slides.
Distributed Database Systems Overview
DDBMS Distributed Database Management Systems Fragmentation
Distributed Databases DBMS Textbook, Chapter 22, Part II.
Kjell Orsborn UU - DIS - UDBL DATABASE SYSTEMS - 10p Course No. 2AD235 Spring 2002 A second course on development of database systems Kjell.
Distributed Databases
Chapter 12 Distributed Database Management Systems.
ASMA AHMAD 28 TH APRIL, 2011 Database Systems Distributed Databases I.
1 Distributed Databases BUAD/American University Distributed Databases.
Databases Illuminated
Distributed database system
CS742 – Distributed & Parallel DBMSM. Tamer Özsu Page 1.1 Outline Introduction & architectural issues What is a distributed DBMS Problems Current state-of-affairs.
Topic Distributed DBMS Database Management Systems Fall 2012 Presented by: Osama Ben Omran.
MBA 664 Database Management Systems Dave Salisbury ( )
Introduction to Distributed Databases Yiwei Wu. Introduction A distributed database is a database in which portions of the database are stored on multiple.
Distributed Database Management Systems. Reading Textbook: Ch. 1, Ch. 3 Textbook: Ch. 1, Ch. 3 For next class: Ch. 4 For next class: Ch. 4 FarkasCSCE.
 Distributed Database Concepts  Parallel Vs Distributed Technology  Advantages  Additional Functions  Distribution Database Design  Data Fragmentation.
Distributed DBMS, Query Processing and Optimization
1 Chapter 22 Distributed DBMS Concepts and Design CS 157B Edward Chen.
1 Information Retrieval and Use De-normalisation and Distributed database systems Geoff Leese September 2008, revised October 2009.
CMS Advanced Database and Client-Server Applications Distributed Databases slides by Martin Beer and Paul Crowther Connolly and Begg Chapter 22.
Distributed Databases
Distributed Databases
1 Chapter 22 Distributed DBMSs - Concepts and Design Simplified Transparencies © Pearson Education Limited 1995, 2005.
CHAPTER 25 - Distributed Databases and Client–Server Architectures
Distributed Database Concepts
Distributed Database Management Systems
Distributed Databases
Introduction of Week 14 Return assignment 12-1
Distributed Databases
Presentation transcript:

Distributed databases A brief introduction (Figure numbers may not be the same as in the book) Distributed databases

Distributed database concepts Distributed database (DDB) Collection of multiple logically interrelated databases distributed over a computer network Distributed database management systems (DDBMS) Software systems managing a distributed database, making distribution transparent to the users Distributed databases

Distributed databases Transparency Hiding implementation details from the users of the database Data organization transparency Location transparency Use does not depend on location Naming transparency Naming is independent from location Replication transparency Copies can be kept for availability, performance, and availability User are unaware of the existence of these copies Fragmentation transparency One table is divided into more locations Horizontal fragmentation Table divided by rows Vertical fragmentation Table divided by columns Distributed databases

Example: Replication and horizontal fragmentation Distributed databases

Reliability and Availability Two common advantages of distributed databases Reliability The probability that a system is running at a certain time point Availability The probability that a system is continuously available during a time interval Distributed databases

Advantages of distributed databases Improved ease and flexibility of application development Transparency: Developers do not have to know … Increased reliability and availability Faults are isolated to a single site Improved performance Data localization, means less network traffic Parallelism Easier expansion Easy to add more data, processors, etc. Distributed databases

Types of distributed database systems Degree of homogeneity Homogeneous: All local DBMSs run identical software Heterogeneous: Local DMBSs run different software Autonomy Local autonomy: Local site can function as a standalone DBMS No autonomy: Local site can not function as a standalone DBMS Distributed databases

Classification of distributed databases

Database system architectures Distributed databases

Distributed databases General architecture Distributed databases

Component architecture of distributed databases

Distributed databases Data fragmentation Which site should store which portion of the database? Simple fragmentation Each site has a whole relation Horizontal fragmentation Subset of rows in each site Sometimes based on location Vertical fragmentation Subset of columns in each site Primary key must be in all sites Mixed / hybrid fragmentation Horizontal + vertical fragmentation Described by fragmentation schema Distributed databases

Example fragmentation Distributed databases

Example fragmentation, continued Distributed databases

Distributed databases Data replication Replication to improve availability Fully replicated database All data is replicated to each site Non replication All data is stored at exactly one site Partial replication Some data is replicated to some sites Described by replication schema Distributed databases

Distributed query processing Query mapping Query mapped from SQL to relational algebra using the global conceptual schema Localization Map query on the global schema to separate queries on the local schemas Using fragmentation and replication information Global query optimization Cost = CPU time + I/O time + communication time Local query optimization Same as in centralized databases Distributed databases

Distributed transaction management, Two-phase commit protocol (2PC) Global transaction manager / coordinator Coordinates the results of local transaction managers. All local transaction managers must be able to ”commit”, before actually doing the ”commit” Two-Phase commit protocol (2PC) Phase 1 Individual databases tell the coordinator that they have finished transaction All individual databases have finished: Coordinator sends ”prepare for commit” to all databases Individual databases answer ”read to commit” or ”cannot commit” Phase 2 If all databases answered ”ready to commit”, coordinator sends ”commit” to all databases If one (or more) databases answered ”cannot commit”, coordinator sends ”abort” to all databases. Timeout: if one (or more) databases does not answer within a given amount of time, coordinator sends ”abort”. Distributed databases

Two-phase commit protocol (2PC) Problems with 2PC Coordinator crashes: All participating sites are waiting No way of knowing whether participating sites really got the ”commit” / ”abort” Distributed databases

Three-phase commit (3PC) Distributed databases