Kjell Orsborn 2015-11-13 1 UU - DIS - UDBL DATABASE SYSTEMS - 10p Course No. 2AD235 Spring 2002 A second course on development of database systems Kjell.

Slides:



Advertisements
Similar presentations
Database Systems: Design, Implementation, and Management
Advertisements

Database Architectures and the Web
Distributed Databases John Ortiz. Lecture 24Distributed Databases2  Distributed Database (DDB) is a collection of interrelated databases interconnected.
Distributed databases
Distributed Database Systems Dr. Mohamed Osman Hegazi.
Transaction.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
Chapter 13 (Web): Distributed Databases
Manajemen Basis Data Pertemuan 9 Matakuliah: M0264/Manajemen Basis Data Tahun: 2008.
1 Minggu 12, Pertemuan 23 Introduction to Distributed DBMS (Chapter , 22.6, 3rd ed.) Matakuliah: T0206-Sistem Basisdata Tahun: 2005 Versi: 1.0/0.0.
ABCSG - Distributed Database 1 Data Management Distributed Database Data Replication.
Distributed Database Management Systems
Chapter 9 : Distributed Database.
Overview Distributed vs. decentralized Why distributed databases
Chapter 12 Distributed Database Management Systems
©Silberschatz, Korth and Sudarshan18.1Database System Concepts Centralized Systems Run on a single computer system and do not interact with other computer.
Definition of terms Definition of terms Explain business conditions driving distributed databases Explain business conditions driving distributed databases.
Distributed Databases
DISTRIBUTED DATABASE MANAGEMENT SYSTEM CHAPTER 07.
Outline Introduction Background Distributed Database Design
DATABASE MANAGEMENT SYSTEMS 2 ANGELITO I. CUNANAN JR.
Distributed Databases Dr. Lee By Alex Genadinik. Distributed Databases? What is that!?? Distributed Database - a collection of multiple logically interrelated.
Distributed Database The University of California Berkeley Extension Copyright © 2011 Patrick McDermott.
1 Distributed and Parallel Databases. 2 Distributed Databases Distributed Systems goal: –to offer local DB autonomy at geographically distributed locations.
12 1 Chapter 12 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Database Design – Lecture 16
Database Systems Design, Implementation, and Management Coronel | Morris 11e ©2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or.
DISTRIBUTED DATABASES IN ADBMS Shilpa Seth
Session-9 Data Management for Decision Support
Massively Distributed Database Systems - Distributed DBS Spring 2014 Ki-Joune Li Pusan National University.
Lecture 5: Sun: 1/5/ Distributed Algorithms - Distributed Databases Lecturer/ Kawther Abas CS- 492 : Distributed system &
Session-8 Data Management for Decision Support
10 1 Chapter 10 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
Database Systems: Design, Implementation, and Management Tenth Edition Chapter 12 Distributed Database Management Systems.
Database Systems: Design, Implementation, and Management Ninth Edition Chapter 12 Distributed Database Management Systems.
Week 5 Lecture Distributed Database Management Systems Samuel ConnSamuel Conn, Asst Professor Suggestions for using the Lecture Slides.
Distributed Database Systems Overview
Intro – Part 2 Introduction to Database Management: Ch 1 & 2.
Distributed Databases Midterm review. Lectures covered Everything until (including) March 2 nd Everything until (including) March 2 nd Focus on distributed.
The Evolution of Distributed DBMS 4Social and Technical Changes in the 1980’s u Business operations became more decentralized geographically. u Competition.
Oracle's Distributed Database Bora Yasa. Definition A Distributed Database is a set of databases stored on multiple computers at different locations and.
DDBMS Distributed Database Management Systems Fragmentation
Distributed DBMSs- Concept and Design Jing Luo CS 157B Dr. Lee Fall, 2003.
Distributed Databases
Chapter 12 Distributed Database Management Systems.
ASMA AHMAD 28 TH APRIL, 2011 Database Systems Distributed Databases I.
1 Distributed Databases BUAD/American University Distributed Databases.
Databases Illuminated
Distributed Database. Introduction A major motivation behind the development of database systems is the desire to integrate the operational data of an.
INTRODUCTION TO DBS Database: a collection of data describing the activities of one or more related organizations DBMS: software designed to assist in.
CS338Parallel and Distributed Databases11-1 Parallel and Distributed Databases Lecture Topics Multi-CPU and distributed systems Monolithic system Client–server.
Distributed database system
1 Principles of Database Systems With Internet and Java Applications Today’s Topic Chapter 15: Reliability and Security in Database Servers Instructor’s.
CS742 – Distributed & Parallel DBMSM. Tamer Özsu Page 1.1 Outline Introduction & architectural issues What is a distributed DBMS Problems Current state-of-affairs.
Topic Distributed DBMS Database Management Systems Fall 2012 Presented by: Osama Ben Omran.
Chapter 12 Distributed Data Bases. Learning Objectives What a distributed database management system (DDBMS) is and what its components are How database.
Introduction to Distributed Databases Yiwei Wu. Introduction A distributed database is a database in which portions of the database are stored on multiple.
Distributed Database Management Systems. Reading Textbook: Ch. 1, Ch. 3 Textbook: Ch. 1, Ch. 3 For next class: Ch. 4 For next class: Ch. 4 FarkasCSCE.
 Distributed Database Concepts  Parallel Vs Distributed Technology  Advantages  Additional Functions  Distribution Database Design  Data Fragmentation.
Chapter 1 Database Access from Client Applications.
1 Chapter 22 Distributed DBMS Concepts and Design CS 157B Edward Chen.
1 Information Retrieval and Use De-normalisation and Distributed database systems Geoff Leese September 2008, revised October 2009.
Em Spatiotemporal Database Laboratory Pusan National University File Processing : Database Management System Architecture 2004, Spring Pusan National University.
1 Chapter 22 Distributed DBMSs - Concepts and Design Simplified Transparencies © Pearson Education Limited 1995, 2005.
CHAPTER 25 - Distributed Databases and Client–Server Architectures
Distributed Database Management Systems
Chapter 19: Distributed Databases
Database Architecture
Introduction of Week 14 Return assignment 12-1
Presentation transcript:

Kjell Orsborn UU - DIS - UDBL DATABASE SYSTEMS - 10p Course No. 2AD235 Spring 2002 A second course on development of database systems Kjell Orsborn Uppsala Database Laboratory Department of Information Technology, Uppsala University, Uppsala, Sweden

Kjell Orsborn UU - DIS - UDBL Introduction to Distributed DBMSs (Elmasri/Navathe ch. 24) Distributed DBMS (ch and 24.5 are omitted) Kjell Orsborn Uppsala Database Laboratory, Department of Information Technology, Uppsala University, Uppsala, Sweden

Kjell Orsborn UU - DIS - UDBL Distributed DBMSs A distributed database (DDB) is a collection of several logically interrelated databases distributed over a computer network including a number of computers (nodes). A distributed database mangement system (DDBMS) is a software system that permits management of DDB’s and that makes the distribution transparent for the user. A DDB is not: –a collection of files (need structure and DB manager) –a client-server interface to a database data on one node, clients on other nodes in network (almost) every centralized DBMS has client-server interface

Kjell Orsborn UU - DIS - UDBL Background What is a Distributed System? A Distributed System is a number of autonomous computers communicating over a network with software for integrated tasks. Examples of Distributed Systems: SUN’s Network File System (NFS), distributed file system

Kjell Orsborn UU - DIS - UDBL Distributed DBMSs... Centralized database in a network Node 1 Node 5 Node 2 Node 3 Node 4 communication network Node 1 Node 5 Node 2 Node 3 Node 4 communication network Distributed database over several nodes in a network

Kjell Orsborn UU - DIS - UDBL Centralized Database Server Stream (row-by-row) based client-server interfaces DBMS specific interfaces Compiler integrated interfaces (embedded SQL) ODBC: SQL-based standardized subroutine call library (Microsoft) JDBC: ODBC for Java (not Microsoft)

Kjell Orsborn UU - DIS - UDBL Distributed Databases Database seen as one unit; queries and updates to ONE database. Data in database transparently distributed over many DB nodes. Manual partitioning or fragmentation of data tables. DBMS automatically optimizes queries and updates to distributed database.

Kjell Orsborn UU - DIS - UDBL Multi-Databases Database seen as several heterogeneous units Multi-database query language needed to combine data from the databases. Primitives needed to integrate (combine, fuse) data from the databases. Special query optimization techniques to deal with heterogneity and dynamism.

Kjell Orsborn UU - DIS - UDBL Example of Multi-Database Automatic Teller Machines, ATMs

Kjell Orsborn UU - DIS - UDBL Fragmentation of data data fragmentation (= data partitioning) division of data sets (e.g. a relation) into several pieces - fragments transparently stored on several different nodes increased accessability and performance several types of fragmentation: –horisontal fragmentation –vertical fragmentation –mixed fragmentation good when nodes far apart

Kjell Orsborn UU - DIS - UDBL Replication of data copies of the same data on several nodes increased reliability and access performance more complex updating, transactions handling, recovery. –updates must be propagated to each replica! –special procedures after failures to restore consistency –more problematic transaction synchronization! types of replication: –full replication (whole db at each node) –no replication (each fragment only at one node) –partial replication (certain fragments replicated) –not necessary to replicate all tables –full replication often not realistic!

Kjell Orsborn UU - DIS - UDBL Transparency in a DDBMS By transparency we here mean the hiding of basic implementation details from one abstraction level to another. Data independence –logical data independence –physical data independence Network transparency –protect user from operational details of network –hides the existence of a network –no machine names in database table references location transparency naming transparency Replication transparency –user should not experience data replicas –automatic handling of updates, such as replica propagation –automatic handling of node crasches Fragmentation transparency –hides the existence of fragments e.g. that a logical relation is horizontally fragmented into local physical tables –handling of transformation of global queries to fragmented queries

Kjell Orsborn UU - DIS - UDBL Advantages of Distributed DBMSs... Data sharing –uniform interface and sharing of data through the DDBMS –natural to distribute certain database applications Increased reliability –redundance increase security and accessability –crashes less severe (if application not dependent of non-local data) Local independence –allows sharing of data but keeps local control of data Improved performance –avoid unnecessary data transfer Expandibility –easy to add new nodes (not always linear scale up due to central directory) Local autonomy –local control –local policies

Kjell Orsborn UU - DIS - UDBL Problems with Distributed DBMSs... Complexity –database administration becomes more complex (such as recovery) –increased complexity of system design, implementation and maintenance Security –keep security in a network harder Networking a known problem Distributed administration –less control and more meetings Cost –hardware - software - development/maintenance

Kjell Orsborn UU - DIS - UDBL Problems with Distributed DBMSs... Distributed schema management –schema is accessed whenever SQL query issued! –global directory => Central Database becomes hot spot –local directories => Data replication –=> Since schema is not updated often but need to be accessed very often it is normally fully replicated by the DDBMS. Distributed concurrency control –consistency of replicas: mutual consistency Distributed deadlock management Reliability of DDBMS –consistency of replicas –bring up (fragmented) database at failed sites OS Support –multiple layers of network software

Kjell Orsborn UU - DIS - UDBL Additional functionality required by DDBMS Access of physically divided databases - schema management Handling of distribution and replication of data –which copy of data should for example be used Handling of consistency of replicated data Handling of distributed queries Handling of distributed transactions (over several network nodes) Handling of recovery/restart from crashes (of nodes) and new types of errors such as communication errrors/failures.

Kjell Orsborn UU - DIS - UDBL Distributed database design Goal: –to minimize the combined cost of maintaining data, recieve efficient communication and good performance for transactions. Problems: –where (on which node/nodes) shall data and applications be placed –partitioning of data (split data into distributed partitions) –replication of data (copies of data on several nodes) –NP-complete optimization problem. –distributed query processing automatically done by distributed query processor of DDBMS analyze query --> distributed execution plan factors: –data replication –data availability –communication costs