Chapter 17: Additional Slides February 6, 2016. Outline Physical Data Management  Fragments  Distributed Query Processing  Transactions Logical Data.

Slides:



Advertisements
Similar presentations
Database Systems: Design, Implementation, and Management
Advertisements

Chapter 10: Designing Databases
V. Megalooikonomou Distributed Databases (based on notes by Silberchatz,Korth, and Sudarshan and notes by C. Faloutsos at CMU) Temple University – CIS.
ISOM Distributed Databases Arijit Sengupta. ISOM Learning Objectives Understand the concept and necessity of distributed databases Understand the types.
Enterprise Systems Distributed databases and systems - DT
Distributed Databases John Ortiz. Lecture 24Distributed Databases2  Distributed Database (DDB) is a collection of interrelated databases interconnected.
Distributed databases
Chapter 13 (Web): Distributed Databases
Advanced Database Systems September 2013 Dr. Fatemeh Ahmadi-Abkenari 1.
Chapter 25 Distributed Databases and Client-Server Architectures Copyright © 2004 Pearson Education, Inc.
1 Distributed Databases Chapter Two Types of Applications that Access Distributed Databases The application accesses data at the level of SQL statements.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide
Distributed Database Management Systems
Distributed DBMSPage 4. 1© 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background  Distributed DBMS Architecture  Datalogical Architecture.
Overview Distributed vs. decentralized Why distributed databases
1 Distributed Databases Chapter What is a Distributed Database? Database whose relations reside on different sites Database some of whose relations.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 17 Client-Server Processing, Parallel Database Processing,
Definition of terms Definition of terms Explain business conditions driving distributed databases Explain business conditions driving distributed databases.
Distributed Databases
Client-Server Processing and Distributed Databases
1 Distributed and Parallel Databases. 2 Distributed Databases Distributed Systems goal: –to offer local DB autonomy at geographically distributed locations.
Database Design – Lecture 16
DISTRIBUTED DATABASES IN ADBMS Shilpa Seth
04/18/2005Yan Huang - CSCI5330 Database Implementation – Distributed Database Systems Distributed Database Systems.
Database Systems: Design, Implementation, and Management
DISTRIBUTED DATABASE DESIGN
1 Database Systems: Design, Implementation, and Management CHAPTER 6 Database Design.
Session-9 Data Management for Decision Support
PMIT-6102 Advanced Database Systems By- Jesmin Akhter Assistant Professor, IIT, Jahangirnagar University.
Client-Server Processing, Parallel Database Processing and Distributed Database Systems. KEVIN ROBERTS ANIKET MURLIDHARAN.
Massively Distributed Database Systems - Distributed DBS Spring 2014 Ki-Joune Li Pusan National University.
Why should a database transaction be atomic?. ABORT = Removal of the updates of a transaction An abort is implemented by a DBMS roll back recovery where.
Session-8 Data Management for Decision Support
Database Systems: Design, Implementation, and Management Tenth Edition Chapter 12 Distributed Database Management Systems.
Database Systems: Design, Implementation, and Management Ninth Edition Chapter 12 Distributed Database Management Systems.
Week 5 Lecture Distributed Database Management Systems Samuel ConnSamuel Conn, Asst Professor Suggestions for using the Lecture Slides.
Distributed systems and Distributed databases design Enterprise systems DT
Distributed Database Systems Overview
PMIT-6102 Advanced Database Systems By- Jesmin Akhter Assistant Professor, IIT, Jahangirnagar University.
Announcements. Data Management Chapter 12 Traditional File Approach  Structure Field  Record  File  Fixed All records have common fields, and a field.
1 Distributed Databases (DDBs) Chap Distributed Databases Distributed Systems goal: –to offer local DB autonomy at geographically distributed locations.
Distributed Databases DBMS Textbook, Chapter 22, Part II.
Databases Illuminated
Distributed Database. Introduction A major motivation behind the development of database systems is the desire to integrate the operational data of an.
PMIT-6101 Advanced Database Systems By- Jesmin Akhter Assistant Professor, IIT, Jahangirnagar University.
Management Information Systems, 4 th Edition 1 Chapter 8 Data and Knowledge Management.
Software School of Hunan University Database Systems Design Part III : Mapping ER Diagram to Relational Schema.
Introduction to Distributed Databases Yiwei Wu. Introduction A distributed database is a database in which portions of the database are stored on multiple.
Distributed Database Management Systems. Reading Textbook: Ch. 1, Ch. 3 Textbook: Ch. 1, Ch. 3 For next class: Ch. 4 For next class: Ch. 4 FarkasCSCE.
 Distributed Database Concepts  Parallel Vs Distributed Technology  Advantages  Additional Functions  Distribution Database Design  Data Fragmentation.
Distributed DBMS, Query Processing and Optimization
1 Chapter 22 Distributed DBMS Concepts and Design CS 157B Edward Chen.
1 Information Retrieval and Use De-normalisation and Distributed database systems Geoff Leese September 2008, revised October 2009.
Topics in Distributed Databases Database System Implementation CSE 507 Some slides adapted from Navathe et. Al and Silberchatz et. Al.
Distributed DBMSs – Concepts and Design Chapter 24 in Textbook.
CMS Advanced Database and Client-Server Applications Distributed Databases slides by Martin Beer and Paul Crowther Connolly and Begg Chapter 22.
1 Chapter 22 Distributed DBMSs - Concepts and Design Simplified Transparencies © Pearson Education Limited 1995, 2005.
Distributed Databases and Client-Server Architectures
CHAPTER 25 - Distributed Databases and Client–Server Architectures
Distributed Database Concepts
Outline Background Introduction Distributed DBMS Architecture
Distributed Database Management Systems
DISTRIBUTED DATABASE ARCHITECTURE
Database System Implementation CSE 507
MANAGING DATA RESOURCES
Distributed Databases
Introduction of Week 14 Return assignment 12-1
Distributed Databases
Presentation transcript:

Chapter 17: Additional Slides February 6, 2016

Outline Physical Data Management  Fragments  Distributed Query Processing  Transactions Logical Data Management  Transparency Conceptual Data Management

Physical Data Management Fragments What is a Fragment?  Vertical subset (project operation)  Horizontal subset (restrict operation)  Mixed Fragment (combination of project and restrict) A fragment may be allocated to a single or multiple sites Fragments may be replicated where  there is a primary fragment as a single site  Copies of the fragment are placed at multiple sites (secondary)

Physical Data Management Distributed Query Processing Involves both local (intra site) and global (inter site) optimization. Multiple optimization objectives The weighting of communication costs versus local processing costs depends on network characteristics. There are many more possible access plans for a distributed query.

Physical Data Management Distributed Query Processing cont’d Local vs. Global query processing  In Local, Queries are performed at a central server (single site)  In Global, Must decide which sites to access for the fragments May need to move fragments from site to site Multiple optimization is needed for Global  Due to the multiple sites and access plans Many possible access plans for Global  Choosing the best one may be difficult

Physical Data Management Distributed Query Processing cont’d Communication Costs  Communication Time (CT)  Fixed Message Delay (MD)  Variable Transmission Time (TT) CT = MD + TT MD = Number of Messages * Delay per message TT = Number of bits/Data rate

Physical Data Management Distributed Query Processing cont’d Global Query Example (p. 632)  List the order number, order date, product number, product name, product price, and order quantity for eastern orders with a specified customer number, date range, and product color. Four possible access plans

Physical Data Management Distributed Query Processing cont’d Access Plan 1  Move the Product table to the Tulsa site where the query is processed

Physical Data Management Distributed Query Processing cont’d Access Plan 2  Restrict the Product table at the Denver Site  Then move result to the Tulsa site to execute the remainder of the query

Physical Data Management Distributed Query Processing cont’d Access Plan 3  Perform join and restrictions of Eastern- Orders and Eastern Order-lines fragments at the Tulsa site  Then move result to Denver site to join with Product Table

Physical Data Management Distributed Query Processing cont’d Access Plan 4  Restrict the Product table at the Denver site  Move product numbers to Tulsa and do restrict/join  Then move result back to Denver to combine with Product table to get product names

Physical Data Management Distributed Query Processing cont’d Obviously many different access plans can be used to answer the same query Need to investigate  actual network costs  the local processing costs at each site to determine which access plan is the best

Physical Data Management Transactions – 2 Phase Commit Protocol 2 Phase Commit (2PC)  Ensures that all transactions are Atomic One site is selected as a Coordinator while other sites are Participants  Each Participant site execute a different part of the transaction  Two phases: Voting Phase and Decision Phase Figure 17.18, page 634

Physical Data Management Transactions – 2 Phase Commit Protocol Several Complications IF  Failures during recovery and Timeouts  Log records are lost  Coordinator fails  Etc. Several methods to resolve these, but out of scope for this class

Logical Data Management Transparency Transparency is related to data independence. With transparency, users can write queries with no knowledge of the distribution, and distribution changes will not cause changes to existing queries and transactions. Without transparency, users must reference some distribution details in queries and distribution changes can lead to changes in existing queries.

Logical Data Management Fragmentation Transparency Fragmentation transparency provides the highest level of data independence. Users formulate queries and transactions without knowledge of fragments (locations, or local formats). If fragments change, queries and transactions are not affected. Table 17.6, p 626

Logical Data Management Location Transparency Location transparency provides a lesser level of data independence than fragmentation transparency. Users need to reference fragments in formulating queries and transactions. However, knowledge of locations and local formats is not necessary. Table 17.7, p 627

Conceptual Data Management Schema Integration Multiple types of schemas may exist to describe the same dataset Integrate multiple schemas into a single schema Best explained using an Exercise

Database III – E/R Model Entities Engineer(Engineer No, name, title, salary) Project(PNo, project name, budget, location) Client(Client Name, Address) Relationships Engineer Works_In Project : (Responsibility, Duration) Project Contract_By Client : (Contract Date) Conceptual Data Management Schema Integration - Exercise Engineering Database – Relational Schema E(eno, ename, title), p.k. = eno J(jno, jname, budget, loc, cname), p.k. = jno G(eno, jno, resp, dur), p.k. = eno, jno S(title, sal), p.k. = title Employee Database – CODASYL Schema Department(dept-name, budget, manager) Employee(e#, name, address, title, salary) Department Employs Employee (1:N relationship) First, find the common entities and relationships between schemas

Database III – E/R Model Entities Engineer(Engineer No, name, title, salary) Project(PNo, project name, budget, location) Client(Client Name, Address) Relationships Engineer Works_In Project : (Responsibility, Duration) Project Contract_By Client : (Contract Date) Conceptual Data Management Schema Integration - Exercise Engineering Database – Relational Schema E(eno, ename, title), p.k. = eno J(jno, jname, budget, loc, cname), p.k. = jno G(eno, jno, resp, dur), p.k. = eno, jno S(title, sal), p.k. = title Employee Database – CODASYL Schema Department(dept-name, budget, manager) Employee(e#, name, address, title, salary) Department Employs Employee (1:N relationship) Second, draw the conceptual diagram for the common entities and relationships

Conceptual Data Management Schema Integration No single correct solution to this exercise It may be seen that there are multiple solutions to this problem Considered a very HARD problem Often hard to find the best synonyms especially from a large set of schemas

Weekly Exercise Questions 2, 4, and 5