G063 - Distributed Databases

Slides:



Advertisements
Similar presentations
Database Systems: Design, Implementation, and Management
Advertisements

Distributed Databases John Ortiz. Lecture 24Distributed Databases2  Distributed Database (DDB) is a collection of interrelated databases interconnected.
Transaction.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
Chapter 13 (Web): Distributed Databases
Advanced Database Systems September 2013 Dr. Fatemeh Ahmadi-Abkenari 1.
Toolbox Mirror -Overview Effective Distributed Learning.
Topic Denormalisation S McKeever Advanced Databases 1.
Distributed Databases Logical next step in geographically dispersed organisations goal is to provide location transparency starting point = a set of decentralised.
DISTRIBUTED DATABASE. Centralized & Distributed Database  Single site database – centralized database –A database is located at a single site or distributed.
Chapter 9 : Distributed Database.
Overview Distributed vs. decentralized Why distributed databases
Organizing Data & Information
 You will be able to: › Explain how databases may be stored in more than one physical location and how distribution may be carried out using different.
Distributed Databases and Query Processing. Distributed DB’s vs. Parallel DB’s Many autonomous processors that may participate in database operations.
PARTITIONING “ A de-normalization practice in which relations are split instead of merger ”
Chapter 3 : Distributed Data Processing
Definition of terms Definition of terms Explain business conditions driving distributed databases Explain business conditions driving distributed databases.
Distributed Databases
Distributed Databases
Distributed Database and Replication. Distributed Database A logically interrelated collection of shared data and a description of this data physically.
It refers to the software used to manage the database.
Introduction to Databases and Database Languages
IMS 4212: Distributed Databases 1 Dr. Lawrence West, Management Dept., University of Central Florida Distributed Databases Business needs.
12 1 Chapter 12 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
By: M.Nadeem Akhtar1 The Data Base Management System (DBMS) Ch # 09.
1 © Prentice Hall, 2002 Physical Database Design Dr. Bijoy Bordoloi.
Database Design – Lecture 16
CORE 2: Information systems and Databases CENTRALISED AND DISTRIBUTED DATABASES.
Definitions What is a network? A series of interconnected computers, linked together either via cabling or wirelessly. Often linked via a central server.
Communication & Information Systems. ICT5 Centralised processing systems Large central mainframe the norm into the seventies All processing carried out.
Lecture 5: Sun: 1/5/ Distributed Algorithms - Distributed Databases Lecturer/ Kawther Abas CS- 492 : Distributed system &
- Ahmad Al-Ghoul Data design. 2 learning Objectives Explain data design concepts and data structures Explain data design concepts and data structures.
G063 - Distributed Databases. Learning Objectives: By the end of this topic you should be able to: explain how databases may be stored in more than one.
INFORMATION MANAGEMENT Unit 2 SO 4 Explain the advantages of using a database approach compared to using traditional file processing; Advantages including.
Database Management COP4540, SCS, FIU Physical Database Design (ch. 16 & ch. 3)
FILES AND DATABASES. A FILE is a collection of records with similar characteristics, e.g: A Sales Ledger Stock Records A Price List Customer Records Files.
Oracle's Distributed Database Bora Yasa. Definition A Distributed Database is a set of databases stored on multiple computers at different locations and.
1 Distributed Databases BUAD/American University Distributed Databases.
Distributed Database. Introduction A major motivation behind the development of database systems is the desire to integrate the operational data of an.
DISTRIBUTED DATABASES JORGE POMBAR. Overview Most businesses need to support databases at multiple sites. Most businesses need to support databases at.
INFO1408 Database Design Concepts Week 16: Introduction to Database Management Systems Continued.
CS338Parallel and Distributed Databases11-1 Parallel and Distributed Databases Lecture Topics Multi-CPU and distributed systems Monolithic system Client–server.
Topic Distributed DBMS Database Management Systems Fall 2012 Presented by: Osama Ben Omran.
Use of ICT in Data Management AS Applied ICT. Back to Contents Back to Contents.
CIS 250 Advanced Computer Applications Database Management Systems.
Introduction to Distributed Databases Yiwei Wu. Introduction A distributed database is a database in which portions of the database are stored on multiple.
 Distributed Database Concepts  Parallel Vs Distributed Technology  Advantages  Additional Functions  Distribution Database Design  Data Fragmentation.
1 Information Retrieval and Use De-normalisation and Distributed database systems Geoff Leese September 2008, revised October 2009.
Retele de senzori Curs 2 - 1st edition UNIVERSITATEA „ TRANSILVANIA ” DIN BRAŞOV FACULTATEA DE INGINERIE ELECTRICĂ ŞI ŞTIINŢA CALCULATOARELOR.
1 Lecture1 Introduction to Databases Systems Database 1.
Distributed Databases
( ) 1 Chapter # 8 How Data is stored DATABASE.
DATABASE’S Dave McDonald Student No /05/2016 Unit 10 task 1.
LM 9. Distributed Database Dr. Lei Li 1. Note: The content of the slides including figures are mainly based on a publicly available textbook chapter:
© 2017 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner.
UNIT 4 NETWORK ENVIRONMENTS.
Distributed Database Management Systems
Physical Database Design
Physical Changes That Don’t Change the Logical Design
Physical Database Design and Performance
Chapter 19: Distributed Databases
Replication.
CHAPTER 5: PHYSICAL DATABASE DESIGN AND PERFORMANCE
Teaching slides Chapter 8.
Physical Database Design
Distributed Databases
Objectives Explain the role of computers in client-server and peer-to-peer networks Explain the advantages and disadvantages of client- server and peer-to-peer.
Communication and the Internet
Database management systems
Presentation transcript:

G063 - Distributed Databases “Sharing is good!”

Learning Objectives: By the end of this topic you should be able to: explain how databases may be stored in more than one physical location explain the methods by which this distribution may be carried out explain reasons why distribution would be carried out explain the security issues of distributed databases

A Distributed Database is: a database not stored in its entirety at a single location database spread across a number of computers in multiple locations computers connected by a data communications link LAN and/or WAN e.g. Tesco - stock & sales databases Thomas Cook travel agent – holiday database

3 Types of Distributed Database Replicated Centralised Partitioned

Replicated database: = a copy of the complete database at each site

Replicated Database a copy of the complete database at each site exact copy of the database stored & accessed locally transaction files created of changes at each centre updates made on a master database a ‘new’, updated copy of database sent to each centre

Replicated Database Advantages: reliability fast response to searches data is always available locally not reliant on the network or central server fast response to searches local access will be faster than WAN access

Replicated Database Disadvantages: data integrity issues local copies of data may be different to each other need additional local storage space requirements additional time required for update operations

Centralised database:

Centralised Database single database held centrally (possibly at Head Office) an index to the central database is held locally speeds up queries each site accesses database through a WAN Example: Booking systems need distributed access to a central database sharing of up-to-date information important, avoids double bookings.

Centralised Database Advantages: better security of data one copy rather than several (replicated copies) good data integrity one copy rather than several always sharing the same data data always up-to-date data is updated in real time centralised backup can be automated

Centralised Database Drawbacks: a virus in the central system could spread throughout all sites possibility of update clashes two sites trying to modify the same record at the same time

Partitioned database:

Partitioned Database not every site needs to have all the data → give each site just the data that is relevant to that site database is split into sections (partitions) each site on the network stores its own data i.e. the part of the database that relates to that site, e.g. the part of the database that relates to a single supermarket’s stock is stored at that site, other (global) data is held centrally changes to central data can be dealt with overnight by a batch update from the sites,

Partitioned Database Advantages: speed: local control over local data faster access to local data local control over local data scalability can add new sites as required not reliant on network or server for day-to-day tasks

Partitioned Database Drawbacks: data inconsistency possibility of different data being held centrally to that on partition unsuitable for certain applications if data changes at one node must be instantly seen by all nodes e.g. holiday bookings high network usage during update process will slow down other network processes

Partitioned Database Two types: Horizontal partitioning Vertical partitioning

Horizontal partitioning Example: branch offices deal mostly with a set of local customers e.g. Northampton branch stores the fragment where contents of the Branch field = ‘Northampton' So: split the table into number of smaller tables on the basis of rows (records) i.e. specific field contents each site (branch) stores just the table relevant to them

Horizontal partitioning this table represents the database for an estate agency with 3 branches

Horizontal partitioning the database is horizontally partitioned so that the data for each branch is stored on the server in that branch: this will speed up local queries Boldmere staff searching for properties in Boldmere

Horizontal partitioning this means that the data is stored like this:

Horizontal partitioning this means that the data is stored like this:

Vertical partitioning data is separated across sites based upon fields dividing the table based on the different columns different columns of a table located at different sites e.g. stock descriptions: item descriptions & prices at sales outlet item’s country of origin, supplier name at Head Office e.g. employee wages: staff name & hours worked at individual shop personal data stored at Head Office

Vertical partitioning Advantages: faster searching reduced amount of data being sent between sites access rights ensures that only certain people see certain fields help to conform to the DPA personal information kept separate from sales records

Vertical partitioning Disadvantages: potential exists for inconsistency in the data stored complex & time consuming to set up and modify regular backups essential since there is no data replication