G063 - Distributed Databases. Learning Objectives: By the end of this topic you should be able to: explain how databases may be stored in more than one.

Slides:



Advertisements
Similar presentations
Organisational Information Systems Centralised Database.
Advertisements

Distributed Databases John Ortiz. Lecture 24Distributed Databases2  Distributed Database (DDB) is a collection of interrelated databases interconnected.
Transaction.
Database Theory Why use database? Data is a valuable corporate resource which needs adequate accuracy, consistency and security controls. The centralized.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
Chapter 13 (Web): Distributed Databases
Toolbox Mirror -Overview Effective Distributed Learning.
Topic Denormalisation S McKeever Advanced Databases 1.
1 Minggu 12, Pertemuan 23 Introduction to Distributed DBMS (Chapter , 22.6, 3rd ed.) Matakuliah: T0206-Sistem Basisdata Tahun: 2005 Versi: 1.0/0.0.
Distributed Databases Logical next step in geographically dispersed organisations goal is to provide location transparency starting point = a set of decentralised.
DISTRIBUTED DATABASE. Centralized & Distributed Database  Single site database – centralized database –A database is located at a single site or distributed.
Chapter 9 : Distributed Database.
Overview Distributed vs. decentralized Why distributed databases
Organizing Data & Information
 You will be able to: › Explain how databases may be stored in more than one physical location and how distribution may be carried out using different.
1 © Prentice Hall, 2002 Chapter 13: Distributed Databases Modern Database Management 6 th Edition Jeffrey A. Hoffer, Mary B. Prescott, Fred R. McFadden.
Data Storage and Data Processing Architectures The difficulty is in the choice George Moore, 1900.
PARTITIONING “ A de-normalization practice in which relations are split instead of merger ”
Chapter 3 : Distributed Data Processing
©Silberschatz, Korth and Sudarshan18.1Database System Concepts Centralized Systems Run on a single computer system and do not interact with other computer.
Definition of terms Definition of terms Explain business conditions driving distributed databases Explain business conditions driving distributed databases.
Distributed Databases
It refers to the software used to manage the database.
1 Distributed and Parallel Databases. 2 Distributed Databases Distributed Systems goal: –to offer local DB autonomy at geographically distributed locations.
IMS 4212: Distributed Databases 1 Dr. Lawrence West, Management Dept., University of Central Florida Distributed Databases Business needs.
12 1 Chapter 12 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
1 © Prentice Hall, 2002 Physical Database Design Dr. Bijoy Bordoloi.
Database Design – Lecture 16
DAY 14: ACCESS CHAPTER 1 Tazin Afrin October 03,
CORE 2: Information systems and Databases CENTRALISED AND DISTRIBUTED DATABASES.
1 Adapted from Pearson Prentice Hall Adapted form James A. Senn’s Information Technology, 3 rd Edition Chapter 7 Enterprise Databases and Data Warehouses.
Data and its manifestations. Storage and Retrieval techniques.
Communication & Information Systems. ICT5 Centralised processing systems Large central mainframe the norm into the seventies All processing carried out.
Massively Distributed Database Systems - Distributed DBS Spring 2014 Ki-Joune Li Pusan National University.
Lecture 5: Sun: 1/5/ Distributed Algorithms - Distributed Databases Lecturer/ Kawther Abas CS- 492 : Distributed system &
Database Systems: Design, Implementation, and Management Ninth Edition Chapter 12 Distributed Database Management Systems.
- Ahmad Al-Ghoul Data design. 2 learning Objectives Explain data design concepts and data structures Explain data design concepts and data structures.
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Chapter 7 Storing Organizational Information - Databases.
INFORMATION MANAGEMENT Unit 2 SO 4 Explain the advantages of using a database approach compared to using traditional file processing; Advantages including.
Database Management COP4540, SCS, FIU Physical Database Design (ch. 16 & ch. 3)
FILES AND DATABASES. A FILE is a collection of records with similar characteristics, e.g: A Sales Ledger Stock Records A Price List Customer Records Files.
Oracle's Distributed Database Bora Yasa. Definition A Distributed Database is a set of databases stored on multiple computers at different locations and.
Kjell Orsborn UU - DIS - UDBL DATABASE SYSTEMS - 10p Course No. 2AD235 Spring 2002 A second course on development of database systems Kjell.
Distributed Databases
1 Distributed Databases BUAD/American University Distributed Databases.
DISTRIBUTED DATABASES JORGE POMBAR. Overview Most businesses need to support databases at multiple sites. Most businesses need to support databases at.
Why do we need a database?
Topic Distributed DBMS Database Management Systems Fall 2012 Presented by: Osama Ben Omran.
Use of ICT in Data Management AS Applied ICT. Back to Contents Back to Contents.
MBA 664 Database Management Systems Dave Salisbury ( )
Physical Database Design Purpose- translate the logical description of data into the technical specifications for storing and retrieving data Goal - create.
Flat Files Relational Databases
CIS 250 Advanced Computer Applications Database Management Systems.
Introduction to Distributed Databases Yiwei Wu. Introduction A distributed database is a database in which portions of the database are stored on multiple.
FILE ORGANIZATION.
 Distributed Database Concepts  Parallel Vs Distributed Technology  Advantages  Additional Functions  Distribution Database Design  Data Fragmentation.
Chapter 1 Database Access from Client Applications.
1 Chapter 22 Distributed DBMS Concepts and Design CS 157B Edward Chen.
1 Information Retrieval and Use De-normalisation and Distributed database systems Geoff Leese September 2008, revised October 2009.
Distributed Databases
( ) 1 Chapter # 8 How Data is stored DATABASE.
NORMALISATION OF DATABASES. WHAT IS NORMALISATION? Normalisation is used because Databases need to avoid have redundant data, which makes it inefficient.
Distributed Database Management Systems
Physical Database Design
Physical Database Design and Performance
Replication.
G063 - Distributed Databases
Distributed Databases
Objectives Explain the role of computers in client-server and peer-to-peer networks Explain the advantages and disadvantages of client- server and peer-to-peer.
Presentation transcript:

G063 - Distributed Databases

Learning Objectives: By the end of this topic you should be able to: explain how databases may be stored in more than one physical location explain the methods by which this distribution may be carried out explain reasons why distribution would be carried out explain the security issues of distributed databases

Database storage:

A Distributed Database is: a single logical database –consisting of many entities –possibly used by many users for different purposes a database is not stored in its entirety at a single physical location database is spread physically across a number of computers –computers could be in multiple locations  buildings or sites, –computers connected by a data communications link  LAN and/or WAN

Why distribute a database: allows faster local queries –faster searching speeds up other network operations –due to some data queries being handled locally  reduces network traffic improved reliability –data may be replicated at multiple sites allows for modular growth of the database –can easily add new sites and/or uses user does not need to know where data is stored physically –looks like a single, location, centralized system to the user

Types of Distributed Database Replicated Centralised Partitioned

Replicated Database complete database is duplicated at each centre exact copy of the database stored & accessed locally duplicated versions are usually read only –transaction files created of changes at each centre updates allowed made on a master database –a ‘new’, updated copy of database sent to each centre  at regular intervals

Replicated Database Advantages: reliability –data is always available locally –not reliant on the network or central server –work carries on even if some nodes are down fast response to searches –local access will be faster than WAN access  data does not have to be transmitted over the network reduced network traffic at prime time –faster access to network if required

Replicated Database Disadvantages additional local requirements for storage space additional time for update operations complexity and cost of updating data integrity issues –if replicated data is not updated simultaneously –local copies of data may be different

Centralised Database single database held centrally (possibly at Head Office) each node accesses database through a network (WAN) –access available to all branches or offices, an index to the central database is held locally at each node –speeds up queries/transactions booking systems need distributed access to a central database if they are to work effectively –sharing of up-to-date information important, –avoids double bookings.

Centralised Database Advantages: better security of data –one copy rather than several (replicated copies) –security handled centrally good data integrity –one copy rather than several  always sharing the same data data can be updated in real time –data always up-to-date centralised backup –can be automated

Centralised Database Advantages (from June 2011 Q13 mark scheme): storage is only required at the central location for the centralised database (1) the local indexes stored at each site take up far less memory (1) queries are processed locally(1) this speeds up searches as only the required data is retrieved from the central location (1) less data traffic than complete centralisation (1) as only data is sent and not the additional information /forms/reports structure (1) increased security (1) only central database needs increased security as that is where the data is stored (1) integrity of data not compromised (1) as it is stored in only one location and one database to update (1) centralised back-up of data (1) management backup easier as it is just one person’s responsibility (1)

Centralised Database Drawbacks: a virus in the central system could spread throughout all sites possibility of update clashes –two sites trying to modify the same record at the same time

Partitioned Database database is split into sections each node or site on the network stores local data –i.e the section of the database that relates to that site,  e.g. the section of the database that relates to a single supermarket’s stock is stored at that site, other (global) data is held centrally –changes to central data can be dealt with overnight by a batch update from the sites,

Horizontal partitioning involves putting different rows into different tables. splitting the table into number of smaller tables –on the basis of rows (records)  i.e. specific field contents Example: branch offices in an organization deal mostly with a set of local customers –Euston Road branch stores the fragment where contents of the Branch field = 'Euston Road'

Horizontal partitioning this table represents the database for an estate agency with 3 branches

Horizontal partitioning the database is horizontally partitioned –so that the data for each branch is stored on the server in that branch: –this will speed up local queries  Boldmere staff searching for properties in Boldmere

Horizontal partitioning this means that the data is stored like this:

Horizontal partitioning this means that the data is stored like this:

Vertical partitioning dividing the table based on the different columns. involves creating tables with fewer columns –using additional tables to store the remaining columns. different columns of a table located at different sites –e.g. stock descriptions (country of origin, supplier name at one site and prices at another site)

Vertical partitioning From June 2011 Q13 mark scheme: only certain people see certain fields –e.g. financial matters not revealed to all (1) to conform to the law/DPA (1) –keeping personal information private (1) reduces amount of data being sent between locations (1) –in order to speed up data transfer (1) –allowing faster reaction time (1) –meaning rescue reaches emergency quicker (1)

Partitioned Database Advantages: speed: –faster access to local data  less network access required local control over local data scalability –can add new sites as required not reliant on network or server for day-to-day tasks each partition can have its own transaction log –local reporting (access/sales)

Partitioned Database Drawbacks: data inconsistency –possibility of different data being held centrally to that on partition –regular batch update required to maintain consistency unsuitable for certain applications –if data changes at one node must be instantly seen by all nodes  e.g. holiday bookings high network usage during update process –will slow down other network processes