Information Systems: Modelling Complexity with Categories Four lectures given by Nick Rossiter at Universidad de Las Palmas de Gran Canaria, 15th-19th.

Slides:



Advertisements
Similar presentations
Concepts of Database Management Seventh Edition
Advertisements

1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Managing Data Resources
Distributed DBMSs A distributed database is a single logical database that is physically distributed to computers on a network. Homogeneous DDBMS has the.
Overview Distributed vs. decentralized Why distributed databases
1 Lecture 13: Database Heterogeneity Debriefing Project Phase 2.
Data Management I DBMS Relational Systems. Overview u Introduction u DBMS –components –types u Relational Model –characteristics –implementation u Physical.
1 Lecture 13: Database Heterogeneity. 2 Outline Database Integration Wrappers Mediators Integration Conflicts.
The University of Akron Dept of Business Technology Computer Information Systems Database Management Approaches 2440: 180 Database Concepts Instructor:
Chapter 4 Relational Databases Copyright © 2012 Pearson Education, Inc. publishing as Prentice Hall 4-1.
DATABASE MANAGEMENT SYSTEMS 2 ANGELITO I. CUNANAN JR.
Chapter 4 Relational Databases Copyright © 2012 Pearson Education 4-1.
Distributed Databases and DBMSs: Concepts and Design
Database Systems: Design, Implementation, and Management Ninth Edition
Chapter 1 Database Systems. Good decisions require good information derived from raw facts Data is managed most efficiently when stored in a database.
Distributed Database The University of California Berkeley Extension Copyright © 2011 Patrick McDermott.
1 Distributed and Parallel Databases. 2 Distributed Databases Distributed Systems goal: –to offer local DB autonomy at geographically distributed locations.
6-1 DATABASE FUNDAMENTALS Information is everywhere in an organization Information is stored in databases –Database – maintains information about various.
Research Topics in Computing Data Modelling for Data Schema Integration 1 March 2005 David George.
III. Current Trends: 1 - Distributed DBMSsSlide 1/32 III. Current Trends Part 1: Distributed DBMSs: Concepts and Design Lecture 12 (2 hours) Lecturer:
CSC271 Database Systems Lecture # 4.
AL-MAAREFA COLLEGE FOR SCIENCE AND TECHNOLOGY INFO 232: DATABASE SYSTEMS CHAPTER 1 DATABASE SYSTEMS (Cont’d) Instructor Ms. Arwa Binsaleh.
Distributed Systems 1 CS- 492 Distributed system & Parallel Processing Sunday: 2/4/1435 (8 – 11 ) Lecture (1) Introduction to distributed system and models.
Database Technical Session By: Prof. Adarsh Patel.
Database System Concepts and Architecture
Session-9 Data Management for Decision Support
Lecture On Introduction (DBMS) By- Jesmin Akhter Assistant Professor, IIT, Jahangirnagar University.
Lecture 5: Sun: 1/5/ Distributed Algorithms - Distributed Databases Lecturer/ Kawther Abas CS- 492 : Distributed system &
Concepts of Database Management Sixth Edition Chapter 9 Database Management Approaches.
Lecture2: Database Environment Prepared by L. Nouf Almujally & Aisha AlArfaj 1 Ref. Chapter2 College of Computer and Information Sciences - Information.
Database A database is a collection of data organized to meet users’ needs. In this section: Database Structure Database Tools Industrial Databases Concepts.
MIS 673: Database Analysis and Design u Objectives: u Know how to analyze an environment and draw its semantic data model u Understand data analysis and.
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 5 Data Resource Management.
Database Systems DBMS Environment Data Abstraction.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
INFO1408 Database Design Concepts Week 15: Introduction to Database Management Systems.
Oracle's Distributed Database Bora Yasa. Definition A Distributed Database is a set of databases stored on multiple computers at different locations and.
Object Oriented Multi-Database Systems An Overview of Chapters 4 and 5.
Distributed DBMSs- Concept and Design Jing Luo CS 157B Dr. Lee Fall, 2003.
1 By Paul Murray Claire McQuade Kashif Rafiq David Miller.
Distributed Databases
AL-MAAREFA COLLEGE FOR SCIENCE AND TECHNOLOGY INFO 232: DATABASE SYSTEMS CHAPTER 1 DATABASE SYSTEMS Instructor Ms. Arwa Binsaleh.
DATABASE MANAGEMENT SYSTEM ARCHITECTURE
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 1 Database Systems.
1 Chapter 1 Introduction to Databases Transparencies.
Distributed database system
Management Information Systems, 4 th Edition 1 Chapter 8 Data and Knowledge Management.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
1 Lecture 9: Distributed Databases – Principles and Architectures Advanced Databases CG096 Nick Rossiter.
Topic Distributed DBMS Database Management Systems Fall 2012 Presented by: Osama Ben Omran.
1 Lecture 7 Distributed Data Bases: Principles and Architectures.
Lecture 4: Handling Heterogeneity with Information Resource Dictionary Systems Nick Rossiter, Computing Science, Newcastle University, England
Object storage and object interoperability
 Distributed Database Concepts  Parallel Vs Distributed Technology  Advantages  Additional Functions  Distribution Database Design  Data Fragmentation.
1 Chapter 22 Distributed DBMS Concepts and Design CS 157B Edward Chen.
Lecture On Introduction (DBMS) By- Jesmin Akhter Assistant Professor, IIT, Jahangirnagar University.
1 Information Retrieval and Use De-normalisation and Distributed database systems Geoff Leese September 2008, revised October 2009.
Data Models. 2 The Importance of Data Models Data models –Relatively simple representations, usually graphical, of complex real-world data structures.
1 © 2013 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the.
Managing Data Resources File Organization and databases for business information systems.
Data and Applications Security Developments and Directions
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
MANAGING DATA RESOURCES
Chapter 5 Data Resource Management.
Distributed Databases
Data Model.
Data and Applications Security Developments and Directions
Distributed Database Management Systems
Database System Concepts and Architecture
Data and Applications Security Developments and Directions
Presentation transcript:

Information Systems: Modelling Complexity with Categories Four lectures given by Nick Rossiter at Universidad de Las Palmas de Gran Canaria, 15th-19th May 2000, under the Socrates-Erasmus Programme

Lectures 1. Interoperability in Information Systems 2. Introduction to Category Theory 3. Object Concepts as Categories 4. Handling Heterogeneity with Information Resource Dictionary System

Lecture 1: Interoperability in Information Systems Nick Rossiter, Computing Science, Newcastle University, England

Motivations Diversity of modelling techniques Distributed businesses may exercise local autonomy in platforms Data warehousing requires heterogeneous systems to be connected Data mining enables new rules to be derived from heterogeneous collections

Basic Definitions 1 Distribution: information bases are stored on multiple computer systems interconnected by a communication medium. Homogeneous system: one that adheres to the same software at all sites. Heterogeneous system: one that does not adhere to the same software at all sites.

Basic Definitions 2 Autonomy: the ability of a site to control its own activities with respect to one or more of: –design –communication –execution –association

Interoperability 1 Interoperability: the ability to request and receive services between various systems and use their functionality. More than data exchange. Implies a close integration.

Interoperability 2 Features: exchange of messages and requests use of each other’s functionality client-server abilities distribution operate multiple systems as single unit communication despite incompatibilities extensibility and evolution

Architectures for Interoperability 1 1. Global schema integration Produces single new schema (C) for the different information systems with schemas (A, B). A C B

Global Schema Integration Advantages –Transparent to end users -- appears as single information system Disadvantages –Difficult -- needs human understanding to perform integration –Local autonomy lost –Static - does not evolve automatically

Architectures for Interoperability 2 2. Federated Database Systems Less tightly coupled schema (than in 1) Each service through an export schema specifies sharable objects Common data model Internal command language Decentralised control (local autonomy) Five-level architecture for federated system

Federated Databases: Loosely- coupled Created by users A E,B E are export V is view schema A,B are base schemas A B V AEAE BEBE

Federated Databases: Tightly- Coupled Created by administrators Global schema integration on all export schemas More formal than loosely-coupled Much effort to resolve semantic inconsistencies

Federated Database Systems - General Advantages Preserves local autonomy Not all data needs to be integrated Provides metadata structures for views (external and export schema, data dictionary)

Federated Database Systems - Disadvantages by Approach Tightly-coupled –similar to global schema integration 1) complex, difficult to make changes dynamically 2) much effort in resolving semantic inconsistencies Loosely-coupled –duplication by different users in building views –updating data defined in views can be difficult

Multidatabase Language Approach No attempt at schema integration Various schema in services provided can be heterogeneous, inconsistent and duplicate information in different ways. Language (e.g. MSQL) is used to integrate databases at run time. Relational data model used as Common Data model

Multidatabase Language Approach - Diagram A,B are schema MSQL is runtime language A B MSQL

Multidatabase Language Approach - Advantages No preparatory work to understand semantics of schema Dynamic -- access latest versions Very skilled users can succeed in reaching their goals Interesting work on multidatabase dependencies

Example Multidatabase Language MSQL (Multidatabase SQL) –Biased towards relational model –Illustrates problems Consider 2 databases –Each on publications of a computing society –And query: –“What is the name, , title for each publication of an author appearing in both of the society’s databases?”

MSQL - Schema Schema 1 (for AIIA): –Contacts (PersonID, Name, , …) –Conference (Name, Type, …) –Attendees(ID, Conf_ID, Speaker, …) –Publ_Papers(P_ID, Title, Author_ID, …) Schema 2 (for IFIP): –Member_Socs(Soc_Name, …) –Conf (Conf_ID, …) –Publ_Papers(P_Ref, Title, Conf_Ref, …) –Authors(Name, , Paper_ID, …) Underlined attributes are primary key; attributes in italics are foreign key.

MSQL for Query USE AIIA, IFIP SELECT Name, , Title FROM Authors, IFIP.Publ_Papers IFIP_Paper, Contacts, AIIA.Publ_papers AIIA_Paper WHERE Authors.Name = Contacts.Name AND Contacts.Person_ID = AIIA_Paper. Author_ID AND Authors.Paper_ID = IFIP_Paper.P_Ref; The USE statement declares the multidatabases which are aliased in the FROM statement to distinguish tables with the same name. Retrieves Name, and Title from both databases.

Potential Problems with MSQL Are domains on name comparable? Can use LET command to create equivalencies of names but does not solve domain mismatch. What if one schema not relational? Entity- Relationship model often used as neutral schema for translation and comparison of heterogeneous features

Multidatabase Language - Disadvantages in General Distribution is not transparent Users must resolve inconsistencies themselves Common language may restrict scope of heterogeneity (relational bias) Local autonomous system may change schema freely (so that existing queries fail)

Comparison of Approaches By coupling: –how tightly is the interoperable system connected to its underlying systems By adaptability: –the ability for the interoperable system to evolve in line with underlying schema By transparency: –the need for the end-user to understand the underlying schema

Comparison of Approaches Coupling AdaptabilityTransparency Approach Global Schema TightLowHigh Integration FederatedMediumMediumMedium Data Bases Multidatbase LowHighLow Languages

Summary Trend: From Global Schema Integration Federated Database Multidatabase Language of lower coupling, higher adaptability, and lower transparency.

Further Reading Management of Heterogeneous and Autonomous Database Systems Elmagarmid, Ahmed Rusinkiewicz, Marek Sheth, Amit Morgan Kaufmann 1999.