The Challenge of Interoperability: the Grid, Semantic Web and Categories Nick Rossiter Seminar -- Northumbria University, 13th November 2002

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

Four-level Architecture for Closure in Interoperability EFIS, 17th-18th July 2003 Nick Rossiter & Michael Heather Informatics, Northumbria University
Semantic Web Thanks to folks at LAIT lab Sources include :
Database Systems: Design, Implementation, and Management Tenth Edition
©Silberschatz, Korth and Sudarshan1.1Database System Concepts Chapter 1: Introduction Purpose of Database Systems View of Data Data Models Data Definition.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 8 Slide 1 System models.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
©Silberschatz, Korth and Sudarshan1.1Database System Concepts Chapter 1: Introduction Purpose of Database Systems View of Data Data Models Data Definition.
Introduction to Databases Transparencies
Chapter 5 Normalization Transparencies © Pearson Education Limited 1995, 2005.
Methodology Conceptual Database Design
Systems Engineering Foundations of Software Systems Integration Peter Denno, Allison Barnard Feeney Manufacturing Engineering Laboratory National Institute.
Course Instructor: Aisha Azeem
1 Chapter 2 Database Environment. 2 Chapter 2 - Objectives u Purpose of three-level database architecture. u Contents of external, conceptual, and internal.
Introduction to Databases Transparencies 1. ©Pearson Education 2009 Objectives Common uses of database systems. Meaning of the term database. Meaning.
From Classical to Quantum Databases with Applied Pullbacks Nick Rossiter Seminar – PSSL, 15th February 2003
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
Database System Concepts, 5th Ed. ©Silberschatz, Korth and Sudarshan See for conditions on re-usewww.db-book.com Chapter 1: Introduction.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
Chapter 4 System Models A description of the various models that can be used to specify software systems.
System models Abstract descriptions of systems whose requirements are being analysed Abstract descriptions of systems whose requirements are being analysed.
Information Systems: Modelling Complexity with Categories Four lectures given by Nick Rossiter at Universidad de Las Palmas de Gran Canaria, 15th-19th.
Database System Concepts and Architecture Lecture # 2 21 June 2012 National University of Computer and Emerging Sciences.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Database System Concepts and Architecture
Introduction to Database Systems
Introduction to MDA (Model Driven Architecture) CYT.
Normalization Transparencies
Chapter 1 : Introduction §Purpose of Database Systems §View of Data §Data Models §Data Definition Language §Data Manipulation Language §Transaction Management.
Lecture2: Database Environment Prepared by L. Nouf Almujally & Aisha AlArfaj 1 Ref. Chapter2 College of Computer and Information Sciences - Information.
Chapter 7 System models.
©Silberschatz, Korth and Sudarshan1.1Database System Concepts Chapter 1: Introduction Purpose of Database Systems View of Data Data Models Data Definition.
System models l Abstract descriptions of systems whose requirements are being analysed.
Modified by Juan M. Gomez Software Engineering, 6th edition. Chapter 7 Slide 1 Chapter 7 System Models.
Sommerville 2004,Mejia-Alvarez 2009Software Engineering, 7th edition. Chapter 8 Slide 1 System models.
1 The Contravariancy of Anticipatory Systems Dimitrios Sisiaridis, Michael Heather & Nick Rossiter Northumbria University, Newcastle NE2 1XE, UK Symposium.
21st May 2004Informatics Research Conference, Northumbria University 1 Some Fundamental Questions in Databases Nick Rossiter (with Michael Heather)
Lecture 5 Normalization. Objectives The purpose of normalization. How normalization can be used when designing a relational database. The potential problems.
Chapter 10 Normalization Pearson Education © 2009.
Lecture 2: Introduction to Category Theory Nick Rossiter, Computing Science, Newcastle University, England
1 Chapter 1 Introduction to Databases Transparencies.
1 Generic Image Structures in Integrated Media Nick Rossiter & Michael Heather, University of Northumbria at Newcastle
1 Typing in Information Systems Nick Rossiter, Michael Heather Computer Science and Digital Technologies Northumbria University, Newcastle NE1 8ST, UK.
Lecture 4: Handling Heterogeneity with Information Resource Dictionary Systems Nick Rossiter, Computing Science, Newcastle University, England
Object storage and object interoperability
Jemerson Pedernal IT 2.1 FUNDAMENTALS OF DATABASE APPLICATIONS by PEDERNAL, JEMERSON G. [BS-Computer Science] Palawan State University Computer Network.
1 Chapter 2 Database Environment Pearson Education © 2009.
ASET 1 Amity School of Engineering & Technology B. Tech. (CSE/IT), III Semester Database Management Systems Jitendra Rajpurohit.
The Natural Metaphysics of Computing Anticipatory Systems Michael Heather Nick Rossiter University of Northumbria
Databases Salihu Ibrahim Dasuki (PhD) CSC102 INTRODUCTION TO COMPUTER SCIENCE.
Welcome: To the fifth learning sequence “ Data Models “ Recap : In the previous learning sequence, we discussed The Database concepts. Present learning:
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Database System Concepts, 5th Ed. ©Silberschatz, Korth and Sudarshan See for conditions on re-usewww.db-book.com Chapter 1: Introduction.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
WOSC 2008 Wroclaw1 Problems of Interoperability in Information Systems Nick Rossiter and Michael Heather CEIS, Northumbria University, UK
CS 325 Spring ‘09 Chapter 1 Goals:
Chapter 2 Database Environment.
Chapter 1: Introduction
A Universal Technique for Relating Heterogeneous Data Models
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Chapter 2 Database Environment.
Chapter 2 Database Environment Pearson Education © 2009.
Chapter 2 Database Environment.
A Natural Basis for Interoperability
Database Systems Instructor Name: Lecture-3.
Categories for Information
Conditions for Interoperability
The Process Semantics of Time and Space as Anticipation
Chapter 2 Database Environment Pearson Education © 2009.
Chapter 2 Database Environment Pearson Education © 2009.
Presentation transcript:

The Challenge of Interoperability: the Grid, Semantic Web and Categories Nick Rossiter Seminar -- Northumbria University, 13th November

Interoperability 1 Interoperability: the ability to request and receive services between various systems and use their functionality. More than data exchange. Implies a close integration

Interoperability 2 Features: –exchange of messages and requests –use of each other’s functionality –client-server abilities –distribution –operate multiple systems as single unit –communication despite incompatibilities –extensibility and evolution

Motivations 1 Diversity of modelling techniques Distributed businesses may exercise local autonomy in platforms Data warehousing requires heterogeneous systems to be connected Data mining enables new dependencies to be derived from heterogeneous collections

Motivations 2 Pervasive Computing : networks supporting many diverse nodes to be driven by users specifying policy and function. –Policy: statements governing how a solution will be achieved. Statements are derived from requirements –Function: mechanism for achieving objectives. Mobile Computing: wireless networks

Motivations 3 E-Science –Distribution of functionality transparently across many different platforms Grid –Layered components available in network: –Computational, information and knowledge layers and probably more.

Motivations 4 Semantic Web –Correlating web resources –Use of Resource Description Framework (RDF) and XML –Ontologies (giving ‘meanings’ of terms)

Definition: Model (as it varies!) Model: a representation of policies in a structured form according to some perceived view of reality e.g. –Relational model – world is tabular –Hierarchical model – world is tree-like –Security model – world is task-based –Object model – world is based on o-o paradigm

Semantic Problems in Interoperability 1 People call different properties by different names People classify properties differently: –Different contexts e.g. colour is property in describing a car but a table to the paint shop. –Different normalization priorities e.g. many tables optimised for updates versus few tables optimised for searching.

Semantic Problems in Interoperability 2 People make use of facilities in different ways. For instance: –In SQL-92 can achieve uniqueness in tables by: Defining keys or Modifying table storage method on various properties or Defining a unique index –and in SQL-1999 we also have: object identifiers (oids) - So many legacy problems

Constraints and Types May differ between systems: –e.g. student ids may be held as: integers (leading zeros removed) integers (padded out with so many leading zeros) strings (fixed length) ‘ ’ –Ids may have checksum function or not

Semantic Problems in Interoperability 3 Structural problems are bad enough. But also: Functionality can be applied in many different ways: –Procedures or functions; –different module layout. Rules can be in: –Model structures, model coding, procedures or application programs.

Simple Problem in Interoperability 1 Two schemas in SQL-1999 AB author char(50)author_surname char(50) author_initials char(10) title varchar(300)title varchar(200) keyword set(char(30))keywd array(8) (char(30)) Note: homogeneous model -- both SQL but difficulties.

Different Standards For example -- Names: –Person(surname, first_name,..) –or Person(first_name, surname, …) –or Person(name, …) First two may easily be made equivalent but convention in third needs to be understood. Note also possibilities of A.N.Other, AN Other, A N Other.

Simple Problem in Interoperability 2 Homogeneous Models –the same information may be held as attribute name, relation name or a value in different databases –e.g. fines in library; could be held in a dedicated relation Fine(amount, borrowed_id) or as an attribute Loan(id, isbn, date_out, fine) or as a value Charge(1.25, ‘fine’)

Complex Problems in Interoperability Heterogeneous models Need to relate model constructions to one another, for example: – relate classes in object-oriented to user-defined types in object-relational All problems are magnified at this level.

Handling Naming, Classifying, Meta and MetaMeta Levels A Foundation for Heterogeneous Interoperability

Foundation Work Rather neglected in database area recently Moves by Date/Darwen/Codd to introduce ‘pure’ relational model handling objects in an orthogonal manner Open Database Group at UNN (now 8 members) looking at object-relational foundations Common objectives with work described here

Meta Data Meta means ‘about’ The basis of schema integration Sometimes treated as an object (MOF - Meta Object Facility) Better viewed as a relationship: –Name (data files) –Classify (database classes) –Meta (data dictionary) –MetaMeta (classify data dictionary )

Bottom Level -- Data Values (tuples) Student information: – Module Information: – Which students take which modules: –

Naming Data Values Student Name {, } Module Name {, } Modules_taken : Name {, } Name associates each value with a name

Relating Named Values to Types -- Classification Named application data: Schema data (types including constraints, rules) Student Name {, } Student ( id integer, name char(50), primary key id, constraint ‘id format’) Classify Classify associates each name with a type in the schema

Relating Types to Constructs - Data Dictionary Classified schema data (types): Constructions (e.g. table, column, function, key): Student ( id integer, name char(50), primary key id, constraint ‘id format’) {Table, Column, Function, Primary Key,...} Meta Meta associates each schema type with a construction e.g. Student to Table, id to Column, id to Primary Key

Relating Constructs to Abstractions Constructions (e.g. table) Abstractions (of real-world, open-ended, concepts) {Table, Column, Function, Foreign Key,..} {Aggregation, inheritance, association, …} MetaMeta MetaMeta associates each construction with an abstraction e.g. Table to Aggregation, Foreign Key to Association

Mappings are two-way MetaMeta Policy Meta Organize Classify Instantiate Concepts Constructs Schema Types Named Data Values Downward arrows are intension-extension pairs

Formalising the Architecture Requirements: –mappings within levels and across levels –bidirectional mappings –closure at top level –open-ended logic –relationships (product and coproduct) Candidate: category theory as used in mathematics as a workspace for relating different constructions

Choice: category theory Requirements: –mappings within levels and across levels arrows: function, functor, natural transformation –bidirectional mappings adjunctions –closure at top level four levels of arrow, closed by natural transformation –open-ended logic Heyting intuitionism –relationships (product and coproduct) Cartesian-closed categories (like 2NF): pullback and pushout

Categories Each level is represented by a category: –Named data values by DATA (DT) valuename –Schema types by SCHEMA (SM) –Constructions by CONSTRUCTS (CS) –Concepts by CONCEPTS (CC) Red font -- categories

Functors Relationships between categories at adjacent levels are given by a functor –For example: –Meta: SCHEMACONSTRUCTS –Meta is a functor Blue font -- functors

Levels in Functorial Terms MetaMeta CONCEPTS CONSTRUCTS S ystem Policy Model Meta InstantiateOrganize DATASCHEMA Classify Green font - composed functors: System = MetaMeta o Meta o Classify

Composition of Adjoint Functors Classify -- CMeta-- M MetaMeta -- A Policy -- POrganise -- O Instantiate -- I CCCSSMDT P OI AMC Composed adjunction

Adjunctions The adjointness between two functors is given by a 4-tuple e.g. for CCCS –  unit of adjunction measures change from initial cc to cc obtained by following P and A (1 CC AP(cc) ) –  counit of adjunction measures PA(cs)1 cs –Unit and counit give measure of creativity of arrows and preservation of style in mapping by functors. – If complete preservation of style (  =1) and no creativity (  =0) -- isomorphism. P A

Composed Adjunction for Four Levels Represents complex mappings across the levels of the system

Benefits of Approach Can represent relationships between levels, either: –abstractly with one relationship from top to bottom levels –in much more detail with all combinations of adjoints expressed.

Comparing one System with Another CCCSSMDT CCCS´SM´ DT´ POI P´P´ O ´I ´  , ,  are natural transformations (comparing functors) 

Godement Calculus Rules showing: –composition of functors and natural transformations is associative –natural transformations can be composed with each other For example: (I´O´)  = I´(O´  );  (OP) = (  O)P   = (  O) o (I´  );   =  P o (O´  )

Four Levels are Sufficient In category theory: –objects are identity arrows –categories are arrows from object to object –functors are arrows from category to category –natural transformations are arrows from functor to functor An arrow between natural transformations is a composition of natural transformations, not a new level

Analogous Levels for Interoperability

The Grid Use of distributed computer power Analogy with national power generation Processing performed where appropriate Requires local definitions to be used interoperably Early stage of development

Grid in Levels LevelIntension  Extension 1Grid Policy  Program Purpose  2Grid Organize  Program Standard  3Grid Implement  Program Spec  4Grid Operation  Program Execution

Databases on the Grid Nearly all data sources are not formal database files yet (Paul Watson, 2002) Data is in operating system files perhaps readable only by particular programs which understand the format So level achieved is really only 3, that is program specification (no organization or policy).

Semantic Web Levels: –data values to be marked up with tags –XML document -- data with XML tags –DTD -- document structure defining tags –RDF -- triples components are typically URIs, could also be literals, collections of URIs or another RDF –Semantic Web -- semantic interoperability between data sources using agents to explore RDFs and ontologies

Semantic Web as Formal Levels

Semantic Web is rather Flat Only handles two bottom levels of architecture Aims to achieve comparison of mappings (  ) through: –an agent exploring RDFs –each RDF relates one URI to another URI in the context of a predicate –ontologies are used to explain and synchronise meanings of terms An AI approach rather than a data-driven one

Discussion 1 Category theory shows that: –four levels are ideal for interoperability –more than four yields no benefits –less than four gives only local interoperability Categorical approach provides: –an architecture for universal interoperability –a calculus (Godement) for composing mappings at any level –adjunctions for evaluating two-way mappings

Discussion 2 In industry: –Approach seems to be getting flatter –In MOF (Meta-Object Facility) which is a four-level system (top-level ‘hard-wired’), drive to bypass top levels –IRDS (Information Resource Dictionary System) – four levels but one-way mapping – not used as much as expected –ISO though are trying a four-level approach for mapping modelling languages

Discussion 3 Lack of adequate foundations may explain some of the reticence Also some suppliers perhaps think that market dominance is the answer, making interoperability irrelevant. But legacies remain! Categorical framework provides a coherent basis for taking this area forward. Even if the framework is translated into a more familiar notation.

References 1 MOF –Bezivin, J, From Object Composition to Model Transformation with the MDA, 3rd International Conference on Enterprise Information Systems (ICEIS), Setubal, invited paper (2001). Plus OMG web pages. IRDS –Gradwell, D J L, The Arrival of IRDS Standards, 8th BNCOD, York 1990, Pitman (1990). GRID –Watson, P, Databases and the Grid, Computing Science Technical Report no.755, University of Newcastle upon Tyne (2002), (16pp). SEMANTIC WEB: W3C web pages Our work (available from NR’s home page) –Heather, M A, & Rossiter, B N, The Anticipatory and Systemic Adjointness of E- Science Computation on the Grid, Computing Anticipatory Systems, Proceedings CASYS`01, Liège, Dubois, D M, (ed.), AIP Conference Proceedings (2002). –Rossiter, B N, Heather, M A, & Nelson, D A, A Universal Technique for Relating Heterogeneous Data Models, 3rd International Conference on Enterprise Information Systems (ICEIS), Setúbal, I (2001). –Heather, M A, & Rossiter, B N, Constructing Standards for Cross-Platform Operation, Software Quality Journal, 7(2) (1998).

References 2 ISO modelling: –Study Report on the Feasibility of Mapping Modelling Languages for Analysis and Design Models.ISO/IEC JTC1/SC7 N2112, 1999/04/19, (1999). Category Theory and Computing Science: –Barr, M, & Wells, C, Category Theory for Computing Science, Prentice-Hall (1990). –Mac Lane, S, Categories for the Working Mathematician, Springer, 2 nd ed (1998). Category Theory and Information Systems: some other workers –Zinovy Diskin (USA, formerly Latvia) –Boris Cadish (Latvia) –Robert Rosebrugh (Canada) –Michael Johnson (Australia) –Christopher Dampney (Australia) –Michael Heather (Northumbria) –David Nelson (Sunderland) –Arthur ter Hofstede (Australia, formerly Holland) Many other workers on category theory and program semantics