Heraklion, April 2, 2002 1 Mapping a Data Structure to the CIDOC Conceptual Reference Model Martin Doerr (ICS-FORTH, Crete, Greece) Heraklion, Crete, April.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Matrix Schema Tutorial Presented at the: IX European Banking Supervisors XBRL Workshop & Tutorial In: Paris On: 29th September 2008 By: Michele Romanelli.
1 ICS-FORTH EU-NSF Semantic Web Workshop 3-5 Oct Christophides Vassilis Database Technology for the Semantic Web Vassilis Christophides Dimitris Plexousakis.
Chapter 3 : Relational Model
1 CIDOC CRM + FRBR ER = FRBR OO … an equation for a harmonised view of museum information and bibliographic information Martin Doerr First CASPAR Seminar.
Visualization of Thematic Units in KR Schemata Mina AKAISHI Meme Media Laboratory Hokkaido University, JAPAN.
Mapping domain thesauri to the CRM to assist the semantic interoperability of data archives Doug Tudhope Hypermedia Research Unit University of Glamorgan.
Copyright Irwin/McGraw-Hill Data Modeling Prepared by Kevin C. Dittman for Systems Analysis & Design Methods 4ed by J. L. Whitten & L. D. Bentley.
Mapping Memory Manager Use Case: Mapping the dFMRÖ coin database to CIDOC-CRM Martin Doerr, Maria Theodoridou Foundation for Research and Technology –
Melbourne, October 13, Electronic Communication on Diverse Data - The Role of the oo CIDOC Reference Model - Martin Doerr (ICS-FORTH, Crete, Greece)
A Review of Ontology Mapping, Merging, and Integration Presenter: Yihong Ding.
TU/e eindhoven university of technology / faculty of mathematics and informatics Exporting Databases in XML DTD A Conceptual and Generic Approach Philippe.
UML CASE Tool. ABSTRACT Domain analysis enables identifying families of applications and capturing their terminology in order to assist and guide system.
XML –Query Languages, Extracting from Relational Databases ADVANCED DATABASES Khawaja Mohiuddin Assistant Professor Department of Computer Sciences Bahria.
Mgt 20600: IT Management & Applications Databases Tuesday April 4, 2006.
Chapter 4 Relational Databases Copyright © 2012 Pearson Education 4-1.
Firat Batmaz, Chris Hinde Computer Science Loughborough University A Diagram Drawing Tool For Semi–Automatic Assessment Of Conceptual Database Diagrams.
10 December, 2013 Katrin Heinze, Bundesbank CEN/WS XBRL CWA1: DPM Meta model CWA1Page 1.
INTRODUCTION TO DATABASE USING MS ACCESS 2013 PART 2 NOVEMBER 4, 2014.
ICS-FORTH May 25, The Utility of XML Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Heraklion, May.
ICS – FORTH, August 31, 2000 Why do we need an “Object Oriented Model” ? Martin Doerr Atlanta, August 31, 2000 Foundation for Research and Technology -
ICS-FORTH October 14, The CIDOC CRM, factor for the integration and presentation of cultural information Martin Doerr Foundation for Research and.
Database Design Concepts
Copyright © 2012 Pearson Education, Inc. Publishing as Prentice Hall 9.1.
Content Resource- Elamsari and Navathe, Fundamentals of Database Management systems.
Harmonising without Harm: towards an object-oriented formulation of FRBR aligned on the CIDOC CRM ontology Maja Žumer (University of Ljubljana) & Patrick.
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
An Approach to Task Modelling for User Interface Design Costin Pribeanu National Institute for Research and Development in Informatics, Bucureşti, Romania.
Concepts and Terminology Introduction to Database.
Templates. The Problem Supplier X A range on the data sheet.
Organizing Data and Information AD660 – Databases, Security, and Web Technologies Marcus Goncalves Spring 2013.
Computer Science 101 Database Concepts. Database Collection of related data Models real world “universe” Reflects changes Specific purposes and audience.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
Dimitrios Skoutas Alkis Simitsis
Smithsonian, March 26, International Symposium “Sharing the Knowledge” Martin Doerr Smithsonian, Washington DC March 26, 2003 FORTH, Greece Chair,
RRXS Redundancy reducing XML storage in relations O. MERT ERKUŞ A. ONUR DOĞUÇ
The CIDOC Conceptual Reference Model A core-ontology for information integration Karl H. Lampe, Zoologisches Forschungsmuseum Alexander Koenig (ZFMK) Bonn/Germany.
Databases Shortfalls of file management systems Structure of a database Database administration Database Management system Hierarchical Databases Network.
Overview of MOT Knowledge representation system : Basic Modeling Editor LexiconGrammarSemantics Pragmatics MOT Editor.
In this session, you will learn to: Map an ER diagram to a table Objectives.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management
ICS-FORTH Thesauri of Historical Periods A Proposal for Standardization Martin Doerr, Athina Kritsotaki Heraklion, Crete, June
1 Hypermedia Design Models & Methodologies Dr Gary Wills IAM Research Group © University of Southampton.
INTRODUCTION TO GIS  Used to describe computer facilities which are used to handle data referenced to the spatial domain.  Has the ability to inter-
Data Modeling Yong Choi School of Business CSUB. Part # 2 2 Study Objectives Understand concepts of data modeling and its purpose Learn how relationships.
Description of Information Resources: RDF/RDFS (an Introduction)
Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin APPENDIX C DESIGNING DATABASES APPENDIX C DESIGNING DATABASES.
Concept Mapping 概念構圖. Concepts and Propositions Concepts –as a perceived regularity in events or objects, or records of events or objects, e.g., book,
Data Provision and Aggregation Mapping Culture Semantically with CIDOC-CRM & 3M CRM SIG Maria Theodoridou Foundation for Research and Technology – Hellas.
Adxstudio Portals Training
ICS-FORTH September, The CIDOC CRM Format Martin Doerr Washington, Sept. 22, 1999 Foundation for Research and Technology - Hellas Institute of Computer.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
David M. Kroenke and David J. Auer Database Processing Fundamentals, Design, and Implementation Appendix H: The Semantic Object Model.
IT 5433 LM3 Relational Data Model. Learning Objectives: List the 5 properties of relations List the properties of a candidate key, primary key and foreign.
Co-funded by the European Union under FP7-ICT Co-ordinated by aparsen.eu #APARSEN Provenance Interoperability and Reasoning Yannis Tzitzikas Assistant.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Chapter 3: Modeling Data in the Organization
Knowledge Representation Techniques
Mapping Cultural Heritage Information to CIDOC-CRM*
Chapter 4: Logical Database Design and the Relational Model
Databases Chapter 16.
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Dr. Sudha Ram Huimin Zhao Department of MIS University of Arizona
Ontology.
CHAPTER 4: LOGICAL DATABASE DESIGN AND THE RELATIONAL MODEL
Database Design Hacettepe University
Semantic Nets and Frames
INSTRUCTOR: MRS T.G. ZHOU
Microsoft Access Date.
Presentation transcript:

Heraklion, April 2, Mapping a Data Structure to the CIDOC Conceptual Reference Model Martin Doerr (ICS-FORTH, Crete, Greece) Heraklion, Crete, April 2, 2002

Heraklion, April 2, What Means Mapping of One Schema to Another  Defining an (automated) transformation of each instance of schema 1 into an instance of schema 2 with the same meaning.  CRM Approach: u Interpretation of schema 1 as semantic model (nodes and links), u mapping each element of that to an equivalent CIDOC CRM path, u such that each instance of an element of the semantic model 1 can be converted into a valid instance of the CIDOC CRM with the same meaning.  This is the most simple theory. Works for good structures

Heraklion, April 2, Interpreting a Schema as Semantic Model 1. Interpreting tables, columns as entities 2. Interpreting records as entity instances 3. Interpreting fieldnames as relationships and entities 4. Interpreting field contents as entity instances  Each field is interpreted as entity-relationship-entity (e-r-e)  The whole schema is decomposed into e-r-e’s  Each e-r-e is mapped individually to the CRM.

Heraklion, April 2, Interpreting a Schema as Semantic Model, Example ID CategoryNRM - Railway furniture DescriptionArmchair, Upholstered in blue moquette with curved, buttoned back & scroll arms. Wooden legs Item name(s)armchairs (AAT Hierarchy: Furnishings) PartAspectTerm(AAT Hierarchy) overallphysical descriptor upholsteringProcesses & techniques overallmaterialmoquetteMaterials overallcolourblueColor legsmaterialwoodMaterials backphysical descriptor buttoningProcesses & techniques backshapecurvedPhysical attributes armsshapescrolled armsComponents The whole record corresponds to one entity: It stands for one object which is not referred to The field name stands for a relationship and the kind of contents Object The field contents stand for an entity instance : has ID: (data example from the Science Museum of London)

Heraklion, April 2, Mapping the First Element: Creating an Equivalent Proposition Object Whole Record ID “has ID ” Source Schema interpretation Instance, valid for both schemata Man-Made Object Identifier is identified by CRM Schema maps to: Possible Mapping Annotation: Whole Record= E22 Man-Made Object ID = E42 Object identifier Whole Record->ID = P47 is identified by Possible CRM instance Annotation: Object (E22: Man-Made_Object) is_identified_by (E42 Object_Identifier)

Heraklion, April 2, Mapping the Interpreted Schema to the CRM  Each Entity-link-entity can be instantiated as self-explanatory, context independent proposition  The mapping allows to create sets of propositions equivalent to the meaning of each source document, but in terms of the CIDOC CRM.  As the CRM-compatible propositions are self-explanatory, they can be merged into huge knowledge pools and the document boundaries can be ignored.  buzz words: Data warehouses, Semantic Web

Heraklion, April 2, Interpreting a Schema: Advanced Stuff: Value Dependency ID CategoryNRM - Railway furniture DescriptionArmchair, Upholstered in blue moquette with curved, buttoned back & scroll arms. Wooden legs Item name(s)armchairs (AAT Hierarchy: Furnishings) PartAspectTerm(AAT Hierarchy) overallphysical descriptor upholsteringProcesses & techniques overallmaterialmoquetteMaterials overallcolourblueColor legsmaterialwoodMaterials backphysical descriptor buttoningProcesses & techniques backshapecurvedPhysical attributes armsshapescrolled armsComponents The whole row corresponds to one entity: It stands for one part The first field name stands for a relationship and the kind of contents Object The field contents stands for an entity instance : legs of obj has part: If part = overall, it stands for the whole Mapping condition:

Heraklion, April 2, Mapping under condition: Creating an equivalent statement Object legs of obj Whole Record Row “Part” “has Part ” Source Schema interpretation Instance, valid for both schemata Man-Made Object Man-Made Object is composed of CRM Schema maps to: Possible Mapping Annotation: Whole Record= E22 Man-Made Object Row “Part” = E22 Man-Made Object If (in Row “Part”, Part /= “overall”) then Whole Record-> Row “Part” = P46 is composed of Possible CRM instance Annotation: Object (E22: Man-Made_Object) is_composed_of legs of (E22: Man-Made_Object) If Part /= “overall”

Heraklion, April 2, Interpreting a Schema: Advanced Stuff: Values as Properties ID CategoryNRM - Railway furniture DescriptionArmchair, Upholstered in blue moquette with curved, buttoned back & scroll arms. Wooden legs Item name(s)armchairs (AAT Hierarchy: Furnishings) PartAspectTerm(AAT Hierarchy) overallphysical descriptor upholsteringProcesses & techniques overallmaterialmoquetteMaterials overallcolourblueColor legsmaterialwoodMaterials backphysical descriptor buttoningProcesses & techniques backshapecurvedPhysical attributes armsshapescrolled armsComponents The field “Aspect” contents state a relationship Object The field contents stands for an entity instance : moquette has material: If part = overall, AND Aspect = material Value based mapping

Heraklion, April 2, Interpreting a Schema: Advanced Stuff: Mapping to Paths ID CategoryNRM - Railway furniture DescriptionArmchair, Upholstered in blue moquette with curved, buttoned back & scroll arms. Wooden legs Item name(s)armchairs (AAT Hierarchy: Furnishings) PartAspectTerm(AAT Hierarchy) overallphysical descriptor upholsteringProcesses & techniques overallmaterialmoquetteMaterials overallcolourblueColor legsmaterialwoodMaterials backphysical descriptor buttoningProcesses & techniques backshapecurvedPhysical attributes armsshapescrolled armsComponents The field “Aspect” contents state a relationship Object The field contents stands for an entity instance : upholstering has physical descriptor: If part = overall, AND Aspect = physical descriptor Value based mapping

Heraklion, April 2, Mapping to Paths: Introducing an intermediate node Object upholstering Whole Record Term “has physical descriptor ” Source Schema interpretation Instance of source Man-Made Object was produced by CRM Schema maps to: Possible Mapping Annotation: Whole Record= E22 Man-Made Object Term = E55 Type If Part = “overall” & Aspect= physical descriptor Whole Record-> Term = P108 was produced by – E12 Production - P32 used general technique Possible CRM instance Annotation: Object (E22: Man-Made_Object) was_produced_by Obj Production (E12: Production) used general technique upholstering (E55 Type) Production Type If Part = “overall” & Aspect= physical descriptor used general technique Object upholstering Obj Production Instance of target

Heraklion, April 2, Interpreting a Schema: Advanced Stuff: Nested Structures ID CategoryNRM - Railway furniture DescriptionArmchair, Upholstered in blue moquette with curved, buttoned back & scroll arms. Wooden legs Item name(s)armchairs (AAT Hierarchy: Furnishings) PartAspectTerm(AAT Hierarchy) overallphysical descriptor upholsteringProcesses & techniques overallmaterialmoquetteMaterials overallcolourblueColor legsmaterialwoodMaterials backphysical descriptor buttoningProcesses & techniques backshapecurvedPhysical attributes armsshapescrolled armsComponents The whole row corresponds to one entity: If part /= overall it stands for one part The field contents stands for an entity instance : legs of obj has material: If Aspect = material wood The contents of field “Aspect” state a relationship Value based mapping

Heraklion, April 2, Mapping Nested Structures : Continuing on a Range Entity wood legs of obj Row “Part” Term “has material ” Source Schema interpretation Instance, valid for both schemata Man-Made Object Material consists of CRM Schema maps to: Possible Mapping Annotation: Row “Part” = E22 Man-Made Object If Aspect= “material” Term = E57 Material Row “Part” -> Term = P45 consists of Possible CRM instance Annotation: Object (E22: Man-Made_Object) is_composed_of legs of (E22: Man-Made_Object) consists_of wood (E57 Material) If Part /= “overall” & Aspect = “material” Object

Heraklion, April 2, Other Forms of Maps: Cases of Heterogeneity A B “a ” Source Schema interpretation D c CRM Schema E C “b ” F d A B “a ” Source Schema interpretation D c CRM Schema E C “b ” F d G e Parallel to nested: Parallel to intermediate- parallel: (frequent with events!)

Heraklion, April 2, Other Mapping Forms: Cases of Heterogeneity A B “a ” Source Schema interpretation B,C,D are parts of an identifier for one real-life thing D d CRM Schema E C “b ” Compound contraction: (frequent with addresses, species names etc!) D “c ”

Heraklion, April 2, Mapping to the CRM: Conclusions  Mapping to the CRM can serve just as guide for good- practice data structures.  It can be used to create a Semantic Web of cultural knowledge.  It can be used to preserve data in a neutral form.  Even though mapping can become weird, good data structures transform easily, and there are commercial tools.  No tool can guess all the experts intention in a data structure: Domain experts must assist the mapping.