21.09.06 Krzysztof Janowicz Towards a Similarity-Based Identity Assumption Service for Historical Places Establishing Meaningful Links Krzysztof Janowicz;

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
1 ICS-FORTH & Univ. of Crete SeLene November 15, 2002 A View Definition Language for the Semantic Web Maganaraki Aimilia.
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
CRMarchaeo CRMarchaeo v1.2.1
CRMarchaeo Modelling Context, Stratigraphic Unit, Excavated Matter
1 ICS –FORTH, Oct.30-Nov.4,2006, Cyprus Documenting Events in Metadata Martin Doerr, Athina Kritsotaki Center for Cultural Informatics Institute of Computer.
1 CIDOC CRM + FRBR ER = FRBR OO … an equation for a harmonised view of museum information and bibliographic information Martin Doerr First CASPAR Seminar.
1 A Description Logic with Concrete Domains CS848 presentation Presenter: Yongjuan Zou.
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
ICS-FORTH Which Period Is It? A Methodology To Create Thesauri Of Historical Periods Martin Doerr, Athina Kritsotaki, Stephen Stead.
Krzysztof Janowicz SIM-DL -Towards a Semantic Similarity Measurement Theory for the Description Logic ALCNR in Geographic Information Retrieval.
Deriving Semantic Description Using Conceptual Schemas Embedded into a Geographic Context Centre for Computing Research, IPN Geoprocessing Laboratory Miguel.
Where are the Semantics in the Semantic Web? Michael Ushold The Boeing Company.
A Review of Ontology Mapping, Merging, and Integration Presenter: Yihong Ding.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
Description Logics. Outline Knowledge Representation Knowledge Representation Ontology Language Ontology Language Description Logics Description Logics.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
Martin Doerr, Gerald Hiebel, Institute of Computer Science
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
ICS-FORTH May 25, The Utility of XML Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Heraklion, May.
ICS – FORTH, August 31, 2000 Why do we need an “Object Oriented Model” ? Martin Doerr Atlanta, August 31, 2000 Foundation for Research and Technology -
ICS-FORTH October 14, The CIDOC CRM, factor for the integration and presentation of cultural information Martin Doerr Foundation for Research and.
1. Motivation Knowledge in the Semantic Web must be shared and modularly organised. The semantics of the modular ERDF framework has been defined model.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
Georgios Christodoulou, Euripides G.M. Petrakis, and Sotirios Batsakis Department of Electronic and Computer Engineering, Technical University of Crete.
Harmonising without Harm: towards an object-oriented formulation of FRBR aligned on the CIDOC CRM ontology Maja Žumer (University of Ljubljana) & Patrick.
National Survey and Cadastre – Denmark Conceptual Modeling of Geographic Databases - Emphasis on Relationships among Geographic Databases Anders Friis-Christensen.
An approach to Intelligent Information Fusion in Sensor Saturated Urban Environments Charalampos Doulaverakis Centre for Research and Technology Hellas.
Standardization and Research Prof. Dr. Christine Giger Swiss Federal Institute of Technology Zurich © Atlas der Schweiz - interaktiv.
Applying Belief Change to Ontology Evolution PhD Student Computer Science Department University of Crete Giorgos Flouris Research Assistant.
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
The Semantic Web William M Baker
Ontologies for the Integration of Geospatial Data Michael Lutz Workshop: Semantics and Ontologies for GI Services, 2006 Paper: Lutz et al., Overcoming.
MPEG-7 Interoperability Use Case. Motivation MPEG-7: set of standardized tools for describing multimedia content at different abstraction levels Implemented.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Dimitrios Skoutas Alkis Simitsis
Smithsonian, March 26, International Symposium “Sharing the Knowledge” Martin Doerr Smithsonian, Washington DC March 26, 2003 FORTH, Greece Chair,
The CIDOC Conceptual Reference Model A core-ontology for information integration Karl H. Lampe, Zoologisches Forschungsmuseum Alexander Koenig (ZFMK) Bonn/Germany.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
WP3: Provenance and Access Policies Giorgos Flouris (FORTH) - Irini Fundulaki (CWI & FORTH) -
C. Lawrence Zitnick Microsoft Research, Redmond Devi Parikh Virginia Tech Bringing Semantics Into Focus Using Visual.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
ICS-FORTH Thesauri of Historical Periods A Proposal for Standardization Martin Doerr, Athina Kritsotaki Heraklion, Crete, June
Dictionary based interchanges for iSURF -An Interoperability Service Utility for Collaborative Supply Chain Planning across Multiple Domains David Webber.
Florian A. Twaroch Institute for Geoinformation and Cartography, TU Vienna Naive Semantic Interoperability Florian A. Twaroch.
Temporal Primitives Institute of Computer Science Foundation for Research and Technology - Hellas Manos Papadakis & Martin Doerr Workshop: Extending, Mapping.
The CEN Metalex Naming Convention Fabio Vitali University of Bologna.
Background-assumptions in knowledge representation systems Center for Cultural Informatics, Institute of Computer Science Foundation for Research and Technology.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Designing and Using an Audio-Visual Description Core Ontology Friday 8 th of October, 2004 Antoine Isaac & Raphaël Troncy.
WP3: Data Provenance and Access Control Irini Fundulaki, FORTH December 11-12, 2012, Luxembourg.
Co-funded by the European Union under FP7-ICT Co-ordinated by aparsen.eu #APARSEN Provenance Interoperability and Reasoning Yannis Tzitzikas Assistant.
The Semantic Web By: Maulik Parikh.
From FRBR to FRBROO through CIDOC CRM…
Jie Bao, Doina Caragea and Vasant G Honavar
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Ontology-Based Approaches to Data Integration
Information Networks: State of the Art
Representations & Reasoning Systems (RRS) (2.2)
Presentation transcript:

Krzysztof Janowicz Towards a Similarity-Based Identity Assumption Service for Historical Places Establishing Meaningful Links Krzysztof Janowicz; Muenster Semantic Interoperability Lab (MUSIL)

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 2 Outline Motivation Scenario Annotation Theory Further Work Image from: (Bleiglass, 1998)

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 3 Motivation For the cultural heritage community Incomplete and vague knowledge Interchange between external sources is necessary to answer complex scientific questions & to clean up local knowledge Local versus global identifiers  Accessible service-based infrastructure!

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 4 Motivation For semantic similarity research Application of similarity in a real world domain Similarity as part of the identity assumption puzzle Combination of similarity and classical reasoning Using a stable upper-level ontology (CIDOC CRM)  Theory of similarity assumptions for historical places

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 5 Motivation For an identity assumption service To run queries against multiple sources it has to be made sure that they refer to the same real-world phenomena; just a common language is not enough! Non unique place names (even within the same area) Place names refer to cities, rivers, valleys, mountains,… Misinterpreted place names (e.g. 'Al Wahat‘  Oasis) Names also refer to varying geopolitical units (e.g. nomads) or prominent (artificial) landmarks (e.g. telegraph stations) Out-dated place or even country names (e.g. UDSSR)  Gazetteers can only partially solve these problems (From discussions with Dr. Karl-Heinz Lampe; ZFMK)

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 6 Battle of Trafalgar - Scenario Took place at Cape Trafalgar (Province Cadiz) in 1805 British victory under the command of Horatio Nelson HMS Victory was Nelsons flagship Nelson was shot during the battle and died afterwards  Should be easy to annotate!? Spatial relation between naval battleground and terrestrial cape, Province Cadiz,..? Place names: Cabo Trafalgar, Taraf al-Gharb, رأس الطرف الأغر Also in a historical source from French perspective? Image from: (painted by Nicholas Pocock) Vice-Admiral Horatio Nelson, 1st Viscount Nelson? HMS Victory: Which one?! Temporal relations?

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 7 From:

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 8 Annotation of Historical Knowledge CIDOC conceptual reference model (CRM) as upper-level ontology for the cultural heritage domain specifies abstract and interrelated vocabulary instead of concrete definitions such as for kinds of exhibits  heterogeneous domain! describes historical knowledge by relations between places, events, actor and objects RDF(S) based representation ISO Standard (ISO/PRF 21127)

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 9 Annotation Examples (RDF-Triples) P89F.falls_within(E53.Place(Cape Trafalgar), E53.Place(Province Cádiz)) Subject-Predicate-Object: The place Cape Trafalgar falls within a place called Province Cádiz P8F.took_place_at(E7.Activity(Battle of Trafalgar), E53.Place(Cape Trafalgar)) P117F.occurs_during (E7.Activity(Battle of Trafalgar), E5.Event(Trafalgar Campaign)) P14F.carried_out_by (E7.Activity(Battle of Trafalgar), E21.Person(Nelson)) P2F.has_type (E53.Place(Andalusia), E55.Type(regions))

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 10 Theory In practice semi-automatic disambiguation via gazetteers and other global authorities (such as for historical figures) is often difficult, expensive and error-prone (especially for subordinate geopolitical units, events, actors,…) Use the links established via the CIDOC CRM annotation between places, actors, objects and events as additional reference points!

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 11 Theory Geoinformation = Semantic Reference Systems interpretation Spatiotemporal Reference Systems Use thematic information as support for spatiotemporal reference Mike Goodchild: Geographic Rreality CIDOC CRM + Reasoning + Similarity

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 12 Theory: Framework Comparing Place Descriptions 1.Extract new triples out of existing ones  Spatiotemporal & Subsumption Reasoning 2.Compute overlap between source and target triples  Semantic Similarity Measurement 3.Compare remaining labels & identifiers  Syntactic Identifier Matching 4.How probably compared places correspond  Identity Assumption

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 13 Theory: Reasoning Entities are described by sets of RDF triples Inference rules to generate new triples  Make local knowledge explicit!  More comparable information about entities Example: Spatial & temporal Inference rules Be careful - names are ambiguous! HMS XYZ (1804) HMS XYZ (1805) ?

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 14 Theory: Similarity Napoleonic Wars Nelson performed falls within died in Cape Trafalgar Province Cádiz falls within Source: Cape Trafalgar Province Cádiz overlaps with Target: sim p * sim s = Province Cádiz

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 15 Theory: Network Approach to Similarity 1.For all tuples from the source entity: find equal or similar tuples within the target entity description 2.Define meaningful notions of similarity for given predicates (relations) Spatial Temporal Thematic 3.Define meaningful notion of similarity for all objects that are not subjects of other triples themselves (e.g. ADL Feature Types)

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 16 Theory: Neighborhoods & Hierarchies Egenhofer & Al-Taha 1992 Different similarity measures for neighborhoods & hierarchies temporal spatial thematic

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 17 Theory: Syntactic Matching After recursively applying (semantic) similarity measurements, only labels, vague appellations and identifier are left  Requires syntactic matching / measuring (Getty Thesaurus) ID: ID: Cape Trafalgar Wrexham (found at: )

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 18 Two place descriptions probably refer to the same (real world) place if they are linked via equal or similar relations to equal or similar events, actors, objects, … Similar position within a network of historical facts Stepwise applying new restrictions to the set of compared historical places  Number of compared tuples is a critical issue! Theory: Identity Assumptions

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 19 Further Work & Evidence Similarity is only one part of the puzzle! Other parts: trust, contradictions & consistence,... Which inference rules may lead to difficulties? How to handle complementary knowledge? Connections to Time Map and ECAI Evidence! Battle of Trafalgar Scenario?  Develop a identity assumption pilot  Combination of similarity measurement with itineraries  Based on real world data from ZFMK, Bonn (biodiversity museum)

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 20 Questions Thank You! Special thanks to Martin Doerr Foundation for Research and Technology - Hellas (FORTH) Institute of Computer Science. Heraklion, Crete, Greece Karl-Heinz Lampe Zoologisches Forschungsmuseum Alexander Koenig (ZFMK). Bonn, Germany Any Questions?

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 21 ‘Real World’-Place? From:

Krzysztof Janowicz Similarity-Based Identity Assumption Service for Historical Places 22 Gazetteer Feature Types Andalucía ADLG Getty Thesaurus