New Ways of Mapping Knowledge Organization Systems Using a Semi-Automatic Matching- Procedure for Building Up Vocabulary Crosswalks Andreas Oskar Kempf.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Inventories, Discovery of Digital Content Minerva WP3 Sarah Faraud.
Oyster, Edinburgh, May 2006 AIFB OYSTER - Sharing and Re-using Ontologies in a Peer-to-Peer Community Raul Palma 2, Peter Haase 1 1) Institute AIFB, University.
European Thesaurus on International Relations and Area Studies A multilingual terminological tool on international affairs Axel Huckstorf Stiftung Wissenschaft.
Alexandria Digital Library Project Integration of Knowledge Organization Systems into Digital Library Architectures Linda Hill, Olha Buchel, Greg Janée.
Controlled Vocabularies in TELPlus Antoine ISAAC Vrije Universiteit Amsterdam EDLProject Workshop November 2007.
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
Stefania Bergamasco, Cecilia Colasanti An integrated approach to turn statistics into knowledge combining data warehouse, controlled vocabularies and advanced.
Maurice Hermans.  Ontologies  Ontology Mapping  Research Question  String Similarities  Winkler Extension  Proposed Extension  Evaluation  Results.
First Insights into the Library Track of the OAEI Dominique Ritze Mannheim University Library.
1 Entity Ranking Using Wikipedia as a Pivot (CIKM 10’) Rianne Kaptein, Pavel Serdyukov, Arjen de Vries, Jaap Kamps 2010/12/14 Yu-wen,Hsu.
Standards for networked knowledge organisation systems Ron Davies European Library Automation Group Bucharest, April 2006.
2006 Ontopia AS1 Towards an Ontology Design Methodology Initial work Lars Marius Garshol CTO, Ontopia TMRA
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya Fridman Noy and Mark A. Musen.
Thesaurus Design and Development
Aligning Thesauri for an integrated Access to Cultural Heritage Collections Antoine ISAAC (including slides by Frank van Harmelen) STITCH Project UDC Conference.
The Value of Usage Scenarios for Thesaurus Alignment in Cultural Heritage Context Antoine Isaac, Claus Zinn, Henk Matthezing, Lourens van der Meij, Stefan.
Queensland University of Technology An Ontology-based Mining Approach for User Search Intent Discovery Yan Shen, Yuefeng Li, Yue Xu, Renato Iannella, Abdulmohsen.
Multi-Concept Alignment and Evaluation Shenghui Wang, Antoine Isaac, Lourens van der Meij, Stefan Schlobach Ontology Matching Workshop Oct. 11 th, 2007.
CSE 730 Information Retrieval of Biomedical Data The use of medical lexicon in biomedical IR.
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Carlos Lamsfus. ISWDS 2005 Galway, November 7th 2005 CENTRO DE TECNOLOGÍAS DE INTERACCIÓN VISUAL Y COMUNICACIONES VISUAL INTERACTION AND COMMUNICATIONS.
Speeding Up Batch Alignment of Large Ontologies Using MapReduce Uthayasanker Thayasivam and Prashant Doshi Dept. of Computer Science University of Georgia.
Thesaurusmanagement Quickstart Introduction. What are controlled vocabularies? organized arrangement of words and phrases used to index content and/or.
Grey Literature, E-Repositories and Evaluation of Academic & Research Institutes. The case study of BPI e-repository Maria V. Kitsiou - Head Librarian,
1/ 27 The Agriculture Ontology Service Initiative APAN Conference 20 July 2006 Singapore.
Computational Methods to Vocalize Arabic Texts H. Safadi*, O. Al Dakkak** & N. Ghneim**
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
1 Intra- and interdisciplinary cross- concordances for information retrieval Philipp Mayr GESIS – Leibniz Institute for the Social Sciences, Bonn, Germany.
1 The Domain-Specific Track at CLEF 2008 Vivien Petras & Stefan Baerisch GESIS Social Science Information Centre, Bonn, Germany Aarhus, Denmark, September.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
DDI-RDF Discovery Vocabulary A Metadata Vocabulary for Documenting Research and Survey Data Linked Data on the Web (LDOW 2013) Thomas Bosch.
Digital environment for e-learning –J. Herget 1 Digital environment for e-learning – A concept for excellence in knowledge transfer Herget, Josef, Prof.
Semantic Interoperability and Retrieval Paradigms Paradigms and conceptual systems in KO February 23, 2010 – February 26, 2010 Prof. Winfried Gödert Felix.
D4: SKOS and HIVE—Enhancing the Creation, Design and Flow of Information Speakers: Hollie White Jane Greenberg Coordinator: Alan Keely.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
Which of the two appears simple to you? 1 2.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
DL.org All WGs Meetings, Rome, May 2010 Quality Interoperability Approaches, case studies and open issues DL.org Quality Working Group Rome, 28 th.
© Paul Buitelaar – November 2007, Busan, South-Korea Evaluating Ontology Search Towards Benchmarking in Ontology Search Paul Buitelaar, Thomas.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
The KOS interoperability in aquatic science field through mapping processes Carmen Reverté Reverté Aquatic Ecosystems Documentation Center. IRTA. (Sant.
Towards Distributed Information Retrieval in the Semantic Web: Query Reformulation Using the Framework Wednesday 14 th of June, 2006.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Aligner automatiquement des ontologies avec Tuesday 23 rd of January, 2007 Rapha ë l Troncy.
Semantic Portal Business and Economics – Project Report NKOS Workshop September 19 th 2008 Aarhus, Denmark Project Report: Semantic Portal Business and.
Metadata Registries Registry: authoritative, centrally controlled store of information – W3C Web Services Glossary, 2004
GEMET GEneral Multilingual Environmental Thesaurus leading the way to federated terminologies Stefan Jensen, Head of information services group with input.
Text Analytics in Action: Using Text Analytics as a Toolset TBC 4:15 p.m. - 5:00 p.m. Marjorie Hlava Semantic enrichment / Semantic Fingerprinting.
Data mining, interactive semantic structuring, and collaboration: A diversity-aware method for sense-making in search Mathias Verbeke, Bettina Berendt,
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
1 Aligning the Parasite Experiment Ontology and the Ontology for Biomedical Investigations Using AgreementMaker Valerie Cross, Cosmin Stroe Xueheng Hu,
Comparing the accuracy of the semantic similarity provided by the Normalized Google Distance (NGD) and the Search Term Recommender (STR). Wilko van Hoek,
FSR, Feb 2014, Rome Which knowledge organization systems for conceptual interoperability? Claudio Gnoli ISKO Italy.
Finding document topics for improving topic segmentation Source: ACL2007 Authors: Olivier Ferret (18 route du Panorama, BP6) Reporter:Yong-Xiang Chen.
Controlled Vocabulary & Thesaurus Design Associative Relationships & Thesauri.
“New Dimensions in KOS” CENDI/NKOS Workshop September 11, 2008 Washington, DC, USA An international conference to share and advance knowledge and experience.
The Development of a search engine & Comparison according to algorithms Sung-soo Kim The final report.
1 The Domain-Specific Track at CLEF 2007 Vivien Petras, Stefan Baerisch & Max Stempfhuber GESIS Social Science Information Centre, Bonn, Germany Budapest,
SKOS : A language to describe simple knowledge structures for the web
Knowledge Organisation Competency Survey ISKO SG – 11 March 2016 Matt Moore Patrick Lambe.
Of 24 lecture 11: ontology – mediation, merging & aligning.
D3.4 Report on Cross-Language Subject Access Options Subject access seminar, Prague Patrice Landry Swiss National Library.
TRSS Terminology Registry Scoping Study
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
DDI-RDF Discovery Vocabulary _ Use Cases and Vocabularies
Block Matching for Ontologies
Semantic Interoperability and Retrieval Paradigms
Presentation transcript:

New Ways of Mapping Knowledge Organization Systems Using a Semi-Automatic Matching- Procedure for Building Up Vocabulary Crosswalks Andreas Oskar Kempf – GESIS – Leibniz Institute for the Social Sciences Benjamin Zapilko – GESIS – Leibniz Institute for the Social Sciences Dominique Ritze – Mannheim University Library Kai Eckert – Mannheim University

Content Vocabulary Crosswalks –Why are they needed? –How do they look like? Automatic Matching Initiatives and Procedures –Ontology Matching Approaches OAEI Library Track 2012 –What kind of outcome and limitations regarding an automatic creation of vocabulary crosswalks do we have to expect? Optimizing the Manual Evaluation Process Conclusion and Outlook ISKO UK Conference, 8th - 9th July 2013, London | Kempf et al. | Mapping KOS 2

Publication x subject (thesaurus 2): ontology alignment Ontology Mapping Search 0 results Thesaurus 1Thesaurus 2 Ontology Mapping Ontology Alignment Ontology Mapping Search Publication x subject (thesaurus 1): ontology alignment = Mapping KOS - Motivation ISKO UK Conference, 8th - 9th July 2013, London | Kempf et al. | Mapping KOS 3

Why are they needed? allow for integrated and high-quality search scenarios in distributed information collections indexed on the basis of different controlled vocabularies allow for interoperability among different knowledge organization systems allow for vocabulary expansion and provide possible routes into various domain-specific languages allow for query expansion and reformulation allow for the use of familiar vocabularies to maneuver between different information resources 1. Vocabulary Crosswalks (1/2) ISKO UK Conference, 8th - 9th July 2013, London | Kempf et al. | Mapping KOS 4

How do they look like?  consist of equivalence (=), hierarchy ( ) and association (^) relations  could consist of a mapping to several terms of the vocabulary being mapped to and of a combination of terms of the vocabulary being mapped to  are established bilaterally (A > B and B > A) How are they being done?  get an overview over the topical overlap and the structure of the different vocabularies  build up an understanding of the meaning and semantics of the terms and the internal relations of the vocabularies  start the mapping process (take all the internal relations, synonyms/non- descriptors within the concepts into account)  modify mappings already built up during the mapping process  perform retrieval tests Projects  MACS (National Libraries CH, F, GB, GER), OCLC Mappings 1. Vocabulary Crosswalks (2/2) ISKO UK Conference, 8th - 9th July 2013, London | Kempf et al. | Mapping KOS 5

Ontology Matching Person Author PCMember Document Paper Review People Author Reviewer Doc Paper reviews writes reviews … CommitteeMember ISKO UK Conference, 8th - 9th July 2013, London | Kempf et al. | Mapping KOS 6

Ontology Matching Evaluation ISKO UK Conference, 8th - 9th July 2013, London | Kempf et al. | Mapping KOS 7

Ontology Alignment Evaluation Initiative (OAEI)  Annual international campaign started in the year 2004  Different tracks/datasets  Objectives:  Improving the performances of mapping tools in the field of ontology matching  Comparing the different algorithms  Detecting new challanges for matching systems 2. Terminology Mapping (2/2) ISKO UK Conference, 8th - 9th July 2013, London | Kempf et al. | Mapping KOS 8

OAEI Library Track 2012 Data Sets  Thesaurus for the Sociel Sciences (TheSoz) about concepts with about additional keywords/entry terms (EN, DE, FR)  Thesaurus for Economics (STW) about concepts with about additional keywords/entry terms (EN, DE) Reference Alignment (2006)  TheSoz > STW; STW > TheSoz (≈7,000 intellectually created relations in each direction)

Thesaurus = Ontology? SKOSOWL skos:Conceptowl:Class skos:prefLabel skos:altLabel rdfs:label skos:scopeNote skos:notation rdfs:comment A skos:narrower BA rdfs: subClassOf B A skos:broader BB rdfs:subClassOf A skos:relatedrdfs:seeAlso Ananas Tropical Fruit Metal Product -> Metal Thesauri:Polydimensional Ontologies (for they are characterized by only a limited number of conceptual relation types). Ontologies: Multidimensional Systems with potentially infinite number of relation types. See: Gietz 2001: 24f.

Results How to evaluate the results? F-Measure of 0.67 good? SystemPrecisionRecallF-MeasureTime (s)Size1:1 GOMMA ServOMapLt LogMap ServOMap yes YAM LogMapLt G02A Hertuda WeSeE yes HotMatch yes CODI yes MapSSS yes AROMA Optima

ISKO UK Conference, 8th -12 Equivalence Relations (in total) Correct Equivalence Relations Non-Correct Equivalence Relations AROMA (6,1%)3.285 CODI (25,8%)466 GO2A (33,8%)418 GOMMA (36,1%)436 Hertuda (32,5%)556 HotMatch (43,3%)254 LogMapLt (43,3%)306 LogMap (50,4%)200 MapSSS17564 (36,6%)111 Optima16538 (23,0%)127 ServOMapL (48,0%)273 ServOMap (53,8)201 WeSeE (33,0%)457 YAM (40,5%)365 Manual Evaluation

Optimizing the Evaluation Process All correspondences (including duplicates) Unique correspondences Total number …of which are correct (11%) Leading question: How can the intellectual matching process be best supported by ontology matching tools? Approach: Reorganizing the alignments according to the largest agreement between the different matching tools. ISKO UK Conference, 8th - 9th July 2013, London | Kempf et al. | Mapping KOS 13

Number of Accordances between the different Matching Tools

Percentage of Correct Correspondences Number of corresponding matchers Number of all correspondences Number of all correct correspondences Percentage of correct correspondences % % % % % % % % % % % % % ISKO UK Conference, 8th - 9th July 2013, London | Kempf et al. | Mapping KOS 15

Comparison between Regular and Optimized Evaluation Scenario optimized scenario normal evaluation No. of correspondin g matchers No. of all correspondences % of all correspondences (22592=100%) No. of correct correspondences % of all correct correspondences (2484=100%) No. of correct correspondences (estimated) % of all correct correspondences (2484=100%) % %80.32 % ( ) 1.24 % % % (…+… ) 4.13 % % % (…+…) 6.37 % % % (…+…) 8.34 % % % (…+…) % % % (…+…) % % % (…+…) % % % (…+…) % % % (…+…) % % % (…+…) % % % (…+…) % % % (…+…) 100 % % %

Conclusion  Significant differences between the different ontology matching tools  Some tools provide rather promising performances  None of the evaluated matching tools alone could ensure high-quality standards for building up vocabulary crosswalks automatically  Ontology matching tools can be used to optimize the intellectual evaluation process  By reorganizing the validation process considering the number of accordances between the different matching tools the intellectual evaluation process could be made more time-efficient  Matching tools can be used as recommendation systems for manual evaluation ISKO UK Conference, 8th - 9th July 2013, London | Kempf et al. | Mapping KOS 17

Thank you for your attention. Contact Dr. Andreas Oskar Kempf GESIS – Leibniz-Institute for the Social Sciences Benjamin Zapilko GESIS – Leibniz-Institute for the Social Sciences Dominique Ritze Mannheim University Library Kai Eckert Mannheim University ISKO UK Conference, 8th - 9th July 2013, London | Kempf et al. | Mapping KOS 18