First Insights into the Library Track of the OAEI Dominique Ritze Mannheim University Library.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Language Technologies Reality and Promise in AKT Yorick Wilks and Fabio Ciravegna Department of Computer Science, University of Sheffield.
OAEI 2007: Library Track Results Antoine Isaac, Lourens van der Meij, Shenghui Wang, Henk Matthezing Claus Zinn, Stefan Schlobach, Frank van Harmelen Ontology.
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
Leveraging Data and Structure in Ontology Integration Octavian Udrea 1 Lise Getoor 1 Renée J. Miller 2 1 University of Maryland College Park 2 University.
Project 2 Ontology alignment. SIGNAL-ONTOLOGY (SigO) Immune Response i- Allergic Response i- Antigen Processing and Presentation i- B Cell Activation.
Terrier Workshop: 24 th October 2007 Alasdair J G Gray.
A New Suffix Tree Similarity Measure for Document Clustering Hung Chim, Xiaotie Deng City University of Hong Kong WWW 2007 Session: Similarity Search April.
Standards for networked knowledge organisation systems Ron Davies European Library Automation Group Bucharest, April 2006.
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
Aligning Thesauri for an integrated Access to Cultural Heritage Collections Antoine ISAAC (including slides by Frank van Harmelen) STITCH Project UDC Conference.
The Value of Usage Scenarios for Thesaurus Alignment in Cultural Heritage Context Antoine Isaac, Claus Zinn, Henk Matthezing, Lourens van der Meij, Stefan.
Automated Changes of Problem Representation Eugene Fink LTI Retreat 2007.
Annotating Documents for the Semantic Web Using Data-Extraction Ontologies Dissertation Proposal Yihong Ding.
Multi-Concept Alignment and Evaluation Shenghui Wang, Antoine Isaac, Lourens van der Meij, Stefan Schlobach Ontology Matching Workshop Oct. 11 th, 2007.
Towards Semantic Web: An Attribute- Driven Algorithm to Identifying an Ontology Associated with a Given Web Page Dan Su Department of Computer Science.
Xiaomeng Su & Jon Atle Gulla Dept. of Computer and Information Science Norwegian University of Science and Technology Trondheim Norway June 2004 Semantic.
Putting ontology alignment in context: Usage scenarios, deployment and evaluation in a library case Antoine Isaac Henk Matthezing Lourens van der Meij.
New Ways of Mapping Knowledge Organization Systems Using a Semi-Automatic Matching- Procedure for Building Up Vocabulary Crosswalks Andreas Oskar Kempf.
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Speeding Up Batch Alignment of Large Ontologies Using MapReduce Uthayasanker Thayasivam and Prashant Doshi Dept. of Computer Science University of Georgia.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Interregional Network Summit. House of the Regions. Brussels, 11th October 2006 Juan D Olabarri Networks and Co-operation Manager SPRI / Basque Country.
Information Extraction with Linked Life Data 19/04/2011.
9/10/20151 SKOS. 9/10/20152 SKOS Describes thesauruses and taxonomies Properties: broader, narrower, subject, related Classes: Concept, Collection
Query Expansion.
 Copyright 2006 Digital Enterprise Research Institute. All rights reserved. Collaborative Building of Controlled Vocabularies Crosswalks Mateusz.
Consensus building workshop Conference track OAEI-2007 Ondřej Šváb The University of Economics The Department of Information and Knowledge Engineering.
1 Intra- and interdisciplinary cross- concordances for information retrieval Philipp Mayr GESIS – Leibniz Institute for the Social Sciences, Bonn, Germany.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
Machine Learning Approach for Ontology Mapping using Multiple Concept Similarity Measures IEEE/ACIS International Conference on Computer and Information.
12th of October, 2006KEG seminar1 Combining Ontology Mapping Methods Using Bayesian Networks Ontology Alignment Evaluation Initiative 'Conference'
DDI-RDF Discovery Vocabulary A Metadata Vocabulary for Documenting Research and Survey Data Linked Data on the Web (LDOW 2013) Thomas Bosch.
Classification Technology at LexisNexis SIGIR 2001 Workshop on Operational Text Classification Mark Wasson LexisNexis September.
A Snapshot of public Web Services Prof: Dr.Jainguo Lu Presenting Group: Aktar-uz-zaman Mohit Sud.
PART IV: REPRESENTING, EXPLAINING, AND PROCESSING ALIGNMENTS & PART V: CONCLUSIONS Ontology Matching Jerome Euzenat and Pavel Shvaiko.
D4: SKOS and HIVE—Enhancing the Creation, Design and Flow of Information Speakers: Hollie White Jane Greenberg Coordinator: Alan Keely.
Which of the two appears simple to you? 1 2.
© Paul Buitelaar – November 2007, Busan, South-Korea Evaluating Ontology Search Towards Benchmarking in Ontology Search Paul Buitelaar, Thomas.
MD9.6 Release: Highlights Increased the character limit for all URL resources to 600 characters. Data_Center/Service_Provider Data_Set_Citation/Service_Citation.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
A Hierarchical Monothetic Document Clustering Algorithm for Summarization and Browsing Search Results Kummamuru et al. Presented by Bei Yu Sept. 22 nd,
IL Step 3: Using Bibliographic Databases Information Literacy 1.
Mark M Hall Information School / Computer Science Sheffield University Sheffield, UK EuropeanaTech 2011, Vienna, 4 th - 6 th October 2011 Aggregating Cultural.
The KOS interoperability in aquatic science field through mapping processes Carmen Reverté Reverté Aquatic Ecosystems Documentation Center. IRTA. (Sant.
Controlled Vocabulary & Thesaurus Design Resources & Future Directions.
Very Large Cross-lingual Resources at OAEI 2008 Laura Hollink Véronique Malaisé Vrije Universiteit Amsterdam.
Aligner automatiquement des ontologies avec Tuesday 23 rd of January, 2007 Rapha ë l Troncy.
Semantic Portal Business and Economics – Project Report NKOS Workshop September 19 th 2008 Aarhus, Denmark Project Report: Semantic Portal Business and.
AGROVOC Thesaurus. 1980s: developed as multilingual structured thesaurus for agricultural terminology (“rice”) : parallel effort to express thesaurus.
CLARIN Concept Registry: the new semantic registry Ineke Schuurman, Menzo Windhouwer, Oddrun Ohren, Daniel Zeman
1 Aligning the Parasite Experiment Ontology and the Ontology for Biomedical Investigations Using AgreementMaker Valerie Cross, Cosmin Stroe Xueheng Hu,
University of the Aegean AI – LAB ESWC 2008 From Conceptual to Instance Matching George A. Vouros AI Lab Department of Information and Communication Systems.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
“New Dimensions in KOS” CENDI/NKOS Workshop September 11, 2008 Washington, DC, USA An international conference to share and advance knowledge and experience.
Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
SKOS : A language to describe simple knowledge structures for the web
Benchmarking Matching Applications on the Semantic Web.
An Efficient Method for Computing Alignment Diagnoses
Automation of systematic reviews: the reviewer’s viewpoint
Cross-language Information Retrieval
Extracting Semantic Concept Relations
[jws13] Evaluation of instance matching tools: The experience of OAEI
Block Matching for Ontologies
Introduction to Information Retrieval
Actively Learning Ontology Matching via User Interaction
Presentation transcript:

First Insights into the Library Track of the OAEI Dominique Ritze Mannheim University Library

Motivation Publication x subject (thesaurus 2): ontology alignment Ontology Mapping Search 0 results Thesaurus 1Thesaurus 2 Ontology Mapping Ontology Alignment Ontology Mapping Search Publication x subject (thesaurus 1): ontology alignment =

Overview Ontology Matching OAEI Thesaurus vs. Ontology OAEI Library Track 2012 Lessons learned and Future Work

Ontology Matching Person Author PCMember Document Paper Review People Author Reviewer Doc Paper reviews writes reviews … CommitteeMember

Ontology Matching Evaluation Tool O1 R A O2 m Test Result

Ontology Alignment Evaluation Initiative (OAEI) Annual campaign started 2005 Different tracks/datasets Benchmark, Anatomy, Conference, Multifarm, Large BioMed, Library, Instance Matching 21 submitted systems (2012) Goal: Improving the performances of the ontology matching field Through comparison of algorithms New challenges for the systems

Thesaurus = Ontology? SKOSOWL skos:conceptowl:class skos:prefLabel skos:alternativeLabel rdfs:label skos:scopeNote skos:notation rdfs:comment A skos:narrower BA rdfs: subClassOf B A skos:broader BB rdfs:subClassOf A skos:relatedrdfs:seeAlso Commodities Germany Ananas Tropical Fruit Metal Product -> Metal

OAEI Library Track Are current state-of-the-art ontology matching tools able to match thesauri? Dominique Ritze, Kai Eckert, Benjamin Zapilko, Joachim Neubert

Data Set Thesaurus for economics (STW) concepts with additional keywords (EN, DE) Thesaurus for the Sociel Sciences (TheSoz) concepts with additional keywords (EN, DE, FR) Reference alignment manually created in 2006 Both actively used in libraries for keyword indexing

Execution 7GB Debian machine Timeframe 1 week 13 of the 21 submitted systems were able to generate an alignment No system had a heap space problem Evaluation: Precision, Recall, F-Measure, Runtime

Results How to evaluate the results? F-Measure of 0.67 good? SystemPrecisionRecallF-MeasureTime (s)Size1:1 GOMMA ServOMapLt LogMap ServOMap yes YAM LogMapLt G02A Hertuda WeSeE yes HotMatch yes CODI yes MapSSS yes AROMA Optima

Results SystemPrecisionRecallF-MeasureTime (s)Size1:1 MatcherPref MatcherDE MatcherAll GOMMA ServOMapLt LogMap … MatcherEN CODI yes MapSSS yes AROMA Optima

Manual Evaluation Between 38 and 269 new correct correspondences found per matcher Up to half of the correspondences correct Many new correspondences are quite simple Some more “complex” and interesting ones Automated production = CAM Several incorrect ones if the labels are quite similar Difficult to distinguish the names of countries, their inhabitants and the languages

Lessons Learned Transformation SKOS to OWL causes some problems, especially regarding the labels Ontology matching systems are nevertheless able to match the thesauri and even discover unknown correct correspondences Interest of the community in this topic

Future Work Update reference alignment adapted results SKOS import for matching systems Use instance data to match thesauri? Other thesauri?

Thank you for your attention!