Shiyan Ou 1, Viktor Pekar 1, Constantin Orasan 1, Christian Spurk 2, Matteo Negri 3 1 Research Group in Computational Linguistics, University of Wolverhampton,

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Requirements. UC&R: Phase Compliance model –RIF must define a compliance model that will identify required/optional features Default.
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
SPARQL Dimitar Kazakov, with references to material by Noureddin Sadawi ARIN, 2014.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
A Linguistic Approach for Semantic Web Service Discovery International Symposium on Management Intelligent Systems 2012 (IS-MiS 2012) July 13, 2012 Jordy.
Ontology Notes are from:
Aki Hecht Seminar in Databases (236826) January 2009
ModelicaXML A Modelica XML representation with Applications Adrian Pop, Peter Fritzson Programming Environments Laboratory Linköping University.
A Review of Ontology Mapping, Merging, and Integration Presenter: Yihong Ding.
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
The Semantic Web Week 12 Term 1 Recap Lee McCluskey, room 2/07 Department of Computing And Mathematical Sciences Module Website:
Semantic Web Presented by: Edward Cheng Wayne Choi Tony Deng Peter Kuc-Pittet Anita Yong.
1 DCS861A-2007 Emerging IT II Rinaldo Di Giorgio Andres Nieto Chris Nwosisi Richard Washington March 17, 2007.
MUSCLE WP9 E-Team Integration of structural and semantic models for multimedia metadata management Aims: (Semi-)automatic MM metadata specification process.
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
WP5.4 - Introduction  Knowledge Extraction from Complementary Sources  This activity is concerned with augmenting the semantic multimedia metadata basis.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
Machine Learning Approach for Ontology Mapping using Multiple Concept Similarity Measures IEEE/ACIS International Conference on Computer and Information.
OWL Capturing Semantic Information using a Standard Web Ontology Language Aditya Kalyanpur Jennifer Jay Banerjee James Hendler Presented By Rami Al-Ghanmi.
Using Text Mining and Natural Language Processing for Health Care Claims Processing Cihan ÜNAL
SWETO: Large-Scale Semantic Web Test-bed Ontology In Action Workshop (Banff Alberta, Canada June 21 st 2004) Boanerges Aleman-MezaBoanerges Aleman-Meza,
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Outline Grammar-based speech recognition Statistical language model-based recognition Speech Synthesis Dialog Management Natural Language Processing ©
Populating A Knowledge Base From Text Clay Fink, Tim Finin, Christine Piatko and Jim Mayfield.
Ontologies and Lexical Semantic Networks, Their Editing and Browsing Pavel Smrž and Martin Povolný Faculty of Informatics,
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
Dimitrios Skoutas Alkis Simitsis
RDF and triplestores CMSC 461 Michael Wilson. Reasoning  Relational databases allow us to reason about data that is organized in a specific way  Data.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
Page 1 Alliver™ Page 2 Scenario Users Contents Properties Contexts Tags Users Context Listener Set of contents Service Reasoner GPS Navigator.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Comparability of language data and analysis Using an ontology for linguistics Scott Farrar, U.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
Technology – Broad View Aspects that play a role when integrating archives leave the details of some core topics to the 2. day Bernhard Neumair:Base Technologies.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
SemantEco Annotator for Linked Data Generation and Generalized Semantic Mapping Session: Technologies, Reasoning, and Annotation Methods of the Semantics.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
© Copyright 2013 STI INNSBRUCK “How to put an annotation in HTML?” Ioannis Stavrakantonakis.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
GEMET GEneral Multilingual Environmental Thesaurus leading the way to federated terminologies Stefan Jensen, Head of information services group with input.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Shridhar Bhalerao CMSC 601 Finding Implicit Relations in the Semantic Web.
Application Ontology Manager for Hydra IST Ján Hreňo Martin Sarnovský Peter Kostelník TU Košice.
Toward an Open Source Textual Entailment Platform (Excitement Project) Bernardo Magnini (on behalf of the Excitement consortium) 1 STS workshop, NYC March.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. 1 Mining knowledge from natural language texts using fuzzy associated concept mapping Presenter : Wu,
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
CMPE58H Project Progress Presentation QAPoint H.Tuğçe Özkaptan Gözde Kaymaz Serkan Kırbaş
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Semantic Data Extraction for B2B Integration Syntactic-to-Semantic Middleware Bruno Silva 1, Jorge Cardoso 2 1 2
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
 XML derives its strength from a variety of supporting technologies.  Structure and data types: When using XML to exchange data among clients, partners,
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Of 24 lecture 11: ontology – mediation, merging & aligning.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
Module: Software Engineering of Web Applications
Semantic Database Builder
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
The Re3gistry software and the INSPIRE Registry
Zachary Cleaver Semantic Web.
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
DBpedia 2014 Liang Zheng 9.22.
CSE 635 Multimedia Information Retrieval
How to publish in a format that enhances literature-based discovery?
Module: Software Engineering of Web Applications
Presentation transcript:

Shiyan Ou 1, Viktor Pekar 1, Constantin Orasan 1, Christian Spurk 2, Matteo Negri 3 1 Research Group in Computational Linguistics, University of Wolverhampton, UK 2 German Research Centre for Artificial Intelligence GmbH (DFKI), Germany 3 Fondazione Bruno Kessler – FBK, Italy Development and Alignment of a Domain- Specific Ontology for Question Answering

2 Structure 1.Introduction to QALL-ME 2.The QALL-ME ontology 3.Alignment to WordNet and SUMO 4.How the ontology is used for data encoding 5.Conclusions

3 Introduction to QALL-ME  QALL-ME (Question Answering Learning technologies Multilingual Multimodal Environment) is an EU-funded project which aims to establish a shared infrastructure for multilingual and multimodal question answering in the domain tourism.  Project’s website:  In the QALL-ME system  users pose natural language questions in several languages (both in textual and speech modality) using a variety of input devices (e.g. mobile phones), and  returns a list of specific answers formatted in the most appropriate modality, ranging from small texts, maps, videos, and pictures.  A domain-specific ontology for the tourism domain was developed and shared among all the partners.

4 The ontology in the project QALL-ME ontology WP4 Annotation of entities Indexing of data Retrieval of data WP3: Multilingual question interpretation WP5: Multilingual answer extraction WP9: Evaluation See more in O39: Multilingual Resources (Ambasadeurs) at 13:05

5 Design of the ontology  Analysis of data from content providers  Analysis of users’ requirements  Inspired by similar ontologies such as Harmonise, eTourism, Hi-Touch, TAGA, GETESS:  Harmonise and eTourism: focus on static information (e.g. accommodation and events/activities), rather than dynamic information related to travel business (e.g. customers and itineraries) as the TAGA and Hi-Touch ontologies do.  Similar to eTourism as is written in OWL rather RDFs  but wider coverage than each individual existing ontology  Introspection

6 Technical details of the ontology  Encoded using OWL DL, since it has more expressive power than OWL Lite and has more efficient reasoning support than OWL Full  Used Protégé-OWL as the editor and RacerPro7 as the reasoner  The ontology contains  122 classes (concepts),  55 datatype properties and  52 object properties which indicate the relationships among the 122 classes.  15 top-level classes.  The class hierarchy has a maximum depth of 4.

7 Part of the ontology (cinema/movies)

8 Ontology alignment  The QALL-ME ontology was designed as a model of the narrow knowledge domain of tourism.  The QALL-ME ontology was complemented with information from WordNet (and implicitly MultiWordNet) and SUMO via alignment  The QALL-ME ontology is being changed so fully manual alignment was not a solution  Fully automatic alignment is not precise enough, but maybe semi-automatic alignment is a solution

9 Ontology alignment (II)  The alignment relied on: 1.String similarity of element identifiers (e.g. chalet  chalet_1, SiteFacilityForChildren  facility_*) 2.Structural similarity for disambiguation (e.g. uses the semantic distance to aligned concepts) 3.Definition similarity for disambiguation (similarity between comments in the ontology and WordNet glosses is used) 4.Structural similarity for unmatched concepts is calculated to all the nouns in WordNet

10 Ontology alignment (III)  The overall accuracy of the fully automatic alignment is clearly suboptimal (precision of 49% and recall of 31%),  Error analysis:  We noticed that for concept names with unambiguous matches in WordNet the algorithm performs without any errors  The poor disambiguation performance is due to the very different depths of the two ontologies  Only a few concepts have comments which are useful for definition similarity  Semi-automatic alignment requires under 30 minutes to obtain “perfect” alignment

11 Example of alignment QALL-ME SUMO WN2.1 WN2.1 gloss = living quarters provided for public convenience; "overnight accommodations are available" = a Swiss house with a sloping roof and wide eaves or a house built in this style = an independent agency of the federal government responsible for mail delivery

12 Semantic annotation and database organization  The ontology was used to encode the data  Annotated data from the content providers was converted to RDF triplets  The RDF documents can be stored in databases or plain text files  The Jena RDF API was used for the operations

13 Semantic annotation and database organization XML Schema XML Documents RDF Documents Define Determine Transform QALL-ME Ontology HTML Parser Download World Wide Web Convert Database Convert

14 Content retrieval  For retrieval SPARQL is used  SPARQL is a query language for accessing RDF graphs by the W3C RDF Data Access Working Group  SPARQL provides interoperability between languages

15 What movie starring Halle Berry is on in Birmingham? Class: MovieShow  Property: isInSite, Range: Cinema  Property: hasPostalAddress, Range: PostalAddress  Property: isInDestination, Range: Destination Property: name, Range: string Property: hasEventContent, Range: Movie  Property: name, Range: string Property: hasStar, Range: Star  Property:name, Range: string

16 PREFIX qme: PREFIX xsd: SELECT ?movieName WHERE { ?MovieShow qme:isInSite ?Cinema. ?Cinema qme:hasPostalAddress ?PostalAddress. ?PostalAddress qme:isInDestination ?Destination. ?Destination qme:name “Birmingham”^^ ?MovieShow qme:hasEventContent ?Movie. ?Movie qme:name ?movieName. ?Movie qme:hasStar ?Star. ?Star qme:name “Halle Berry”^^ }

17 Conclusions  The QALL-ME ontology was specifically designed for the domain of tourism  The ontology is playing an important role in several parts of the project  The current ontology went through several revisions before reaching the current stage (and it may change again!!!)  Both the ontology and its alignment to WordNet and SUMO will be made freely available on the project’s website

18 Thank you ! Project’s website: