Ontology Lexicalisation In collaboration with John McCrae, Philipp Cimiano (CITEC, Univ. of Bielefeld) Elena Montiel-Ponsado (Universidad Politecnica Madrid)

Slides:



Advertisements
Similar presentations
Oyster, Edinburgh, May 2006 AIFB OYSTER - Sharing and Re-using Ontologies in a Peer-to-Peer Community Raul Palma 2, Peter Haase 1 1) Institute AIFB, University.
Advertisements

ISWC Doctoral Symposium Monday, 7 November 2005
CILC2011 A framework for structured knowledge extraction and representation from natural language via deep sentence analysis Stefania Costantini Niva Florio.
Semantic Web Thanks to folks at LAIT lab Sources include :
ICT Monica Monachini – 1° KYOTO Workshop – Amsterdam 2/ KYOTO (ICT ) Yielding Ontologies for Transition-Based Organization Intelligent.
1 Publishing Linked Sensor Data Semantic Sensor Networks Workshop 2010 In conjunction with the 9th International Semantic Web Conference (ISWC 2010), 7-11.
UNCERTML - DESCRIBING AND COMMUNICATING UNCERTAINTY Matthew Williams
Toward Linguistically Grounded Ontologies by Paul Buitelaar, Philipp Cimiano, Peter Haase, and Michael Sintek (Ireland, Netherlands, Germany) presented.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Language Technology for the Semantic Web OntoWeb/AgentLink, Barcelona: February 4 th,2003 OntoWeb SIG5 Language Technology in.
Thesauri, Terminologies and the Semantic Web
Linked Data as an enabler of cross-media and multilingual content analytics for enterprises across Europe A.Gómez-Pérez (UPM) Project Coordinator.
Information and Business Work
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
OWL-AA: Enriching OWL with Instance Recognition Semantics for Automated Semantic Annotation 2006 Spring Research Conference Yihong Ding.
Low-cost semantics-enhanced web browsing with Magpie Enrico Motta Knowledge Media Institute The Open University, UK.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
Enterprise Linked Data Seán O’Riain Domain of eBusiness Digital Enterprise Research Institute - National University of Ireland, Galway  Copyright 2010.
PSY 369: Psycholinguistics Some basic linguistic theory part3.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Conceptual modelling. Overview - what is the aim of the article? ”We build conceptual models in our heads to solve problems in our everyday life”… ”By.
 Copyright 2009 Digital Enterprise Research Institute. All rights reserved Digital Enterprise Research Institute Ontologies & Natural Language.
OIL: An Ontology Infrastructure for the Semantic Web D. Fensel, F. van Harmelen, I. Horrocks, D. L. McGuinness, P. F. Patel-Schneider Presenter: Cristina.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
Thesaurusmanagement Quickstart Introduction. What are controlled vocabularies? organized arrangement of words and phrases used to index content and/or.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Ontology Development in the Sciences Some Fundamental Considerations Ontolytics LLC Topics:  Possible uses of ontologies  Ontologies vs. terminologies.
Knowledge representation
Clément Troprès - Damien Coppéré1 Semantic Web Based on: -The semantic web -Ontologies Come of Age.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
AthenaPlus: WP4 Eva Coudyzer Koninklijke Musea voor Kunst en Geschiedenis Europeana Overlegplatform, 7 juni 2013.
 Copyright 2007 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Report on DERI,
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
© Copyright 2008 STI INNSBRUCK NLP Interchange Format José M. García.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Semantic Web services Interoperability for Geospatial decision.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
© DATAMAT S.p.A. – Giuseppe Avellino, Stefano Beco, Barbara Cantalupo, Andrea Cavallini A Semantic Workflow Authoring Tool for Programming Grids.
Finding, Linking and Organizing Resources with Linked Data & Natural Language Processing Paul Buitelaar Unit for Natural Language Processing Digital Enterprise.
Ontology Summit2007 Survey Response Analysis Ken Baclawski Northeastern University.
Nicoletta Calzolari Berlin, October PWI ISO SC 4/WG 4 Lexicon-Ontology relations PWI Nicoletta Calzolari Exploratory meeting.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Evaluating Semantic Metadata without the Presence of a Gold Standard Yuangui Lei, Andriy Nikolov, Victoria Uren, Enrico Motta Knowledge Media Institute,
Proposed NWI KIF/CG --> Common Logic Standard A working group was recently formed from the KIF working group. John Sowa is the only CG representative so.
FDT Foil no 1 On Methodology from Domain to System Descriptions by Rolv Bræk NTNU Workshop on Philosophy and Applicablitiy of Formal Languages Geneve 15.
ISO-PWI Lexical ontology some loose remarks Thierry Declerck, DFKI GmbH.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Chapter 6 – Architectural Design Lecture 1 1Chapter 6 Architectural design.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
TMF - Terminological Markup Framework Laurent Romary Laboratoire LORIA (CNRS, INRIA, Universités de Nancy) ISO meeting London, 14 August 2000.
Towards Linguistically Grounded Ontologies Paul Buitelaar, Philipp Cimiano, Peter Haase, and Michael Sintek Proceedings of the 6 th European Semantic Web.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
Linked Data Profiling Andrejs Abele National University of Ireland, Galway Supervisor: Paul Buitelaar.
WSMO in Knowledge Web 2nd SDK cluster f2f meeting Rubén Lara Digital Enterprise.
UNEP Terminology Workshop - Geneva, April 15, Environmental Terminology & Thesaurus Workshop UN Environment Programme Regional Office of Europe.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Quality Assurance, Ontology Engineering, and Semantic Interoperability.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Linked Open Data Dataset from Related Documents Petya Osenova and Kiril Simov IICT-BAS LDL-2016, LREC, Portoroz.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Mapping the NCI Thesaurus and the Collaborative Inter-Lingual Index Amanda Hicks University of Florida HealthInsight Workshop, Oslo, Norway.
Semantic and geographic information system for MCDA: review and user interface building Christophe PAOLI*, Pascal OBERTI**, Marie-Laure NIVET* University.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
ece 627 intelligent web: ontology and beyond
Web Service Modeling Ontology (WSMO)
Linguistic Linked Open Data
Unit for Natural Language Processing
Presentation transcript:

Ontology Lexicalisation In collaboration with John McCrae, Philipp Cimiano (CITEC, Univ. of Bielefeld) Elena Montiel-Ponsado (Universidad Politecnica Madrid) and other Monnet partners  Copyright 2010 Digital Enterprise Research Institute. All rights reserved, Paul Buitelaar Paul Buitelaar Unit for Natural Language Processing Digital Enterprise Research Institute - National University of Ireland, Galway

What is this talk about? Ontology Lexicalisation  Integrating ontologies (knowledge representation about objects) and lexicons (knowledge representation about words that refer to objects)  Enriching ontologies with a lexical layer Defining an Ontology for Lexicons  Defining a formal model (ontology) for representing lexical information relative to independently defined ontological semantics of concepts denoted by this lexicon  Formal model for web-based, modular, distributed lexicons

Use Cases of Ontology Lexicalisation Ontology-based Information Extraction from text Ontology Learning from text Lexical methods in Ontology Alignment Ontology Verbalisation Ontology Localisation …

Ontology-based Information Extraction >> ontology-text mismatch – is this a good match? (no) Ontology: Recurso-comercial Text: recurso por las licencias comerciales

Cross-lingual Ontology-based IE >> cross-lingual meaning mismatch Ontology (es): Recurso-comercial Text (en): Commercial Appeal (of Communism …)

SKOS - Multilingual Information

Not much uptake yet? from

Ontology-Text Mismatch ‘Edificio-historico’ vs. ‘…edificio, declarado Monumento Histórico…’ >> goes beyond SKOS (monolingual & multilingual term variants) >> requires representation of lexical information to compute linguistic variants, e.g. ‘edificio historico[apposVP[NP[Adj]]]’

A Lexicon Model for Ontologies Requirements for ‘ontology-lexicon’ model  Represent linguistic information relative to ontology – Avoid unnecessary ambiguities by representing only lexical features relevant to semantics of underlying application  Keep semantics separate from linguistic info – Separate clearly ‘world’ (properties of objects referred to by words) from ‘word’ (properties of words) knowledge  Modular, minimal design – Provide simple core model that can be easily extended upon need

Was there a solution already? - SKOS Simple Knowledge Organization System – SKOS  General model for formalizing thesauri, terminologies and related semantic and knowledge resources  Formalization of terminology in focus - terminology, classification, Semantic Web communities  Does not address linguistic aspects of terminology, or therefore, the lexicon-ontology interface 

Was there a solution already? - GOLD General Ontology for Linguistic Description – GOLD  Community-based ontology of linguistics  Linguistic study in focus - linguistics community  Formal model of linguistics as an ontology, but not about connecting lexical features to ontological semantics  Other issues: very big, modularity? 

Was there a solution already? - OWN OntoWordNet – OWN  Formal specification of WordNet through extension and axiomatization of its conceptual relations  Formal knowledge representation in focus - logic, knowledge representation, Semantic Web communities  Turns WordNet into an ontology but not about connecting lexical features to ontological semantics 

Was there a solution already? - LMF Lexical Markup Framework – LMF  General model for formalizing and sharing of machine- readable dictionaries  Lexical knowledge representation in focus - lexicography, NLP communities  Very close to ontology-lexicon requirements, but no view on how lexical features link to ontological semantics – semantics is limited to a notion of sense based on synsets  Other issues: incomplete formal model, focus on classes, less on properties/relations 

lemon lexicon model for ontologies: ‘lemon’  General model for formalizing lexical features relative to independently defined ontological semantics Two-level modelling  Abstract level (meta-model): lemon  Instantiation level (lexicon model): e.g. ‘LexInfo2’ 

lemon: Overview

LexicalEntry can be a Word, Phrase, or Part - such as an Affix lemon: Lexicon

lemon: Form LexicalForm can be, e.g., lemma (canonicalForm), plural form (otherForm), stem (abstractForm)

lemon: Structure LexicalEntry can be decomposed into one or more Components and compositional structure can be represented

lemon: Structure - Example

lemon: Meaning & Reference LexicalSense is an underspecified sense that points to a language- external reference, a unique ontological semantic object, depending on conditions and context LexicalSense can have a subsense and senseRelation with other LexicalSense sememe relation between LexicalSense and ontological semantic object can be either of pref/alt/hiddenSem

lemon: Meaning & Reference - Examples

lemon: Lexical Projection LexicalEntry can introduce a syntactic frame with arguments that are mapped to LexicalSense and indirectly to ontological semantic objects/properties

lemon: Lexical Projection - Example

lemon in Use Ontology-Lexicon Generator  Generate a lexicon for a given ontology in RDF/OWL format  rdfs:. … lemon:phraseRoot [ lemon:edge [ lemon:edge [ lemon:edge [ lemon:leaf _:n6 ] ; lemon:constituent pennbank:NNP ] ; lemon:constituent pennbank:NP ], [ lemon:edge [ lemon:edge [ lemon:leaf _:n88 ] ; lemon:constituent pennbank:VBD ], [ lemon:edge [ lemon:edge [ lemon:leaf _:n69 ] ; lemon:constituent pennbank:NN ] ; lemon:constituent pennbank:NP ] ; lemon:constituent pennbank:VP ] ; lemon:constituent pennbank:S ] ; lemon:decomposition ( _:n6 _:n88 _:n69 ) ; lemon:sense [ lemon:reference financeV4:AssetBackedDebt ] ; lemon:canonicalForm [ lemon:writtenRep "Asset backed ]. … lexinfo:partOfSpeech lexinfo:verb ; lemon:canonicalForm [ lexinfo:tense lexinfo:past ; lexinfo:verbFormMood lexinfo:indicative ; lemon:writtenRep ; lexinfo:aspect lexinfo:perfective ]. _:n88 rdf:type lemon:Component ; lexinfo:tense lexinfo:past ; lemon:element ; lexinfo:verbFormMood lexinfo:indicative ; lexinfo:aspect lexinfo:perfective.

Lexical Linked Data lemon is a web-based ontology, i.e., based on Uniform Resource Identifiers (URI)  Therefore all objects described by it are uniquely identifiable on the web  And can therefore be interlinked in a flexible, modular and distributed way  Making lemon-based lexicons part of the Web of Data, as currently defined by the ‘Linked Open Data cloud’

Lexical Linked Data – LOD cloud

Lexical Linked Data - Implications lemon objects (lexicons, lexical entries, words, phrases, forms, variants, senses, references, etc.) can be maintained uniquely (only one URI for each lemon object) but in a distributed fashion (maintenance by various parties) lemon objects can be interlinked upon need, creating layers of lexical structure defined formally by selected links with growing legacy of collaborative, formal definition of lexical structure (through use in applications), meta-level analysis of lemon objects will become object of study for lexicography and linguistics ontology development can build on and plug-in formal lexical structures in specific application domains collaborative web-based ontological knowledge development and lexicon development will go hand-in-hand …

What happens next? lemon  W3C Incubator Group planned  Experimentation, Dissemination  YOUR input/feedback Lexical Linked Data  Develop infrastructures to support/exploit this  Envision drastically novel applications in linguistic study and product development

Acknowledgements & Further Info Monnet colleagues  In particular John McCrae of CITEC, University of Bielefeld, Germany who leads the lemon effort in Monnet Grant support  EU FP7 Grant No for the Monnet project on Multilingual Ontologies for Networked Knowledge  Science Foundation Ireland Grant No. SFI/08/CE/I1380 for Lion Further info  lemon:  &  Monnet Community – contact me: