Download presentation
Presentation is loading. Please wait.
1
Introduction to the OntoLex-Lemon Model
John P. McCrae1, Thierry Declerck2 1Insight Centre for Data Analytics, National University of Ireland Galway 2Austrian Centre for Digital Humanities
2
RDF Turtle (Terse) Syntax
3
Simple (!) RDF Document < < “ ”^^< gativeInteger> . < < < . < < < .
4
Prefixes @prefix dbo: < . @prefix dbp: < . @prefix rdf: < . @prefix xsd: < . < < “ ”^^< . < < ns#type> < . < < ns#type> < . dbp:Paris dbo:populationTotal “ ”^^xsd:nonNegativeInteger . dbp:Paris rdf:type dbo:Location . dbp:Paris rdf:type < . @prefix pre: <long> . pre:name => <long+name>
5
Continuations dbp:Paris dbo:populationTotal “ ”^^xsd:nonNegativeInteger . dbp:Paris rdf:type dbo:Location . dbp:Paris rdf:type < . Replace . with ; to repeat subject and , to repeat subject and object dbp:Paris dbo:populationTotal “ ”^^xsd:nonNegativeInteger ; rdf:type dbo:Location , < . Or more nicely formatted: dbp:Paris dbo:populationTotal “ ”^^xsd:nonNegativeInteger ; rdf:type dbo:Location , < .
6
RDF Lists ex:node ?? rdf:nil “one” “two” rdf:rest rdf:rest rdf:first
7
Blank nodes Some nodes do not have a known URI, we call these blank nodes, they are denoted with [ ] or _:id. A typical use is for lists: ex:node rdf:first “one” ; rdf:rest _:n1 . _:n1 rdf:first “two” ; rdf:rest rdf:nil . ex:node rdf:first “one” ; rdf:rest [ rdf:first “two” ; rdf:rest rdf:nil ] . Actually Turtle supports an even more compact syntax for lists ( “one” “two” )
8
URLs http://www.example.com/path/to/file#identifer Domain Fragment
Protocol Path
9
Relative URLs URLs may be resolved relative to the Base URL (e.g., the URL used to find the document) < <// </path/to/file#identifer> <file#identifer> <#identifer>
10
Design of the Model
11
History LingInfo (Buitelaar, 2006) Monnet Lemon (2011)
Linguistic Information Repository (Montiel-Ponsoda, 2008) OntoLex Use Cases (2014) Lexicography Module (2019) LexInfo (2010) OntoLex Lemon Final Specification (2016) LexOnto (Cimiano, 2007) OntoLex CG Founded (2012)
12
General Requirements R1. OWL and RDF R2. Multilinguality
R3. Semantics by Reference R4. Openness R5. Reuse relevant standards
13
RDF and OWL RDF models are labelled directed graphs Representation
Each entry has a URI Reuse of lexicon data Reasoning
14
Multilinguality Support any language
Do not make language-specific assumptions Part-of-speech values Gender Translation and variation
15
Semantics by Reference
Meaning of a word given by reference Reference captures semantic information Disambiguation is performed relative to the ontology No (traditional) word senses
16
Openness Extensible with new models
No unnecessary choices of linguistic categories No payment or restrictions in using the model
17
Reuse standards Reuse as many standards as possible OWL RDF SKOS
Dublin Core LMF TMF
18
The OntoLex-Lemon Model
19
...Cuairt Liteartha do Theangacha Mionlaigh san Eoraip...
Ontologies ...Cuairt Liteartha do Theangacha Mionlaigh san Eoraip...
20
Linked Data on the Web “Edema” http://dbpedia.org/resource/Edema
umls:C mesh:D00487 icd10:R60.9
21
Linked Data with Language
“Edemata” “Edema” mesh:D00487 icd10:R60.9 umls:C “Dropsy”
22
Lexical Entries EDEMA DROPSY “Edemata” “Edema” “Dropsy” dbpedia:Edema
mesh:D00487 “Edema” icd10:R60.9 DROPSY “Dropsy” umls:C
23
What is a lexical entry? A lexical entry represents a unit of analysis of the lexicon that consists of a set of forms that are grammatically related and a set of base meanings that are associated with these forms. Thus, a lexical entry is a word, multiword expression or affix with a single part-of-speech, morphological pattern, etymology and set of senses.
24
Forms “Edemata” “Edema” number=plural number=singular EDEMA
25
Senses EDEMA DROPSY dating=old dbpedia:Fish_Dropsy dbpedia:Edema
27
The Model
29
Simple Entry OntoLex Namespace
@prefix ontolex: < . @prefix skos: < . <#cat> a ontolex:Word ; ontolex:canonicalForm [ ontolex:writtenRep ] ; ontolex:denotes [ skos:definition “A four-legged, furry ] . Lemma Sense
30
Simple Entry with Grammatical Information
@prefix ontolex: < . @prefix skos: < . @prefix lexinfo: < . <#cat> a ontolex:Word ; lexinfo:partOfSpeech lexinfo:noun ontolex:canonicalForm [ ontolex:writtenRep ; lexinfo:number lexinfo:singular ] ; ontolex:otherForm [ ontolex:writtenRep ; lexinfo:number lexinfo:plural ] ; ontolex:denotes [ skos:definition “A four-legged, furry ] . LexInfo Ontology Part of Speech Inflected Form
31
Restriction on Lexical Sense
@prefix ontolex: < . @prefix dbpedia: < . @prefix dbo: < . <#bulrush> a ontolex:Word ; ontolex:sense [ ontolex:reference dbpedia:Typha ; ontolex:usage [ rdf:value “British English” ] ] ; ontolex:denotes dbpedia:Typha . <#cattail> a ontolex:Word ; ontolex:usage [ rdf:value “American English” ] ] ; sense ⚬ reference = denotes Restriction on Lexical Sense
32
<http://john.mccr.ae> foaf:knows agsc:cimiano
Syntax and Semantics John knows Philipp < foaf:knows agsc:cimiano
33
Syntax and Semantics
34
Syntactic Frames Synsem Module
@prefix ontolex: < . @prefix synsem: < . @prefix lexinfo: < . <#know> a ontolex:Word ; synsem:synBehavior <#know/transitive> . <#know/transitive> a synsem:SyntacticFrame ; lexinfo:subject <#know/subject> ; lexinfo:directObject <#know/directObject> . Frame
35
Semantic Frames @prefix ontolex: < . @prefix synsem: < . @prefix lexinfo: < . @prefix foaf: < @prefix rdfs: < . <#know> a ontolex:Word ; ontolex:sense <#know/sense> ; synsem:synBehavior <#know/transitive> . <#know/sense> a ontolex:LexicalSense , synsem:OntoMap ; synsem:ontoMap <#know/sense> ; ontolex:reference foaf:knows ; synsem:subjOfProp <#know/subject> ; synsem:objOfProp <#know/directObject> . foaf:knows a rdf:Property ; rdfs:domain foaf:Person ; rdfs:range foaf:Person . Lexical sense is an ontology mapping Identifiers from syntactic frame Ontological definition of semantic frame
36
Syntactic-Semantic Mapping
Lexical Entry Argument (subject) Lexical Sense/ Onto Map Syntactic Frame Argument (object) Class (domain) Property Class (range)
37
Decomposition Qualitätsmanagement-System Qualität Management System
38
Decomposition
39
constituent ⚬ correspondsTo
Decomposition constituent ⚬ correspondsTo = subterm @prefix ontolex: < . @prefix decomp: < . <#summer_school> a ontolex:MultiWordExpression ; decomp:subterm <#summer>, <#school> . <#école_d’été> a ontolex:MultiWordExpression ; decomp:constituent <#école_d’été/école> , <#école_d’été/de> , <#école_d’été/été> ; rdf:_1 <#école_d’été/école> ; rdf:_2 <#école_d’été/de> ; rdf:_3 <#école_d’été/été> ; <#école_d’été/de> a decomp:Component ; decomp:correspondsTo <#de> ; lexinfo:lexTermType lexinfo:contraction . Order Component Properties
40
Variation and Translation
Cultural Translation “Japanese Rice
41
Variation and Translation
42
How to represent translation
Lexical Level (4) Translatable As Lexicosemantic Level vartrans:Translation (3) Stand-off Sense Sense (2) Translation Semantic Level dbpedia:Rice (1) Shared Reference
43
Linguistic Metadata Magic Ontology Jace the Wizard Erhnam the Djinn
44
LiMe - Linguistic Metadata
See Manuel’s Talk
45
Future Directions
46
New Modules Morphology
Lexicography (for traditional lexicographic resources) Frequency, Attribution and Corpus Information (FRAC) Etymology and Diachronicity Lexico-Syntactic Categories
47
Lexicography Module
48
Community Group Please join!
49
Thanks. This work has emanated from research supported in part by a research grant from Science Foundation Ireland (SFI) under Grant Number SFI/12/RC/2289, co-funded by the European Regional Development Fund, and the European Union’s Horizon 2020 research and innovation programme under grant agreement No , ELEXIS - European Lexical Infrastructure.
50
Coffee
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.