Ontology Engineering: from Cognitive Science to the Semantic Web Maria Teresa Pazienza University of Roma Tor Vergata, Italy 1.

Slides:



Advertisements
Similar presentations
Building Wordnets Piek Vossen, Irion Technologies.
Advertisements

KR-2002 Panel/Debate Are Upper-Level Ontologies worth the effort? Chris Welty, IBM Research.
An Overview of Ontologies and their Practical Applications Gianluca Correndo
CMSC 723: Intro to Computational Linguistics November 24, 2004 Lecture 12: Lexical Semantics Bonnie Dorr Christof Monz.
Knowledge Representation
Cognitive Linguistics Croft & Cruse 9
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
Semiotics and Ontologies. Ontologies contain categories, lexicons contain word senses, terminologies contain terms, directories contain addresses, catalogs.
Ontology From Wikipedia, the free encyclopedia In philosophy, ontology (from the Greek oν, genitive oντος: of being (part. of εiναι: to be) and –λογία:
Statistical NLP: Lecture 3
Steven Schoonover.  What is VerbNet?  Levin Classification  In-depth look at VerbNet  Evolution of VerbNet  What is FrameNet?  Applications.
Chapter 17. Lexical Semantics From: Chapter 17 of An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, by.
Building an Ontology-based Multilingual Lexicon for Word Sense Disambiguation in Machine Translation Lian-Tze Lim & Tang Enya Kong Unit Terjemahan Melalui.
1 Words and the Lexicon September 10th 2009 Lecture #3.
 Christel Kemke 2007/08 COMP 4060 Natural Language Processing Feature Structures and Unification.
NLP and Speech 2004 Feature Structures Feature Structures and Unification.
1/27 Semantics Going beyond syntax. 2/27 Semantics Relationship between surface form and meaning What is meaning? Lexical semantics Syntax and semantics.
PSY 369: Psycholinguistics Some basic linguistic theory part3.
What is an Ontology? AmphibiaTree 2006 Workshop Saturday 8:45–9:15 A. Maglia.
From Semantic Similarity to Semantic Relations Georgeta Bordea, November 25 Based on a talk by Alessandro Lenci titled “Will DS ever become Semantic?”,
Symbols and Language Lexical Relations SIMS 202 Profs. Hearst & Larson UC Berkeley SIMS Fall 2000.
Title: Chinese Characters and Top Ontology in EuroWordNet Paper by: Shun Sylvia Wong & Karel Pala Presentation By: Patrick Baker.
Meaning and Language Part 1.
The Study of Meaning in Language
SI485i : NLP Set 10 Lexical Relations slides adapted from Dan Jurafsky and Bill MacCartney.
Adam Pease and Christiane Fellbaum Presenter: 吳怡安
Ontology Development in the Sciences Some Fundamental Considerations Ontolytics LLC Topics:  Possible uses of ontologies  Ontologies vs. terminologies.
Knowledge representation
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
Ontology-Driven Information Retrieval Nicola Guarino Laboratory for Applied Ontology Institute for Cognitive Sciences and Technology (ISTC-CNR) Trento-Roma,
Lecture 18 Ontologies and Wordnet Topics Ontologies Wordnet Overview of MeaningReadings: Text 13.5 NLTK book Chapter 2 March 25, 2013 CSCE 771 Natural.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Tommie Curtis SAIC January 17, 2000 Open Forum on Metadata Registries Santa Fe, NM SDC JE-2023.
WORD SENSE DISAMBIGUATION STUDY ON WORD NET ONTOLOGY Akilan Velmurugan Computer Networks – CS 790G.
Lexical Semantics Chapter 16
Application of INTEX in refinement and validation of Serbian WordNet Ivan Obradović, Ranka Stanković Cvetana Krstev, Gordana Pavlović-Lažetić University.
WordNet: Connecting words and concepts Christiane Fellbaum Cognitive Science Laboratory Princeton University.
LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.
WordNet: Connecting words and concepts Peng.Huang.
Terminology and documentation*  Object of the study of terminology:  analysis and description of the units representing specialized knowledge in specialized.
Taken from Schulze-Kremer Steffen Ontologies - What, why and how? Cartic Ramakrishnan LSDIS lab University of Georgia.
Semantics The study of meaning in language. Semantics is…  The study of meaning in language.  It deals with the meaning of words (Lexical semantics)
Logics for Data and Knowledge Representation Applications of ClassL: Lightweight Ontologies.
11 Chapter 19 Lexical Semantics. 2 Lexical Ambiguity Most words in natural languages have multiple possible meanings. –“pen” (noun) The dog is in the.
From Allesandro Lenci. Linguistic Ontologies Mikrokosmos (Nirenburg, Mahesh et al.) Generalized Upper Model (Bateman et al.)Generalized Upper Model WordNet.
Integrating lexical units, synsets and ontology in the Cornetto Database Piek Vossen 1, 2, Isa Maks 1, Roxane Segers 1, Hennie van der Vliet 1 1: Faculty.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
Wordnet - A lexical database for the English Language.
Word Relations Slides adapted from Dan Jurafsky, Jim Martin and Chris Manning.
◦ Process of describing the structure of phrases and sentences Chapter 8 - Phrases and sentences: grammar1.
Knowledge Structure Vijay Meena ( ) Gaurav Meena ( )
An Introduction to Semantic Parts of Speech Rajat Kumar Mohanty rkm[AT]cse[DOT]iitb[DOT]ac[DOT]in Centre for Indian Language Technology Department of Computer.
Of 29 lecture 15: description logic - introduction.
Ontology Technology applied to Catalogues Paul Kopp.
SEMANTICS Chapter 10 Ms. Abrar Mujaddidi. What is semantics?  Semantics is the study of the conventional meaning conveyed by the use of words, phrases.
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
SERVICE ANNOTATION WITH LEXICON-BASED ALIGNMENT Service Ontology Construction Ontology of a given web service, service ontology, is constructed from service.
Grammar Grammar analysis.
COMP6215 Semantic Web Technologies
Talp Research Center, UPC, Barcelona, Spain
ece 720 intelligent web: ontology and beyond
Statistical NLP: Lecture 3
ece 627 intelligent web: ontology and beyond
Ontology Engineering: from Cognitive Science to the Semantic Web
ConceptNet: Search ontology classes via human senses ---A proposal
ArtsSemNet: From Bilingual Dictionary To Bilingual Semantic Network
CSC 594 Topics in AI – Applied Natural Language Processing
The Study of Meaning in Language
Linguistic Essentials
Structure of a Lexicon Debasri Chakrabarti 13-May-19.
Presentation transcript:

Ontology Engineering: from Cognitive Science to the Semantic Web Maria Teresa Pazienza University of Roma Tor Vergata, Italy 1

2 Role of natural language

3 Computational lexicons and natural language technologies Computational lexicons provide a word knowledge that is comprehensible to machines There is an esplicit representation Word meaning is related to both its morphology and syntax It is possible to create multilingual lexical links

4 Computational lexicons and natural language technologies Computational lexicons are collections of lexical entries in a specific language A lexical entry may correspond to a lemma: dog, fine, house flexed form : eats, eated, dogs, houses For lemma based lexicons, each lexical entry may collect a variable amount of information

5 Computational lexicons and natural language technologies Orthographic form Categorial information (parts of discourse): N, V, P, …. A few morphological info: gender, number, person, etc. Information on selectional properties (subcategorization) Information on lemma meaning (lexical semantics)

6 Computational lexicons and natural language technologies A language processing system is composed, at least, by the following components Syntactic analyzer/parser phrase/text Computational lexicon Morphologic analyzer results

7 Ontologies and computational lexicons Semantic Web Ontologies Computational Lexicons HLT Access to Content ?

8 Ontologies “ An ontology is an explicit specification of a conceptualization ” (Gruber, 1993) “it includes vocabulary, semantic links, a few simple inference rules and logics ” (Hendler, 2001)

9 “Linguistic” ontologies Systems of symbols representing concepts as they are coded by linguistic expressions (lexical units, terms,...) They specify semantic classes by grouping terms with similar meaning A language for semantic representation is used OBJECT EVENT LOCATION ARTIFACT ANIMAL ENTITY VEHICLE MAMMAL BEACH CONCERT dog, cat, horse car, van, truck beach piano concert, rock concert spiaggia

10 “Linguistic” ontologies Monolingual vs multilingual General purpose vs domain specific Tipes of content (Morpho)syntactic Semantic Mixed Terminological

11 Syntactic computational lexicons Lexical information is represented into subcategorization frames (ComLex, PAROLE ecc.) Syntactic frames express: A number of arguments Related syntactic categories (PP, NP, ecc.) Lexical constraints on arguments (ie. PP must have a preposition as first element) A functional role for each argument (Subj, Obj, ecc.) hit [V: (Subj: NP) (Objd: NP)] answer [N: (Obji: PP_to)]

12 Semantic computational lexicons They represent the meaning of a word By distinguishing different word senses By expressing inferences (being a human => being an animate) By representing similarities, relatedness ecc. (es. bank, current account, money are concepts that are related in a financial context)

13 Semantic computational lexicons Based on: Conceptual nets WordNet (Miller, Fellbaum et al.) EuroWordNet (Vossen et al.).. Frames Mikrokosmos (Nirenburg, Mahesh et al.) FrameNet (Fillmore et al.).. Hybrid SIMPLE (Calzolari, Lenci et al.)..

14 Semantic lexicons Generally lexicons are alphabetically organized. Mainly they reproduce the same structure of dictionaries as they publish infos just starting by words (starting from the lemma, ecc.) It is possible to organize a lexicon on different bases, for example, on conceptual bases.

15 Words and concepts words, ie. ‘dog, ‘eat, etc. express concepts. Dogs are mammals The phrase has among its constituents the words “dog”, “mammal”… the proposition has among its constituents the concepts dog, mammal Concepts may be considered a sort of constituents of the meaning (that is what we wish to communicate). To understand propositions we must understand all concepts expressed by their constituents

16 Polysemy and synonymy A given word, (ex. “bank”) may have different senses, that is may express more than one concept in different contexts; it is called polysemyc bank = institution where people can keep their money, etc.. bank = raised ground along the edge of a river or lake, etc

17 Polysemy and synonymy On the contrary, the same concept may be expressed by different words (synonyms) house, residence, flat, … Both synonymy and polysemy are not properties in a total approach, they are context dependent These properties may be helpful for doing inference

18 Hyperonym and hyponym A robin is (is-a) a bird, a bird is (is-a) an animal, an animal is (is-a) a living being… robin is-a bird is-a animal is-a living being… The concept robin is subordinate to the concept bird. The concept bird is superordinate to the concept robin. The word “robin” is a hyponym of the word “bird” The word “bird” is a hyperonym of the word “robin” These properties may be helpful for doing inference

19 Lexical concepts A lexical concept is a concept that, in a specific language, may be expressed in a simple way (a word, a complex word, etc.). house is a lexical concept house made of glass, is not a lexical concept

20 Lexical concepts representation A lexical concept may be represented as a set of synonym words (synset) that express that concept. {automobile, car} It is possible to relate synsets (representations of lexical concepts) by means of hyponyms and hyperonyms. Criteria for inserting two words in the same synset: A mother tongue person may substitute a word with the other in the highest number of contexts

21 {automobile, car} is-a {vehicle} is-a {transportation means } …………….. {automobile,car} {vehicle} {transportation means} Is-a

22 WordNet (WN) WordNet (WN) has been developed at the University of Princeton by George Miller research group as a model of mental lexicon. Def. by C. Fellbaum: it seems consistent … a semantic dictionary designed as a net, to represent words and concepts as in interrelated system; it seems consistent with the evidence with which persons speacking organize their own mental lexicons… It is a semantic network where concepts are defined in terms of relations with other concepts In WordNet, words are structured in 15 different hierarchies. The root of each of them corresponds to a sort of semantic primitive. {activity}, {animal}, {artifact}, {attribute}, {body}, {cognition, knowledge}, {communication}, {event}, ……

23 Hierarchies …………………………………… activity communication

24 WordNet (WN) WordNet (WN) is a lexical database for English language high coverage for English lexical entries (N, V, Adg, Adv) information on lexical and semantic relations among entries 1. synonymy (automobile, car) 2. hyponymy - a kind of - (ambulance, automobile) 3. meronymy – has part – (hand, fingers) 4. antonymy (day, night)

25 WordNet WN Each word can have different senses (identified by numbers) identifying a specific synset, that is composed by synonyms terms (i.e. ). With such a structure it is possible to explicit the glossa correspondent to a specific word sense (as in a conventional dictionary), as well as the semantic relations in which the glossa is involved.

26 WordNet (WN) structure WN structural fundamental element is the synset = synonym set A synset is equivalent to a concept A concept is expressed by a synset Ex. Senses of “car” (synsets to which “car” belongs) {car, auto, automobile, machine, motorcar} {car, railcar, railway car, railroad car} {cable car, car} {car, gondola} {car, elevator car}

27 WordNet (WN) structure Separate tables (files) for different syntactic categories (N, V, Adg, Adv) Links among words and synsets as well as among synsets (that represent syntactic relations) Ex. {persons, individuals, humans } a kind of {organism, being} a kind of {living thing, animate thing} a kind of {object, physical object} a kind of {entity, physical thing}

28 WordNet structure

29 WordNet WN (not updated values)

30 WordNet WN The word ``bass'' has 8 senses in WordNet 1. bass - (the lowest part of the musical range) 2. bass, bass part - (the lowest part in polyphonic music) 3. bass, basso - (an adult male singer with the lowest voice) 4. sea bass, bass - (flesh of lean-fleshed saltwater fish of the family Serranidae) 5. freshwater bass, bass - (any of various North American lean-fleshed freshwater fishes especially of the genus Micropterus) 6. bass, bass voice, basso - (the lowest adult male singing voice) 7. bass - (the member with the lowest range of a family of musical instruments) 8. bass -(nontechnical name for any of numerous edible marine and freshwater spiny-finned fishes)

31 Hierarchies in WordNet

32 WordNet WN Synsets are organized hierarchically by means of hyperonymy and hyponymy relations Further semantic relations exist between synsets (role, part-of, cause); thanks them a very rich and complex semantic network has been realized. By using the semantic structure of WordNet, each one can build a personalized cognitive view starting by a word.

33 WordNet WN WN configures in two different aspects: Lexicon describing different word senses Ontology describing semantic relations between concepts. WN has been initially created for English; then versions for further languages have been developed: Dutch, Spanish, Italian, Basc, …. EuroWordNet multilingual database ( Vossen )

34 WordNet WN The Wordnet more relevant aspect is the notion of synset; through a synset it is possible to define a sense (as well a concept ) For example: table as a verb to indicate defer > {postpone, hold over, table, shelve, set back, defer, remit, put off} For WordNet, the meaning of this sense of table is just this list.

35 WordNet WN domain independent lexical relations (among entries, senses, set of synonyms),

36 WordNet WN A few problems: There is a confusion between concepts and individuals (lack of expressivity: with the relation INSTANCE-OF it is not possible to distinguish between subsumption concept-concept and instantiation individual-concept) Confusion between object-level and meta-level (i.e.: the concept Abstraction includes either abstract entities as Set, Time, Space, or abstractions and meta-level concepts as Attribute, Relation, Quantity) Confusion between different levels of generality (i.e. entities are both types and roles)