Ontology Engineering: from Cognitive Science to the Semantic Web

Slides:



Advertisements
Similar presentations
Building Wordnets Piek Vossen, Irion Technologies.
Advertisements

KR-2002 Panel/Debate Are Upper-Level Ontologies worth the effort? Chris Welty, IBM Research.
An Overview of Ontologies and their Practical Applications Gianluca Correndo
CMSC 723: Intro to Computational Linguistics November 24, 2004 Lecture 12: Lexical Semantics Bonnie Dorr Christof Monz.
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
Semiotics and Ontologies. Ontologies contain categories, lexicons contain word senses, terminologies contain terms, directories contain addresses, catalogs.
Statistical NLP: Lecture 3
Steven Schoonover.  What is VerbNet?  Levin Classification  In-depth look at VerbNet  Evolution of VerbNet  What is FrameNet?  Applications.
Chapter 17. Lexical Semantics From: Chapter 17 of An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, by.
1 Words and the Lexicon September 10th 2009 Lecture #3.
NLP and Speech 2004 Feature Structures Feature Structures and Unification.
1/27 Semantics Going beyond syntax. 2/27 Semantics Relationship between surface form and meaning What is meaning? Lexical semantics Syntax and semantics.
What is an Ontology? AmphibiaTree 2006 Workshop Saturday 8:45–9:15 A. Maglia.
From Semantic Similarity to Semantic Relations Georgeta Bordea, November 25 Based on a talk by Alessandro Lenci titled “Will DS ever become Semantic?”,
Symbols and Language Lexical Relations SIMS 202 Profs. Hearst & Larson UC Berkeley SIMS Fall 2000.
Title: Chinese Characters and Top Ontology in EuroWordNet Paper by: Shun Sylvia Wong & Karel Pala Presentation By: Patrick Baker.
Meaning and Language Part 1.
The Study of Meaning in Language
SI485i : NLP Set 10 Lexical Relations slides adapted from Dan Jurafsky and Bill MacCartney.
Adam Pease and Christiane Fellbaum Presenter: 吳怡安
Ontology Development in the Sciences Some Fundamental Considerations Ontolytics LLC Topics:  Possible uses of ontologies  Ontologies vs. terminologies.
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
WORD SENSE DISAMBIGUATION STUDY ON WORD NET ONTOLOGY Akilan Velmurugan Computer Networks – CS 790G.
Lexical Semantics Chapter 16
Application of INTEX in refinement and validation of Serbian WordNet Ivan Obradović, Ranka Stanković Cvetana Krstev, Gordana Pavlović-Lažetić University.
WordNet: Connecting words and concepts Christiane Fellbaum Cognitive Science Laboratory Princeton University.
LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.
WordNet: Connecting words and concepts Peng.Huang.
Terminology and documentation*  Object of the study of terminology:  analysis and description of the units representing specialized knowledge in specialized.
Semantics The study of meaning in language. Semantics is…  The study of meaning in language.  It deals with the meaning of words (Lexical semantics)
From Allesandro Lenci. Linguistic Ontologies Mikrokosmos (Nirenburg, Mahesh et al.) Generalized Upper Model (Bateman et al.)Generalized Upper Model WordNet.
Integrating lexical units, synsets and ontology in the Cornetto Database Piek Vossen 1, 2, Isa Maks 1, Roxane Segers 1, Hennie van der Vliet 1 1: Faculty.
Linguistic Essentials
Wordnet - A lexical database for the English Language.
Ontology Engineering: from Cognitive Science to the Semantic Web Maria Teresa Pazienza University of Roma Tor Vergata, Italy 1.
Word Relations Slides adapted from Dan Jurafsky, Jim Martin and Chris Manning.
An Introduction to Semantic Parts of Speech Rajat Kumar Mohanty rkm[AT]cse[DOT]iitb[DOT]ac[DOT]in Centre for Indian Language Technology Department of Computer.
Of 29 lecture 15: description logic - introduction.
SEMANTICS Chapter 10 Ms. Abrar Mujaddidi. What is semantics?  Semantics is the study of the conventional meaning conveyed by the use of words, phrases.
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
SERVICE ANNOTATION WITH LEXICON-BASED ALIGNMENT Service Ontology Construction Ontology of a given web service, service ontology, is constructed from service.
10/31/00 1 Introduction to Cognitive Science Linguistics Component Topic: Formal Grammars: Generating and Parsing Lecturer: Dr Bodomo.
Knowledge Representation Techniques
Grammar Grammar analysis.
COMP6215 Semantic Web Technologies
Lexicons, Concept Networks, and Ontologies
Talp Research Center, UPC, Barcelona, Spain
ece 720 intelligent web: ontology and beyond
Statistical NLP: Lecture 3
ece 627 intelligent web: ontology and beyond
Generating sets of synonyms between languages
Ontology From Wikipedia, the free encyclopedia
SYNTAX.
Representation of Actions as an Interlingua
ConceptNet: Search ontology classes via human senses ---A proposal
Word Relations Slides adapted from Dan Jurafsky, Jim Martin and Chris Manning.
ArtsSemNet: From Bilingual Dictionary To Bilingual Semantic Network
Knowledge Representation
Lecture 16: Lexical Semantics, Wordnet, etc
Chapter 2 Database Environment Pearson Education © 2009.
Chapter 2 Database Environment.
CSC 594 Topics in AI – Applied Natural Language Processing
Lecture 26 Lexical Semantics
Word Relations Slides adapted from Dan Jurafsky, Jim Martin and Chris Manning.
The Study of Meaning in Language
Linguistic Essentials
Semantics Going beyond syntax.
Structure of a Lexicon Debasri Chakrabarti 13-May-19.
Chapter 2 Database Environment Pearson Education © 2009.
Presentation transcript:

Ontology Engineering: from Cognitive Science to the Semantic Web Maria Teresa Pazienza University of Roma Tor Vergata, Italy

Role of natural language

Computational lexicons and natural language technologies Computational lexicons provide a word knowledge that is comprehensible to machines There is an esplicit representation Word meaning is related to both its morphology and syntax It is possible to create multilingual lexical links

Computational lexicons and natural language technologies Computational lexicons are collections of lexical entries in a specific language A lexical entry may correspond to a lemma: dog, fine, house flexed form : eats, eated, dogs, houses For lemma based lexicons, each lexical entry may collect a variable amount of information

Computational lexicons and natural language technologies Orthographic form Categorial information (parts of discourse): N, V, P, …. A few morphological info: gender, number, person, etc. Information on selectional properties (subcategorization) Information on lemma meaning (lexical semantics)

Computational lexicons and natural language technologies A language processing system is composed, at least, by the following components phrase/text results Syntactic analyzer/parser Morphologic analyzer Computational lexicon

Ontologies and computational lexicons Access to Content HLT Semantic Web Ontologies Computational Lexicons ?

Ontologies “An ontology is an explicit specification of a conceptualization” (Gruber, 1993) “it includes vocabulary, semantic links, a few simple inference rules and logics ” (Hendler, 2001)

“Linguistic” ontologies Systems of symbols representing concepts as they are coded by linguistic expressions (lexical units, terms, ...) They specify semantic classes by grouping terms with similar meaning A language for semantic representation is used OBJECT EVENT LOCATION ARTIFACT ANIMAL ENTITY VEHICLE MAMMAL BEACH CONCERT dog, cat, horse car, van, truck beach piano concert, rock concert spiaggia

“Linguistic” ontologies Monolingual vs multilingual General purpose vs domain specific Tipes of content (Morpho)syntactic Semantic Mixed Terminological

Syntactic computational lexicons Lexical information is represented into subcategorization frames (ComLex, PAROLE ecc.) Syntactic frames express: A number of arguments Related syntactic categories (PP, NP, ecc.) Lexical constraints on arguments (ie. PP must have a preposition as first element) A functional role for each argument (Subj, Obj, ecc.) hit [V: (Subj: NP) (Objd: NP)] answer [N: (Obji: PP_to)]

Semantic computational lexicons They represent the meaning of a word By distinguishing different word senses By expressing inferences (being a human => being an animate) By representing similarities, relatedness ecc. (es. bank, current account, money are concepts that are related in a financial context)

Semantic computational lexicons Based on: Conceptual nets WordNet (Miller, Fellbaum et al.) EuroWordNet (Vossen et al.) .. Frames Mikrokosmos (Nirenburg, Mahesh et al.) FrameNet (Fillmore et al.) Hybrid SIMPLE (Calzolari, Lenci et al.)

Semantic lexicons Generally lexicons are alphabetically organized. Mainly they reproduce the same structure of dictionaries as they publish infos just starting by words (starting from the lemma, ecc.) It is possible to organize a lexicon on different bases, for example, on conceptual bases.

Words and concepts words, ie. ‘dog, ‘eat, etc. express concepts. Dogs are mammals The phrase has among its constituents the words “dog”, “mammal”… the proposition has among its constituents the concepts dog, mammal Concepts may be considered a sort of constituents of the meaning (that is what we wish to communicate). To understand propositions we must understand all concepts expressed by their constituents

Polysemy and synonymy A given word, (ex. “bank”) may have different senses, that is may express more than one concept in different contexts; it is called polysemyc bank = institution where people can keep their money, etc.. bank = raised ground along the edge of a river or lake, etc

Polysemy and synonymy On the contrary, the same concept may be expressed by different words (synonyms) house, residence, flat, … Both synonymy and polysemy are not properties in a total approach, they are context dependent These properties may be helpful for doing inference

robin is-a bird is-a animal is-a living being… Hyperonym and hyponym A robin is (is-a) a bird, a bird is (is-a) an animal, an animal is (is-a) a living being… robin is-a bird is-a animal is-a living being… The concept robin is subordinate to the concept bird. The concept bird is superordinate to the concept robin. The word “robin” is a hyponym of the word “bird” The word “bird” is a hyperonym of the word “robin” These properties may be helpful for doing inference

Lexical concepts A lexical concept is a concept that, in a specific language, may be expressed in a simple way (a word, a complex word, etc.). house is a lexical concept house made of glass, is not a lexical concept

Lexical concepts representation A lexical concept may be represented as a set of synonym words (synset) that express that concept. {automobile, car} It is possible to relate synsets (representations of lexical concepts) by means of hyponyms and hyperonyms. Criteria for inserting two words in the same synset: A mother tongue person may substitute a word with the other in the highest number of contexts

{transportation means } {automobile, car} is-a {vehicle} is-a {transportation means } …………….. {transportation means} Is-a {vehicle} Is-a {automobile,car}

WordNet (WN) WordNet (WN) has been developed at the University of Princeton by George Miller research group as a model of mental lexicon. Def. by C. Fellbaum: … a semantic dictionary designed as a net, to represent words and concepts as in interrelated system; it seems consistent with the evidence with which persons speacking organize their own mental lexicons… It is a semantic network where concepts are defined in terms of relations with other concepts In WordNet, words are structured in 15 different hierarchies. The root of each of them corresponds to a sort of semantic primitive. {activity}, {animal}, {artifact}, {attribute}, {body}, {cognition, knowledge}, {communication}, {event}, ……

Hierarchies activity communication …………………………………… ……………………………………

WordNet (WN) WordNet (WN) is a lexical database for English language high coverage for English lexical entries (N, V, Adg, Adv) information on lexical and semantic relations among entries synonymy (automobile, car) hyponymy - a kind of - (ambulance, automobile) meronymy – has part – (hand, fingers) antonymy (day, night)

WordNet WN Each word can have different senses (identified by numbers) identifying a specific synset, that is composed by synonyms terms (i.e. <living form, organism, being, living object>). With such a structure it is possible to explicit the glossa correspondent to a specific word sense (as in a conventional dictionary), as well as the semantic relations in which the glossa is involved.

WordNet (WN) structure WN structural fundamental element is the synset = synonym set A synset is equivalent to a concept A concept is expressed by a synset Ex. Senses of “car” (synsets to which “car” belongs) {car, auto, automobile, machine, motorcar} {car, railcar, railway car, railroad car} {cable car, car} {car, gondola} {car, elevator car}

WordNet (WN) structure Separate tables (files) for different syntactic categories (N, V, Adg, Adv) Links among words and synsets as well as among synsets (that represent syntactic relations) Ex. {persons, individuals, humans } a kind of {organism, being} a kind of {living thing, animate thing} a kind of {object, physical object} a kind of {entity, physical thing}

WordNet structure

WordNet WN (not updated values)

WordNet WN The word ``bass'' has 8 senses in WordNet bass - (the lowest part of the musical range) bass, bass part - (the lowest part in polyphonic music) bass, basso - (an adult male singer with the lowest voice) sea bass, bass - (flesh of lean-fleshed saltwater fish of the family Serranidae) freshwater bass, bass - (any of various North American lean-fleshed freshwater fishes especially of the genus Micropterus) bass, bass voice, basso - (the lowest adult male singing voice) bass - (the member with the lowest range of a family of musical instruments) bass -(nontechnical name for any of numerous edible marine and freshwater spiny-finned fishes)

Hierarchies in WordNet

WordNet WN Synsets are organized hierarchically by means of hyperonymy and hyponymy relations Further semantic relations exist between synsets (role, part-of, cause); thanks them a very rich and complex semantic network has been realized. By using the semantic structure of WordNet, each one can build a personalized cognitive view starting by a word.

WordNet WN WN configures in two different aspects: Lexicon describing different word senses Ontology describing semantic relations between concepts. WN has been initially created for English; then versions for further languages have been developed: Dutch, Spanish, Italian, Basc, …. EuroWordNet multilingual database (Vossen)

WordNet WN The Wordnet more relevant aspect is the notion of synset; through a synset it is possible to define a sense (as well a concept ) For example: table as a verb to indicate defer > {postpone, hold over, table, shelve, set back, defer, remit, put off} For WordNet, the meaning of this sense of table is just this list.

WordNet WN domain independent lexical relations (among entries, senses, set of synonyms),

WordNet WN A few problems: There is a confusion between concepts and individuals (lack of expressivity: with the relation INSTANCE-OF it is not possible to distinguish between subsumption concept-concept and instantiation individual-concept) Confusion between object-level and meta-level (i.e.: the concept Abstraction includes either abstract entities as Set, Time, Space, or abstractions and meta-level concepts as Attribute, Relation, Quantity) Confusion between different levels of generality (i.e. entities are both types and roles)