Wordnet - A lexical database for the English Language.

Slides:



Advertisements
Similar presentations
The WordNet Lexical Database Bernardo Magnini ITC-irst, Istituto per la Ricerca Scientifica e Tecnologica Trento - Italy.
Advertisements

C SC 620 Advanced Topics in Natural Language Processing Lecture 4 1/27/04.
The Meaning of Language
Semantics Chapter 5.
Introduction to Ontologies ECE457 Applied Artificial Intelligence Spring 2007 Lecture #13.
Lexical Nets Miriam Butt December WordNet Main Researchers: George Miller (Princeton), Christiane Fellbaum (Princeton) WordNet is free and runs.
English Lexicography.
Introduction to Computational Linguisitics The Lexicon.
Ewa Rudnicka, Wojciech Witkowski, Maciej Piasecki G4.19 Research Group Institute of Informatics, Wrocław University of Technology nlp.pwr.wroc.pl plwordnet.pwr.wroc.pl.
Section 4: Language and Intelligence Overview Instructor: Sandiway Fong Department of Linguistics Department of Computer Science.
Chapter 17. Lexical Semantics From: Chapter 17 of An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, by.
1 Words and the Lexicon September 10th 2009 Lecture #3.
PSY 369: Psycholinguistics Representing language.
1/27 Semantics Going beyond syntax. 2/27 Semantics Relationship between surface form and meaning What is meaning? Lexical semantics Syntax and semantics.
Introduction to Lexical Semantics Vasileios Hatzivassiloglou University of Texas at Dallas.
A STUDY ON THE KNOWLEDGE SOURCES OF TURKISH EFL LEARNERS IN LEXICAL INFERENCING İlknur İSTİFÇİ Anadolu University Eskişehir, TURKEY Eskişehir, TURKEY.
1 CS 502: Computing Methods for Digital Libraries Lecture 12 Information Retrieval II.
Structured lexicons and Lexical semantics Especially WordNet ® See D Jurafsky & JH Martin: Speech and Language Processing, Upper Saddle River NJ (2000):
Using resources WordNet and the BNC. WordNet: History 1985: a group of psychologists and linguists start to develop a “lexical database” –Princeton University.
C SC 620 Advanced Topics in Natural Language Processing Lecture Notes 2 1/20/04.
Chapter 8 Structuring System Data Requirements
Symbols and Language Lexical Relations SIMS 202 Profs. Hearst & Larson UC Berkeley SIMS Fall 2000.
1 Analysing and teaching meaning SSIS Lazio - Lesson 1 prof. Hugo Bowles January 2007.
The Study of Meaning in Language
1 Indo WordNet A WordNet for Hindi Centre for Technology Development for Indian Languages Computer Science and Engineering Department, IIT Bombay.
Course G Web Search Engines 3/9/2011 Wei Xu
Introduction to linguistics II
Session 8 Lexical Semantic
WORDNET Approach on word sense techniques - AKILAN VELMURUGAN.
Adam Pease and Christiane Fellbaum Presenter: 吳怡安
BT Exact Technologies - Adastral Park, Ipswich July - October 2003 Linguistic Web Services for Semantic Web Dr. Vassil T. Vassilev London Metropolitan.
1 Natural Language Processing (2a) Zhao Hai 赵海 Department of Computer Science and Engineering Shanghai Jiao Tong University
COMP423.  Query expansion  Two approaches ◦ Relevance feedback ◦ Thesaurus-based  Most Slides copied from ◦
WordNet ® and its Java API ♦ Introduction to WordNet ♦ WordNet API for Java Name: Hao Li Uni: hl2489.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
1 Query Operations Relevance Feedback & Query Expansion.
WORD SENSE DISAMBIGUATION STUDY ON WORD NET ONTOLOGY Akilan Velmurugan Computer Networks – CS 790G.
WORDNET. THE WORDNET SYSTEM  Lexicographer files  Code: Lexico files  database  Search Routines and Interfaces.
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
Application of INTEX in refinement and validation of Serbian WordNet Ivan Obradović, Ranka Stanković Cvetana Krstev, Gordana Pavlović-Lažetić University.
WordNet: Connecting words and concepts Christiane Fellbaum Cognitive Science Laboratory Princeton University.
WordNet: Connecting words and concepts Peng.Huang.
Terminology and documentation*  Object of the study of terminology:  analysis and description of the units representing specialized knowledge in specialized.
Semantics The study of meaning in language. Semantics is…  The study of meaning in language.  It deals with the meaning of words (Lexical semantics)
Dr. Francisco Perlas Dumanig
WordNet Enhancements: Toward Version 2.0 WordNet Connectivity Derivational Connections Disambiguated Definitions Topical Connections.
Ontology Engineering: from Cognitive Science to the Semantic Web Maria Teresa Pazienza University of Roma Tor Vergata, Italy 1.
The meaning of Language Chapter 5 Semantics and Pragmatics Week10 Nov.19 th -23 rd.
1 Masters Thesis Presentation By Debotosh Dey AUTOMATIC CONSTRUCTION OF HASHTAGS HIERARCHIES UNIVERSITAT ROVIRA I VIRGILI Tarragona, June 2015 Supervised.
Determining Meaning You can use a dictionary for many things. A dictionary can tell you what words mean. It can tell you how to pronounce, or say, words.
Annotation Framework & ImageCLEF 2014 JAN BOTOREK, PETRA BUDÍKOVÁ
Knowledge Structure Vijay Meena ( ) Gaurav Meena ( )
NLP. Text Similarity Example: post-close market announcements The S&P 500 climbed 6.93, or 0.56 percent, to 1,243.72, its best close since June 12, 2001.
Chapter 3 The Relational Database Model. Database Systems, 10th Edition 2 * Relational model * View data logically rather than physically * Table * Structural.
Charlyn P. Salcedo Instructor Types of Indexing Languages.
Semantics Lecture 5. Semantics Language uses a system of linguistic signs, each of which is a combination of meaning and phonological and/or orthographic.
Query expansion COMP423. Menu Query expansion Two approaches Relevance feedback Thesaurus-based Most Slides copied from
Introduction to Computational Linguisitics The Lexicon.
Lesson 11 Lexical semantics 1
Ontology Engineering: from Cognitive Science to the Semantic Web
About WordNet 瞿裕忠
ArtsSemNet: From Bilingual Dictionary To Bilingual Semantic Network
An Introduction to Linguistics
WordNet: A Lexical Database for English
Introduction to Ontologies
Bulgarian WordNet Svetla Koeva Institute for Bulgarian Language
WordNet WordNet, WSD.
Lesson 11 Lexical semantics 1
The Study of Meaning in Language
Lecture 19 Word Meanings II
Presentation transcript:

Wordnet - A lexical database for the English Language. Project at Cognitive Science Laboratory, Princeton University - began in late 80s. Team consisted of linguists and psychologists. Design - inspired by psycho-linguistic theories of human lexical memory. Wordnet continues to grow – Novel applications to research.

Wordnet - A lexical database for the English Language – Goal. Alphabetical organization – clusters words that are spelt alike. scatters words with similar or related meanings. Wordnet resembles a thesaurus more than a dictionary. Goal - search dictionaries conceptually.

Wordnet - A lexical database for the English Language – Forms and Meanings. Some Definitions Word form - Physical utterance or inscription. Word meaning - a possible lexical concept that a form can be used to express. Word is commonly used to refer both. Lexical Matrix – captures the mapping between forms and meanings.

Wordnet - A lexical database for the English Language – Lexical Matrix. A Lexical Matrix

Wordnet - A lexical database for the English Language – Polysemy and Synonymy. Two entries in the same column - word form is polysemous. For example the word form “case”. Two entries in the same row - word forms are synonymous. For example the word forms “cruel” and “unjust”. Mappings between forms and meanings are many -many.

Wordnet - A lexical database for the English Language – Synonymy and Synsets. Synonymy – Two words are synonymous if substitution of one for the other does not alter the truth value. (inverse is Antonymy.) Possible Representations: List the word forms (synsets) that can be used to express a meaning - Thesaurus. Draw semantic relations between meanings i.e. synsets or list of synonyms – Wordnet.

Wordnet - A lexical database for the English Language – Human Lexical Memory. In lexical memory Nouns organized as topical hierarchies. Verbs are organized by a variety of entailment. Adjectives and adverbs are organized as hyperspaces.

Wordnet - A lexical database for the English Language – Lexical Inherence of Nouns. Dictionary – words used to describe words, causes circularity. Lexicographers impose tree structure on the semantic memory of nouns. Consider the following: oak->tree->plant->organism. Asymmetric, transitive semantic relation – Hypernymic relation. (inverse is hyponymic relation).

Wordnet - A lexical database for the English Language – Lexical Inherence of Nouns. Design creates a sequence of levels – hierarchies. Specific terms at lower levels to a few generic terms at the top. Hierarchies provide conceptual skeletons for nouns.

Wordnet - A lexical database for the English Language – Lexical Inherence of Nouns. Issue - How to choose top level generic classes. One way - Assume all nouns are in a single hierarchy. Alternative - Few generic top level concepts. Multiple hierarchies - relatively distinct semantic fields.

Wordnet - A lexical database for the English Language – Multiple Hierarchies.

Wordnet - A lexical database for the English Language – Capturing Meronymy. Canary -> Bird. (-> is Hypernymic relationship) Canary has a small size, beak and wings. (Is this relation captured?) Associate nouns with 3 characteristic features: Attributes : small, yellow. (adjectives) Parts : beak, wings. (nouns) Functions : sing, fly. (verbs)

Wordnet - A lexical database for the English Language – Network Representation.

Wordnet - A lexical database for the English Language – Adjectives. Linguists divide adjectives into two distinct classes. Descriptive - which describe a head noun. Relational - stylistic variants of nouns. Descriptive - good, bad, big, small, interesting. Relational - presidential, nuclear - derived from a noun.

Wordnet - A lexical database for the English Language – Descriptive Adjectives. Descriptive Adjectives ascribe attribute to nouns. Pointers between adjectives and noun synsets . There is no hierarchy – semantic organization thought as abstract hyperspace. Basic Semantic Relation here is antonymy.

Wordnet - A lexical database for the English Language – Bipolar Adjective Structure. Adjective synsets organized as adjective clusters. Association – Semantic similarity to a focal adjective. Focal adjective relates the cluster to contrasting cluster at opposite pole.

Wordnet - A lexical database for the English Language – Bipolar Adjective Structure.

Wordnet - A lexical database for the English Language – Relational Adjectives. Often derived from Greek and Latin nouns. Some examples: “Fraternal” relates to brother. “Atomic bomb” and “Atom bomb” both admissible. Relation with nouns most important. Cross Referenced to parent nouns.

Wordnet - A lexical database for the English Language – Verbs as Semantic Net. Verbs – Central Organizers of English sentences. Verbs highly polysemous. Polysemy count: nouns - 1.74 , verbs – 2.11. Mutability of verbs – meanings depend on kind of noun arguments. “run in the street” versus “run a company”.

Wordnet - A lexical database for the English Language – Lexical Entailment of Verbs. Entailment means Strict Implication. (P -> Q). Not possible for that “P is true” and “Q is false”. “He is snoring” entails “He is sleeping”. Entailment - Primary Relation among verbs. Troponymy - To V1 is to V2 in some particular fashion – “amble” is troponomous to “walk”.

Wordnet - A lexical database for the English Language – Familiarity Index. Familiarity influences performance variables like reading, speed of comprehension. Indicators of Familiarity: Frequency of Use – from literature. Polysemy count – more meanings implies more usage – Psycholinguistic evidence. Wordnet uses Polysemy count as written literature is a small sample compared to spoken language.

Wordnet - A lexical database for the English Language – Wordnet Team. Website Main Team – Prof. George Miller. Dr. Christiane Fellbaum. Randee Tengi. "WordNet: An Electronic Lexical Database" is available from MIT Press. http://www.cogsci.princeton.edu/~wn/