CS 4705 Relations Between Words. Today Word Clustering Words and Meaning Lexical Relations WordNet Clustering for word sense discovery.

Slides:



Advertisements
Similar presentations
CMSC 723: Intro to Computational Linguistics November 24, 2004 Lecture 12: Lexical Semantics Bonnie Dorr Christof Monz.
Advertisements

So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
Lexical Semantics and Word Senses Hongning Wang
Word sense disambiguation and information retrieval Chapter 17 Jurafsky, D. & Martin J. H. SPEECH and LANGUAGE PROCESSING Jarmo Ritola -
Modeling Semantic Relations Expressed by Prepositions Vivek Srikumar and Dan Roth University of Illinois, Urbana-Champaign.
Automatic Metaphor Interpretation as a Paraphrasing Task Ekaterina Shutova Computer Lab, University of Cambridge NAACL 2010.
Chapter 17. Lexical Semantics From: Chapter 17 of An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, by.
CS 4705 Relationships among Words, Semantic Roles, and Word- Sense Disambiguation.
CS Word Sense Disambiguation. 2 Overview A problem for semantic attachment approaches: what happens when a given lexeme has multiple ‘meanings’?
CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.
1/27 Semantics Going beyond syntax. 2/27 Semantics Relationship between surface form and meaning What is meaning? Lexical semantics Syntax and semantics.
CS 4705 Semantic Roles and Disambiguation. Today Semantic Networks: Wordnet Thematic Roles Selectional Restrictions Selectional Association Conceptual.
The Informative Role of WordNet in Open-Domain Question Answering Marius Paşca and Sanda M. Harabagiu (NAACL 2001) Presented by Shauna Eggers CS 620 February.
CS 4705 Lecture Lexical Semantics. What is lexical semantics? Meaning of Words Lexical Relations WordNet Thematic Roles Selectional Restrictions Conceptual.
Introduction to Lexical Semantics Vasileios Hatzivassiloglou University of Texas at Dallas.
1 Noun Homograph Disambiguation Using Local Context in Large Text Corpora Marti A. Hearst Presented by: Heng Ji Mar. 29, 2004.
Creating a Bilingual Ontology: A Corpus-Based Approach for Aligning WordNet and HowNet Marine Carpuat Grace Ngai Pascale Fung Kenneth W.Church.
November 20, 2003 Chapter 16 Lexical Semantics. Words have structured meanings Lexeme – a pairing of a form with a sense Orthographic form – the way the.
CS 4705 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised –Dictionary-based.
Lecture Lexical Semantics CS 4705.
CS 4705 Lexical Semantics. Today Words and Meaning Lexical Relations WordNet Thematic Roles Selectional Restrictions Conceptual Dependency.
1 MODULE 2 Meaning and discourse in English LEXICAL RELATIONS Lesson 2.
Ontology Learning and Population from Text: Algorithms, Evaluation and Applications Chapters Presented by Sole.
Session 8 Lexical Semantic
SI485i : NLP Set 10 Lexical Relations slides adapted from Dan Jurafsky and Bill MacCartney.
Ontology Learning from Text: A Survey of Methods Source: LDV Forum,Volume 20, Number 2, 2005 Authors: Chris Biemann Reporter:Yong-Xiang Chen.
COMP423: Intelligent Agent Text Representation. Menu – Bag of words – Phrase – Semantics – Bag of concepts – Semantic distance between two words.
COMP423.  Query expansion  Two approaches ◦ Relevance feedback ◦ Thesaurus-based  Most Slides copied from ◦
“How much context do you need?” An experiment about context size in Interactive Cross-language Question Answering B. Navarro, L. Moreno-Monteagudo, E.
Semantics: Semantic Grammar & Information Extraction CMSC Natural Language Processing May 13, 2003.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Annotating Words using WordNet Semantic Glosses Julian Szymański Department of Computer Systems Architecture, Faculty of Electronics, Telecommunications.
1 Query Operations Relevance Feedback & Query Expansion.
Paper Review by Utsav Sinha August, 2015 Part of assignment in CS 671: Natural Language Processing, IIT Kanpur.
WORD SENSE DISAMBIGUATION STUDY ON WORD NET ONTOLOGY Akilan Velmurugan Computer Networks – CS 790G.
W ORD S ENSE D ISAMBIGUATION By Mahmood Soltani Tehran University 2009/12/24 1.
Lexical Semantics Chapter 16
10/22/2015ACM WIDM'20051 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis Voutsakis.
WordNet: Connecting words and concepts Christiane Fellbaum Cognitive Science Laboratory Princeton University.
CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.
WordNet: Connecting words and concepts Peng.Huang.
Unsupervised Word Sense Disambiguation REU, Summer, 2009.
Mining Topic-Specific Concepts and Definitions on the Web Bing Liu, etc KDD03 CS591CXZ CS591CXZ Web mining: Lexical relationship mining.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
LEXICAL RELATIONS Presented by ‘the big family’ group 3 Rauwan Harahap (Opung) Riza Nirmala Putri Salmah Silih Warni Siti Anifah Siti Juariyah.
Disambiguation Read J & M Chapter 17.1 – The Problem Washington Loses Appeal on Steel Duties Sue caught the bass with the new rod. Sue played the.
Wordnet - A lexical database for the English Language.
1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )
WordNet Enhancements: Toward Version 2.0 WordNet Connectivity Derivational Connections Disambiguated Definitions Topical Connections.
Word Relations Slides adapted from Dan Jurafsky, Jim Martin and Chris Manning.
1 Latent Concepts and the Number Orthogonal Factors in Latent Semantic Analysis Georges Dupret
Word Meaning and Similarity
2/10/2016Semantic Similarity1 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis.
Word Meaning and Similarity Word Senses and Word Relations.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Feature Assignment LBSC 878 February 22, 1999 Douglas W. Oard and Dagobert Soergel.
Lexical Semantics and Word Senses Hongning Wang
SEMANTICS Chapter 10 Ms. Abrar Mujaddidi. What is semantics?  Semantics is the study of the conventional meaning conveyed by the use of words, phrases.
Chapter 3 Word Meaning.
Query expansion COMP423. Menu Query expansion Two approaches Relevance feedback Thesaurus-based Most Slides copied from
LEXICAL RELATIONS IN DISCOURSE
Word Relations Slides adapted from Dan Jurafsky, Jim Martin and Chris Manning.
ArtsSemNet: From Bilingual Dictionary To Bilingual Semantic Network
CSCI 5832 Natural Language Processing
WordNet: A Lexical Database for English
WordNet WordNet, WSD.
Word Relations Slides adapted from Dan Jurafsky, Jim Martin and Chris Manning.
Giannis Varelas Epimenidis Voutsakis Paraskevi Raftopoulou
Semantics Going beyond syntax.
Relations Between Words
Presentation transcript:

CS 4705 Relations Between Words

Today Word Clustering Words and Meaning Lexical Relations WordNet Clustering for word sense discovery

Related Words: Clustering Clustering feature vectors to ‘discover’ word senses using some similarity metric (e.g. cosine distance) –Represent each cluster as average of feature vectors it contains –Label clusters by hand with known senses –Classify unseen instances by proximity to these known and labeled clusters Evaluation problem –What are the ‘right’ senses?

–Cluster impurity –How do you know how many clusters to create? –Some clusters may not map to ‘known’ senses

Related Words: Dictionary Entries Lexeme: an entry in the lexicon that includes –an orthographic representation –a phonological form –a symbolic meaning representation or sense Some typical dictionary entries: –Red (‘red) n: the color of blood or a ruby –Blood (‘bluhd) n: the red liquid that circulates in the heart, arteries and veins of animals

–Right (‘rIt) adj: located nearer the right hand esp. being on the right when facing the same direction as the observer –Left (‘left) adj: located nearer to this side of the body than the right Can we get semantics directly from online dictionary entries? –Some are circular –All are defined in terms of other lexemes –You have to know something to learn something What can we learn from dictionaries? –Relations between words: Oppositions, similarities, hierarchies

Homonymy Homonyms: Words with same form – orthography and pronunciation -- but different, unrelated meanings, or senses (multiple lexemes) –A bank holds investments in a custodial account in the client’s name. –As agriculture is burgeoning on the east bank, the river will shrink even more Word sense disambiguation: what clues? Related phenomena –homophones - read and red (same pron/different orth) –homographs - bass and bass (same orth/different pron)

Ambiguity: Which applications will these cause problems for? A bass, the bank, red/read General semantic interpretation Machine translation Spelling correction Speech recognition Text to speech Information retrieval

Polysemy Word with multiple but related meanings (same lexeme) –They rarely serve red meat. –He served as U.S. ambassador. –He might have served his time in prison. What’s the difference between polysemy and homonymy? Homonymy: –Distinct, unrelated meanings –Different etymology? Coincidental similarity?

Polysemy: –Distinct but related meanings –idea bank, sperm bank, blood bank, bank bank –How different? Different subcategorization frames? Domain specificity? Can the two candidate senses be conjoined? ?He served his time and as ambassador to Norway. For either, practical task: –What are its senses? (related or not) –How are they related? (polysemy ‘easier’ here) –How can we distinguish them?

Tropes, or Figures of Speech Metaphor: one entity is given the attributes of another (tenor/vehicle/ground)Metaphor –Life is a bowl of cherries. Don’t take it serious…. –We are the eyelids of defeated caves. ?? Metonymy: one entity used to stand for another (replacive) –GM killed the Fiero. –The ham sandwich wants his check. (deferred reference) Both extend existing sense to new meaning –Metaphor: completely different concept –Metonymy: related concepts

Synonymy Substitutability: different lexemes, same meaning –How big is that plane? –How large is that plane? –How big are you? Big brother is watching. What influences substitutability? –Polysemy (large vs. old sense) –register: He’s really cheap/?parsimonious. –collocational constraints: roast beef, ?baked beef economy fare ?economy price

Finding Synonyms and Collations Automatically from a Corpus Synonyms: Identify words appearing frequently in similar contexts Blast victims were helped by civic-minded passersby. Few passersby came to the aid of this crime victim. Collocations: Identify synonyms that don’t appear in some specific similar contexts Flu victims, flu suffers,… Crime victims, ?crime sufferers, …

Hyponomy General: hypernym (super…ordinate) –dog is a hypernym of poodle Specific: hyponym (under..neath) –poodle is a hyponym of dog Test: That is a poodle implies that is a dog Ontology: set of domain objects Taxonomy? Specification of relations between those objects Object hierarchy? Structured hierarchy that supports feature inheritance (e.g. poodle inherits some properties of dog)

Semantic Networks Used to represent lexical relationships –e.g. WordNet (George Miller et al) –Most widely used hierarchically organized lexical database for English –Synset: set of synonyms, a dictionary-style definition (or gloss), and some examples of uses --> a concept –Databases for nouns, verbs, and modifiers Applications can traverse network to find synonyms, antonyms, hierarchies,... –Available for download or online use –

Using WN, e.g. in Question-Answering Pasca & Harabagiu ’01 results on TREC corpus –Parses questions to determine question type, key words (Who invented the light bulb?) –Person question; invent, light, bulb –The modern world is an electrified world. It might be argued that any of a number of electrical appliances deserves a place on a list of the millennium's most significant inventions. The light bulb, in particular, profoundly changed human existence by illuminating the night and making it hospitable to a wide range of human activity. The electric light, one of the everyday conveniences that most affects our lives, was invented in 1879 simultaneously by Thomas Alva Edison in the United States and Sir Joseph Wilson Swan in England. Finding named entities is not enough

Compare expected answer ‘type’ to potential answers –For questions of type person, expect answer is person –Identify potential person names in passages retrieved by IR –Check in WN to find which of these are hyponyms of person Or, Consider reformulations of question: Who invented the light bulb –For key words in query, look for WN synonyms –E.g. Who fabricated the light bulb? –Use this query for initial IR Results: improve system accuracy by 147% (on some question types)

Next time Chapter 18.10