Word Sense Disambiguation Reading: Chap 16-17, Jurafsky & Martin Instructor: Rada Mihalcea.

Slides:



Advertisements
Similar presentations
Language Technologies Reality and Promise in AKT Yorick Wilks and Fabio Ciravegna Department of Computer Science, University of Sheffield.
Advertisements

Albert Gatt Corpora and Statistical Methods Lecture 13.
Word Sense Disambiguation MAS.S60 Catherine Havasi Rob Speer.
Evaluating the Waspbench A Lexicography Tool Incorporating Word Sense Disambiguation Rob Koeling, Adam Kilgarriff, David Tugwell, Roger Evans ITRI, University.
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
Word sense disambiguation and information retrieval Chapter 17 Jurafsky, D. & Martin J. H. SPEECH and LANGUAGE PROCESSING Jarmo Ritola -
Introduction to Computational Linguistics Lecture 2.
Semi-supervised learning and self-training LING 572 Fei Xia 02/14/06.
CS Word Sense Disambiguation. 2 Overview A problem for semantic attachment approaches: what happens when a given lexeme has multiple ‘meanings’?
1 Noun Homograph Disambiguation Using Local Context in Large Text Corpora Marti A. Hearst Presented by: Heng Ji Mar. 29, 2004.
Introduction to CL Session 1: 7/08/2011. What is computational linguistics? Processing natural language text by computers  for practical applications.
Taking the Kitchen Sink Seriously: An Ensemble Approach to Word Sense Disambiguation from Christopher Manning et al.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
WSD using Optimized Combination of Knowledge Sources Authors: Yorick Wilks and Mark Stevenson Presenter: Marian Olteanu.
Ontology Learning and Population from Text: Algorithms, Evaluation and Applications Chapters Presented by Sole.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Word Sense Disambiguation. Word Sense Disambiguation (WSD) Given A word in context A fixed inventory of potential word senses Decide which sense of the.
Coarse to Fine Grained Sense Disambiguation in Wikipedia Hui Shen [Ohio University] Razvan Bunescu [Ohio University] Rada Mihalcea [University of North.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Lexical Semantics CSCI-GA.2590 – Lecture 7A
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
Printing: This poster is 48” wide by 36” high. It’s designed to be printed on a large-format printer. Customizing the Content: The placeholders in this.
COMP423: Intelligent Agent Text Representation. Menu – Bag of words – Phrase – Semantics – Bag of concepts – Semantic distance between two words.
Francisco Viveros-Jiménez Alexander Gelbukh Grigori Sidorov.
CSCI 4410 Introduction to Artificial Intelligence.
A Fully Unsupervised Word Sense Disambiguation Method Using Dependency Knowledge Ping Chen University of Houston-Downtown Wei Ding University of Massachusetts-Boston.
Probabilistic Parsing Reading: Chap 14, Jurafsky & Martin This slide set was adapted from J. Martin, U. Colorado Instructor: Paul Tarau, based on Rada.
Short Introduction to Machine Learning Instructor: Rada Mihalcea.
CS 4705 Natural Language Processing Fall 2010 What is Natural Language Processing? Designing software to recognize, analyze and generate text and speech.
Annotating Words using WordNet Semantic Glosses Julian Szymański Department of Computer Systems Architecture, Faculty of Electronics, Telecommunications.
1 Statistical NLP: Lecture 9 Word Sense Disambiguation.
Paper Review by Utsav Sinha August, 2015 Part of assignment in CS 671: Natural Language Processing, IIT Kanpur.
W ORD S ENSE D ISAMBIGUATION By Mahmood Soltani Tehran University 2009/12/24 1.
SYMPOSIUM ON SEMANTICS IN SYSTEMS FOR TEXT PROCESSING September 22-24, Venice, Italy Combining Knowledge-based Methods and Supervised Learning for.
An Effective Word Sense Disambiguation Model Using Automatic Sense Tagging Based on Dictionary Information Yong-Gu Lee
CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.
1 Scaling Up Word Sense Disambiguation via Parallel Texts Yee Seng Chan Hwee Tou Ng Department of Computer Science National University of Singapore.
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
Wikipedia as Sense Inventory to Improve Diversity in Web Search Results Celina SantamariaJulio GonzaloJavier Artiles nlp.uned.es UNED,c/Juan del Rosal,
Lecture 21 Computational Lexical Semantics Topics Features in NLTK III Computational Lexical Semantics Semantic Web USCReadings: NLTK book Chapter 10 Text.
Intelligent Database Systems Lab Presenter : Kung, Chien-Hao Authors : Yoong Keok Lee and Hwee Tou Ng 2002,EMNLP An Empirical Evaluation of Knowledge Sources.
Disambiguation Read J & M Chapter 17.1 – The Problem Washington Loses Appeal on Steel Duties Sue caught the bass with the new rod. Sue played the.
CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 24 (14/04/06) Prof. Pushpak Bhattacharyya IIT Bombay Word Sense Disambiguation.
NUMBER OR NUANCE: Factors Affecting Reliable Word Sense Annotation Susan Windisch Brown, Travis Rood, and Martha Palmer University of Colorado at Boulder.
Information Retrieval using Word Senses: Root Sense Tagging Approach Sang-Bum Kim, Hee-Cheol Seo and Hae-Chang Rim Natural Language Processing Lab., Department.
CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 1 (03/01/06) Prof. Pushpak Bhattacharyya IIT Bombay Introduction to Natural.
L&I SCI 110: Information science and information theory Instructor: Xiangming(Simon) Mu Sept. 9, 2004.
Measuring the Influence of Errors Induced by the Presence of Dialogs in Reference Clustering of Narrative Text Alaukik Aggarwal, Department of Computer.
Dr Gordon Russell, Napier University Unit 5.1a - Database Administration - V2.0 1 CO22001 Database Administrator Section 5.1a.
Comparing Word Relatedness Measures Based on Google n-grams Aminul ISLAM, Evangelos MILIOS, Vlado KEŠELJ Faculty of Computer Science Dalhousie University,
1 Gloss-based Semantic Similarity Metrics for Predominant Sense Acquisition Ryu Iida Nara Institute of Science and Technology Diana McCarthy and Rob Koeling.
1 An Introduction to Computational Linguistics Mohammad Bahrani.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Second Language Learning From News Websites Word Sense Disambiguation using Word Embeddings.
Word Sense and Subjectivity (Coling/ACL 2006) Janyce Wiebe Rada Mihalcea University of Pittsburgh University of North Texas Acknowledgements: This slide.
COMP423: Intelligent Agent Text Representation. Menu – Bag of words – Phrase – Semantics Semantic distance between two words.
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
Intro to NLP - J. Eisner1 Splitting Words a.k.a. “Word Sense Disambiguation”
Natural Language Processing Vasile Rus
Coarse-grained Word Sense Disambiguation
SENSEVAL: Evaluating WSD Systems
Course Summary (Lecture for CS410 Intro Text Info Systems)
Lecture 21 Computational Lexical Semantics
Statistical NLP: Lecture 9
A method for WSD on Unrestricted Text
A programming language
CS224N Section 3: Corpora, etc.
Unsupervised Word Sense Disambiguation Using Lesk algorithm
Statistical NLP : Lecture 9 Word Sense Disambiguation
Austin Karingada, Jacob Handy, Adviser : Dr
Presentation transcript:

Word Sense Disambiguation Reading: Chap 16-17, Jurafsky & Martin Instructor: Rada Mihalcea

Slide 1 The problem of Word Sense Disambiguation Two examples: 1. There is a table and 4 chairs in the dining room. 2. The chair of the Computer Science and Engineering Department is Dr. T. For humans is not a real problem Ex. 1: chair = piece of furniture Ex. 2: chair = person For machines - one of the hardest problems in NLP

Slide 2 WSD applicability Why should one want to know the sense of a word?? Machine Translation. English - Romanian Ex. 1: Exista o masa si 4 scaune in sufragerie. Ex. 2: Decanul Facultatii de Calculatoare este Dr.Helgason See for instance AltaVista BabelfishAltaVista Babelfish Information Retrieval. Query: chair AND department AND math Retrieve documents: Referring to the Chair of the Department of Math Referring to some chairs in the Department of Math Knowledge acquisition Coreference

Slide 3 Word Sense... Disambiguation: –Distinguish word senses in texts with respect to a dictionary –WordNet, LDOCE, Roget Discrimination –Cluster word senses in a text –Pros: no need for a-priori dictionary definitions agglomerative clustering is a well studied field –Cons: sense inventory varies from one text to another hard to evaluate hard to standardize

Slide 4 Word Sense Disambiguation With respect to a dictionary (WordNet) 1. (37) sense -- (a general conscious awareness; "a sense of security"; "a sense of happiness"; "a sense of danger"; "a sense of self") 2. (23) sense, signified -- (the meaning of a word or expression; the way in which a word or expression or situation can be interpreted; "the dictionary gave several senses for the word"; "in the best sense charity is really a duty"; "the signifier is linked to the signified") 3. (19) sense, sensation, sentience, sentiency, sensory faculty -- (the faculty through which the external world is apprehended) 4. (8) common sense, good sense, gumption, horse sense, sense, mother wit -- (sound practical judgment; "I can't see the sense in doing it now"; "he hasn't got the sense God gave little green apples"; "fortunately she had the sense to run away") 5. (1) sense -- (a natural appreciation or ability; "a keen musical sense"; "a good sense of timing")

Slide 5 Main directions Knowledge-based approaches Lesk 86 Corpus based approaches Supervised algorithms: Instance-Based Learning (Ng & Lee 96) Naïve Bayes Semi-supervised algorithms: Yarowsky 95 Hybrid algorithms (supervised + dictionary)

Slide 6 WSD evaluation: Senseval Senseval 1: 1999 – about 10 teams Senseval 2: 2001 – about 30 teams Senseval 3: 2004 – coming up, check for detailswww.senseval.org How to compare WSD systems? methodology sense inventory (what dictionary) Senseval 1: Hector dictionary State-of-the-art for fine grained WSD is 75-80% Senseval 2: WordNet dictionary Fine grained statistical systems: 64% Fine grained all words systems: 69% Senseval 3: WordNet Many other tasks in addition to English WSD