Word Sense Disambiguation UIUC - 06/10/2004 Word Sense Disambiguation Another NLP working problem for learning with constraints… Lluís Màrquez TALP, LSI,

Slides:



Advertisements
Similar presentations
Exploring the Effectiveness of Lexical Ontologies for Modeling Temporal Relations with Markov Logic Eun Y. Ha, Alok Baikadi, Carlyle Licata, Bradford Mott,
Advertisements

A Robust Approach to Aligning Heterogeneous Lexical Resources Mohammad Taher Pilehvar Roberto Navigli MultiJEDI ERC
Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
How dominant is the commonest sense of a word? Adam Kilgarriff Lexicography MasterClass Univ of Brighton.
CL Research ACL Pattern Dictionary of English Prepositions (PDEP) Ken Litkowski CL Research 9208 Gue Road Damascus,
Word Sense Disambiguation for Machine Translation Han-Bin Chen
The Impact of Task and Corpus on Event Extraction Systems Ralph Grishman New York University Malta, May 2010 NYU.
TÍTULO GENÉRICO Concept Indexing for Automated Text Categorization Enrique Puertas Sanz Universidad Europea de Madrid.
CS Word Sense Disambiguation. 2 Overview A problem for semantic attachment approaches: what happens when a given lexeme has multiple ‘meanings’?
Collective Word Sense Disambiguation David Vickrey Ben Taskar Daphne Koller.
Introduction to Lexical Semantics Vasileios Hatzivassiloglou University of Texas at Dallas.
1 Noun Homograph Disambiguation Using Local Context in Large Text Corpora Marti A. Hearst Presented by: Heng Ji Mar. 29, 2004.
Designing clustering methods for ontology building: The Mo’K workbench Authors: Gilles Bisson, Claire Nédellec and Dolores Cañamero Presenter: Ovidiu Fortu.
1 Complementarity of Lexical and Simple Syntactic Features: The SyntaLex Approach to S ENSEVAL -3 Saif Mohammad Ted Pedersen University of Toronto, Toronto.
Machine Learning in Natural Language Processing Noriko Tomuro November 16, 2006.
A Framework for Named Entity Recognition in the Open Domain Richard Evans Research Group in Computational Linguistics University of Wolverhampton UK
WSD using Optimized Combination of Knowledge Sources Authors: Yorick Wilks and Mark Stevenson Presenter: Marian Olteanu.
Ontology Learning and Population from Text: Algorithms, Evaluation and Applications Chapters Presented by Sole.
Aiding WSD by exploiting hypo/hypernymy relations in a restricted framework MEANING project Experiment 6.H(d) Luis Villarejo and Lluís M à rquez.
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
Multilingual Word Sense Disambiguation using Wikipedia Bharath Dandala (University of North Texas) Rada Mihalcea (University of North Texas) Razvan Bunescu.
Lemmatization Tagging LELA /20 Lemmatization Basic form of annotation involving identification of underlying lemmas (lexemes) of the words in.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
A Fully Unsupervised Word Sense Disambiguation Method Using Dependency Knowledge Ping Chen University of Houston-Downtown Wei Ding University of Massachusetts-Boston.
CLEF Ǻrhus Robust – Word Sense Disambiguation exercise UBC: Eneko Agirre, Oier Lopez de Lacalle, Arantxa Otegi, German Rigau UVA & Irion: Piek Vossen.
1 Wikification CSE 6339 (Section 002) Abhijit Tendulkar.
“How much context do you need?” An experiment about context size in Interactive Cross-language Question Answering B. Navarro, L. Moreno-Monteagudo, E.
Resolving abbreviations to their senses in Medline S. Gaudan, H. Kirsch and D. Rebholz-Schuhmann European Bioinformatics Institute, Wellcome Trust Genome.
Scott Duvall, Brett South, Stéphane Meystre A Hands-on Introduction to Natural Language Processing in Healthcare Annotation as a Central Task for Development.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Discovery of Manner Relations and their Applicability to Question Answering Roxana Girju 1,2, Manju Putcha 1, and Dan Moldovan 1 University of Texas at.
1 Statistical NLP: Lecture 9 Word Sense Disambiguation.
11 Chapter 20 Computational Lexical Semantics. Supervised Word-Sense Disambiguation (WSD) Methods that learn a classifier from manually sense-tagged text.
Paper Review by Utsav Sinha August, 2015 Part of assignment in CS 671: Natural Language Processing, IIT Kanpur.
W ORD S ENSE D ISAMBIGUATION By Mahmood Soltani Tehran University 2009/12/24 1.
Improving Subcategorization Acquisition using Word Sense Disambiguation Anna Korhonen and Judith Preiss University of Cambridge, Computer Laboratory 15.
SYMPOSIUM ON SEMANTICS IN SYSTEMS FOR TEXT PROCESSING September 22-24, Venice, Italy Combining Knowledge-based Methods and Supervised Learning for.
CS : Language Technology for the Web/Natural Language Processing Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 12: WSD approaches (contd.)
An Effective Word Sense Disambiguation Model Using Automatic Sense Tagging Based on Dictionary Information Yong-Gu Lee
2014 EMNLP Xinxiong Chen, Zhiyuan Liu, Maosong Sun State Key Laboratory of Intelligent Technology and Systems Tsinghua National Laboratory for Information.
A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources Author: Carmen Banea, Rada Mihalcea, Janyce Wiebe Source:
Page 1 SenDiS Sectoral Operational Programme "Increase of Economic Competitiveness" "Investments for your future" Project co-financed by the European Regional.
CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.
Approximating a Deep-Syntactic Metric for MT Evaluation and Tuning Matouš Macháček, Ondřej Bojar; {machacek, Charles University.
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
Part 5. Minimally Supervised Methods for Word Sense Disambiguation.
Bootstrapping for Text Learning Tasks Ramya Nagarajan AIML Seminar March 6, 2001.
Wikipedia as Sense Inventory to Improve Diversity in Web Search Results Celina SantamariaJulio GonzaloJavier Artiles nlp.uned.es UNED,c/Juan del Rosal,
Lecture 21 Computational Lexical Semantics Topics Features in NLTK III Computational Lexical Semantics Semantic Web USCReadings: NLTK book Chapter 10 Text.
Information Retrieval using Word Senses: Root Sense Tagging Approach Sang-Bum Kim, Hee-Cheol Seo and Hae-Chang Rim Natural Language Processing Lab., Department.
Supertagging CMSC Natural Language Processing January 31, 2006.
1 Adaptive Subjective Triggers for Opinionated Document Retrieval (WSDM 09’) Kazuhiro Seki, Kuniaki Uehara Date: 11/02/09 Speaker: Hsu, Yu-Wen Advisor:
1 Gloss-based Semantic Similarity Metrics for Predominant Sense Acquisition Ryu Iida Nara Institute of Science and Technology Diana McCarthy and Rob Koeling.
1 Fine-grained and Coarse-grained Word Sense Disambiguation Jinying Chen, Hoa Trang Dang, Martha Palmer August 22, 2003.
Exploiting Named Entity Taggers in a Second Language Thamar Solorio Computer Science Department National Institute of Astrophysics, Optics and Electronics.
From Words to Senses: A Case Study of Subjectivity Recognition Author: Fangzhong Su & Katja Markert (University of Leeds, UK) Source: COLING 2008 Reporter:
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Word Sense and Subjectivity (Coling/ACL 2006) Janyce Wiebe Rada Mihalcea University of Pittsburgh University of North Texas Acknowledgements: This slide.
Semantic search-based image annotation Petra Budíková, FI MU CEMI meeting, Plzeň,
Finding Predominant Word Senses in Untagged Text Diana McCarthy & Rob Koeling & Julie Weeds & Carroll Department of Indormatics, University of Sussex {dianam,
Sentiment Analysis Using Common- Sense and Context Information Basant Agarwal 1,2, Namita Mittal 2, Pooja Bansal 2, and Sonal Garg 2 1 Department of Computer.
An Ontology-based Automatic Semantic Annotation Approach for Patent Document Retrieval in Product Innovation Design Feng Wang, Lanfen Lin, Zhou Yang College.
Coarse-grained Word Sense Disambiguation
SENSEVAL: Evaluating WSD Systems
Using UMLS CUIs for WSD in the Biomedical Domain
Statistical NLP: Lecture 9
WordNet WordNet, WSD.
A method for WSD on Unrestricted Text
Automatic Detection of Causal Relations for Question Answering
Statistical NLP : Lecture 9 Word Sense Disambiguation
Presentation transcript:

Word Sense Disambiguation UIUC - 06/10/2004 Word Sense Disambiguation Another NLP working problem for learning with constraints… Lluís Màrquez TALP, LSI, Technical University of Catalonia UIUC, June Word Sense Disambiguation Another NLP working problem for learning with constraints… Lluís Màrquez TALP, LSI, Technical University of Catalonia UIUC, June

Word Sense Disambiguation UIUC - 06/10/2004 The problem –WSD is the problem of assigning the correct meaning to the words occurring in a text or discourse (sense tagging) –Example: “He was mad about stars at the age#1 of nine” “About 20,000 years ago the last ice age#2 ended” age#1: the length of time something (or someone) has existed age#2: a historic period –Origin in the beginning of AI (60’s) around first MT models –Renewed interest with the explosion of statistical and ML-based approaches to NLP (90’s) The problem –WSD is the problem of assigning the correct meaning to the words occurring in a text or discourse (sense tagging) –Example: “He was mad about stars at the age#1 of nine” “About 20,000 years ago the last ice age#2 ended” age#1: the length of time something (or someone) has existed age#2: a historic period –Origin in the beginning of AI (60’s) around first MT models –Renewed interest with the explosion of statistical and ML-based approaches to NLP (90’s) Word Sense Disambiguation

Word Sense Disambiguation UIUC - 06/10/2004 Usual approaches –Supervised learning (ML): multiclass classification problem; “word-experts”. Results about 75% accuracy on subsets of selected polysemous words. Sometimes better (over 90%) on some specific words –“Unsupervised”, “knowledge-based” = heuristic rules based on preexisting knowledge sources (WorNet, MRDs, multilingual aligned corpora, etc.). Accuracy: around 60% (allwords WSD) –Combined approaches: 65% (allwords WSD) –Supervised methods are better but difficult to apply to “allwords” WSD Usual approaches –Supervised learning (ML): multiclass classification problem; “word-experts”. Results about 75% accuracy on subsets of selected polysemous words. Sometimes better (over 90%) on some specific words –“Unsupervised”, “knowledge-based” = heuristic rules based on preexisting knowledge sources (WorNet, MRDs, multilingual aligned corpora, etc.). Accuracy: around 60% (allwords WSD) –Combined approaches: 65% (allwords WSD) –Supervised methods are better but difficult to apply to “allwords” WSD Word Sense Disambiguation

Word Sense Disambiguation UIUC - 06/10/2004 Usual Features: –Local context patterns (POS, words, lemmas) the of, CD limit, mean –Broad context features: Bag of (relevant) words Atomic occurs in the sentence Dark occurs in the sentence –Also syntactic features capturing predicate-argument relations Usual Features: –Local context patterns (POS, words, lemmas) the of, CD limit, mean –Broad context features: Bag of (relevant) words Atomic occurs in the sentence Dark occurs in the sentence –Also syntactic features capturing predicate-argument relations WSD: ML Approach

Word Sense Disambiguation UIUC - 06/10/2004 Main difficulties: –Each word is a classification problem => data scarceness –High granularity of sense repositories used => many classes –Difficulty in capturing the semantic information present in the context: words (sparseness problem) which are also ambiguous (no interactions between word-classifiers have been exploited). Main difficulties: –Each word is a classification problem => data scarceness –High granularity of sense repositories used => many classes –Difficulty in capturing the semantic information present in the context: words (sparseness problem) which are also ambiguous (no interactions between word-classifiers have been exploited). WSD: ML Approach

Word Sense Disambiguation UIUC - 06/10/2004 Example (from WSJ) The jury further said in term end presentments that the City Executive Committee, which had over-all charge of the election, “deserves the praise and thanks of the City of Atlanta” for the manner in which the election was conducted. WSD: Difficulties

Word Sense Disambiguation UIUC - 06/10/2004 Example (from WSJ, WordNet senses) The jury#NN#1 further#RB#2 said#VB#1 in term#NN#2 end#NN#2 presentments#NN#1 that the City_Executive_ Committee#1, which had#VB#4 over-all#JJ#2 charge#NN#6 of the election#NN#1, “ deserves#VB#1 the praise#NN#1 and thanks#NN#1 of the City_of_Atlanta#1 ” for the manner#NN#1 in which the election#NN#1 was conducted#VB#1. The jury#NN#1 further#RB#2 said#VB#1 in term#NN#2 end#NN#2 presentments#NN#1 that the City_Executive_ Committee#1, which had#VB#4 over-all#JJ#2 charge#NN#6 of the election#NN#1, “ deserves#VB#1 the praise#NN#1 and thanks#NN#1 of the City_of_Atlanta#1 ” for the manner#NN#1 in which the election#NN#1 was conducted#VB#1. WSD: Difficulties

Word Sense Disambiguation UIUC - 06/10/2004 Example (from WSJ, WordNet senses) jury#NN#1 further#RB#2 said#VB#1 term#NN#2 end#NN#2 presentments#NN#1 had#VB#4 over-all#JJ#2charge#NN#6 election#NN#1 deserves#VB#1 praise#NN#1 thanks#NN#1 manner#NN#1 election#NN#1 conducted#VB#1. jury#NN#1 further#RB#2 said#VB#1 term#NN#2 end#NN#2 presentments#NN#1 had#VB#4 over-all#JJ#2charge#NN#6 election#NN#1 deserves#VB#1 praise#NN#1 thanks#NN#1 manner#NN#1 election#NN#1 conducted#VB#1. WSD: Difficulties

Word Sense Disambiguation UIUC - 06/10/2004 Example (from WSJ, WordNet senses) The jury(2) further(5) said(11) in term(6) end(15) presentments(3) that the City_Executive_ Committee, which had(21) over-all(2) charge(15) of the election(2), “ deserves the praise(2) and thanks(2) of the City_of_Atlanta ” for the manner(3) in which the election(2) was conducted(5). WSD: Difficulties

Word Sense Disambiguation UIUC - 06/10/2004 Utility? –Useful for IR / IE / Semantic parsing / Knowledge acquisition? –Accurately resolving WSD is more difficult that most of the NLP tasks for which is potentially helpful Evaluation Exercises for WSD: Senseval-1/2/3 –Senseval-3 collocated with ACL-2004 –2 major types of task: “lexical sample”, “allwords” –10 different languages + 1 multilingual lexical sample task –Several new tasks: Automatic subcategorization acquisition, WSD of WordNet glosses, Semantic Roles (English and Swedish), Logic Forms, etc. Utility? –Useful for IR / IE / Semantic parsing / Knowledge acquisition? –Accurately resolving WSD is more difficult that most of the NLP tasks for which is potentially helpful Evaluation Exercises for WSD: Senseval-1/2/3 –Senseval-3 collocated with ACL-2004 –2 major types of task: “lexical sample”, “allwords” –10 different languages + 1 multilingual lexical sample task –Several new tasks: Automatic subcategorization acquisition, WSD of WordNet glosses, Semantic Roles (English and Swedish), Logic Forms, etc. WSD: ML Approach

Word Sense Disambiguation UIUC - 06/10/2004 Our implication in Senseval-3 (TALP research group) –As organizers: Lexical sample tasks for Catalan and Spanish: –Coarse sense dictionary developed for the tasks with additional information (collocations, examples, etc.) –Manual annotation of about 300 examples for 50 different words in each language. Context of 3 sentences. Also POS and lemma annotation –Large corpus of about 1,500 unnanotated examples for each word –Best results: 85% accuracy –But nothing new was presented!!! Our implication in Senseval-3 (TALP research group) –As organizers: Lexical sample tasks for Catalan and Spanish: –Coarse sense dictionary developed for the tasks with additional information (collocations, examples, etc.) –Manual annotation of about 300 examples for 50 different words in each language. Context of 3 sentences. Also POS and lemma annotation –Large corpus of about 1,500 unnanotated examples for each word –Best results: 85% accuracy –But nothing new was presented!!! Word Sense Disambiguation

Word Sense Disambiguation UIUC - 06/10/2004 –As participants: English lexical sample task: SVMs, constraint classification, thorough feature optimization and parameter tuning, (semantically) rich feature set. Accuracy: 71.6% %, state-of-the-art. English allwords task: combination (cascade + weighted voted scheme) of several supervised and knowledge based modules. Supervised trained on frequent words of the SemCor corpus. Knowledge based modules rely on WordNet and WordNet Domains. Accuracy: 62.40% (67.4%) Desambiguation of WordNet glosses (best results) –Five papers already available. Also resources (datasets and dictionaries) will be also available after the workshop in July. –As participants: English lexical sample task: SVMs, constraint classification, thorough feature optimization and parameter tuning, (semantically) rich feature set. Accuracy: 71.6% %, state-of-the-art. English allwords task: combination (cascade + weighted voted scheme) of several supervised and knowledge based modules. Supervised trained on frequent words of the SemCor corpus. Knowledge based modules rely on WordNet and WordNet Domains. Accuracy: 62.40% (67.4%) Desambiguation of WordNet glosses (best results) –Five papers already available. Also resources (datasets and dictionaries) will be also available after the workshop in July. Word Sense Disambiguation

Word Sense Disambiguation UIUC - 06/10/2004 New Direction... The jury#NN#1 further#RB#2 said#VB#1 in term#NN#2 end#NN#2 presentments#NN#1 that the City_Executive_ Committee#1, which had#VB#4 over-all#JJ#2 charge#NN#6 of the election#NN#1, “ deserves#VB#1 the praise#NN#1 and thanks#NN#1 of the City_of_Atlanta#1 ” for the manner#NN#1 in which the election#NN#1 was conducted#VB#1.... The jury#NN#1 further#RB#2 said#VB#1 in term#NN#2 end#NN#2 presentments#NN#1 that the City_Executive_ Committee#1, which had#VB#4 over-all#JJ#2 charge#NN#6 of the election#NN#1, “ deserves#VB#1 the praise#NN#1 and thanks#NN#1 of the City_of_Atlanta#1 ” for the manner#NN#1 in which the election#NN#1 was conducted#VB#1.... Allwords WSD in context

Word Sense Disambiguation UIUC - 06/10/2004 Allwords WSD in context jury termend presentments charge election praise thanks manner election jury termend presentments charge election praise thanks manner election Example (WSJ, only nouns)

Word Sense Disambiguation UIUC - 06/10/2004 Allwords WSD in context jury termend presentments charge election praise thanks manner election jury termend presentments charge election praise thanks manner election Example (WSJ, only nouns) “One sense per discourse” constraint

Word Sense Disambiguation UIUC - 06/10/2004 Allwords WSD in context jury termend body of citizens... word or expression point in time in which something ends committee, panel limited period of time surface of a three dimensional object presentments charge election an accusation of crime... electrical charge the act of presenting something a impetuous rush toward someone... a pleading a command to do something praise thanks manner acnkowledgement of appreciation with the help or owing to jury termend body of citizens... word or expression point in time in which something ends committee, panel limited period of time surface of a three dimensional object presentments charge election an accusation of crime... electrical charge the act of presenting something a impetuous rush toward someone... a pleading a command to do something praise thanks manner acnkowledgement of appreciation with the help or owing to Example (WSJ, only nouns) Sense pairs likely to occur together

Word Sense Disambiguation UIUC - 06/10/2004 Allwords WSD in context jury termend body of citizens... word or expression point in time in which something ends committee, panel limited period of time surface of a three dimensional object presentments charge election an accusation of crime... electrical charge the act of presenting something a impetuous rush toward someone... a pleading a command to do something praise thanks manner acnkowledgement of appreciation with the help or owing to jury termend body of citizens... word or expression point in time in which something ends committee, panel limited period of time surface of a three dimensional object presentments charge election an accusation of crime... electrical charge the act of presenting something a impetuous rush toward someone... a pleading a command to do something praise thanks manner acnkowledgement of appreciation with the help or owing to Example (WSJ, only nouns) Uncompatible sense pairs

Word Sense Disambiguation UIUC - 06/10/2004 Allwords WSD in context jury termend body of citizens... word or expression point in time in which something ends committee, panel limited period of time surface of a three dimensional object presentments charge election an accusation of crime... electrical charge the act of presenting something a impetuous rush toward someone... a pleading a command to do something praise thanks manner acnkowledgement of appreciation with the help or owing to jury termend body of citizens... word or expression point in time in which something ends committee, panel limited period of time surface of a three dimensional object presentments charge election an accusation of crime... electrical charge the act of presenting something a impetuous rush toward someone... a pleading a command to do something praise thanks manner acnkowledgement of appreciation with the help or owing to Example (WSJ, only nouns) Lots of irrelevant/unknown sense pairs

Word Sense Disambiguation UIUC - 06/10/2004 Allwords WSD in context Selectional preferences –To produce compatibility constraints between verbs and subject/object head nouns –For instance: “when money#1 appears as object the preferred verbs are: raise#4 (1.44), {take_in#5, collect#2} (0.45), {earn#2, garner#2} (0.23), …” –Need of syntactic information Selectional preferences –To produce compatibility constraints between verbs and subject/object head nouns –For instance: “when money#1 appears as object the preferred verbs are: raise#4 (1.44), {take_in#5, collect#2} (0.45), {earn#2, garner#2} (0.23), …” –Need of syntactic information

Word Sense Disambiguation UIUC - 06/10/2004 A very good starting point –Funding: MEANING, European research project –Resources: MCR, including WordNets from different languages, “ontologies” (Domains, SUMO, TopOntology, SemFile) linked to WordNet synsets, selectional preferences, etc. –Tools: the Senseval-3 allwords WSD system and all its components –People: Lluís Villarejo (PhD student at TALP) –ML approach: Inference & Learning with Linear Constraints A very good starting point –Funding: MEANING, European research project –Resources: MCR, including WordNets from different languages, “ontologies” (Domains, SUMO, TopOntology, SemFile) linked to WordNet synsets, selectional preferences, etc. –Tools: the Senseval-3 allwords WSD system and all its components –People: Lluís Villarejo (PhD student at TALP) –ML approach: Inference & Learning with Linear Constraints Allwords WSD in context

Word Sense Disambiguation UIUC - 06/10/2004 Potential problems –Computational requirements –Soft constraints –Lots of irrelevant sense pairs –Can compatibility constraints be reliably estimated from existing labeled corpora? –… –We have to codify only the most relevant constraints between pairs of “related” words at a coarse level of granularity (very general semantic class labels) Potential problems –Computational requirements –Soft constraints –Lots of irrelevant sense pairs –Can compatibility constraints be reliably estimated from existing labeled corpora? –… –We have to codify only the most relevant constraints between pairs of “related” words at a coarse level of granularity (very general semantic class labels) Allwords WSD in context

Word Sense Disambiguation UIUC - 06/10/2004 Current status –Semantic-class attributes of the context words have already been incorporated as features for capturing “interactions”: gain 1-2 points (but context words are very ambiguous…) –Training/testing the system assuming that we know the actual senses of context words (upper bounds) (near) Future –Inference on top of classifiers’ output –Learning with global feedback (coming from inference) Current status –Semantic-class attributes of the context words have already been incorporated as features for capturing “interactions”: gain 1-2 points (but context words are very ambiguous…) –Training/testing the system assuming that we know the actual senses of context words (upper bounds) (near) Future –Inference on top of classifiers’ output –Learning with global feedback (coming from inference) Allwords WSD in context

Word Sense Disambiguation UIUC - 06/10/2004 Thanks again for your attention!!!