1 Representing Meaning in Unsupervised Word Sense Disambiguation Bridget T. McInnes 5 September 2008 University of Minnesota Twin Cities.

Slides:



Advertisements
Similar presentations
Ciro Cattuto, Dominik Benz, Andreas Hotho, Gerd Stumme Presented by Smitashree Choudhury.
Advertisements

Determining the Syntactic Structure of Medical Terms in Clinical Notes Bridget T. McInnes¹ Ted Pedersen² and Serguei V. Pakhomov¹ University of Minnesota¹.
A UTOMATICALLY A CQUIRING A S EMANTIC N ETWORK OF R ELATED C ONCEPTS Date: 2011/11/14 Source: Sean Szumlanski et. al (CIKM’10) Advisor: Jia-ling, Koh Speaker:
Indexing the Biomedical Literature in a Time of Increased Demand and Limited Resources BioASQ Workshop September 27, 2013 Alan R. Aronson Lister Hill Center,
The Impact of Task and Corpus on Event Extraction Systems Ralph Grishman New York University Malta, May 2010 NYU.
U. S. National Library of Medicine NLM Indexing Initiative Tools for NLP: MetaMap and the Medical Text Indexer Natural Language Processing: State of the.
Word sense disambiguation and information retrieval Chapter 17 Jurafsky, D. & Martin J. H. SPEECH and LANGUAGE PROCESSING Jarmo Ritola -
Scott Wen-tau Yih (Microsoft Research) Joint work with Vahed Qazvinian (University of Michigan)
LEDIR : An Unsupervised Algorithm for Learning Directionality of Inference Rules Advisor: Hsin-His Chen Reporter: Chi-Hsin Yu Date: From EMNLP.
K NOWLEDGE - BASED M ETHOD FOR D ETERMINING THE M EANING OF A MBIGUOUS B IOMEDICAL T ERMS U SING I NFORMATION C ONTENT M EASURES OF S IMILARITY Bridget.
An open source, ontology-driven concept analysis engine, with applications to capturing knowledge regarding protein transport, protein interactions and.
CS Word Sense Disambiguation. 2 Overview A problem for semantic attachment approaches: what happens when a given lexeme has multiple ‘meanings’?
CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.
Predicting the Semantic Orientation of Adjective Vasileios Hatzivassiloglou and Kathleen R. McKeown Presented By Yash Satsangi.
An Unsupervised Approach to Biomedical Term Disambiguation: Integrating UMLS and Medline Bridget T McInnes University of Minnesota Twin Cities Background.
1 Noun Homograph Disambiguation Using Local Context in Large Text Corpora Marti A. Hearst Presented by: Heng Ji Mar. 29, 2004.
CS 4705 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised –Dictionary-based.
1 Complementarity of Lexical and Simple Syntactic Features: The SyntaLex Approach to S ENSEVAL -3 Saif Mohammad Ted Pedersen University of Toronto, Toronto.
Semantic Video Classification Based on Subtitles and Domain Terminologies Polyxeni Katsiouli, Vassileios Tsetsos, Stathes Hadjiefthymiades P ervasive C.
1 Discriminating Word Senses Using McQuitty’s Similarity Analysis Amruta Purandare University of Minnesota, Duluth Advisor : Dr Ted Pedersen Research supported.
Word sense induction using continuous vector space models
1 UCB Digital Library Project An Experiment in Using Lexical Disambiguation to Enhance Information Access Robert Wilensky, Isaac Cheng, Timotius Tjahjadi,
Evaluation of Utility of LSA for Word Sense Discrimination Esther Levin, Mehrbod Sharifi, Jerry Ball
Word Sense Disambiguation for Automatic Taxonomy Construction from Text-Based Web Corpora 12th International Conference on Web Information System Engineering.
Automated Classification of Medical Questions Using Semantic Parsing Techniques Paul E. Pancoast, MD Arthur B. Smith, MS Chi-Ren Shyu, PhD University of.
Unsupervised Learning. CS583, Bing Liu, UIC 2 Supervised learning vs. unsupervised learning Supervised learning: discover patterns in the data that relate.
“How much context do you need?” An experiment about context size in Interactive Cross-language Question Answering B. Navarro, L. Moreno-Monteagudo, E.
Jiuling Zhang  Why perform query expansion?  WordNet based Word Sense Disambiguation WordNet Word Sense Disambiguation  Conceptual Query.
Annual reports and feedback from UMLS licensees Kin Wah Fung MD, MSc, MA The UMLS Team National Library of Medicine Workshop on the Future of the UMLS.
Stuart Card PARC (since ’74) Area Manager of the User Interface Research Center Ph.D. in Psychology from Carnegie Mellon Co-authored “The Psychology of.
Annotating Words using WordNet Semantic Glosses Julian Szymański Department of Computer Systems Architecture, Faculty of Electronics, Telecommunications.
1 Statistical NLP: Lecture 9 Word Sense Disambiguation.
Paper Review by Utsav Sinha August, 2015 Part of assignment in CS 671: Natural Language Processing, IIT Kanpur.
W ORD S ENSE D ISAMBIGUATION By Mahmood Soltani Tehran University 2009/12/24 1.
Combining terminology resources and statistical methods for entity recognition: an evaluation Angus Roberts, Robert Gaizauskas, Mark Hepple, Yikun Guo.
A Word at a Time: Computing Word Relatedness using Temporal Semantic Analysis Kira Radinsky (Technion) Eugene Agichtein (Emory) Evgeniy Gabrilovich (Yahoo!
SYMPOSIUM ON SEMANTICS IN SYSTEMS FOR TEXT PROCESSING September 22-24, Venice, Italy Combining Knowledge-based Methods and Supervised Learning for.
An Effective Word Sense Disambiguation Model Using Automatic Sense Tagging Based on Dictionary Information Yong-Gu Lee
Katrin Erk Vector space models of word meaning. Geometric interpretation of lists of feature/value pairs In cognitive science: representation of a concept.
2014 EMNLP Xinxiong Chen, Zhiyuan Liu, Maosong Sun State Key Laboratory of Intelligent Technology and Systems Tsinghua National Laboratory for Information.
Relevance Detection Approach to Gene Annotation Aid to automatic annotation of databases Annotation flow –Extraction of molecular function of a gene from.
A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources Author: Carmen Banea, Rada Mihalcea, Janyce Wiebe Source:
CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.
HyperLex: lexical cartography for information retrieval Jean Veronis Presented by: Siddhanth Jain( ) Samiulla Shaikh( )
Disambiguation of Biomedical Text Mark Stevenson Natural Language Processing Group University of Sheffield, UK Joint work.
Wikipedia as Sense Inventory to Improve Diversity in Web Search Results Celina SantamariaJulio GonzaloJavier Artiles nlp.uned.es UNED,c/Juan del Rosal,
Exploiting Wikipedia Categorization for Predicting Age and Gender of Blog Authors K Santosh Aditya Joshi Manish Gupta Vasudeva Varma
Extracting Keyphrases to Represent Relations in Social Networks from Web Junichiro Mori and Mitsuru Ishizuka Universiry of Tokyo Yutaka Matsuo National.
Evgeniy Gabrilovich and Shaul Markovitch
1 A Web Search Engine-Based Approach to Measure Semantic Similarity between Words Presenter: Guan-Yu Chen IEEE Trans. on Knowledge & Data Engineering,
Using Semantic Relatedness for Word Sense Disambiguation
Improving Named Entity Translation Combining Phonetic and Semantic Similarities Fei Huang, Stephan Vogel, Alex Waibel Language Technologies Institute School.
Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Unsupervised word sense disambiguation for Korean through the acyclic weighted digraph using corpus and.
Multi-level Bootstrapping for Extracting Parallel Sentence from a Quasi-Comparable Corpus Pascale Fung and Percy Cheung Human Language Technology Center,
Improved Video Categorization from Text Metadata and User Comments ACM SIGIR 2011:Research and development in Information Retrieval - Katja Filippova -
Comparing Word Relatedness Measures Based on Google n-grams Aminul ISLAM, Evangelos MILIOS, Vlado KEŠELJ Faculty of Computer Science Dalhousie University,
1 Gloss-based Semantic Similarity Metrics for Predominant Sense Acquisition Ryu Iida Nara Institute of Science and Technology Diana McCarthy and Rob Koeling.
Second Language Learning From News Websites Word Sense Disambiguation using Word Embeddings.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
A Self-organizing Semantic Map for Information Retrieval Xia Lin, Dagobert Soergel, Gary Marchionini presented by Yi-Ting.
Medical Semantic Similarity with a Neural Language Model Dongfang Xu School of Information Using Skip-gram Model for word embedding.
Korean version of GloVe Applying GloVe & word2vec model to Korean corpus speaker : 양희정 date :
Bridget McInnes Ted Pedersen Serguei Pakhomov
Using UMLS CUIs for WSD in the Biomedical Domain
Category-Based Pseudowords
Word embeddings (continued)
Presentation transcript:

1 Representing Meaning in Unsupervised Word Sense Disambiguation Bridget T. McInnes 5 September 2008 University of Minnesota Twin Cities

2 What is WSD? The culture count doubled. Culture Laboratory Culture Anthropological Culture Sense Inventory

3 Approaches to WSD Supervised Advantages: obtains a high accuracy Disadvantages: manually annotated training data is required for each word that needs to be disambiguated therefore it can not scale Unsupervised Advantages: does not require manually annotated training data Disadvantages: generally does not obtain as high of an accuracy as supervised approaches

4 Unsupervised Approaches Similarity and Relatedness Based

5 Unsupervised Approaches Similarity and Relatedness Based Patwardhan, Banerjee and Pedersen 2005 Pedersen, et al 2006 Budanitsky and Hirst 2006

6 Unsupervised Approaches Similarity and Relatedness based Vector Based

7 Unsupervised Approaches Similarity and Relatedness Based Vector-based Mohammad and Hirst, 2006 Patwardhan, 2003 Pedersen, et al 2006 Humphrey, et al 2006

8 Unsupervised Approaches Similarity and Relatedness-based Vector-based Clustering

9 Unsupervised Approaches Similarity and Relatedness based Vector-based Clustering Pedersen and Bruce, 1997 Shütze, 1998 Pedersen and Bruce, 1998 Purandare and Pedersen, 2004 Kulkarni and Pedersen, 2005

10 Road Map Previous Approaches Our vector approach Future Work

11 Previous Approaches Similarity and Relatedness Based SenseRelate (Banerjee and Pedersen, 2003) Vector-based Semantic Type Indexing (Humphrey et al 2006) Clustering SenseClusters (Kulkarni and Pedersen, 2005)

12 Banerjee and Pedersen 2003 Sense Relate

13 SenseRelate Target Word: Transport Concept 1: Biological Transport (C ) Concept 2: Patient Transport (C ) Transport of glutathione S-linked conjugates. glutathione S-linked conjugates. C C C C = SS + SS = Total SS for Concept 1

14 SenseRelate Target Word: Transport Concept 1: Biological Transport (C ) Concept 2: Patient Transport (C ) Transport of glutathione S-linked conjugates. glutathione S-linked conjugates. C C C C = SS + SS = Total SS for concept 2 C = SS + SS = Total SS for concept 1

15 Humphrey et al, 2006 Semantic Type Indexing for WSD

16 Semantic Type Indexing (STI) Target Word: Transport Concept 2 Vector Concept 1 Vector Target Word Vector Cosine 2 Cosine 1 Concept 1: Biological Transport Semantic type: Cell Function Concept 2: Patient Transport Semantic type: Health Care Activity JDI CV1 – JDI vector CV2 – JDI vector TW – JDI vector Transport of glutathione S-linked conjugates.

17 Target Word Vector Transport of glutathione S-linked conjugates. Contains the words surrounding the ambiguous word

18 STI - Target Word Vectors Transport of glutathione S-linked conjugates. Contains the words surrounding the ambiguous word

19 STI -Concept Vectors The concept vectors are created based on their semantic type(s) Transport:C : Biological Transport C : Patient Transport C C Cell Function One word terms in the Metathesaurus associated with Cell Function Health Care Activity One word terms in the Metathesaurus associated with Health Care Activity

20 Kulkarni and Pedersen, 2005 SenseClusters

21 Sense Clusters (SC) Target Word: Transport Concept 1: Biological TransportConcept 2: Patient Transport Instance 1 Instance 2 Instance 3 Instance 4 Instance 5 Instance 6 Instance 7 Instance 8 Instance 9 Instance 10 Instance 11 Instance 12 Instance 13 … Concept 1 Concept 2 Transport of glutathione S-linked conjugates.

22 Sense Clusters (SC) Instance 1 Instance 2 Instance 3 Instance 4 Instance 5 Instance 6 Instance 7 Instance 8 Instance 9 Instance 10 Instance 11 Instance 12 Instance 13 … Concept 1 Concept 2 Target Word: Transport Concept 1: Biological TransportConcept 2: Patient Transport Transport of glutathione S-linked conjugates.

23 Sense Clusters Concept 2 Vector Concept 1 Vector Target Word Vector Cosine 2 Cosine 1 Target Word: Transport Concept 1: Biological TransportConcept 2: Patient Transport Transport of glutathione S-linked conjugates.

24 SC -Vectors Contain the words surrounding the ambiguous word Created using: First order co-occurrences Second order co-occurrences

25 First Order Co-occurrence Vectors glutathione S-linked conjugates Word 1 Word 2 Word N Target Vector

26 Second Order Co-occurrence Vectors Word 1 Word 2 Word N st order glutathione … … … … … …… Word1 Word 2 … Word N 022 … 2 nd order glutathione

27 Second Order Co-occurrence Vectors S-linked conjugates Word 1 Word 2 Word N Target Vector glutathione

28 Our unsupervised approach

29 CuiTools Approach Our approach uses a general vector approach with SenseCluster vectors

30 CuiTools Concept 2 Vector Concept 1 Vector Target Word Vector Cosine 2 Cosine 1 Target Word: Transport Concept 1: Biological Transport (C ) Concept 2: Patient Transport (C ) Transport of glutathione S-linked conjugates.

31 CuiTools Approach We explore using First-order co-occurrence vectors Second-order co-occurrence vectors Our approach uses a general vector approach with SenseCluster vectors

32 Target Word Vector Contains the words surrounding the ambiguous word Transport of glutathione S-linked conjugates.

33 CuiTools - Concept Vectors How to create a vector that can represent the meaning of a concept for word sense disambiguation?

34 To answer this question We explore information in the UMLS that can be used to represent the meaning of a concept.

35 CuiTools - Concept Vectors Adjustment Individual Adjustment Conceptually broad term referring to a state of harmony between internal needs and external … Adjustment Action The act of making necessary corrections or modifications … Psychological Adjustment A state of harmony between internal needs and external demands and the processes used … CUI definition

36 CuiTools - Concept Vectors Blood Pressure Force exerted by the blood on the walls of the arteries and other vessels. Blood Pressure Determination Actions performed to measure the diastolic and systolic pressure of the blood. Arterial Pressure NO DEFINTION CUI definition

37 CuiTools - Concept Vectors CUI definition Use CUI definition but if it doesn’t exist PARent definition Semantic Type definition SYNonymous terms For example: C : Laboratory Culture laboratory culture microbial culture sample culture

38 CuiTools - Concept Vectors CUI definition PARent definition Semantic Type definition SIBlings For example: C : Anthropological Culture archeology family social groups If CUI definition doesn’t exist SYNonymous terms

39 CuiTools - Concept Vectors CUI definition If CUI definition doesn’t exist PARent definition Semantic Type definition SIBlings SYNonymous terms TOP 50 most frequent words surrounding the terms associated with the CUI

40 Dataset National Library of Medicine's Word Sense Disambiguation (NLM-WSD) Dataset 50 words from the 1998 MEDLINE abstracts 100 instances for each of the 50 words The target word was manually assigned a UMLS concept or None All instances of None were removed Average number of concepts per ambiguous word is 2.26

41 Data subsets Humphrey subset Humphrey, et al out of the 50 words in NLM-WSD 5 words were excluded because at least two of the possible concepts associated with these words have the same semantic type Instances that were assigned “None” were removed

42 Training Data The training data used to create the 1 st and 2 nd order co-occurrence vectors is 2005 Medline baseline

43 Results

45 Results of Co-occurrence Vectors

46 Results of the Representations of Meaning

47 Results of the Representations of Meaning - CUI Adding the parent and semantic type definitions decreased the accuracy by 6 and 7 percentage points Parent and semantic type definitions are too broad to define the meaning of a concept

48 Results of the Representations of Meaning - SYN Using the synonymous terms associated with a concept is too narrow to represent the meaning. Adjustment Action Adjustment – action Adjustments Adjustment, NOS Adjustment – action qualifier value Adjustment – action procedure

49 Results of the Representations of Meaning - SIB Using the terms associated the siblings of a concept is too broad to represent the meaning. Adjustment Action Biopsy Cauterisation Cautery Cold Therapy Desiccation Drainage procedure Electrolysis

50 Results of the Representations of Meaning

51 Supervised versus Unsupervised Joshi McInnes Stevenson SenseClusters Humphrey CuiTools et al 04 et al 07 et al 08 et al 06

52 To recap How to create a vector that can represent the meaning of a concept for word sense disambiguation?

53 Conclusions To answer this we explored information in the UMLS that could be used to represent the meaning of a concept Finding a context to represent the meaning of a concept is difficult We found using the top 50 most frequent words surrounding the terms associated with the concept best represented the concept for the task of word sense disambiguation

54 Take away message Unsupervised approaches are showing promise Their disadvantage due to supervised approaches obtaining a higher disambiguation accuracy is slowly disappearing But we are not there yet … so there is more work to do

55 Future Work UMLS-Similarity package Using the Semantic Similarity scores rather than frequency in the 1 st order co-occurrence vectors

56 First Order Co-occurrence Vectors glutathione S-linked conjugates Word 1 Word 2 Word N Target Vector FREQ (glutathione, word N)Average

57 First Order Co-occurrence Vectors glutathione S-linked conjugates Word 1 Word 2 Word N Target Vector Similarity (glutathione, word N)Average

58 First Order Co-occurrence Vectors glutathione S-linked conjugates Word 1 Word 2 Word N Target Vector Similarity (glutathione, word N)Sum (like SenseRelate)

59 First Order Co-occurrences glutathione Word 1 Word 2 Word N Word N (C ).3+.2 C C Similarity ==.5 C

60 Future Work UMLS-Similarity package Creating 2 nd order co-occurrence matrices based on highly similar concepts rather than words in text Using the Semantic Similarity scores rather than frequency in the 1 st order co-occurrence vectors

61 Second Order Co-occurrence Vectors Word 1 Word 2 Word N … … … … … …… Word1 Word 2 … Word N Words come from training corpus Frequency counts

62 Second Order Co-occurrence Vectors CUI 1 CUI 2 CUI N … … … … … …… CUI1 CUI2 … CUI N Use concepts from the UMLS Similarity scores

63 Future Work UMLS-Similarity package Creating 2 nd order co-occurrence matrices based on highly similar concepts rather than co- occurrences in text Use terms associated with CUIs that have a high similarity score with the possible concept to represent the meaning of the concept Using the Semantic Similarity scores rather than frequency in the 1 st order co-occurrence vectors

64 Similarity Scores What is potentially gained by using the similarity or relatedness measures May catch words/concepts that are similar but do not frequently occur together in the training data culture and ethnology Ethnology is the study of anthropology ethnology appears with culture only five times in the training data The concepts Anthropological Culture and Ethnology would have a high similarity score where as Laboratory culture and Ethnology would not

65 Software CuiTools version

66 Thank you Lan Aronson François Lang Jim Mork Aurélie Névéol Will Rogers Olivier Bodenreider Allen Browne May Chey Dina Demner- Fushman Guy Divita Kin Wah Fung Susanne Humphrey Dwayne McCully Tom Rindflesch Suresh Srinivasan