Using Semantic Relatedness for Word Sense Disambiguation

Slides:



Advertisements
Similar presentations
Ciro Cattuto, Dominik Benz, Andreas Hotho, Gerd Stumme Presented by Smitashree Choudhury.
Advertisements

Improved TF-IDF Ranker
Leveraging Sentiment to Compute Word Similarity GWC 2012, Matsue, Japan Balamurali A R *,+ Subhabrata Mukherjee + Akshat Malu + Pushpak Bhattacharyya +
Word sense disambiguation (1) Instructor: Paul Tarau, based on Rada Mihalcea’s original slides Note: Some of the material in this slide set was adapted.
Semantic News Recommendation Using WordNet and Bing Similarities 28th Symposium On Applied Computing 2013 (SAC 2013) March 21, 2013 Michel Capelle
Using Semantic Similarity Measures in the Biomedical Domain for Computing Similarity between Genes based on Gene Ontology By : Elham Khabiri Adviser :
An Evaluation Procedure for Word Net Based Lexical Chaining: Methods and Issues Irene Cramer & Marc Finthammer Faculty of Cultural.
1 Extended Gloss Overlaps as a Measure of Semantic Relatedness Satanjeev Banerjee Ted Pedersen Carnegie Mellon University University of Minnesota Duluth.
Incorporating Dictionary and Corpus Information into a Context Vector Measure of Semantic Relatedness Siddharth Patwardhan Advisor: Ted Pedersen 07/18/2003.
Lexical Semantics and Word Senses Hongning Wang
Measures of Text Similarity
 How to Manage Your Cash › Daily Cash Needs  Lunch, movies, gas, or paying for other activities  Carry cash  Go to an ATM  Credit Card  Know pros.
Lexical chains for summarization a summary of Silber & McCoy’s work by Keith Trnka.
K NOWLEDGE - BASED M ETHOD FOR D ETERMINING THE M EANING OF A MBIGUOUS B IOMEDICAL T ERMS U SING I NFORMATION C ONTENT M EASURES OF S IMILARITY Bridget.
Word Sense Disambiguation Ling571 Deep Processing Techniques for NLP February 23, 2011.
CS Word Sense Disambiguation. 2 Overview A problem for semantic attachment approaches: what happens when a given lexeme has multiple ‘meanings’?
Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures Presenter: Cosmin Adrian Bejan Alexander Budanitsky and.
1 Noun Homograph Disambiguation Using Local Context in Large Text Corpora Marti A. Hearst Presented by: Heng Ji Mar. 29, 2004.
June 19-21, 2006WMS'06, Chania, Crete1 Design and Evaluation of Semantic Similarity Measures for Concepts Stemming from the Same or Different Ontologies.
Generating topic chains and topic views: Experiments using GermaNet Irene Cramer, Marc Finthammer, and Angelika Storrer Faculty.
Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures Written by Alexander Budanitsky Graeme Hirst Retold by.
Word Sense Disambiguation: A Survey Roberto Navigli Department of Computer Science University of Rome "La Sapienza“ ACM Computing Surveys, Vol. 41, No.
1 Complementarity of Lexical and Simple Syntactic Features: The SyntaLex Approach to S ENSEVAL -3 Saif Mohammad Ted Pedersen University of Toronto, Toronto.
Semantic Video Classification Based on Subtitles and Domain Terminologies Polyxeni Katsiouli, Vassileios Tsetsos, Stathes Hadjiefthymiades P ervasive C.
Using Information Content to Evaluate Semantic Similarity in a Taxonomy Presenter: Cosmin Adrian Bejan Philip Resnik Sun Microsystems Laboratories.
C SC 620 Advanced Topics in Natural Language Processing Lecture 10 2/19.
Banking. What does the banker do? Banking Financial Institution permitted to – hold savings deposits – offer loans and mortgages – issue credit rating.
Feature Selection for Automatic Taxonomy Induction The Features Input: Two terms Output: A numeric score, or. Lexical-Syntactic Patterns Co-occurrence.
Word Sense Disambiguation for Automatic Taxonomy Construction from Text-Based Web Corpora 12th International Conference on Web Information System Engineering.
SI485i : NLP Set 10 Lexical Relations slides adapted from Dan Jurafsky and Bill MacCartney.
WORDNET Approach on word sense techniques - AKILAN VELMURUGAN.
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
Personalisation Seminar on Unlocking the Secrets of the Past: Text Mining for Historical Documents Sven Steudter.
Francisco Viveros-Jiménez Alexander Gelbukh Grigori Sidorov.
Jiuling Zhang  Why perform query expansion?  WordNet based Word Sense Disambiguation WordNet Word Sense Disambiguation  Conceptual Query.
Natural Language Processing Group Department of Computer Science University of Sheffield, UK Improving Semi-Supervised Acquisition of Relation Extraction.
Part 3. Knowledge-based Methods for Word Sense Disambiguation.
Word Sense Disambiguation Many words have multiple meanings –E.g, river bank, financial bank Problem: Assign proper sense to each ambiguous word in text.
1 Statistical NLP: Lecture 9 Word Sense Disambiguation.
Paper Review by Utsav Sinha August, 2015 Part of assignment in CS 671: Natural Language Processing, IIT Kanpur.
WORD SENSE DISAMBIGUATION STUDY ON WORD NET ONTOLOGY Akilan Velmurugan Computer Networks – CS 790G.
SYMPOSIUM ON SEMANTICS IN SYSTEMS FOR TEXT PROCESSING September 22-24, Venice, Italy Combining Knowledge-based Methods and Supervised Learning for.
10/22/2015ACM WIDM'20051 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis Voutsakis.
An Effective Word Sense Disambiguation Model Using Automatic Sense Tagging Based on Dictionary Information Yong-Gu Lee
Efficiently Computed Lexical Chains As an Intermediate Representation for Automatic Text Summarization H.G. Silber and K.F. McCoy University of Delaware.
The More the Better? Assessing the Influence of Wikipedia’s Growth on Semantic Relatedness Measures Torsten Zesch and Iryna Gurevych Ubiquitous Knowledge.
CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.
Unsupervised Word Sense Disambiguation REU, Summer, 2009.
Semantics-Based News Recommendation with SF-IDF+ International Conference on Web Intelligence, Mining, and Semantics (WIMS 2013) June 13, 2013 Marnix Moerland.
Lecture 21 Computational Lexical Semantics Topics Features in NLTK III Computational Lexical Semantics Semantic Web USCReadings: NLTK book Chapter 10 Text.
Compact Encodings for All Local Path Information in Web Taxonomies with Application to WordNet Svetlana Strunjaš-Yoshikawa Joint with Fred Annexstein and.
1 Measuring the Semantic Similarity of Texts Author : Courtney Corley and Rada Mihalcea Source : ACL-2005 Reporter : Yong-Xiang Chen.
1 Gloss-based Semantic Similarity Metrics for Predominant Sense Acquisition Ryu Iida Nara Institute of Science and Technology Diana McCarthy and Rob Koeling.
Semantics-Based News Recommendation International Conference on Web Intelligence, Mining, and Semantics (WIMS 2012) June 14, 2012 Michel Capelle
2/10/2016Semantic Similarity1 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis.
Word Sense and Subjectivity (Coling/ACL 2006) Janyce Wiebe Rada Mihalcea University of Pittsburgh University of North Texas Acknowledgements: This slide.
Semantic Evaluation of Machine Translation Billy Wong, City University of Hong Kong 21 st May 2010.
Lexical Semantics and Word Senses Hongning Wang
Chapter 5. Financial Services Borrowing Short Term Regular Savings Money Market Accounts Long Term Certificates of Deposit U.S. Savings Bonds Investment.
Automatic Writing Evaluation
Exploring and Navigating: Tools for GermaNet
Bridget McInnes Ted Pedersen Serguei Pakhomov
OMIOTIS: A Thesaurus-based Measure of Semantic Relatedness
Statistical NLP: Lecture 9
WordNet WordNet, WSD.
A method for WSD on Unrestricted Text
C SC 620 Advanced Topics in Natural Language Processing
Giannis Varelas Epimenidis Voutsakis Paraskevi Raftopoulou
Unsupervised Word Sense Disambiguation Using Lesk algorithm
Statistical NLP : Lecture 9 Word Sense Disambiguation
Presentation transcript:

Using Semantic Relatedness for Word Sense Disambiguation Siddharth Patwardhan patw0006@d.umn.edu 10/24/2002

The Lesk1 Algorithm Two Hypotheses: The intended sense of the target word in a given context is semantically related to other word senses in the context. Semantically related words have greater number of overlaps of their dictionary definitions. 1[Lesk 1986]

The rate of interest at this bank is high. An Example The rate of interest at this bank is high. interest (involvement) interest (interestingness) interest (sake) interest (charge for loan) interest (pastime) interest (stake) bank (financial institution) bank (river) bank (stock) bank (building) bank (arrangement) bank (container) rate (charge per unit) rate (change with time) rate (pace) rate: amount of a charge or payment relative to some basis; "a 10-minute phone call at that rate would cost $5“ interest: a fixed charge for borrowing money; usually a percentage of the amount borrowed; "how much interest do you pay on your mortgage?“ bank: a financial institution that accepts deposits and channels the money into lending activities; "he cashed a check at the bank"; "that bank holds the mortgage on my home"

Adapting Lesk to WordNet2 Banerjee and Pedersen [2001] adapt the Lesk algorithm to use the rich source of knowledge in WordNet. hypernym: gloss hypernym: gloss hypernym: gloss bank: gloss interest: gloss rate: gloss hyponym: gloss hyponym: gloss hyponym: gloss 2 [Fellbaum 1998]

Semantic Relatedness by counting edges. Rada, et al [1989] introduce a notion of relatedness between words by counting the number of edges between the them in a “broader-than” hierarchy (MeSH: a hierarchy of medical terms). Leacock and Chodorow [1998] use a similar approach to measure semantic relatedness between concepts by finding the length of the shortest path between the two concepts in the is-a hierarchy of WordNet. They scale this value by the maximum depth of the taxonomy and get a formula for relatedness: relatedness = -log(pathLength/(2·maxDepth))

Information Content Introduced by Resnik [1995]. Indicates the specificity or generality of a concept. More specific concepts have higher information content, while more general concepts have less information content. For example concepts like dime, clinker and hayfork are rather specific or topical, would be localized in a discourse and would greatly restrict the choice of concepts that can be used around them (in the context). Computed from large (ideally sense-tagged) corpora. IC(concept) = -log(Probability of occurrence of the concept in a large corpus)

Information Content *Root* *Root* +1 +1 Motor vehicle Motor vehicle +1 car car +1 +1 cab cab +1 minicab minicab +1

Measures of Semantic Relatedness based on a concept hierarchy LOWEST COMMON SUBSUMER CONCEPT c2 CONCEPT c1

Measures of Semantic Relatedness Resnik[1995] relatedness = IC(lcs) Jiang Conrath[1997] distance = 2 x IC(lcs) – (IC(c1) + IC(c2)) Lin[1998] relatedness = 2 x IC(lcs) IC(c1) + IC(c2)

Measures of Semantic Relatedness The Hirst-St.Onge[1998] measure c2 Extra Strong Relation – between two occurrences of the same word. (2) Strong Relation – three rules. Synonyms Horizontal Relation Compound – Word Relation (3) Medium Strong Relation – if there exists an allowable path between the two concepts. Is a kind of (UPWARD) Opposite (HORIZONTAL) c4 c1 Has part (DOWNWARD) c3

Word Sense Disambiguation using Measures of Semantic Relatedness CONTEXT Word1 Target Word2 W11 W12 T1 T2 W21 W22 SENSES LOCAL APPROACH W11 W12 W21 W22 W11 W12 W21 W22 S11 S21 T1 S12 T2 S22 S13 S23 S24 S14 Score(T1) = S11 + S12 + S13 + S14 Score(T2) = S21 + S22 + S23 + S24

Word Sense Disambiguation using Measures of Semantic Relatedness GLOBAL APPROACH a b W11 T1 W21 S1 = a + b + c S5 W11 T2 W21 c T2 W12 T1 W22 S6 W12 W22 S2 W11 T1 W22 W11 T2 W22 S3 S7 W12 T1 W21 W12 T2 S8 W21 S4 Combination with the highest score is selected.

Some Results The experiments were performed on the noun instances of the SENSEVAL-2 data (1723 instances). A context window size of 3 with the local scoring approach was considered for the experiments. Information content was calculated using the SemCor semantically tagged corpus and from the Brown corpus. SemCor Brown Resnik 0.295 0.290 Jiang-Conrath 0.330 0.331 Lin 0.328 0.363 Leacock-Chodorow 0.305 Hirst-St.Onge 0.316 Adapted Lesk 0.391

References [Lesk 1986] M. Lesk. Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone In Proceedings of International Conference on Machine Learning, Madison, Wisconsin, August 1998. [Budanitsky Hirst 2001] A. Budanitsky and G. Hirst. Semantic Distance in WordNet: An experimental application-oriented evaluation of five measures. In Workshop on WordNet and Other Lexical Resources, Second meeting of the North American Chapter of the Association for Computational Linguistics, Pittsburgh, June 2001. [Banerjee Pedersen 2002] S. Banerjee and T. Pedersen. An adapted Lesk algorithm for word sense disambiguation using WordNet. In Proceedings of the Third International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, Feb 2002. [Resnik 1995] P. Resnik. Using information content to evaluate semantic similarity in a taxonomy. In Proceedings of the 14th International Joint Conference on Artificial Intelligence, Montreal, August 1995.

References [Jiang Conrath 1997] J. Jiang and D. Conrath. Semantic similarity based on corpus statistics and lexical taxonomy. In Proceedings on International Conference on Research in Computational Linguistics, Taiwan, 1997. [Lin 1998] D. Lin. An information-theoretic definition of similarity. In Proceedings of International Conference on Machine Learning, Madison, Wisconsin, August 1998. [Leacock Chodorow 1998] C. Leacock and M. Chodorow. Combining local context and WordNet similarity for word sense identification. In Fellbaum, pp. 265 – 283, 1998. [Hirst St-Onge 1998] G. Hirst and D. St-Onge. Lexical chain as representations of context for the detection and correction of malapropisms. In Fellbaum, pp. 305 – 332, 1998. [Fellbaum 1998] C. Fellbaum, editor. WordNet: An electronic lexical database. MIT Press, 1998. [Rada et al, 1989] R. Rada, H. Mili, E. Bicknell and M. Blettner. Development and application of a metric on semantic nets. IEEE Transactions on Systems, Man and Cybernetics, 19(1):17-30, February 1989.