SI485i : NLP Set 14 Reference Resolution. 2 Kraken, also called the Crab-fish, which is not that huge, for heads and tails counted, he is no larger than.

Slides:



Advertisements
Similar presentations
Referring Expressions: Definition Referring expressions are words or phrases, the semantic interpretation of which is a discourse entity (also called referent)
Advertisements

Pasco-Hernando Community College Tutorial Series.
Specialized models and ranking for coreference resolution Pascal Denis ALPAGE Project Team INRIA Rocquencourt F Le Chesnay, France Jason Baldridge.
A Machine Learning Approach to Coreference Resolution of Noun Phrases By W.M.Soon, H.T.Ng, D.C.Y.Lim Presented by Iman Sen.
Pronouns Brenham Writing Room Created by D. Herring.
Long Distance Dependencies (Filler-Gap Constructions) and Relative Clauses October 10, : Grammars and Lexicons Lori Levin (Examples from Kroeger.
1 Discourse, coherence and anaphora resolution Lecture 16.
Chapter 18: Discourse Tianjun Fu Ling538 Presentation Nov 30th, 2006.
Easy-First Coreference Resolution Veselin Stoyanov and Jason Eisner Johns Hopkins University.
PRONOUNS English 3 CP.
Reference Resolution #1 CSCI-GA.2590 Ralph Grishman NYU.
CS 4705 Algorithms for Reference Resolution. Anaphora resolution Finding in a text all the referring expressions that have one and the same denotation.
CS 4705 Lecture 21 Algorithms for Reference Resolution.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Fall 2005-Lecture 2.
Supervised models for coreference resolution Altaf Rahman and Vincent Ng Human Language Technology Research Institute University of Texas at Dallas 1.
Improving Machine Learning Approaches to Coreference Resolution Vincent Ng and Claire Cardie Cornell Univ. ACL 2002 slides prepared by Ralph Grishman.
Reference and inference By: Esra’a Rawah
Anaphora Resolution Sanghoon Kwak Takahiro Aoyama.
Designed by Elisa Paramore
{ Pronouns My father shot the elephant wearing his pajamas.
Pronouns – Part One Grade Eight.
SI485i : NLP Set 12 Features and Prediction. What is NLP, really? Many of our tasks boil down to finding intelligent features of language. We do lots.
SI485i : NLP Set 9 Advanced PCFGs Some slides from Chris Manning.
Pronouns!. What’s a Pronoun? Have you ever wondered where pronouns came from in the first place? Probably not! We seem to take these little words for.
Reference Resolution CSCI-GA.2590 Ralph Grishman NYU.
A Light-weight Approach to Coreference Resolution for Named Entities in Text Marin Dimitrov Ontotext Lab, Sirma AI Kalina Bontcheva, Hamish Cunningham,
Pronoun Usage 4. Pronouns in comparisons Sometimes a comparison is made by using a clause that begins with than or as. Examples Fred is better at chess.
PRONOUNS!! A pronoun is a word used in place of a noun or another pronoun.
Andreea Bodnari, 1 Peter Szolovits, 1 Ozlem Uzuner 2 1 MIT, CSAIL, Cambridge, MA, USA 2 Department of Information Studies, University at Albany SUNY, Albany,
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Tree Kernels for Parsing: (Collins & Duffy, 2001) Advanced Statistical Methods in NLP Ling 572 February 28, 2012.
ASPECTS OF LINGUISTIC COMPETENCE 4 SEPT 09, 2013 – DAY 6 Brain & Language LING NSCI Harry Howard Tulane University.
Syntax I: Constituents and Structure Gareth Price – Duke University.
PRONOUNS English 3 CP. Pronouns  Pronouns are words that take the place of nouns, groups of words acting as nouns, or other pronouns. Pronouns are necessary.
Illinois-Coref: The UI System in the CoNLL-2012 Shared Task Kai-Wei Chang, Rajhans Samdani, Alla Rozovskaya, Mark Sammons, and Dan Roth Supported by ARL,
A multiple knowledge source algorithm for anaphora resolution Allaoua Refoufi Computer Science Department University of Setif, Setif 19000, Algeria .
SYNTAX Lecture -1 SMRITI SINGH.
Pronouns Pronoun/Antecedents Who vs. Whom Pronouns as Compound Elements Shifts in Person.
Coreference Resolution
NLP. Introduction to NLP Is language more than just a “bag of words”? Grammatical rules apply to categories and groups of words, not individual words.
PERSONAL PRONOUNS, POSSESSIVE PRONOUNS AND POSSESSIVE ADJECTIVES.
A Cross-Lingual ILP Solution to Zero Anaphora Resolution Ryu Iida & Massimo Poesio (ACL-HLT 2011)
Adjective: identify pronouns in sentences and distinguish between singular and plural pronouns. What is a Pronoun? Pronoun.
Reference Resolution. Sue bought a cup of coffee and a donut from Jane. She met John as she left. He looked at her enviously as she drank the coffee.
Pronouns. What is a Pronoun? A pronoun replaces a noun so you don’t have to keep repeating it. –Definite pronouns: I, me, she, he, they, we, us, etc.
Inference Protocols for Coreference Resolution Kai-Wei Chang, Rajhans Samdani, Alla Rozovskaya, Nick Rizzolo, Mark Sammons, and Dan Roth This research.
Evaluation issues in anaphora resolution and beyond Ruslan Mitkov University of Wolverhampton Faro, 27 June 2002.
Sentences Ed McCorduck English 402--Grammar SUNY Cortland
Measuring the Influence of Errors Induced by the Presence of Dialogs in Reference Clustering of Narrative Text Alaukik Aggarwal, Department of Computer.
Sight Words.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Spring 2006-Lecture 2.
Support Vector Machines and Kernel Methods for Co-Reference Resolution 2007 Summer Workshop on Human Language Technology Center for Language and Speech.
11 Project, Part 3. Outline Basics of supervised learning using Naïve Bayes (using a simpler example) Features for the project 2.
Pronouns and Articles Unit 5 Grammar Forms & Functions 3.
Dependency Parsing Niranjan Balasubramanian March 24 th 2016 Credits: Many slides from: Michael Collins, Mausam, Chris Manning, COLNG 2014 Dependency Parsing.
PRONOUNS Pronouns are words which stand in place of nouns. There are many different kinds of pronouns, used in different ways and for different purposes.
Pronouns.
Simone Paolo Ponzetto University of Heidelberg Massimo Poesio
NYU Coreference CSCI-GA.2591 Ralph Grishman.
Pronoun Notes.
Improving Written Communication: “To Do” Verb Phrase Problems
Referring Expressions: Definition
Pronouns – Part One Grade Eight.
Clustering Algorithms for Noun Phrase Coreference Resolution
Algorithms for Reference Resolution
Brenham Writing Room Created by D. Herring
A Machine Learning Approach to Coreference Resolution of Noun Phrases
Pronouns.
A Machine Learning Approach to Coreference Resolution of Noun Phrases
Structure of a Lexicon Debasri Chakrabarti 13-May-19.
Presentation transcript:

SI485i : NLP Set 14 Reference Resolution

2 Kraken, also called the Crab-fish, which is not that huge, for heads and tails counted, he is no larger than our Öland is wide [i.e., less than 16 km]... He stays at the sea floor, constantly surrounded by innumerable small fishes, who serve as his food and are fed by him in return: for his meal, (if I remember correctly what E. Pontoppidan writes,) lasts no longer than three months, and another three are then needed to digest it.

Why Reference Resolution? Q: What is the second-oldest service academy in the US? 3 The United States Naval Academy (also known as USNA, Annapolis, or Navy) is a four-year coeducational federal service academy located in Annapolis, Maryland, United States. Established in 1845 under Secretary of the Navy George Bancroft, it is the second-oldest of the United States' five service academies. “it” United States Naval Academy Coeducational federal service academy Annapolis, Maryland 1845 George Bancroft ??

Reference Resolution, Defined Identify all noun phrases that refer to the same real- world entity. Alternate: ground each noun phrase (entity mention) to its real-world referent Input: noun phrases Output: sets of noun phrases 4

Terminology Coreference: two noun phrases both refer to the same real-world entity Anaphora: a noun phrase (the anaphor) refers to a previously mention noun phrase (the antecedent) Key: interpretation of the anaphor is dependent on interpreting the antecedent John won the race. He is happy. Not all anaphora is coreference, but in general, most are. 5 We went to see a movie yesterday. The tickets were expensive!

Anaphora vs Coreference Anaphora Coreference 6 Slide from Chris Manning

Terminology Entity: the real-world concept Entity mention: the noun phrase Antecedent: the previous NP to which our current NP refers 7 Text contains entity mentions, each of which belongs to an entity. Some entity mentions have antecedents in the text.

Hobbs Algorithm (for pronouns) 1.Start at the pronoun NP 2.Climb parse tree until NP or S. Call this node X, and path p that you climbed up. 3.Traverse all branches below X to the left of path p, breadth first. Propose any seen NP/S as the antecedent. 4.If node X is the highest S node, traverse previous sentences left-to- right breadth-first…most recent sentences first. When an NP is seen, propose as antecedent. 5.If node X not highest S node, go up tree to first NP or S. Call this the new X node. 6.If X is an NP node and if path p did not pass through a Nominal node that X immediately dominates, propose X. 7.Traverse breadth-first all branches below X to the left of path p. Propose any NP as the antecedent. 8.If X is an S node, traverse all branches of X to the right of path p. But don’t go below any NP or S encountered. Propose any seen NP. 9.Go to step 4. 8

Supervised Coreference Use labeled data, and learn to perform coreference! For each entity mention, find the best previous antecedent. 9 The man jumped the fence. It scraped his leg as he went over. P(coreferent | “the man”, “it”, sentence)

Learning in Coreference Create classifiers over pairs of NPs 1.Step through each NP in the text, in order 2.Choose a preceding NP as antecedent, or “new” 3.Start with the closest NP to the left Compute P(coreferent | NP, NPleft, document) Determine if probability over a threshold OR compute probability for all NPs to the left, choose best Many, many variants of this However, the vast majority make these pairwise decisions 10

Features for Pronominal Coreference Number Singular (he/she/it…) vs Plural (we/us/they…) Gender He and she are obvious. John is male. Pat is ??? Person First/Second/Third … he/she/they refer to a third person name Syntax English contains certain hard (and soft!) constraints “Sally gave her a book” – (her cannot be Sally) “Sally gave herself a treat” – (herself must be Sally) 11

Features for Pronominal Coreference Sentence distance between mentions Hobbs distance! Grammatical role of potential antecedent Is the antecedent the subject, object, or PP ? Linguistic form Is the antecedent a definite noun (the car), an indefinite noun (a car), a pronoun (it), or a proper name (John)? 12

Features for Reference Resolution Features can work on any mentions, not pronouns! Word edit distance How similar are the two mentions? Word subphrase Does one mention contain a subset of the other’s words? “Tom Hanks” and “Hanks” Named entity type match Are both mentions labeled as PERSON? Etc. 13 I saw a 2013 Audi A3 yesterday. The red car zoomed past me.

It’s sometimes very difficult Common nouns can differ in number “A patrol moved down our street. The soldiers saw us.” Common nouns can refer to proper nouns “Barack Obama … the president” Split antecedence! “Bilbo found Gollum. They traveled to Mordor.” 14

Exercise! “What a pity that Bilbo did not stab that vile creature, when he had a chance!”, Frodo said. “Pity? It was Pity that stayed his hand on Gollum. Pity, and Mercy: not to strike without need. And he has been well rewarded, Frodo. Be sure that he took so little hurt from the evil, and escaped in the end, because he began his ownership of the Ring so. With Pity.” 15 Text very slightly altered by Dr. Chambers 1.Identify the entities in this passage, and cluster the entity mentions. Write down sets of entity mentions. Some sets will only have one mention! 2.How do you know what other mention Gollum goes with?

How to Evaluate? We have labeled data (pairs of NPs linked to each other) B-CUBED algorithm (Bagga and Baldwin, 1998) Precision: % of NPs in your guessed entity that are in the same gold entity Recall: % of NPs in the gold entity that are in your guessed entity You sum over all your guessed entities (sets of NPs) to get the overall precision and recall. Greedy matching…choose best matching gold entities 16

B-CUBED 17 From Amigo et al. 2009

Alternative Approaches Multi-pass systems. Run through all entity mentions, and only resolve the ones you are most certain about. E.g., “John” and “John” and “John” Run through again, resolving only the ones you are pretty certain about. E.g., “John Smith” and “Smith” Run through again, resolving only those you are reasonably sure of. Etc. 18

Example Passes 19 Lee et al. 2012

Multi-pass system This is not a learning system! It is a series of rules! It is contrary to much of NLP these days in that it outperforms learning approaches Why? It is very flexible, and the rules are gradually applied. Also…coreference is a somewhat unique task compared to other things we’ve covered in this course. 20