CSCI 5832 Natural Language Processing

Slides:



Advertisements
Similar presentations
Discourse Structure and Discourse Coherence
Advertisements

Referring Expressions: Definition Referring expressions are words or phrases, the semantic interpretation of which is a discourse entity (also called referent)
Specialized models and ranking for coreference resolution Pascal Denis ALPAGE Project Team INRIA Rocquencourt F Le Chesnay, France Jason Baldridge.
Processing of large document collections Part 6 (Text summarization: discourse- based approaches) Helena Ahonen-Myka Spring 2006.
1 Discourse, coherence and anaphora resolution Lecture 16.
Discourse Martin Hassel KTH NADA Royal Institute of Technology Stockholm
Chapter 18: Discourse Tianjun Fu Ling538 Presentation Nov 30th, 2006.
INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING NLP-AI IIIT-Hyderabad CIIL, Mysore ICON DECEMBER, 2003.
For Friday No reading Homework –Chapter 23, exercises 1, 13, 14, 19 –Not as bad as it sounds –Do them IN ORDER – do not read ahead here.
Generation of Referring Expressions: Modeling Partner Effects Surabhi Gupta Advisor: Amanda Stent Department of Computer Science.
1 Words and the Lexicon September 10th 2009 Lecture #3.
CS 4705 Algorithms for Reference Resolution. Anaphora resolution Finding in a text all the referring expressions that have one and the same denotation.
CS 4705 Lecture 21 Algorithms for Reference Resolution.
Natural Language Generation Martin Hassel KTH CSC Royal Institute of Technology Stockholm
Introduction to CL Session 1: 7/08/2011. What is computational linguistics? Processing natural language text by computers  for practical applications.
Reference and inference By: Esra’a Rawah
The classification problem (Recap from LING570) LING 572 Fei Xia, Dan Jinguji Week 1: 1/10/08 1.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Differential effects of constraints in the processing of Russian cataphora Kazanina and Phillips 2010.
Scott Duvall, Brett South, Stéphane Meystre A Hands-on Introduction to Natural Language Processing in Healthcare Annotation as a Central Task for Development.
On the Issue of Combining Anaphoricity Determination and Antecedent Identification in Anaphora Resolution Ryu Iida, Kentaro Inui, Yuji Matsumoto Nara Institute.
Opinion Holders in Opinion Text from Online Newspapers Youngho Kim, Yuchul Jung and Sung-Hyon Myaeng Reporter: Chia-Ying Lee Advisor: Prof. Hsin-Hsi Chen.
Bootstrapping for Text Learning Tasks Ramya Nagarajan AIML Seminar March 6, 2001.
CSA2050 Introduction to Computational Linguistics Lecture 1 Overview.
Artificial Intelligence: Natural Language
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Friday Finish chapter 24 No written homework.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
Recognizing Stances in Online Debates Unsupervised opinion analysis method for debate-side classification. Mine the web to learn associations that are.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
Introduction to Data Mining by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.
4. Relationship Extraction Part 4 of Information Extraction Sunita Sarawagi 9/7/2012CS 652, Peter Lindes1.
SYNTAX.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
Discourse: Structure and Coherence Kathy McKeown Thanks to Dan Jurafsky, Diane Litman, Andy Kehler, Jim Martin.
AQUAINT Mid-Year PI Meeting – June 2002 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
Reference. “John went to the candy store to shop for chocolate.” “He bought some.”
Natural Language Processing Vasile Rus
gesture features for coreference
اسم المقرر استاذ المقرر
Discourse Analysis Natural Language Understanding basis
MIS2502: Data Analytics Advanced Analytics - Introduction
Lecture 3: Functional Phrases
Simone Paolo Ponzetto University of Heidelberg Massimo Poesio
Niu Kun Discrete Mathematics Chapter 1 The Foundations: Logic and Proof, Sets, and Functions Niu Kun 离散数学.
SAT Notes: Please get out your notebook and turn to the writing section. We are taking notes today.
张昊.
Improving a Pipeline Architecture for Shallow Discourse Parsing
CSCI 5832 Natural Language Processing
CSCE 590 Web Scraping – Information Retrieval
Referring Expressions: Definition
CSCI 5832 Natural Language Processing
Probabilistic and Lexicalized Parsing
Social Knowledge Mining
Clustering Algorithms for Noun Phrase Coreference Resolution
Lecture 17: Discourse: Anaphora Resolution and Coherence
Algorithms for Reference Resolution
CSCI 5832 Natural Language Processing
CSCI 5832 Natural Language Processing
CSCI 5832 Natural Language Processing
A Machine Learning Approach to Coreference Resolution of Noun Phrases
CSCI 5832 Natural Language Processing
CSCI 5832 Natural Language Processing
CSCI 5832 Natural Language Processing
CSCI 5832 Natural Language Processing
CS4705 Natural Language Processing
Pragmatics: Reference and inference
Information Retrieval
Stance Classification of Ideological Debates
Presentation transcript:

CSCI 5832 Natural Language Processing Jim Martin Lecture 23 11/28/2018

Today: 4/15 Discourse structure Referring expressions Co-reference resolution 11/28/2018

What Makes a Discourse Coherent? The reason is that these utterances, when juxtaposed, will not exhibit coherence. Almost certainly not. Do you have a discourse? Assume that you have collected an arbitrary set of well-formed and independently interpretable utterances, for instance, by randomly selecting one sentence from each of the previous chapters of this book. 11/28/2018

Better? Assume that you have collected an arbitrary set of well-formed and independently interpretable utterances, for instance, by randomly selecting one sentence from each of the previous chapters of this book. Do you have a discourse? Almost certainly not. The reason is that these utterances, when juxtaposed, will not exhibit coherence. 11/28/2018

What makes a text coherent? Appropriate use of coherence relations between subparts of the discourse -- rhetorical structure Appropriate sequencing of subparts of the discourse -- discourse/topic structure Appropriate use of referring expressions 11/28/2018

Referring Expressions Referring expressions provide a kind of glue that makes texts cohere. 11/28/2018

Referring Expressions: Definition Referring expressions are words or phrases, the semantic interpretation of which is a discourse entity (also called referent) Discourse entities are semantic objects and they can have multiple syntactic realizations within a text 11/28/2018

NY Times Example A college student accused of faking her own kidnapping last month was charged Wednesday with lying to police in what they suggested was a desperate attempt to get her boyfriend's attention. Audrey Seiler, a 20-year-old sophomore at the University of Wisconsin, was charged with two misdemeanor counts of obstructing officers. Each charge carries up to nine months in jail and a $10,000 fine. Seiler disappeared from her off-campus apartment March 27 without her coat or purse. She was discovered curled in a fetal position in a marsh four days later, and told police that a man had abducted her at knifepoint. But police concluded Seiler made up the story after obtaining a store videotape that showed her buying the knife, duct tape, rope and cold medicine she claimed her abductor used to restrain her. Seiler confessed after she was confronted with the tape, according to authorities. 11/28/2018

Referring Expressions: Example A pretty woman entered the restaurant. She sat at the table next to mine and only then I recognized her. This was Amy Garcia, my next door neighbor from 10 years ago. The woman has totally changed! Amy was at the time shy… 11/28/2018

Pronouns vs. Full NPs A pretty woman entered the restaurant. She sat at the table next to mine and only then I recognized her. This was Amy Garcia, my next door neighbor from 10 years ago. The woman has totally changed! Amy was at the time shy… 11/28/2018

Definite vs. Indefinite NPs A pretty woman entered the restaurant. She sat at the table next to mine and only then I recognized her. This was Amy Garcia, my next door neighbor from 10 years ago. The woman has totally changed! Amy was at the time shy… 11/28/2018

Common Noun vs. Proper Noun A pretty woman entered the restaurant. She sat at the table next to mine and only then I recognized her. This was Amy Garcia, my next door neighbor from 10 years ago. The woman has totally changed! Amy was at the time shy… 11/28/2018

Modified vs. Bare head NP A pretty woman entered the restaurant. She sat at the table next to mine and only then I recognized her. This was Amy Garcia, my next door neighbor from 10 years ago. The woman has totally changed! Amy was at the time shy… 11/28/2018

Premodified vs. postmodified A pretty woman entered the restaurant. She sat at the table next to mine and only then I recognized her. This was Amy Garcia, my next door neighbor from 10 years ago. The woman has totally changed! Amy was at the time shy… 11/28/2018

More NP types Inferrables Discontinuous sets Sally bought a used car. The tires need to be replaced. Discontinuous sets John has known Bill for many years now. They often go hiking together. 11/28/2018

Break Quiz Thursday Then Q/A and Summarization MT Read Chapter 23 for Tuesday MT Read Chapter 25 for the following Thursday 11/28/2018

Quiz Areas From 17 From 18 From 20 Basics of FOL Rule to rule approach Syntax and semantics From 18 Rule to rule approach Semantic attachments Lambdas Problems with quantifiers From 20 Different approaches to WSD (20.1 to 20.5) 11/28/2018

Quiz Areas From 21 From 22 21.3 to 21.8 All Focus on Hobbs and ML approaches in 21.6.1 and 21.6.3 for pronominal resolution From 22 All 11/28/2018

Quiz Don’t forget all the earlier stuff. In particular, syntax/parsing plays a big role in Semantic analysis Co-reference And it plays a supporting role in IE and WSD Provides features 11/28/2018

Anaphora resolution Finding in a text all the referring expressions that have one and the same denotation Pronominal anaphora resolution Anaphora resolution between named entities Full noun phrase anaphora resolution Zero anaphora detection/resolution 11/28/2018

Automatic Anaphora Resolution Resolving the referents of pronouns Hand built rule systems vs. trained systems Statistical systems vs rules Systems that require a full syntactic parse vs. systems that assume a weaker chunking segmentation Systems that deal with text vs. systems that deal with spoken discourse 11/28/2018

Basic ML Approach for Pronominal Reference Acquire an annotated corpus Pronoun/antecedent pairs constitute + examples Pronoun/non-referent pairs are - examples Lots more - data than + data Extract features from the pairs Train a binary classifier SVMs or MaxEnt 11/28/2018

Basic Features Strict grammatical number match (1/0) Compatible match Not the number of the elements, just whether they match Compatible match Same 2 for gender Sentence distance Distance from pronoun sentence to the referent Hobbs distance Grammatical role information Form Form of the antecedent (full NP, etc) 11/28/2018

Classification Proceed through a text linearly. When a pronoun is encountered look backward and apply the classifier to each potential antecedent In some order Need a way to break ties Result is a set of referent chains 11/28/2018

Full Co-reference Analysis Locating all the entity mentions in a text and then clustering them into groups that represent the actual entities. 11/28/2018

NY Times Example A college student accused of faking her own kidnapping last month was charged Wednesday with lying to police in what they suggested was a desperate attempt to get her boyfriend's attention. Audrey Seiler, a 20-year-old sophomore at the University of Wisconsin, was charged with two misdemeanor counts of obstructing officers. Each charge carries up to nine months in jail and a $10,000 fine. Seiler disappeared from her off-campus apartment March 27 without her coat or purse. She was discovered curled in a fetal position in a marsh four days later, and told police that a man had abducted her at knifepoint. But police concluded Seiler made up the story after obtaining a store videotape that showed her buying the knife, duct tape, rope and cold medicine she claimed her abductor used to restrain her. Seiler confessed after she was confronted with the tape, according to authorities. 11/28/2018

NY Times Example A college student accused of faking her own kidnapping last month was charged Wednesday with lying to police in what they suggested was a desperate attempt to get her boyfriend's attention. Audrey Seiler, a 20-year-old sophomore at the University of Wisconsin, was charged with two misdemeanor counts of obstructing officers. Each charge carries up to nine months in jail and a $10,000 fine. Seiler disappeared from her off-campus apartment March 27 without her coat or purse. She was discovered curled in a fetal position in a marsh four days later, and told police that a man had abducted her at knifepoint. But police concluded Seiler made up the story after obtaining a store videotape that showed her buying the knife, duct tape, rope and cold medicine she claimed her abductor used to restrain her. Seiler confessed after she was confronted with the tape, according to authorities. 11/28/2018

Example There are 33 entity mentions 16 actual entities Seiler accounts for 12 of the 33 mentions, with the police in second place at 6 There are 11 entities with 1 mention each 11/28/2018

2 Basic ML Approaches Scan the text from the top. When a new entity mention is encountered apply a binary classifier to each previous mention. Result is a set of chains of references Extract all entity mentions and represent them as feature vectors. Apply an unsupervised partition-based clustering technique to all the mentions. Result is a set of (soft) clusters that partition the mentions. 11/28/2018

Cross-Document Co-Reference Take a set of documents (possibly from the Web), extract all the named referents and resolve them into unique entities. Ie. Google “Michael Jordan” and resolve the results into classes based on the real entities. 11/28/2018

Easy Money 11/28/2018

Easy Money 100,000 files provided for development Set of target names of interest Ground truth provided with a F-measure based scoring metric Precision/Recall computed on a pairwise document basis. Assumes that target names are unique within each document (ie. All the Michael Jordan mentions in a single document are the same person). 11/28/2018