1 CS3730/ISP3120 Discourse Processing and Pragmatics Lecture Notes Jan 10, 12.

Slides:



Advertisements
Similar presentations
Point of View Dr. Karen Petit.
Advertisements

Present Perfect Dragana Filipovic.
Referring Expressions: Definition Referring expressions are words or phrases, the semantic interpretation of which is a discourse entity (also called referent)
Discourse Analysis David M. Cassel Natural Language Processing Villanova University April 21st, 2005 David M. Cassel Natural Language Processing Villanova.
Please check, just in case…. Announcements Next week is our last “regular” class session – guest speaker. Make an appointment to see me about the final.
Welcome back! Term 2 Week 1.
Introduction to phrases & clauses
1 Discourse, coherence and anaphora resolution Lecture 16.
Discourse Martin Hassel KTH NADA Royal Institute of Technology Stockholm
1/18 Dialogue systems R Mitkov (ed.) The Oxford Handbook of Computational Linguistics, Oxford (2004): OUP – Chapters 6 ( “ Discourse ” Allan Ramsay), 7.
Chapter 18: Discourse Tianjun Fu Ling538 Presentation Nov 30th, 2006.
INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING NLP-AI IIIT-Hyderabad CIIL, Mysore ICON DECEMBER, 2003.
Parts of Speech ITSW 1410 Presentation Media Software Instructor: Glenda H. Easter.
Reference Resolution #1 CSCI-GA.2590 Ralph Grishman NYU.
 Christel Kemke 2007/08 COMP 4060 Natural Language Processing Discourse and Dialogue.
Two e-Learning elective seminars in Novi Sad Putnik Z., Komlenov Ž., Budimac Z. DMI, Faculty of Science University of Novi Sad.
CS 4705 Algorithms for Reference Resolution. Anaphora resolution Finding in a text all the referring expressions that have one and the same denotation.
Welcome to Turnitin.com’s Peer Review! This tour will take you through the basics of Turnitin.com’s Peer Review. The goal of this tour is to give you.
Corpus 06 Discourse Characteristics. Reasons why discourse studies are not corpus-based: 1. Many discourse features cannot be identified automatically.
CS 4705 Lecture 21 Algorithms for Reference Resolution.
Natural Language Generation Martin Hassel KTH CSC Royal Institute of Technology Stockholm
Basic Scientific Writing in English Lecture 3 Professor Ralph Kirby Faculty of Life Sciences Extension 7323 Room B322.
CS 330 Programming Languages 09 / 16 / 2008 Instructor: Michael Eckmann.
1 Pragmatics: Discourse Analysis J&M’s Chapter 21.
Reference and inference By: Esra’a Rawah
Pragmatics I: Reference resolution Ling 571 Fei Xia Week 7: 11/8/05.
Pronouns – Part One Grade Eight.
Research Methods for Computer Science CSCI 6620 Spring 2014 Dr. Pettey CSCI 6620 Spring 2014 Dr. Pettey.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Strategies for Success
9/8/20151 Natural Language Processing Lecture Notes 1.
1 Computational Discourse Chapter 21 November 2012 Lecture #15.
1 Natural Language Processing Computational Discourse.
Fall 2005 Lecture Notes #7 EECS 595 / LING 541 / SI 661 Natural Language Processing.
Essay Pointers. Essay Grading Rubric  Composition (25%)  Subject Knowledge (25%)  Contribution (25%)  Reference and Citation (25%)  Composition (25%)
Pronouns Cano. A pronoun replaces a noun. We call the word being replaced by the pronoun the antecedent. In the following sentence, keys is the antecedent.
Incident Threading for News Passages (CIKM 09) Speaker: Yi-lin,Hsu Advisor: Dr. Koh, Jia-ling. Date:2010/06/14.
The Expository Essay An Overview
Discourse Read J & M Chapter More than One Sentence at a Time The alphas have a long-standing hatred of the betas. Their leaders have decided that.
Task Writing an advertisement listening for statistics and descriptions When listening for statistics, you may hear: fractions e.g. ½( a/one half),
Parts of Speech Notes. Part of Speech: Nouns  A naming word  Names a person, place, thing, idea, living creature, quality, or idea Examples: cowboy,
1 Special Electives of Comp.Linguistics: Processing Anaphoric Expressions Eleni Miltsakaki AUTH Fall 2005-Lecture 2.
Planning an Applied Research Project Chapter 3 – Conducting a Literature Review © 2014 by John Wiley & Sons, Inc. All rights reserved.
Levels of Language 6 Levels of Language. Levels of Language Aspect of language are often referred to as 'language levels'. To look carefully at language.
인공지능 연구실 황명진 FSNLP Introduction. 2 The beginning Linguistic science 의 4 부분 –Cognitive side of how human acquire, produce, and understand.
Reference Resolution. Sue bought a cup of coffee and a donut from Jane. She met John as she left. He looked at her enviously as she drank the coffee.
Coherence and Coreference Introduction to Discourse and Dialogue CS 359 October 2, 2001.
1 Natural Language Processing Chapter Outline Reference –Kinds of reference phenomena –Constraints on co-reference –Preferences for co-reference.
Grade Book Database Presentation Jeanne Winstead CINS 137.
1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )
For Friday Finish chapter 24 No written homework.
ACE TESOL Diploma Program – London Language Institute OBJECTIVES You will understand: 1. The terminology and concepts of semantics, pragmatics and discourse.
UWMS Data Mining Workshop Content Analysis: Automated Summarizing Prof. Marti Hearst SIMS 202, Lecture 16.
1 Running Experiments for Your Term Projects Dana S. Nau CMSC 722, AI Planning University of Maryland Lecture slides for Automated Planning: Theory and.
The Research Process: Finding, Annotating, and Organizing the Literature Created by Dr. Mary Clai Jones and Amy Miller November 2015 Created by Dr. Mary.
COP4020 INTRODUCTION FALL COURSE DESCRIPTION Programming Languages introduces the fundamentals of the design and implementation of programming languages.
GRAMMAR AND PUNCTUATION REVISE AND REVIEW WORD CLASSES.
Fall 2004 Lecture Notes #8 EECS 595 / LING 541 / SI 661 Natural Language Processing.
PRAGMATICS. SCHEDULE May 14: Yule ch. 1, 2 and 3 May 16: Yule ch. 4, 5 and 6 May 21: Yule ch. 7, 8 and 9 May 22: Seminar EXAM Thursday; May 31,
Welcome to Introduction to Psychology! Let’s share a bit about where we are all from…
Discourse Analysis Natural Language Understanding basis
Simone Paolo Ponzetto University of Heidelberg Massimo Poesio
Introduction to In-Text Citations
Referring Expressions: Definition
Lecture 17: Discourse: Anaphora Resolution and Coherence
Algorithms for Reference Resolution
Parts of Speech Nouns Prepositions Pronouns Conjunctions
CS4705 Natural Language Processing
Presentation transcript:

1 CS3730/ISP3120 Discourse Processing and Pragmatics Lecture Notes Jan 10, 12

2 Outline Finish going over the syllabus and list of assigned papers. Signup for presentation dates. Questions about Bonnie Webber’s Chapter in the Handbook of Discourse processing? Questions about the Intro to the Handbook of Discourse Processing? Information about Reference Information about Annotation

3 Syllabus and List of Assigned Papers Note that the weighting for calculating final grades has changed, to give you more credit for presentations and reaction essays. Instructions for the course yahoo! group were added to the syllabus. The data cannot be posted to the web, so information about where it is has been posted to the group. Note that a journal article was removed from the list of assigned papers, and the schedule changed accordingly Auditors: the grade you will receive is NC, for “no credit” This is a change effective this term.

4 List of Assigned Papers Schedule

5 Signup for Presentation Dates Non-auditors (NAs) should sign up for two lectures. NAs should randomly select a number NAs will select a presentation date in increasing order, and then select their second presentation date in decreasing order

6 Signup for presentation dates Let doubles = (# NAs * 2) – 24 At most doubles/2 (rounded up) paired presentations may be chosen during the first round. An individual NA may be involved in at most one paired presentation A day may have at most 2 presenters The following papers much have sole presenters: Lappin & Leass 94, Mann & Thompson 88, Hobbs 79

7 Questions on Bonnie Webber’s Chapter? Good for pointers into the literature. P. 800: ellipsis, eventualities, information structure (general idea of what they are?) P : understand the examples?

8 Questions about the Introduction to The Handbook of DP? Many fields study discourse. Different definitions, theoretical paradigms, and methodologies Interesting links are described on page 6 Most of the topics in Part 1, Discourse Analysis and Linguistics, have influenced work in NLP (except historical linguistics; diachronic) Throughout, many possibilities for NLP, some in interdisciplinary work

9 Reference –Kinds of reference phenomena –Constraints on co-reference –Preferences for co-reference From Dan Jurafsky’s Lecture notes. Here are his acknowledgements: Thanks to Diane Litman, Andy Kehler, Jim Martin!!! This material is from J+M Chapter 18, written by Andy Kehler, slides inspired by Diane Litman + Jim Martin

10 Reference Resolution John went to Bill’s car dealership to check out an Acura Integra. He looked at it for half an hour I’d like to get from Boston to San Francisco, on either December 5th or December 6th. It’s ok if it stops in another city along they way

11 Why reference resolution? Conversational Agents: Airline reservation system needs to know what “it” refers to in order to book correct flight Information Extraction: First Union Corp. is continuing to wrestle with severe problems unleashed by a botched merger and a troubled business strategy. According to industry insiders at Paine Webber, their president, John R. Georgius, is planning to retire by the end of the year.

12 Some terminology John went to Bill’s car dealership to check out an Acura Integra. He looked at it for half an hour Reference: process by which speakers use words John and he to denote a particular person –Referring expression: John, he –Referent: the actual entity (but as a shorthand we might call “John” the referent). –John and he “corefer” –Antecedent: John –Anaphor: he

13 Many types of reference (after Webber 91) According to John, Bob bought Sue an Integra, and Sue bought Fred a Legend –But that turned out to be a lie (a speech act) –But that was false (proposition) –That struck me as a funny way to describe the situation (manner of description) –That caused Sue to become rather poor (event) –That caused them both to become rather poor (combination of several events)

14 Reference Phenomena Indefinite noun phrases: new to hearer –I saw an Acura Integra today –Some Acura Integras were being unloaded… –I am going to the dealership to buy an Acura Integra today. (specific/non-specific) I hope they still have it I hope they have a car I like Definite noun phrases: identifiable to hearer because –Mentioned: I saw an Acura Integra today. The Integra was white –Identifiable from beliefs: The Indianapolis 500 –Inherently unique: The fastest car in …

15 Indefinites (an aside) Lots of complexities; an e.g. of one type: The king and his men don’t know Merry and Pippin, and they can’t even see what they are (superordinate term “figures” versus basic level term “hobbits“) There they [the King and his men] saw close beside them a great rubbleheap; and suddenly they were aware of two small figures lying on it at their ease, grey-clad, hardly to be seen among the stones. [The Two Towers, Tolkein]

16 Reference Phenomena: Pronouns I saw an Acura Integra today. It was white Compared to definite noun phrases, pronouns require more referent salience. –John went to Bob’s party, and parked next to a beautiful Acura Integra –He went inside and talked to Bob for more than an hour. –Bob told him that he recently got engaged. –??He also said that he bought it yesterday. –OK He also said that he bought the Acura yesterday

17 More on Pronouns Cataphora: pronoun appears before referent: –Before he bought it, John checked over the Integra very carefully.

18 Inferrables I almost bought an Acura Integra today, but the engine seemed noisy. Mix the flour, butter, and water. –Kneed the dough until smooth and shiny –Spread the paste over the blueberries –Stir the batter until all lumps are gone.

19 Generics I saw no less than 6 Acura Integras today. They are the coolest cars.

20 Pronominal Reference Resolution Given a pronoun, find the reference (either in text or as a entity in the world) We will look at constraints. The first student presentations will look at resolution algorithms. –Hard constraints on reference –Soft constraints on reference

21 Hard constraints on coreference Number agreement –John has an Acura. It is red. Person and case agreement –*John and Mary have Acuras. We love them (where We=John and Mary) Gender agreement –John has an Acura. He/it/she is attractive. Syntactic constraints –John bought himself a new Acura (himself=John) –John bought him a new Acura (him = not John)

22 Pronoun Interpretation Preferences Selectional Restrictions –John parked his Acura in the garage. He had driven it around for hours. Recency –John has an Integra. Bill has a Legend. Mary likes to drive it.

23 Pronoun Interpretation Preferences Grammatical Role: Subject preference –John went to the Acura dealership with Bill. He bought an Integra. –Bill went to the Acura dealership with John. He bought an Integra –(?) John and Bill went to the Acura dealership. He bought an Integra

24 Repeated Mention preference John needed a car to get to his new job. He decided that he wanted something sporty. Bill went to the Acura dealership with him. He bought an Integra.

25 Parallelism Preference Mary went with Sue to the Acura dealership. Sally went with her to the Mazda dealership. Mary went with Sue to the Acura dealership. Sally told her not to buy anything.

26 Verb Semantics Preferences John telephoned Bill. He lost the pamphlet on Acuras. John criticized Bill. He lost the pamphlet on Acuras. Implicit causality –Implicit cause of criticizing is object. –Implicit cause of telephoning is subject.

27 Manual Annotation AKA coding, labeling

28 From Webber’s Chapter The aims of computational work in [discourse and dialog]: –Modeling particular phenomena in discourse and dialog in terms of underlying computational processes –Providing useful natural language services, whose success depends in part on handling aspects of discourse and dialog What computation contributes is a coherent framework for modeling these phenomena in terms of … search through a space of possible candidate interpretations (in language analysis) or candidate realizations (in language generation)

29 Desiderata Interesting and rich enough Not so rich that automation is too far ahead of the current state of the art –Too complex logical structure –Knowledge bottleneck (without viable source) –Too fine-grained or subtle Annotation instructions (aka coding manual) feasible Time required for training is reasonable Annotators can reliably perform the annotations in a reasonable amount of time

30 Minimal Process for NLP 1.Develop initial coding manual 2.At least two people perform sample annotations, and discuss their disagreements and experiences 3.Revise coding manual 4.Repeat 2-3 until agreement on training data is sufficient 5.Independently annotate a fresh test set 6.Evaluate agreement

31 Additional Steps 1.Develop initial coding manual 2.At least two people perform sample annotations, and discuss their disagreements and experiences. Analysis of patterns of agreement and disagreement using probability models (Wiebe et al. ACL-99; Bruce & Wiebe NLE-99; from work in applied statistics) 3.Revise coding manual 4.Repeat 2-3 until agreement on training data is sufficient 5.Independently annotate a fresh test set 6.Evaluate agreement 7.Train more annotators, assess average time for training and annotation 8.Evaluate other types of reliability (psychology, content analysis, applied statistics literatures)

32 Measures of Agreement Percentage Agreement: OK, but not sufficient If the distribution of classes is highly skewed, then the baseline algorithm of always assigning the most frequent class would have high agreement Kappa: measures agreement over and above agreement expected by chance Details available in section 3 of this paper by our groupthis paper