A Database of Narrative Schemas A 2010 paper by Nathaniel Chambers and Dan Jurafsky Presentation by Julia Kelly.

Slides:



Advertisements
Similar presentations
SWG Strategy (C) Copyright IBM Corp. 2006, All Rights Reserved. P4 Task 2 Fact Extraction using a CNL Current Status David Mott, Dave Braines, ETS,
Advertisements

NRRC Summer ‘02 Workshop Proposal Temporal December 5, 2001 MITRE James Pustejovsky, Brandeis with input from: James Allan, UMASS Inderjeet Mani, MITRE.
1 Temporal Parsing Annotation A temporal parse (TP) is an annotation that explicitly marks the temporal relationship between the different events, time.
Exploring the Effectiveness of Lexical Ontologies for Modeling Temporal Relations with Markov Logic Eun Y. Ha, Alok Baikadi, Carlyle Licata, Bradford Mott,
COGEX at the Second RTE Marta Tatu, Brandon Iles, John Slavick, Adrian Novischi, Dan Moldovan Language Computer Corporation April 10 th, 2006.
Coreference Based Event-Argument Relation Extraction on Biomedical Text Katsumasa Yoshikawa 1), Sebastian Riedel 2), Tsutomu Hirao 3), Masayuki Asahara.
Jointly Identifying Temporal Relations with Markov Logic Katsumasa Yoshikawa †, Sebastian Riedel ‡, Masayuki Asahara †, Yuji Matsumoto † † Nara Institute.
NYU ANLP-00 1 Automatic Discovery of Scenario-Level Patterns for Information Extraction Roman Yangarber Ralph Grishman Pasi Tapanainen Silja Huttunen.
Statistical NLP: Lecture 3
Processing of large document collections Part 6 (Text summarization: discourse- based approaches) Helena Ahonen-Myka Spring 2006.
1 Discourse, coherence and anaphora resolution Lecture 16.
A Database of Nate Chambers and Dan Jurafsky Stanford University Narrative Schemas.
Semantic Frames: FrameNet. What is FrameNet? FrameNet is an ongoing project at the International Computer Science Institute located in Berkeley California.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
Towards An Algebraic Formulation of Domain Definitions using Parameterised Machines T. L. McCluskey and R. M.Simpson School of Computing and Engineering.
Event Extraction: Learning from Corpora Prepared by Ralph Grishman Based on research and slides by Roman Yangarber NYU.
Basi di dati distribuite Prof. M.T. PAZIENZA a.a
TimeBank Status Status of TimeML annotation for the ULA project James Pustejovsky and Marc Verhagen Brandeis University.
Extracting Opinions, Opinion Holders, and Topics Expressed in Online News Media Text Soo-Min Kim and Eduard Hovy USC Information Sciences Institute 4676.
Computational Lexical Semantics Lecture 9: Learning Narrative Frames Linguistic Institute 2005 University of Chicago.
Page 1 Relation Alignment for Textual Entailment Recognition Department of Computer Science University of Illinois at Urbana-Champaign Mark Sammons, V.G.Vinod.
ELN – Natural Language Processing Giuseppe Attardi
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
Learning Narrative Schemas Nate Chambers, Dan Jurafsky Stanford University IBM Watson Research Center Visit.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Syntax Lecture 8: Verb Types 1. Introduction We have seen: – The subject starts off close to the verb, but moves to specifier of IP – The verb starts.
1 Statistical NLP: Lecture 10 Lexical Acquisition.
Computational Lexical Semantics Lecture 8: Selectional Restrictions Linguistic Institute 2005 University of Chicago.
Based on “Semi-Supervised Semantic Role Labeling via Structural Alignment” by Furstenau and Lapata, 2011 Advisors: Prof. Michael Elhadad and Mr. Avi Hayoun.
A Survey for Interspeech Xavier Anguera Information Retrieval-based Dynamic TimeWarping.
Scott Duvall, Brett South, Stéphane Meystre A Hands-on Introduction to Natural Language Processing in Healthcare Annotation as a Central Task for Development.
IV. SYNTAX. 1.1 What is syntax? Syntax is the study of how sentences are structured, or in other words, it tries to state what words can be combined with.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Incorporating Extra-linguistic Information into Reference Resolution in Collaborative Task Dialogue Ryu Iida Shumpei Kobayashi Takenobu Tokunaga Tokyo.
Open Information Extraction using Wikipedia
1 Statistical NLP: Lecture 9 Word Sense Disambiguation.
Combining terminology resources and statistical methods for entity recognition: an evaluation Angus Roberts, Robert Gaizauskas, Mark Hepple, Yikun Guo.
UB LIS 571 Soergel Lecture 6.2b Document analysis for retrieval and information extraction Dagobert Soergel Department of Library and Information Studies.
Processing of large document collections Part 6 (Text summarization: discourse- based approaches) Helena Ahonen-Myka Spring 2005.
Comparing Information Extraction Pattern Models Mark Stevenson and Mark A. Greenwood Natural Language Processing Group University of Sheffield, UK.
TimeML compliant text analysis for Temporal Reasoning Branimir Boguraev and Rie Kubota Ando.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
Using a Named Entity Tagger to Generalise Surface Matching Text Patterns for Question Answering Mark A. Greenwood and Robert Gaizauskas Natural Language.
TWC Illuminate Knowledge Elements in Geoscience Literature Xiaogang (Marshall) Ma, Jin Guang Zheng, Han Wang, Peter Fox Tetherless World Constellation.
FILTERED RANKING FOR BOOTSTRAPPING IN EVENT EXTRACTION Shasha Liao Ralph York University.
NLP. Introduction to NLP Last week, Min broke the window with a hammer. The window was broken with a hammer by Min last week With a hammer, Min broke.
SALSA-WS 09/05 Approximating Textual Entailment with LFG and FrameNet Frames Aljoscha Burchardt, Anette Frank Computational Linguistics Department Saarland.
SemAF – Basics: Semantic annotation framework Harry Bunt Tilburg University isa -6 Joint ISO - ACL/SIGSEM workshop Oxford, January 2011 TC 37/SC.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
1 Question Answering and Logistics. 2 Class Logistics  Comments on proposals will be returned next week and may be available as early as Monday  Look.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
Department of Computer Science The University of Texas at Austin USA Joint Entity and Relation Extraction using Card-Pyramid Parsing Rohit J. Kate Raymond.
1 The T4SQL Temporal Query Language Presented by 黃泰豐 2007/12/26.
Learning Event Durations from Event Descriptions Feng Pan, Rutu Mulkar, Jerry R. Hobbs University of Southern California ACL ’ 06.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Annotating and measuring Temporal relations in texts Philippe Muller and Xavier Tannier IRIT,Université Paul Sabatier COLING 2004.
NTNU Speech Lab 1 Topic Themes for Multi-Document Summarization Sanda Harabagiu and Finley Lacatusu Language Computer Corporation Presented by Yi-Ting.
Artificial Intelligence Knowledge Representation.
King Faisal University جامعة الملك فيصل Deanship of E-Learning and Distance Education عمادة التعلم الإلكتروني والتعليم عن بعد [ ] 1 جامعة الملك فيصل عمادة.
COP Introduction to Database Structures
The Semantic Web By: Maulik Parikh.
Syntax Lecture 9: Verb Types 1.
Two Discourse Driven Language Models for Semantics
(Entity and) Event Extraction CSCI-GA.2591
Improving a Pipeline Architecture for Shallow Discourse Parsing
Lecture 9: Semantic Parsing
Linguistic Essentials
Unsupervised Learning of Narrative Schemas and their Participants
Sanjna Kashyap 18th February 2019
Presentation transcript:

A Database of Narrative Schemas A 2010 paper by Nathaniel Chambers and Dan Jurafsky Presentation by Julia Kelly

Natural Language Understanding (NLU) A more specific definition of a sub-topic for Natural Language Processing (NLP) The Parsing rather than the generating part of NLP

Chambers and Jurafsky: What are they trying to solve? Using unsupervised learning (machine learning technique) in conjunction with coreference chains to extract rich event structure in order to produce better narrative schema

What came before? FrameNet: Baker (also a Upenn affiliate) et al FrameNet: Baker (also a Upenn affiliate) et al – – Frame definition Frame definition – Annotation Annotation TimeBank and TempEval : Pustejovsky et al TimeBank and TempEval : Pustejovsky et al – TimeBank is a corpus (often referred to as TimeML now) TimeBank is a corpus (often referred to as TimeML now) – TempEval seems to organize by events TempEval seems to organize by events

TimeBank TimeML is a robust specification language for events and temporal expressions in natural language. It is designed to address four problems in event and temporal expression markup: Time stamping of events (identifying an event and anchoring it in time); Ordering events with respect to one another (lexical versus discourse properties of ordering); Reasoning with contextually underspecified temporal expressions (temporal functions such as 'last week' and 'two weeks before'); Reasoning about the persistence of events (how long does an event or the outcome of an event last).

TASK A: For a restricted set of event terms, identify temporal relations between events and all time expressions appearing in the same sentence. (NOTE: The restricted set of event terms is to be specified by providing a list of root forms. Time expressions will be annotated in the source, in accordance with TIMEX3.) TASK B: For a restricted set of event terms, identify temporal relations between events and the Document Creation Time (DCT). (NOTE: The restricted set of events will be the same as for Task A. DCTs will be explicitly annotated in the source.) TASK C: Identify the temporal relations betweeen contiguous pairs of matrix verbs. (NOTE: matrix verbs, i.e. the main verb of the matrix clause in each sentence, will be explicitly annotated in the source.) We specify three separate tasks that involve identifying event-time and event-event temporal relations. A restricted set of temporal relations will be used, which includes only the relations: BEFORE, AFTER, and OVERLAP (defined to encompass all cases where event intervals have non-empty overlap).

Chambers and Jurafsky: Narrative Schema Narrative Schema contain: Narrative Schema – Sets of related events – A temporal ordering of events – The semantic roles of the participants Based off of scripts.

Narrative Schema A brief example Hand-selected examples from the Database

Chambers and Jurafsky: A better approach A schema is not just a chain of synonyms Counting times verbs appear before or after on another Combines both the schema and temporal data within the database

Narrative Coherence Assumption Predicates sharing coreferring arguments are related by virtue of narrative discourse structure. In order to find events, 1. Parse all sentences into dependency graphs. 2. Run coreference over the document. 3. Count all pairs of verbs and dependencies (e.g. subject, object) that are filled by coreferring entities. 4. Record with each pair the head word of the shared argument.

Why did other approaches not make the grade? FrameNet approaches the problem from the aspect of frames TimeBank did not also include a method of schema in addition to temporal placement

Performance The Chambers and Jurafsky database has very similar results to the hand-labeled FrameNet data. The temporal aspect of the database did not seem to perform as well as the TimeBank Somewhat specialized