A Database of Nate Chambers and Dan Jurafsky Stanford University Narrative Schemas.

Slides:



Advertisements
Similar presentations
ELA Common Core – What We Know So Far….
Advertisements

Overview of the TAC2013 Knowledge Base Population Evaluation: English Slot Filling Mihai Surdeanu with a lot help from: Hoa Dang, Joe Ellis, Heng Ji, and.
1 CS 388: Natural Language Processing Story Understanding Raymond J. Mooney University of Texas at Austin.
1 Relational Learning of Pattern-Match Rules for Information Extraction Presentation by Tim Chartrand of A paper bypaper Mary Elaine Califf and Raymond.
A Joint Model For Semantic Role Labeling Aria Haghighi, Kristina Toutanova, Christopher D. Manning Computer Science Department Stanford University.
Recognizing Implicit Discourse Relations in the Penn Discourse Treebank Ziheng Lin, Min-Yen Kan, and Hwee Tou Ng Department of Computer Science National.
LEDIR : An Unsupervised Algorithm for Learning Directionality of Inference Rules Advisor: Hsin-His Chen Reporter: Chi-Hsin Yu Date: From EMNLP.
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
Predicting Text Quality for Scientific Articles AAAI/SIGART-11 Doctoral Consortium Annie Louis : Louis A. and Nenkova A Automatically.
July 9, 2003ACL An Improved Pattern Model for Automatic IE Pattern Acquisition Kiyoshi Sudo Satoshi Sekine Ralph Grishman New York University.
Predicting the Semantic Orientation of Adjective Vasileios Hatzivassiloglou and Kathleen R. McKeown Presented By Yash Satsangi.
Latent Semantic Analysis (LSA). Introduction to LSA Learning Model Uses Singular Value Decomposition (SVD) to simulate human learning of word and passage.
Flash talk by: Aditi Garg, Xiaoran Wang Authors: Sarah Rastkar, Gail C. Murphy and Gabriel Murray.
MCTest-160 [1] contains pairs of narratives and multiple choice questions. An example of narrative QA: Story: “Sally had a very exciting summer vacation.
Company LOGO Software Piracy Software Piracy Software Pirate Gets 7 Years in Prison Cannelle Cuvelier.
Jan 4 th 2013 Event Extraction Using Distant Supervision Kevin Reschke.
Computational Lexical Semantics Lecture 9: Learning Narrative Frames Linguistic Institute 2005 University of Chicago.
ELN – Natural Language Processing Giuseppe Attardi
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
Web Document Analysis: How can Natural Language Processing Help in Determining Correct Content Flow? Hassan Alam, Fuad Rahman and Yuliya Tarnikova Human.
Rui Yan, Yan Zhang Peking University
Knowledge and Tree-Edits in Learnable Entailment Proofs Asher Stern and Ido Dagan (earlier partial version by Roy Bar-Haim) Download at:
Automatic Extraction of Opinion Propositions and their Holders Steven Bethard, Hong Yu, Ashley Thornton, Vasileios Hatzivassiloglou and Dan Jurafsky Department.
Learning Narrative Schemas Nate Chambers, Dan Jurafsky Stanford University IBM Watson Research Center Visit.
Extracting Key Terms From Noisy and Multi-theme Documents Maria Grineva, Maxim Grinev and Dmitry Lizorkin Institute for System Programming of RAS.
Identifying the Elements of a Plot Diagram Student Notes.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Illinois-Coref: The UI System in the CoNLL-2012 Shared Task Kai-Wei Chang, Rajhans Samdani, Alla Rozovskaya, Mark Sammons, and Dan Roth Supported by ARL,
Scott Duvall, Brett South, Stéphane Meystre A Hands-on Introduction to Natural Language Processing in Healthcare Annotation as a Central Task for Development.
Open Information Extraction using Wikipedia
David L. Chen Fast Online Lexicon Learning for Grounded Language Acquisition The 50th Annual Meeting of the Association for Computational Linguistics (ACL)
Incident Threading for News Passages (CIKM 09) Speaker: Yi-lin,Hsu Advisor: Dr. Koh, Jia-ling. Date:2010/06/14.
Unsupervised Learning of Narrative Event Chains Original paper by: Nate Chambers and Dan Jurafsky in ACL 2008 This presentation for discussion created.
Coreference Resolution with Knowledge Haoruo Peng March 20,
A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources Author: Carmen Banea, Rada Mihalcea, Janyce Wiebe Source:
Modelling Human Thematic Fit Judgments IGK Colloquium 3/2/2005 Ulrike Padó.
Outline Knowledge discovery in databases. Data warehousing. Data mining. Different types of data mining. The Apriori algorithm for generating association.
Minimally Supervised Event Causality Identification Quang Do, Yee Seng, and Dan Roth University of Illinois at Urbana-Champaign 1 EMNLP-2011.
1 KINDS OF PARAGRAPH. There are at least seven types of paragraphs. Knowledge of the differences between them can facilitate composing well-structured.
Comparing Information Extraction Pattern Models Mark Stevenson and Mark A. Greenwood Natural Language Processing Group University of Sheffield, UK.
1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )
Multilingual Opinion Holder Identification Using Author and Authority Viewpoints Yohei Seki, Noriko Kando,Masaki Aono Toyohashi University of Technology.
Inference Protocols for Coreference Resolution Kai-Wei Chang, Rajhans Samdani, Alla Rozovskaya, Nick Rizzolo, Mark Sammons, and Dan Roth This research.
Commonsense Reasoning in and over Natural Language Hugo Liu, Push Singh Media Laboratory of MIT The 8 th International Conference on Knowledge- Based Intelligent.
Comparing Word Relatedness Measures Based on Google n-grams Aminul ISLAM, Evangelos MILIOS, Vlado KEŠELJ Faculty of Computer Science Dalhousie University,
Finding document topics for improving topic segmentation Source: ACL2007 Authors: Olivier Ferret (18 route du Panorama, BP6) Reporter:Yong-Xiang Chen.
FILTERED RANKING FOR BOOTSTRAPPING IN EVENT EXTRACTION Shasha Liao Ralph York University.
Using Linguistic Analysis and Classification Techniques to Identify Ingroup and Outgroup Messages in the Enron Corpus.
MMM2005The Chinese University of Hong Kong MMM2005 The Chinese University of Hong Kong 1 Video Summarization Using Mutual Reinforcement Principle and Shot.
1 ICASSP Paper Survey Presenter: Chen Yi-Ting. 2 Improved Spoken Document Retrieval With Dynamic Key Term Lexicon and Probabilistic Latent Semantic Analysis.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Annotating and measuring Temporal relations in texts Philippe Muller and Xavier Tannier IRIT,Université Paul Sabatier COLING 2004.
Cold-Start KBP Something from Nothing Sean Monahan, Dean Carpenter Language Computer.
Solving Hard Coreference Problems Haoruo Peng, Daniel Khashabi and Dan Roth Problem Description  Problems with Existing Coref Systems Rely heavily on.
AQUAINT Mid-Year PI Meeting – June 2002 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
A Database of Narrative Schemas A 2010 paper by Nathaniel Chambers and Dan Jurafsky Presentation by Julia Kelly.
Reading literacy. Definition of reading literacy: “Reading literacy is understanding, using and reflecting on written texts, in order to achieve one’s.
Automatically Labeled Data Generation for Large Scale Event Extraction
Part 2 Applications of ILP Formulations in Natural Language Processing
A Brief Introduction to Distant Supervision
Reading & Writing assignments in Chemistry
Relation Extraction CSCI-GA.2591
Temporal Constraint Acquisition
Two Discourse Driven Language Models for Semantics
Improving a Pipeline Architecture for Shallow Discourse Parsing
Learning Scripts for Text Understanding with Recurrent Neural Networks
CS224N Section 3: Corpora, etc.
Word embeddings (continued)
Unsupervised Learning of Narrative Schemas and their Participants
Presentation transcript:

A Database of Nate Chambers and Dan Jurafsky Stanford University Narrative Schemas

Two Joint Tasks Events in a NarrativeSemantic Roles suspect, criminal, client, immigrant, journalist, government, … police, agent, officer, authorities, troops, official, investigator, …

Scripts Background knowledge for language understanding Restaurant Script Schank and Abelson Scripts Plans Goals and Understanding. Lawrence Erlbaum. Mooney and DeJong Learning Schemata for NLP. IJCAI-85. Hand-coded Domain dependent

Applications Coreference Argument prediction Summarization Inform sentence selection with event confidence scores Textual Inference Does a document infer other events Selectional Preferences Use chains to inform argument types Aberration Detection Detect surprise/unexpected events in text Story Generation

The Protagonist protagonist: (noun) 1.the principal character in a drama or other literary work 2.a leading actor, character, or participant in a literary work or real event

Narrative Event Chains ACL-2008 Narrative Event Chains (1)Narrative relations (2)Single arguments (3)Temporal ordering

Inducing Narrative Relations 1.Dependency parse a document. 2.Run coreference to cluster entity mentions. 3.Count pairs of verbs with coreferring arguments. 4.Use pointwise mutual information to measure relatedness. Chambers and Jurafsky. Unsupervised Learning of Narrative Event Chains. ACL-08 Narrative Coherence Assumption Verbs sharing coreferring arguments are semantically connected by virtue of narrative discourse structure.

Chain Example (ACL-08)

Schema Example Police, Agent, Authorities Judge, Official Prosecutor, Attorney Plea, Guilty, Innocent Suspect, Criminal, Terrorist, …

Narrative Schemas E = {arrest, charge, plead, convict, sentence}

Learning Schemas

Training Data NYT portion of the Gigaword Corpus David Graff English Gigaword. Linguistic Data Consortium. 1.2 million documents Stanford Parser OpenNLP coreference Lemmatize verbs and noun arguments.

Viral Example mosquito, aids, virus, tick, catastrophe, disease virus, disease, bacteria, cancer, toxoplasma, strain

Authorship Example company, author, group, year, microsoft, magazine book, report, novel, article, story, letter, magazine

Temporal Ordering Supervised classifier for before/after relations Chambers and Jurafsky, EMNLP Chambers et al., ACL Classify all pairs of verbs in Gigaword Record counts of before and after relations 16

The Database 1.Narrative Schemas (unordered) 1.Various sizes of schemas (6, 8, 10, 12) base verbs 2.Temporal Orderings 1.Pairs of verbs 2.Counts of before and after relations 17

Evaluations The Cloze Test Chambers, Jurafsky. ACL Chambers, Jurafsky. ACL Comparison to FrameNet Chambers, Jurafsky. ACL Corpus Coverage LREC

Comparison to FrameNet Narrative Schemas Focuses on events that occur together in a narrative. FrameNet (Baker et al., 1998) Focuses on events that share core roles.

Comparison to FrameNet Narrative Schemas Focuses on events that occur together in a narrative. Schemas represent larger situations. FrameNet (Baker et al., 1998) Focuses on events that share core roles. Frames typically represent single events.

Comparison to FrameNet 1.How similar are schemas to frames? Find “best” FrameNet frame by event overlap 2.How similar are schema roles to frame elements? Evaluate argument types as FrameNet frame elements.

FrameNet Schema Similarity 1.How many schemas map to frames? 13 of 20 schemas mapped to a frame 26 of 78 (33%) verbs are not in FrameNet 2.Verbs present in FrameNet 35 of 52 (67%) matched frame 17 of 52 (33%) did not match

FrameNet Schema Similarity trade rise fall slip quote Exchange Change Position on a Scale Multiple FrameNet FramesOne Schema Why are 33% unaligned? FrameNet represents subevents as separate frames Schemas model sequences of events. Undressing n/a Narrative Relation

Evaluations The Cloze Test Chambers, Jurafsky. ACL Chambers, Jurafsky. ACL Comparison to FrameNet Chambers, Jurafsky. ACL Corpus Coverage LREC

Corpus Coverage Evaluation Narrative Schemas are generalized knowledge structures. Newspaper articles discuss specific scenarios. How many events in an article’s description are stereotypical events in narrative schemas? 25

Coverage Example 26 Investing Schema invest sell take_out buy pay withdraw pull pull_out put put_in Article Text aware sell think pay walk put He is painfully aware that if he sold his four-bedroom brick suburban home for the $220,000 that he thinks he can get for it and then paid off his mortgage, he would walk away with, as he puts it, ….

Coverage Score Largest Connected Component Largest subset of vertices such that a path exists between all vertices Events are connected if there exists some schema such that both events are members. 27 Article Text aware sell think pay walk put 3 of the 6 are connected 50% coverage

Coverage Results 69 documents 740 events Macro-average document coverage Final coverage: 34% “One third of a document’s events are part of a self- contained narrative schema.” 28

Evaluation Results The Cloze Test Schemas improve 36% on event prediction over verb-based similarity Comparison to FrameNet 65% of schemas match FrameNet frames 33% of schema events are novel to FrameNet Corpus Coverage 96% of events are connected in the space of events 34% of events are connected by self-contained schemas 29

Schemas Online Narrative Schemas Coverage Evaluation (Cloze Test) 30 Nate Chambers and Dan Jurafsky

31

FrameNet Argument Similarity 2.Argument role mapping to frame elements. 72% of arguments appropriate as frame elements law, ban, rule, constitutionality, conviction, ruling, lawmaker, tax INCORRECT FrameNet frame: Enforcing Frame element: Rule

Coverage Evaluation 1.Choose a news article at random. 2.Identify the protagonist. 3.Extract the narrative event chain. Match the chain to the best narrative schema with the largest event overlap.

Coverage Dataset NYT portion of the Gigaword Corpus Randomly selected the year random newspaper articles within initially chosen, 31 removed that are not news Identified the protagonist and events by hand 34

The Resource 3.5% of events in new documents are disconnected from the space of narrative relations Chambers and Jurafsky. ACL % of events in new documents are not clustered into generalized narrative schemas. The extent to which a document discusses new variants of known schemas remains for future work. 35