Syntactic Contributions in the Entailment Task Lucy Vanderwende, Arul Menezes, Rion Snow (Stanford)

Slides:

Advertisements

Similar presentations

Engineering in business and the environment Lesson 5: Environmental legislation relating to noise and waste.

Advertisements

COGEX at the Second RTE Marta Tatu, Brandon Iles, John Slavick, Adrian Novischi, Dan Moldovan Language Computer Corporation April 10 th, 2006.

COGEX at the Second RTE Marta Tatu, Brandon Iles, John Slavick, Adrian Novischi, Dan Moldovan Language Computer Corporation April 10 th, 2006.

Recognizing Textual Entailment Challenge PASCAL Suleiman BaniHani.

Deciding entailment and contradiction with stochastic and edit distance-based alignment Marie-Catherine de Marneffe, Sebastian Pado, Bill MacCartney, Anna.

Overview of the KBP 2013 Slot Filler Validation Track Hoa Trang Dang National Institute of Standards and Technology.

Lesson 02: Experimental Design

A Joint Model For Semantic Role Labeling Aria Haghighi, Kristina Toutanova, Christopher D. Manning Computer Science Department Stanford University.

Robust Textual Inference via Graph Matching Aria Haghighi Andrew Ng Christopher Manning.

Running Records.

Normalized alignment of dependency trees for detecting textual entailment Erwin Marsi & Emiel Krahmer Tilburg University Wauter Bosma & Mariët Theune University.

Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.

Socrates and the Socratic Turn

1 Natural Language Processing for the Web Prof. Kathleen McKeown 722 CEPSR, Office Hours: Wed, 1-2; Tues 4-5 TA: Yves Petinot 719 CEPSR,

Predicting the Semantic Orientation of Adjective Vasileios Hatzivassiloglou and Kathleen R. McKeown Presented By Yash Satsangi.

Automatic Classification of Semantic Relations between Facts and Opinions Koji Murakami, Eric Nichols, Junta Mizuno, Yotaro Watanabe, Hayato Goto, Megumi.

Third Recognizing Textual Entailment Challenge Potential SNeRG Submission.

A Confidence Model for Syntactically-Motivated Entailment Proofs Asher Stern & Ido Dagan ISCOL June 2011, Israel 1.

Extracting Opinions, Opinion Holders, and Topics Expressed in Online News Media Text Soo-Min Kim and Eduard Hovy USC Information Sciences Institute 4676.

Anomaly detection Problem motivation Machine Learning.

Outline P1EDA’s simple features currently implemented –And their ablation test Features we have reviewed from Literature –(Let’s briefly visit them) –Iftene’s.

Overview of the Fourth Recognising Textual Entailment Challenge NIST-Nov. 17, 2008TAC Danilo Giampiccolo (coordinator, CELCT) Hoa Trang Dan (NIST)

The Queen Elizabeth II is a constitutional monarch: that is, she is Britain’s head of state, but her executive powers are limited by constitutional rules.

Learning Information Extraction Patterns Using WordNet Mark Stevenson and Mark A. Greenwood Natural Language Processing Group University of Sheffield,

Fabio Massimo Zanzotto

How Newspapers Differ: Devolution in Northern Ireland How far down the path to devolution is Northern Ireland?

Opinion Mining Using Econometrics: A Case Study on Reputation Systems Anindya Ghose, Panagiotis G. Ipeirotis, and Arun Sundararajan Leonard N. Stern School.

Governments of Europe.

The UK Constitutional Arrangement Starter Task 1.Who is the head of state of the United Kingdom? 2.According to British law, one group of people are never.

The British Government

Assessing the Impact of Frame Semantics on Textual Entailment Authors: Aljoscha Burchardt, Marco Pennacchiotti, Stefan Thater, Manfred Pinkal Saarland.

Knowledge and Tree-Edits in Learnable Entailment Proofs Asher Stern, Amnon Lotan, Shachar Mirkin, Eyal Shnarch, Lili Kotlerman, Jonathan Berant and Ido.

2007. Software Engineering Laboratory, School of Computer Science S E Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying.

Interpreting Dictionary Definitions Dan Tecuci May 2002.

The consumption of e-democracy in Britain Wainer Lusoli University of Chester

Scalable Inference and Training of Context- Rich Syntactic Translation Models Michel Galley, Jonathan Graehl, Keven Knight, Daniel Marcu, Steve DeNeefe.

 Great Britain England Scotland Wales  Northern Ireland Independent Irish Republic.

United Kingdom Review Jeopardy Mr. Oakes UK Review.

A Language Independent Method for Question Classification COLING 2004.

The British Isles Reading The tasks of this period:  1.Get a better understanding of the British Isles.  2.Describe something about the British Isles.

Scotland within the UK and EU: the work and welfare issue.

How British are you?. Which countries are part of Britain?

Event-Centric Summary Generation Lucy Vanderwende, Michele Banko and Arul Menezes One Microsoft Way, WA, USA DUC 2004.

Presenter: Jinhua Du ( 杜金华 ) Xi’an University of Technology 西安理工大学 NLP&CC, Chongqing, Nov , 2013 Discriminative Latent Variable Based Classifier.

1 UNIT ONE Topic: Accuracy and Precision. 2 Accuracy How close a measurement is to the actual or true value good accuracy true value poor accuracy true.

Inference Protocols for Coreference Resolution Kai-Wei Chang, Rajhans Samdani, Alla Rozovskaya, Nick Rizzolo, Mark Sammons, and Dan Roth This research.

Welcome to the UK!.

1 Generating Comparative Summaries of Contradictory Opinions in Text (CIKM09’)Hyun Duk Kim, ChengXiang Zhai 2010/05/24 Yu-wen,Hsu.

Relational Duality: Unsupervised Extraction of Semantic Relations between Entities on the Web Danushka Bollegala Yutaka Matsuo Mitsuru Ishizuka International.

Spatial Smoothing and Multiple Comparisons Correction for Dummies Alexa Morcom, Matthew Brett Acknowledgements.

SALSA-WS 09/05 Approximating Textual Entailment with LFG and FrameNet Frames Aljoscha Burchardt, Anette Frank Computational Linguistics Department Saarland.

 2003 CSLI Publications Ling 566 Oct 17, 2011 How the Grammar Works.

Cheap and Fast – But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks EMNLP 2008 Rion Snow CS Stanford Brendan O’Connor Dolores.

Feature Assignment LBSC 878 February 22, 1999 Douglas W. Oard and Dagobert Soergel.

My daughter likes I am fond of My husband enjoys My son is interested in and.

Europes Governments SS6CG4a/b/c SS6CG5 a/b.

Relation Extraction (RE) via Supervised Classification See: Jurafsky & Martin SLP book, Chapter 22 Exploring Various Knowledge in Relation Extraction.

Predicting and Adapting to Poor Speech Recognition in a Spoken Dialogue System Diane J. Litman AT&T Labs -- Research

Where is Europe? Where is the British Isles? What do you know about the British Isles?

The UK Constitutional Arrangement

Political representation and democracy

Diana McCarthy Erasmus Mundus Visiting Scholar Saarland University

How were countries run in Europe during the 1920s and 1930s?

Объединенное Королевство

Democracy in Scotland and the United Kingdom

Объединенное Королевство

Constitution and Parliament

An Overview of Concepts and Selected Techniques

Automatic Detection of Causal Relations for Question Answering

Johns Hopkins 2003 Summer Workshop on Syntax and Statistical Machine Translation Chapters 5-8 Ethan Phelps-Goodman.

Presentation transcript:

Syntactic Contributions in the Entailment Task Lucy Vanderwende, Arul Menezes, Rion Snow (Stanford)

RTE-1 analysis Recap of MSR’s manual analysis of RTE-1 test data; in principle, 74% is achievable using syntax and thesaurus Without thesaurus Using thesaurus True69 (9%)147 (18%) False197 (25%)243 (30%) Not syntax534 (67%)410 (51%)

RTE-1 analysis Recap of MSR’s manual analysis of RTE-1 test data; in principle, 74% is achievable using syntax and thesaurus Without thesaurus Using thesaurus True69 (9%)147 (18%) False197 (25%)243 (30%) Not syntax534 (67%)410 (51%)

MENT algorithm Predicting negative entailment using syntactic features: Obtain syntactic dependency graphs for T and H sentences Attempt to align each H node to a node in T Check syntactic heuristics on aligned nodes if match, then predict false If no match, use lexical similarity model (with threshold)

MENT: heuristic alignment

MENT: superlative heuristic Superlative heuristic (100% accurate, 5 test items): –If the superlatives align, and their heads are aligned, and the head in Text has any additional modifiers, and those modifiers are aligned to some modifier in H, say yes, else say no. (RTE2-test- #477) Crater Lake is the deepest lake in the United States, the second deepest in the Western Hemisphere, and the seventh deepest in the world, dropping downward to 1,932 feet just southeast of Merriam Cone. Crater Lake is the deepest lake in the world.

MENT: superlative heuristic Superlative heuristic (100% accurate, 5 test items): –If the superlatives align, and their heads are aligned, and the head in Text has any additional modifiers, and those modifiers are aligned to some modifier in H, say yes, else say no. (RTE2-test- #477) Crater Lake is the deepest lake in the United States, the second deepest in the Western Hemisphere, and the seventh deepest in the world, dropping downward to 1,932 feet just southeast of Merriam Cone. Crater Lake is the deepest lake in the world.

MENT: superlative heuristic Superlative heuristic (100% accurate, 5 test items): –If the superlatives align, and their heads are aligned, and the head in Text has any additional modifiers, and those modifiers are aligned to some modifier in H, say yes, else say no. (RTE2-test- #477) Crater Lake is the deepest lake in the United States, the second deepest in the Western Hemisphere, and the seventh deepest in the world, dropping downward to 1,932 feet just southeast of Merriam Cone. Crater Lake is the deepest lake in the world.

MENT: superlative heuristic Superlative heuristic (100% accurate, 5 test items): –If the superlatives align, and their heads are aligned, and the head in Text has any additional modifiers, and those modifiers are aligned to some modifier in H, say yes, else say no. (RTE2-test- #477) Crater Lake is the deepest lake in the United States, the second deepest in the Western Hemisphere, and the seventh deepest lake in the world, dropping downward to 1,932 feet just southeast of Merriam Cone. Crater Lake is the deepest lake in the world.

Counterfactual heuristic (80% accurate, 15 test items): –If there is a pair of aligned nodes, and a second pair of aligned nodes, and the PATH in the dependency contains a conditional or counterfactual, say no. (RTE2-test- #473) Blondlot was trying to polarize X-rays when he claimed to have discovered this new form of radiation. Blondlot discovered x-rays. MENT: Counterfactual heuristic

Counterfactual heuristic (80% accurate, 15 test items): –If there is a pair of aligned nodes, and a second pair of aligned nodes, and the PATH in the dependency contains a conditional or counterfactual, say no. (RTE2-test- #473) Blondlot was trying to polarize X-rays when he claimed to have discovered this new form of radiation. Blondlot discovered x-rays. MENT: Counterfactual heuristic

MENT: example of alignment heuristics Unaligned entity (64.49% accuracy, 13.38% of the test items): If a node in H is an entity, but is not aligned to any node in T, say no (RTE2-test #781) Former European Ryder Cup winning captain Sam Torrance says Welshman Ian Woosnam is the right man to lead Europe at the 2006 match in Ireland. Torrance told BBC Sport: "I think Ian Woosnam should get it" (the 2006 captaincy).

MENT: training feature weights “run2”: treating a syntactic heuristic match as a yes/no vote, alignment threshold set using training data “run1”: learning weights (using MaxEnt) for each syntactic and alignment heuristic, as well as for sub-components of these heuristics

MENT: results Run1 (with feature weights) Run2 Training (1717 sents) Dev (450 sents) RTE2 test (800 sents) RUN1 TRUTHYesNo Yes No MENT Run1 says no 43.25% of the time

MENT variations – no thresholds If heuristics apply, say no Else say yes 56% accurate system says no 35% Say no, unless everything is aligned and no heuristics apply 59.25% accurate system says no 74.5% SYSTEM TRUTHYesNo Yes No SYSTEM TRUTHYesNo Yes No65335 ** Note: Run2 = if no heuristics apply, and alignment score is above a threshold trained on the training set, then say yes, else no. Accuracy: 58.50

MENT variations – with threshold With learned alignment and syntactic heuristic weights, with alignment threshold from training, say no Else say yes 60.25% accurate System says no 43% of the time Say no, unless alignment score is above an Oracle threshold and no heuristics apply 61.25% accurate System says no 70% of the time SYSTEM TRUTHYesNo Yes No75325 RUN1 TRUTHYesNo Yes No186214

Lessons? Use syntactic heuristics and sub-components as features and apply discriminative training Thresholding for lexical similarity isn’t stable across data sets Error Analysis …

bad parses (e.g., rte2 test #550)

How far do you take syntactic heuristics? Location : for a pair of aligned verb nodes, if there is an argument in H, and that argument is aligned to a node in T, say no if that node is not also the same argument of the aligned verb (applied 7 times, 5 incorrect) Brandenburg Gate is one of Berlin's best known landmarks and is now regarded as one of the greatest symbols of German unity. Brandenburg Gate is in Berlin.

A great heuristic …but Unaligned Verb: if there is an aligned subject and an aligned object, then if their verb is not aligned, say no This heuristic was not used because of its poor performance, for example: –Rodriguez told detectives he never touched the burning backpack, which was loaded with plastic pipes packed with gunpowder and BBs. –The burning backpack contained plastic pipes packed with gunpowder and BBs. Need to learn paraphrase similarity for verbs – see NAACL-HLT paper forthcoming.

Directions and Plans MSR submission available at Might it be possible to have access to all sites’ submissions? Need to learn paraphrase similarity for verbs More feature engineering Different graph-matching strategies to avoid brittleness of syntactic heuristics Find more data for training to build more stable systems

A plug for Pyramids Conservatives oppose any form of devolution. The conservatives are opposed to devolution. The UK’s Tory Prime Minister adamantly resisted calls for devolution of British rule. Scotts want self-rule … as buoyed as most Scotts by North Ireland’s prospective self-rule Wales is following Scotland, and moving towards a call for an elected assembly with devolved powers … A self-governing Wales would be part of the EU … an independent Wales within the European community … Wales could participate directly in forthcoming EC meetings … … a fully self-governing Wales within the European Community.

A plug for Pyramids Conservatives oppose any form of devolution. The conservatives are opposed to devolution. The UK’s Tory Prime Minister adamantly resisted calls for devolution of British rule. Scotts want self-rule … as buoyed as most Scotts by North Ireland’s prospective self-rule Wales is following Scotland, and moving towards a call for an elected assembly with devolved powers … A self-governing Wales would be part of the EU … an independent Wales within the European community … Wales could participate directly in forthcoming EC meetings … … a fully self-governing Wales within the European Community. SCU name, given by annotator Candidate hypothesis? Candidate Text?