Textual Entailment 1 Textual Entailment Al Akhwayn Summer School 2015 Horacio Rodríguez TALP Research Center Dept. Computer Science Universitat Politècnica.

Slides:



Advertisements
Similar presentations
COGEX at the Second RTE Marta Tatu, Brandon Iles, John Slavick, Adrian Novischi, Dan Moldovan Language Computer Corporation April 10 th, 2006.
Advertisements

S-Match: an Algorithm and an Implementation of Semantic Matching Pavel Shvaiko 1 st European Semantic Web Symposium, 11 May 2004, Crete, Greece paper with.
COGEX at the Second RTE Marta Tatu, Brandon Iles, John Slavick, Adrian Novischi, Dan Moldovan Language Computer Corporation April 10 th, 2006.
Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
Recognizing Textual Entailment Challenge PASCAL Suleiman BaniHani.
LEDIR : An Unsupervised Algorithm for Learning Directionality of Inference Rules Advisor: Hsin-His Chen Reporter: Chi-Hsin Yu Date: From EMNLP.
Robust Textual Inference via Graph Matching Aria Haghighi Andrew Ng Christopher Manning.
Recognizing Textual Entailment Progress towards RTE 4 Scott Settembre University at Buffalo, SNePS Research Group
47 th Annual Meeting of the Association for Computational Linguistics and 4 th International Joint Conference on Natural Language Processing Of the AFNLP.
Shallow Processing: Summary Shallow Processing Techniques for NLP Ling570 December 7, 2011.
Region labelling Giving a region a name. Image Processing and Computer Vision: 62 Introduction Region detection isolated regions Region description properties.
Information Extraction and Ontology Learning Guided by Web Directory Authors:Martin Kavalec Vojtěch Svátek Presenter: Mark Vickers.
TEXTUAL ENTAILMENT Lliçons sobre un desastre anunciat Horacio Rodríguez.
Automatic Classification of Semantic Relations between Facts and Opinions Koji Murakami, Eric Nichols, Junta Mizuno, Yotaro Watanabe, Hayato Goto, Megumi.
Introduction to CL Session 1: 7/08/2011. What is computational linguistics? Processing natural language text by computers  for practical applications.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
تمرين شماره 1 درس NLP سيلابس درس NLP در دانشگاه هاي ديگر ___________________________ راحله مکي استاد درس: دکتر عبدالله زاده پاييز 85.
Article by: Feiyu Xu, Daniela Kurz, Jakub Piskorski, Sven Schmeier Article Summary by Mark Vickers.
1 Natural Language Processing for the Web Prof. Kathleen McKeown 722 CEPSR, Office Hours: Wed, 1-2; Tues 4-5 TA: Yves Petinot 719 CEPSR,
Employing Two Question Answering Systems in TREC 2005 Harabagiu, Moldovan, et al 2005 Language Computer Corporation.
A Confidence Model for Syntactically-Motivated Entailment Proofs Asher Stern & Ido Dagan ISCOL June 2011, Israel 1.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Ontology Learning and Population from Text: Algorithms, Evaluation and Applications Chapters Presented by Sole.
Feature Selection for Automatic Taxonomy Induction The Features Input: Two terms Output: A numeric score, or. Lexical-Syntactic Patterns Co-occurrence.
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia,
Overview of the Fourth Recognising Textual Entailment Challenge NIST-Nov. 17, 2008TAC Danilo Giampiccolo (coordinator, CELCT) Hoa Trang Dan (NIST)
Answer Validation Exercise - AVE QA subtrack at Cross-Language Evaluation Forum 2007 UNED (coord.) Anselmo Peñas Álvaro Rodrigo Valentín Sama Felisa Verdejo.
Knowledge and Tree-Edits in Learnable Entailment Proofs Asher Stern and Ido Dagan (earlier partial version by Roy Bar-Haim) Download at:
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
Tree Kernels for Parsing: (Collins & Duffy, 2001) Advanced Statistical Methods in NLP Ling 572 February 28, 2012.
The Second PASCAL Recognising Textual Entailment Challenge Roy Bar-Haim, Ido Dagan, Bill Dolan, Lisa Ferro, Danilo Giampicollo, Bernardo Magnini, Idan.
“How much context do you need?” An experiment about context size in Interactive Cross-language Question Answering B. Navarro, L. Moreno-Monteagudo, E.
Machine Learning Approach for Ontology Mapping using Multiple Concept Similarity Measures IEEE/ACIS International Conference on Computer and Information.
Exploiting Ontologies for Automatic Image Annotation M. Srikanth, J. Varner, M. Bowden, D. Moldovan Language Computer Corporation
Combining Lexical Semantic Resources with Question & Answer Archives for Translation-Based Answer Finding Delphine Bernhard and Iryna Gurevvch Ubiquitous.
Knowledge and Tree-Edits in Learnable Entailment Proofs Asher Stern, Amnon Lotan, Shachar Mirkin, Eyal Shnarch, Lili Kotlerman, Jonathan Berant and Ido.
METEOR-Ranking & M-BLEU: Flexible Matching & Parameter Tuning for MT Evaluation Alon Lavie and Abhaya Agarwal Language Technologies Institute Carnegie.
Scott Duvall, Brett South, Stéphane Meystre A Hands-on Introduction to Natural Language Processing in Healthcare Annotation as a Central Task for Development.
Semantic Matching Fausto Giunchiglia work in collaboration with Pavel Shvaiko The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, 2003.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
2010/2/4Yi-Ting Huang Pennacchiotti, M., & Zanzotto, F. M. Learning Shallow Semantic Rules for Textual Entailment. Recent Advances in Natural Language.
A Language Independent Method for Question Classification COLING 2004.
Recognizing textual entailment: Rational, evaluation and approaches Source:Natural Language Engineering 15 (4) Author:Ido Dagan, Bill Dolan, Bernardo Magnini.
1 Learning Sub-structures of Document Semantic Graphs for Document Summarization 1 Jure Leskovec, 1 Marko Grobelnik, 2 Natasa Milic-Frayling 1 Jozef Stefan.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.
A Systematic Exploration of the Feature Space for Relation Extraction Jing Jiang & ChengXiang Zhai Department of Computer Science University of Illinois,
Relation Alignment for Textual Entailment Recognition Cognitive Computation Group, University of Illinois Experimental ResultsTitle Mark Sammons, V.G.Vinod.
1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )
1 A Web Search Engine-Based Approach to Measure Semantic Similarity between Words Presenter: Guan-Yu Chen IEEE Trans. on Knowledge & Data Engineering,
Element Level Semantic Matching Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan Paper by Fausto.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.
Iterative similarity based adaptation technique for Cross Domain text classification Under: Prof. Amitabha Mukherjee By: Narendra Roy Roll no: Group:
Toward an Open Source Textual Entailment Platform (Excitement Project) Bernardo Magnini (on behalf of the Excitement consortium) 1 STS workshop, NYC March.
Towards Entailment Based Question Answering: ITC-irst at Clef 2006 Milen Kouylekov, Matteo Negri, Bernardo Magnini & Bonaventura Coppola ITC-irst, Centro.
Exploiting Named Entity Taggers in a Second Language Thamar Solorio Computer Science Department National Institute of Astrophysics, Optics and Electronics.
SALSA-WS 09/05 Approximating Textual Entailment with LFG and FrameNet Frames Aljoscha Burchardt, Anette Frank Computational Linguistics Department Saarland.
Event-Based Extractive Summarization E. Filatova and V. Hatzivassiloglou Department of Computer Science Columbia University (ACL 2004)
Learning Event Durations from Event Descriptions Feng Pan, Rutu Mulkar, Jerry R. Hobbs University of Southern California ACL ’ 06.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
NTNU Speech Lab 1 Topic Themes for Multi-Document Summarization Sanda Harabagiu and Finley Lacatusu Language Computer Corporation Presented by Yi-Ting.
Question Answering Passage Retrieval Using Dependency Relations (SIGIR 2005) (National University of Singapore) Hang Cui, Renxu Sun, Keya Li, Min-Yen Kan,
Recognising Textual Entailment Johan Bos School of Informatics University of Edinburgh Scotland,UK.
Short Text Similarity with Word Embedding Date: 2016/03/28 Author: Tom Kenter, Maarten de Rijke Source: CIKM’15 Advisor: Jia-Ling Koh Speaker: Chih-Hsuan.
Learning Textual Entailment from Examples
Reading Report: Open QA Systems
Knowledge Representation for Natural Language Understanding
CS224N Section 3: Corpora, etc.
Presentation transcript:

Textual Entailment 1 Textual Entailment Al Akhwayn Summer School 2015 Horacio Rodríguez TALP Research Center Dept. Computer Science Universitat Politècnica de Catalunya

Textual Entailment 2 Outline Introduction RTE in PASCAL and TAC Techniques Systems Paraphrasing

Textual Entailment 3 Readings Textual Entailment Community: –The RTE Resource Pool can now be accessed from: –The Textual Entailment Subzone can now be accessed from: Textual Entailment Resource Pool –Textual Entailment Resource PoolTextual Entailment Resource Pool PASCAL Challenges TAC has been proposed as a generic task that captures major semantic inference needs across many natural language processing applications. TAC challenges

Textual Entailment 4 Readings Workshops –ACL 2005 Workshop on Empirical Modeling of Semantic Equivalence and Entailment, 2005ACL 2005 Workshop on Empirical Modeling of Semantic Equivalence and Entailment, 2005 –First PASCAL Recognising Textual Entailment Challenge (RTE-1), 2005First PASCAL Recognising Textual Entailment Challenge (RTE-1), 2005 –Second PASCAL Recognising Textual Entailment Challenge (RTE-2), 2006Second PASCAL Recognising Textual Entailment Challenge (RTE-2), 2006 –Third PASCAL Recognising Textual Entailment Challenge (RTE-3), 2007Third PASCAL Recognising Textual Entailment Challenge (RTE-3), 2007 –Answer Validation Exercise at CLEF 2006 (AVE 2006)Answer Validation Exercise at CLEF 2006 (AVE 2006) –Answer Validation Exercise at CLEF 2007 (AVE 2007)Answer Validation Exercise at CLEF 2007 (AVE 2007) –TAC workshops since 2008 Surveys – [Ghuge, Bhattacharya, 2013]

Textual Entailment 5 Readings Thesis –Oren Glickman (PHD, 2006) –Idan Szpecktor (MSC, 2005, PHD, 2009) –Milen Kouylekov (PHD, 2006) –Regina Barzilay (PHD, 2004) –Elena Cabrio (PHD, 2011) –Ó scar Ferr á ndez (PHD, 2009) –Prodromos Malakasiotis (PHD, 2011) –Annisa Ihsani (MSC, 2012) –Roy Bar Haim (PHD, 2010) –Shachar Mirkin (PHD, 2011)

Textual Entailment 6 Textual Entailment Textual entailment recognition is the task of deciding, given two text fragments, whether the meaning of one text is entailed (can be inferred) from another text. This task captures generically a broad range of inferences that are relevant for multiple applications. For example, a QA system has to identify texts that entail the expected answer. Given the question "Who killed Kennedy?", the text "the assassination of Kennedy by Oswald" entails the expected answer form "Oswald killed Kennedy"..

Textual Entailment 7 Textual Entailment Applications –Question-Answering –Information Extraction –Information Retrieval –Multi-Document Summarization –Named Entity Recognition –Temporal and Spatial Normalization –Semantic Parsing –Natural Language Generation

Textual Entailment 8 Textual Entailment Equivalence (Paraphrase): expr1  expr2 Entailment: expr1  expr2 – more general Directional relation between two text fragments: Text (t) and Hypothesis (h): t entails h (t  h) if, typically, a human reading t would infer that h is most likely true”

Textual Entailment 9 Textual Entailment examples TEXT Eyeing the huge market potential, currently led by Google, Yahoo took over search company Overture Services Inc last year. Microsoft's rival Sun Microsystems Inc. bought Star Office last month and plans to boost its development as a Web-based device running over the Net on personal computers and Internet appliances. The National Institute for Psychobiology in Israel was established in May 1971 as the Israel Center for Psychobiology by Prof. Joel. HYPOTHESIS Yahoo bought Overture. Microsoft bought Star Office. Israel was established in May ENTAILMENT TRUE FALSE

Textual Entailment 10 Methods and Approaches Word overlap –lexical, syntactic, and semantic Logical approaches –Raina et al, 2005 –Bos et al, 2005, 2006 –Moldovan et al, 2003 Graph matching approaches –Haghighi et al, 2005 –de Salvo et al, 2005 –de Marneffe et al, 2005, 2006 Paraphrases and Entailment Rules –Moldovan and Rus, 2001 –Lin and Pantel, 2001 QA –Shinyama et al, 2002 IE

Textual Entailment 11 Methods and Approaches Measure similarity between t and h (coverage of h by t): –Lexical overlap (unigram, N-gram, subsequence) –Average Matched Word Displacement –Lexical substitution (WordNet, statistical) –Syntactic matching/transformations –Lexical-syntactic variations (“paraphrases”) –Semantic role labeling and matching –Global similarity parameters (e.g. negation, modality) Sentence Alignment –Exhaustive Sentence Alignment parallel corpora comparable corpora –Web-based Sentence Alignment –Bigrams Syncronous grammars Inversion Transduction grammars Cross-pair similarity Detect mismatch (for non-entailment) Logical inference

Textual Entailment 12 Methods and Approaches Thesaurus-based Term Expansion –WN Distributional Similarity BLEU (BiLingual Evaluation Understudy) ROUGE (Recall-Oriented Understudy for Gisting Evaluation) classical statistical machine translation model –giza++ software (Och and Ney, 2003)

Textual Entailment 13 Dominant approach: Supervised Learning Features model both similarity and mismatch Train on development set and auxiliary t-h corpora t,h Features: Lexical, n-gram,syntactic semantic, global Feature vector Classifier YES NO

Textual Entailment 14 PASCAL RTE-3 Frequency

Textual Entailment 15 PASCAL RTE-3 Resources –WordNet –Extended WordNet –WordNet3.0 –TimeML –IKRIS –DIRT paraphrase database –FrameNet –VerbNet –VerbOcean –Component Library (U. Texas) –OpenCyC –SUMO –Tuple Database (Boeing) –Stanford ’ s additions to Wn

Textual Entailment 16 Notable Systems Idan Szpektor (2005) Scaling Web-based Acquisition of Entailment Relations (Ms. thesis) Idan Szpektor et al (2004) Idan Szpektor and Ido Dagan (2007) Lorenza Romano et al (2007) Ido Dagan et al (2008) TEASE and improvements

Textual Entailment 17 Glickman Acquiring Lexical Entailment Relations –identify lexical paraphrases of verbs

Textual Entailment 18 DFKI Rui Wang, G ü nter Neumann Based on: –H is usually textually shorter than T –not all information in T is relevant to make decisions for the entailment –the dissimilarity of relations among the same topics between T and H are of great importance. Process –starting from H to T –exclude irrelevante information from T –represent the structural differences between T and H by means of a set of Closed-Class Symbols (Entailment Patterns – EPs) –classification using subsequence kernels

Textual Entailment 19 COGEX Combination of LEX, COGEXd, COGEXc EXtended WordNet Knowledge Base (XWN-KB) –XWN Lexical Chains –coarse-grained sense inventory for WordNet 2.1 released for Task #7 in SemEval This clustering was created automatically with the aid of a methodology described in (Navigli, 2006). NLP Axioms –links a NE to its set of aliases Named Entity Check –deducts points for each pair whose H contains at least one named entity not- derivable from T COGEX, Tatu 2006, 2007

Textual Entailment 20 Lexical Alignment –Maximum Entropy classifier to compute the probability that an element selected from a text corresponds to – or can be aligned with – an element selected from a hypothesis. –Three-step Process: First, sentences were decomposed into a set of “alignable chunks” that were derived from the output of a chunk parser and a collocation detection system. Next, chunks from the text (C t ) and hypothesis (C h ) were assembled into an alignment matrix (C t  C h ). Finally, each pair of chunks were then submitted to a classifier which output the probability that the pair represented a positive example of alignment. LCC Groundhog, Hickl, 2006 Groundhog

Textual Entailment 21 Discourse Commitment-based Framework LCC, Hickl, 2007 LCC Hickl

Textual Entailment 22 De Marneffe et al, 2005, 2006 Chambers et al, 2007 Graph matching –Represent sentences as typed dependency trees –Find low-cost alignment (using lexical & structural match costs) T:Thirteen soldiers lost their lives in today’s ambush H:Several troops were killed in the ambush Stanford

Textual Entailment 23 Zanzotto et al, 2006, 2007 cross-pair similarity –similarity measure aiming at capturing rewrite rules from training examples, computing a cross-pair similarity K S ((T',H'), (T'',H'')). –if two pairs are similar, it is extremely likely that they have the same entailment value. The key point is the use of placeholders to mark the relations between the sentence words. A placeholder co-indexes two substructures in the parse trees of text and hypothesis –a tree similarity measure K T (  1,  2 ) (Collins and Duffy, 2002) that counts the subtrees that  1 and  2 have in common –a substitution function t(·, c) that changes names of the placeholders in a tree according to a set of correspondences between placeholders c Tor Vergata

Textual Entailment 24 Process –Linguistic Processing –Semantic-based distance measures –Classifier Adaboost SVM UPC

Textual Entailment 25 Linguistic Processing –tokenization –morphologic tagging –lemmatization –fine grained Named Entities Recognition and Classification –syntactic parsing and robust detection of verbal predicate arguments Spear parser (Surdeanu, 2005) –semantic labeling, with WordNet synsets –Magnini´s domain markers –EuroWordNet Top Concept Ontology labels UPC

Textual Entailment 26 "Romano_Prodi 1 is 2 the 3 prime 4 minister 5 of 6 Italy 7 " i_en_proper_person(1), entity_has_quality(2), entity(5), i_en_country(7), quality(4), which_entity(2,1), which_quality(2,5), mod(5,7), mod(5,4). UPC

Textual Entailment 27 Semantic-based distance measures between T and H - Strict overlapping of unary predicates. - Strict overlapping of binary predicates. - Loose overlapping of unary predicates. - Loose overlapping of binary predicates. UPC

Textual Entailment 28 UPC

Textual Entailment 29 Approaching RTE from Logic Inference hypothesis Logical Representation NLP including Semantic Interpretation Logical Inference text

Textual Entailment 30 Johan Bos, 2007 Components of Nutcracker: –The C&C parser for CCG –Boxer –Vampire, a FOL theorem prover –Paradox and Mace, FOL model builders Background knowledge –WordNet [hyponyms, synonyms] –NomLex [nominalisations] Nutcracker, Roma (La Sapienza) Nutcracker