EVALITA 2009 Recognizing Textual Entailment (RTE) Italian Chapter Johan Bos 1, Fabio Massimo Zanzotto 2, Marco Pennacchiotti 3 1 University of Rome La.

Slides:



Advertisements
Similar presentations
Dealing with Italian Temporal Expressions: the ITA-Chronos System Matteo Negri Fondazione Bruno Kessler - IRST, Trento - Italy EVALITA 2007.
Advertisements

Rationale for a multilingual corpus for machine translation evaluation Debbie Elliott Anthony Hartley Eric Atwell Corpus Linguistics 2003, Lancaster, England.
Specialized models and ranking for coreference resolution Pascal Denis ALPAGE Project Team INRIA Rocquencourt F Le Chesnay, France Jason Baldridge.
Statistical Machine Translation Part II: Word Alignments and EM Alexander Fraser ICL, U. Heidelberg CIS, LMU München Statistical Machine Translation.
Markpong Jongtaveesataporn † Chai Wutiwiwatchai ‡ Koji Iwano † Sadaoki Furui † † Tokyo Institute of Technology, Japan ‡ NECTEC, Thailand.
Question Answering for Machine Reading Evaluation Evaluation Campaign at CLEF 2011 Anselmo Peñas (UNED, Spain) Eduard Hovy (USC-ISI, USA) Pamela Forner.
Recognizing Textual Entailment Challenge PASCAL Suleiman BaniHani.
Baselines for Recognizing Textual Entailment Ling 541 Final Project Terrence Szymanski.
Fabio Massimo Zanzotto and Danilo Croce University of Rome “Tor Vergata” Roma, Italy Reading what Machines ‘Think’
HOO 2012: A Report on the Preposition and Determiner Error Correction Shared Task Robert Dale, Ilya Anisimoff and George Narroway Centre for Language Technology.
FATE: a FrameNet Annotated corpus for Textual Entailment Marco Pennacchiotti, Aljoscha Burchardt Computerlinguistik Saarland University, Germany LREC 2008,
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
The Unreasonable Effectiveness of Data Alon Halevy, Peter Norvig, and Fernando Pereira Kristine Monteith May 1, 2009 CS 652.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
Human Language Technologies. Issue Corporate data stores contain mostly natural language materials. Knowledge Management systems utilize rich semantic.
Fabio Massimo Zanzotto and Lorenzo Dell’Arciprete University of Rome “Tor Vergata” Roma, Italy Efficient kernels for sentence pair classification.
Automatic Classification of Semantic Relations between Facts and Opinions Koji Murakami, Eric Nichols, Junta Mizuno, Yotaro Watanabe, Hayato Goto, Megumi.
Integrating ontological and linguistic knowledge for Conceptual Information Extraction Roberto Basili, Michele Vindigni, Fabio Massimo Zanzotto Università.
Longbiao Kang, Baotian Hu, Xiangping Wu, Qingcai Chen, and Yan He Intelligent Computing Research Center, School of Computer Science and Technology, Harbin.
ELN – Natural Language Processing Giuseppe Attardi
CAREERS IN LINGUISTICS OUTSIDE OF ACADEMIA CAREERS IN INDUSTRY.
Overview of the Fourth Recognising Textual Entailment Challenge NIST-Nov. 17, 2008TAC Danilo Giampiccolo (coordinator, CELCT) Hoa Trang Dan (NIST)
Answer Validation Exercise - AVE QA subtrack at Cross-Language Evaluation Forum 2007 UNED (coord.) Anselmo Peñas Álvaro Rodrigo Valentín Sama Felisa Verdejo.
1 7-Speech Recognition (Cont’d) HMM Calculating Approaches Neural Components Three Basic HMM Problems Viterbi Algorithm State Duration Modeling Training.
INTRODUCTION TO ARTIFICIAL INTELLIGENCE Truc-Vien T. Nguyen Lab: Named Entity Recognition.
Extracting Key Terms From Noisy and Multi-theme Documents Maria Grineva, Maxim Grinev and Dmitry Lizorkin Institute for System Programming of RAS.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Fabio Massimo Zanzotto
University of Sheffield, NLP Entity Linking Kalina Bontcheva © The University of Sheffield, This work is licensed under the Creative Commons.
Combining Lexical Semantic Resources with Question & Answer Archives for Translation-Based Answer Finding Delphine Bernhard and Iryna Gurevvch Ubiquitous.
Assessing the Impact of Frame Semantics on Textual Entailment Authors: Aljoscha Burchardt, Marco Pennacchiotti, Stefan Thater, Manfred Pinkal Saarland.
Knowledge and Tree-Edits in Learnable Entailment Proofs Asher Stern, Amnon Lotan, Shachar Mirkin, Eyal Shnarch, Lili Kotlerman, Jonathan Berant and Ido.
Multi-Prototype Vector Space Models of Word Meaning __________________________________________________________________________________________________.
Answer Validation Exercise - AVE QA subtrack at Cross-Language Evaluation Forum UNED (coord.) Anselmo Peñas Álvaro Rodrigo Valentín Sama Felisa Verdejo.
Text classification Day 35 LING Computational Linguistics Harry Howard Tulane University.
2010/2/4Yi-Ting Huang Pennacchiotti, M., & Zanzotto, F. M. Learning Shallow Semantic Rules for Textual Entailment. Recent Advances in Natural Language.
The TERN Task EVALITA 2007 Valentina Bartalesi Lenzi & Rachele Sprugnoli
Combining terminology resources and statistical methods for entity recognition: an evaluation Angus Roberts, Robert Gaizauskas, Mark Hepple, Yikun Guo.
Recognizing textual entailment: Rational, evaluation and approaches Source:Natural Language Engineering 15 (4) Author:Ido Dagan, Bill Dolan, Bernardo Magnini.
Coşkun Mermer, Hamza Kaya, Mehmet Uğur Doğan National Research Institute of Electronics and Cryptology (UEKAE) The Scientific and Technological Research.
Entity Mention Detection using a Combination of Redundancy-Driven Classifiers Silvana Marianela Bernaola Biggio, Manuela Speranza, Roberto Zanoli bernaola,
Relation Alignment for Textual Entailment Recognition Cognitive Computation Group, University of Illinois Experimental ResultsTitle Mark Sammons, V.G.Vinod.
Evgeniy Gabrilovich and Shaul Markovitch
Seminar Topics and Projects Giuseppe Attardi Dipartimento di Informatica Università di Pisa.
Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.
Emotion Detection in Customer Care Narendra Gupta, Mazin Gilbert, and Giuseppe Di Fabbrizio AT&T Labs - Research, Inc ACL.
Toward an Open Source Textual Entailment Platform (Excitement Project) Bernardo Magnini (on behalf of the Excitement consortium) 1 STS workshop, NYC March.
1 Measuring the Semantic Similarity of Texts Author : Courtney Corley and Rada Mihalcea Source : ACL-2005 Reporter : Yong-Xiang Chen.
Multilingual Information Retrieval using GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of Kaohsiung.
Cheap and Fast – But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks EMNLP 2008 Rion Snow CS Stanford Brendan O’Connor Dolores.
1 CPA: Where do we go from here? Research Institute for Information and Language Processing, University of Wolverhampton; UPF Barcelona; University of.
Fabio Massimo Zanzotto Alessandro Moschitti Experimenting a “general purpose” textual entailment learner in AVE University of Rome “Tor Vergata” Italy.
Semantic search-based image annotation Petra Budíková, FI MU CEMI meeting, Plzeň,
1 7-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches Recognition Theories Bayse Rule Simple Language Model P(A|W) Network Types.
Recognising Textual Entailment Johan Bos School of Informatics University of Edinburgh Scotland,UK.
Short Text Similarity with Word Embedding Date: 2016/03/28 Author: Tom Kenter, Maarten de Rijke Source: CIKM’15 Advisor: Jia-Ling Koh Speaker: Chih-Hsuan.
Korean version of GloVe Applying GloVe & word2vec model to Korean corpus speaker : 양희정 date :
Concept Grounding to Multiple Knowledge Bases via Indirect Supervision
Learning Textual Entailment from Examples
张昊.
Text Analytics Giuseppe Attardi Università di Pisa
Social Knowledge Mining
Recognizing Partial Textual Entailment
Giuseppe Attardi Dipartimento di Informatica Università di Pisa
Introduction Task: extracting relational facts from text
What is the Entrance Exams Task
Giuseppe Attardi Dipartimento di Informatica Università di Pisa
Hierarchical, Perceptron-like Learning for OBIE
By Hossein Hematialam and Wlodek Zadrozny Presented by
Machine Reading.
Presentation transcript:

EVALITA 2009 Recognizing Textual Entailment (RTE) Italian Chapter Johan Bos 1, Fabio Massimo Zanzotto 2, Marco Pennacchiotti 3 1 University of Rome La Sapienza, Italy 2 University of Rome Tor Vergata, Italy 3 Yahoo! Labs, Sunnyvale, CA, United States

F.M.Zanzotto Outline Introduction to the Recognizing Textual Entailment in Italian (iRTE) Building the corpus: Using Wikipedia Evaluation and results: –dataset and metrics –systems results Conclusion and future works

F.M.Zanzotto What is Textual Entailment Recognition? T1T1 H1H1 At the end of the year, all solid companies pay dividends. At the end of the year, all solid insurance companies pay dividends. T 1 H 1 The textual entailment recognition task: determine whether or not a text T implies an hypothesis H T1T1 H2H2 At the end of the year, all solid companies pay dividends. At the end of the year, all solid companies pay cash dividends. T 1 H 2 Why it is extremely difficult?

F.M.Zanzotto Using Wikipedia Revisions for Building a Corpus It is well known that Wikipedia is an open encyclopedia: –where every person can behave as an author –inserting new entries or modifying existing ones. If S1 is a piece of text in Wikipedia before it is modified by an author, and S2 is its revision Our hypothesis is that (S1, S2) pairs extracted from the Wikipedia database, represent good candidate of both true and false entailment pairs (T,H)

F.M.Zanzotto Using Wikipedia Revisions for Building a Corpus S1S1 S2S2 Tuttora, nel 2008, e a 40 anni detà, delizia seppur con qualche pausa a causa di qualche infortunio in più, i suoi tifosi. Tuttora, nel 2008, e a 40 anni detà, delizia seppur con qualche infortunio in più, i suoi tifosi S1S1 S2S2 In carcere si convertì al cattolicesimo, si sposò e visse fino al 1981, senza che di lei si sapesse mediaticamente più nulla. In carcere si convertì al cattolicesimo, si sposò e visse fino al 1981, senza che di lei si sapesse più nulla

F.M.Zanzotto Using Wikipedia Revisions for Building a Corpus Annotation Guidelines The annotators classified each pair into one of the following classes: –bidirectional: S1 entails S2 and viceversa –left: S1 entails S2, but not viceversa –right: S2 entails S1, but not viceversa –no: neither S1 entails S2, nor viceversa –reject: rejected pairs

F.M.Zanzotto iRTE Corpus Description Selecting Relevant Wikipedia Entries Final Corpus –Training: 400 balanced pairs –Testing: 400 balanced pairs

F.M.Zanzotto Participants We had two participants: The Alicante/Pisa Team –machine learning classifier –features derived from lexical distances, part-of-speech information Semantic knowledge taken from SIMPLE-CLIPS, an Italian language resource The FBK Irsts team –EDITS (Edit Distance Textual Entailment Suite), a freely available open source tool for Recognizing Textual Entailment (RTE)

F.M.Zanzotto Participants Results

F.M.Zanzotto Conclusion Compared to the English edition, the Italian RTE shared task attracted relatively few participants. Most of the research on Natural Language Processing focusses on English. Is it too early in the development of Italian NLP for organising a shared task that requires complete systems comprising several layers of linguistic analysis and having access to lexical resources?