On the Issue of Combining Anaphoricity Determination and Antecedent Identification in Anaphora Resolution Ryu Iida, Kentaro Inui, Yuji Matsumoto Nara Institute.

Slides:

Advertisements

Similar presentations

Ryu Iida Tokyo Institute of Technology Kentaro Inui Yuji Matsumoto Nara Institute of Science and Technology

Advertisements

Specialized models and ranking for coreference resolution Pascal Denis ALPAGE Project Team INRIA Rocquencourt F Le Chesnay, France Jason Baldridge.

A Machine Learning Approach to Coreference Resolution of Noun Phrases By W.M.Soon, H.T.Ng, D.C.Y.Lim Presented by Iman Sen.

Coreference Based Event-Argument Relation Extraction on Biomedical Text Katsumasa Yoshikawa 1), Sebastian Riedel 2), Tsutomu Hirao 3), Masayuki Asahara.

Playing the Telephone Game: Determining the Hierarchical Structure of Perspective and Speech Expressions Eric Breck and Claire Cardie Department of Computer.

A Self Learning Universal Concept Spotter By Tomek Strzalkowski and Jin Wang Original slides by Iman Sen Edited by Ralph Grishman.

Made with OpenOffice.org 1 Sentiment Classification using Word Sub-Sequences and Dependency Sub-Trees Pacific-Asia Knowledge Discovery and Data Mining.

Sunita Sarawagi.  Enables richer forms of queries  Facilitates source integration and queries spanning sources “Information Extraction refers to the.

Deep Belief Networks for Spam Filtering

CS 4705 Lecture 21 Algorithms for Reference Resolution.

Automatic Classification of Semantic Relations between Facts and Opinions Koji Murakami, Eric Nichols, Junta Mizuno, Yotaro Watanabe, Hayato Goto, Megumi.

Reference Collections: Task Characteristics. TREC Collection Text REtrieval Conference (TREC) –sponsored by NIST and DARPA (1992-?) Comparing approaches.

Empirical Methods in Information Extraction - Claire Cardie 자연어처리연구실 한 경 수

Supervised models for coreference resolution Altaf Rahman and Vincent Ng Human Language Technology Research Institute University of Texas at Dallas 1.

Improving Machine Learning Approaches to Coreference Resolution Vincent Ng and Claire Cardie Cornell Univ. ACL 2002 slides prepared by Ralph Grishman.

Information Extraction with Unlabeled Data Rayid Ghani Joint work with: Rosie Jones (CMU) Tom Mitchell (CMU & WhizBang! Labs) Ellen Riloff (University.

Longbiao Kang, Baotian Hu, Xiangping Wu, Qingcai Chen, and Yan He Intelligent Computing Research Center, School of Computer Science and Technology, Harbin.

Na-Rae Han (University of Pittsburgh), Joel Tetreault (ETS), Soo-Hwa Lee (Chungdahm Learning, Inc.), Jin-Young Ha (Kangwon University) May , LREC.

Mining and Summarizing Customer Reviews

ISMB 2003 presentation Extracting Synonymous Gene and Protein Terms from Biological Literature Hong Yu and Eugene Agichtein Dept. Computer Science, Columbia.

A Global Relaxation Labeling Approach to Coreference Resolution Coling 2010 Emili Sapena, Llu´ıs Padr´o and Jordi Turmo TALP Research Center Universitat.

A Light-weight Approach to Coreference Resolution for Named Entities in Text Marin Dimitrov Ontotext Lab, Sirma AI Kalina Bontcheva, Hamish Cunningham,

Machine translation Context-based approach Lucia Otoyo.

Processing of large document collections Part 3 (Evaluation of text classifiers, applications of text categorization) Helena Ahonen-Myka Spring 2005.

Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.

Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.

1 Named Entity Recognition based on three different machine learning techniques Zornitsa Kozareva JRC Workshop September 27, 2005.

Illinois-Coref: The UI System in the CoNLL-2012 Shared Task Kai-Wei Chang, Rajhans Samdani, Alla Rozovskaya, Mark Sammons, and Dan Roth Supported by ARL,

A multiple knowledge source algorithm for anaphora resolution Allaoua Refoufi Computer Science Department University of Setif, Setif 19000, Algeria .

Automatic Detection of Tags for Political Blogs Khairun-nisa Hassanali Vasileios Hatzivassiloglou The University.

Employing Active Learning to Cross-Lingual Sentiment Classification with Data Quality Controlling Shoushan Li †‡ Rong Wang † Huanhuan Liu † Chu-Ren Huang.

Incorporating Extra-linguistic Information into Reference Resolution in Collaborative Task Dialogue Ryu Iida Shumpei Kobayashi Takenobu Tokunaga Tokyo.

This work is supported by the Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior National Business Center contract number.

Multi-modal Reference Resolution in Situated Dialogue by Integrating Linguistic and Extra-Linguistic Clues Ryu Iida Masaaki Yasuhara Takenobu Tokunaga.

1 Exploiting Syntactic Patterns as Clues in Zero- Anaphora Resolution Ryu Iida, Kentaro Inui and Yuji Matsumoto Nara Institute of Science and Technology.

A Language Independent Method for Question Classification COLING 2004.

Japanese Dependency Analysis using Cascaded Chunking Taku Kudo 工藤拓 Yuji Matsumoto 松本裕治 Nara Institute Science and Technology, JAPAN.

Coreference Resolution

A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources Author: Carmen Banea, Rada Mihalcea, Janyce Wiebe Source:

1 Learning Sub-structures of Document Semantic Graphs for Document Summarization 1 Jure Leskovec, 1 Marko Grobelnik, 2 Natasa Milic-Frayling 1 Jozef Stefan.

A Cross-Lingual ILP Solution to Zero Anaphora Resolution Ryu Iida & Massimo Poesio (ACL-HLT 2011)

Noun-Phrase Analysis in Unrestricted Text for Information Retrieval David A. Evans, Chengxiang Zhai Laboratory for Computational Linguistics, CMU 34 th.

Opinion Holders in Opinion Text from Online Newspapers Youngho Kim, Yuchul Jung and Sung-Hyon Myaeng Reporter: Chia-Ying Lee Advisor: Prof. Hsin-Hsi Chen.

Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.

Incorporating Contextual Cues in Trainable Models for Coreference Resolution 14 April 2003 Ryu Iida Computational Linguistic Laboratory Graduate School.

Using Semantic Relations to Improve Passage Retrieval for Question Answering Tom Morton.

Processing of large document collections Part 5 (Text summarization) Helena Ahonen-Myka Spring 2005.

Improving Named Entity Translation Combining Phonetic and Semantic Similarities Fei Huang, Stephan Vogel, Alex Waibel Language Technologies Institute School.

Multilingual Opinion Holder Identification Using Author and Authority Viewpoints Yohei Seki, Noriko Kando,Masaki Aono Toyohashi University of Technology.

Information Transfer through Online Summarizing and Translation Technology Sanja Seljan*, Ksenija Klasnić**, Mara Stojanac*, Barbara Pešorda*, Nives Mikelić.

Machine Learning Tutorial-2. Recall, Precision, F-measure, Accuracy Ch. 5.

Multi-level Bootstrapping for Extracting Parallel Sentence from a Quasi-Comparable Corpus Pascale Fung and Percy Cheung Human Language Technology Center,

Evaluation issues in anaphora resolution and beyond Ruslan Mitkov University of Wolverhampton Faro, 27 June 2002.

A Maximum Entropy Based Honorificity Identification for Bengali Pronominal Anaphora Resolution Apurbalal Senapati and Utpal Garain Presented by Samik Some.

Learning Subjective Nouns using Extraction Pattern Bootstrapping Ellen Riloff School of Computing University of Utah Janyce Wiebe, Theresa Wilson Computing.

Answer Mining by Combining Extraction Techniques with Abductive Reasoning Sanda Harabagiu, Dan Moldovan, Christine Clark, Mitchell Bowden, Jown Williams.

1 Gloss-based Semantic Similarity Metrics for Predominant Sense Acquisition Ryu Iida Nara Institute of Science and Technology Diana McCarthy and Rob Koeling.

Exploiting Named Entity Taggers in a Second Language Thamar Solorio Computer Science Department National Institute of Astrophysics, Optics and Electronics.

From Words to Senses: A Case Study of Subjectivity Recognition Author: Fangzhong Su & Katja Markert (University of Leeds, UK) Source: COLING 2008 Reporter:

Virtual Examples for Text Classification with Support Vector Machines Manabu Sassano Proceedings of the 2003 Conference on Emprical Methods in Natural.

Department of Computer Science The University of Texas at Austin USA Joint Entity and Relation Extraction using Card-Pyramid Parsing Rohit J. Kate Raymond.

Twitter as a Corpus for Sentiment Analysis and Opinion Mining

Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.

The University of Illinois System in the CoNLL-2013 Shared Task Alla RozovskayaKai-Wei ChangMark SammonsDan Roth Cognitive Computation Group University.

Language Identification and Part-of-Speech Tagging

A Deep Memory Network for Chinese Zero Pronoun Resolution

Simone Paolo Ponzetto University of Heidelberg Massimo Poesio

Clustering Algorithms for Noun Phrase Coreference Resolution

Automatic Detection of Causal Relations for Question Answering

Ping LUO*, Fen LIN^, Yuhong XIONG*, Yong ZHAO*, Zhongzhi SHI^

Presentation transcript:

On the Issue of Combining Anaphoricity Determination and Antecedent Identification in Anaphora Resolution Ryu Iida, Kentaro Inui, Yuji Matsumoto Nara Institute of Science and Technology NLP-KE’05, October 30, 2005

2 Noun phrase anaphora resolution Anaphora resolution is the process of determining whether two expressions in natural language refer to the same real world entity Important process for various NLP applications : machine translation, information extraction, question answering A federal judge in Pittsburgh issued a temporary restraining order preventing Trans World Airlines from buying additional shares of USAir Group Inc. The order, requested in a suit filed by USAir, dealt another blow to TWA's bid to buy the company for $52 a share. A federal judge in Pittsburgh issued a temporary restraining order preventing Trans World Airlines from buying additional shares of USAir Group Inc. The order, requested in a suit filed by USAir, dealt another blow to TWA's bid to buy the company for $52 a share. antecedentanaphor

3 Anaphora resolution can be decomposed into two sub processes 1. Anaphoricity determination is the task of classifying whether a given noun phrase (NP) is anaphoric or non- anaphoric 2. Antecedent identification is the identification of the antecedent of a given anaphoric NP Noun phrase anaphora resolution A federal judge in Pittsburgh issued a temporary restraining order preventing Trans World Airlines from buying additional shares of USAir Group Inc. The order, requested in a suit filed by USAir, dealt another blow to TWA's bid to buy the company for $52 a share. A federal judge in Pittsburgh issued a temporary restraining order preventing Trans World Airlines from buying additional shares of USAir Group Inc. The order, requested in a suit filed by USAir, dealt another blow to TWA's bid to buy the company for $52 a share. antecedentanaphor non-anaphor

4 Previous work Early corpus-based work on anaphora resolution does not address anaphoricity determination (Hobbs `78, Lappin and Leass `94) Assuming that the anaphora resolution system knows a priori all the anaphoric noun phrases This problem has been paid attention by an increasing number of researchers (Bean and Riloff `99, Ng and Cardie `02, Uryupina `03, Ng `04) Determining anaphoricity is not a trivial problem Overall performance of anaphora resolution crucially depends on the accuracy of anaphoricity determination

5 Previous work (Cont’d) Previous efforts to tackle anaphoricity determination problem have provided the two findings 1.One useful cue for determining anaphoricity of a given NP can be obtained by searching for an antecedent (Soon et al. 01, Ng and Cardie 02a) 2.Anaphoricity determination can be effectively carried out by a binary classifier that learns instances of non- anaphoric NPs (Ng and Cardie 02b, Ng 04) None of the previous models effectively combines the strengths of these findings

6 Aim Improving anaphora resolution performance : Using better anaphoricity determination Combining sources of evidence from previous models

7 Proposal Introducing a 2-step process for combining antecedent information and non-anaphoric information We call this model the selection-and-classification model 1.Select the most likely candidate antecedent (CA) of a target NP (TNP) using the tournament model (Iida et al. `03) 2.Classify a TNP paired with CA is classified as anaphoric if CA is identified as the antecedent of TNP; otherwise TNP is judged non-anaphoric

8 2-step process for anaphora resolution A federal judge in Pittsburgh issued a temporary restraining order preventing Trans World Airlines from buying additional shares of USAir Group Inc. The order, requested in a suit filed by USAir, … candidate anaphor tournament model USAir suit USAir Group Inc order federal judge candidate anaphor candidate antecedents …

9 2-step process for anaphora resolution A federal judge in Pittsburgh issued a temporary restraining order preventing Trans World Airlines from buying additional shares of USAir Group Inc. The order, requested in a suit filed by USAir, … candidate anaphor tournament model USAir suit USAir Group Inc order federal judge candidate anaphor candidate antecedents … USAir Group Inc USAir suit USAir Group Inc Federal judge candidate anaphor candidate antecedents … order

10 2-step process for anaphora resolution USAir Group Inc candidate antecedent A federal judge in Pittsburgh issued a temporary restraining order preventing Trans World Airlines from buying additional shares of USAir Group Inc. The order, requested in a suit filed by USAir, … candidate anaphor tournament model USAir suit USAir Group Inc order federal judge candidate anaphor candidate antecedents … Anaphoricity determination model is non-anaphoric USAir score θ ana score θ ana is anaphoric and is the USAir USAir Group Inc antecedent of USAir Group Inc USAir

11 Training phase Anaphoric Non-anaphoric NANP NP5 NP4 NP3 NP2 NP1 Non-anaphoric NP set of candidate antecedents NP3 tournament model candidate antecedent Non-anaphoric instances NP3NANP ANP NP5 NP4 NP3 NP2 NP1 Anaphoric NP set of candidate antecedents Antecedent Anaphoric instances NP4ANP NPi: candidate antecedent

12 Comparison with previous approaches 1. Search-based approach (SM) (Soon et al. `01, Ng and Cardie `02) Recasting anaphora resolution as binary classification problems Comparable to the state-of-the-art rule-based system disadvantage: not use non-anaphoric instances in training 2. Classification-and-search approach (CSM) (Ng and Cardie `02, Ng `04) Introducing anaphoricity determination as a classification task The performance of the CSM is better than the SM if the threshold parameters are appropriately tuned disadvantage: not use the contextual information (i.e. whether an appropriate antecedent appears on the context)

13 Experiments Noun phrase anaphora resolution in Japanese Japanese newspaper article corpus tagged NP- anaphoric relations 90 text, 1,104 sentences Noun phrases : 876 anaphors and 6,292 non-anaphors Recall = Precision = # of correctly detected anaphoric relations # of anaphoric NPs # of correctly detected anaphoric relations # of NPs classified as anaphoric

14 Experimental setting Conduct 10-fold cross-validation with support vector machines Comparison among three models 1. Search-based model (Ng and Cardie `02) 2. Classification-and-Search model (Ng and Cardie `04) 3. Selection-and-Classification model (Proposed model) using the tournament model (Iida et al. `03)

15 Results of noun phrase anaphora resolution Proposed model Search-based model Classification-and- search model Search-based model (SM) vs. Classification-and-search model (CSM) the performance of CSM is significantly better than the SM

16 Results of noun phrase anaphora resolution Proposed model Search-based model Classification-and- search model Classification-and-search model (CSM) vs. Proposed model the proposed model outperforms the CSM in the higher-recall portion

17 Conclusion Our selection-and-classification approach to anaphora resolution improves on the performance of previous learning-based models by combining their advantages 1.Our model uses non-anaphoric instances together with anaphoric instances to induce anaphoricity classifier 2.Our model determines the anaphoricity of a given NP by taking antecedent information into account

18 Future work The majority of errors are caused by the difficulty of judging the semantic compatibility e.g.) the system outputs that “ ani (elder brother)” is anaphoric with “ kanojo (she)” The lexical resource we employed in the experiments did not contain gender information  D eveloping a lexical resource which includes a broad range of semantic compatible relations