Lance Ramshaw (with Ralph Weischedel) BBN. 2 Ontobank Coreference Part of the multi-site Ontobank effort –Intended to combine with word-sense and propositional.

Slides:



Advertisements
Similar presentations
A Machine Learning Approach to Coreference Resolution of Noun Phrases By W.M.Soon, H.T.Ng, D.C.Y.Lim Presented by Iman Sen.
Advertisements

Antalis H results presentation to Analysts September 8, 2004.
A Corpus for Cross- Document Co-Reference D. Day 1, J. Hitzeman 1, M. Wick 2, K. Crouch 1 and M. Poesio 3 1 The MITRE Corporation 2 University of Massachusetts,
Layering Semantics (Putting meaning into trees) Treebank Workshop Martha Palmer April 26, 2007.
Multilinugual PennTools that capture parses and predicate-argument structures, and their use in Applications Martha Palmer, Aravind Joshi, Mitch Marcus,
June 6, 20073rd PIRE Meeting1 Tectogrammatical Representation of English in Prague Czech-English Dependency Treebank Lucie Mladová Silvie Cinková, Kristýna.
1 An Integrated Annotation DB in OntoNotes Sameer Pradhan, Eduard Hovy, Mitchell Marcus, Martha Palmer, Lance Ramshaw, and Ralph Weischedel
1 Discourse, coherence and anaphora resolution Lecture 16.
MAQUILA SOLIDARITY NETWORK Prepared for the MFAF Americas Working Group Mexico Subcommittee August 19-20, 2010 Update on the Mexican garment sector.
The JDPA Sentiment Corpus for the Automotive Domain Miriam Eckert, Lyndsie Clark, Nicolas Nicolov J.D. Power and Associates Jason S. Kessler Indiana University.
Annotating Topics of Opinions Veselin Stoyanov Claire Cardie.
April 26th, 2007 Workshop on Treebanking, HLT/NAACL, Rochester 1 Layering of Annotations in the Penn Discourse TreeBank (PDTB) Rashmi Prasad Institute.
Recognizing Implicit Discourse Relations in the Penn Discourse Treebank Ziheng Lin, Min-Yen Kan, and Hwee Tou Ng Department of Computer Science National.
Annotating Expressions of Opinions and Emotions in Language Wiebe, Wilson, Cardie.
Partial Prebracketing to Improve Parser Performance John Judge NCLT Seminar Series 7 th December 2005.
14 April 2005 RST Discourse Corpus Lynn Carlson Daniel Marcu Mary Ellen Okurowski.
Are Linguists Dinosaurs? 1.Statistical language processors seem to be doing away with the need for linguists. –Why do we need linguists when a machine.
1/13 Parsing III Probabilistic Parsing and Conclusions.
Using Information Extraction for Question Answering Done by Rani Qumsiyeh.
4/14/20051 ACE Annotation Ralph Grishman New York University.
Survey of Annotation Work Joint session Thursday afternoon, April 14 Chair: Eduard Hovy, ISI.
1/17 Probabilistic Parsing … and some other approaches.
OntoNotes project Treebank Syntax Training Data Decoders Propositions Verb Senses and verbal ontology links Noun Senses and targeted nominalizations Coreference.
Improving Machine Learning Approaches to Coreference Resolution Vincent Ng and Claire Cardie Cornell Univ. ACL 2002 slides prepared by Ralph Grishman.
EMPOWER 2 Empirical Methods for Multilingual Processing, ‘Onoring Words, Enabling Rapid Ramp-up Martha Palmer, Aravind Joshi, Mitch Marcus, Mark Liberman,
STATE FISCAL CRISIS: UPDATED IMPACTS ON K-12 AND RUSD August 16, 2012.
1 Answering Questions through Understanding and Analysis (AQUA) Ralph Weischedel and Scott Miller BBN Technologies 3 December 2001.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Machine Translation, Digital Libraries, and the Computing Research Laboratory Indo-US Workshop on Digital Libraries June 23, 2003.
Illinois-Coref: The UI System in the CoNLL-2012 Shared Task Kai-Wei Chang, Rajhans Samdani, Alla Rozovskaya, Mark Sammons, and Dan Roth Supported by ARL,
GROWTH TESTS. 2 The compounding effect…  Assume that you are looking at Twitter and that the revenues currently are $ 600 million. Assume also that you.
Introduction  Information Extraction (IE)  A limited form of “complete text comprehension”  Document 로부터 entity, relationship 을 추출 
The Prague (Czech-)English Dependency Treebank Jan Hajič Charles University in Prague Computer Science School Institute of Formal and Applied Linguistics.
27/03/01CROSSMARC kick-off meeting LTG Background XML-based Processing –Several years of experience in developing XML-based software –LT XML Tools –Pipeline.
Using Semantic Relations to Improve Information Retrieval Tom Morton.
I2B2 Shared Task 2011 Coreference Resolution in Clinical Text David Hinote Carlos Ramirez.
Efficiently Computed Lexical Chains As an Intermediate Representation for Automatic Text Summarization H.G. Silber and K.F. McCoy University of Delaware.
Semi-supervised Training of Statistical Parsers CMSC Natural Language Processing January 26, 2006.
NYU: Description of the Proteus/PET System as Used for MUC-7 ST Roman Yangarber & Ralph Grishman Presented by Jinying Chen 10/04/2002.
Ideas for 100K Word Data Set for Human and Machine Learning Lori Levin Alon Lavie Jaime Carbonell Language Technologies Institute Carnegie Mellon University.
GET READY Questions will run automatically. Set 2 Question  0.35.
Summarisation Work at Sheffield Robert Gaizauskas Natural Language Processing Group Department of Computer Science University of Sheffield.
Using Semantic Relations to Improve Passage Retrieval for Question Answering Tom Morton.
TimeML compliant text analysis for Temporal Reasoning Branimir Boguraev and Rie Kubota Ando.
Error Analysis for Learning-based Coreference Resolution Olga Uryupina
1 DUTIE Speech: Determining Utility Thresholds for Information Extraction from Speech John Makhoul, Rich Schwartz, Alex Baron, Ivan Bulyko, Long Nguyen,
nd PIRE project workshop1 Tectogrammatical Representation of English Silvie Cinková Lucie Mladová, Anja Nedoluzhko, Jiří Semecký, Jana Šindlerová,
Automatic recognition of discourse relations Lecture 3.
Annotation Procedure in Building the Prague Czech-English Dependency Treebank Marie Mikulová and Jan Štěpánek Institute of Formal and Applied Linguistics.
FY 2014 Budget / Actual Performance through November 2013 John W. Lawson Chief Financial Officer January 9, 2014.
CPSC 422, Lecture 27Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 27 Nov, 16, 2015.
Natural Language Processing Lecture 14—10/13/2015 Jim Martin.
Income Statement Issues. Questions What can be done to improve profits?
Human-Assisted Machine Annotation Sergei Nirenburg, Marjorie McShane, Stephen Beale Institute for Language and Information Technologies University of Maryland.
Combining Deep Linguistics Analysis and Surface Pattern Learning: A Hybrid Approach to Chinese Definitional Question Answering Fuchun Peng, Ralph Weischedel,
Using Semantic Relations to Improve Information Retrieval
Learning Event Durations from Event Descriptions Feng Pan, Rutu Mulkar, Jerry R. Hobbs University of Southern California ACL ’ 06.
State Budget Update Legislative Analyst’s Office C.A.S.H. Fall Conference October 15,
A Database of Narrative Schemas A 2010 paper by Nathaniel Chambers and Dan Jurafsky Presentation by Julia Kelly.
Lecture 15 Market Based Valuation Investment Analysis.
AmTrust Announces Continued Growth of Operated Earnings For The First Quarter 2016
Identifying Expressions of Opinion in Context Eric Breck and Yejin Choi and Claire Cardie IJCAI 2007.
AmTrust Announces Continued Growth of Operated Earnings For The Fourth Quarter
Preliminaries CSCI-GA.2591
NYU Coreference CSCI-GA.2591 Ralph Grishman.
A Machine Learning Approach to Coreference Resolution of Noun Phrases
LAUNCH OF THE AFRICAN ECONOMIC OUTLOOK REPORT 2008/09
A Machine Learning Approach to Coreference Resolution of Noun Phrases
Ling 566 Oct 14, 2008 How the Grammar Works.
Distributor Sales Rep Name
Presentation transcript:

Lance Ramshaw (with Ralph Weischedel) BBN

2 Ontobank Coreference Part of the multi-site Ontobank effort –Intended to combine with word-sense and propositional structure to capture lexical semantics Target coreference types: –Names, nominals, pronouns Attributes not corefed (“Bush is the president.”) Generic or underspecified nominals are not corefed, though pronouns may still refer to them –Appositives These can then be treated like copulas –Temporal expressions –Definite references to events Intended for broad coverage –Initial testing being done on Penn Treebank data

3Examples Names/Nominals/Pronouns –Elco Industries Inc. said it expects net income in the year ending June 30, 1990, to fall below a recent analyst 's estimate of $ 1.65 a share. The Rockford, Ill. maker of fasteners also said it expects to post sales in the current fiscal year that are `` slightly above '' fiscal 1989 sales of $ 155 million. Appositive constructions Heads and attributes in –the PhacoFlex intraocular lens, the first foldable silicone lens available for cataract surgery Events/Verbs –Sales of passenger cars grew 22%. The strong growth followed year-to-year increases. Temporal expressions –John spent three years in jail. In that time …

4Results Corpus: WSJ –Mention extents extracted automatically from the Treebank trees Annotated so far: ~300K words (4 annotators) Annotation speed: ~4500 words / hour Double-annotation and adjudication of 100K words: ~60 hours Interannotator agreement (using MUC coref measure) –Coreference: ~84% measured between annotators ~90% between annotator and the adjudicated version –Apposition: ~90% measured between annotators ~94% between annotator and the adjudicated version