Linguistic Resources for the 2013 TAC KBP Cold Start Evaluation Joe Ellis (presenter), Jeremy Getman, Jonathan Wright, Stephanie Strassel Linguistic Data.

Slides:



Advertisements
Similar presentations
NCEA LEVEL 1 HISTORY SOME ADVICE By John Pipe MT ALBERT GRAMMAR SCHOOL.
Advertisements

Overview of the TAC2013 Knowledge Base Population Evaluation: Temporal Slot Filling Mihai Surdeanu with a lot help from: Hoa Dang, Joe Ellis, Heng Ji,
Text Analysis Conference Knowledge Base Population 2013 Hoa Trang Dang National Institute of Standards and Technology Sponsored by:
Overview of the TAC2013 Knowledge Base Population Evaluation: English Slot Filling Mihai Surdeanu with a lot help from: Hoa Dang, Joe Ellis, Heng Ji, and.
A Corpus for Cross- Document Co-Reference D. Day 1, J. Hitzeman 1, M. Wick 2, K. Crouch 1 and M. Poesio 3 1 The MITRE Corporation 2 University of Massachusetts,
Distant Supervision for Knowledge Base Population Mihai Surdeanu, David McClosky, John Bauer, Julie Tibshirani, Angel Chang, Valentin Spitkovsky, Christopher.
 TDT 2003 Evaluation Workshop, NIST, November 17-18, 2003 Creating the Annotated TDT-4 Y2003 Evaluation Corpus Stephanie Strassel, Meghan Glenn Linguistic.
Tri-lingual EDL Planning Heng Ji (RPI) Hoa Trang Dang (NIST) WORRY, BE HAPPY!
Overview of the KBP 2013 Slot Filler Validation Track Hoa Trang Dang National Institute of Standards and Technology.
Linguistic Resources for the 2013 TAC KBP Sentiment SF Evaluation Joe Ellis (presenter), Jeremy Getman, Jonathan Wright, Stephanie Strassel Linguistic.
TAC 2012 Cold Start Knowledge Base Population James Mayfield Javier Artiles Hoa Trang Dang Special thanks to: Brendan Callahan Bonnie Dorr Joe Ellis Tim.
Semantic Annotation for Multilingual Search Shibamouli Lahiri
Linguistic Resources for the 2013 TAC KBP Slot Filling Evaluations Joe Ellis (presenter), Jeremy Getman, Jonathan Wright, Stephanie Strassel Linguistic.
Ang Sun Ralph Grishman Wei Xu Bonan Min November 15, 2011 TAC 2011 Workshop Gaithersburg, Maryland USA.
Automatic Classification of Semantic Relations between Facts and Opinions Koji Murakami, Eric Nichols, Junta Mizuno, Yotaro Watanabe, Hayato Goto, Megumi.
Jumping Off Points Ideas of possible tasks Examples of possible tasks Categories of possible tasks.
Web Search – Summer Term 2006 II. Information Retrieval (Basics) (c) Wolfgang Hürst, Albert-Ludwigs-University.
 TDT PI Meeting - November 16-17, 2000 Annotation Overview  Background  annotation strategy search-guided complete annotation work with one topic at.
Automatically Constructing a Dictionary for Information Extraction Tasks Ellen Riloff Proceedings of the 11 th National Conference on Artificial Intelligence,
Use Case Modelling Visual Annotator for studying ICU Notes Bacchus Beale.
Chapter 4 The Law.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Processing of large document collections Part 10 (Information extraction: multilingual IE, IE from web, IE from semi-structured data) Helena Ahonen-Myka.
Probabilistic Model for Definitional Question Answering Kyoung-Soo Han, Young-In Song, and Hae-Chang Rim Korea University SIGIR 2006.
Survey of Semantic Annotation Platforms
University of Sheffield, NLP Entity Linking Kalina Bontcheva © The University of Sheffield, This work is licensed under the Creative Commons.
Sofia, Bulgaria | 9-10 October Using XQuery to Query and Manipulate XML Data Stephen Forte CTO, Corzen Inc Microsoft Regional Director NY/NJ (USA) Stephen.
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
Semantic Search via XML Fragments: A High-Precision Approach to IR Jennifer Chu-Carroll, John Prager, David Ferrucci, and Pablo Duboue IBM T.J. Watson.
Overview of the KBP 2012 Slot-Filling Tasks Hoa Trang Dang (National Institute of Standards and Technology Javier Artiles (Rakuten Institute of Technology)
Assigning Global Relevance Scores to DBpedia Facts Philipp Langer, Patrick Schulze, Stefan George, Tobias Metzke, Ziawasch Abedjan, Gjergji Kasneci DESWeb.
Legal Classification of Offenses Daytona State College School of Emergency Services Introduction to Law Classification of Offenses.
1 Automating Slot Filling Validation to Assist Human Assessment Suzanne Tamang and Heng Ji Computer Science Department and Linguistics Department, Queens.
Lesson Plan Project by Jill Keeve. Goal/Objective Goal : Students will use a reading excerpt to explore alternate background information on conic sections.
Entity-oriented Filtering of Large Streams TREC KBA 2013 John R. Frank Ian Soboroff Max Kleiman-Weiner
UB LIS 571 Soergel Lecture 6.2b Document analysis for retrieval and information extraction Dagobert Soergel Department of Library and Information Studies.
IT-522: Web Databases And Information Retrieval By Dr. Syed Noman Hasany.
Linguistic Resources for the 2013 TAC KBP Entity Linking Evaluation Joe Ellis (presenter), Justin Mott, Xuansong Li, Jeremy Getman, Jonathan Wright, Stephanie.
PRIS at Slot Filling in KBP 2012: An Enhanced Adaboost Pattern-Matching System Yan Li Beijing University of Posts and Telecommunications
Ang Sun Director of Research, Principal Scientist, inome
Law of Contrariness "Our chief want in life is somebody who shall make us do what we can. Having found them, we shall then hate them for it." Ralph Waldo.
OCR AS Applied ICT Business Documents. Big picture.
4advanced 3proficient 2approaching 1emerging ACE Rubric
RESEARCH POSTER PRESENTATION DESIGN © Triggers in Extraction 5. Experiments Data Development set: KBP SF 2012 corpus.
KnowItAll April William Cohen. Announcements Reminder: project presentations (or progress report) –Sign up for a 30min presentation (or else) –First.
 TDT 2004 Evaluation Workshop, NIST, December 2-3, 2004 Creating the TDT5 Corpus and 2004 Evaluation Topics at LDC Stephanie Strassel, Meghan Glenn, Junbo.
Starter: reminder of the AS exam structure Paper 1: 3 questions assessing AOs 1, 3 and 4. – 2 questions on how language is used to create meanings and.
Semantic Search An Active Approach to Searching. Why Semantic Search? A better question is – why not? We are interested in what the document means, not.
Arabic Syntactic Trees Zdeněk Žabokrtský Otakar Smrž Center for Computational Linguistics Faculty of Mathematics and Physics Charles University in Prague.
On using context for automatic correction of non-word misspellings in student essays Michael Flor Yoko Futagi Educational Testing Service 2012 ACL.
AQUAINT AQUAINT Evaluation Overview Ellen M. Voorhees.
CS Architecture of Web Information Systems Spring 04 April 16 th 2004 Shay David sd256 at cornell.edu Social Networks in Scholarly publishing.
CONTEXTUAL SEARCH AND NAME DISAMBIGUATION IN USING GRAPHS EINAT MINKOV, WILLIAM W. COHEN, ANDREW Y. NG SIGIR’06 Date: 2008/7/17 Advisor: Dr. Koh,
Linguistic Resources for the 2013 TAC KBP Temporal SF Evaluation Joe Ellis (presenter), Jeremy Getman, Jonathan Wright, Stephanie Strassel Linguistic Data.
Using Semantic Relations to Improve Information Retrieval
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
1 Evaluation of Opinion Questions ä Session leaders: Ed Hovy, Kathy McKeown ä Topics ä Is evaluating opinion questions feasible at all? How can we construct.
STEWARD: A Spatio-Textual Document Search Engine for HUDUSER.ORG Prof. Hanan Samet Department of Computer Science, University of Maryland, College Park,
BioCreAtIvE Critical Assessment for Information Extraction in Biology Granada, Spain, March28-March 31, 2004 Task 2: Functional annotation of gene products.
Concealed Online Carry Class For Concealed Weapons Permit Firearms Safety Training.
From requirements to specification Specification is a refinement of requirements Can be included together as Software Requirements Specifications (SRS)
Populating and Evaluating Knowledge Bases
Einat Minkov University of Haifa, Israel CL course, U
Tri-lingual EDL for 2017 and Beyond
Assessment of the model
Yr 7 Key tag.
Show the URL for the first location.
Only connect You will see 16 tiles that can be arranged to make 4 groups of 4 terms. You have 2 mins to arrange the tiles in the correct 4 groups of 4.
Unsupervised Learning of Narrative Schemas and their Participants
Presentation transcript:

Linguistic Resources for the 2013 TAC KBP Cold Start Evaluation Joe Ellis (presenter), Jeremy Getman, Jonathan Wright, Stephanie Strassel Linguistic Data Consortium University of Pennsylvania, USA

Query Selection  Annotators search and annotate chains of entities connected by KBP slots Cold Start queries comprised of Entity TAC KBP Evaluation Workshop – NIST, November 18-19, 2013 Appleton Museum of Art org:top_members_employees John Lofgren per:title director

Query Selection  Annotators search and annotate chains of entities connected by KBP slots Cold Start queries comprised of Entity – Slot 0 TAC KBP Evaluation Workshop – NIST, November 18-19, 2013 Appleton Museum of Art org:top_members_employees John Lofgren per:title director

Query Selection  Annotators search and annotate chains of entities connected by KBP slots Cold Start queries comprised of Entity – Slot 0 – Slot 1 Inverse slots to increase connectivity e.g. per:cities_of_residence – gpe:residents_of_city TAC KBP Evaluation Workshop – NIST, November 18-19, 2013 Appleton Museum of Art org:top_members_employees John Lofgren per:title director

Query Selection  Annotators search and annotate chains of entities connected by KBP slots Cold Start queries comprised of Entity – Slot 0 – Slot 1 Inverse slots to increase connectivity e.g. org:founded_by – {per,org,gpe}:organizations_founded TAC KBP Evaluation Workshop – NIST, November 18-19, 2013 Appleton Museum of Art org:top_members_employees John Lofgren per:title director

Query Selection  Annotators search and annotate chains of entities connected by KBP slots Cold Start queries comprised of Entity – Slot 0 – Slot 1 Inverse slots to increase connectivity e.g. org:top_members_employees – per:top_member_employee_of Cold Start corpus KBA output Comprised of web documents from Ocala, FL; Kentucky; Guyana TAC KBP Evaluation Workshop – NIST, November 18-19, 2013 Appleton Museum of Art org:top_members_employees John Lofgren per:title director

Annotation  Unlike other SF tasks, Cold Start annotation is performed concurrently with query development  Multiple fillers at each “hop” level, all of which must be annotated and correctly connected to one another TAC KBP Evaluation Workshop – NIST, November 18-19, 2013 London – gpe:residents_of_city – per:charges Lance Barrett first-degree attempted burglary theft of a firearm carrying a concealed weapon Lesa Bailey criminal conspiracy to make meth unlawful possession of meth precursors possession of a controlled substance

Assessment  Assess validity of fillers & justification from humans & systems Filler Correct – meets the slot requirements and supported in document Wrong – doesn’t meet slot requirements and/or not supported in doc Inexact – otherwise correct, but is incomplete, includes extraneous text, or is not the most informative string in the document Predicate Correct, Wrong, Inexact-Short, Inexact-Long Subject/Object Correct, Wrong, Inexact Ignore TAC KBP Evaluation Workshop – NIST, November 18-19, 2013

Justification  Justification is the string(s) of text that show a relation is true Predicate: Includes all three pieces of information necessary to justify the entity/slot/filler relation Subject: proves the entity’s involvement in the relation Object: proves the filler’s involvement in the relation Each part can be comprised of up to two, discontiguous strings Predicate 1: the Islamabad headquarters of Harkat-ul-Mujahideen Predicate 2: Islamabad, the capital city of Pakistan TAC KBP Evaluation Workshop – NIST, November 18-19, 2013 New in 2013: Ronnie James Dio - per:date_of_death: Sunday [ ]

2013 Discoveries  New justification scheme used in unexpected, creative ways Additional predicate strings used to disambiguate entities Predicate 1: Ginzburg died late Sunday of cardiac arrest. Predicate 2: Vitaly Ginzburg, a Nobel Prize-winning Russian physicist and one of the fathers of the Soviet hydrogen bomb TAC KBP Evaluation Workshop – NIST, November 18-19, 2013

Delivered 2013 Resources TAC KBP Evaluation Workshop – NIST, November 18-19, 2013 Corpus TitleTypeLDC CatalogLanguageSize TAC 2013 KBP English Cold Start Evaluation Queries and Annotations V1.1 EvaluationLDC2013E87English326 Queries TAC 2013 KBP English Cold Start Evaluation Assessment Results EvaluationLDC2013E101English 6,755 Assessments