Overview of the TAC2013 Knowledge Base Population Evaluation: Temporal Slot Filling Mihai Surdeanu with a lot help from: Hoa Dang, Joe Ellis, Heng Ji,

Slides:



Advertisements
Similar presentations
Learning to Generalize for Complex Selection Tasks Alan Ritter University of Washington Sumit Basu Microsoft Research research.microsoft.com/~sumitb/smartselection.
Advertisements

Temporal Query Log Profiling to Improve Web Search Ranking Alexander Kotov (UIUC) Pranam Kolari, Yi Chang (Yahoo!) Lei Duan (Microsoft)
Text Analysis Conference Knowledge Base Population 2013 Hoa Trang Dang National Institute of Standards and Technology Sponsored by:
Overview of the TAC2013 Knowledge Base Population Evaluation: English Slot Filling Mihai Surdeanu with a lot help from: Hoa Dang, Joe Ellis, Heng Ji, and.
Jointly Identifying Temporal Relations with Markov Logic Katsumasa Yoshikawa †, Sebastian Riedel ‡, Masayuki Asahara †, Yuji Matsumoto † † Nara Institute.
Large-Scale Entity-Based Online Social Network Profile Linkage.
Presenters: Arni, Sanjana.  Subtask of Information Extraction  Identify known entity names – person, places, organization etc  Identify the boundaries.
Towards Twitter Context Summarization with User Influence Models Yi Chang et al. WSDM 2013 Hyewon Lim 21 June 2013.
Distant Supervision for Knowledge Base Population Mihai Surdeanu, David McClosky, John Bauer, Julie Tibshirani, Angel Chang, Valentin Spitkovsky, Christopher.
Using Query Patterns to Learn the Durations of Events Andrey Gusev joint work with Nate Chambers, Pranav Khaitan, Divye Khilnani, Steven Bethard, Dan Jurafsky.
Tri-lingual EDL Planning Heng Ji (RPI) Hoa Trang Dang (NIST) WORRY, BE HAPPY!
Problem Semi supervised sarcasm identification using SASI
Overview of the KBP 2013 Slot Filler Validation Track Hoa Trang Dang National Institute of Standards and Technology.
NYU ANLP-00 1 Automatic Discovery of Scenario-Level Patterns for Information Extraction Roman Yangarber Ralph Grishman Pasi Tapanainen Silja Huttunen.
Event Extraction Using Distant Supervision Kevin Reschke, Martin Jankowiak, Mihai Surdeanu, Christopher D. Manning, Daniel Jurafsky 30 May 2014 Language.
Linguistic Resources for the 2013 TAC KBP Sentiment SF Evaluation Joe Ellis (presenter), Jeremy Getman, Jonathan Wright, Stephanie Strassel Linguistic.
Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
Personal Name Classification in Web queries Dou Shen*, Toby Walker*, Zijian Zheng*, Qiang Yang**, Ying Li* *Microsoft Corporation ** Hong Kong University.
Linguistic Resources for the 2013 TAC KBP Slot Filling Evaluations Joe Ellis (presenter), Jeremy Getman, Jonathan Wright, Stephanie Strassel Linguistic.
Title Extraction from Bodies of HTML Documents and its Application to Web Page Retrieval Yunhua Hu 1, Guomao Xin 2, Ruihua Song, Guoping Hu 3, Shuming.
Ang Sun Ralph Grishman Wei Xu Bonan Min November 15, 2011 TAC 2011 Workshop Gaithersburg, Maryland USA.
Event Extraction: Learning from Corpora Prepared by Ralph Grishman Based on research and slides by Roman Yangarber NYU.
ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.
Text Classification Using Stochastic Keyword Generation Cong Li, Ji-Rong Wen and Hang Li Microsoft Research Asia August 22nd, 2003.
Ordinal Decision Trees Qinghua Hu Harbin Institute of Technology
Introduction to Machine Learning Approach Lecture 5.
Jan 4 th 2013 Event Extraction Using Distant Supervision Kevin Reschke.
Title Extraction from Bodies of HTML Documents and its Application to Web Page Retrieval Microsoft Research Asia Yunhua Hu, Guomao Xin, Ruihua Song, Guoping.
Reyyan Yeniterzi Weakly-Supervised Discovery of Named Entities Using Web Search Queries Marius Pasca Google CIKM 2007.
A Semantic Approach to IE Pattern Induction Mark Stevenson and Mark Greenwood Natural Language Processing Group University of Sheffield, UK.
Mianwei Zhou, Kevin Chen-Chuan Chang University of Illinois at Urbana-Champaign Unifying Learning to Rank and Domain Adaptation -- Enabling Cross-Task.
Overview of the KBP 2012 Slot-Filling Tasks Hoa Trang Dang (National Institute of Standards and Technology Javier Artiles (Rakuten Institute of Technology)
Wei Xu, Ralph Grishman, Le Zhao (CMU) New York University Novmember 24, 2011.
Presenter: Lung-Hao Lee ( 李龍豪 ) January 7, 309.
1 Automating Slot Filling Validation to Assist Human Assessment Suzanne Tamang and Heng Ji Computer Science Department and Linguistics Department, Queens.
BOOSTING David Kauchak CS451 – Fall Admin Final project.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
BioSnowball: Automated Population of Wikis (KDD ‘10) Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/11/30 1.
PRIS at Slot Filling in KBP 2012: An Enhanced Adaboost Pattern-Matching System Yan Li Beijing University of Posts and Telecommunications
Ang Sun Director of Research, Principal Scientist, inome
Template-Based Event Extraction Kevin Reschke – Aug 15 th 2013 Martin Jankowiak, Mihai Surdeanu, Dan Jurafsky, Christopher Manning.
RESEARCH POSTER PRESENTATION DESIGN © Triggers in Extraction 5. Experiments Data Development set: KBP SF 2012 corpus.
Linguistic Resources for the 2013 TAC KBP Cold Start Evaluation Joe Ellis (presenter), Jeremy Getman, Jonathan Wright, Stephanie Strassel Linguistic Data.
DeepDive Model Dongfang Xu Ph.D student, School of Information, University of Arizona Dec 13, 2015.
Data Mining and Decision Support
Unsupervised Relation Detection using Automatic Alignment of Query Patterns extracted from Knowledge Graphs and Query Click Logs Panupong PasupatDilek.
Event-Based Extractive Summarization E. Filatova and V. Hatzivassiloglou Department of Computer Science Columbia University (ACL 2004)
Virtual Examples for Text Classification with Support Vector Machines Manabu Sassano Proceedings of the 2003 Conference on Emprical Methods in Natural.
BOOTSTRAPPING INFORMATION EXTRACTION FROM SEMI-STRUCTURED WEB PAGES Andrew Carson and Charles Schafer.
Improving QA Accuracy by Question Inversion John Prager, Pablo Duboue, Jennifer Chu-Carroll Presentation by Sam Cunningham and Martin Wintz.
Linguistic Resources for the 2013 TAC KBP Temporal SF Evaluation Joe Ellis (presenter), Jeremy Getman, Jonathan Wright, Stephanie Strassel Linguistic Data.
Cold-Start KBP Something from Nothing Sean Monahan, Dean Carpenter Language Computer.
Authorship Verification as a One-Class Classification Problem Moshe Koppel Jonathan Schler.
Relation Extraction: Rule-based Approaches CSCI-GA.2590 Ralph Grishman NYU.
Automatically Labeled Data Generation for Large Scale Event Extraction
Click Through Rate Prediction for Local Search Results
A Brief Introduction to Distant Supervision
Krishna Kumar Singh, Yong Jae Lee University of California, Davis
Relation Extraction CSCI-GA.2591
Lecture 24: Relation Extraction
Training and Evaluation CSCI-GA.2591
CIKM Competition 2014 Second Place Solution
Director, Proteus Project Research in Natural Language Processing
CIKM Competition 2014 Second Place Solution
Intent-Aware Semantic Query Annotation
Table Cell Search for Question Answering Huan Sun
Introduction Task: extracting relational facts from text
Rachit Saluja 03/20/2019 Relation Extraction with Matrix Factorization and Universal Schemas Sebastian Riedel, Limin Yao, Andrew.
How Much is 131 Million Dollars
Presentation transcript:

Overview of the TAC2013 Knowledge Base Population Evaluation: Temporal Slot Filling Mihai Surdeanu with a lot help from: Hoa Dang, Joe Ellis, Heng Ji, Ralph Grishman, and Taylor Cassidy

Introduction Temporal Slot filling (TSF): grounds fillers extracted by SF by finding the start and end dates when they were valid. This was the 2 nd year for a KBP TSF evaluation – There was a pilot evaluation in 2011 A few new things this year

~ New: Seven Slots Considered per:spouse per:title per:employee_or_member_of per:cities_of_residence per:statesorprovinces_of_residence per:countries_of_residence org:top_employees/members

New: Input Queries

Both entity and filler given!

New: Input Queries Provenances and justification given!

New: Provenance of Dates

Provenance of date mentions used for normalization must be reported!

Scoring Metric Same four-tuple used to represent dates: [T1 T2 T3 T4] – Relation is true for period beginning between T1 and T2 – Relation is true for period ending between T3 and T4 Has limitations – Recurring events

For each query: – System output S = – Gold tuple S g = – Individual query score: Overall: Scoring Metric

PARTICIPANTS

Participants

Participation Summary TeamsSubmissions

RESULTS

Data 273 queries Only 201 were actually scored – 5 dropped because neither LDC nor systems found correct fillers – 67 dropped because gold annotations had an invalid temporal interval Valid interval: T1 ≤ T2, T3 ≤ T4, and T1 ≤ T4

Scoring and Baseline Justification ignored (for now) in scoring DCT-WITHIN baseline of Ji et al. (2011) – Assumption: the relation is valid at the doc date – Tuple:

Results org:top_members_employees per:cities_of_residence per:countries_of_residence per:employee_or_member_of per:spouse per:stateorprovinces_of_residenceper:title

Results org:top_members_employees per:cities_of_residence per:countries_of_residence per:employee_or_member_of per:spouse per:stateorprovinces_of_residenceper:title 2/5 systems outperformed the baseline 3/4 did in /5 systems outperformed the baseline 3/4 did in 2011

Results org:top_members_employees per:cities_of_residence per:countries_of_residence per:employee_or_member_of per:spouse per:stateorprovinces_of_residenceper:title Perspective: Top system is at 48% of human performance Perspective: Top system is at 48% of human performance

Results org:top_members_employees per:cities_of_residence per:countries_of_residence per:employee_or_member_of per:spouse per:stateorprovinces_of_residenceper:title Locations of residence tend to perform worse than average

Results org:top_members_employees per:cities_of_residence per:countries_of_residence per:employee_or_member_of per:spouse per:stateorprovinces_of_residenceper:title Employment relations tend to perform better than average

Technology Most groups used distant supervision (DS) to assign labels to tuples – Training data: Freebase (structured) – RPI, UNED Wikipedia infoboxes (semi-structured) – Microsoft – Labels: Start, End, In, Start-And-End Ensemble models for DS (RPI) – Explicit features + tree kernels

Technology Language model to clean up DS noise (Microsoft) – Learns that n-grams such as “FILLER and ENTITY were married” are indicative of per:spouse – These n-grams then used in a boosted decision tree classifier, which identifies noisy tuples

Conclusions Slight increase in participation On average, performance worse than in 2011 – 2/5 systems outperformed the baseline vs. 3/4 – New and complex task! Notable contributions – Noise reduction for TSF – Ensemble models for TSF