Temporal Ordering of Events in the News Domain Preethi Raghavan.

Slides:



Advertisements
Similar presentations
A Support Vector Method for Optimizing Average Precision
Advertisements

A Comparison of Implicit and Explicit Links for Web Page Classification Dou Shen 1 Jian-Tao Sun 2 Qiang Yang 1 Zheng Chen 2 1 Department of Computer Science.
(SubLoc) Support vector machine approach for protein subcelluar localization prediction (SubLoc) Kim Hye Jin Intelligent Multimedia Lab
Active Learning with Feedback on Both Features and Instances H. Raghavan, O. Madani and R. Jones Journal of Machine Learning Research 7 (2006) Presented.
Silent Launch Expectations This activity should be… Silent Independent Work until I say stop Be ready to share your answersExpectations This activity should.
Atomatic summarization of voic messages using lexical and prosodic features Koumpis and Renals Presented by Daniel Vassilev.
NRRC Summer ‘02 Workshop Proposal Temporal December 5, 2001 MITRE James Pustejovsky, Brandeis with input from: James Allan, UMASS Inderjeet Mani, MITRE.
Exploring the Effectiveness of Lexical Ontologies for Modeling Temporal Relations with Markov Logic Eun Y. Ha, Alok Baikadi, Carlyle Licata, Bradford Mott,
Location Recognition Given: A query image A database of images with known locations Two types of approaches: Direct matching: directly match image features.
Classification Classification Examples
Multi-Document Person Name Resolution Michael Ben Fleischman (MIT), Eduard Hovy (USC) From Proceedings of ACL-42 Reference Resolution workshop 2004.
CrowdER - Crowdsourcing Entity Resolution
Jointly Identifying Temporal Relations with Markov Logic Katsumasa Yoshikawa †, Sebastian Riedel ‡, Masayuki Asahara †, Yuji Matsumoto † † Nara Institute.
Distant Supervision for Emotion Classification in Twitter posts 1/17.
Greedy Algorithms Basic idea Connection to dynamic programming Proof Techniques.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
Ang Sun Ralph Grishman Wei Xu Bonan Min November 15, 2011 TAC 2011 Workshop Gaithersburg, Maryland USA.
Drawing TimeML Relations
Xyleme A Dynamic Warehouse for XML Data of the Web.
CS347 Review Slides (IR Part II) June 6, 2001 ©Prabhakar Raghavan.
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
Page-level Template Detection via Isotonic Smoothing Deepayan ChakrabartiYahoo! Research Ravi KumarYahoo! Research Kunal PuneraUniv. of Texas at Austin.
INFERRING NETWORKS OF DIFFUSION AND INFLUENCE Presented by Alicia Frame Paper by Manuel Gomez-Rodriguez, Jure Leskovec, and Andreas Kraus.
Statistical Relational Learning for Link Prediction Alexandrin Popescul and Lyle H. Unger Presented by Ron Bjarnason 11 November 2003.
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
A Joint Model of Feature Mining and Sentiment Analysis for Product Review Rating Jorge Carrillo de Albornoz Laura Plaza Pablo Gervás Alberto Díaz Universidad.
Intrusion Detection Jie Lin. Outline Introduction A Frame for Intrusion Detection System Intrusion Detection Techniques Ideas for Improving Intrusion.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification on Reviews Peter D. Turney Institute for Information Technology National.
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
1 Bins and Text Categorization Carl Sable (Columbia University) Kenneth W. Church (AT&T)
1 Statistical NLP: Lecture 9 Word Sense Disambiguation.
A Weakly-Supervised Approach to Argumentative Zoning of Scientific Documents Yufan Guo Anna Korhonen Thierry Poibeau 1 Review By: Pranjal Singh Paper.
Partially Supervised Classification of Text Documents by Bing Liu, Philip Yu, and Xiaoli Li Presented by: Rick Knowles 7 April 2005.
Predicting Positive and Negative Links in Online Social Networks
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Instance Filtering for Entity Recognition Advisor : Dr.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Artificial Intelligence Chapter 18. Representing Commonsense Knowledge.
Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
LOGO Summarizing Conversations with Clue Words Giuseppe Carenini, Raymond T. Ng, Xiaodong Zhou (WWW ’07) Advisor : Dr. Koh Jia-Ling Speaker : Tu.
A Scalable Machine Learning Approach for Semi-Structured Named Entity Recognition Utku Irmak(Yahoo! Labs) Reiner Kraft(Yahoo! Inc.) WWW 2010(Information.
Minimally Supervised Event Causality Identification Quang Do, Yee Seng, and Dan Roth University of Illinois at Urbana-Champaign 1 EMNLP-2011.
TimeML compliant text analysis for Temporal Reasoning Branimir Boguraev and Rie Kubota Ando.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Mining Logs Files for Data-Driven System Management Advisor.
Bing LiuCS Department, UIC1 Chapter 8: Semi-supervised learning.
CHAPTER 6 Naive Bayes Models for Classification. QUESTION????
Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.
Inference Protocols for Coreference Resolution Kai-Wei Chang, Rajhans Samdani, Alla Rozovskaya, Nick Rizzolo, Mark Sammons, and Dan Roth This research.
Multiple Instance Learning for Sparse Positive Bags Razvan C. Bunescu Machine Learning Group Department of Computer Sciences University of Texas at Austin.
1 Generating Comparative Summaries of Contradictory Opinions in Text (CIKM09’)Hyun Duk Kim, ChengXiang Zhai 2010/05/24 Yu-wen,Hsu.
Implicit Hitting Set Problems Richard M. Karp Erick Moreno Centeno DIMACS 20 th Anniversary.
DeepDive Model Dongfang Xu Ph.D student, School of Information, University of Arizona Dec 13, 2015.
Probabilistic Text Structuring: Experiments with Sentence Ordering Mirella Lapata Department of Computer Science University of Sheffield, UK (ACL 2003)
Learning Subjective Nouns using Extraction Pattern Bootstrapping Ellen Riloff School of Computing University of Utah Janyce Wiebe, Theresa Wilson Computing.
Improved Video Categorization from Text Metadata and User Comments ACM SIGIR 2011:Research and development in Information Retrieval - Katja Filippova -
Context-Aware Query Classification Huanhuan Cao, Derek Hao Hu, Dou Shen, Daxin Jiang, Jian-Tao Sun, Enhong Chen, Qiang Yang Microsoft Research Asia SIGIR.
Finding document topics for improving topic segmentation Source: ACL2007 Authors: Olivier Ferret (18 route du Panorama, BP6) Reporter:Yong-Xiang Chen.
An evolutionary approach for improving the quality of automatic summaries Constantin Orasan Research Group in Computational Linguistics School of Humanities,
Bayesian Networks in Document Clustering Slawomir Wierzchon, Mieczyslaw Klopotek Michal Draminski Krzysztof Ciesielski Mariusz Kujawiak Institute of Computer.
Scalable Learning of Collective Behavior Based on Sparse Social Dimensions Lei Tang, Huan Liu CIKM ’ 09 Speaker: Hsin-Lan, Wang Date: 2010/02/01.
Learning Event Durations from Event Descriptions Feng Pan, Rutu Mulkar, Jerry R. Hobbs University of Southern California ACL ’ 06.
Sentiment Analysis Using Common- Sense and Context Information Basant Agarwal 1,2, Namita Mittal 2, Pooja Bansal 2, and Sonal Garg 2 1 Department of Computer.
Part 2 Applications of ILP Formulations in Natural Language Processing
Constrained Clustering -Semi Supervised Clustering-
Semi-Structured Reasoning for Answering Science Questions
A Structured Learning Approach to Temporal Relation Extraction
Web Mining Research: A Survey
Learning to Detect Human-Object Interactions with Knowledge
Presentation transcript:

Temporal Ordering of Events in the News Domain Preethi Raghavan

Motivation Users have temporal information needs Query: “Prime Minister United Kingdom 2000” Query : “Prime Minister United Kingdom immediately before 2000” Problem Traditional information retrieval systems do not exploit the temporal content in documents Possibilities Integrate temporal dimension into an information retreival framework Question Answering Relative order of events in multi-document summarization

TimeBank Corpus Characteristics News reports annotated using the TimeML specification 186 documents, with a total of 68.5K words. 10% of the corpus is held out as test data TimeML annotations  EVENT: typically verbs  TIMEX3: temporal expressions  TLINK: relates events using temporal relations modeled after Allen’s Interval Algebra + James F. Allen: Maintaining knowledge about temporal intervals. In: Communications of the ACM., 1983

Example Unordered Events in a Document New evidence is suggesting that a series of bombings in Atlanta and last month’s explosion at an Alabama women's clinic might be related In 1996, a bomb blast shocks the Olympic games One person is killed

Simplified Sample TimeML Annotation A bomb blast shocks the Olympic games. <TLINK relType= "BEFORE" eventID="e138" relatedToEvent="e11"/>

Methodology Infer partial order by learning the relation between event pairs in a document ◦ Collapsed labels used:  BEFORE = {IBEFORE, BEFORE}  AFTER = {IAFTER, AFTER}  OVERLAPS = {SIMULTANEOUS, INCLUDES, INCLUDED_BY, DURING, BEGINS, ENDS, ENDED_BY, BEGUN_BY, IDENTITY} ◦ For instance, in document d1  e2 BEFORE e3  e2 AFTER e1  e3 OVERLAPS e4 Infer global temporal order using the proposed approaches ◦ d1: e1, e2, e3

Event Pairs Classification: Feature Set Training data: 3000 event pairs Testing data: 481 event pairs Features: ◦ Event Class: Occurrence (bombing, discovered), Reporting (say) ◦ Tense: Present, Past etc. ◦ Aspect: Progressive, Perfective etc. ◦ Polarity: Positive, Negative ◦ Event Phrase ◦ Temporal Expression occurring in the same sentence as the event ◦ Same aspect, Same tense

Event Pair Classification Results Event-Event Relation using 13 Labels Event-Event Relation using 3 Labels ClassifierPrecision (%)Recall(%)Accuracy (%) Naïve Bayes SVM MaxEnt ClassifierPrecision (%)Recall(%)Accuracy (%) Naïve Bayes SVM MaxEnt

Event Pair Classification Results MaxEnt, Overall Accuracy 56.1% ◦ (Majority Classifier 52.4%) Other Experiments ◦ Experiments in Mani et. al use 6 disjunctive labels. Overall accuracy 62.5% ◦ Collapsing BEFORE and AFTER into the same category will increase accuracy ClassifierPrecision (%)Recall(%)F1 BEFORE AFTER OVERLAPS

Event Pair Classification Results TimeBank + Aquaint Corpus (6234 Event-Event pairs) 6 labels ◦ (BEGINS, SIMULTANEOUS,BEFORE, IBEFORE,ENDS, INCLUDES) ◦ MaxEnt Overall accuracy labels ◦ (BEFORE, OVERLAPS) ◦ MaxEnt Overall accuracy ClassifierPrecision (%)Recall(%)F1 BEFORE OVERLAPS

Inferring Global Temporal Order Ordering of events as a Temporal Directed Acyclic Graph (TDAG) Nodes: Events Edges: Temporal relation between events Cycles are prohibited ◦ Since the graph encodes order Coarse annotation scheme ◦ Does not capture overlap ◦ Only captures precedence relations

Problem Given a partial ordering of event pairs, how do we generate a TDAG to establish global ordering?

Greedy Approach Greedy Algorithm (1) Sort edges according to scores. (2) Start with an empty graph. (3) Add the current largest edge into the graph. (4) Apply transitive closure and constraints. (5) Repeat (3) and (4) until all edges are considered.

Integer Linear Programming For a document with N event pairs, each pair (i, j) can be related in the graph as ◦ i BEFORE j ◦ i AFTER j ◦ i not connected to j Given the probability scores for the relation assigned to each event pair Objective: ◦ Optimize the score of a TDAG by maximizing the sum of the scores of all edges in the graph

ILP Constraints No cycles Enforce transitivity Connectivity constraint

Reference TDAG

Inferring Global Temporal Order TDAG generated using ILP

Observations ILP generates some feasible solution, but not necessarily optimal In certain cases, it recognized the presence of a link, but is not able to accurately predict its direction A single wrongly inferred relation may lead to generation of multiple wrong inferences For the reference TDAG, ◦ ILP gives us 80% accuracy ◦ Greedy gives 60% accuracy

Conclusions Accuracy for 6 disjunctive labels matches the baseline by Mani et al. for event pair relation classification Global ordering helps infer new relations between events This could also be used to increase the size of training data and learn on an increased corpus.

References 1. Philip Bramsen, Pawan Deshpande, Yoong Keok, Lee, Regina Barzilay, Inducing Temporal Graphs. EMNLP (2006) 2. Inderjeet Mani, Marc Verhagen, Ben Wellner, Chong Min Lee and James Pustejovsky, Machine Learning of Temporal Relations. ACL (2006) 3. J. Pustejovsky, J. Castano, R. Ingria, R. Sauri, R. Gauzauskas, A. Setzer, G. Katz, TimeML: Robust Specification of Event and Temporal Expression in Text. IWCS (2003) 4. J. F. Allen. Towards a general theory of action and time. Artificial Intelligence, July Mixed Integer Programming Solver: CPLEX 7. Modeling tool: AMPL