Event Detection Aliaksei Antonau alant4741@mail.ru 1 5. Juli 2016 1.

Slides:



Advertisements
Similar presentations
Arnd Christian König Venkatesh Ganti Rares Vernica Microsoft Research Entity Categorization Over Large Document Collections.
Advertisements

DOMAIN DEPENDENT QUERY REFORMULATION FOR WEB SEARCH Date : 2013/06/17 Author : Van Dang, Giridhar Kumaran, Adam Troy Source : CIKM’12 Advisor : Dr. Jia-Ling.
WSCD INTRODUCTION  Query suggestion has often been described as the process of making a user query resemble more closely the documents it is expected.
LEDIR : An Unsupervised Algorithm for Learning Directionality of Inference Rules Advisor: Hsin-His Chen Reporter: Chi-Hsin Yu Date: From EMNLP.
Search Engines and Information Retrieval
1 ETR 520 Introduction to Educational Research Dr. M C. Smith.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Designing clustering methods for ontology building: The Mo’K workbench Authors: Gilles Bisson, Claire Nédellec and Dolores Cañamero Presenter: Ovidiu Fortu.
Application of Apriori Algorithm to Derive Association Rules Over Finance Data Set Presented By Kallepalli Vijay Instructor: Dr. Ruppa Thulasiram.
Learning syntactic patterns for automatic hypernym discovery Rion Snow, Daniel Jurafsky and Andrew Y. Ng Prepared by Ang Sun
Citances and What should our UI look like? Marti Hearst SIMS, UC Berkeley Supported by NSF DBI and a gift from Genentech.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
BILINGUAL CO-TRAINING FOR MONOLINGUAL HYPONYMY-RELATION ACQUISITION Jong-Hoon Oh, Kiyotaka Uchimoto, Kentaro Torisawa ACL 2009.
Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification on Reviews Peter D. Turney Institute for Information Technology National.
Search Engines and Information Retrieval Chapter 1.
Probabilistic Model for Definitional Question Answering Kyoung-Soo Han, Young-In Song, and Hae-Chang Rim Korea University SIGIR 2006.
Detecting Semantic Cloaking on the Web Baoning Wu and Brian D. Davison Lehigh University, USA WWW 2006.
On the Issue of Combining Anaphoricity Determination and Antecedent Identification in Anaphora Resolution Ryu Iida, Kentaro Inui, Yuji Matsumoto Nara Institute.
Automatic Detection of Tags for Political Blogs Khairun-nisa Hassanali Vasileios Hatzivassiloglou The University.
Unit 1 Lesson 2 Scientific Investigations
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
Opinion Holders in Opinion Text from Online Newspapers Youngho Kim, Yuchul Jung and Sung-Hyon Myaeng Reporter: Chia-Ying Lee Advisor: Prof. Hsin-Hsi Chen.
Unit 1 Lesson 3 Scientific Investigations Copyright © Houghton Mifflin Harcourt Publishing Company.
STORYFIGHTER: A Common Sense Storytelling Game Chian Chuu and Hana Kim MAS.964 Final Project December 12, 2002.
Automatic Identification of Pro and Con Reasons in Online Reviews Soo-Min Kim and Eduard Hovy USC Information Sciences Institute Proceedings of the COLING/ACL.
Summarizing Encyclopedic Term Descriptions on the Web from Coling 2004 Atsushi Fujii and Tetsuya Ishikawa Graduate School of Library, Information and Media.
Emerging Trend Detection Shenzhi Li. Introduction What is an Emerging Trend? –An Emerging Trend is a topic area for which one can trace the growth of.
Matwin Text classification: In Search of a Representation Stan Matwin School of Information Technology and Engineering University of Ottawa
Named Entity Disambiguation on an Ontology Enriched by Wikipedia Hien Thanh Nguyen 1, Tru Hoang Cao 2 1 Ton Duc Thang University, Vietnam 2 Ho Chi Minh.
Ranking Definitions with Supervised Learning Methods J.Xu, Y.Cao, H.Li and M.Zhao WWW 2005 Presenter: Baoning Wu.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Psychiatric document retrieval using a discourse-aware model Presenter : Wu, Jia-Hao Authors : Liang-Chih.
Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.
Answer Mining by Combining Extraction Techniques with Abductive Reasoning Sanda Harabagiu, Dan Moldovan, Christine Clark, Mitchell Bowden, Jown Williams.
Acquisition of Categorized Named Entities for Web Search Marius Pasca Google Inc. from Conference on Information and Knowledge Management (CIKM) ’04.
Generating Query Substitutions Alicia Wood. What is the problem to be solved?
Extracting and Ranking Product Features in Opinion Documents Lei Zhang #, Bing Liu #, Suk Hwan Lim *, Eamonn O’Brien-Strain * # University of Illinois.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.
An Effective Statistical Approach to Blog Post Opinion Retrieval Ben He, Craig Macdonald, Jiyin He, Iadh Ounis (CIKM 2008)
Trends in NL Analysis Jim Critz University of New York in Prague EurOpen.CZ 12 December 2008.
Data mining in web applications
Unit 1 Lesson 3 Scientific Investigations
Information Retrieval in Practice
Web-based acquisition of Japanese katakana variants
Recommendation in Scholarly Big Data
What do these mean? Your time is up Ready for anything (Red E)
Chapter 2: Hypothesis development: Where research questions come from.
Review of Related Literature
The Scientific Method “..a logical approach to solving problems by observing and collecting data, developing a hypothesis, conducting experiments to test.
HOW TO WRITE A RESEARCH PAPER
HOW TO WRITE A RESEARCH PAPER
9. Introduction to signal detection
Generating Natural Answers by Incorporating Copying and Retrieving Mechanisms in Sequence-to-Sequence Learning Shizhu He, Cao liu, Kang Liu and Jun Zhao.
A Library Partnership By Krista Jorgensen and Lisa Nowak
Social Knowledge Mining
Thanks to Bill Arms, Marti Hearst
Unit 1 Lesson 2 Scientific Investigations
How will discussion days/note taking work in science class this year?
Presented by: Prof. Ali Jaoua
KDD Reviews 周天烁 2018年5月9日.
Introduction Task: extracting relational facts from text
Unit 1 Lesson 2 Scientific Investigations
Automatic Detection of Causal Relations for Question Answering
Do Now: Answer the following in your Science Notebook using complete sentences.
Facilitating Navigation on Linked Data through Top-K Link Patterns
Web Mining Research: A Survey
Rachit Saluja 03/20/2019 Relation Extraction with Matrix Factorization and Universal Schemas Sebastian Riedel, Limin Yao, Andrew.
Kostas Kolomvatsos, Christos Anagnostopoulos
Extracting Why Text Segment from Web Based on Grammar-gram
Presentation transcript:

Event Detection Aliaksei Antonau alant4741@mail.ru 1 5. Juli 2016 1

Learning to Extract Local Events from the Web John Foley, Michael Bendersky and Vanja Josifovski Aliaksei Antonau 5. Juli 2016

Introduction Aliaksei Antonau 5. Juli 2016

Introduction Aliaksei Antonau 5. Juli 2016

Schema.org “Schema.org is a collaborative, community activity with a mission to create, maintain, and promote schemas for structured data on the Internet, on web pages, in email messages, and beyond.” Aliaksei Antonau 5. Juli 2016

Schema.org Aliaksei Antonau 5. Juli 2016

Event Extraction Model Aliaksei Antonau 5. Juli 2016

Experimental Setup Aliaksei Antonau 5. Juli 2016

Collecting Judgments Aliaksei Antonau 5. Juli 2016

Event Prediction Results Aliaksei Antonau 5. Juli 2016

Precision Evaluation Aliaksei Antonau 5. Juli 2016

Aliaksei Antonau 5. Juli 2016

Aliaksei Antonau 5. Juli 2016

Conclusion In this paper authors introduce task: to retrieve and recommend events that users might want to attend. They focus on the identification and extraction of local events. Authors show, that using semantic web technologies, and specifically Schema.org microdata can be useful for training approaches to extraction problems like the one they explored. Aliaksei Antonau 5. Juli 2016

15 Aliaksei Antonau 5. Juli 2016

Generating Event Causality Hypotheses through Semantic Relations Chikara Hashimoto, Kentaro Torisawa, Julien Kloetzer and Jong-Hoon Oh Aliaksei Antonau 5. Juli 2016

Introduction Goal: develop method of generating plausible event causality hypotheses from other event causalities extracted from the web. Aliaksei Antonau 5. Juli 2016

Introduction Application Reason for generating NEW hypotheses: Future event prediction Why-question answering Future scenario generation Reason for generating NEW hypotheses: It is unlikely that all the event causalities that we recognize in this world are written in corpora Aliaksei Antonau 5. Juli 2016

Proposed Method Semantic relation database records which binary pattern, e.g., A CAUSES B, which indicates a semantic relation, co-occurs with which noun pairs. Binary patterns: Causation. A CAUSES B (deforestation and global warming) Material. B IS MADE OF A (plutonium and atomic bomb) Necessity. B REQUIRES A (ability to think and verbal aptitude) Use. A IS USED FOR B (e-mailer and exchanges of e-mail messages) Prevention. A PREVENTS B (a mosquito net and malaria) Aliaksei Antonau 5. Juli 2016

Hypotheses candidates generation Proposed Method Event Causalities, extracted from the web Extraction of phrase pairs as event causality candidates from single sentences in 600 million web pages. Replacement of the original noun pair of a source event causality with other noun pairs from web. Plausible hypothesis candidates are identified by an SVM classifier. Hypotheses candidates generation Hypotheses Ranking Aliaksei Antonau 5. Juli 2016

Proposed Method Generated event causality Deploy a mosquito net → avoid malaria On phrase pair level On noun pair level novelty hypotheses it novelty hypotheses it will be will be REJECTED IF Source includes: Deploy a → avoid Use a → prevent mosquito net malaria mosquito net malaria Aliaksei Antonau 5. Juli 2016

Experimental Setup Source for extracting event causalities: 600 million web pages 132,528,706 event causality candidates extracted 2,451,254 event causalities after applying filters These 2.4M events were used for generating hypotheses All pages and events in Japanese language Aliaksei Antonau 5. Juli 2016

Results in noun pair level novelty setting Aliaksei Antonau 5. Juli 2016

Results in phrase pair level novelty setting Aliaksei Antonau 5. Juli 2016

Conclusion Authors proposed a method of hypothesizing plausible event causality hypotheses from event causalities extracted from the web by using semantic relations. With 70% precision, proposed method generated 347,093 noun pair level novelty hypotheses and 302,350 phrase pair level novelty hypotheses from the 2.4M event causalities extracted from the web. Aliaksei Antonau 5. Juli 2016

Thank You! Questions? Aliaksei Antonau 5. Juli 2016