Temporal Event Map Construction For Event Search Qing Li Department of Computer Science City University of Hong Kong.

Slides:



Advertisements
Similar presentations
A Comparison of Implicit and Explicit Links for Web Page Classification Dou Shen 1 Jian-Tao Sun 2 Qiang Yang 1 Zheng Chen 2 1 Department of Computer Science.
Advertisements

DISCOVERING EVENT EVOLUTION GRAPHS FROM NEWSWIRES Christopher C. Yang and Xiaodong Shi Event Evolution and Event Evolution Graph: We define event evolution.
A probabilistic model for retrospective news event detection
Personalized Query Classification Bin Cao, Qiang Yang, Derek Hao Hu, et al. Computer Science and Engineering Hong Kong UST.
Improved TF-IDF Ranker
Designing Multimedia with Fuzzy Logic Enrique Diaz de Leon * Rene V. Mayorga ** Paul D. Guild *** * ITESM, Guadalajara Campus, Mexico ** Faculty of Engineering,
1.Accuracy of Agree/Disagree relation classification. 2.Accuracy of user opinion prediction. 1.Task extraction performance on Bing web search log with.
Automatic Video Shot Detection from MPEG Bit Stream Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC.
2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Introduction to IR Research ChengXiang Zhai Department of Computer.
Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.
Systems Engineering and Engineering Management The Chinese University of Hong Kong Parameter Free Bursty Events Detection in Text Streams Gabriel Pui Cheong.
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Web Mining Research: A Survey
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
On Burstiness-Aware Search for Document Sequences Theodoros Lappas Benjamin Arai Manolis Platakis Dimitrios Kotsakos Dimitrios Gunopulos SIGKDD 2009.
Building Knowledge-Driven DSS and Mining Data
Alert Correlation for Extracting Attack Strategies Authors: B. Zhu and A. A. Ghorbani Source: IJNS review paper Reporter: Chun-Ta Li ( 李俊達 )
Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng.
Page 1 WEB MINING by NINI P SURESH PROJECT CO-ORDINATOR Kavitha Murugeshan.
Citation Recommendation 1 Web Technology Laboratory Ferdowsi University of Mashhad.
 An important problem in sponsored search advertising is keyword generation, which bridges the gap between the keywords bidded by advertisers and queried.
Reyyan Yeniterzi Weakly-Supervised Discovery of Named Entities Using Web Search Queries Marius Pasca Google CIKM 2007.
©2008 Srikanth Kallurkar, Quantum Leap Innovations, Inc. All rights reserved. Apollo – Automated Content Management System Srikanth Kallurkar Quantum Leap.
1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.
Automatically Extracting Data Records from Web Pages Presenter: Dheerendranath Mundluru
2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Frame an IR Research Problem and Form Hypotheses ChengXiang Zhai Department.
25/03/2003CSCI 6405 Zheyuan Yu1 Finding Unexpected Information Taken from the paper : “Discovering Unexpected Information from your Competitor’s Web Sites”
Lesley Charles November 23, 2009.
11 A Hybrid Phish Detection Approach by Identity Discovery and Keywords Retrieval Reporter: 林佳宜 /10/17.
Web Mining Class Nam Hoai Nguyen Hiep Tuan Nguyen Tri Survey on Web Structure Mining
CS 533 Information Retrieval Systems.  Introduction  Connectivity Analysis  Kleinberg’s Algorithm  Problems Encountered  Improved Connectivity Analysis.
Mining fuzzy domain ontology based on concept Vector from wikipedia category network.
Wei Feng , Jiawei Han, Jianyong Wang , Charu Aggarwal , Jianbin Huang
Online Kinect Handwritten Digit Recognition Based on Dynamic Time Warping and Support Vector Machine Journal of Information & Computational Science, 2015.
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
1 Web-Page Summarization Using Clickthrough Data* JianTao Sun, Yuchang Lu Dept. of Computer Science TsingHua University Beijing , China Dou Shen,
BioRAT: Extracting Biological Information from Full-length Papers David P.A. Corney, Bernard F. Buxton, William B. Langdon and David T. Jones Bioinformatics.
MODEL ADAPTATION FOR PERSONALIZED OPINION ANALYSIS MOHAMMAD AL BONI KEIRA ZHOU.
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Mining Logs Files for Data-Driven System Management Advisor.
Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Psychiatric document retrieval using a discourse-aware model Presenter : Wu, Jia-Hao Authors : Liang-Chih.
Liangjie Hong and Brian D. Davison Department of Computer Science and Engineering Lehigh University SIGIR 2009.
Multi-level Bootstrapping for Extracting Parallel Sentence from a Quasi-Comparable Corpus Pascale Fung and Percy Cheung Human Language Technology Center,
Supporting Knowledge Discovery: Next Generation of Search Engines Qiaozhu Mei 04/21/2005.
Discovering Evolutionary Theme Patterns from Text - An Exploration of Temporal Text Mining Qiaozhu Mei and ChengXiang Zhai Department of Computer Science.
Divided Pretreatment to Targets and Intentions for Query Recommendation Reporter: Yangyang Kang /23.
Event-Based Extractive Summarization E. Filatova and V. Hatzivassiloglou Department of Computer Science Columbia University (ACL 2004)
Multilingual Information Retrieval using GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of Kaohsiung.
2005/09/13 A Probabilistic Model for Retrospective News Event Detection Zhiwei Li, Bin Wang*, Mingjing Li, Wei-Ying Ma University of Science and Technology.
A Multilingual Hierarchy Mapping Method Based on GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of.
MMM2005The Chinese University of Hong Kong MMM2005 The Chinese University of Hong Kong 1 Video Summarization Using Mutual Reinforcement Principle and Shot.
Discovering Evolutionary Theme Patterns from Text -An exploration of Temporal Text Mining KDD’05, August 21–24, 2005, Chicago, Illinois, USA. Qiaozhu Mei.
MINING DEEP KNOWLEDGE FROM SCIENTIFIC NETWORKS
Introduction to IR Research
A Paper Presentation Vikram Singh Dept. of Computer Engineering ,
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Associative Query Answering via Query Feature Similarity
Video Summarization by Spatial-Temporal Graph Optimization
Finding Story Chains in Newswire Articles
Applying Key Phrase Extraction to aid Invalidity Search
Cross-library API Recommendation Using Web Search Engines
Introduction of KNS55 Platform
MEgo2Vec: Embedding Matched Ego Networks for User Alignment Across Social Networks Jing Zhang+, Bo Chen+, Xianming Wang+, Fengmei Jin+, Hong Chen+, Cuiping.
Searching with context
Block Matching for Ontologies
Web Mining Department of Computer Science and Engg.
Web Mining Research: A Survey
Topic: Semantic Text Mining
Presentation transcript:

Temporal Event Map Construction For Event Search Qing Li Department of Computer Science City University of Hong Kong

Outline  Introduction  Problem Formulation  Temporal Event Map Construction For Event Search  Algebra operations devised on TEM  Conclusions and Future Work

Outline  Introduction  Problem Formulation  Temporal Event Map Construction For Event Search  Algebra operations devised on TEM  Conclusions and Future Work

Introduction  Many news articles report events on the WWW  For an event, it may consist of several component events, i.e., episodes There are relationship between component events Some component events are more important than others  For example, “Toyota vehicle recalls”

 What people want from news articles? Not a sole news article, but the events reported by some related news articles Dependent relationships between component events Which component events play important roles in the event evolution or development

 They are interested in the whole picture of an event evolution or development along a time line includes the dependent relationships between component events Event importance in the event evolution or development  Read news is an important way to know what happen Too many news Too many topics  Time consuming job to read news to find what user wants Understand what happens What is important component events What is the relations between component events Temporal event map Provide a convenient way to browse the event evolution

Introduction  Current search engine Keyword search List of web pages Can not provide a map of an event  Necessary and useful to provide temporal event map (TEM)

Previous Work  Qiaozhu Mei and Chengxiang Zhai. Discovering Evolutionary Theme Patterns from Text: An Exploration of Temporal Text Mining. In Proceeding of the 11th ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, pp ,  Christopher C. Yang, Xiaodong Shi, and Chih-Ping Wei. Tracing the Event Evolution of Terror Attacks from On-Line News. In Proceeding of the ISI  Jiangtao Qiu et al. Timeline Analysis of Web News Events. In Proceeding of the ADMA  Christopher C. Yang, Xiaodong Shi, and Chih-Ping Wei. Discovering Event Evolution Graphs From News Corpora. IEEE Trans. Sys. Man Cyber. Part A  Jin, Peiquan et al. TISE: A Temporal Search Engine for Web Contents. Proceedings of the 2008 Second International Symposium on Intelligent Information Technology Application  A. Feng and J. Allan. Incident threading for news passages. In CIKM ’09: Proceeding of the 18th ACM conference on Information and knowledge management, pages 1307–1316, New York, NY, USA, ACM.

Limitations of Previous Work  Only take time sequence and content similarity between two component events into consideration  Using such two factors are not enough to identify dependent relationships  Do not provide a way to measure event importance so as to identify milestone events that are more interested by most users  Do not provide a convenient way to browse event evolution

Our Work  Formalize the problem of event search  Propose a framework to search events based on users’ queries  Characteristics and contributions of our work

Content dependent relationship - mutual information VS. content similarity Event reference relationship- some news articles of an event may refer to (mention) other events Adopt three kinds of event relationships which are temporal relationship, content dependent relationship and event reference relationship to identify a dependent relationship between two events

 Characteristics and contributions of our work Define some algebra operations to assist user browsing TEM The search results are organized by a temporal event map (TEM) which is a whole picture about an event’s evolution or development along a time line Propose a method to measure event importance degrees so as to rank events based on their importance degrees Experiment results show that our method outperforms baselines in discovering event dependent relationships and ranking events based on event importance

Outline  Introduction  Problem Formulation  Temporal Event Map Construction For Event Search  Algebra operations devised on TEM  Conclusions and Future Work

Event Modeling  Event – reported by documents  Document - A story talking about an event including the happen time, places and content of the event Timestamp Places Content

Event Modeling

 Related document set of an event - a set of documents talking about the event  Life cycle  Begin time the earliest timestamp among all timestamps of related documents of the event  End time latest timestamp among all timestamps of related documents of the event  Place set

Event Modeling  Example: SARS epidemic The life cycle of this event is from November 2002 to May The places of the event includes China, Canada, Singapore and so on. There are many news from the world wide web which reported such an event. We can extract keywords from the set of documents to describe the event such as SARS, flu-like, fever, treatment and so on.

An Example of TEM

Our Work  Input User can search events by time, places and interested content  Output A temporal event map  Event evolution  Fuzzy relations of events  Important events  Some algebra operations

Outline  Introduction  Problem Formulation  Temporal Event Map Construction For Event Search  Algebra operations devised on TEM  Conclusions and Future Work

Steps of Constructing Temporal Event Map For Temporal Event Search  Identify related document set of target event  Event Discovering  Content Dependent Relationship Analysis  Event Reference Relationship Analysis  Event Ranking  Temporal Event Map Construction

Identify related document set of target event  Identify related document set of target event The input information could be considered as the search requirements of the user and corresponds to a target event which satisfy all the requirements. The related document set of the target event can be obtained by a function.

Identify related document set of target event  Special cases of input Partially input - part of (I t, I p, I f ) We consider these input as three kinds of requirements and only take the input requirements into consideration R tf R tpf

Event discovering  For each target event a corresponding to an input I and its related document set Ra, we can detect several sub-vents from Ra.  We do not aim at event detection  Adopt the topic-model based method to detect the events  A function

Content Dependent Relationship Analysis  Event Representation – salient feature vector

Content Dependent Relationship Analysis  Previous works - content similarity  Some keywords in two events are dependent but not exactly matched  Calculating mutual information to measure the dependence between features, and then use an aggregation of all mutual information between features in events

Event Reference Relationship Analysis

Event Ranking  some component events are more important than others  ranking function to rank all the sub-events E1E3 E4 E2 E6 E

Temporal Event Map Construction

Outline  Introduction  Problem Formulation  Temporal Event Map Construction For Event Search  Algebra Operations Devised on TEM  Conclusions and Future Work

Algebra Operations Devised on TEM  Projection Based on Content Dependency Relation  Projection Based on Reference Relation  Projection on Happening Places  Projection on A Time Period  Zoom In  Zoom out

An example of the whole TEM about the event “2011 Japan Earthquake”

An example of Projection Based on Content Dependency Relation about the event “2011 Japan Earthquake”

An example of Projection Based on Reference Relation on TEM about the event “2011 Japan Earthquake”

Projection on Happening Places on TEM about the event “2011 Japan Earthquake”

Projection on Time Period on TEM about the event “2011 Japan Earthquake”

Zoom In Operation

Sub-TEM of Zoom In “The earthquake and tsunami caused a number of nuclear accidents.”

Outline  Introduction  Problem Formulation  Temporal Event Map Construction For Event Search  Evaluation  Conclusions and Future Work

Conclusion and future work  Conclusion Formulate the temporal event search problem Propose a framework to search events according to users’ queries. Define three kinds of relationships and use them to identify event dependent relationships The search results is represented by a temporal event map (TEM) A method to measure event importance degree  Future work Try to handle different kinds of input and discuss their scalability Discover the important and burst periods of an event To achieve personalization News recommendation to let user know the event more clearly and completely

Thanks!