Exploiting Temporal References in Text Retrieval Irem Arikan advised by: Srikanta Bedathur, Klaus Berberich.

Slides:



Advertisements
Similar presentations
Information Retrieval and Organisation Chapter 12 Language Models for Information Retrieval Dell Zhang Birkbeck, University of London.
Advertisements

Traditional IR models Jian-Yun Nie.
Metadata in Carrot II Current metadata –TF.IDF for both documents and collections –Full-text index –Metadata are transferred between different nodes Potential.
1 Language Models for TR (Lecture for CS410-CXZ Text Info Systems) Feb. 25, 2011 ChengXiang Zhai Department of Computer Science University of Illinois,
SEARCHING QUESTION AND ANSWER ARCHIVES Dr. Jiwoon Jeon Presented by CHARANYA VENKATESH KUMAR.
Information Retrieval Models: Probabilistic Models
Hinrich Schütze and Christina Lioma Lecture 12: Language Models for IR
Chapter 7 Retrieval Models.
1 Language Model CSC4170 Web Intelligence and Social Computing Tutorial 8 Tutor: Tom Chao Zhou
IR Challenges and Language Modeling. IR Achievements Search engines  Meta-search  Cross-lingual search  Factoid question answering  Filtering Statistical.
Incorporating Language Modeling into the Inference Network Retrieval Framework Don Metzler.
A Markov Random Field Model for Term Dependencies Donald Metzler and W. Bruce Croft University of Massachusetts, Amherst Center for Intelligent Information.
Introduction to Information Retrieval Introduction to Information Retrieval Hinrich Schütze and Christina Lioma Lecture 12: Language Models for IR.
Introduction to Information Retrieval Introduction to Information Retrieval Hinrich Schütze and Christina Lioma Lecture 11: Probabilistic Information Retrieval.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Probabilistic Information Retrieval.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Probabilistic Information Retrieval.
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
Question Answering using Language Modeling Some workshop-level thoughts.
Language Models for TR Rong Jin Department of Computer Science and Engineering Michigan State University.
Evaluating the Performance of IR Sytems
ITCS 6010 Natural Language Understanding. Natural Language Processing What is it? Studies the problems inherent in the processing and manipulation of.
Scalable Text Mining with Sparse Generative Models
Language Modeling Approaches for Information Retrieval Rong Jin.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Multi-Style Language Model for Web Scale Information Retrieval Kuansan Wang, Xiaolong Li and Jianfeng Gao SIGIR 2010 Min-Hsuan Lai Department of Computer.
İrem Arıkan, Srikanta Bedathur, Klaus Berberich Time Will Tell: Leveraging Temporal Expressions in IR.
Language Models for IR Debapriyo Majumdar Information Retrieval Indian Statistical Institute Kolkata Spring 2015 Credit for several slides to Jimmy Lin.
Topic Models in Text Processing IR Group Meeting Presented by Qiaozhu Mei.
Estimating Importance Features for Fact Mining (With a Case Study in Biography Mining) Sisay Fissaha Adafre School of Computing Dublin City University.
A Comparative Study of Search Result Diversification Methods Wei Zheng and Hui Fang University of Delaware, Newark DE 19716, USA
Improved search for Socially Annotated Data Authors: Nikos Sarkas, Gautam Das, Nick Koudas Presented by: Amanda Cohen Mostafavi.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Modern Information Retrieval: A Brief Overview By Amit Singhal Ranjan Dash.
Bayesian Extension to the Language Model for Ad Hoc Information Retrieval Hugo Zaragoza, Djoerd Hiemstra, Michael Tipping Presented by Chen Yi-Ting.
Retrieval Models for Question and Answer Archives Xiaobing Xue, Jiwoon Jeon, W. Bruce Croft Computer Science Department University of Massachusetts, Google,
Context-Sensitive Information Retrieval Using Implicit Feedback Xuehua Shen : department of Computer Science University of Illinois at Urbana-Champaign.
Toward A Session-Based Search Engine Smitha Sriram, Xuehua Shen, ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Web Image Retrieval Re-Ranking with Relevance Model Wei-Hao Lin, Rong Jin, Alexander Hauptmann Language Technologies Institute School of Computer Science.
Probabilistic Models of Novel Document Rankings for Faceted Topic Retrieval Ben Cartrette and Praveen Chandar Dept. of Computer and Information Science.
Autumn Web Information retrieval (Web IR) Handout #1:Web characteristics Ali Mohammad Zareh Bidoki ECE Department, Yazd University
Lecture 1: Overview of IR Maya Ramanath. Who hasn’t used Google? Why did Google return these results first ? Can we improve on it? Is this a good result.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
Positional Relevance Model for Pseudo–Relevance Feedback Yuanhua Lv & ChengXiang Zhai Department of Computer Science, UIUC Presented by Bo Man 2014/11/18.
A Language Modeling Approach to Information Retrieval 한 경 수  Introduction  Previous Work  Model Description  Empirical Results  Conclusions.
Language Model in Turkish IR Melih Kandemir F. Melih Özbekoğlu Can Şardan Ömer S. Uğurlu.
Performance Measurement. 2 Testing Environment.
Relevance-Based Language Models Victor Lavrenko and W.Bruce Croft Department of Computer Science University of Massachusetts, Amherst, MA SIGIR 2001.
Language Modeling Putting a curve to the bag of words Courtesy of Chris Jordan.
CpSc 881: Information Retrieval. 2 Using language models (LMs) for IR ❶ LM = language model ❷ We view the document as a generative model that generates.
Toward Semantic Search: RDFa based facet browser Jin Guang Zheng Tetherless World Constellation.
Automatic Labeling of Multinomial Topic Models
Relevance Models and Answer Granularity for Question Answering W. Bruce Croft and James Allan CIIR University of Massachusetts, Amherst.
CONTEXTUAL SEARCH AND NAME DISAMBIGUATION IN USING GRAPHS EINAT MINKOV, WILLIAM W. COHEN, ANDREW Y. NG SIGIR’06 Date: 2008/7/17 Advisor: Dr. Koh,
A Generation Model to Unify Topic Relevance and Lexicon-based Sentiment for Opinion Retrieval Min Zhang, Xinyao Ye Tsinghua University SIGIR
Automatic Labeling of Multinomial Topic Models Qiaozhu Mei, Xuehua Shen, and ChengXiang Zhai DAIS The Database and Information Systems Laboratory.
Navigation Aided Retrieval Shashank Pandit & Christopher Olston Carnegie Mellon & Yahoo.
Introduction to Information Retrieval Probabilistic Information Retrieval Chapter 11 1.
Introduction to Information Retrieval Introduction to Information Retrieval Lecture 14: Language Models for IR.
Bayesian Extension to the Language Model for Ad Hoc Information Retrieval Hugo Zaragoza, Djoerd Hiemstra, Michael Tipping Microsoft Research Cambridge,
Lecture 13: Language Models for IR
CSCI 5417 Information Retrieval Systems Jim Martin
Language Models for Information Retrieval
Murat Açar - Zeynep Çipiloğlu Yıldız
John Lafferty, Chengxiang Zhai School of Computer Science
Language Model Approach to IR
Navigation-Aided Retrieval
Topic Models in Text Processing
INF 141: Information Retrieval
Conceptual grounding Nisheeth 26th March 2019.
Language Models for TR Rong Jin
Presentation transcript:

Exploiting Temporal References in Text Retrieval Irem Arikan advised by: Srikanta Bedathur, Klaus Berberich

Motivation  users’ information needs often have a temporal dimension,  but traditional information retrieval systems do not exploit the temporal content in documents.  query: PM United Kingdom 2000  search engine is not aware that 2000 is actually mentioned implicitly by the document an approach which recognizes and exploits temporal references in documents to yield better search results !

Example Temporal Queries Broad Queries British colony 17 th century Economic situtation Germany 1920s President assasination 1950 – 2000 Specific Queries US president October 1962 Pope 1940s Academy awards best actor 1975 Ambiguous Queries George Bush 1990 vs. George Bush 2007 Gulf war 1991 vs. Gulf war 2005

 Language Modeling for Information Retrieval  Time Modeling for Temporal Information Retrieval  Combining Text Relevance with Temporal Relevance  Experimental Results Outline

Language Model: a statistical model to generate text Language Modeling: the task of estimating the statistical parameters of a language model Language Modeling for IR: the problem of estimating the likelihood that a query and a document could have been generated by the same language model In practical IR approaches: Unigram Language Model words occur independently Language Modeling for Information Retrieval

1)document : a sample from a language model assume an underlying multinomial probability distribution over words for each document estimate statistics of this distribution: P[word] 2)estimate the likelihood that the query is generated by this distribution 3)rank the documents by P(q | d ) document infer M d : P [ word | M d ] Language Modeling for IR

General approach  similar to LM approach  based on a generative model which generates temporal references  temporal model  splits query into 2 parts: text query and temporal query Probabilistic mechanism for producing temporal content of the document  each time reference generated by a different generative temporal model  for generating a time reference 1)first choose a temporal model 2)then generate a time reference using this temporal model Temporal Modeling for Temporal Retrieval

Estimating temporal query likelihood  Infer a temporal model from each temporal reference in the document  Estimate the likelihood that the temporal query is generated by one of the models which generated the temporal content of the document Temporal query generation probability Temporal Modeling

 A probabilistic model to generate temporal references  What kind of distribution?  How can we estimate its parameters? What is a temporal model?

Temporal Modeling  A probabilistic model to generate temporal references  What kind of distribution?  How can we estimate its parameters? Formalize the problem in a goal-oriented way,  We should infer a temporal model from each time interval (sample time interval)  This temporal model should be able to generate all time intervals which are relevant to the sample interval What is a temporal model?

Assumptions: only relevant if they intersect the generative model inferred should be able to produce subintervals, superintervals, overlapping intervals of the interval in the document probability of generating an intersecting time interval should be proportional to the length of intersection query: 1980 – – 1989 is more relevant than 23 March 1984 Appropriate probabilistic model: 2 underlying triangular distributions one for start, one for end, sub2 lOverlap s t sub1 sup1 sup2 rOverlap e 1. Approach

Parameters Support Triangular Distribution

se +1 u elq s Approach r1r2r3r4 nonzero probability for intersecting intervals r1 – r3 : left overlaps r1 – r4 : super intervals r2 – r3 : subintervals r2 - r4 : right overlaps interval [s,e] has the highest probability probability decreases to the left and right resulting in lower probability for intervals which have smaller intersection lengths

se +1 u el q s Approach r1r2 r3r4

Assumptions:  Only relevant if they are positioned closely to each other on the time axis and have similar lengths  | start1 – start2 | < a  | length1 – length2 | < b  The generative model inferred should be able to produce temporal intervals in some neighbourhood on the time axis s l t ∆s ∆l 2. Approach

ss+as -a ll+b l-b 2. Approach  Temporal interval x = s, y = l has the highest probability  Probability decreases as start point moves away from s and as length moves away from l

ss+as -a ll+b l-b 2. Approach

Text relevance Combining Text Relevance with Temporal Relevance

Text relevance Combining Text Relevance with Temporal Relevance Temporal relevance  Filter and re-rank search results by weighting text relevance score by temporal relevance

Information Retrieval (IR) with Temporal Extension IR System Query Index Result Set System Architecture Temporal Query Index for temporal references Temporal Retrieval Result Set

Query: Spanish painter 18th century Experimental Results-1 TerrierBooleanOur Method Art_in_Puerto_RicoAgustín_EsteveJosé_del_Castillo Spanish_artAcislo_Antonio_Palomino_ de_Castro_y_Velasco Agustín_Esteve Palazzo_Bianco_(Genoa)AlvarezRoybal CaprichosAgostino_Scilla_00e6Maldonado List_of_people_from_Antw erp BassanoLuis_Egidio_Meléndez

Query: Chancellor Germany 1955 Experimental Results-2 TerrierBooleanOur Method Federal_Minister_for_Speci al_Affairs_of_Germany Basic_Law_for_the_Federal _Republic_of_Germany Occupation_statute Otto_GesslerBonn-Paris_conventionsSecond_German_Bundestag Bonn-Paris_conventionsBavaria_PartyWest_Germany Occupation_statuteAll-German_Bloc_League_ of_Expellees_and_Deprived _of_Rights Bonn-Paris_conventions Petersberg_AgreementAnschlussKonrad_Adenauer

Query: George Bush 1990 Experimental Results-3 TerrierBooleanOur Method George_W._Bush_insider_tr ading_allegations Bush_familyPresident_Bush Bush_familyBush_administration Early_life_of_George_W._B ush Andrew_CardPresident's Council of Advisors on Science and Technology George_H._W._BushApproval_ratingGeorge_H._W._Bush C_Boyden_GrayBrent_ScowcroftArbusto_Energy

Thanks!