Generating Impact-Based Summaries for Scientific Literature Qiaozhu Mei, ChengXiang Zhai University of Illinois at Urbana-Champaign 1.

Slides:

Advertisements

Similar presentations

Critical Reading Strategies: Overview of Research Process

Advertisements

Information Retrieval and Organisation Chapter 12 Language Models for Information Retrieval Dell Zhang Birkbeck, University of London.

Language Models Naama Kraus (Modified by Amit Gross) Slides are based on Introduction to Information Retrieval Book by Manning, Raghavan and Schütze.

1 Language Models for TR (Lecture for CS410-CXZ Text Info Systems) Feb. 25, 2011 ChengXiang Zhai Department of Computer Science University of Illinois,

Language Models Hongning Wang

Cumulative Progress in Language Models for Information Retrieval Antti Puurula 6/12/2013 Australasian Language Technology Workshop University of Waikato.

Comparing Twitter Summarization Algorithms for Multiple Post Summaries David Inouye and Jugal K. Kalita SocialCom May 10 Hyewon Lim.

Information Retrieval Models: Probabilistic Models

Language Model based Information Retrieval: University of Saarland 1 A Hidden Markov Model Information Retrieval System Mahboob Alam Khalid.

Generative Topic Models for Community Analysis

Chapter 7 Retrieval Models.

IR Challenges and Language Modeling. IR Achievements Search engines  Meta-search  Cross-lingual search  Factoid question answering  Filtering Statistical.

Incorporating Language Modeling into the Inference Network Retrieval Framework Don Metzler.

1 Ranked Queries over sources with Boolean Query Interfaces without Ranking Support Vagelis Hristidis, Florida International University Yuheng Hu, Arizona.

Language Models for TR Rong Jin Department of Computer Science and Engineering Michigan State University.

1 LM Approaches to Filtering Richard Schwartz, BBN LM/IR ARDA 2002 September 11-12, 2002 UMASS.

Scalable Text Mining with Sparse Generative Models

Language Modeling Approaches for Information Retrieval Rong Jin.

JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 30, (2014) BERLIN CHEN, YI-WEN CHEN, KUAN-YU CHEN, HSIN-MIN WANG2 AND KUEN-TYNG YU Department of Computer.

Query session guided multidocument summarization THESIS PRESENTATION BY TAL BAUMEL ADVISOR: PROF. MICHAEL ELHADAD.

CS344: Introduction to Artificial Intelligence Vishal Vachhani M.Tech, CSE Lecture 34-35: CLIR and Ranking in IR.

Multi-Style Language Model for Web Scale Information Retrieval Kuansan Wang, Xiaolong Li and Jianfeng Gao SIGIR 2010 Min-Hsuan Lai Department of Computer.

Topic Models in Text Processing IR Group Meeting Presented by Qiaozhu Mei.

IRDM WS Chapter 4: Advanced IR Models 4.1 Probabilistic IR 4.2 Statistical Language Models (LMs) Principles and Basic LMs Smoothing.

1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.

Language Models Hongning Wang Two-stage smoothing [Zhai & Lafferty 02] c(w,d) |d| P(w|d) = +  p(w|C) ++ Stage-1 -Explain unseen words -Dirichlet.

Topical Crawlers for Building Digital Library Collections Presenter: Qiaozhu Mei.

Mining the Web to Create Minority Language Corpora Rayid Ghani Accenture Technology Labs - Research Rosie Jones Carnegie Mellon University Dunja Mladenic.

A Machine Learning Approach to Sentence Ordering for Multidocument Summarization and Its Evaluation D. Bollegala, N. Okazaki and M. Ishizuka The University.

A General Optimization Framework for Smoothing Language Models on Graph Structures Qiaozhu Mei, Duo Zhang, ChengXiang Zhai University of Illinois at Urbana-Champaign.

Context-Sensitive Information Retrieval Using Implicit Feedback Xuehua Shen : department of Computer Science University of Illinois at Urbana-Champaign.

Toward A Session-Based Search Engine Smitha Sriram, Xuehua Shen, ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.

LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.

Information Retrieval at NLC Jianfeng Gao NLC Group, Microsoft Research China.

Positional Relevance Model for Pseudo–Relevance Feedback Yuanhua Lv & ChengXiang Zhai Department of Computer Science, UIUC Presented by Bo Man 2014/11/18.

Yuya Akita , Tatsuya Kawahara

Gravitation-Based Model for Information Retrieval Shuming Shi, Ji-Rong Wen, Qing Yu, Ruihua Song, Wei-Ying Ma Microsoft Research Asia SIGIR 2005.

Lower-Bounding Term Frequency Normalization Yuanhua Lv and ChengXiang Zhai University of Illinois at Urbana-Champaign CIKM 2011 Best Student Award Paper.

Language Model in Turkish IR Melih Kandemir F. Melih Özbekoğlu Can Şardan Ömer S. Uğurlu.

Comparative Experiments on Sentiment Classification for Online Product Reviews Hang Cui, Vibhu Mittal, and Mayur Datar AAAI 2006.

Dependence Language Model for Information Retrieval Jianfeng Gao, Jian-Yun Nie, Guangyuan Wu, Guihong Cao, Dependence Language Model for Information Retrieval,

Language Modeling Putting a curve to the bag of words Courtesy of Chris Jordan.

Discriminative Models for Information Retrieval Ramesh Nallapati UMass SIGIR 2004.

NTNU Speech Lab Dirichlet Mixtures for Query Estimation in Information Retrieval Mark D. Smucker, David Kulp, James Allan Center for Intelligent Information.

Active Feedback in Ad Hoc IR Xuehua Shen, ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.

1 Adaptive Subjective Triggers for Opinionated Document Retrieval (WSDM 09’) Kazuhiro Seki, Kuniaki Uehara Date: 11/02/09 Speaker: Hsu, Yu-Wen Advisor:

Using Social Annotations to Improve Language Model for Information Retrieval Shengliang Xu, Shenghua Bao, Yong Yu Shanghai Jiao Tong University Yunbo Cao.

Discovering Evolutionary Theme Patterns from Text - An Exploration of Temporal Text Mining Qiaozhu Mei and ChengXiang Zhai Department of Computer Science.

Automatic Labeling of Multinomial Topic Models

Relevance Models and Answer Granularity for Question Answering W. Bruce Croft and James Allan CIIR University of Massachusetts, Amherst.

A Generation Model to Unify Topic Relevance and Lexicon-based Sentiment for Opinion Retrieval Min Zhang, Xinyao Ye Tsinghua University SIGIR

Indri at TREC 2004: UMass Terabyte Track Overview Don Metzler University of Massachusetts, Amherst.

1 ICASSP Paper Survey Presenter: Chen Yi-Ting. 2 Improved Spoken Document Retrieval With Dynamic Key Term Lexicon and Probabilistic Latent Semantic Analysis.

Automatic Labeling of Multinomial Topic Models Qiaozhu Mei, Xuehua Shen, and ChengXiang Zhai DAIS The Database and Information Systems Laboratory.

A Study of Poisson Query Generation Model for Information Retrieval

Meta-Path-Based Ranking with Pseudo Relevance Feedback on Heterogeneous Graph for Citation Recommendation By: Xiaozhong Liu, Yingying Yu, Chun Guo, Yizhou.

Recent Paper of Md. Akmal Haidar Meeting before ICASSP 2013 報告者：郝柏翰 2013/05/23.

A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval Chengxiang Zhai, John Lafferty School of Computer Science Carnegie.

Language Modeling Again So are we smooth now? Courtesy of Chris Jordan.

Bayesian Extension to the Language Model for Ad Hoc Information Retrieval Hugo Zaragoza, Djoerd Hiemstra, Michael Tipping Microsoft Research Cambridge,

A Formal Study of Information Retrieval Heuristics

Statistical Language Models

An Empirical Study of Learning to Rank for Entity Search

Murat Açar - Zeynep Çipiloğlu Yıldız

John Lafferty, Chengxiang Zhai School of Computer Science

Topic Models in Text Processing

CS590I: Information Retrieval

INF 141: Information Retrieval

Conceptual grounding Nisheeth 26th March 2019.

Language Models for TR Rong Jin

Presentation transcript:

Generating Impact-Based Summaries for Scientific Literature Qiaozhu Mei, ChengXiang Zhai University of Illinois at Urbana-Champaign 1

Motivation Fast growth of publications – >100k papers in DBLP; > 10 references per paper Summarize a scientific paper –Author’s view: Abstracts, introductions May not be what the readers received May change over time –Reader’s view: impact of the paper Impact Factor: numeric Summary of the content? Author’s view: Proof of xxx; new definition of xxx; apply xxx technique State-of-the-art algorithm; Evaluation metric Reader’s view 20 years later 2

What should an impact summary look like?

Citation Contexts  Impact, but… Describes how other authors view/comment on the paper –Implies the impact Similar to anchor text on web graph, but: Usually more than one sentences (informative). Usually mixed with discussions/comparison about other papers (noisy). … They have been also successfully used in part of speech tagging [7], machine translation [3, 5], information retrieval [4, 20], transliteration [13] and text summarization [14].... For example, Ponte and Croft [20] adopt a language modeling approach to information retrieval. … 4

Our Definition of Impact Summary Solution: Citation context  infer impact; Original content  summary Abstract:…. Introduction: ….. Content: …… References: …. … Ponte and Croft [20] adopt a language modeling approach to information retrieval. … … probabilistic models, as well as to the use of other recent models [19, 21], the statistical properties … Author picked sentences: good for summary, but doesn’t reflect the impact Reader composed sentences: good signal of impact, but too noisy to be used as summary Citation Contexts Target: extractive summary (pick sentences) of the impact of a paper 5

Rest of this Talk An Feasibility study: A Language modeling based approach –Sentence retrieval Estimation of impact language models Experiments Conclusion 6

Language Modeling in Information Retrieval d1d1 d2d2 dNdN Doc LM Documents q Query LM Rank with neg. KL Divergence Smooth using collection LM 7

Impact-based Summarization as Sentence Retrieval s1s1 s2s2 sNsN Sent LM Sentences D Impact LM Rank with neg. KL Divergence D c1c1 c2c2 cMcM Use top ranked sentences as a summary Key problem: estimate θ I 8

Estimating Impact Language Models Interpolation of document language model and citation language model D c1c1 c2c2 cMcM Constant coefficient: Dirichlet smoothing: Set λ j with features of c j : f 1 (c j ) = |c j |, and… 9

Specific Feature – Citation-based Authority Assumption: High authority paper has more trustable comments (citation context) Weight more in impact language model Authority  pagerank on the citation graph d1d1 d2d2 10

Specific Feature – Citation Context Proximity Weight citation sentences according to the proximity to the citation label k  distance to the citation label … There has been a lot of effort in applying the notion of language modeling and its variations to other problems. For example, Ponte and Croft [20] adopt a language modeling approach to information retrieval. They argue that much of the difficulty for IR lies in the lack of an adequate indexing model. Instead of making prior parametric assumptions about the similarity of documents, they propose a non-parametric approach to retrieval based probabilistic language modeling. Empirically, their approach significantly outperforms traditional tf*idf weighting on two different collections and query sets. … 11

Experiments Gold standard: –human generated summary –14 most cited papers in SIGIR Baselines: –Random; LEAD (likely to cover abs/intro.); –MEAD – Single Doc; –MEAD – Doc + Citations; (multi-document) Evaluation Metric: –ROUGE-1, ROUGE-L (unigram cooccurrence; longest common sequence) 12

Basic Results LengthMetricRandomLEADMEAD- Doc MEAD- Doc +Cite LM (KL-Div) 3R (+7.3%) 3R-L (+12.8%) 5R (+16.5%) 5R-L (+22.7%) 10R (+12.9%) 10R-L (+16.2%) 15R (+6.6%) 15R-L (+8.5%) 13

Component Study Impact language model: –Document LM << Citation Context LM << Interpolation (Doc LM, Cite LM) –Dirichlet interpolation > constant coefficient 14 MetricImpact LM = Doc LM Impact LM = Citation LM Interpolation ConstCoefDirichlet ROUGE ROUGE-L

Component Study (Cont.) Authority and Proximity –Both Pagerank and Proximity improves –Pagerank + Proximity improves marginally –Q: How to combine pagerank and proximity? 15 PageRankProximity = OffPr(s) = 1/α k Off On

Non-impact-based Summary Paper = “A study of smoothing methods for language models applied to ad hoc information retrieval” 1. Language modeling approaches to information retrieval are attractive and promising because they connect the problem of retrieval with that of language model estimation, which has been studied extensively in other application areas such as speech recognition. 2. The basic idea of these approaches is to estimate a language model for each document, and then rank documents by the likelihood of the query according to the estimated language model. 3. On the one hand, theoretical studies of an underlying model have been developed; this direction is, for example, represented by the various kinds of logic models and probabilistic models (e.g., [14, 3, 15, 22]). 16 Good big picture of the field (LMIR), but not about contribution of the paper (smoothing in LMIR)

Impact-based Summary Paper = “A study of smoothing methods for language models applied to ad hoc information retrieval” 1. Figure 5: Interpolation versus backoff for Jelinek- Mercer (top), Dirichlet smoothing (middle), and absolute discounting (bottom). 2. Second, one can de-couple the two different roles of smoothing by adopting a two stage smoothing strategy in which Dirichlet smoothing is first applied to implement the estimation role and Jelinek-Mercer smoothing is then applied to implement the role of query modeling 3. We find that the backoff performance is more sensitive to the smoothing parameter than that of interpolation, especially in Jelinek-Mercer and Dirichlet prior. 17 Specific to smoothing LM in IR; especially for the concrete smoothing techniques (Dirichlet and JM)

Related Work Text summarization (extractive) –E.g., Luhn ’58; McKeown and Radev ’95; Goldstein et al. ’99; Kraaij et al. ’01 (using language modeling) Technical paper summarization –Paice and Jones ’93; Saggion and Lapalme ’02; Teufel and Moens ’02 Citation context –Ritchie et al. ’06; Schwartz et al. ’07 Anchor text and hyperlink structure Language Modeling for information retrieval –Ponte and Croft ’98; Zhai and Lafferty ’01; Lafferty and Zhai ’01 18

Conclusion Novel problem of Impact-based Summarization Language Modeling approach –Citation context  Impact language model –Accommodating authority and proximity features Feasibility study rather than optimizing Future work –Optimize features/methods –Large scale evaluation 19

Thanks! 20

Feature Study 21 What we have explored: –Unigram language models - doc; citation context; –Length features –Authority features; –Proximity features; –Position-based re-ranking; What we haven’t done: –Redundancy removal (Diversity); –Deeper NLP features; ngram features; –Learning to weight features;

Scientific Literature with Citations … They have been also successfully used in part of speech tagging [7], machine translation [3, 5], information retrieval [4, 20], transliteration [13] and text summarization [14].... For example, Ponte and Croft [20] adopt a language modeling approach to information retrieval. … … While the statistical properties of text corpora are fundamental to the use of probabilistic models, as well as to the use of other recent models [19, 21], the statistical properties … paper Citation paper Citation Citation context 22

Language Modeling in Information Retrieval Estimate document language models –Unigram multinomial distribution of words –θ d : {P(w|d)} Ranking documents with query likelihood –R(doc, Q) ~ P(q|d), a special case of –negative KL-divergence: R(doc, Q) ~ -D(θ q || θ d ) Smooth the document language model –Interpolation-based (p(w|d) ~ p ML (w|d) + p(w|REF)) –Dirichlet smoothing empirically performs well 23