L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011.

Slides:



Advertisements
Similar presentations
Accurately Interpreting Clickthrough Data as Implicit Feedback Joachims, Granka, Pan, Hembrooke, Gay Paper Presentation: Vinay Goel 10/27/05.
Advertisements

Learning to Suggest: A Machine Learning Framework for Ranking Query Suggestions Date: 2013/02/18 Author: Umut Ozertem, Olivier Chapelle, Pinar Donmez,
1.Accuracy of Agree/Disagree relation classification. 2.Accuracy of user opinion prediction. 1.Task extraction performance on Bing web search log with.
L EARNING TO D IVERSIFY USING IMPLICIT FEEDBACK Karthik Raman, Pannaga Shivaswamy & Thorsten Joachims Cornell University 1.
Catching the Drift: Learning Broad Matches from Clickthrough Data Sonal Gupta, Mikhail Bilenko, Matthew Richardson University of Texas at Austin, Microsoft.
Contextual Advertising by Combining Relevance with Click Feedback D. Chakrabarti D. Agarwal V. Josifovski.
Toward Whole-Session Relevance: Exploring Intrinsic Diversity in Web Search Date: 2014/5/20 Author: Karthik Raman, Paul N. Bennett, Kevyn Collins-Thompson.
Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)
A Quality Focused Crawler for Health Information Tim Tang.
Evaluating Search Engine
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
Predicting Text Quality for Scientific Articles AAAI/SIGART-11 Doctoral Consortium Annie Louis : Louis A. and Nenkova A Automatically.
Learning to Rank: New Techniques and Applications Martin Szummer Microsoft Research Cambridge, UK.
Nisha Ranga TURNING DOWN THE NOISE IN BLOGOSPHERE.
Authoritative Sources in a Hyperlinked Environment Jon M. Kleinberg Presented By: Talin Kevorkian Summer June
Affinity Rank Yi Liu, Benyu Zhang, Zheng Chen MSRA.
Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng.
Search Engines and Information Retrieval Chapter 1.
SEEKING STATEMENT-SUPPORTING TOP-K WITNESSES Date: 2012/03/12 Source: Steffen Metzger (CIKM’11) Speaker: Er-gang Liu Advisor: Dr. Jia-ling Koh 1.
1 Wikification CSE 6339 (Section 002) Abhijit Tendulkar.
A Comparative Study of Search Result Diversification Methods Wei Zheng and Hui Fang University of Delaware, Newark DE 19716, USA
Newsjunkie: Providing Personalized Newsfeeds via Analysis of Information Novelty Gabrilovich et.al WWW2004.
Web Search. Structure of the Web n The Web is a complex network (graph) of nodes & links that has the appearance of a self-organizing structure  The.
1 University of Qom Information Retrieval Course Web Search (Link Analysis) Based on:
Mining the Web to Create Minority Language Corpora Rayid Ghani Accenture Technology Labs - Research Rosie Jones Carnegie Mellon University Dunja Mladenic.
Similar Document Search and Recommendation Vidhya Govindaraju, Krishnan Ramanathan HP Labs, Bangalore, India JOURNAL OF EMERGING TECHNOLOGIES IN WEB INTELLIGENCE.
A Machine Learning Approach to Sentence Ordering for Multidocument Summarization and Its Evaluation D. Bollegala, N. Okazaki and M. Ishizuka The University.
Google News Personalization: Scalable Online Collaborative Filtering
Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.
INTERESTING NUGGETS AND THEIR IMPACT ON DEFINITIONAL QUESTION ANSWERING Kian-Wei Kor, Tat-Seng Chua Department of Computer Science School of Computing.
Improving Cloaking Detection Using Search Query Popularity and Monetizability Kumar Chellapilla and David M Chickering Live Labs, Microsoft.
Wei Feng , Jiawei Han, Jianyong Wang , Charu Aggarwal , Jianbin Huang
Probabilistic Models of Novel Document Rankings for Faceted Topic Retrieval Ben Cartrette and Praveen Chandar Dept. of Computer and Information Science.
Improving Web Search Results Using Affinity Graph Benyu Zhang, Hua Li, Yi Liu, Lei Ji, Wensi Xi, Weiguo Fan, Zheng Chen, Wei-Ying Ma Microsoft Research.
How Useful are Your Comments? Analyzing and Predicting YouTube Comments and Comment Ratings Stefan Siersdorfer, Sergiu Chelaru, Wolfgang Nejdl, Jose San.
Positional Relevance Model for Pseudo–Relevance Feedback Yuanhua Lv & ChengXiang Zhai Department of Computer Science, UIUC Presented by Bo Man 2014/11/18.
Linking Organizational Social Networking Profiles PROJECT ID: H JEROME CHENG ZHI KAI (A H ) 1.
Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.
Carnegie Mellon Novelty and Redundancy Detection in Adaptive Filtering Yi Zhang, Jamie Callan, Thomas Minka Carnegie Mellon University {yiz, callan,
Q&A sites Martice Pitts PUR About 57.9 million visits a month About 57.9 million visits a month Ask.com is a free site Formally known as Ask Jeeves.
Effective Automatic Image Annotation Via A Coherent Language Model and Active Learning Rong Jin, Joyce Y. Chai Michigan State University Luo Si Carnegie.
+ User-induced Links in Collaborative Tagging Systems Ching-man Au Yeung, Nicholas Gibbins, Nigel Shadbolt CIKM’09 Speaker: Nonhlanhla Shongwe 18 January.
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
1 Adaptive Subjective Triggers for Opinionated Document Retrieval (WSDM 09’) Kazuhiro Seki, Kuniaki Uehara Date: 11/02/09 Speaker: Hsu, Yu-Wen Advisor:
How to write a useful abstract By Janis Ramey Report by Heidi Christensen.
Divided Pretreatment to Targets and Intentions for Query Recommendation Reporter: Yangyang Kang /23.
The Loquacious ( 愛說話 ) User: A Document-Independent Source of Terms for Query Expansion Diane Kelly et al. University of North Carolina at Chapel Hill.
11 A Classification-based Approach to Question Routing in Community Question Answering Tom Chao Zhou 1, Michael R. Lyu 1, Irwin King 1,2 1 The Chinese.
10 th Grade Research Paper. Instructions  Read over the instructions.  Are there any questions about what is on the paper?
Micro Topics, Subtopics & Argument in Miel-Con Paragraphs.
Leveraging Knowledge Bases for Contextual Entity Exploration Categories Date:2015/09/17 Author:Joonseok Lee, Ariel Fuxman, Bo Zhao, Yuanhua Lv Source:KDD'15.
By Pamela Drake SEARCH ENGINE OPTIMIZATION. WHAT IS SEO? Search engine optimization (SEO) is the process of affecting the visibility of a website or a.
LEARNING FROM THE PAST: ANSWERING NEW QUESTIONS WITH PAST ANSWERS Date: 2012/11/22 Author: Anna Shtok, Gideon Dror, Yoelle Maarek, Idan Szpektor Source:
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
A Framework to Predict the Quality of Answers with Non-Textual Features Jiwoon Jeon, W. Bruce Croft(University of Massachusetts-Amherst) Joon Ho Lee (Soongsil.
GENERATING RELEVANT AND DIVERSE QUERY PHRASE SUGGESTIONS USING TOPICAL N-GRAMS ELENA HIRST.
Identifying Spam Web Pages Based on Content Similarity Sole Pera CS 653 – Term paper project.
哈工大信息检索研究室 HITIR ’ s Update Summary at TAC2008 Extractive Content Selection Using Evolutionary Manifold-ranking and Spectral Clustering Reporter: Ph.d.
Meta-Path-Based Ranking with Pseudo Relevance Feedback on Heterogeneous Graph for Citation Recommendation By: Xiaozhong Liu, Yingying Yu, Chun Guo, Yizhou.
Ariel Fuxman, Panayiotis Tsaparas, Kannan Achan, Rakesh Agrawal (2008) - Akanksha Saxena 1.
Opinion spam and Analysis 소프트웨어공학 연구실 G 최효린 1 / 35.
QUERY-PERFORMANCE PREDICTION: SETTING THE EXPECTATIONS STRAIGHT Date : 2014/08/18 Author : Fiana Raiber, Oren Kurland Source : SIGIR’14 Advisor : Jia-ling.
Evaluation Anisio Lacerda.
Rey-Long Liu Dept. of Medical Informatics Tzu Chi University Taiwan
Evaluating Information Retrieval Systems
Citation-based Extraction of Core Contents from Biomedical Articles
A Classification-based Approach to Question Routing in Community Question Answering Tom Chao Zhou 22, Feb, 2010 Department of Computer.
Google Analytics Data for “Going Out There” website
Menghindari Penolakan Editors dan Reviewers
Presentation transcript:

L EARNING TO M ODEL R ELATEDNESS FOR N EWS R ECOMMENDATION Author: Yuanhua Lv and et al. UIUC, Yahoo! labs Presenter: Robbie 1 WWW 2011

O UTLINE Introduction and Motivation Model Relatedness Experiment Conclusion 2

I NTRODUCTION Post-Click news Recommendation Seed news Candidate news 3

M OTIVATION Promote users’ navigation on the visited website Yahoo!, Google focus on initial clicks, post- click news recommendation largely under- explored Mainly depend on editors’ manual effort No existing method proposed to model relatedness directly 4

M ODEL RELATEDNESS Four aspects Relevance Novelty Connection clarity Transition smoothness 5

R ELEVANCE AND N OVELTY Similar but not duplicate Novelty often in contrast to relevance Use same set of features to measure them cosine similarity BM25 language models with Dirichlet prior smoothing language models with Jelinek-Mercer smoothing 6

C ONNECTION C LARITY Relevance and novelty can only model word overlap between two articles s and d Example: s: White House: Obamas earn $5.5 million in 2009 d: Obama’s oil spill bill seeks $118 million, oil company s and d must be topically cohesive Connection clarity defines topical cohesion of two news 7

C ONNECTION C LARITY 8

T RANSITION S MOOTHNESS Example: s: Toyota dismisses account of runaway Prius d1: What to do if your car suddenly accelerates d2: Toyota to build Prius at 3 rd Japan plant: report Definition: Measures how well a user’s reading interests can transit from s to d Transition smoothness from s-d to d-s, i.e. from “known” to “novel” 9 Smooth(s, d1) >smooth(s, d2)

T RANSITION S MOOTHNESS 10

L EARNING A RELATEDNESS F UNCTION 11

C ONSTRUCTING T EST C OLLECTION Yahoo! News articles from March 1 st to June 30 th 2010 Each run, randomly generate 549 seed news from June 10 th to June 20 th with at least 2000 visits Perform redundancy detection 12

E DITORIAL J UDGMENTS A group of professional news editors from a commercial online news website 4 point relatedness scale “very related”, “somewhat related”, “redundant”, “unrelated”, any document with two different judgments, select a judgment with a higher ratio High agreement in relative relatedness (80.8%),inspire to learn relatedness functions from pair-wise preference information 13

E XPERIMENTS : C OMPARING I NDIVIDUAL R ETRIEVAL M ODELS 14 “Body” is the best ---- title and abstract may lose information Cosine similarity as well as or even better than language models in some cases, but NDCG1 is worst Effective for redundancy detection, which brings redundant documents to the top

E XPERIMENTS : C OMPARING M ACHINE - L EARNED R ELATEDNESS M ODELS 15

E XPERIMENTS : A NALYZING THE U NIFIED R ELATEDNESS M ODEL 16 Cosine similarity significantly worse than BM25 as individual relatedness function, but the most important in the unified model Connection clarity and transition smoothness contribute 7/15 together

CONCLUSIONS First attempt at post-click news recommendation Propose 4 aspects to characterize news relatedness Future work Incorporate into the unified relatedness function non- content features Document and user adaptive measures will be more accurate 17

RELEVANCE AND NOVELTY Problem: Top ranked documents may be redundant and unrelated articles Solution : Passage retrieval 18

E XPERIMENTS : P ASSAGE R ETRIEVAL E VALUATION 19 o Fixed-length(250 empirically) arbitrary passage retrieval o Passage retrieval doesn’t help in most cases o Improve NDCG1 clearly. --- Probably relaxes the concern of ranking redundant documents on top