Learning to Personalize Query Auto-Completion

Slides:



Advertisements
Similar presentations
Context-Sensitive Query Auto-Completion AUTHORS:NAAMA KRAUS AND ZIV BAR-YOSSEF DATE OF PUBLICATION:NOVEMBER 2010 SPEAKER:RISHU GUPTA 1.
Advertisements

Active Learning with Feedback on Both Features and Instances H. Raghavan, O. Madani and R. Jones Journal of Machine Learning Research 7 (2006) Presented.
Temporal Query Log Profiling to Improve Web Search Ranking Alexander Kotov (UIUC) Pranam Kolari, Yi Chang (Yahoo!) Lei Duan (Microsoft)
DQR : A Probabilistic Approach to Diversified Query recommendation Date: 2013/05/20 Author: Ruirui Li, Ben Kao, Bin Bi, Reynold Cheng, Eric Lo Source:
Introduction to Information Retrieval
1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
SIGIR 2013 Recap September 25, 2013.
Time-sensitive Personalized Query Auto-Completion
WSCD INTRODUCTION  Query suggestion has often been described as the process of making a user query resemble more closely the documents it is expected.
Contextual Advertising by Combining Relevance with Click Feedback D. Chakrabarti D. Agarwal V. Josifovski.
Context-aware Query Suggestion by Mining Click-through and Session Data Authors: H. Cao et.al KDD 08 Presented by Shize Su 1.
Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)
Optimizing Estimated Loss Reduction for Active Sampling in Rank Learning Presented by Pinar Donmez joint work with Jaime G. Carbonell Language Technologies.
Evaluating Search Engine
Using Trees to Depict a Forest Bin Liu, H. V. Jagadish EECS, University of Michigan, Ann Arbor Presented by Sergey Shepshelvich 1.
Learning to Rank: New Techniques and Applications Martin Szummer Microsoft Research Cambridge, UK.
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
1 Statistical correlation analysis in image retrieval Reporter : Erica Li 2004/9/30.
1 Ranked Queries over sources with Boolean Query Interfaces without Ranking Support Vagelis Hristidis, Florida International University Yuheng Hu, Arizona.
Re-ranking Documents Segments To Improve Access To Relevant Content in Information Retrieval Gary Madden Applied Computational Linguistics Dublin City.
Collaborative Ordinal Regression Shipeng Yu Joint work with Kai Yu, Volker Tresp and Hans-Peter Kriegel University of Munich, Germany Siemens Corporate.
Online Learning for Web Query Generation: Finding Documents Matching a Minority Concept on the Web Rayid Ghani Accenture Technology Labs, USA Rosie Jones.
The Relevance Model  A distribution over terms, given information need I, (Lavrenko and Croft 2001). For term r, P(I) can be dropped w/o affecting the.
Cohort Modeling for Enhanced Personalized Search Jinyun YanWei ChuRyen White Rutgers University Microsoft BingMicrosoft Research.
Adapting Deep RankNet for Personalized Search
Quality-aware Collaborative Question Answering: Methods and Evaluation Maggy Anastasia Suryanto, Ee-Peng Lim Singapore Management University Aixin Sun.
1 A Discriminative Approach to Topic- Based Citation Recommendation Jie Tang and Jing Zhang Presented by Pei Li Knowledge Engineering Group, Dept. of Computer.
MINING RELATED QUERIES FROM SEARCH ENGINE QUERY LOGS Xiaodong Shi and Christopher C. Yang Definitions: Query Record: A query record represents the submission.
 An important problem in sponsored search advertising is keyword generation, which bridges the gap between the keywords bidded by advertisers and queried.
Understanding and Predicting Graded Search Satisfaction Tang Yuk Yu 1.
Improving Web Search Ranking by Incorporating User Behavior Information Eugene Agichtein Eric Brill Susan Dumais Microsoft Research.
PERSONALIZED SEARCH Ram Nithin Baalay. Personalized Search? Search Engine: A Vital Need Next level of Intelligent Information Retrieval. Retrieval of.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
« Pruning Policies for Two-Tiered Inverted Index with Correctness Guarantee » Proceedings of the 30th annual international ACM SIGIR, Amsterdam 2007) A.
Interpreting Advertiser Intent in Sponsored Search BHANU C VATTIKONDA, SANTHOSH KODIPAKA, HONGYAN ZHOU, VACHA DAVE, SAIKAT GUHA, ALEX C SNOEREN 1.
Presenter: Lung-Hao Lee ( 李龍豪 ) January 7, 309.
RecBench: Benchmarks for Evaluating Performance of Recommender System Architectures Justin Levandoski Michael D. Ekstrand Michael J. Ludwig Ahmed Eldawy.
Q2Semantic: A Lightweight Keyword Interface to Semantic Search Haofen Wang 1, Kang Zhang 1, Qiaoling Liu 1, Thanh Tran 2, and Yong Yu 1 1 Apex Lab, Shanghai.
Search Result Interface Hongning Wang Abstraction of search engine architecture User Ranker Indexer Doc Analyzer Index results Crawler Doc Representation.
AN EFFECTIVE STATISTICAL APPROACH TO BLOG POST OPINION RETRIEVAL Ben He Craig Macdonald Iadh Ounis University of Glasgow Jiyin He University of Amsterdam.
Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.
Post-Ranking query suggestion by diversifying search Chao Wang.
More Than Relevance: High Utility Query Recommendation By Mining Users' Search Behaviors Xiaofei Zhu, Jiafeng Guo, Xueqi Cheng, Yanyan Lan Institute of.
ASSOCIATIVE BROWSING Evaluating 1 Jinyoung Kim / W. Bruce Croft / David Smith for Personal Information.
Finding the Right Facts in the Crowd: Factoid Question Answering over Social Media J. Bian, Y. Liu, E. Agichtein, and H. Zha ACM WWW, 2008.
26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.
Multi-Aspect Query Summarization by Composite Query Date: 2013/03/11 Author: Wei Song, Qing Yu, Zhiheng Xu, Ting Liu, Sheng Li, Ji-Rong Wen Source: SIGIR.
DISTRIBUTED INFORMATION RETRIEVAL Lee Won Hee.
TO Each His Own: Personalized Content Selection Based on Text Comprehensibility Date: 2013/01/24 Author: Chenhao Tan, Evgeniy Gabrilovich, Bo Pang Source:
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.
Learning to Rank: From Pairwise Approach to Listwise Approach Authors: Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li Presenter: Davidson Date:
Federated text retrieval from uncooperative overlapped collections Milad Shokouhi, RMIT University, Melbourne, Australia Justin Zobel, RMIT University,
LEARNING IN A PAIRWISE TERM-TERM PROXIMITY FRAMEWORK FOR INFORMATION RETRIEVAL Ronan Cummins, Colm O’Riordan (SIGIR’09) Speaker : Yi-Ling Tai Date : 2010/03/15.
Information Retrieval in Practice
Learning Software Behavior for Automated Diagnosis
Tingdan Luo 05/02/2016 Interactively Optimizing Information Retrieval Systems as a Dueling Bandits Problem Tingdan Luo
Accounting for the relative importance of objects in image retrieval
Search Pages and Results
Author: Kazunari Sugiyama, etc. (WWW2004)
Weakly Learning to Match Experts in Online Community
Agenda What is SEO ? How Do Search Engines Work? Measuring SEO success ? On Page SEO – Basic Practices? Technical SEO - Source Code. Off Page SEO – Social.
Example: Academic Search
Date : 2013/1/10 Author : Lanbo Zhang, Yi Zhang, Yunfei Chen
A novel Web usage mining approach for search engines
Relevance and Reinforcement in Interactive Browsing
Topic: Semantic Text Mining
Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.
Ranking using Multiple Document Types in Desktop Search
CoXML: A Cooperative XML Query Answering System
Presentation transcript:

Learning to Personalize Query Auto-Completion Milad Shokouhi PAPER PRESENTATION Sandesh Sanjay Gade

Query auto-completion (QAC) QAC is one of the most prominent features of modern search engines. List of query candidates generated according to the prefix entered by the user in the search box and is updated on each new stroke. Two step process: Filtering Facilitated by using data structures such as prefix-trees (tries) that allow efficient lookups by prefix matching. Ranking Results of the filtering process that match the prefix are then ordered according to their expected likelihood.

Traditional Pipeline filtered relevant list ranked relevant list prefix RANK FILTER PARAMETERS PARAMETERS

i filtered relevant ranked list Ikea RANK Imdb prefix FILTER Instagram Ipad Ipod … Imdb Ikea Instagram ipad i

Query Auto-completion Query Suggestion Query Auto-completion Personalized Search

Personalized Search Query Suggestion Query Auto-completion Maximum Likelihood Estimation

Personalized Auto-Completion Learning to personalize auto-completion ranking of query suggestions for prefixes, is analogous to learning to personalize search results. Aim: Using a training dataset that consists of a set of labeled query-document pairs, the goal is to learn a ranking model by optimizing a cost function that is expected to be correlated with user satisfaction.

Experiment Setup Winner of the Yahoo! 2010 Learning to Rank Challenge. TRAINING Lambda-MART Rank algorithm [Burge at al., 2006] as the ranking algorithm. Winner of the Yahoo! 2010 Learning to Rank Challenge. BASELINE MostPopularCompletion(MPC) method [Bar-Yossef and Kraus, 2011] used as a baseline. EVALUATION METRIC Mean-Reciprocal-Rank. The top-10 candidates returned by the MPC model as input to the ranker. Work’s personalized auto-completion ranker is built on top of the MPC algorithm.

Datasets AOL testbed Bing testbed March 1, 2006 – May 31, 2006 QUERY SAMPLE DURATION March 1, 2006 – May 31, 2006 January 1, 2013 – January 9, 2013 UNIQUE QUERIES 128,620 699,862 UNIQUE USERS 657,426 196,190

List of features used

d Short History Features filtered relevant list ranked relevant list prefix dictionary Driving directions Deal or no deal dell RANK dell dictionary Driving directions Deal or no deal FILTER d The personalization model has picked up on the high lexical similarity between dell (candidate) and dell computer (2nd last query in the session) for re-ranking.

n Long History Features filtered Relevant list ranked Relevant list nascar netflix nicks.com nascar.com nextel northwest airlines netflix nascar northwest airlines nicks.com nascar.com nextel prefix FILTER RANK n Compared to the previous experiment based on short-term features, the MRR gains are higher. Personalization model realizes that Netflix has appeared twice in user’s search history before and boosts it to position one.

Age* Gender Region * Biggest mover in personalized auto-completion rankings

Evaluation Summary Long history feature (Bing and AOL) BEST PERFORMING FEATURE Long history feature (Bing and AOL) SECOND BEST Location features BEST COMBINATION (BING TESTBED) BEST COMBINATION (AOL TESTBED) All user-specific and demographic features Short and long-history features

Conclusions