Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng.

Slides:

Advertisements

Similar presentations

Recommender Systems & Collaborative Filtering

Advertisements

Google News Personalization: Scalable Online Collaborative Filtering

Improvements and extras Paul Thomas CSIRO. Overview of the lectures 1.Introduction to information retrieval (IR) 2.Ranked retrieval 3.Probabilistic retrieval.

Prediction Modeling for Personalization & Recommender Systems Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.

1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.

A Machine Learning Approach for Improved BM25 Retrieval

Search Engines Information Retrieval in Practice All slides ©Addison Wesley, 2008.

Catching the Drift: Learning Broad Matches from Clickthrough Data Sonal Gupta, Mikhail Bilenko, Matthew Richardson University of Texas at Austin, Microsoft.

Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.

Searchable Web sites Recommendation Date : 2012/2/20 Source : WSDM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh Jia-ling 1.

Recommender Systems Aalap Kohojkar Yang Liu Zhan Shi March 31, 2008.

Evaluating Search Engine

Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan Susan T.Dumains Eric Horvitz MIT,CSAILMicrosoft Researcher Microsoft.

COMP 630L Paper Presentation Javy Hoi Ying Lau. Selected Paper “A Large Scale Evaluation and Analysis of Personalized Search Strategies” By Zhicheng Dou,

Topic-Sensitive PageRank Taher H. Haveliwala. PageRank Importance is propagated A global ranking vector is pre-computed.

Recommender Systems; Social Information Filtering.

University of Kansas Department of Electrical Engineering and Computer Science Dr. Susan Gauch April 2005 I T T C Dr. Susan Gauch Personalized Search Based.

Scalable Text Mining with Sparse Generative Models

A Search-based Method for Forecasting Ad Impression in Contextual Advertising Defense.

Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.

Overview of Web Data Mining and Applications Part I

Cohort Modeling for Enhanced Personalized Search Jinyun YanWei ChuRyen White Rutgers University Microsoft BingMicrosoft Research.

Improving web image search results using query-relative classifiers Josip Krapacy Moray Allanyy Jakob Verbeeky Fr´ed´eric Jurieyy.

Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.

Personalized Information Retrieval in Context David Vallet Universidad Autónoma de Madrid, Escuela Politécnica Superior,Spain.

A Comparative Study of Search Result Diversification Methods Wei Zheng and Hui Fang University of Delaware, Newark DE 19716, USA

1 Cross-Lingual Query Suggestion Using Query Logs of Different Languages SIGIR 07.

Group Recommendations with Rank Aggregation and Collaborative Filtering Linas Baltrunas, Tadas Makcinskas, Francesco Ricci Free University of Bozen-Bolzano.

A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.

Improving Web Search Ranking by Incorporating User Behavior Information Eugene Agichtein Eric Brill Susan Dumais Microsoft Research.

Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.

EMIS 8381 – Spring Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.

CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.

Personalized Search Cheng Cheng (cc2999) Department of Computer Science Columbia University A Large Scale Evaluation and Analysis of Personalized Search.

Exploring Online Social Activities for Adaptive Search Personalization CIKM’10 Advisor ： Jia Ling, Koh Speaker ： SHENG HONG, CHUNG.

Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.

Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.

Google News Personalization: Scalable Online Collaborative Filtering

Collaborative Information Retrieval - Collaborative Filtering systems - Recommender systems - Information Filtering Why do we need CIR? - IR system augmentation.

Contextual Ranking of Keywords Using Click Data Utku Irmak, Vadim von Brzeski, Reiner Kraft Yahoo! Inc ICDE 09’ Datamining session Summarized.

Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.

1 Web-Page Summarization Using Clickthrough Data* JianTao Sun, Yuchang Lu Dept. of Computer Science TsingHua University Beijing , China Dou Shen,

Personalizing Web Search using Long Term Browsing History Nicolaas Matthijs, Cambridge Filip Radlinski, Microsoft In Proceedings of WSDM

Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI

Personalization with user’s local data Personalizing Search via Automated Analysis of Interests and Activities 1 Sungjick Lee Department of Electrical.

Algorithmic Detection of Semantic Similarity WWW 2005.

Chapter 8 Evaluating Search Engine. Evaluation n Evaluation is key to building effective and efficient search engines  Measurement usually carried out.

COLLABORATIVE SEARCH TECHNIQUES Submitted By: Shikha Singla MIT-872-2K11 M.Tech(2 nd Sem) Information Technology.

+ User-induced Links in Collaborative Tagging Systems Ching-man Au Yeung, Nicholas Gibbins, Nigel Shadbolt CIKM’09 Speaker: Nonhlanhla Shongwe 18 January.

Post-Ranking query suggestion by diversifying search Chao Wang.

Context-Aware Query Classification Huanhuan Cao, Derek Hao Hu, Dou Shen, Daxin Jiang, Jian-Tao Sun, Enhong Chen, Qiang Yang Microsoft Research Asia SIGIR.

Web Search Personalization with Ontological User Profile Advisor: Dr. Jai-Ling Koh Speaker: Shun-hong Sie.

Divided Pretreatment to Targets and Intentions for Query Recommendation Reporter: Yangyang Kang /23.

11 A Classification-based Approach to Question Routing in Community Question Answering Tom Chao Zhou 1, Michael R. Lyu 1, Irwin King 1,2 1 The Chinese.

Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.

Learning to Rank: From Pairwise Approach to Listwise Approach Authors: Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li Presenter: Davidson Date:

Navigation Aided Retrieval Shashank Pandit & Christopher Olston Carnegie Mellon & Yahoo.

Usefulness of Quality Click- through Data for Training Craig Macdonald, ladh Ounis Department of Computing Science University of Glasgow, Scotland, UK.

University Of Seoul Ubiquitous Sensor Network Lab Query Dependent Pseudo-Relevance Feedback based on Wikipedia 전자전기컴퓨터공학 부 USN 연구실 G

User Modeling for Personal Assistant

Recommender Systems & Collaborative Filtering

Personalized Social Image Recommendation

Author: Kazunari Sugiyama, etc. (WWW2004)

Martin Rajman, EPFL Switzerland & Martin Vesely, CERN Switzerland

Movie Recommendation System

Recommender Systems Copyright: Dietmar Jannah, Markus Zanker and Gerhard Friedrich (slides based on their IJCAI talk „Tutorial: Recommender Systems”)

Date: 2012/11/15 Author: Jin Young Kim, Kevyn Collins-Thompson,

Relevance and Reinforcement in Interactive Browsing

INF 141: Information Retrieval

Presentation transcript:

Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng Gao, Shujie Li Research Department, GenieKnows.com September 17, 2009

About GenieKnows.com Based in Halifax, Nova Scotia, Canada Established in 1999 ~35 People Online Advertising Network -100 to 150 million searches per day Search Engines (local, health, games) Content Portals

3

About Tony Abou-Assaleh Director of Research at GenieKnows -Since Build search engines -Other internal R&D initiatives Lecturer at Brock University, St. Catharines, Canada – 2006 GNU grep official maintainer

Agenda Introduction Related Work Our Approach Experiments Conclusion & Future Work

Agenda Introduction Related Work Our Approach Experiments Conclusion & Future Work

Introduction Local Search -What? Why? Personalization -What? How? Why? Assumptions Objectives

What is Local Search? Local Search vs. Business Directory Contains: -Internet Yellow Pages (IYP) Business Directory -Enhanced business listings -Map -Ratings and Reviews -Articles and editorials -Pictures and rich media -Social Networking

9

Why Local Search? Good for end users Good for businesses Good for our company Interesting research problems No market leader Could be the next big thing

What is Personalization? No personalization: -Everybody gets the same results Personalization: -User may see different results Personalization vs. customization

What to Personalize? Ranking Snippets Presentation Collection Recommendations

How to Personalize? Search history Click history User profiles – interests Collaborative filtering

Why Personalization? One size does not fit all Ambiguity of short queries Improve per-user precision Improve user experience Targeted advertising $$$

Assumptions Interests are location dependent Long-term interests Implicit relevance feedback Relevance in location dependent Relevance is category dependent User cooperation Single-user personalization

Objectives General framework for personalization of spatial- keyword queries User profile representation Personalized ranking Improve over baseline system

Agenda Introduction Related Work Our Approach Experiments Conclusion & Future Work

Related Work User Profile Modeling Personalized Ranking

User Profile Modeling Topic based (Liu et al, 2002) -Vector of interests -Explicit: how to collect data? -Implicit: relevance feedback Click based (Li et al, 2008) -Implicit feedback from click through data -Require a lot of data Ontological profiles (Sieg et al, 2007) Hierarchical representations (Huete et al, 2008)

Personalized Ranking Web, desktop, and enterprise search Local search? Strategies: -Implicit -Clicks as relevance feedback -Query topic identification -Collaborative filtering -Learning algorithms

Agenda Introduction Related Work Our Approach Experiments Conclusion & Future Work

Our Approach Problem formulation Ranking Function Decomposition Business Features User Profile User Interest Function Business-specific Preference Function

Problem Formulation Query: keywords + spatial (geographic) context Ranking function: Relevant Results ✕ User Profiles ✕ Location  Real Number Online personalized ranking: -Optimization of an objective function over rank scores

Ranking Function Decomposition Final rank = weighted combination of: -Baseline rank -User rank -Business rank

Ranking Function Decomposition Final rank = weighted combination of: -Baseline rank -User rank -Business rank

Baseline Rank Okapi BM25F on textual fields Distance from query centre Other non-textual features

Business Features List of categories -18 top level, 275 second level Terms -Vector-space model Location -Geocoded address Meta data -Year established, number of employees, languages, etc

User Profile Local Profile -For each geographic region (city) -For each category -Needs at least 1 query Global Profile -Aggregation of local profiles -Used for new city and category combination

Local Profile Category interest score -Fraction of queries in this category -Fraction of clicks in this category Number of queries Terms vector-space model Clicks (business, timestamp)

Global Profile Estimated global category interest score -Aggregated over all cities -Weighted combination of interest scores -Weights derived from query volume -Estimated using a Dirichlet Distribution

Ranking Function Decomposition Final rank = weighted combination of: -Baseline rank -User rank -Business rank

User Interest Function Rank (business, user, query) = Category interest score ✕ Term similarity ✕ Click count Averaged over all categories of the business Term similarity: cosine similarity Click count: capture navigational queries

Ranking Function Decomposition Final rank = weighted combination of: -Baseline rank -User rank -Business rank

Business-specific Preference Function Rank (business, user, city, category) = Sum of query dependent click scores + Sum of query independent click scores Click scores are time discounted -1 year windows -1 week intervals Parameter to control relative importance of query- dependency

Agenda Introduction Related Work Our Approach Experiments Conclusion & Future Work

Experiments Data Procedure Results Discussion

Data 22 Million businesses 30 participants Only 12 with sufficient queries 2388 queries 1653 unique queries

Procedure Types of tasks: -Navigational, browsing, information seeking 5-point explicit relevance feedback Ranking algorithm -Baseline vs. personalized -Alternates every 2 minutes -Identical interface -No bootstrapping phase

Results Measures: -Mean Average Precision – MAP -Mean Reciprocal Rank – MRR -Normalized Discounted Cumulative Gain – nDCG

Results

Results Welch two-sample t-test: -Significant improvement -MAP: 95% confidence, p= MRR: 95% confidence, p=

Results 16 randomly selected queries Not significant

Agenda Introduction Related Work Our Approach Experiments Conclusion & Future Work

Contributions Personalization framework for spatial-keyword queries Model for user profiles Local and global profiles Address data sparseness problem Personalized ranking function -Interests, clicks, terms Empirical evaluation -Significant improvement over the baseline system

Future Work Modeling of short-term interests Modeling of recurring interests “Learning to Rank” algorithms Multi-user personalization -Recommender system Incorporate on

Thanks you!

Questions Can I access your data? Did you do parameter tuning? Did users try to test/cheat the system? What is the computational complexity? Any confounding variables?