Personalizing Search Jaime Teevan, MIT Susan T. Dumais, MSR and Eric Horvitz, MSR.

Slides:



Advertisements
Similar presentations
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
Advertisements

Chapter 5: Introduction to Information Retrieval
Optimizing search engines using clickthrough data
Personalization and Search Jaime Teevan Microsoft Research.
1 Learning User Interaction Models for Predicting Web Search Result Preferences Eugene Agichtein Eric Brill Susan Dumais Robert Ragno Microsoft Research.
Information Retrieval in Practice
Search Engines and Information Retrieval
Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan Susan T.Dumains Eric Horvitz MIT,CSAILMicrosoft Researcher Microsoft.
PROBLEM BEING ATTEMPTED Privacy -Enhancing Personalized Web Search Based on:  User's Existing Private Data Browsing History s Recent Documents 
Seesaw Personalized Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.
Chapter 5: Query Operations Baeza-Yates, 1999 Modern Information Retrieval.
Re-ranking Documents Segments To Improve Access To Relevant Content in Information Retrieval Gary Madden Applied Computational Linguistics Dublin City.
Information Access Douglas W. Oard College of Information Studies and Institute for Advanced Computer Studies Design Understanding.
Online Learning for Web Query Generation: Finding Documents Matching a Minority Concept on the Web Rayid Ghani Accenture Technology Labs, USA Rosie Jones.
Affinity Rank Yi Liu, Benyu Zhang, Zheng Chen MSRA.
University of Kansas Department of Electrical Engineering and Computer Science Dr. Susan Gauch April 2005 I T T C Dr. Susan Gauch Personalized Search Based.
Information Retrieval
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Chapter 5: Information Retrieval and Web Search
Overview of Search Engines
Cohort Modeling for Enhanced Personalized Search Jinyun YanWei ChuRyen White Rutgers University Microsoft BingMicrosoft Research.
Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan.
Information Re-Retrieval Repeat Queries in Yahoo’s Logs Jaime Teevan (MSR), Eytan Adar (UW), Rosie Jones and Mike Potts (Yahoo) Presented by Hugo Zaragoza.
Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng.
Search Engines and Information Retrieval Chapter 1.
Georg Buscher German Research Center for Artificial Intelligence (DFKI) Knowledge Management Department Kaiserslautern, Germany SIGIR 07 Doctoral Consortium.
MPI Informatik 1/17 Oberseminar AG5 Result merging in a Peer-to-Peer Web Search Engine Supervisors: Speaker : Sergey Chernov Prof. Gerhard Weikum Christian.
Improving Web Search Ranking by Incorporating User Behavior Information Eugene Agichtein Eric Brill Susan Dumais Microsoft Research.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Tag Data and Personalized Information Retrieval 1.
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Hao Wu Nov Outline Introduction Related Work Experiment Methods Results Conclusions & Next Steps.
Implicit Acquisition of Context for Personalization of Information Retrieval Systems Chang Liu, Nicholas J. Belkin School of Communication and Information.
Personalized Search Xiao Liu
Chapter 6: Information Retrieval and Web Search
UCAIR Project Xuehua Shen, Bin Tan, ChengXiang Zhai
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
Search Engine Architecture
Improving Web Search Results Using Affinity Graph Benyu Zhang, Hua Li, Yi Liu, Lei Ji, Wensi Xi, Weiguo Fan, Zheng Chen, Wei-Ying Ma Microsoft Research.
Less is More Probabilistic Models for Retrieving Fewer Relevant Documents Harr Chen, David R. Karger MIT CSAIL ACM SIGIR 2006 August 9, 2006.
Discovering and Using Groups to Improve Personalized Search Jaime Teevan, Merrie Morris, Steve Bush Microsoft Research.
IR Theory: Relevance Feedback. Relevance Feedback: Example  Initial Results Search Engine2.
 Examine two basic sources for implicit relevance feedback on the segment level for search personalization. Eye tracking Display time.
Personalizing Web Search using Long Term Browsing History Nicolaas Matthijs, Cambridge Filip Radlinski, Microsoft In Proceedings of WSDM
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Personalization with user’s local data Personalizing Search via Automated Analysis of Interests and Activities 1 Sungjick Lee Department of Electrical.
Qi Guo Emory University Ryen White, Susan Dumais, Jue Wang, Blake Anderson Microsoft Presented by Tetsuya Sakai, Microsoft Research.
CiteSight: Contextual Citation Recommendation with Differential Search Avishay Livne 1, Vivek Gokuladas 2, Jaime Teevan 3, Susan Dumais 3, Eytan Adar 1.
Implicit User Modeling for Personalized Search Xuehua Shen, Bin Tan, ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
COLLABORATIVE SEARCH TECHNIQUES Submitted By: Shikha Singla MIT-872-2K11 M.Tech(2 nd Sem) Information Technology.
Post-Ranking query suggestion by diversifying search Chao Wang.
Personalizing Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.
Navigation Aided Retrieval Shashank Pandit & Christopher Olston Carnegie Mellon & Yahoo.
1 Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan, MIT Susan T. Dumais, Microsoft Eric Horvitz, Microsoft SIGIR 2005.
Seesaw Personalized Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.
Potential for Personalization Transactions on Computer-Human Interaction, 17(1), March 2010 Data Mining for Understanding User Needs Jaime Teevan, Susan.
SEARCH AND CONTEXT Susan Dumais, Microsoft Research INFO 320.
Information Retrieval in Practice
Information Storage and Retrieval Fall Lecture 1: Introduction and History.
Search Engine Architecture
Simultaneous Support for Finding and Re-Finding
Search Engine Architecture
Augmenting (personal) IR
SIS: A system for Personal Information Retrieval and Re-Use
Author: Kazunari Sugiyama, etc. (WWW2004)
CS246: Leveraging User Feedback
Presentation transcript:

Personalizing Search Jaime Teevan, MIT Susan T. Dumais, MSR and Eric Horvitz, MSR

Relevant result “pia workshop” Query:

Outline Approaches to personalization The PS algorithm Evaluation Results Future work

Approaches to Personalization Content of user profile  Long-term interests Liu, et al. [14], Compass Filter [13]  Short-term interests Query refinement [2,12,15], Watson [4] How user profile is developed  Explicit Relevance feedback [19], query refinement [2,12,15]  Implicit Query history [20, 22], browsing history [16, 23] Very rich user profile

PS Search Engine query

PS Search Engine query dog cat monkey banana food baby infant child boy girl forest hiking walking gorp baby infant child boy girl csail mit artificial research robot web search retrieval ir hunt

PS Search Engine query Search results page web search retrieval ir hunt 1.3

Calculating a Document’s Score Based on standard tf.idf Score = Σ tf i * w i web search retrieval ir hunt 1.3

Calculating a Document’s Score Based on standard tf.idf Score = Σ tf i * w i Σ

N nini Calculating a Document’s Score Based on standard tf.idf Score = Σ tf i * w i World (N) (n i ) w i = log Σ 1.3

N nini Calculating a Document’s Score Based on standard tf.idf Score = Σ tf i * w i (N) (n i ) w i = log World riri R (r i +0.5)(N-n i -R+r i +0.5) (n i -r i +0.5)(R-r i +0.5) w i = log † † From Sparck Jones, Walker and Roberson, 1998 [21]. ’ ’ Where: N = N+R, n i = n i +r i ’ ’ Client (r i +0.5)(N-n i -R+r i +0.5) (n i -r i +0.5)(R-r i +0.5) w i = log ’

Finding the Parameter Values Corpus representation (N, n i )  How common is the term in general?  Web vs. result set User representation (R, r i )  How well does it represent the user’s interest?  All vs. recent vs. Web vs. queries vs. none Document representation  What terms to sum over?  Full document vs. snippet web search retrieval ir hunt

Building a Test Bed 15 evaluators x ~10 queries  131 queries total Personally meaningful queries  Selected from a list  Queries issued earlier (kept diary) Evaluate 50 results for each query  Highly relevant / relevant / irrelevant Index of personal information

Evaluating Personalized Search Measure algorithm quality  DCG(i) = { Look at one parameter at a time  67 different parameter combinations!  Hold other parameters constant and vary one Look at best parameter combination  Compare with various baselines Gain(i), DCG (i–1) + Gain(i)/log(i), if i = 1 otherwise

Analysis of Parameters User

Analysis of Parameters CorpusUserDocument

PS Improves Text Retrieval No model Relevance Feedback Personalized Search

Text Features Not Enough

Take Advantage of Web Ranking PS+Web

Summary Personalization of Web search  Result re-ranking  User’s documents as relevance feedback Rich representations important  Rich user profile particularly important  Efficiency hacks possible  Need to incorporate features beyond text

Further Exploration Improved non-text components  Usage data  Personalized PageRank Learn parameters  Based on individual  Based on query  Based on results UIs for user control

User Interface Issues Make personalization transparent Give user control over personalization  Slider between Web and personalized results  Allows for background computation Exacerbates problem with re-finding  Results change as user model changes  Thesis research – Re:Search Engine

Thank you!

Much Room for Improvement Group ranking  Best improves on Web by 23%  More people  Less improvement Personal ranking  Best improves on Web by 38%  Remains constant Potential for Personalization

Evaluating Personalized Search Query selection  Chose from 10 pre-selected queries  Previously issued query cancer Microsoft traffic … bison frise Red Sox airlines … Las Vegas rice McDonalds … Pre-selected 53 pre-selected (2-9/query) Total: 137 Joe Mary

Making PS Practical Learn most about personalization by deploying a system Best algorithm reasonably efficient Merging server and client  Query expansion Get more relevant results in the set to be re-ranked  Design snippets for personalization