Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.

Slides:

Advertisements

Similar presentations

Beliefs & Biases in Web Search

Advertisements

Predicting User Interests from Contextual Information

Enhancing Personalized Search by Mining and Modeling Task Behavior

Temporal Query Log Profiling to Improve Web Search Ranking Alexander Kotov (UIUC) Pranam Kolari, Yi Chang (Yahoo!) Lei Duan (Microsoft)

DQR : A Probabilistic Approach to Diversified Query recommendation Date: 2013/05/20 Author: Ruirui Li, Ben Kao, Bin Bi, Reynold Cheng, Eric Lo Source:

1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.

Bringing Order to the Web: Automatically Categorizing Search Results Hao Chen SIMS, UC Berkeley Susan Dumais Adaptive Systems & Interactions Microsoft.

Center for E-Business Technology Seoul National University Seoul, Korea Socially Filtered Web Search: An approach using social bookmarking tags to personalize.

Search Engines Information Retrieval in Practice All slides ©Addison Wesley, 2008.

Experiments on Query Expansion for Internet Yellow Page Services Using Log Mining Summarized by Dongmin Shin Presented by Dongmin Shin User Log Analysis.

Evaluating Search Engine

Ryen W. White, Microsoft Research Jeff Huang, University of Washington.

Topic-Sensitive PageRank Taher H. Haveliwala. PageRank Importance is propagated A global ranking vector is pre-computed.

University of Kansas Department of Electrical Engineering and Computer Science Dr. Susan Gauch April 2005 I T T C Dr. Susan Gauch Personalized Search Based.

Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.

From Devices to People: Attribution of Search Activity in Multi-User Settings Ryen White, Ahmed Hassan, Adish Singla, Eric Horvitz Microsoft Research,

Abstract Introduction Results and Discussions James Kasson  (Dr. Bruce W.N. Lo)  Information Systems  University of Wisconsin-Eau Claire In a world.

Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng.

Title Extraction from Bodies of HTML Documents and its Application to Web Page Retrieval Microsoft Research Asia Yunhua Hu, Guomao Xin, Ruihua Song, Guoping.

Performance of Recommender Algorithms on Top-N Recommendation Tasks RecSys 2010 Intelligent Database Systems Lab. School of Computer Science & Engineering.

Mining Interesting Locations and Travel Sequences from GPS Trajectories IDB & IDS Lab. Seminar Summer 2009 강 민 석강 민 석 July 23 rd,

User Browsing Graph: Structure, Evolution and Application Yiqun Liu, Yijiang Jin, Min Zhang, Shaoping Ma, Liyun Ru State Key Lab of Intelligent Technology.

Understanding and Predicting Graded Search Satisfaction Tang Yuk Yu 1.

1 Applying Collaborative Filtering Techniques to Movie Search for Better Ranking and Browsing Seung-Taek Park and David M. Pennock (ACM SIGKDD 2007)

Improving Web Search Ranking by Incorporating User Behavior Information Eugene Agichtein Eric Brill Susan Dumais Microsoft Research.

Detecting Semantic Cloaking on the Web Baoning Wu and Brian D. Davison Lehigh University, USA WWW 2006.

UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.

CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.

Personalized Search Cheng Cheng (cc2999) Department of Computer Science Columbia University A Large Scale Evaluation and Analysis of Personalized Search.

Hao Wu Nov Outline Introduction Related Work Experiment Methods Results Conclusions & Next Steps.

WEB SEARCH PERSONALIZATION WITH ONTOLOGICAL USER PROFILES Data Mining Lab XUAN MAN.

Ryen W. White, Matthew Richardson, Mikhail Bilenko Microsoft Research Allison Heath Rice University.

Center for E-Business Technology Seoul National University Seoul, Korea BrowseRank: letting the web users vote for page importance Yuting Liu, Bin Gao,

Personalizing Search on Shared Devices Ryen White and Ahmed Hassan Awadallah Microsoft Research, USA Contact:

Collaborative Filtering versus Personal Log based Filtering: Experimental Comparison for Hotel Room Selection Ryosuke Saga and Hiroshi Tsuji Osaka Prefecture.

Detecting Dominant Locations from Search Queries Lee Wang, Chuang Wang, Xing Xie, Josh Forman, Yansheng Lu, Wei-Ying Ma, Ying Li SIGIR 2005.

Diversifying Search Result WSDM 2009 Intelligent Database Systems Lab. School of Computer Science & Engineering Seoul National University Center for E-Business.

Analysis of Topic Dynamics in Web Search Xuehua Shen (University of Illinois) Susan Dumais (Microsoft Research) Eric Horvitz (Microsoft Research) WWW 2005.

1 Web-Page Summarization Using Clickthrough Data* JianTao Sun, Yuchang Lu Dept. of Computer Science TsingHua University Beijing , China Dou Shen,

Personalization with user’s local data Personalizing Search via Automated Analysis of Interests and Activities 1 Sungjick Lee Department of Electrical.

21/11/20151Gianluca Demartini Ranking Clusters for Web Search Gianluca Demartini Paul–Alexandru Chirita Ingo Brunkhorst Wolfgang Nejdl L3S Info Lunch Hannover,

Chapter 8 Evaluating Search Engine. Evaluation n Evaluation is key to building effective and efficient search engines  Measurement usually carried out.

Authors: Marius Pasca and Benjamin Van Durme Presented by Bonan Min Weakly-Supervised Acquisition of Open- Domain Classes and Class Attributes from Web.

Adish Singla, Microsoft Bing Ryen W. White, Microsoft Research Jeff Huang, University of Washington.

Retroactive Answering of Search Queries Beverly Yang Glen Jeh.

COLLABORATIVE SEARCH TECHNIQUES Submitted By: Shikha Singla MIT-872-2K11 M.Tech(2 nd Sem) Information Technology.

Enhancing Web Search by Promoting Multiple Search Engine Use Ryen W. W., Matthew R. Mikhail B. (Microsoft Research) Allison P. H (Rice University) SIGIR.

Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic Kazunari Sugiyama 8 th January, 2010.

Xinyu Xing, Wei Meng, Dan Doozan, Georgia Institute of Technology Alex C. Snoeren, UC San Diego Nick Feamster, and Wenke Lee, Georgia Institute of Technology.

Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.

Post-Ranking query suggestion by diversifying search Chao Wang.

Improved Video Categorization from Text Metadata and User Comments ACM SIGIR 2011:Research and development in Information Retrieval - Katja Filippova -

Learning User Behaviors for Advertisements Click Prediction Chieh-Jen Wang & Hsin-Hsi Chen National Taiwan University Taipei, Taiwan.

Context-Aware Query Classification Huanhuan Cao, Derek Hao Hu, Dou Shen, Daxin Jiang, Jian-Tao Sun, Enhong Chen, Qiang Yang Microsoft Research Asia SIGIR.

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

Learning to Rank: From Pairwise Approach to Listwise Approach Authors: Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li Presenter: Davidson Date:

Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.

ENHANCING CLUSTER LABELING USING WIKIPEDIA David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab SIGIR’09.

To Personalize or Not to Personalize: Modeling Queries with Variation in User Intent Presented by Jaime Teevan, Susan T. Dumais, Daniel J. Liebling Microsoft.

1 Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan, MIT Susan T. Dumais, Microsoft Eric Horvitz, Microsoft SIGIR 2005.

Personalized Ontology for Web Search Personalization S. Sendhilkumar, T.V. Geetha Anna University, Chennai India 1st ACM Bangalore annual Compute conference,

Sampath Jayarathna Cal Poly Pomona

User Modeling for Personal Assistant

Assessing the Scenic Route: Measuring the Value of Search Trails in Web Logs Ryen W. White1 Jeff Huang2 1Microsoft Research 1University of Washington.

Evaluation of IR Systems

Personalizing Search on Shared Devices

Ryen White, Ahmed Hassan, Adish Singla, Eric Horvitz

Date: 2012/11/15 Author: Jin Young Kim, Kevyn Collins-Thompson,

INF 141: Information Retrieval

Presentation transcript:

Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee

Copyright  2008 by CEBT Introduction  Search and Recommendation systems include contextual information to effectively model users’ interests This paper presents the effectiveness of five variant sources of contextual information for user interests modeling – Social, history, task, collection and user interaction This paper evaluate the utility of these sources and overlaps between them – the context overlap outperforms any isolated sources IDS Lab. Seminar - 2Center for E-Business Technology

Copyright  2008 by CEBT Introduction  Contextual information Interaction – Recent interaction behavior preceding the current page Collection – Pages with hyperlinks to the current page Task – Pages related to the current page by sharing the same search queries Historic – The long term interests for the current user Social – The combined interests of other users that also visit the current page IDS Lab. Seminar - 3Center for E-Business Technology

Copyright  2008 by CEBT Log Data  Browse trails Extracted from user logs (From August 2008 to November 2008) Consist of a temporally ordered sequence of URLs visited by a user per Web browser instance or browser tab Termination of trails – A period of user inactivity of 30 or more minutes – Termination of the browser instance or tab  Context trails Extracted from the set of browse trails Comprise a terminal URL u t, and the lists of five Web pages preceding u t in the browse trail (u t-5,.., u t-1 ) The five pages forms the immediate session based interaction context  T h : the set of terminal URLs IDS Lab. Seminar - 4Center for E-Business Technology

Copyright  2008 by CEBT User Interest Models  All pages extracted from context (interaction, collection, historic, task, and social) are classified into Web categories (i.e., ODP) User interests were represented as a lists of ODP category labels ODP labels in the lists were ranked based on each label’s frequency in the context IDS Lab. Seminar - 5Center for E-Business Technology

Copyright  2008 by CEBT User Interest Models  No Context (only u t ) One ODP label is assigned to the terminal URL  Interaction Context (u t-5,.., u t-1 ) One ODP is assigned to each of the five pages The label frequencies are used to created a ranked list of labels The ranked list is the interest model for the interaction context of u t IDS Lab. Seminar - 6Center for E-Business Technology

Copyright  2008 by CEBT User Interest Models  Task Context Created using ODP labels assigned to Web pages visited by other users with same query (or similar tasks) Queries are common in u t and u r IDS Lab. Seminar - 7Center for E-Business Technology ODP labels Ranked lists are regarded as task context

Copyright  2008 by CEBT User Interest Models  Collection Context Created using Web pages containing hyperlinks that refer to u t – In-links for each u t ODP labels are assigned to each in-links  Historic Context Created for each user based on their long-term interaction history To create each user’s historic context, we classified all Web pages the user visited, and assigned ODP labels to the pages  Social Context We found users who have also visited u t, and combined their interest models (historic context) to create a ranked list of ODP labels This list formed the interest model for the social context of u t IDS Lab. Seminar - 8Center for E-Business Technology

Copyright  2008 by CEBT Data Preparation  Interest model effectiveness may vary depending on temporal distance from u t to some future time point Short – Within one hour from u t Medium – Within one day from u t Long – Within one week from u t The futures are overlapping – e.g., medium contains short IDS Lab. Seminar - 9Center for E-Business Technology

Copyright  2008 by CEBT Evaluation Methodology  Find the short, medium and long term futures and build ground- truth interest models for each of them (making correct interest models)  Build user interest models for different context sources  Determine the accuracy of the context-based models in predicting the ground truth IDS Lab. Seminar - 10Center for E-Business Technology

Copyright  2008 by CEBT Measures  The top predicted category label pl 1 for a context trail matched to its top actual label l 1  The top predicted category label pl1 for a context trail matched to its top actual label l 1, l 2, l 3  Mean reciprocal rank (MRR) If l 1 matched pl i, the score assigned was the reciprocal of the prediction rank position, 1/i The computed scores were averaged to computed final MRR  Normalized discounted cumulative gain Emphasize highly relevant ODP labels appearing early in the result list  F1 Harmonic mean of precision and recall IDS Lab. Seminar - 11Center for E-Business Technology

Copyright  2008 by CEBT Results  Context source comparison Different sources of contextual information may be suited for different tasks – To predict user interests immediately, u t, interaction and task context can be used – To predict long term interests, historic and social context can be used IDS Lab. Seminar - 12Center for E-Business Technology

Copyright  2008 by CEBT Results  Handling near misses Near miss – E.g., although two ODP labels are different, we can consider that two labels are same with slight loss in precision /Sports/golf/instruction/golf school & /Sports/golf/instruction One level back-off means convert all ODP to their top level (e.g., /Sports/) IDS Lab. Seminar - 13Center for E-Business Technology

Copyright  2008 by CEBT Results  Combining contexts After 57 context combinations are tested, top 10 combination are displayed – Those combinations that are significantly different from the best performing model in Context source comparison are marked IDS Lab. Seminar - 14Center for E-Business Technology

Copyright  2008 by CEBT Conclusion  We build a variety of user interest models based on the current page, contextual variants, and overlaps between contexts  The interest models were required to predict short-, medium-, long-term interests The predictive value of each contextual sources varies according to the time duration of the prediction IDS Lab. Seminar - 15Center for E-Business Technology