Enhancing Web Search by Promoting Multiple Search Engine Use Ryen W. W., Matthew R. Mikhail B. (Microsoft Research) Allison P. H (Rice University) SIGIR.

Slides:



Advertisements
Similar presentations
Introduction to Information Retrieval
Advertisements

Temporal Query Log Profiling to Improve Web Search Ranking Alexander Kotov (UIUC) Pranam Kolari, Yi Chang (Yahoo!) Lei Duan (Microsoft)
Optimizing search engines using clickthrough data
A Machine Learning Approach for Improved BM25 Retrieval
Search Engines Information Retrieval in Practice All slides ©Addison Wesley, 2008.
An Unsupervised Framework for Extracting and Normalizing Product Attributes from Multiple Web Sites Center for E-Business Technology Seoul National University.
WSCD INTRODUCTION  Query suggestion has often been described as the process of making a user query resemble more closely the documents it is expected.
Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)
Optimizing Estimated Loss Reduction for Active Sampling in Rank Learning Presented by Pinar Donmez joint work with Jaime G. Carbonell Language Technologies.
A Quality Focused Crawler for Health Information Tim Tang.
Evaluating Search Engine
Automatic Discovery and Classification of search interface to the Hidden Web Dean Lee and Richard Sia Dec 2 nd 2003.
Ryen W. White, Microsoft Research Jeff Huang, University of Washington.
INFO 624 Week 3 Retrieval System Evaluation
J. Chen, O. R. Zaiane and R. Goebel An Unsupervised Approach to Cluster Web Search Results based on Word Sense Communities.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Evaluation of Image Retrieval Results Relevant: images which meet user’s information need Irrelevant: images which don’t meet user’s information need Query:
AdWords Instructor: Dawn Rauscher. Quality Score in Action 0a2PVhPQhttp:// 0a2PVhPQ.
Evaluation David Kauchak cs458 Fall 2012 adapted from:
Evaluation David Kauchak cs160 Fall 2009 adapted from:
1 Context-Aware Search Personalization with Concept Preference CIKM’11 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
Automatically Identifying Localizable Queries Center for E-Business Technology Seoul National University Seoul, Korea Nam, Kwang-hyun Intelligent Database.
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
1 Applying Collaborative Filtering Techniques to Movie Search for Better Ranking and Browsing Seung-Taek Park and David M. Pennock (ACM SIGKDD 2007)
Improving Web Search Ranking by Incorporating User Behavior Information Eugene Agichtein Eric Brill Susan Dumais Microsoft Research.
Ruirui Li, Ben Kao, Bin Bi, Reynold Cheng, Eric Lo Speaker: Ruirui Li 1 The University of Hong Kong.
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
Exploring Online Social Activities for Adaptive Search Personalization CIKM’10 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
Ryen W. White, Matthew Richardson, Mikhail Bilenko Microsoft Research Allison Heath Rice University.
Math Information Retrieval Zhao Jin. Zhao Jin. Math Information Retrieval Examples: –Looking for formulas –Collect teaching resources –Keeping updated.
User Behavior Analysis of Location Aware Search Engine Third international Conference of MDM, 2002 Takahiko Shintani, Iko Pramudiono NTT Information Sharing.
GoodRelations : An Ontology for Describing Web Offers Primer and User’s Guide Draft Presented by Jae-won Lee.
EigenRank: A Ranking-Oriented Approach to Collaborative Filtering IDS Lab. Seminar Spring 2009 강 민 석강 민 석 May 21 st, 2009 Nathan.
Contextual Ranking of Keywords Using Click Data Utku Irmak, Vadim von Brzeski, Reiner Kraft Yahoo! Inc ICDE 09’ Datamining session Summarized.
Diversifying Search Result WSDM 2009 Intelligent Database Systems Lab. School of Computer Science & Engineering Seoul National University Center for E-Business.
BEHAVIORAL TARGETING IN ON-LINE ADVERTISING: AN EMPIRICAL STUDY AUTHORS: JOANNA JAWORSKA MARCIN SYDOW IN DEFENSE: XILING SUN & ARINDAM PAUL.
Center for E-Business Technology Seoul National University Seoul, Korea Social Ranking: Uncovering Relevant Content Using Tag-based Recommender Systems.
Chapter 8 Evaluating Search Engine. Evaluation n Evaluation is key to building effective and efficient search engines  Measurement usually carried out.
Query Segmentation Using Conditional Random Fields Xiaohui and Huxia Shi York University KEYS’09 (SIGMOD Workshop) Presented by Jaehui Park,
Adish Singla, Microsoft Bing Ryen W. White, Microsoft Research Jeff Huang, University of Washington.
Retroactive Answering of Search Queries Beverly Yang Glen Jeh.
Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.
CoOL: A Context Ontology Language to Enable Contextual Interoperability Thomas Strang, Claudia Linnhoff-Popien, and Korbinian Frank German Aerospace Centor.
Learning to Rank From Pairwise Approach to Listwise Approach.
COLLABORATIVE SEARCH TECHNIQUES Submitted By: Shikha Singla MIT-872-2K11 M.Tech(2 nd Sem) Information Technology.
A Novel Visualization Model for Web Search Results Nguyen T, and Zhang J IEEE Transactions on Visualization and Computer Graphics PAWS Meeting Presented.
Post-Ranking query suggestion by diversifying search Chao Wang.
More Than Relevance: High Utility Query Recommendation By Mining Users' Search Behaviors Xiaofei Zhu, Jiafeng Guo, Xueqi Cheng, Yanyan Lan Institute of.
Context-Aware Query Classification Huanhuan Cao, Derek Hao Hu, Dou Shen, Daxin Jiang, Jian-Tao Sun, Enhong Chen, Qiang Yang Microsoft Research Asia SIGIR.
DivQ: Diversification for Keyword Search over Structured Databases Elena Demidova, Peter Fankhauser, Xuan Zhou and Wolfgang Nejfl L3S Research Center,
Searching for the Best Engine Presented by Gong GI Hyun, IDS Lab., Seoul National University.
Chapter. 3: Retrieval Evaluation 1/2/2016Dr. Almetwally Mostafa 1.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.
Learning to Rank: From Pairwise Approach to Listwise Approach Authors: Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li Presenter: Davidson Date:
Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
To Personalize or Not to Personalize: Modeling Queries with Variation in User Intent Presented by Jaime Teevan, Susan T. Dumais, Daniel J. Liebling Microsoft.
1 Context-Aware Ranking in Web Search (SIGIR 10’) Biao Xiang, Daxin Jiang, Jian Pei, Xiaohui Sun, Enhong Chen, Hang Li 2010/10/26.
Why You Should Optimize Your Website Content. Optimizing a website's content, in order to obtain a high search engine ranking is what Search Engine Optimization.
Federated text retrieval from uncooperative overlapped collections Milad Shokouhi, RMIT University, Melbourne, Australia Justin Zobel, RMIT University,
Sampath Jayarathna Cal Poly Pomona
Evaluation of IR Systems
Personalizing Search on Shared Devices
Intent-Aware Semantic Query Annotation
ISWC 2013 Entity Recommendations in Web Search
Feature Selection for Ranking
INF 141: Information Retrieval
Presentation transcript:

Enhancing Web Search by Promoting Multiple Search Engine Use Ryen W. W., Matthew R. Mikhail B. (Microsoft Research) Allison P. H (Rice University) SIGIR 2008 Presented by Jae-won Lee

Copyright  2008 by CEBT Introduction  Users are generally loyal to one search engine Even though it may not satisfy their needs  A given search engine performs well for some queries and poorly for others Excessive loyalty can hinder search effectiveness  Our Goal Support engine switching by recommending the most effective search engine for a given query IDS Lab. Seminar - 2Center for E-Business Technology

Copyright  2008 by CEBT Related Work  Meta search engines such as Clusty and Dogpile Merge search results  Switching search engine is more attractive Strong brand loyalty may discourage users from migrating to meta search engine Meta search engine eliminates the benefits of interface feature of each engine Hurts source engine brand awareness  We lets users keep their default engine Suggest an alternative engine if it performs better for the current query IDS Lab. Seminar - 3Center for E-Business Technology

Copyright  2008 by CEBT Does Switching Help Users?  Potential Benefits of Switching We quantify the benefits of multiple engine use – Normalized Discounted Cumulative Gain (NDCG) Measure a topical relevance of results to a given query – Click-through rate for search results Reasonable estimation of search result utility NDCG – A measure to evaluate the Web search engine performance Where N i : a constant for normalization r(i) : relevance score of the i th result, 0 (bad) ~ 5 (perfect) IDS Lab. Seminar - 4Center for E-Business Technology

Copyright  2008 by CEBT Does Switching Help Users?  Potential Benefits of Switching (cont’d) X, Y, and Z are anonymous notations of Google, Yahoo, and Live Search Number of queries for which engine performs best Engine choice for particular query is important IDS Lab. Seminar - 5Center for E-Business Technology Search engineRelevance (NDCG)Result click-through rate X952 (19.3%)2,777 (56.4%) Y1,136 (23.1%)1,226 (24.9%) Z789 (16.1%)892 (18.1%) No difference2,044 (41.5%)26 (0.6%)

Copyright  2008 by CEBT Switching as Classification - Query Processing IDS Lab. Seminar - 6Center for E-Business Technology Query Feature Extractor Classifier (offline Training) Recommend a Search Engine Features Search Engines Result sets

Copyright  2008 by CEBT Classifier Features  Classifier must recommend an engine in real-time Derive features from result pages, a query and query-result matching  Features Features from result pages Features from the query Features from the query-result page match IDS Lab. Seminar - 7Center for E-Business Technology

Copyright  2008 by CEBT Classifier Features IDS Lab. Seminar - 8Center for E-Business Technology

Copyright  2008 by CEBT Classifier  Notation q : query R : result page of original engine R’ : result page of target engine R* = {(d 1,s 1 ), …, (d k,s k )} : Human-judged result set - d k (result page), s k (score) Utility of each engine : U(R) = NDCG R* (R), U(R’) = NDCG R* (R’)  Training Each training instance D = {(x, y)} – x = f(q, R, R’) ; comprised of features derived from the query and result page – y = 1 iff NDCG R* (R’) >= NDCG R* (R) + margin  Switching engine if utility is higher by at least some margin IDS Lab. Seminar - 9Center for E-Business Technology

Copyright  2008 by CEBT Experiments  Evaluate accuracy of switching support to determine its viability  Data Set From Google, Yahoo, Live Search logs IDS Lab. Seminar - 10Center for E-Business Technology Total number of queries17,111 Total number of judged pages4,254,730 Total number of judged pages labeled Fair or higher1,378,011

Copyright  2008 by CEBT Precision – Recall Results  The proposed method can achieve high accuracy  Therefore, the method can be used for providing useful search engine suggestions to users IDS Lab. Seminar - 11Center for E-Business Technology

Copyright  2008 by CEBT Avoiding Querying the Alternative Engine  Evaluating the utility of target engine is undesirable to some users due to the network traffic  So, only use the features from the current engine’s result pages IDS Lab. Seminar - 12Center for E-Business Technology

Copyright  2008 by CEBT Contribution of Features  All sets of features contribute to accuracy  Features obtained from result pages seems to provide the most benefit IDS Lab. Seminar - 13Center for E-Business Technology

Copyright  2008 by CEBT Conclusion  Demonstrated potential benefit of switching  Described a method for automatically determining when to switch engines for a given query  Evaluated the method and illustrated good performance, especially at usable recall IDS Lab. Seminar - 14Center for E-Business Technology

Copyright  2008 by CEBT Pros. & Cons.  Pros. Propose a new research area of IR by switching support Good explanation for user behavior  Cons. Poor explanation for equations No analysis for the experiment results IDS Lab. Seminar - 15Center for E-Business Technology