Ryen W. White, Dan Morris Microsoft Research, Redmond, USA {ryenw,

Slides:



Advertisements
Similar presentations
Beliefs & Biases in Web Search
Advertisements

Predicting User Interests from Contextual Information
Ryen W. White Microsoft Research Steven M. Drucker Microsoft Live Labs
Struggling or Exploring? Disambiguating Long Search Sessions
1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
Patterns in Online Shopping Behavior (and possible links to call centers) Peter S. Fader Professor of Marketing The Wharton School, University of Pennsylvania.
This tutorial will take approximately 15 minutes. Click here to advance. Click here to go back.
Studies of the Onset & Persistence of Medical Concerns in Search Logs Ryen White and Eric Horvitz Microsoft Research, Redmond
Searchable Web sites Recommendation Date : 2012/2/20 Source : WSDM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh Jia-ling 1.
1 Learning User Interaction Models for Predicting Web Search Result Preferences Eugene Agichtein Eric Brill Susan Dumais Robert Ragno Microsoft Research.
C R E S S T / U C L A Paper presented at AERA New Orleans—April 2000 Using Technology to Assess Students’ Web Expertise Davina C. D. Klein CRESST/UCLA.
Amanda Spink : Analysis of Web Searching and Retrieval Larry Reeve INFO861 - Topics in Information Science Dr. McCain - Winter 2004.
Ryen W. White, Microsoft Research Jeff Huang, University of Washington.
LinkSelector: A Web Mining Approach to Hyperlink Selection for Web Portals Xiao Fang University of Arizona 10/18/2002.
Ryen White, Susan Dumais, Jaime Teevan Microsoft Research {ryenw, sdumais,
LinkSelector: Select Hyperlinks for Web Portals Prof. Olivia Sheng Xiao Fang School of Accounting and Information Systems University of Utah.
© Anselm Spoerri Lecture 13 Housekeeping –Term Projects Evaluations –Morse, E., Lewis, M., and Olsen, K. (2002) Testing Visual Information Retrieval Methodologies.
Topic-Sensitive PageRank Taher H. Haveliwala. PageRank Importance is propagated A global ranking vector is pre-computed.
JavaScript, Third Edition
Maryam Karimzadehgan (U. Illinois Urbana-Champaign)*, Ryen White (MSR), Matthew Richardson (MSR) Presented by Ryen White Microsoft Research * MSR Intern,
Consumers on the Web: Identification of usage patterns Consumers on the Web: Identification of usage patterns by Nina Koiso-Kanttila
Cohort Modeling for Enhanced Personalized Search Jinyun YanWei ChuRyen White Rutgers University Microsoft BingMicrosoft Research.
Modeling Long-Term Search Engine Usage Ryen White, Ashish Kapoor & Susan Dumais Microsoft Research.
Section 2: Finding and Refinding Jaime Teevan Microsoft Research 1.
Information Re-Retrieval Repeat Queries in Yahoo’s Logs Jaime Teevan (MSR), Eytan Adar (UW), Rosie Jones and Mike Potts (Yahoo) Presented by Hugo Zaragoza.
CAMBRIDGE COMMUNITY TELEVISION Web Analytics Review & Recommendations.
TwitterSearch : A Comparison of Microblog Search and Web Search
Ryen W. White, Mikhail Bilenko, Silviu Cucerzan Microsoft Research, Redmond, USA {ryenw, mbilenko,
From Devices to People: Attribution of Search Activity in Multi-User Settings Ryen White, Ahmed Hassan, Adish Singla, Eric Horvitz Microsoft Research,
23. Juli By Benjamin Riedel Collaborative Web.
Web 2.0: Concepts and Applications 4 Organizing Information.
System for Administration, Training, and Educational Resources for NASA SATERN Overview for Learners May 2006.
Springerlink.com Introduction to SpringerLink springerlink.com.
User Browsing Graph: Structure, Evolution and Application Yiqun Liu, Yijiang Jin, Min Zhang, Shaoping Ma, Liyun Ru State Key Lab of Intelligent Technology.
Understanding and Predicting Graded Search Satisfaction Tang Yuk Yu 1.
COMPREHENSIVE Windows Tutorial 4 Working with the Internet and .
Introduction to eChalk For Students. What is eChalk? eChalk’s unique online learning environment provides your school with its own electronic “town square”
Support.ebsco.com My EBSCOhost Tutorial Tutorial.
Improving Web Search Ranking by Incorporating User Behavior Information Eugene Agichtein Eric Brill Susan Dumais Microsoft Research.
Fan Guo 1, Chao Liu 2 and Yi-Min Wang 2 1 Carnegie Mellon University 2 Microsoft Research Feb 11, 2009.
Hao Wu Nov Outline Introduction Related Work Experiment Methods Results Conclusions & Next Steps.
Understanding and Predicting Personal Navigation Date : 2012/4/16 Source : WSDM 11 Speaker : Chiu, I- Chih Advisor : Dr. Koh Jia-ling 1.
Ryen White Microsoft Research research.microsoft.com/~ryenw/talks/ppt/WhiteIMT542E.ppt.
System for Administration, Training, and Educational Resources for NASA SATERN Overview for Users December 2009.
BUSINESS ANALYTICS AND DATA VISUALIZATION
Lecture 2 Jan 15, 2008 Social Search. What is Social Search? Social Information Access –a stream of research that explores methods for organizing users’
Personalized Course Navigation Based on Grey Relational Analysis Han-Ming Lee, Chi-Chun Huang, Tzu- Ting Kao (Dept. of Computer Science and Information.
Qi Guo Emory University Ryen White, Susan Dumais, Jue Wang, Blake Anderson Microsoft Presented by Tetsuya Sakai, Microsoft Research.
CS 347Notes101 CS 347 Parallel and Distributed Data Processing Distributed Information Retrieval Hector Garcia-Molina Zoltan Gyongyi.
Search Tools and Search Engines Searching for Information and common found internet file types.
Adish Singla, Microsoft Bing Ryen W. White, Microsoft Research Jeff Huang, University of Washington.
Retroactive Answering of Search Queries Beverly Yang Glen Jeh.
 Who Uses Web Search for What? And How?. Contribution  Combine behavioral observation and demographic features of users  Provide important insight.
Xinyu Xing, Wei Meng, Dan Doozan, Georgia Institute of Technology Alex C. Snoeren, UC San Diego Nick Feamster, and Wenke Lee, Georgia Institute of Technology.
Understanding and Predicting Personal Navigation.
Post-Ranking query suggestion by diversifying search Chao Wang.
User ID Feature GAP Mini-Summit
Measuring the value of search trails in web logs Presentation by Maksym Taran & Scott Breyfogle Research by Ryen White & Jeff Huang.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.
Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
To Personalize or Not to Personalize: Modeling Queries with Variation in User Intent Presented by Jaime Teevan, Susan T. Dumais, Daniel J. Liebling Microsoft.
Assessing the Scenic Route: Measuring the Value of Search Trails in Web Logs Ryen W. White1 Jeff Huang2 1Microsoft Research 1University of Washington.
Search User Behavior: Expanding The Web Search Frontier
The Knowledge Center.
Author: Kazunari Sugiyama, etc. (WWW2004)
Struggling and Success in Web Search
Ryen White, Ahmed Hassan, Adish Singla, Eric Horvitz
Date: 2012/11/15 Author: Jin Young Kim, Kevyn Collins-Thompson,
Efficient Multiple-Click Models in Web Search
Presentation transcript:

Ryen W. White, Dan Morris Microsoft Research, Redmond, USA {ryenw,

Motivation Some people are more expert at searching than others Search expertise, not domain expertise We study characteristics of these “advanced search engine users” in an effort to better understand how these users search If we can better understand what advanced searchers are doing maybe we can improve the search experience for everyone

Study We define advanced users as searchers who use advanced query operators in their query statements, Used plus (+), minus (-), quotes (“”), and “site:” e.g., “sigir 2007” e.g., microsoft +office site:microsoft.com Is there a relationship between the use of advanced syntax and: Queries and result clicks? Post-query browsing? Search success?

Data Interaction logs of 586K opt-in users English U.S. users 13-week period from January to April 2006 Complete browsing history Search engine queries (Multiple engines) All page visits Relevance judgments for 11K queries 6-level judgments

Characterizing Advanced Users Four advanced operators used: +, -, “”, and “site:” ~1% of submitted queries contained at least one operator 51K users (9%) of users used query operators at least once p advanced used to denote the percentage of a user’s queries that contain advanced operators Non-advanced users (p advanced = 0%) Advanced users (p advanced > 0%) Included users who issued > 50 queries ~38K (20%) advanced users ~151K (80%) non-advanced users

Do advanced users query and click on results differently than novices?

Query and Result-Click Features Give overview of subjects’ direct interactions with search engines FeatureMeaning Query Repeat Rate (QRR)Fraction of queries that are repeats Query Word Length (QWL)Avg. number of words in query Queries Per Day (QPD)Avg. number of queries per day Queries Per Second (QPS) Avg. number of queries per second between initial query and end-of-session Avg. Click Position (ACP)Avg. rank of clicked results Click Probability (CP)Ratio of result clicks to queries Avg. Seconds To Click (ASC)Avg. search to result click interval

Findings: Query/Result-click Advanced users: Repeat queries more often Compose longer queries Submit more queries/day Query less/second Click further down the result list Less likely to click a result Feature p advanced 0%> 0%≥ 25%≥ 50%≥ 75% QRR QWL QPD QPS ACP CP ASC % Users 79.90%20.10%.79%.18%.04% Non-advancedMore advanced  Advanced

Findings: Query/Result-click Factor analysis to study the relationships among the dependent variables Factor analysis revealed two factors that could account for ~84% of the variance: Factor A = Querying Query properties associated with position of clicks in result list Factor B = Result-click Querying frequency associated with the likelihood that user will click on a search result and click latency

Do advanced users browse differently than novices?

Search Session Session Query  Timeout Query trail Query  End trail event Another query Type URL Visit homepage Check Web-based or logon to online service Close browser Session timeout canon.com amazon.com S1 S3 S4 S3 S5 dpreview.com howstuffworks.com S2 S6 S5 S8 S6 S9 S1 S10S11 S10 S12 S13 S14 amazon pmai.org digitalcamera-hq.com digital cameras digital camera canon S7 S6 canon lenses S2

Post-Query Browsing Features Based on search sessions and search trails extracted from interaction logs 12.5 million search trails extracted Median number of trails per user was 30 Median number of steps in the trails was 3

Post-Query Browsing Features FeatureMeaning Session SecondsAverage session length (in seconds) Trail SecondsAverage query trail length (in seconds) Display SecondsAverage display time for each page on the trail (in seconds) Num. Steps Average number of steps from the page following the results page to the end of the trail Num. RevisitsAverage number of “back” operations Num. BranchesAverage number of branches

Findings – Post-query browsing Advanced users: Traverse trails faster Spend less time viewing each Web page Follow query trails with fewer steps Revisit pages less often “Branch” less often Featurep advanced 0%> 0%≥ 25%≥ 50%≥ 75% Session Secs Trail Secs Display Secs Num. Steps Num. Revisits Num. Branches % Trails 72.14%27.86%.83%.23%.05% % Users 79.90%20.10%.79%.18%.04% Non-advancedMore advanced  Advanced

Findings – Post-query browsing Greater the proportion of queries with advanced syntax the more focused their search interactions become Shorter query trails Less “branchy” query trails Session time increases but search time drops with increases in p advanced Perhaps more advanced users are multitasking between search and other activities

Are advanced users more successful than novices?

Search Success Human relevance judgment available for 11K queries Extract corresponding query trails from our logs Relevance judgments for 56% of pages on those trails We use these judgments to compute several metrics for search success MetricMeaning FirstJudgment assigned to the first page in the trail LastJudgment assigned to the last page in the trail AverageAverage judgment across all pages in the trail MaximumMaximum judgment across all pages in the trail

Findings – Search Success Feature p advanced 0%> 0%≥ 25%≥ 50%≥ 75% FirstM SD LastM SD MaxM SD Avg.M SD Average relevance judgment, Min = 1, Max = 6 More advanced users are more likely to have success Non-advancedMore advanced  Advanced

Summary Conducted log-based study of search behavior Classified users according to their use of advanced query syntax Demonstrated that use of advanced search syntax is correlated with other aspects of search behavior such as queries and result clicks, post-query navigation, and search success Next steps Use the interactions of advanced users for improved document ranking, page recommendation, training

Thank you! Ryen W. White, Dan Morris {ryenw, Questions/comments?