 Who Uses Web Search for What? And How?. Contribution  Combine behavioral observation and demographic features of users  Provide important insight.

Slides:



Advertisements
Similar presentations
Struggling or Exploring? Disambiguating Long Search Sessions
Advertisements

Temporal Query Log Profiling to Improve Web Search Ranking Alexander Kotov (UIUC) Pranam Kolari, Yi Chang (Yahoo!) Lei Duan (Microsoft)
1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
Back to Table of Contents
WSCD INTRODUCTION  Query suggestion has often been described as the process of making a user query resemble more closely the documents it is expected.
Studies of the Onset & Persistence of Medical Concerns in Search Logs Ryen White and Eric Horvitz Microsoft Research, Redmond
1 Web Search and Web Search Overlap: What the Deal? Amanda Spink Queensland University of Technology.
IPEDS C ollege O pportunities O n- L ine COOL.
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Amanda Spink : Analysis of Web Searching and Retrieval Larry Reeve INFO861 - Topics in Information Science Dr. McCain - Winter 2004.
Web queries classification Nguyen Viet Bang WING group meeting June 9 th 2006.
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Search engines. The number of Internet hosts exceeded in in in in in
1 Automatic Identification of User Goals in Web Search Uichin Lee, Zhenyu Liu, Junghoo Cho Computer Science Department, UCLA {uclee, vicliu,
Interactive Advertising & Promotional Communication Class 8/9 Targeting the Internet Consumer Kuen-Hee Ju-Pak CSUF.
Consumers on the Web: Identification of usage patterns Consumers on the Web: Identification of usage patterns by Nina Koiso-Kanttila
Query Log Analysis Naama Kraus Slides are based on the papers: Andrei Broder, A taxonomy of web search Ricardo Baeza-Yates, Graphs from Search Engine Queries.
From Devices to People: Attribution of Search Activity in Multi-User Settings Ryen White, Ahmed Hassan, Adish Singla, Eric Horvitz Microsoft Research,
Web Usage Mining with Semantic Analysis Date: 2013/12/18 Author: Laura Hollink, Peter Mika, Roi Blanco Source: WWW’13 Advisor: Jia-Ling Koh Speaker: Pei-Hao.
1 Computers, the Internet, and the 50+ Population Shereen Remez Robert Prisuta AARP Knowledge Management March, 2003.
Searching the Web Dr. Frank McCown Intro to Web Science Harding University This work is licensed under Creative Commons Attribution-NonCommercial 3.0Attribution-NonCommercial.
Accessing the Deep Web Bin He IBM Almaden Research Center in San Jose, CA Mitesh Patel Microsoft Corporation Zhen Zhang computer science at the University.
Google Confidential and Proprietary 1 Google University Google Analytics and Website Optimiser Dyana Najdi, Customer Analytics Manager, EMEA Lee Hunter,
 Search Engine Search Engine  Steps to Search for webpages pertaining to a specific information Steps to Search for webpages pertaining to a specific.
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
SEARCH ENGINES Jaime Ma, Vancy Truong & Victoria Fry.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
Hao Wu Nov Outline Introduction Related Work Experiment Methods Results Conclusions & Next Steps.
Understanding and Predicting Personal Navigation Date : 2012/4/16 Source : WSDM 11 Speaker : Chiu, I- Chih Advisor : Dr. Koh Jia-ling 1.
Internet Unlimited LLC. Trolley Square, Suite 19c, Wilmington, DE 19806, USA Local Search & SEO Evaluation For ‘HangersRestaurant.com’ Local Directory.
Personalizing Search on Shared Devices Ryen White and Ahmed Hassan Awadallah Microsoft Research, USA Contact:
Log files presented to : Sir Adnan presented by: SHAH RUKH.
Ryen W. White, Dan Morris Microsoft Research, Redmond, USA {ryenw,
CHAPTER 8 Segmenting and Analyzing the Target Market.
Analysis of Topic Dynamics in Web Search Xuehua Shen (University of Illinois) Susan Dumais (Microsoft Research) Eric Horvitz (Microsoft Research) WWW 2005.
Discovering Computers Fundamentals, Third Edition CGS 1000 Introduction to Computers and Technology Spring 2007.
WHAT AND HOW CHILDREN SEARCH ON THE WEB Sergio Duarte Torres, Ingmar Weber.
Meet the web: First impressions How big is the web and how do you measure it? How many people use the web? How many use search engines? What is the shape.
Endangered Species A Collaborative Teaching Unit.
Adish Singla, Microsoft Bing Ryen W. White, Microsoft Research Jeff Huang, University of Washington.
© 2009 Experian Limited. All rights reserved. Experian and the marks used herein are service marks or registered trademarks of Experian Limited. Other.
Understanding User Goals in Web Search University of Seoul Computer Science Database Lab. Min Mi-young.
Social Tag Prediction Paul Heymann, Daniel Ramage, and Hector Garcia- Molina Stanford University SIGIR 2008.
Over 2.7 Million Users Per Month Over 16.8 Million Page Views Per Month.
Web Directories: Group 5 Jack Baker Laura Bingham Morgan Stewart.
Web-Mining …searching for the knowledge on the Internet… Marko Grobelnik Institut Jožef Stefan.
Web Mining Issues Size Size –>350 million pages –Grows at about 1 million pages a day Diverse types of data Diverse types of data.
Post-Ranking query suggestion by diversifying search Chao Wang.
Glossary of Terms Sessions - (old name: Visits) Users - (old name: Unique Visitors) Pageviews Pages/Session Avg. Session Duration Bounce Rate %New Sessions.
Ms. Smith. Have you ever identified a target market? Well….if you have ever seen a commercial or made a purchase, you have already seen target markets.
Why Decision Engine Bing Demos Search Interaction model Data-driven Research Problems Q & A.
Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
Fabricio Benevenuto, Gabriel Magno, Tiago Rodrigues, and Virgilio Almeida Universidade Federal de Minas Gerais Belo Horizonte, Brazil ACSAC 2010 Fabricio.
Introduction Web analysis includes the study of users’ behavior on the web Traffic analysis – Usage analysis Behavior at particular website or across.
Usefulness of Quality Click- through Data for Training Craig Macdonald, ladh Ounis Department of Computing Science University of Glasgow, Scotland, UK.
A Large Scale Study of Wireless Search Behavior: Google Mobile Search By Maryam Kamvar, Shumeet Baluja Presented by Prashanth Kumar Muthoju, Aditya Varakantam.
Discovering Computers Fundamentals, 2011 Edition Living in a Digital World.
Search Engine Marketing Science Writers Conference 2009.
CS 115: COMPUTING FOR THE SOCIO-TECHNO WEB FINDING INFORMATION WITH SEARCH ENGINES.
Quiz Show Review The Marketing Concept.
Learning Profiles from User Interactions
Internet Marketing Web Business Models.
Web Mining Ref:
Personalizing Search on Shared Devices
Understanding the Features of a Web Site
Ryen White, Ahmed Hassan, Adish Singla, Eric Horvitz
Date: 2012/11/15 Author: Jin Young Kim, Kevyn Collins-Thompson,
Web Mining Research: A Survey
Using Link Information to Enhance Web Page Classification
Presentation transcript:

 Who Uses Web Search for What? And How?

Contribution  Combine behavioral observation and demographic features of users  Provide important insight on search behavior  Design decision  Search results  Information flows

Key Ideas  Who is searching  Analyzing user’s demographics  What are they searching for  Analyzing query topics  How they are searching  Analyzing session information

User Modeling  Query topics (what?): what are the topics that user issues queries on?  Y! Directory classification for topics  User demographics (who?): What is the demographics profile of the user?  Used a mix of user-provided information (age and gender) and information derived from user’s zip code  Session characteristics (how?) : Does a user have many/few, short/long and navigational/information sessions?  Session length  No. of queries per session

Data  web search query log of Yahoo! Search engine (2.3 million)  Log from U.S. Yahoo! site  Registered U.S. Yahoo users with an identifiable cookie  Only active users 1. Issue at least 100 queries over the sample period 2. Remove users with more than 100,000 queries issues 3. Remove users who clicked fewer than 1/100 of their queries 4. Remove User who clicked more than 100 time per queries

“Who?” Data  Using U.S Census with zip code Per-capita income Level of education Ethnicity  Using 2008 U.S. presidential election with zip code to obtain result

“Who?” Data

“What?” Data  Topic distribution of the queries issued by user  Use top 10 Yahoo! search results obtained for a given query  Use unique proprietary classification to classify into 71 topics  Classified queries issued at least 30 times  10 million distinct queries

“Who?” Data

“How?” Data  Session length and no. of clicks per session  Count no. of queries issued within session interval of 30 mins  Classified queries into Navigational- seek a single website Informational - queries that cover a board topic Transactional- queries that reflect the intent of user to perform a particular action (Click Entropy) H(D|q) = ∑ d,q –p(d|q) log 2 p(d|q)  Compute click entropy for queries issued >= 20 times  H(D|q) ≤ 1 – focused query  H(D|q) ≥ 3 – diverse query

“How?” Data  370k distinct queries for both focused and diverse queries  Focused queries – navigational  Diverse queries - informational

Method  Unsupervised K-mean clustering  K ranging from 8 to 20  Clustered users using topics distributions  Using the result to induced the “Who” and “How”

Informational Users  Used search engine as research engine to find information on a wide range of topics  “Who” Well educated with above-average income  “What” Do research on a wide range of topics with little interest in adult content  “How” More likely to issue non-navigational queries Less likely to have a single-click session More likely to make use of the suggested query alternatives

Navigational Users  Used search engine as a replacement for web page bookmarking to navigate to URLs that he already knows exist  “Who” Background averages of topical cluster under consideration  “What” Dominated by topic of popular website e.g FB, gmail  “How” More likely to issue navigational queries More likely to click only on a single result within a session Less likely to make use of unnecessary suggested query alternatives

Transactional Users  Used search engine to take him to some URL where he can perform desired transaction Little benefit in learning more about a subject URL generally not known in advance  “Who” Depend heavily on the kind of transaction  “What” Predominant topics are shopping, adult content and gaming sites  “How” Diverse clicks

Close-Up  Baby Boomers Avg age of 50 yrs old Simple navigational queries related to online banking and interested in finance  Liberal Females Most likely to voted for Democrat in 2008 elections Biggest single query is shopping with longer session  White Conservatives Voted for Republican in 2008 elections Search for automotive related topics, business pages & home and garden information

Conclusion  Overall finding are stereotypical  Future work Fine grained analysis in term of categories and search strategy Closer look at long-tail queries