A Taxonomy of Web Searches Andrei Broder, SIGIR Forum, 2002 Ahmet Yenicag Ceyhun Karbeyaz.

Slides:



Advertisements
Similar presentations
A taxonomy of web search Andrei Broder IBM Research.
Advertisements

Natural Language Processing WEB SEARCH ENGINES August, 2002.
Urchin & Website Usability. Usability Study Usability study is a repetitive process that involves testing a site and then using the test results to change.
Dave Krause ANRCS Web Action Team.  Data is collected from a web site based on what the user does during the visit.
The PageRank Citation Ranking “Bringing Order to the Web”
By Andrei Broder, IBM Research 1 A Taxonomy of Web Search Presented By o Onur Özbek o Mirun Akyüz.
Automatic Discovery and Classification of search interface to the Hidden Web Dean Lee and Richard Sia Dec 2 nd 2003.
Web Search – Summer Term 2006 III. Web Search - Introduction (Cont.) - Jeff Dean, Google's Systems Lab:
Web Algorithmics Web Search Engines. Retrieve docs that are “relevant” for the user query Doc : file word or pdf, web page, , blog, e-book,... Query.
Web queries classification Nguyen Viet Bang WING group meeting June 9 th 2006.
Using Local Information for Personalized Search Haward Jie CS 290C.
1 Automatic Identification of User Goals in Web Search Uichin Lee, Zhenyu Liu, Junghoo Cho Computer Science Department, UCLA {uclee, vicliu,
Behavior of Web Searchers Katie Greenstreet Computer Science 49S Duke Spring 2007.
Web Search – Summer Term 2006 VII. Selected Topics - PageRank (closer look) (c) Wolfgang Hürst, Albert-Ludwigs-University.
Query Log Analysis Naama Kraus Slides are based on the papers: Andrei Broder, A taxonomy of web search Ricardo Baeza-Yates, Graphs from Search Engine Queries.
Searching the Web Dr. Frank McCown Intro to Web Science Harding University This work is licensed under a Creative Commons Attribution-NonCommercial- ShareAlike.
Algorithms for Information Retrieval Prologue. References Managing gigabytes A. Moffat, T. Bell e I. Witten, Kaufmann Publisher A bunch of scientific.
WEB ANALYTICS Prof Sunil Wattal. Business questions How are people finding your website? What pages are the customers most interested in? Is your website.
Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification on Reviews Peter D. Turney Institute for Information Technology National.
What is the Internet? Internet: The Internet, in simplest terms, is the large group of millions of computers around the world that are all connected to.
Aardvark Anatomy of a Large-Scale Social Search Engine.
What is the Internet? Internet: The Internet, in simplest terms, is the large group of millions of computers around the world that are all connected to.
AnswerBus Question Answering System Zhiping Zheng School of Information, University of Michigan HLT 2002.
Accessing the Deep Web Bin He IBM Almaden Research Center in San Jose, CA Mitesh Patel Microsoft Corporation Zhen Zhang computer science at the University.
Dr. Susan Gauch When is a rock not a rock? Conceptual Approaches to Personalized Search and Recommendations Nov. 8, 2011 TResNet.
Tag Data and Personalized Information Retrieval 1.
Web Search. Structure of the Web n The Web is a complex network (graph) of nodes & links that has the appearance of a self-organizing structure  The.
Search Engine Interfaces search engine modus operandi.
 Search Engine Search Engine  Steps to Search for webpages pertaining to a specific information Steps to Search for webpages pertaining to a specific.
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
Objective Understand concepts used to web-based digital media. Course Weight : 5%
Understanding and Predicting Personal Navigation Date : 2012/4/16 Source : WSDM 11 Speaker : Chiu, I- Chih Advisor : Dr. Koh Jia-ling 1.
MIS 424 Professor Sandvig. Overview  Why Analytics?  Two major approaches:  Server logs  Google Analytics.
Michel Oldroyd – SEO Strategist Search Engine Optimisation Demystified.
Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.
Google Analytics Top 10 Metrics By: Errett Cord Website:
Measuring the Size of the Web Dongwon Lee, Ph.D. IST 501, Fall 2014 Penn State.
Personalized Search Xiao Liu
Query trends CS 349 Presentation December 2 nd, 2008 Catherine Grevet.
Networked Information Resources Online Retrieval.
Detecting Dominant Locations from Search Queries Lee Wang, Chuang Wang, Xing Xie, Josh Forman, Yansheng Lu, Wei-Ying Ma, Ying Li SIGIR 2005.
Lecture 6 Title: Web Planning, Designing, Developing for E-Marketing By: Mr Hashem Alaidaros MKT 445.
CSM06 Information Retrieval Lecture 1a – Introduction Dr Andrew Salway
Search Engine Architecture
A Statistical Comparison of Tag and Query Logs Mark J. Carman, Robert Gwadera, Fabio Crestani, and Mark Baillie SIGIR 2009 June 4, 2010 Hyunwoo Kim.
Personalization with user’s local data Personalizing Search via Automated Analysis of Interests and Activities 1 Sungjick Lee Department of Electrical.
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
Meet the web: First impressions How big is the web and how do you measure it? How many people use the web? How many use search engines? What is the shape.
Retroactive Answering of Search Queries Beverly Yang Glen Jeh.
Mark Levene, An Introduction to Search Engines and Web Navigation © Pearson Education Limited 2005 Slide 3.1 Chapter 3 : The Problem of Web Navigation.
Understanding User Goals in Web Search University of Seoul Computer Science Database Lab. Min Mi-young.
Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.
Diversifying Search Results Rakesh AgrawalSreenivas GollapudiSearch LabsMicrosoft Research Alan HalversonSamuel.
 Who Uses Web Search for What? And How?. Contribution  Combine behavioral observation and demographic features of users  Provide important insight.
Web Search – Summer Term 2006 VII. Web Search - Indexing: Structure Index (c) Wolfgang Hürst, Albert-Ludwigs-University.
Recommendation systems Paolo Ferragina Dipartimento di Informatica Università di Pisa Slides only!
TO Each His Own: Personalized Content Selection Based on Text Comprehensibility Date: 2013/01/24 Author: Chenhao Tan, Evgeniy Gabrilovich, Bo Pang Source:
Starter (June 2011) Explain two methods by which someone could find a website on the internet. [4]
CPS 49S Google: The Computer Science Within and its Impact on Society Shivnath Babu Spring 2008.
Chapter 8: Web Analytics, Web Mining, and Social Analytics
SEO services company in India
How Magento eCommerce Enterprises Works and Manage Zaptech Solutions.
Heat-seeking Honeypots: Design and Experience John P. John, Fang Yu, Yinglian Xie, Arvind Krishnamurthy and Martin Abadi WWW 2011 Presented by Elias P.
Information Architecture
Marking the Most of the Web’s Resources
Search Engine Architecture
Identify Different Chinese People with Identical Names on the Web
Search Engine Architecture
Query Type Classification for Web Document Retrieval
Development of Search engine optimization for Crowdfunding site
Presentation transcript:

A Taxonomy of Web Searches Andrei Broder, SIGIR Forum, 2002 Ahmet Yenicag Ceyhun Karbeyaz

Abstract Classic IR is inherently predicated on users search for their information need, that leads them to use an IR system. But the need behind a web search is not only informational. Navigational Transactional In this paper: Analyze the taxonomy of web searches Present some statistics about this taxonomy Show evolution of search engines in light of this taxonomy. 12/9/2015 2/8 CS 533, Information Retrieval, Karbeyaz & Yenicag

Taxonomy of Web Searches Classify web queries according to their intent: Navigational: Give me the URL of the site I want to reach. Bilkent => Informational: Find information assumed to be available on the web and no further interaction is predicted except reading. Closest to classical IR What is IR? => IR is … Transactional: Show me sites where I can perform a certain transaction. Shopping, downloading media files (mp3, video) 12/9/2015 CS 533, Information Retrieval, Karbeyaz & Yenicag 3/8

Statistics Prevalence of queries are determined by two methods : - User survey - AltaVista query log analysis 12/9/2015 CS 533, Information Retrieval, Karbeyaz & Yenicag 4/8

Statistics (continued) User survey: Self selection Most of the queries are non-navigational Hard to distinguish between transactional and informational queries Queries that are neither transactional, nor navigational are assumed to be informational Estimation: Transactional queries are around 36% 12/9/2015 CS 533, Information Retrieval, Karbeyaz & Yenicag 5/8

Statistics (continued) Log analysis: AltaVista log is analyzed for 1000 random queries. Only English queries are taken into consideration: 400 queries Figure 4. Query Classification 12/9/2015 CS 533, Information Retrieval, Karbeyaz & Yenicag 6/8

Evolution of Search Engines Three stages: First generation: uses mostly on-page data and very close to classic IR. (informational) AltaVista Second generation: uses off-page, web specific data such as link analysis. (informational and navigational) Google Third generation: Recently emerging, attempts to blend data from multiple sources to answer “the need behind the query”. (informational, transactional and navigational) Ankara => direct links to a hotel reservation page for Ankara 12/9/2015 CS 533, Information Retrieval, Karbeyaz & Yenicag 7/8

Conclusion The need behind a query might be: informational, navigational or transactional. Search engines need to deal with all of them. Understanding taxonomy is important for successful web search development. Current engines lack of supporting transactional queries directly. Third generation engines will solve this problem mostly via semantic analyses (understanding what the query is about). 12/9/2015 CS 533, Information Retrieval, Karbeyaz & Yenicag 8/8