1 Web Search and Web Search Overlap: What the Deal? Amanda Spink Queensland University of Technology.

Slides:



Advertisements
Similar presentations
Mark Levene, An Introduction to Search Engines and Web Navigation © Pearson Education Limited 2005 Slide 4.1 Chapter 4 : Searching the Web The mechanics.
Advertisements

Dogpile.com Metasearch. What is metasearch technology Webopedia (2012): A search engine that queries other search engines and then combines the results.
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features INIS Training Seminar 7-11 October 2013, Vienna Domenico.
Search Engines. 2 What Are They?  Four Components  A database of references to webpages  An indexing robot that crawls the WWW  An interface  Enables.
1 Web Research - Large-Scale Web Data Analysis Amanda Spink Queensland University of Technology Jim Jansen The Pennsylvania State University.
Best Web Directories and Search Engines Order Out of Chaos on the World Wide Web.
Using Search Engines Finding Ag Information You Need On The Internet.
Amanda Spink : Analysis of Web Searching and Retrieval Larry Reeve INFO861 - Topics in Information Science Dr. McCain - Winter 2004.
Mastering the Internet, XHTML, and JavaScript Chapter 7 Searching the Internet.
Searching the Web II. The Web Why is it important: –“Free” ubiquitous information resource –Broad coverage of topics and perspectives –Becoming dominant.
The Web is perhaps the single largest data source in the world. Due to the heterogeneity and lack of structure, mining and integration are challenging.
Searching The Web Search Engines are computer programs (variously called robots, crawlers, spiders, worms) that automatically visit Web sites and, starting.
1 ETT 429 Spring 2007 Microsoft Publisher II. 2 World Wide Web Terminology Internet Web pages Browsers Search Engines.
What is the Internet? The Internet is a computer network connecting millions of computers all over the world It has no central control - works through.
© Tefko Saracevic, Rutgers University 1 Vox populi: the public searching of the Web: A longitudinal study of large samples of Excite queries Dietmar Wofram.
Internet Resources Discovery (IRD) Meta-Search Engines (MSEs)
SEARCH ENGINES By, CH.KRISHNA MANOJ(Y5CS021), 3/4 B.TECH, VRSEC. 8/7/20151.
1 Searching the World Wide Web CMP 101 Introduction to Information Systems L02. Internet Search.
Web Search Engines and Information Retrieval on the World-Wide Web Torsten Suel CIS Department Overview: introduction.
Searching “Search results are only as good as the query you pose and how you search. There is no silver bullet”
Introductions Search Engine Development COMP 475 Spring 2009 Dr. Frank McCown.
HOW SEARCH ENGINE WORKS. Aasim Bashir.. What is a Search Engine? Search engine: It is a website dedicated to search other websites and there contents.
Search Engine Marketing Shelly Brown Director of Web Services Southwest Baptist University.
The Internet as a Search Tool Prepared by Ms. Emery, Summer 2006 With billions of web pages online, you could spend a lifetime surfing the Web, following.
Searching the Web Dr. Frank McCown Intro to Web Science Harding University This work is licensed under Creative Commons Attribution-NonCommercial 3.0Attribution-NonCommercial.
Query Logs – Used everywhere and for everything Sai Vallurupalli.
1 © Netskills Quality Internet Training, University of Newcastle Search Engines and Other Animals © Netskills, Quality Internet Training, University of.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
Searching Information. General Steps Identifying Key Words, Synonyms, and Key Phrases Constructing an effective search statement Advance search/boolean.
Search Engines By Wanda Dansby CECS 5030 Dr. Knezek.
Search Yahoo! With Boolean Operators AND, OR, (), “”, NOT, Domain:
Fourth Edition Discovering the Internet Discovering the Internet Complete Concepts and Techniques, Second Edition Chapter 3 Searching the Web.
1 Search Engines Emphasis on Google.com. 2 Discovery  Discovery is done by browsing & searching data on the Web.  There are 2 main types of search facilities.
Subject (Exam) Review WSTA 2015 Trevor Cohn. Exam Structure Worth 50 marks Parts: – A: short answer [14] – B: method questions [18] – C: algorithm questions.
WISER Humanities: Quality Information on the Internet Johanneke Sytsema Linguistics Subject Consultant
Access to Information in Digital Libraries: Users and Digital Divide Gobinda G. Chowdhury Graduate School of Informatics, Department of Computer and Information.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
Search Engines. Search Strategies Define the search topic(s) and break it down into its component parts What terms, words or phrases do you use to describe.
Where do I find it? Created by Connie CampbellConnie Campbell.
Search Engines.
Stop Searching and Start FINDING: Strategies for Effective Web Research.
Analysis of Topic Dynamics in Web Search Xuehua Shen (University of Illinois) Susan Dumais (Microsoft Research) Eric Horvitz (Microsoft Research) WWW 2005.
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
Meet the web: First impressions How big is the web and how do you measure it? How many people use the web? How many use search engines? What is the shape.
Search Tools and Search Engines Searching for Information and common found internet file types.
Analysing Clickstream Data: From Anomaly Detection to Visitor Profiling Peter I. Hofgesang Wojtek Kowalczyk ECML/PKDD Discovery.
 Who Uses Web Search for What? And How?. Contribution  Combine behavioral observation and demographic features of users  Provide important insight.
1 SEARCHING FOR TRUTH Locating Information on the WWW chapter 5.
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features The Role of the International Nuclear Information System.
Internet Power Searching: Finding Pearls in a Zillion Grains of Sand By Daniel Arze.
W orkshops in I nformation S kills and E lectronic R esources Oxford University Library Services – Information Skills Training Finding quality information.
Week 11 New Media November 16, 2012 New Media November 16, 2012.
Learning how to search on the web “If all you ever do is all you’ve ever done, then all you’ll ever get is all you’ve ever got.” (author unknown)
Traffic Source Tell a Friend Send SMS Social Network Group chat Banners Advertisement.
 The web is referred to as a “massive collection of web pages stored on millions of computers across the world that are linked by the Internet” (Chowdhury,
Third Edition Discovering the Internet Discovering the Internet Complete Concepts and Techniques, Second Edition Chapter 3 Searching the Web.
Lecture 4 Access Tools/Searching Tools. Learning Objectives To define access tools To identify various access tools To be able to formulate a search strategy.
Session 5: How Search Engines Work. Focusing Questions How do search engines work? Is one search engine better than another?
Searching the Web for academic information Ruth Stubbings.
CS 115: COMPUTING FOR THE SOCIO-TECHNO WEB FINDING INFORMATION WITH SEARCH ENGINES.
Web Searching Strategies
Google Search Appliance: improving the search experience
Prepared by Rao Umar Anwar For Detail information Visit my blog:
Web Information retrieval
Information Integration for Digital Libraries
Agreeing to Disagree: Search Engines and Their Public Interfaces
Top Search Engines.
All About the Internet.
Searching the Internet
Journal of Web Semantics 55 (2019)
Presentation transcript:

1 Web Search and Web Search Overlap: What the Deal? Amanda Spink Queensland University of Technology

2 My Talk Today Web search challenges How do people search the Web? What do we know about Web search overlap? Web search trends and challenges

3 Electronic Information Challenge Electronic information management is a challenging area for organisations and people How to collect, organise, maintain and provide access to electronic information? Poor information systems billions $ in lost productivity Still learning about people’s information behaviours

4 Web Search Challenge Many people Web search daily Web search engine usage is projected to soon outstrip as the primary Internet application Web search is now a major social issue – people are expected to search effectively

5 Web Search Engines Web search engines are generally commercial products with brand identities, e.g., Google Desk-top wars – MSN versus Google Studies show that people are satisfied with their searching, but know little about Web search engines

6 Web Search Engine Overlap For any Web query – what is the overlap on the first page of results for Yahoo, Google, MSN and Ask Jeeves? 3% - very low overlap for the same query across different Web search engines Implications – different Web search engines index different Websites

7 Web Search Engine Overlap No Web search engine covers the complete Web Web search engines have different crawling policies – how frequently and deep into a Website and what Websites. Single versus meta-search engines Meta-search engines allow you to search multiple Web search engines simultaneously, e.g., Dogpile, Metacrawler, Hotbot, Mamma, etc.

8 How People Search the Web Track Web search trends Identify characteristics of Web searching - search length, query length, and use of advanced features. Examine the distribution of query topics, terms, queries, sources, and languages used on Web search engines. Implications for society, business, Web services, interfaces and systems design.

9 Web Query Datasets Web query transaction logs collected in 1997,1999, 2001, 2003, 2004, 2005 & Combined dataset of 25 million+ Web searches

10 Web Search Engines Excite.com Ask.com AlltheWeb.com AltaVista.com Vivisimo.com Dogpile.com AOL No Google, Yahoo or MSN data

11 Queries Per Web Search Sessions are SHORT in queries, but getting longer

12 Terms Per Web Query SHORT QUERIES: But increasing towards 3 terms per query

13 Web Search Duration (Minutes) < 1 minute 56.6% 1 minute to < 5 minutes 15.5% 5 to < 10 minutes 6.8% 10 minutes + 20%

14 Use of Advanced Web Search Features Many uses of advanced search features are used incorrectly Poor spelling

15 Results Pages Viewed Most users view VERY FEW pages beyond the first or first two pages. 14% of users view Web pages for less than 30 seconds

16 Search Term Distribution

17 Top 10 Web Search Terms

18 Top 10 Co-Occurring Terms

19 Web Query Topics

20 Web Query Topics Commerce, Travel, Employment or Economy21% 2. Indiscernible or Non-English19% 3. People, Places or Things15% 4. Computers or Internet13% 5. Social, Culture, Ethnic or Religion9% 6. Health or Sciences6% 7. Education or Humanities5% 8. Sex or Pornography4% 9. Performing or Fine Arts3% 10. Government3% 11. Entertainment or Recreation2%

21 Web Query Topics Commerce, Travel, Employment or Economy30.4% 2. Indiscernible or Non-English13.2% 3. People, Places or Things16% 4. Computers or Internet13% 5. Social, Culture, Ethnic or Religion9% 6. Health or Sciences6% 7. Education or Humanities5% 8. Sex or Pornography4% 9. Performing or Fine Arts3% 10. Government3% 11. Entertainment or Recreation2%

22 Typical Web Searches Short Web searches – getting longer Shift from entertainment queries to e- commerce/people queries Sex / pornography queries > 4% Increasing number of non-English queries More query reformulation Less results and Web page viewing

23 Implications Finding electronic information is often difficult and often time-consuming - effects productivity Proliferation of Web search tools Most people are not trained to search Incremental change – no silver bullet!

24 New Web Technologies Visualization interfaces Clustering techniques Larger query boxes More systems assistance for users Multitasking support Longitudinal searching

25 Systems Design Challenges Support information behaviors not just behaviors of technologists Design to support/help humans bridge their training gap Help humans express and resolve complex information problems Measure how systems impact on human information behaviors

26 User Challenges No technology silver bullet Need to improve information behaviors and search expertise People learn and think more about their information behaviors People learning information tools and training

27 Conclusions Web search is changing - incrementally Some user characteristics are relatively stable and some changing incrementally Better usability testing Web search engine systems and interfaces? Continue research to identify trends in Web search with Dogpile and AOL data.

28 THANK YOU