Characterising Browsing Strategies in the World Wide Web Lara D. Catledge & James E. Pitkow Presented by: Mat Mannion, Dean Love, Nick Forrington & Andrew.

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

Getting Your Web Site Found. Meta Tags Description Tag This allows you to influence the description of your page with the web crawlers.
Introduction Lesson 1 Microsoft Office 2010 and the Internet
Wincite Knowledge Warehousing and Networking Sophisticated Simplicity.
LeadManager™- Internet Marketing Lead Management Solution May, 2009.
Using Literature Resource Center Literature Resource Center (LitRC) is a complete literature reference database designed for college and university student.
Computer Information Technology – Section 3-2. The Internet Objectives: The Student will: 1. Understand Search Engines and how they work 2. Understand.
Design Guidelines for Effective WWW History Mechanisms Linda Tauscher and Saul Greenberg University of Calgary This talk accompanied a paper, and was presented.
Empirical Investigations of WWW Surfing Paths Jim Pitkow User Interface Research Xerox Palo Alto Research Center.
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
WebKDD 2001 Aristotle University of Thessaloniki 1 Effective Prediction of Web-user Accesses: A Data Mining Approach Nanopoulos Alexandros Katsaros Dimitrios.
James Tam Web Browsers In this section of notes you will learn about the web browsing process, some of the important features of popular browsers and a.
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Eric Sieverts University Library Utrecht IT Department Institute for Media & Information Management (Hogeschool van Amsterdam)
Session 3: Web Site Design J 394 – Perancangan Situs Web Program Studi Manajemen Universitas Bina Nusantara.
Searching and Researching the World Wide: Emphasis on Christian Websites Developed from the book: Searching and Researching on the Internet and World Wide.
Introduction Web Development II 5 th February. Introduction to Web Development Search engines Discussion boards, bulletin boards, other online collaboration.
The Internet 8th Edition Tutorial 1 Browser Basics.
THE BASICS OF THE WEB Davison Web Design. Introduction to the Web Main Ideas The Internet is a worldwide network of hardware. The World Wide Web is part.
Query Log Analysis Naama Kraus Slides are based on the papers: Andrei Broder, A taxonomy of web search Ricardo Baeza-Yates, Graphs from Search Engine Queries.
Navigation and Menus Hillary Funk. Agenda  Overview of Navigation and Menus  Types of Navigation  What good navigation includes  Navigation Stress.
Web Design Basic Concepts.
Prof. Vishnuprasad Nagadevara Indian Institute of Management Bangalore
FALL 2012 DSCI5240 Graduate Presentation By Xxxxxxx.
Adobe Dreamweaver CS5 Introduction Web Site Development and Adobe Dreamweaver CS5.
Web Search Created by Ejaj Ahamed. What is web?  The World Wide Web began in 1989 at the CERN Particle Physics Lab in Switzerland. The Web did not gain.
Navigation Section 2. Objectives Student will knowhow to navigate through the browser.
Generating Intelligent Links to Web Pages by Mining Access Patterns of Individuals and the Community Benjamin Lambert Omid Fatemieh CS598CXZ Spring 2005.
Do You Have a Web Site?. Everyone does, don’t they?
Tutorial 1: Browser Basics.
WHAT IS A SEARCH ENGINE. Widescreen Presentation Proteus, Keeper of Knowledge. Proteus is synonymous with change and success.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
UNIT 14 1 Websites. Introduction 2 A website is a set of related webpages stored on a web server. Webmaster: is a person who sets up and maintains a.
Objective Understand concepts used to web-based digital media. Course Weight : 5%
Search Engine Optimization & Pay Per Click Advertising
Web software. Two types of web software Browser software – used to search for and view websites. Web development software – used to create webpages/websites.
Log files presented to : Sir Adnan presented by: SHAH RUKH.
Chapter 12: Web Usage Mining - An introduction Chapter written by Bamshad Mobasher Many slides are from a tutorial given by B. Berendt, B. Mobasher, M.
Utilizing OPAC Search Logs and Google Analytics Assessing OPAC Effectiveness and User Search Behavior VALE Users'/NJLA CUS/NJ ACRL Conference January 9,
IWM 14 Information Architecture: Designing Navigation.
AnnotatEd: A Social Navigation and Annotation Service for Web-based Educational Resources Rosta Farzan & Peter Brusilovsky Personalized Adaptive Web Systems.
The World Wide Web: Information Resource. Hock, Randolph. The Extreme Searcher’s Internet Handbook. 2 nd ed. CyberAge Books: Medford. (2007). Internet.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Xinyu Xing, Wei Meng, Dan Doozan, Georgia Institute of Technology Alex C. Snoeren, UC San Diego Nick Feamster, and Wenke Lee, Georgia Institute of Technology.
Website design and structure. A Website is a collection of webpages that are linked together. Webpages contain text, graphics, sound and video clips.
What is Web Information retrieval from web Search Engine Web Crawler Web crawler policies Conclusion How does a web crawler work Synchronization Algorithms.
Secondary Evidence for User Satisfaction With Community Information Systems Gregory B. Newby University of North Carolina at Chapel Hill ASIS Midyear Meeting.
Microsoft Office 2008 for Mac – Illustrated Unit D: Getting Started with Safari.
Taking Them With You Portable Bookmarks: What’s the Difference? Molly Graham Mary Ann Jones.
Web Design Terminology Unit 2 STEM. 1. Accessibility – a web page or site that address the users limitations or disabilities 2. Active server page (ASP)
introductionwhyexamples What is a Web site? A web site is: a presentation tool; a way to communicate; a learning tool; a teaching tool; a marketing important.
What Is Firefox? __________ is a Web ___________ that you use to search for and view Web pages, save pages for use in the future, and maintain a list.
General Architecture of Retrieval Systems 1Adrienn Skrop.
Session 5: How Search Engines Work. Focusing Questions How do search engines work? Is one search engine better than another?
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
SEMINAR ON INTERNET SEARCHING PRESENTED BY:- AVIPSA PUROHIT REGD NO GUIDED BY:- Lect. ANANYA MISHRA.
Searching the Web for academic information Ruth Stubbings.
Data mining in web applications
2.2 Internet Basics.
Effective Prediction of Web-user Accesses: A Data Mining Approach
Warm Handshake with Websites, Servers and Web Servers:
Web Mining Ref:
Web software.
Objective % Explain concepts used to create websites.
Eric Sieverts University Library Utrecht Institute for Media &
Objectives To understand the about types of computer network
Effective Prediction of Web-user Accesses: A Data Mining Approach
Internet Basics and Information Literacy
Objective Explain concepts used to create websites.
Web Mining Research: A Survey
Presentation transcript:

Characterising Browsing Strategies in the World Wide Web Lara D. Catledge & James E. Pitkow Presented by: Mat Mannion, Dean Love, Nick Forrington & Andrew Ingram

Overview Summary of Paper Further research Relevance to today’s browsing strategies Conclusions

Part 1: Paper Summary

Summary of the paper Context of the paper Age Experimental population

Browsing Strategies 1.Search Browsing: directed search – goal is known 2.General Purpose Browsing: following links that have high likelihood of leading to interesting content. 3.Serendipitous: Completely random More Focused Browsing Intention Drunk

Browsing vs. Searching There is a tension between designing for browsers and designing for searchers. A hierarchical searchable database may work well for searchers, but not as useful for those just browsing for unexpected results. First step in solving the problem was to determine what strategies are in use. This information was obtained experimentally

Data Collection Special version of XMosaic coded to log user interface events at Georgia Institute of Technology's College of Computing Study conducted for 3 week period in August 1994 Final log file comprised over 43,000 events with each uniquely identifiable

Results 80% of requests used http Following hyperlinks accounted for 52% of all requests “Back” command closely followed with 41% Most users didn’t make use of hotlist (bookmarks) and history functions Only 10% of events activated via keyboard Popular sites didn’t match the sites put on the hotlist

Results Continued Pattern Detection Module (PDM) algorithm used to identify repeating sequences of site and document accesses –For example a user going from to to a total of 7 times would be identified as a path length of 3 with a frequency of An analysis of lengths of paths within each site visited for each user was performed.

Frequency and Path Length Analysis Previous graph gives: Frequency = x path length Using the 3 Characterisations, the following classifications can be made: –Serendipitous Browser (slope < -0.24) – avoid repetition of long sequences –General Purpose Browser (slope = -0.24) – Users perform as expected, 1 in 4 chance of repeating complex navigation –Searcher (slope > -0.24) – performs long sequences often

Within-Site Navigation Backtracking (via use of “Back” command) heavily used, can be visualised as hub-based browsing Typical example would be: – – – – – – – – – As example shows, users rarely reach a depth of more than a few pages before returning to entry point Paper suggests sites should be organised to exploit this user behaviour

Conclusions of the paper That there are three different types of browsing strategy These strategies can be matched to empirical results That these strategies were valid for pre- web hypertext browsing, and remain so Website designers should design with these user groups in mind

Part 2: Subsequent Research

Linda Tauscher and Saul Greenberg Two separate papers –How people revisit web pages: empirical findings and implications for the design of history systems, 1997 Partially based on Catledge and Pitkow’s data –Revisitation Patterns in World Wide Web Navigation, 1997 Made conclusions on patterns of information seeking

Web page revisits 58% of web pages in a single session are re- visits –Changing information on pages –Further exploration of a page –Special purpose pages Search engine Home page Information seeking is influenced by browser ergonomics –It is very easy to go “Back” in a web browser

Seven browsing patterns First-time visits to a cluster of pages Revisits to pages Page authoring (refreshing changes) Web-based applications Hub-and-spoke (navigating to each new page around a central page) Guided tour Depth-first-search (link paths followed without necessarily returning to the source pages)

Huberman, Pirolli, Pitkow and Lukose Strong Regularities in World Wide Web Surfing, 1998 –Regularities of surfing patterns –Mathematical law of surfing –Massively larger sample than previously studied In a single day: –23,692 AOL users –3,247,054 pages –1,090,168 unique pages Also studied statistics of Xerox external site –Can predict number of visitors to a single page from its’ link topology –Implications in e-commerce websites

Choo, Detlor and Turnbull Information Seeking on the Web: An Integrated Model of Browsing and Searching, 2000 –Many different strategies for information seeking –Sophisticated search techniques –Implications for brand building Brands are a substitute for information Information searching strategies make information more freely available

Episodes of information seeking on the web

Part 3: Relevance today

Are the initial results valid today? Mosaic provided basis for modern browsers –Missing a lot of features compared to newer browsers –but browsing experience is similar

XMosaic

Mozilla Firefox

New Features that could affect browsing strategy Search engines –Not widely used at the time of study –Didn’t crawl Forms –Not supported by all browsers at the time of the study Dynamic Pages / Personalisation –Pages can be generated for a particular user Scripting –Webpages becoming more like applications Really Simple Syndication (RSS) –Used to keep track of sites with changing content (news, blogs etc).

Evaluation of findings Heavy use of back button – can now use… –Mouse gestures –Additional mouse & keyboard buttons Paper envisioned page designers creating different “views” for different types of users (browsers & searchers) Instead, we have search engines for searchers Directories for browsing

Evaluation of findings “alter page design on the fly based on accesses by users” –Most popular products in online stores –Also personalisation of websites (e.g. recommendations) not predicted in paper –Although a “guided tour based on paths most travelled” isn’t commonplace.

Evaluation of findings Paper suggested image maps –Not widely used “Document designers need to be cognizant of the classification of expected visitors” –This is generally not the case with most websites –Designers generally more aware of different classifications in terms of browser type, screen size etc Problem of searching vs. browsing –With more advanced search engines, we don’t really need to worry about this anymore

Part 4: Conclusions

Conclusions The web has changed significantly in ten years The three browsing strategies still exist An entire industry has formed to cater for search browsing – eg. Google Designers no longer have to cater for this – your information can always be found

Conclusions continued While the three strategies exist, users will switch between them so often that offering different views for different users is not commonplace Though many sites offer user registration to target content more effectively

The End… … go home. Oops, there’s another presentation first. Sorry. Go home in half an hour. Or go to the next lecture, we don’t care. That’s really the end. Applaud now.