1 Web Analytics: A Brief Tutorial by Dr. Robert J. Boncella Professor of Information Systems & Technology School of Business Washburn University Presented.

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

The Internet and the Web
Data Preparation for Web Usage Analysis
© Minder Chen, Web Architecture - 1 The Architecture of Internet and WWW Web Browser Client Web Server End User HTTP TCP/IP HTML documents Internet.
Chapter 12: Web Usage Mining - An introduction
1 Internet Privacy - At Home and At Work: A Tutorial Presented by Dr. Robert J. Boncella Professor of CIS CIS Department and School of Business Washburn.
Master’s course Bioinformatics Data Analysis and Tools Lecture 6: Internet Basics Centre for Integrative Bioinformatics.
Jacob Boston Josh Pfeifer. Definition of HyperText Transfer Protocol How HTTP works How Websites work GoDaddy.com OSI Model Networking.
Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?
1 The HyperText Transfer Protocol: HTTP Nick Smith Stuart Alley Tara Tjaden.
SESSION 9 THE INTERNET AND THE NEW INFORMATION NEW INFORMATIONTECHNOLOGYINFRASTRUCTURE.
Browsing the World Wide Web. Spring 2002Computer Networks Applications Browsing Service Allows one to conveniently obtain and display information that.
1 The World Wide Web. 2  Web Fundamentals  Pages are defined by the Hypertext Markup Language (HTML) and contain text, graphics, audio, video and software.
Web Programming Language Dr. Ken Cosh Week 1 (Introduction)
A global, public network of computer networks. The largest computer network in the world. Computer Network A collection of computing devices connected.
Christopher M. Pascucci Basic Structural Concepts of.NET Browser – Server Interaction.
WEB ANALYTICS Prof Sunil Wattal. Business questions How are people finding your website? What pages are the customers most interested in? Is your website.
Prof. Vishnuprasad Nagadevara Indian Institute of Management Bangalore
Sys Prog & Scripting - HW Univ1 Systems Programming & Scripting Lecture 15: PHP Introduction.
1 Introduction to Web Development. Web Basics The Web consists of computers on the Internet connected to each other in a specific way Used in all levels.
1.Understand the decision-making process of consumer purchasing online. 2.Describe how companies are building one-to-one relationships with customers.
FALL 2012 DSCI5240 Graduate Presentation By Xxxxxxx.
CS 401 Paper Presentation Praveen Inuganti
1 Web Database Processing. Web Database Applications Static Report Publishing a report is prepared from a database application and exported to HTML DB.
CSCI 323 – Web Development Chapter 1 - Setting the Scene We’re going to move through the first few chapters pretty quick since they are a review for most.
INTRODUCTION TO WEB DATABASE PROGRAMMING
Chapter 16 The World Wide Web Chapter Goals ( ) Compare and contrast the Internet and the World Wide Web Describe general Web processing.
Chapter 16 The World Wide Web Chapter Goals Compare and contrast the Internet and the World Wide Web Describe general Web processing Describe several.
Lecturer: Ghadah Aldehim
Internet Basics Dr. Norm Friesen June 22, Questions What is the Internet? What is the Web? How are they different? How do they work? How do they.
Postacademic Interuniversity Course in Information Technology – Module C1p1 Contents Data Communications Applications –File & print serving –Mail –Domain.
XHTML Introductory1 Linking and Publishing Basic Web Pages Chapter 3.
Advanced Web Forms with Databases Programming Right from the Start with Visual Basic.NET 1/e 13.
The Internet  Internet Hardware connected together Creates a massive worldwide network  Hardware Computers Communication lines  Interlinked collection.
Chapter 1: The Internet and the WWW CIS 275—Web Application Development for Business I.
Web Analytics Unit 4-1(2005 Fall) Managing the Digital Enterprise By Professor Michael Rappa.
Internet Protocol B Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
Log files presented to : Sir Adnan presented by: SHAH RUKH.
Chapter 12: Web Usage Mining - An introduction Chapter written by Bamshad Mobasher Many slides are from a tutorial given by B. Berendt, B. Mobasher, M.
Srivastava J., Cooley R., Deshpande M, Tan P.N.
INTRODUCTION TO WEB APPLICATION Chapter 1. In this chapter, you will learn about:  The evolution of the Internet  The beginning of the World Wide Web,
ECEN “Internet Protocols and Modeling”, Spring 2012 Course Materials: Papers, Reference Texts: Bertsekas/Gallager, Stuber, Stallings, etc Class.
1 UNIT 13 The World Wide Web Lecturer: Kholood Baselm.
World Wide Web “WWW”, "Web" or "W3". World Wide Web “WWW”, "Web" or "W3"
The Problem of State. We will look at… Sometimes web development is just plain weird! Internet / World Wide Web Aspects of their operation The role of.
Web-Mining …searching for the knowledge on the Internet… Marko Grobelnik Institut Jožef Stefan.
CONTENTS  Definition And History  Basic services of INTERNET  The World Wide Web (W.W.W.)  WWW browsers  INTERNET search engines  Uses of INTERNET.
Organisations and Data Management 1 Data Collection: Why organisations & individuals acquire data & supply data via websites 2Techniques used by organisations.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Web Server.
27.1 Chapter 27 WWW and HTTP Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Internet Applications (Cont’d) Basic Internet Applications – World Wide Web (WWW) Browser Architecture Static Documents Dynamic Documents Active Documents.
ASP-2-1 SERVER AND CLIENT SIDE SCRITPING Colorado Technical University IT420 Tim Peterson.
Web Design Terminology Unit 2 STEM. 1. Accessibility – a web page or site that address the users limitations or disabilities 2. Active server page (ASP)
COMPUTER NETWORKS Hwajung Lee. Image Source:
1 UNIT 13 The World Wide Web. Introduction 2 Agenda The World Wide Web Search Engines Video Streaming 3.
1 UNIT 13 The World Wide Web. Introduction 2 The World Wide Web: ▫ Commonly referred to as WWW or the Web. ▫ Is a service on the Internet. It consists.
6/28/ A global mesh of interconnected networks (internetworks) meets these human communication needs. Some of these interconnected networks are.
Web Analytics Fundamentals Presented by Tejaswi, Chandrika, Sunil.
Some from Chapter 11.9 – “Web” 4 th edition and SY306 Web and Databases for Cyber Operations Cookies and.
1 Chapter 1 INTRODUCTION TO WEB. 2 Objectives In this chapter, you will: Become familiar with the architecture of the World Wide Web Learn about communication.
E-Business Infrastructure PRESENTED BY IKA NOVITA DEWI, MCS.
Distributed Control and Measurement via the Internet
CISC103 Web Development Basics: Web site:
Web Development Web Servers.
Warm Handshake with Websites, Servers and Web Servers:
Some Common Terms The Internet is a network of computers spanning the globe. It is also called the World Wide Web. World Wide Web It is a collection of.
Chapter 27 WWW and HTTP.
Web Page Concept and Design :
Unit# 5: Internet and Worldwide Web
Presentation transcript:

1 Web Analytics: A Brief Tutorial by Dr. Robert J. Boncella Professor of Information Systems & Technology School of Business Washburn University Presented March 2008 To SAIS 2008

2 Introduction Web analytics is the study of the behavior of website visitors. In a commercial context, web analytics refers to the use of data collected from a web site to determine which aspects of the website achieve the business objectives Tutorial Outline –Web Analytics: Context –Web Analytics: Technology & Terminology –Web Analytics: Tools and Case Studies

3 Context for Web Analytics DSS – Decision Support System –A conceptual framework for a process of supporting managerial decision- making, usually by modeling problems and employing quantitative models for solution analysis BI - Business Intelligence subset of DSS –An umbrella term that combines architectures, tools, databases, applications, and methodologies BA - Business Analytics subset of BI –The application of models directly to business data –Assists in making strategic decisions WA - Web Analytics subset of BA –The application of business analytics activities to Web-based processes, including e-commerce

4 Web Analytics - Details Relevant Technology –Internet & TCP/IP –Client / Server Computing –HTTP (HyperText Transfer Protocol) –Server Log Files & Cookies –Web Bugs Data Collection –The Clickstream Server Log Files Page Tagging Data Analysis –Data Preparation –Pattern Discovery –Pattern Analysis

5 Client Server This is a response This is a request Client/Server Computing

Internet & TCP/IP The Internet –The infrastructure that provides for the delivery of data between computer based processes TCP/IP –The protocols that provides for reliable delivery of data on The Internet 6

7 HTTP Protocol Client sends a request to a server Server sends a response to client Connectionless –Client: Opens connection to server Sends request –Server Responds to request Closes connection Stateless –Client/Server have no memory of prior connections –Server cannot distinguish one client request from another client

8 Cookies Used to solve the “Statelessness” of the HTTP Protocol Used to store and retrieve user-specific information on the web When an HTTP server responds to a request it may send additional information that is stored by the client - “state information” When client makes a request to this server the client will return the “cookie” that contains its state information State information may be a client ID that can be used as an index to a client data record on the server

9 Client Browser My_Brwsr Server B Server C W BS Server A Cookie: My_Brwsr Pg A - Server A Pg B - Server B Pg C - Server C 1. Render page 2. Click on URL Page B cnts - URLs & Img Src - WebBug WBS. TRKSTRM.COM Page A cnts - URLs & Img Src - WebBug WBS. TRKSTRM.COM Page C cnts - URLs & Img Src - WebBug WBS. TRKSTRM.COM Req : Page _ B.html Req: Page_A.html Res: Page_A.html Req: WebBug IMG -Referer Header - Any cookie for TRKSTRM.com Res: WebBug Img -Cookie to client Browser on 1st Req. Res: Page_B.html Res : Page _ C.html Req: Page_C.html Web Bug Process

10 Common Clickstream Data Sources Server Log Files –Passive data collection –Normal part of web browser/ web server transaction Page Tagging –Active data collection –Often requires a third party to implement – a vendor

11 Server Log Files The name & IP address of the client computer The time of the request The URL that was requested The time it took to send the resource If HTTP authentication used; the username of the user of the client will be recorded Any errors that occurred The referer link The kind of web browser that was used Each time a client requests a resource the server of that resource may record the following in its log files:

12 Server Log Files Example – frank [10/Oct/2000:13:55: ] "GET /apache_pb.gif HTTP/1.0" – Remote host frank - user name [10/Oct/2000:13:55: ] - date & time "GET /apache_pb.gif HTTP/1.0" - request status bytes

13 Server Log Files Technical issues for server log data –Data Preparation –Pageview Identification –User Identification –Session Identification

14 Page Tags as Data Source Provided by Third Party - Vendor –Vendor Supplies Page Tags –Vendor Collects the Data –Vendor Analyzes the Data –Business Accesses the Data Online or Reports sent to Business

15 Web Data Abstractions Abstractions concerning Web usage, Content, and Structure Establishes precise semantics for the concepts –Web site –Users or Visitors –User Sessions –Server Sessions or Visits –Pageviews –Clickstreams

16 Data Abstractions Web Site - collection of interlinked Web pages, including a host page, residing at the same network location. User or Visitors - principal using a client to interactively retrieve and render resources or resource manifestations –an individual that is accessing files from a Web server, using a browser. User Session - a delimited set of user clicks across one or more Web servers

17 Data Abstractions Server Session or Visit - a collection of user clicks to a single Web server during a user session Pageview - the visual rendering of a Web page in a specific environment at a specific point in time –a pageview consists of several items frames, text, graphics, and scripts that construct a single Web page Clickstream - a sequential series of pageview requests made from a single user

18 Web Data Abstractions (High Level) Abstractions concerning Visitors Establishes precise semantics for the concepts –Unique Visitor –Conversion Rate –Abandonment Rate –Attrition –Loyalty –Frequency –Recency

19 Data Abstractions Unique Visitor –A unique visitor is counted when a human being uses a web browser to visit a web site. –A visitor may be “unique” for different periods of time. –The individual is defined by a cookie in the visitor’s web browser

20 Data Abstractions Conversion Rate –A conversion rate is the number of “completers” divided by the number of “starters” for any online activity that is more than one logical step in length –Starting and finishing any activity Purchase Download a research article Etc.

21 Data Abstractions Abandonment Rate –The abandonment rate for any step in a multi-step process is one minus the number of units that make it to “step n+1” divided by those at “step n” –The formula is (1 – ((n+1)/n) –Consider a 10 step process to acquire a resource How any quit after step 1 or 2 or 3 or 4 or … –Consider a 5 step process to acquire a resource How any quit after step 1 or 2 or 3 or 4 or …

22 Data Abstractions Attrition –Attrition is a measurement of people you have been able to successfully convert but are unable to retain to convert again –Consider e-bay web site vs. web site for technical information

23 Data Abstractions Loyalty –Loyalty is a measure of the number of visits any visitor is likely to make over their lifetime as a visitor –Reported as number of visits per visitor 100 visitors made 3 visits each, 87 visitors made 4, etc. Avoid double counting (i.e. do not count the 87 in with the 100)

24 Data Abstractions Frequency –Frequency is a measure of the activity a visitor generates on a web site in terms of time between visits –Measured in terms of “days between visits”

25 Data Abstractions Recency –Recency is the number of days since the last visit (or purchase) –Reported as the number of visitors who returned after “n” days.

26 Pyramid Model of Web Analytics Data Hits Page Views Visits Unique Visitors Uniquely Identified Visitors Volume of Available Data Increasing Value of Data

27 Web Usage Mining Web usage mining is to apply statistical and data mining techniques to the processed server log data, in order to discover useful patterns Data mining methods and algorithms that have been adapted for the Web domain –Association rules –Sequential pattern discovery –Clustering –Classification

28 Web Usage Data Mining After discovering patterns from usage data, a further analysis has to be conducted. Common ways of analyzing such patterns –Using a query mechanism on a database where the results are stored –Loading the results into a data cube and then performing OLAP operations –Visualization techniques are used for an easier interpretation of the results Using these results in association with content and structure information concerning the Web site there can be extracted useful knowledge for modifying the site according to the correlation between user and content groups.

29 Web Analytics: Tools and Case Studies Tools –VisiStat - Web Analytics Case Studies –Communications Provider - TuVox.com –Online Retailer - TicketsByInternet.comTicketsByInternet.com –Winery & Entertainment Venue - The Mountain WineryThe Mountain Winery –Non-Profit Organization - SFBallet.orgSFBallet.org –Public Relations & Media Agency - BLASTmediaBLASTmedia –Technology Provider for Real Estate Professionals - Pullan.comPullan.com –Real Estate Agency - Intero Real EstateIntero Real Estate –Start-Up Online Business - GuruPrint.comGuruPrint.com