Prof. Vishnuprasad Nagadevara Indian Institute of Management Bangalore

Slides:



Advertisements
Similar presentations
RP Designs Semi-Custom e-Commerce Package. Overview RP Designs semi- custom e-commerce package is a complete website solution. Visitors can browse a catalog.
Advertisements

WEB DESIGN TABLES, PAGE LAYOUT AND FORMS. Page Layout Page Layout is an important part of web design Why do you think your page layout is important?
Advanced Web Metrics with Google Analytics By: Carley Brown.
Digital Marketing Analytics v10. Introduction  Name / job role  What company are you with  How much experience do you have using Webtrends  Create.
Back to Table of Contents
What is the Internet? Internet: The Internet, in simplest terms, is the large group of millions of computers around the world that are all connected to.
XP Browser and Basics1. XP Browser and Basics2 Learn about Web browser software and Web pages The Web is a collection of files that reside.
Tutorial 6 Working with Web Forms
ASP.NET 2.0 Chapter 6 Securing the ASP.NET Application.
Tutorial 6 Working with Web Forms. XP Objectives Explore how Web forms interact with Web servers Create form elements Create field sets and legends Create.
PubMed Search Options (Basic Course: Module 6). Table of Contents  History  Advanced Search  Accessing full text articles from HINARI/PubMed  Failure.
Browser and Basics Tutorial 1. Learn about Web browser software and Web pages The Web is a collection of files that reside on computers, called.
The Premier Software Usage Analysis and Reporting Toolset Maximizing Value for Software Users.
Application Layer. Applications A program or group of programs designed for end users. A program or group of programs designed for end users. Software.
Chapter 10 Publishing and Maintaining Your Web Site.
Christopher M. Pascucci Basic Structural Concepts of.NET Browser – Server Interaction.
Chapter 9 Collecting Data with Forms. A form on a web page consists of form objects such as text boxes or radio buttons into which users type information.
WEB ANALYTICS Prof Sunil Wattal. Business questions How are people finding your website? What pages are the customers most interested in? Is your website.
HTML Comprehensive Concepts and Techniques Intro Project Introduction to HTML.
HTTP: cookies and advertising Concepts to cover:  web page content (including ads) from multiple site: composition at client  cookies  third-party cookies:
FALL 2012 DSCI5240 Graduate Presentation By Xxxxxxx.
UNIT 14 Lecturer: Ghadah Aldehim 1 Websites. Introduction 2.
Fall 2006 Davison/LinCSE 197/BIS 197: Search Engine Strategies 6-1 Module II Overview PLANNING: Things to Know BEFORE You Start… Why SEM? Goal Analysis.
Web Design, 4 th Edition 7 Promoting and Maintaining a Web Site.
Getting started on informaworld™ How do I register my institution with informaworld™? How is my institution’s online access activated? What do I do if.
Chapter 16 The World Wide Web. 2 The Web An infrastructure of information combined and the network software used to access it Web page A document that.
About Dynamic Sites (Front End / Back End Implementations) by Janssen & Associates Affordable Website Solutions for Individuals and Small Businesses.
Programming with Microsoft Visual Basic 2012 Chapter 12: Web Applications.
System for Administration, Training, and Educational Resources for NASA SATERN Overview for Learners May 2006.
Strategies for improving Web site performance Google Webmaster Tools + Google Analytics Marshall Breeding Director for Innovative Technologies and Research.
Web Site Performance An analytical approach for benchmarking and tuning.
XHTML Introductory1 Linking and Publishing Basic Web Pages Chapter 3.
XP New Perspectives on Browser and Basics Tutorial 1 1 Browser and Basics Tutorial 1.
What is the Internet? Internet: The Internet, in simplest terms, is the large group of millions of computers around the world that are all connected to.
UNIT 14 1 Websites. Introduction 2 A website is a set of related webpages stored on a web server. Webmaster: is a person who sets up and maintains a.
C# Tutorial -1 ASP.NET Web Application with Visual Studio 2005.
Module 3: Administrator Set-Up Intuit Financial Services University Internet Banking Certification Training.
Portal User Group Meeting June 13, Agenda I. Welcome II. Updates on the following: –Migration Status –New Templates –DB Breakup –Keywords –Streaming.
Sustainability: Web Site Statistics Marieke Napier UKOLN University of Bath Bath, BA2 7AY UKOLN is supported by: URL
Near East University Department of Computer Engineering E-COMMERCE FOR LAPTOPS SELLING COMPANY Abdul Halim Abu Kuwaik
Chapter 9 Publishing and Maintaining Your Site. 2 Principles of Web Design Chapter 9 Objectives Understand the features of Internet Service Providers.
System for Administration, Training, and Educational Resources for NASA SATERN Overview for Users December 2009.
Log files presented to : Sir Adnan presented by: SHAH RUKH.
Tutorial 6 Working with Web Forms. XP Objectives Explore how Web forms interact with Web servers Create form elements Create field sets and legends Create.
Tutorial 6 Working with Web Forms. 2New Perspectives on HTML, XHTML, and XML, Comprehensive, 3rd Edition Objectives Explore how Web forms interact with.
EVALUATE YOUR SITE’S PERFORMANCE. Web site statistics Affiliate Sales Figures.
ECEN “Internet Protocols and Modeling”, Spring 2012 Course Materials: Papers, Reference Texts: Bertsekas/Gallager, Stuber, Stallings, etc Class.
Module 7: Marketing Tools Intuit Financial Services University Internet Banking Certification Training.
EOSDIS FY2009 Annual Metrics Report Prepared By: Hyo Duck Chang Adnet, Inc. Brian Krupp Adnet, Inc. Lalit Wanchoo Adnet, Inc. March 2010.
Registration Solutions for your Event Management.
Tutorial 6 Working with Web Forms. 2New Perspectives on HTML, XHTML, and XML, Comprehensive, 3rd Edition Objectives Explore how Web forms interact with.
Glossary of Terms Sessions - (old name: Visits) Users - (old name: Unique Visitors) Pageviews Pages/Session Avg. Session Duration Bounce Rate %New Sessions.
JavaScript and Ajax (Internet Background) Week 1 Web site:
Microsoft Office 2008 for Mac – Illustrated Unit D: Getting Started with Safari.
Introduction Web analysis includes the study of users’ behavior on the web Traffic analysis – Usage analysis Behavior at particular website or across.
Web Design Terminology Unit 2 STEM. 1. Accessibility – a web page or site that address the users limitations or disabilities 2. Active server page (ASP)
COMPUTER NETWORKS Hwajung Lee. Image Source:
Website Design and Construction Services and Standards.
Web Analytics Fundamentals Presented by Tejaswi, Chandrika, Sunil.
Introduction to Digital Analytics Keith MacDonald Guest Presentation.
E-Business Infrastructure PRESENTED BY IKA NOVITA DEWI, MCS.
Chapter Objectives Explain how to test a website before it is published Describe how to publish a website to a web server Identify ways to promote a published.
3.02H Publishing a Website 3.02 Develop webpages..
PIWIK JUNIOR TIDAL ASSOCIATE PROF., WEB SERVICES & MULTIMEDIA LIBRARIAN NEW YORK CITY COLLEGE OF TECHNOLOGY, CUNY.
Strategies for improving Web site performance
Latest Updates on BlackHawk Mines Music : Privacy Policy
PubMed Database Interface (Basic Course Module 4 Part A)
Chapter 27 WWW and HTTP.
4.02 Develop web pages using various layouts and technologies.
CMP Creating Your Personal and Small Business Web Sites
Presentation transcript:

Prof. Vishnuprasad Nagadevara Indian Institute of Management Bangalore

Definition Web Analytics as defined by Web Analytics Association : “ Web Analytics is the measurement, collection, analysis and reporting of Internet data for the purposes of understanding and optimizing Web usage.” Clickstream as defined by Internet Advertising Bureau (IAB) : “The electronic path a user takes while navigating from site to site, and from page to page within a site. It is a comprehensive body of data describing the sequence of activity between a user’s browser and any other Internet resource, such as a Web site or third party ad server”

Information from Web Analytics How many visitors visit the page daily? Who are the regular visitors? What percentage of the visitors to the page are registered users? What are the top pages that are visited on the web page? What is the average visit time on the website? How often does the visitor return to the site? What is the average page depth of a visitor? What is the geographic distribution of users of the website?

Measures Clicks: The interaction between the user and the web server is measured by the click of a mouse. Visits: The number of times a user visits a specific web site. Every new session is counted as a new visit. Hits: Total number of server requests serviced by the server Exits: Site exits, counted by site inactivity for more than 30 minutes Unique Visitors: A Unique User who accesses the site in a specified period of time. Repeated Visitor: The average number of times a user returns to a site over a specific time period. Page views: The view of any page by the user. A page may contain text, images, and other online elements and may be statically or dynamically generated and could contain single or multiple frames or screens. Sessions: IAB defines it to be an “A sequence of Internet activity made by one user at one site. If a user makes no request from a site during a 30 minute period of time, the next content or ad request would then constitute the beginning of a new visit “ Unique authenticated visitors: A unique visitor who logs on to a site via a registration method using his/her user id and password.

Metrics Page views per visit: Average number of page views per visit. Page views per session: Average number of page views per session. Page views per hour/day: Average number of page views per hour/day. Clicks per session: Average number or clicks per session. Clicks per hour: Average number of clicks per hour. Time between clicks: The average duration of time spent between two clicks. Hits per hour: Average number of hits to the web server per hour. Busy hour of the day: The highest number of hits to the web server in a particular hour of a day.

IMPLEMENTING WEB ANALYTICS Define your business objectives Define the KPIs that are important for your business based on objectives and goals of business. Identify the data that needs to be collected. Identify the process to collect the data Prepare the data, analyze and interpret the data Design and implement the plan of action Monitor the data for continuous feedback

Objectives of the Study The objectives of this study are to Explore Web analytics and its usefulness to web based business. Identify the techniques used in click stream analysis. Identify the application of click stream analysis through analyzing click stream data obtained from a particular website using appropriate click stream analysis techniques.

Methodology This study analyzes the click stream data obtained from a web site, which specializes in an online information exchange service to facilitate identification of suitable partners, in India and other countries. The site has a very different revenue model. The visitors are allowed to browse through the site without any initial payment. The visitors are allowed to look at the profiles of prospective partners free of charge. The visitors will have to become members by making a one-time payment only when they need to contact the prospective brides or grooms. Users can search for profiles through advanced search options on the site on various preferences ranging from basic details of preferred partner to lifestyle, career, education, profession etc.

Methodology Members can make initial contact with each other through services available via Chat, SMS, and . Users can avail free registration on the website and are assured of exclusive privacy and confidentiality. The website allows the users to create their profiles, search for other profiles, and express interest in other profiles and contact others. Registration and creating a profile is free of cost. Registered users can become paid members that will allow them to contact others, view contact details of other members, write personalized messages, initiate chats and let other members view their contact details. Paid memberships are provided for a specified duration.

Methodology The click stream data is analyzed to identify different paths taken by the visitors and the sequence of pages that lead to payment of membership fee. Based on this analysis, specific strategies are recommended to maximize the revenue for the website.

DATA PREPARATION Problem : Format of data Clickstream data files are neither delimited nor fixed length files Solution: Used the date in the clickstream as the delimiter to import data to database Have to perform string handling in database to separate out the fields , [23/May/2008:00:00: ] "GET /billing/billing.php?user=&cid= da14a61c43512fa025b59578i HTTP/1.0" [23/May/2008:00:00: ] "GET /profile/js/common.js HTTP/1.1" [23/May/2008:00:00: ] "GET /P/css/comm_style.css HTTP/1.1" [23/May/2008:00:00: ] "GET /P/search.php?checksum=&searchchecksum= &j=300&newsearch=&inf_checksum=&castemapping=&crmback=&searchorder =T&label_select_no=&savesearch=&from_index=&viewall=&save_search_redirect=&hide_search_bar=y HTTP/1.1" [23/May/2008:00:00: ] "GET /P/css/homestyle.css HTTP/1.1" [23/May/2008:00:00: ] "GET /profile/mainmenu.php?checksum= f9d75933dfa9ac9005d|i|537f26ca181f05c ab261i HTTP/1.1" , [23/May/2008:00:00: ] "GET /P/css/homestyle.css HTTP/1.0" , , [23/May/2008:00:00: ] "GET /profile/mainmenu.php?checksum= HTTP/1.1"

Data Data is obtained from the site in the form of click stream records. Each record consists of the details of clicks by the visitors and each record contains the following details: Server IP Client IP Time stamp with Date Status: HTTP Status code URL requested: has three subfields namely The request method, resource requested and the protocol used No. of bytes transferred The country of origin for a specific request is identified using the IP address.

Data URL is used to identify the information/web page browsed by the visitors. Time stamp of each click is used to sequence the movement of the visitors across different pages in the website. Identifying a unique user session is an important step in the analysis of click stream data. Inactivity for more than 30 minutes is considered as a break of session. This is an approximation since there could be multiple users accessing from the same IP, or the same user accessing from different IPs. Due to lack of more data available we consider hits from each unique IP as belonging to a unique user for a unique session.

No of Sessions Day Number of sessions Number of clicks Day 123,440460,211 Day 222,717453,977 Day 324,694461,518

Countries By Hour

Exit Points

Different Pages Accessed

Web Diagram – Freq ≥ 19,000

Web Diagram – Freq ≥ 1,000

Associations Consequent Antecedent 1 Antecedent 2Antecedent 3Antecedent 4 Support % Confidence % Payment = T Photorequest =T memcomp=T Payment = T Country = India Photorequest= T memcomp=T8073 Payment = TLogin=T Photorequest= T memcomp=T6073 Payment = T ViewProfile= T Photorequest= T memcomp=T Payment = T ViewProfile= T Login=TPhotorequest=Tmemcomp=T Payment = T Country = India ViewProfile=TPhotorequest=Tmemcomp=T Payment = T Mmshowmsg = T Photorequest= T memcomp=T Payment = TViewProfile= T Mmshowmsg = T Photorequest=Tmemcomp=T5066.4

Summary and Conclusions Usage of the website by time of the day. This will help busy hour identification, and provide information of the server capacity required for the website, and when maintenance window can be scheduled. Usage of website from different geographic location. This can provide the data of the distribution of users across geographical locations Exit screens provide information on where the users exit from the website. This input can help redesign the webpage if it provides information on which pages are breaking the flow of the user session.

Summary and Conclusions Most accessed and least accessed pages This can be used for variable pricing of advertisings on the web page. This can also be used for better user interface design and space utilization, by removing or repositioning the links that are infrequently accessed. Associations Provide information on unique actions on the website and the sequence in which the user has performed these actions. This can be used in better user interface design. Web diagrams Gives information on co-occurrence of actions on the webpage and their significance – also provides inputs on user interface design.

Questions? Suggestions? Comments?