Web Usage Mining - W hat, W hy, ho W Presented by:Roopa Datla Jinguang Liu.

Slides:



Advertisements
Similar presentations
E-Business and e-Commerce. e-commerce and e-business e-commerce refers to aspects of online business involving exchanges among customers, business partners.
Advertisements

Web Mining.
Web Usage Mining Web Usage Mining (Clickstream Analysis) Mark Levene (Follow the links to learn more!)
WEB USAGE MINING FRAMEWORK FOR MINING EVOLVING USER PROFILES IN DYNAMIC WEBSITE DONE BY: AYESHA NUSRATH 07L51A0517 FIRDOUSE AFREEN 07L51A0522.
Digital Experience Analytics v10. Agenda Digital Experience REAN Model.
Getting Results with WebTrends How to Improve Your Business with Web Analytics.
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Web Mining Research: A Survey Authors: Raymond Kosala & Hendrik Blockeel Presenter: Ryan Patterson April 23rd 2014 CS332 Data Mining pg 01.
Customer information: Server log file and clickstream analysis; data mining MARK 430 Week 3.
Search Engines & Search Engine Optimization (SEO) Presentation by Saeed El-Darahali 7 th World Congress on the Management of e-Business.
Chapter 12: Web Usage Mining - An introduction
Introduction to Web Analytics Web analytics is the measurement, collection, analysis and reporting of internet data for purposes of understanding and optimizing.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Web Usage Mining: Processes and Applications
The Web is perhaps the single largest data source in the world. Due to the heterogeneity and lack of structure, mining and integration are challenging.
Web Metrics October 26, 2006 Steven Schwartz President, PowerWebResults.com Southeastern Massachusetts E-Commerce Network University of Massachusetts –
Automatic Data Collection: Server Logs As with all methods, have to ask: What are the goals for your system? –What constitutes success, or good quality.
Web Mining Research: A Survey
Insight on Google Analytics Features - Suresh. K.
Search Engine Optimization (SEO)
Measuring and Monitoring Social Media Presence Measuring and Monitoring Social Media Presence Rim Dakelbab.
The Internet & Web Browsers Business Webpage Design Kelly Seale.
WEB ANALYTICS Prof Sunil Wattal. Business questions How are people finding your website? What pages are the customers most interested in? Is your website.
Discovering Web Access Patterns and Trends by Applying OLAP and Data Mining Technology on Web logs Data Engineering Lab 성 유 진.
FALL 2012 DSCI5240 Graduate Presentation By Xxxxxxx.
CS 401 Paper Presentation Praveen Inuganti
Online Search Marketing OMI Certification Course – Discovery Documentation.
Page 1 WEB MINING by NINI P SURESH PROJECT CO-ORDINATOR Kavitha Murugeshan.
Chapter 7 DATA, TEXT, AND WEB MINING Pages , 311, Sections 7.3, 7.5, 7.6.
Copyright © 2009 Pearson Education, Inc. Slide 6-1 Chapter 6 E-commerce Marketing Concepts.
Strategies for improving Web site performance Google Webmaster Tools + Google Analytics Marshall Breeding Director for Innovative Technologies and Research.
CSE Data Mining, 2002Lecture 11.1 Data Mining - CSE5230 Web Mining CSE5230/DMS/2002/11.
Search Engines & Search Engine Optimization (SEO).
February 21, 2009 Helen M. Overland | PodCamp Toronto 2009 H ELEN M. O VERLAND PodCamp Toronto, 2009 Increasing Audience Share Using Google Analytics.
Google Confidential and Proprietary 1 Google University Google Analytics and Website Optimiser Dyana Najdi, Customer Analytics Manager, EMEA Lee Hunter,
Analytics. Is your site working?  Lots of ways to measure this.  User feedback  Functional tests (output is what you expect)  But, can we measure.
MIS 424 Professor Sandvig. Overview  Why Analytics?  Two major approaches:  Server logs  Google Analytics.
1 Search Engine Optimization An introduction to optimizing your web site for best possible search engine results.
Data Mining By Dave Maung.
Log files presented to : Sir Adnan presented by: SHAH RUKH.
Track Your Keyword Rankings and Competitors Rankings - Live.
Chapter 12: Web Usage Mining - An introduction Chapter written by Bamshad Mobasher Many slides are from a tutorial given by B. Berendt, B. Mobasher, M.
Copyright © 2013 Customer Paradigm, All Rights Reserved.Web Marketing How Companies Track You.
1 Tools for Website Effectiveness. What is your site producing? Sales PR Expanding client base Brand awareness Feedback.
Srivastava J., Cooley R., Deshpande M, Tan P.N.
EVALUATE YOUR SITE’S PERFORMANCE. Web site statistics Affiliate Sales Figures.
Web-Mining …searching for the knowledge on the Internet… Marko Grobelnik Institut Jožef Stefan.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Academic Year 2014 Spring Academic Year 2014 Spring.
Week 1 Introduction to Search Engine Optimization.
WEB USAGE MINING Web Usage Mining 1. Contents Web Usage Mining 2  Web Mining  Web Mining Taxonomy  Web Usage Mining  Web analysis tools  Pattern.
© Prentice Hall1 DATA MINING Web Mining Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University Companion slides.
Web mining is the use of data mining techniques to automatically discover and extract information from Web documents/services
Web Analytics and Reporting Michal Neuwirth Product Manager – Kentico Software.
 by Başak Meral.  Introduction  Web Personalization  User Profiling  Log Analysis and Web Usage Mining  Research Initiatives  Conclusion.
A WEB USAGE MINING FRAMEWORK FOR MINING EVOLVING USER PROFILES IN DYNAMIC WEB SITES.
Chapter 8: Web Analytics, Web Mining, and Social Analytics
Week-6 (Lecture-1) Publishing and Browsing the Web: Publishing: 1. upload the following items on the web Google documents Spreadsheets Presentations drawings.
Web Analytics Fundamentals Presented by Tejaswi, Chandrika, Sunil.
Presentation by Sunitha SEO Company in India- KG Tech
Data mining in web applications
SEARCH ENGINE OPTIMIZATION.
Search Engine Optimization
PIWIK JUNIOR TIDAL ASSOCIATE PROF., WEB SERVICES & MULTIMEDIA LIBRARIAN NEW YORK CITY COLLEGE OF TECHNOLOGY, CUNY.
Getting Results with WebTrends
Automated ad placement
Web Mining Ref:
SEARCH ENGINE OPTIMIZATION SEO. What is SEO? It is the process of optimizing structure, design and content of your website in order to increase traffic.
1 SEO is short for search engine optimization. Search engine optimization is a methodology of strategies, techniques and tactics used to increase the amount.
Web Mining Department of Computer Science and Engg.
Presentation transcript:

Web Usage Mining - W hat, W hy, ho W Presented by:Roopa Datla Jinguang Liu

Agenda  What is Web Mining?  Why Web Usage Mining?  How to perform Web Usage Mining?

What is Web Mining?  Web Mining: – can be broadly defined as discovery and analysis useful information from the WWW –Consists of two major types: Web Content Mining Web Usage Mining

Why Web Usage Mining?  Explosive growth of E-commerce –Provides an cost-efficient way doing business –Amazon.com: “online Wal-Mart”  Hidden Useful information –Visitors’ profiles can be discovered –Measuring online marketing efforts, launching marketing campaigns, etc.

How to perform Web Usage Mining  Obtain web traffic data from –Web server log files –Corporate relational databases –Registration forms  Apply data mining techniques and other Web mining techniques  Two categories: –Pattern Discovery Tools –Pattern Analysis Tools

Pattern Analysis Tools  Answer Questions like: –“How are people using this site?” –“which Pages are being accessed most frequently?”  This requires the analysis of the structure of hyperlinks and the contents of the pages

Pattern Analysis Tools O/P of Analysis  The frequency of visits per document  Most recent visit per document  Frequency of use of each hyperlink  Most recent use of each hyperlink Techniques:  Visualization techniques  OLAP techniques  Data & Knowledge Querying  Usability analysis

Pattern Discovery Tools  Data Pre-processing –Filtering/clean Web log files eliminate outliers and irrelevant items –Integration of Web Usage data from: Web Server Logs Referral logs Registration file Corporate Database

Pattern Discovery Techniques  Converting IP addresses to Domain Names –Domain Name System does the conversion –Discover information from visitors’ domain names: Ex:.ca(Canada),.cn(China), etc  Converting URLs to Page Titles –Page Title: between and

Pattern Discovery Techniques  Path Analysis –Uses Graph Model –Provide insights to navigational problems –Example of info. Discovered by Path analysis: 78% “company”-> “what’s new”->“sample”-> “order” 60% left sites after 4 or less page references => most important info must be within the first 4 pages of site entry points.

Filter Pattern Discovery Techniques  Grouping –Groups similar info. to help draw higher-level conclusions –Ex: all URLs containing the word “Yahoo”…  Filtering –Allows to answer specific questions like: how many visitors to the site in this week ?

Pattern Discovery Techniques  Dynamic Site Analysis –Dynamic html links to the database, and requires parameters appended to URLs – in/search?search=Federal+Tax+Return+Form&c p=ntserchhttp://search.netscape.com/cgi- in/search?search=Federal+Tax+Return+Form&c p=ntserch –Knowledge: What the visitors looked for What keywords S/B purchased from Search engineer

Pattern Discovery Techniques  Cookies –Randomly assigned ID by web server to browser –Cookies are beneficial to both web site developers and visitors –Cookie field entry in log file can be used by Web traffic analysis software to track repeat visitors  loyal customers.

Pattern Discovery Techniques  Association Rules –help find spending patterns on related products –30% who accessed/company/products/bread.html, also accessed /company/products/milk.htm.  Sequential Patterns –help find inter-transaction patterns –50% who bought items in /pcworld/computers/, also bought in /pcworld/accessories/ within 15 days

Pattern Discovery Techniques  Clustering –Identifies visitors with common characteristics based on visitors’ profiles –50% who applied discover platinum card in /discovercard/customerService/newcard, were in the age group, with annual income between $40,000 – 50,000.

Pattern Discovery Techniques  Decision Trees –a flow chart of questions leading to a decision –Ex: car buying decision tree What Brand? What Year? What Type? 2000 Model Honda Accord EX …

Summary  E-commerce means more than just build up a web site, then sit back and relax;  Web Mining systems need to be implemented to: –Understand visitors’ profiles –Identify company’s strengths and weaknesses –Measure the effectiveness of online marketing efforts  Web Mining support on-going, continuous improvements for E-businesses

Thank You!