On Improving Website Connectivity by Using Web-Log Data Streams

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

C. Mastroianni, D. Talia, O. Verta - A Super-Peer Model for Resource Discovery Services in Grids A Super-Peer Model for Building Resource Discovery Services.
Web Usage Mining Web Usage Mining (Clickstream Analysis) Mark Levene (Follow the links to learn more!)
Clickstream analysis - data collection, preprocessing and mining using the LISp-Miner system Effective placement of on-line advertisments Tomáš Kliegr.
Mining Frequent Patterns II: Mining Sequential & Navigational Patterns Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Data e Web Mining Paolo Gobbo
SIM348. “ConfigMgr appeared in Gartner client buying decisions more frequently than any other product in the market in 2010.”
Experiments on Query Expansion for Internet Yellow Page Services Using Log Mining Summarized by Dongmin Shin Presented by Dongmin Shin User Log Analysis.
Location Management in Cellular Networks By Priyanka Patel ( ) Instructor: Prof. Ivan Stojmenovic.
LinkSelector: A Web Mining Approach to Hyperlink Selection for Web Portals Xiao Fang University of Arizona 10/18/2002.
What ’ s Hot and What ’ s Not: Tracking Most Frequent Items Dynamically G. Cormode and S. Muthukrishman Rutgers University ACM Principles of Database Systems.
LinkSelector: Select Hyperlinks for Web Portals Prof. Olivia Sheng Xiao Fang School of Accounting and Information Systems University of Utah.
The Web is perhaps the single largest data source in the world. Due to the heterogeneity and lack of structure, mining and integration are challenging.
Progress Report 11/1/01 Matt Bridges. Overview Data collection and analysis tool for web site traffic Lets website administrators know who is on their.
1 End-to-End Detection of Shared Bottlenecks Sridhar Machiraju and Weidong Cui Sahara Winter Retreat 2003.
Web Usage Mining - W hat, W hy, ho W Presented by:Roopa Datla Jinguang Liu.
Update Log Dissemination in Mobile Ad Hoc Networks Hideki HAYASHI Hitachi, Ltd., Central Research Laboratory (Grad. School of Info. Science and Tech.,
T EMPORAL -I NFORMATICS R ESEARCH Eytan Adar University of Michigan, School of Information and Computer Science & Engineering April 23, 2010.
Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα
Sensys 2009 Speaker:Lawrence.  Introduction  Overview & Challenges  Algorithm  Travel Time Estimation  Evaluation  Conclusion.
By Ravi Shankar Dubasi Sivani Kavuri A Popularity-Based Prediction Model for Web Prefetching.
The Research of Applying Random Early Blocking strategy to Dynamic Lightpath Routing National Yunlin University of Science & Technology.
FALL 2012 DSCI5240 Graduate Presentation By Xxxxxxx.
CS 401 Paper Presentation Praveen Inuganti
Fall 2006 Davison/LinCSE 197/BIS 197: Search Engine Strategies 6-1 Module II Overview PLANNING: Things to Know BEFORE You Start… Why SEM? Goal Analysis.
CS 712 | Fall 2007 Using Mobile Relays to Prolong the Lifetime of Wireless Sensor Networks Wei Wang, Vikram Srinivasan, Kee-Chaing Chua. National University.
Page 1 WEB MINING by NINI P SURESH PROJECT CO-ORDINATOR Kavitha Murugeshan.
Strategies for improving Web site performance Google Webmaster Tools + Google Analytics Marshall Breeding Director for Innovative Technologies and Research.
Generating Intelligent Links to Web Pages by Mining Access Patterns of Individuals and the Community Benjamin Lambert Omid Fatemieh CS598CXZ Spring 2005.
The probability that a person visits a location. ~ means that two locations are equivalent.  Grids -> close to the border of two different.
Hao Wu Nov Outline Introduction Related Work Experiment Methods Results Conclusions & Next Steps.
MINING FREQUENT ITEMSETS IN A STREAM TOON CALDERS, NELE DEXTERS, BART GOETHALS ICDM2007 Date: 5 June 2008 Speaker: Li, Huei-Jyun Advisor: Dr. Koh, Jia-Ling.
Efficient mining and prediction of user behavior patterns in mobile web systems Vincent S. Tseng, Kawuu W. Lin Information and Software Technology 48 (2006)
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Web usage mining: extracting unexpected periods from web.
TECHNION – Israel Institute of Technology Department of Electrical Engineering The Computer Network Laboratory Crankback Prediction in ATM According to.
Zibin Zheng DR 2 : Dynamic Request Routing for Tolerating Latency Variability in Cloud Applications CLOUD 2013 Jieming Zhu, Zibin.
WIRED Week 3 Syllabus Update (next week) Readings Overview - Quick Review of Last Week’s IR Models (if time) - Evaluating IR Systems - Understanding Queries.
EVALUATE YOUR SITE’S PERFORMANCE. Web site statistics Affiliate Sales Figures.
1 Murat Ali Bayır Middle East Technical University Department of Computer Engineering Ankara, Turkey A New Reactive Method for Processing Web Usage Data.
August 30, 2004STDBM 2004 at Toronto Extracting Mobility Statistics from Indexed Spatio-Temporal Datasets Yoshiharu Ishikawa Yuichi Tsukamoto Hiroyuki.
Web-Mining …searching for the knowledge on the Internet… Marko Grobelnik Institut Jožef Stefan.
HTML 5 Tutorial Chapter 6 Web Storage. Storing Data on The Client HTML5 offers two new objects for storing data on the client: localStorage - stores data.
Web Mining Issues Size Size –>350 million pages –Grows at about 1 million pages a day Diverse types of data Diverse types of data.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Self-Organized Web Usage Regularities. Problems of foraging information on WWW Slow accession Difficulty in finding useful information is related to balkanization.
Cooperative Mobile Live Streaming Considering Neighbor Reception SPEAKER: BO-YU HUANG ADVISOR: DR. HO-TING WU 2015/10/15 1.
Secondary Evidence for User Satisfaction With Community Information Systems Gregory B. Newby University of North Carolina at Chapel Hill ASIS Midyear Meeting.
How Web Database Architectures Work CPS181s April 8, 2003.
Exploring Traversal Strategy for Web Forum Crawling Yida Wang, Jiang-Ming Yang, Wei Lai, Rui Cai Microsoft Research Asia, Beijing SIGIR
An Energy-Efficient Approach for Real-Time Tracking of Moving Objects in Multi-Level Sensor Networks Vincent S. Tseng, Eric H. C. Lu, & Kawuu W. Lin Institute.
for all Hyperion video tutorial/Training/Certification/Material Essbase Optimization Techniques by Amit.
Web Analytics Xuejiao Liu INF 385F: WIRED Fall 2004.
A Sequential Pattern Query Language for Supporting Instant Data Mining for e-Services Reza Sadri Carlo Zaniolo Amir.
© Prentice Hall1 DATA MINING Web Mining Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University Companion slides.
Improvement of Apriori Algorithm in Log mining Junghee Jaeho Information and Communications University,
Data mining in web applications
Patterns in Sequences and Data Streams
T-Share: A Large-Scale Dynamic Taxi Ridesharing Service
Strategies for improving Web site performance
Online Frequent Episode Mining
Yi-Chia Wang LTI 2nd year Master student
EMarketing: The Essential Guide to Marketing in a Digital World Data Analytics What you’ll learn How to set solid objectives, goals and measurements.
Lin Lu, Margaret Dunham, and Yu Meng
Automation in IMS Can it help the shrinking talent pool
The Anatomy of a Large-Scale Hypertextual Web Search Engine
Degree-aware Hybrid Graph Traversal on FPGA-HMC Platform
Web Mining Department of Computer Science and Engg.
Date: 2012/11/15 Author: Jin Young Kim, Kevyn Collins-Thompson,
Family Networks Web Transition Planning
Discovery of Significant Usage Patterns from Clickstream Data
Presentation transcript:

On Improving Website Connectivity by Using Web-Log Data Streams Edmond HaoCun Wu, Michael KwokPo Ng, Joshua ZheXue Huang From DASFAA 2004, LNCS 2973 Advisor:Jia-Ling Koh Speaker:Chun-Wei Hsieh 07/14/2004

Problem Statement Excessive web pages requested make web servers ineffectively. User have the low Website connective speed and even cannot access the Website. The Website linkage design affects the Website access efficiency. User spend a lot of time on searching the contents.

Problem Solving Decrease the redundant pages requested at the peak hours will be of great help to improve the system performance.

Website Topology Website topology is the structure of a Website.

Website Topology If users want to sequentially visit {A,E,F} ,the shortest traversal path is {A,C,H,E,H,C,A,B,F}. The access sequence is S = {AC,CH,HE,EH,HC,CA,AB,BF}

Non-target accesses There are two types of access redundancy. 1.Jump-track access 2.Backtrack access S = {AC,CH,HE,EH,HC,CA,AB,BF} AC,CH,HE,EH,HC,CA,AB,BF AC,CH,HE,EH,HC,CA,AB,BF

UAE & SAE User Access Efficiency Server Access Efficiency S: a user access session A: the jump-track access session B: the backtrack access session

UAE & SAE (2) Given a access session database D, the UAE and SAE of D are calculated as follows:

Mining Access Patterns Three factors: 1.page transitive probability 2.page access frequency 3.page staying time The new index : Access Interest (AI)

Topology Probability Model h: the number of hyperlinks w: the weighting parameter m: assume the shortest path passes through m Web pages

Counting Page Sequential Accesses Given a user access session dataset D={S1,S2,…., Sn}, for each S D, S={P1,P2,….,Pr} is the number of accesses from to

The Temporal Factor is the staying time of visiting and : the average staying time on page

Access Interest (AI) can identify frequent but not efficient access patterns. A access pattern with higher AI value has more necessity to improve its access efficiency.

Dynamic Website Connectivity Enhancement 1.Convert Web-log data streams into access sessions. 2.Calculate the access efficiency (UAE and SAE) of current user access patterns.If Access efficiency is lower than the threshold, the monitoring system give a warning signal.Otherwise, keep on monitoring. 3.Calculate the AI value of current access patterns, then select access patterns with high AI value for connectivity enhancement. 4.Recompute the access efficiency of access patterns,if the access efficiency increases, it indicates the Website connectivity has been improved.

Experiments

Experiments .

Experiments

Experiments

Conclusion Proposed new measures for evaluating access efficiency in Website. Proposed an efficient method aggregating access sessions and mining access patterns. In the future, extend the Website optimization model in the CRM.