Download presentation
Presentation is loading. Please wait.
Published byNeal Hart Modified over 9 years ago
1
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Web usage mining: extracting unexpected periods from web logs Presenter: Hsin-Yi Huang Authors: F.Masseglia, P. Poncelet, M. Teisseire, A. Marascu 12007.DMKD.27.
2
N.Y.U.S.T. I. M. Intelligent Database Systems Lab 2008/05/27 2 Outline Motivation Objective Methodology Stable Period PERIO Heuristic Sequence alignment Experiment Conclusion Comments
3
N.Y.U.S.T. I. M. Intelligent Database Systems Lab 2008/05/27 3 Motivation Existing Web usage mining techniques are currently based on an arbitrary division of the data or guided by presumed results. First, they depend on the above-mentioned arbitrary organization of data. Second, they cannot automatically extract “seasonal peaks” from among the stored data. Request 網頁 01/31~02/01 one log per month 200,000 navigations
4
N.Y.U.S.T. I. M. Intelligent Database Systems Lab 2008/05/27 4 Objective The authors propose a specific data mining process to reveal “dense periods” in the short range automatically. Furthermore, their method can extracts the frequent sequential patterns related to the extracted periods.
5
N.Y.U.S.T. I. M. Intelligent Database Systems Lab Stable Period 2008/05/27 5 ?
6
N.Y.U.S.T. I. M. Intelligent Database Systems Lab 2008/05/27 6 PERIO heuristic … PnPn C1C2C3CnC1C2C3Cn DB=C Pn FI=Frequent Items(C Pn ) FIXFI Candidates Evaluated Candidates Frequents Operators 1 2 Log
7
N.Y.U.S.T. I. M. Intelligent Database Systems Lab 2008/05/27 7 Sequence alignment
8
N.Y.U.S.T. I. M. Intelligent Database Systems Lab 2008/05/27 8 Experiment Dataset INRIA Sophia Antipolis From January 2004 to March 2005 3.5 million sequences (users)
9
N.Y.U.S.T. I. M. Intelligent Database Systems Lab 2008/05/27 9 Experiment (cont.)
10
N.Y.U.S.T. I. M. Intelligent Database Systems Lab 2008/05/27 10 Conclusion The method can handle a Web log of any size, with no need to divide it and identify interesting periods in the log. The method can extract a frequent even if it is frequent only for a very short period, or frequent over a period that is not included in a standard division of time.
11
N.Y.U.S.T. I. M. Intelligent Database Systems Lab 2008/05/27 11 Comments Advantage a lot of illustrations Drawback … Application Web usage mining
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.