Mining General Temporal Association Rules for Items with Different Exhibition Cheng-Yue Chang, Ming-Syan Chen, Chang-Hung Lee, Proc. of the 2002 IEEE international.

Slides:



Advertisements
Similar presentations
Association Rule Mining
Advertisements

Recap: Mining association rules from large datasets
Frequent Itemset Mining Methods. The Apriori algorithm Finding frequent itemsets using candidate generation Seminal algorithm proposed by R. Agrawal and.
Association Rule Mining. 2 The Task Two ways of defining the task General –Input: A collection of instances –Output: rules to predict the values of any.
TEMPORAL ASSOCIATION RULE MINING
Association rules The goal of mining association rules is to generate all possible rules that exceed some minimum user-specified support and confidence.
Sampling Large Databases for Association Rules ( Toivenon’s Approach, 1996) Farzaneh Mirzazadeh Fall 2007.
A Fast High Utility Itemsets Mining Algorithm Ying Liu,Wei-keng Liao,and Alok Choudhary KDD’05 Advisor : Jia-Ling Koh Speaker : Tsui-Feng Yen.
Data Mining Association Analysis: Basic Concepts and Algorithms Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach, Kumar Introduction.
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
Data Mining Association Analysis: Basic Concepts and Algorithms
Spring 2003Data Mining by H. Liu, ASU1 5. Association Rules Market Basket Analysis and Itemsets APRIORI Efficient Association Rules Multilevel Association.
Spring 2005CSE 572, CBS 598 by H. Liu1 5. Association Rules Market Basket Analysis and Itemsets APRIORI Efficient Association Rules Multilevel Association.
Mining Time-Series Databases Mohamed G. Elfeky. Introduction A Time-Series Database is a database that contains data for each point in time. Examples:
Association Analysis: Basic Concepts and Algorithms.
Association Rule Mining. Generating assoc. rules from frequent itemsets  Assume that we have discovered the frequent itemsets and their support  How.
Data Mining Association Analysis: Basic Concepts and Algorithms
Is Sampling Useful in Data Mining? A Case in the Maintenance of Discovered Association Rules S.D. Lee, D. W. Cheung, B. Kao Department of Computer Science.
6/23/2015CSE591: Data Mining by H. Liu1 Association Rules Transactional data Algorithm Applications.
© Vipin Kumar CSci 8980 Fall CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance Computing Research Center Department of Computer.
2/8/00CSE 711 data mining: Apriori Algorithm by S. Cha 1 CSE 711 Seminar on Data Mining: Apriori Algorithm By Sung-Hyuk Cha.
Is Sampling Useful in Data Mining? A Case in the Maintenance of Discovered Association Rules S.D. Lee David W. Cheung Ben Kao The University of Hong Kong.
Mining Association Rules
1 Fast Algorithms for Mining Association Rules Rakesh Agrawal Ramakrishnan Srikant Slides from Ofer Pasternak.
Mining Association Rules
Performance and Scalability: Apriori Implementation.
Mining Association Rules in Large Databases. What Is Association Rule Mining?  Association rule mining: Finding frequent patterns, associations, correlations,
Mining Association Rules between Sets of Items in Large Databases presented by Zhuang Wang.
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining By Tan, Steinbach, Kumar Lecture.
Modul 7: Association Analysis. 2 Association Rule Mining  Given a set of transactions, find rules that will predict the occurrence of an item based on.
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
Association Rules. CS583, Bing Liu, UIC 2 Association rule mining Proposed by Agrawal et al in Initially used for Market Basket Analysis to find.
Efficient Data Mining for Calling Path Patterns in GSM Networks Information Systems, accepted 5 December 2002 SPEAKER: YAO-TE WANG ( 王耀德 )
9/03Data Mining – Association G Dong (WSU) 1 5. Association Rules Market Basket Analysis APRIORI Efficient Mining Post-processing.
CSE4334/5334 DATA MINING CSE4334/5334 Data Mining, Fall 2014 Department of Computer Science and Engineering, University of Texas at Arlington Chengkai.
1 On Mining General Temporal Association Rules in a Publication Database Chang-Hung Lee, Cheng-Ru Lin and Ming-Syan Chen, Proceedings of the 2001 IEEE.
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Data Mining: Association Analysis This lecture node is modified based on Lecture Notes for.
1 AC-Close: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery Advisor : Dr. Koh Jia-Ling Speaker : Tu Yi-Lang Date : Hong.
1 Mining Sequential Patterns with Constraints in Large Database Jian Pei, Jiawei Han,Wei Wang Proc. of the 2002 IEEE International Conference on Data Mining.
Association Analysis (3)
Mining Progressive Confident Rules M. Zhang, W. Hsu and M.L. Lee Int'l Conf on Data Mining (ICDM),2006 IEEE Advisor : Jia-Ling Koh Speaker : Tsui-Feng.
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
A Scalable Association Rules Mining Algorithm Based on Sorting, Indexing and Trimming Chuang-Kai Chiou, Judy C. R Tseng Proceedings of the Sixth International.
1 The Strategies for Mining Fault-Tolerant Patterns Jia-Ling Koh Department of Information and Computer Education National Taiwan Normal University.
Reducing Number of Candidates Apriori principle: – If an itemset is frequent, then all of its subsets must also be frequent Apriori principle holds due.
Data Mining Association Rules Mining Frequent Itemset Mining Support and Confidence Apriori Approach.
1 Data Mining Lecture 6: Association Analysis. 2 Association Rule Mining l Given a set of transactions, find rules that will predict the occurrence of.
1 Mining the Smallest Association Rule Set for Predictions Jiuyong Li, Hong Shen, and Rodney Topor Proceedings of the 2001 IEEE International Conference.
CS685: Special Topics in Data Mining The UNIVERSITY of KENTUCKY Frequent Itemset Mining II Tree-based Algorithm Max Itemsets Closed Itemsets.
Rapid Association Rule Mining Amitabha Das, Wee-Keong Ng, Yew-Kwong Woon, Proc. of the 10th ACM International Conference on Information and Knowledge Management(CIKM’01),2001.
Reducing Number of Candidates
Data Mining Association Analysis: Basic Concepts and Algorithms
Data Mining: Concepts and Techniques
Association Rules Repoussis Panagiotis.
Frequent Pattern Mining
Chang-Hung Lee, Jian Chih Ou, and Ming Syan Chen, Proc
Dynamic Itemset Counting
Data Mining Association Analysis: Basic Concepts and Algorithms
Association Rule Mining
Data Mining Association Analysis: Basic Concepts and Algorithms
Association Rule Mining
An Efficient Algorithm for Incremental Mining of Association Rules
A Parameterised Algorithm for Mining Association Rules
Data Mining Association Analysis: Basic Concepts and Algorithms
Amer Zaheer PC Mohammad Ali Jinnah University, Islamabad
Farzaneh Mirzazadeh Fall 2007
Frequent-Pattern Tree
Association Analysis: Basic Concepts
Presentation transcript:

Mining General Temporal Association Rules for Items with Different Exhibition Cheng-Yue Chang, Ming-Syan Chen, Chang-Hung Lee, Proc. of the 2002 IEEE international Conference on Data Mining(ICDM’02) Adviser: Jia-Ling Koh Speaker: Yu-ting Kung

2 Introduction In this paper, explore a new model of mining general temporal association rules from large database where the exhibition periods of the items are allowed to be different from one to another. (see next page)

3 Introduction (Cont.) What’s wrong on conventional mining algorithm applied in this database? For example: Min_support = 30%, min_conf= 75% By conventional mining, only {A}, {B}, {C} and {F} are frequent itemsets No association rule discovered But some rules do exist in this database!!

4 Introduction (Cont.) What’s the problem of conventional mining algorithm? It doesn’t take the individual exhibition periods of items into consideration.

5 Introduction (Cont.) For allowing to have different exhibition periods, now define three basic definition: Maximal common exhibition period (MCP) MCP(X) = [p, q] For example: (in Figure1) MCP(BC) = [2,3] itemset Latest-exhibition-start time earliest-exhibition-end-time

6 Introduction (Cont.) Relative support For example: (in Figure1) Confidence For example: (in Figure1)

7 Introduction (Cont.) Based on the definition above, the frequent general temporal association rules in this database are:

8 Introduction (Cont.) In this model, the “downward closure” property is no longer valid. For example: (In Figure1) itemset BCD is frequent in [2,2], but BC, BD and CD are “not” all frequent in their corresponding MCP!! ex: BC’s relative support is only 25% (< 30%)

9 Problem Description Maximal temporal itemset For example: BCD 2,2 ( ) BD 2,2 ( ) BC 2,2 ( X ) Temporal sub-itemset of the maximal temporal itemset For example: BCD 2,2 is a maximal temporal itemset  BD 2,2, BC 2,2 and CD 2,2 are the temporal sub-itemset of BCD 2,2

10 Problem Description (Cont.) Maximal temporal itemset is frequent For example: (X MCP(X) is a maximal TI) If supp(X MCP(X) ) >= min_supp, then X MCP(X) is a frequent Property: All temporal sub-itemsets of a frequent maximal temporal itemset are frequent General temporal association rule It will be frequent iff

11 Mining General Temporal Association Rule ─ SPF Algorithm SPF consists of “two” major procedures: Segmentation (ProcSG) Progressively Filtering (ProcPF) First, SPF divide the database into partitions according to the time granularity imposed. Second, SPF employs ProcSG Third, SPF utilizes ProcPF Then, generate all candidate k-itemsets from (k-1)- itemset  transform to TIs, generate SIs Finally, scan database to determine all frequent TIs and SIs

12 SPF Algorithm ─ ProcSG Segment the database into sub-database that items in each will have either the common starting time or the common ending time db 1,6  db 1,3, db 4,4 and db 5,6

13 SPF Algorithm ─ ProcPF After the entire database is segmented by ProcSG, ProcPF is to progressivly filter candidate 2-itemsets from one partition to another in each sub-database

14 An Illustrative Example (SPF) Illustrative Example: Figure1 Min_supp = 30%, min_conf=75% Use ProcSG: database  sub-databases db 1,4  db 1,2 and db 3,4 (two sub-segments)

15 An Illustrative Example (SPF) Use ProcPF: progressively filter the candidate 2- itemsets

16 An Illustrative Example (SPF) After the 1st database scan, C2= {AB, BC, BD, CD, CF, EF} Generate C3, C3={BCD} Transform to TI and generate SI After the 2nd database scan, Frequent T1={AB 2,4, BD 2,2, CF 1,3, EF 3,3 BCD 2,2 }

17 Experiment Data |D| = the number of transactions |T| = average size in each transaction |N| = the number of different items |L| = the number of potential frequent itemsets Algorithms to compare SPF Apriori IP

18 Experiment (Cont.)

19 Experiment (Cont.)