A new Informative Generic Base of Association Rules Gh. Gasmi, S. Ben Yahia, E. Mephu Nguifo, and Y. Slimani PAKDD’05 Advisor : Jia-Ling Koh Speaker :

Slides:



Advertisements
Similar presentations
Brief Introduction to Logic. Outline Historical View Propositional Logic : Syntax Propositional Logic : Semantics Satisfiability Natural Deduction : Proofs.
Advertisements

Recap: Mining association rules from large datasets
Salvatore Ruggieri SIGKDD2010 Frequent Regular Itemset Mining 2010/9/2 1.
Association Analysis (2). Example TIDList of item ID’s T1I1, I2, I5 T2I2, I4 T3I2, I3 T4I1, I2, I4 T5I1, I3 T6I2, I3 T7I1, I3 T8I1, I2, I3, I5 T9I1, I2,
Association Rule Mining. 2 The Task Two ways of defining the task General –Input: A collection of instances –Output: rules to predict the values of any.
Association rules The goal of mining association rules is to generate all possible rules that exceed some minimum user-specified support and confidence.
A Fast High Utility Itemsets Mining Algorithm Ying Liu,Wei-keng Liao,and Alok Choudhary KDD’05 Advisor : Jia-Ling Koh Speaker : Tsui-Feng Yen.
Data Mining Techniques So Far: Cluster analysis K-means Classification Decision Trees J48 (C4.5) Rule-based classification JRIP (RIPPER) Logistic Regression.
Data Mining Association Analysis: Basic Concepts and Algorithms Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach, Kumar Introduction.
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
Association Analysis (2). Example TIDList of item ID’s T1I1, I2, I5 T2I2, I4 T3I2, I3 T4I1, I2, I4 T5I1, I3 T6I2, I3 T7I1, I3 T8I1, I2, I3, I5 T9I1, I2,
Data Mining Association Analysis: Basic Concepts and Algorithms
Brief Introduction to Logic. Outline Historical View Propositional Logic : Syntax Propositional Logic : Semantics Satisfiability Natural Deduction : Proofs.
Data Mining Association Analysis: Basic Concepts and Algorithms
Generating Non-Redundant Association Rules Mohammed J. Zaki.
Association Rules presented by Zbigniew W. Ras *,#) *) University of North Carolina – Charlotte #) Warsaw University of Technology.
Association Rules. 2 Customer buying habits by finding associations and correlations between the different items that customers place in their “shopping.
Introduction to Normalization CPSC 356 Database Ellen Walker Hiram College.
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining By Tan, Steinbach, Kumar Lecture.
Modul 7: Association Analysis. 2 Association Rule Mining  Given a set of transactions, find rules that will predict the occurrence of an item based on.
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
A Personalized Recommender System Based on Users’ Information In Folksonomies Date: 2013/12/18 Author: Mohamed Nader Jelassi, Sadok Ben Yahia, Engelbert.
Association Rules. CS583, Bing Liu, UIC 2 Association rule mining Proposed by Agrawal et al in Initially used for Market Basket Analysis to find.
Efficient Data Mining for Calling Path Patterns in GSM Networks Information Systems, accepted 5 December 2002 SPEAKER: YAO-TE WANG ( 王耀德 )
MINING FREQUENT ITEMSETS IN A STREAM TOON CALDERS, NELE DEXTERS, BART GOETHALS ICDM2007 Date: 5 June 2008 Speaker: Li, Huei-Jyun Advisor: Dr. Koh, Jia-Ling.
2006/12/06Chen Yi-Chun1 Mining Positive and Negative Association Rules: An Approach for Confined Rules Maria-Luiza Antonie, Osmar R. Zaiane PKDD2004.
Apriori Algorithms Feapres Project. Outline 1.Association Rules Overview 2.Apriori Overview – Apriori Advantage and Disadvantage 3.Apriori Algorithms.
Mining Approximate Frequent Itemsets in the Presence of Noise By- J. Liu, S. Paulsen, X. Sun, W. Wang, A. Nobel and J. Prins Presentation by- Apurv Awasthi.
CSE4334/5334 DATA MINING CSE4334/5334 Data Mining, Fall 2014 Department of Computer Science and Engineering, University of Texas at Arlington Chengkai.
Privacy-preserving rule mining. Outline  A brief introduction to association rule mining  Privacy preserving rule mining Single party  Perturbation.
Mining Quantitative Association Rules in Large Relational Tables ACM SIGMOD Conference 1996 Authors: R. Srikant, and R. Agrawal Presented by: Sasi Sekhar.
1 What is Association Analysis: l Association analysis uses a set of transactions to discover rules that indicate the likely occurrence of an item based.
Minimum Cover of F. Minimal Cover for a Set of FDs Minimal cover G for a set of FDs F: –Closure of F = closure of G. –Right hand side of each FD in G.
1/24 Novel algorithm for mining high utility itemsets Shankar, S. Purusothaman, T. Jayanthi, S. International Conference on Computing, Communication and.
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Data Mining: Association Analysis This lecture node is modified based on Lecture Notes for.
1 AC-Close: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery Advisor : Dr. Koh Jia-Ling Speaker : Tu Yi-Lang Date : Hong.
Association Rules presented by Zbigniew W. Ras *,#) *) University of North Carolina – Charlotte #) ICS, Polish Academy of Sciences.
Functional Dependencies CIS 4301 Lecture Notes Lecture 8 - 2/7/2006.
Mining Progressive Confident Rules M. Zhang, W. Hsu and M.L. Lee Int'l Conf on Data Mining (ICDM),2006 IEEE Advisor : Jia-Ling Koh Speaker : Tsui-Feng.
Data Mining Association Rules Mining Frequent Itemset Mining Support and Confidence Apriori Approach.
1 Data Mining Lecture 6: Association Analysis. 2 Association Rule Mining l Given a set of transactions, find rules that will predict the occurrence of.
1 Mining the Smallest Association Rule Set for Predictions Jiuyong Li, Hong Shen, and Rodney Topor Proceedings of the 2001 IEEE International Conference.
Searching for Pattern Rules Guichong Li and Howard J. Hamilton Int'l Conf on Data Mining (ICDM),2006 IEEE Advisor : Jia-Ling Koh Speaker : Tsui-Feng Yen.
Mining General Temporal Association Rules for Items with Different Exhibition Cheng-Yue Chang, Ming-Syan Chen, Chang-Hung Lee, Proc. of the 2002 IEEE international.
A LAZY APPROACH TO ASSOCIATIVE CLASSIFICATION Elena Baralis, Silvia Chiusano, Paolo Garza Dipartimento di Automatica e Informatica, Politecnico di Torino,
Mining Sequential Patterns With Item Constraints
CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets
Mining Dependent Patterns
Data Mining Association Analysis: Basic Concepts and Algorithms
Indirect Argument: Contradiction and Contraposition
Knowledge discovery & data mining Association rules and market basket analysis--introduction UCLA CS240A Course Notes*
Frequent Pattern Mining
Association Rules.
Association Rules Zbigniew W. Ras*,#) presented by
Chang-Hung Lee, Jian Chih Ou, and Ming Syan Chen, Proc
Dynamic Itemset Counting
Data Mining Association Analysis: Basic Concepts and Algorithms
Association Rule Mining
Data Mining Association Analysis: Basic Concepts and Algorithms
CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets
DIRECT HASHING AND PRUNING (DHP) ALGORITHM
Mining Association Rules from Stars
Data Mining Association Analysis: Basic Concepts and Algorithms
Association Analysis: Basic Concepts and Algorithms
Graduate Course DataMining
DENSE ITEMSETS JOUNI K. SEPPANEN, HEIKKI MANNILA SIGKDD2004
CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets
Association Analysis: Basic Concepts
Presentation transcript:

A new Informative Generic Base of Association Rules Gh. Gasmi, S. Ben Yahia, E. Mephu Nguifo, and Y. Slimani PAKDD’05 Advisor : Jia-Ling Koh Speaker : Tsui-Feng Yen

Outline Introduction Mathematical background Related work on generic bases of association rules -Work of Bastide et al. -Work of Phan New generic base Conclusion

Introduction The problem of the relevance and the usefulness of extracted association rules is becoming of primary importance, since an overwhelming number of association rules may be derived In this paper, we introduce a novel generic base of association rules, The novel generic base is sound and informative

Mathematical background Basic notions -Formal context A formal context is a triplet K = (O,A,R), where O represents a finite set of objects (or transactions), A is a finite set of attributes and R is a binary relation (i.e., R ⊆ O×A). Each couple (o, a) ∈ R expresses that the transaction o ∈ O contains the attribute a ∈ A, objects are denoted by numbers and attributes by letters.

Mathematical background(conti) Basic notions -Formal context define two functions: note: 找出現在這幾個 transaction 中, 所有共同的出現的 item note: 找有出現 A 這個 itemset 的所有的 transactions -both compound operators of Φand Ψ are closure operators, in particular ω = Φ 。 Ψ is a closure operator.

Mathematical background Basic notions -Frequent closed itemset : Closed itemset note :沒有其他的 itemset 跟他的 support 一樣且又包含他的 - Formal concept: - Minimal generator:

Mathematical background Basic notions frequent closed itemset C 、 AC 、 BE 、 BCE 、 ABCE ex : AC=Φ(Ψ(AC))= Φ(1 、 3 、 5)=AC ,且 sup=3/5 minimal generator A 、 B 、 C 、 E 、 AB 、 AE 、 BC 、 CE

Mathematical background Derivation of association rules : - Association rule R is a relation between itemsets of the form R : X ⇒ (Y-X), in which X and Y are frequent itemsets, and X ⊂ Y - The valid association rules are those whose the strength metric conf(R)=, is greater than or equal to the minimal threshold of confidence minconf. - If conf(R)=1 then R is called exact association rule (ER), otherwise it is called approximative association rule (AR).

Related work The concept of rule redundancy can be considered as follows : Ex:R 1 : A=>CD R: AB=>C (X 1 ) (Y 1 ) (X) (Y)

Related work Work of Bastide et al. In one paper, the couple (GBE,GBA) of generic bases form a sound and informative generic base. In another,the authors presented sound inference axioms for both (GBE and GBA) bases, permitting to derive all association rules from generic bases of association rules.

Related work Work of Bastide et al.

Work of Phan

For example: ∅ => AC, {A => AC, C=> AC, and A => C}

Drawbacks of Work of Phan the presented generic base is not informative, i.e., it may exist a derivable rule for which it is impossible to determine exactly both its support and confidence For example : the association rule BE=>C is derivable from the generic rule E=>ABCE. However, it is impossible to derive the exact confidence and support of the derivable rule from the GB Phan generic base.

New Generic Base : IGB

Proposition 3. The IGB generic base is informative. proof: - sufficient to show that it contains all the necessary information to determine the support of an itemset in a derived rule. -means that we have to be able to reconstitute all closed itemset by concatenation of the premise and the conclusion of a generic rule.

New Generic Base : IGB -The algorithm considers all the discovered frequent closed itemsets. Hence for a given frequent closed itemset, say c, it tries to find the smallest minimal generator, say gs, associated to frequent closed itemsets subsumed by c and fulfilling the minsup constraint -Therefore, the algorithm generates the following generic rule gs => c. Since gs ⊂ c, then gs ∪ c=c. Conclusion: The IGB generic base is informative.

Conclusion presented a critical survey of the reported approaches for defining generic bases of association rules. introduced a novel generic base, which is sound and informative. provided a set of sound inference axioms for deriving all association rules from the introduced generic base of association rules.