Action Association Rules Mining

Slides:



Advertisements
Similar presentations
Huffman Codes and Asssociation Rules (II) Prof. Sin-Min Lee Department of Computer Science.
Advertisements

Association Analysis (Data Engineering). Type of attributes in assoc. analysis Association rule mining assumes the input data consists of binary attributes.
Mining Multiple-level Association Rules in Large Databases
Action Rules Discovery /Lecture I/ by Zbigniew W. Ras UNC-Charlotte, USA.
Association Rule Mining. 2 The Task Two ways of defining the task General –Input: A collection of instances –Output: rules to predict the values of any.
Chase Repp.  knowledge discovery  searching, analyzing, and sifting through large data sets to find new patterns, trends, and relationships contained.
ICDM'06 Panel 1 Apriori Algorithm Rakesh Agrawal Ramakrishnan Srikant (description by C. Faloutsos)
Association rules The goal of mining association rules is to generate all possible rules that exceed some minimum user-specified support and confidence.
Data Mining Association Analysis: Basic Concepts and Algorithms
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
Association Mining Data Mining Spring Transactional Database Transaction – A row in the database i.e.: {Eggs, Cheese, Milk} Transactional Database.
Association Rules l Mining Association Rules between Sets of Items in Large Databases (R. Agrawal, T. Imielinski & A. Swami) l Fast Algorithms for.
Association Analysis. Association Rule Mining: Definition Given a set of records each of which contain some number of items from a given collection; –Produce.
Data Mining Techniques So Far: Cluster analysis K-means Classification Decision Trees J48 (C4.5) Rule-based classification JRIP (RIPPER) Logistic Regression.
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
ICS 421 Spring 2010 Data Mining 1 Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 4/6/20101Lipyeow Lim.
Data Mining Association Analysis: Basic Concepts and Algorithms
Chapter 16 Parallel Data Mining 16.1From DB to DW to DM 16.2Data Mining: A Brief Overview 16.3Parallel Association Rules 16.4Parallel Sequential Patterns.
Data Mining Association Analysis: Basic Concepts and Algorithms
Association Rules Presented by: Anilkumar Panicker Presented by: Anilkumar Panicker.
DATA MINING -ASSOCIATION RULES-
Mining Tree-Query Associations in a Graph Bart Goethals University of Antwerp, Belgium Eveline Hoekx Jan Van den Bussche Hasselt University, Belgium.
© Vipin Kumar CSci 8980 Fall CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance Computing Research Center Department of Computer.
Fast Algorithms for Association Rule Mining
Research Project Mining Negative Rules in Large Databases using GRD.
Mining Association Rules
SEG Tutorial 2 – Frequent Pattern Mining.
Association Discovery from Databases Association rules are a simple formalism for expressing positive connections between columns in a 0/1 matrix. A classical.
Mining Association Rules between Sets of Items in Large Databases presented by Zhuang Wang.
Apriori algorithm Seminar of Popular Algorithms in Data Mining and Machine Learning, TKK Presentation Lauri Lahti.
Data Mining CS157B Fall 04 Professor Lee By Yanhua Xue.
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
Modul 7: Association Analysis. 2 Association Rule Mining  Given a set of transactions, find rules that will predict the occurrence of an item based on.
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
Association Rules. CS583, Bing Liu, UIC 2 Association rule mining Proposed by Agrawal et al in Initially used for Market Basket Analysis to find.
ASSOCIATION RULE DISCOVERY (MARKET BASKET-ANALYSIS) MIS2502 Data Analytics Adapted from Tan, Steinbach, and Kumar (2004). Introduction to Data Mining.
Data Mining By Dave Maung.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
CSE4334/5334 DATA MINING CSE4334/5334 Data Mining, Fall 2014 Department of Computer Science and Engineering, University of Texas at Arlington Chengkai.
Towards Using Grid Services for Mining Fuzzy Association Rules Mihai Gabroveanu, Ion Iancu, Mirel Cosulschi, Nicolae Constantinescu Faculty of Mathematics.
Association rule mining Goal: Find all rules that satisfy the user-specified minimum support (minsup) and minimum confidence (minconf). Assume all data.
1 What is Association Analysis: l Association analysis uses a set of transactions to discover rules that indicate the likely occurrence of an item based.
Association Rule Mining
CS Data Mining1 Data Mining The Extraction of useful information from data The automated extraction of hidden predictive information from (large)
Elsayed Hemayed Data Mining Course
Data Mining Association Rules Mining Frequent Itemset Mining Support and Confidence Apriori Approach.
Optimization of Association Rules Extraction Through Exploitation of Context Dependent Constraints Arianna Gallo, Roberto Esposito, Rosa Meo, Marco Botta.
1 Data Mining Lecture 6: Association Analysis. 2 Association Rule Mining l Given a set of transactions, find rules that will predict the occurrence of.
1 Mining the Smallest Association Rule Set for Predictions Jiuyong Li, Hong Shen, and Rodney Topor Proceedings of the 2001 IEEE International Conference.
Chapter 3 Data Mining: Classification & Association Chapter 4 in the text box Section: 4.3 (4.3.1),
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
DATA MINING © Prentice Hall.
Data Mining Association Analysis: Basic Concepts and Algorithms
Frequent Pattern Mining
William Norris Professor and Head, Department of Computer Science
Waikato Environment for Knowledge Analysis
Chang-Hung Lee, Jian Chih Ou, and Ming Syan Chen, Proc
Data Mining Association Analysis: Basic Concepts and Algorithms
Association Rule Mining
Data Mining Association Analysis: Basic Concepts and Algorithms
Targeted Association Mining in Time-Varying Domains
Data Mining Association Rules Assoc.Prof.Songül Varlı Albayrak
Data Mining Association Rules: Advanced Concepts and Algorithms
Mining Association Rules from Stars
Data Mining Association Analysis: Basic Concepts and Algorithms
Market Basket Analysis and Association Rules
Lecture 11 (Market Basket Analysis)
Association Analysis: Basic Concepts
Hansheng Lei Univ. of Texas Rio Grande Valley
Presentation transcript:

Action Association Rules Mining Project proposal for Fall 2016 Jian Chen & Jennifer Lavergne

Project Goals To modify the itemset tree algorithm: Discovering action sets Generating action rules

Background

Application: Customer Attrition Facts: On average, most US corporations lose half of their customers every five years Longer a customer stays with the organization, the more profitable he or she becomes The cost of attracting new customers is five to ten times more than retaining existing ones About 14% to 17% of the accounts are closed for reasons that can be controlled like price or service Action: Reducing the outflow of the customers by 5% can double a typical company’s profit

Association Rules Association rule - an implication {X ⇒ Y, support, confidence}. Where X and Y are subsets of the itemset I and X∩Y = Ø Example: {{bread, milk} ⇒ {cheese}, 30%, 75%} Support = #occurrences of I in database/#rows in database Minsup – The minimum support threshold for an itemset I to be considered frequent Confidence = Support(X ⋃ Y)/Support(X) for itemset I = X ⋃ Y. Minconf – a user specified threshold that indicates the interestingness of a candidate rule I: conf(I) > minconf

Itemset Trees A data structure which aids in users querying for a specific itemset and it’s support: Targeted Association Mining Item mapped to numeric values: {bread} = {1}, {cheese} = {2} Numbers must be in ascending order within the itemset Ex: I = {1, 2, 56, 120} Note: Can be used to find all or specific rules within a dataset.

Rule generation

Rule Generation minsup = 10% and minconf = 30% Rules generated using this method: {query} ⇒ {I - query} {2, 4} ⇒ {1}, support = 1/7 = 14%, confidence = 1/3 = 33% {2, 4} ⇒ {8}, support = 2/7 = 29%, confidence = 2/3 = 66%

Action association rules

Action Association Rules Action Rule - Association rules with flexible and stable attributes. Flexible attribute – objects which can potentially change from one state to another: (interest rate, low → high) Stable attribute – objects which will remain stable and not change: (date of birth) Action rule example: (a, a1), (b, b1 → b2) ⇒ (d, d2 → d1)(c, c1)

Action Set Discovery A B C D x1 a1 b1 c1 d1 x2 a2 x3 b2 d2 x4 x5 x6 x7 (category, item) – support count (A, a1) - support 2 (A, a2) - support 6 (B, b1) - support 4 (B, b2) - support 4 (C, c1) - support 6 (C, c2) - support 2 (D, d1) - support 4 (D, d2) - support 4 B and D are flexible A and C are stable MinSup = 3

Action Set Discovery A B C D x1 a1 b1 c1 d1 x2 a2 x3 b2 d2 x4 x5 x6 x7 (category, item change) – support count (B, b1 → b2) - support 4 (B, b2 → b1) - support 4 (D, d1 → d2) - support 4 (D, d2 → d1) - support 4 The minimum of the two supports is kept and compared against MinSup. Support(b1 → b2) = min(support(b1), support(b2)) B and D are flexible A and C are stable MinSup = 3

Action Set Discovery A B C D x1 a1 b1 c1 d1 x2 a2 x3 b2 d2 x4 x5 x6 x7 (A, a2) ・ (B, b1) - support 3 (A, a2) ・ (B, b2) - support 3 (A, a2) ・ (C, c1) - support 4 (A, a2) ・ (D, d1) - support 2 (A, a2) ・ (D, d2) - support 4 (B, b1) ・ (C, c1) - support 3 (B, b1) ・ (D, d1) - support 3 (B, b1) ・ (D, d2) - support 1 (B, b2) ・ (C, c1) - support 2 (B, b2) ・ (D, d1) - support 1 (B, b2) ・ (D, d2) - support 3 (C, c1) ・ (D, d1) - support 3 (C, c1) ・ (D, d2) - support 2 B and D are flexible A and C are stable MinSup = 3

Action Set Discovery A B C D x1 a1 b1 c1 d1 x2 a2 x3 b2 d2 x4 x5 x6 x7 (A, a2) ・ (B, b1 → b2) - support 3 (A, a2) ・ (B, b2 → b1) - support 3 (A, a2) ・ (D, d2 → d1) - support 2 (A, a2) ・ (D, d1 → d2) - support 2 (B, b1 → b2)・(D, d1) - support 1 (B, b1 → b2)・(D, d2) - support 1 (B, b1 → b2)・(D, d2 → d1) - support 1 (B, b1 → b2)・ (D, d1 → d2) - support 3 (C, c2)・(D, d2 → d1) - support 1 (C, c2)・(D, d1 → d2) - support 1 And so on…. B and D are flexible A and C are stable MinSup = 3

Action Rule Generation Selection of rules generated from this dataset using MinSup and MinConf thresholds: (B, b1 → b2)・ (D, d1 → d2) - support 3 (B, b1 → b2) ⇒ (D, d1 → d2) (D, d1 → d2) ⇒ (B, b1 → b2) There has to be a transition on each side of the association rule.

Project Description

Project description Modify the ordered itemset tree algorithm: Discovering action sets Generating action rules Modify existing code or write your own. Needs to be “attribute aware.” Needs to keep track of flexible/stable Read papers on itemset tree and action association rules (provided)

Project description Need to map datasets to integer values without losing track of flexible and stable. Modifications to search: Additions to keep track of the flexible/stable attributes during search Ability to search for itemsets as well as transitions or both simultaneously Mark nodes and use subtrees to find transitions

Project description Modifications to rule generation: Produce rules with both stable and flexible attributes Has to have a transition on each side of the rule (a, a2) (d, d1 → d2) ⇒ (b, b1 → b2) (d, d1 → d2) ⇒ (a, a2) (b, b1 → b2) (b, b1 → b2)⇒(d, d1 → d2) All flexible attributes don’t have to have a transition

References M. Kubat, A. Hafez, V. V. Raghavan, J. Lekkala, and W. K. Chen, “Itemset trees for targeted association mining”, IEEE Trans. on Knowledge and Data Engineering, 2002 Z.W. Ras, A. Dardzinska, L.-S. Tsay, H. Wasyluk, “Association Action Rules”, IEEE/ICDM Workshop on Mining Complex Data (MCD 2008), Pisa, Italy, ICDM Workshops Proceedings, IEEE Computer Society, 2008, 283-290 S. Im, Z.W. Ras, "Action rule extraction from a decision table: ARED“, in "Foundations of Intelligent Systems", Proceedings of ISMIS'08, A. An et al. (Eds.), Toronto, Canada, LNAI, Vol. 4994, Springer, 2008, 160-168 Z.W. Ras, A. Dardzinska, "Action Rules Discovery without pre-existing classification rules“ in Proceedings of RSCTC 2008 Conference, in Akron, Ohio, LNAI 5306, Springer, 2008, 181-190 D. Difallah, R. Benton, T. Johnsten and V. Raghavan, "FAARM: Frequent Association Action Rules Mining Using FP-Tree", in Workshop on Domain Driven Data Mining, part of 11th IEEE International Conference on Data Mining Workshops, Vancouver, Canada, pp. 398-404, December 11, 2011.

Questions? Jian Chen, Ph.D. jchen@louisiana.edu Jennifer Lavergne, Ph.D. jjslavernge@louisiana.edu