Tutorial 3: Using XLMiner for Association Rule Mining

Slides:



Advertisements
Similar presentations
Identifying Interesting Association Rules with Genetic Algorithms
Advertisements

DATA MINING Association Rule Discovery. AR Definition aka Affinity Grouping Common example: Discovery of which items are frequently sold together at a.
Association Rule Mining. 2 The Task Two ways of defining the task General –Input: A collection of instances –Output: rules to predict the values of any.
MIS2502: Data Analytics Association Rule Mining. Uses What products are bought together? Amazon’s recommendation engine Telephone calling patterns Association.
Sampling Large Databases for Association Rules ( Toivenon’s Approach, 1996) Farzaneh Mirzazadeh Fall 2007.
NA-MIC National Alliance for Medical Image Computing [Tutorial Name] [List of authors] [Institution] [ of the first author]
Association Rules l Mining Association Rules between Sets of Items in Large Databases (R. Agrawal, T. Imielinski & A. Swami) l Fast Algorithms for.
Mining Association Rules. Association rules Association rules… –… can predict any attribute and combinations of attributes … are not intended to be used.
1 Mining Quantitative Association Rules in Large Relational Database Presented by Jin Jin April 1, 2004.
6/23/2015CSE591: Data Mining by H. Liu1 Association Rules Transactional data Algorithm Applications.
Statistical Analysis of Transaction Dataset Data Visualization Homework 2 Hongli Li.
Mining Negative Rules in Large Databases using GRD Dhananjay R Thiruvady Supervisor: Professor Geoffrey Webb.
Research Project Mining Negative Rules in Large Databases using GRD.
Copyright © 2013 Pearson Education, Inc. publishing as Prentice Hall12-1.
Introduction to Data Mining Data mining is a rapidly growing field of business analytics focused on better understanding of characteristics and.
MIS 451 Building Business Intelligence Systems Association Rule Mining (1)
Chapter 13 – Association Rules
Business Analytics: Methods, Models, and Decisions, 1 st edition James R. Evans Copyright © 2013 Pearson Education, Inc. publishing as Prentice Hall12-1.
Data Mining By Fu-Chun (Tracy) Juang. What is Data Mining? ► The process of analyzing LARGE databases to find useful patterns. ► Attempts to discover.
ASSOCIATION RULE DISCOVERY (MARKET BASKET-ANALYSIS) MIS2502 Data Analytics Adapted from Tan, Steinbach, and Kumar (2004). Introduction to Data Mining.
Introduction of Data Mining and Association Rules cs157 Spring 2009 Instructor: Dr. Sin-Min Lee Student: Dongyi Jia.
Association Rule Mining Data Mining and Knowledge Discovery Prof. Carolina Ruiz and Weiyang Lin Department of Computer Science Worcester Polytechnic Institute.
1 FINDING FUZZY SETS FOR QUANTITATIVE ATTRIBUTES FOR MINING OF FUZZY ASSOCIATE RULES By H.N.A. Pham, T.W. Liao, and E. Triantaphyllou Department of Industrial.
Association Rule.. Association rule mining  It is an important data mining model studied extensively by the database and data mining community.  Assume.
Association rule mining Goal: Find all rules that satisfy the user-specified minimum support (minsup) and minimum confidence (minconf). Assume all data.
Evaluation of Recommender Algorithms for an Internet Information Broker based on Simple Association Rules and on the Repeat-Buying Theory WEBKDD 2002 Edmonton,
1 What is Association Analysis: l Association analysis uses a set of transactions to discover rules that indicate the likely occurrence of an item based.
ASSOCIATION RULES (MARKET BASKET-ANALYSIS) MIS2502 Data Analytics Adapted from Tan, Steinbach, and Kumar (2004). Introduction to Data Mining.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Design of an Expert System for Enhancing.
Associations and Frequent Item Analysis. 2 Outline  Transactions  Frequent itemsets  Subset Property  Association rules  Applications.
C OMPARING A SSOCIATION R ULES AND D ECISION T REES FOR D ISEASE P REDICTION Carlos Ordonez.
Charles Tappert Seidenberg School of CSIS, Pace University
DATA MINING Using Association Rules by Andrew Williamson.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Comparing Association Rules and Decision Trees for Disease.
Data Analytics CMIS Short Course part II Day 1 Part 1: Clustering Sam Buttrey December 2015.
Chap 6: Association Rules. Rule Rules!  Motivation ~ recent progress in data mining + warehousing have made it possible to collect HUGE amount of data.
Data Mining Association Rules Mining Frequent Itemset Mining Support and Confidence Apriori Approach.
MIS2502: Data Analytics Association Rule Mining David Schuff
Searching for Pattern Rules Guichong Li and Howard J. Hamilton Int'l Conf on Data Mining (ICDM),2006 IEEE Advisor : Jia-Ling Koh Speaker : Tsui-Feng Yen.
Chapter 14 – Association Rules and Collaborative Filtering © Galit Shmueli and Peter Bruce 2016 Data Mining for Business Analytics (3rd ed.) Shmueli, Bruce.
Chapter 13 – Association Rules DM for Business Intelligence.
MIS2502: Data Analytics Association Rule Mining Jeremy Shafer
P Left half of rt half ? false  Left half pure1? false  Whole is pure1? false  0 5. Rt half of right half? true  1.
Chapter 10 Introduction to Data Mining
Data Mining – Association Rules
Mining Dependent Patterns
Data Science Algorithms: The Basic Methods
COMP 1942 PCA TA: Harry Chan COMP1942.
Clustering 1 (Introduction and kmean)
Classification 3 (Nearest Neighbor Classifier)
Association Rule Mining (Data Mining Tool)
Chapter 6 Tutorial.
Eco 6380 Predictive Analytics For Economists Spring 2016
Market Basket Many-to-many relationship between different objects
Big Data Analytics: HW#2
Association Rule Mining
DIRECT HASHING AND PRUNING (DHP) ALGORITHM
A Parameterised Algorithm for Mining Association Rules
Transactional data Algorithm Applications
MIS Professor Sandvig MIS 324 Professor Sandvig
Frequent patterns and Association Rules
MIS2502: Data Analytics Association Rule Mining
MIS2502: Data Analytics Association Rule Mining
Association Rules :A book store case
MIS2502: Data Analytics Association Rule Learning
Chapter 14 – Association Rules
Statistics Review (It’s not so scary).
Summary of the Rules A>B -A B -(A>B) A -B --A A AvB A B
Data Mining CSCI 307, Spring 2019 Lecture 18
Presentation transcript:

Tutorial 3: Using XLMiner for Association Rule Mining COMP 1942 Tutorial 3: Using XLMiner for Association Rule Mining TA: Harry Chan Email: khchanak@cse.ust.hk COMP1942

Outline Review Data Source Mine Association Rule with XLMiner Binary matrix Item list COMP1942

Review Transaction Itemset Support (of an itemset {a, b, c}) A set of items, e.g. a, b, c, d Itemset A set of items, e.g. {a, b}, {a, b, c} Support (of an itemset {a, b, c}) Number of transactions that contain the itemset {a, b, c} COMP1942

Review (cont.) Association rule Confidence Lift ratio Antecedent -> Consequent, e.g., {a, b} -> c Confidence support (of the {antecedent, consequent}) / support of {antecedent} E.g., support (of {a, b, c}) / support (of {a, b}) Lift ratio Confidence / Expected confidence COMP1942

Summary (Lift=Conf./Expected Conf.) Support Confidence Lift ratio Confidence, # of rules  Support , # of rules  # of rules COMP1942

Outline Review Data Source Mine Association Rule with XLMiner Binary matrix Item list COMP1942

Data source Dataset is a set of transactions Two formats A transaction is a set of items Two formats Binary matrix Item list COMP1942

Data source formats Binary matrix Item list Transaction: {A, D} COMP1942

Outline Review Data Source Mine Association Rule with XLMiner Binary matrix Item list COMP1942

Mine Association Rule in XLMiner Two ways to access association rule “Add-ins” Tag  XLMiner  Affinity  Association Rules “XLMiner Platform” Tag  Associate  Association Rules COMP1942

Steps Step 1: Specify the data range. Step 2: Specify the data format. Step 3: Specify the parameters. Step 4: Analyze the mining results. COMP1942

Binary matrix Example Data source: rule.xls. Data range: $B$1:$F$6. Data format: Binary matrix. Parameters: Minimum Support = 3 Minimum Confidence = 50% COMP1942

Binary matrix Example: Steps 1-3 Data range Parameters Data format COMP1942

Binary matrix Example: Step 4 Rule 1:D  A COMP1942

Item list Example Data source: Shopping-Items.xls. Data range: $A$3:$G$1003. Data format: Item list. Parameters: Minimum Support = 200 Minimum Confidence = 80% COMP1942

Item list Example: Steps 1-3 Data range Parameters Data format COMP1942

Item list Example: Step 4 Rule 1: { heineken, soda } cracker COMP1942

Exercise Data source: Shopping-Items.xls. Data range: $A$3:$G$1003. Data format: Item list. Parameters: Minimum Support = 150 Minimum Confidence = 90% COMP1942