Mining Negative Rules in Large Databases using GRD Dhananjay R Thiruvady Supervisor: Professor Geoffrey Webb.

Slides:



Advertisements
Similar presentations
Data Mining Techniques Association Rule
Advertisements

DATA MINING Association Rule Discovery. AR Definition aka Affinity Grouping Common example: Discovery of which items are frequently sold together at a.
Data Mining (Apriori Algorithm)DCS 802, Spring DCS 802 Data Mining Apriori Algorithm Spring of 2002 Prof. Sung-Hyuk Cha School of Computer Science.
Mining Frequent Patterns II: Mining Sequential & Navigational Patterns Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Query Optimization of Frequent Itemset Mining on Multiple Databases Mining on Multiple Databases David Fuhry Department of Computer Science Kent State.
Frequent Closed Pattern Search By Row and Feature Enumeration
Association Rules Spring Data Mining: What is it?  Two definitions:  The first one, classic and well-known, says that data mining is the nontrivial.
10 -1 Lecture 10 Association Rules Mining Topics –Basics –Mining Frequent Patterns –Mining Frequent Sequential Patterns –Applications.
Zeev Dvir – GenMax From: “ Efficiently Mining Frequent Itemsets ” By : Karam Gouda & Mohammed J. Zaki.
Nadia Andreani Dwiyono DESIGN AND MAKE OF DATA MINING MARKET BASKET ANALYSIS APLICATION AT DE JOGLO RESTAURANT.
Association rules The goal of mining association rules is to generate all possible rules that exceed some minimum user-specified support and confidence.
Sampling Large Databases for Association Rules ( Toivenon’s Approach, 1996) Farzaneh Mirzazadeh Fall 2007.
Data Mining Association Analysis: Basic Concepts and Algorithms
Association Rules l Mining Association Rules between Sets of Items in Large Databases (R. Agrawal, T. Imielinski & A. Swami) l Fast Algorithms for.
Rakesh Agrawal Ramakrishnan Srikant
Association Analysis. Association Rule Mining: Definition Given a set of records each of which contain some number of items from a given collection; –Produce.
Data Mining Techniques So Far: Cluster analysis K-means Classification Decision Trees J48 (C4.5) Rule-based classification JRIP (RIPPER) Logistic Regression.
Data Mining Association Analysis: Basic Concepts and Algorithms Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach, Kumar Introduction.
Mining Association Rules. Association rules Association rules… –… can predict any attribute and combinations of attributes … are not intended to be used.
A Robust System Architecture For Mining Semi-structured Data By Aby M Mathew CSE
Generating Non-Redundant Association Rules Mohammed J. Zaki.
Association Rules Presented by: Anilkumar Panicker Presented by: Anilkumar Panicker.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Association Rule Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Sai Moturu. Introduction Current approaches to microarray data analysis –Analysis of experimental data followed by a posterior process where biological.
© Vipin Kumar CSci 8980 Fall CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance Computing Research Center Department of Computer.
Research Project Mining Negative Rules in Large Databases using GRD.
Fast Vertical Mining Using Diffsets Mohammed J. Zaki Karam Gouda
1 Fast Algorithms for Mining Association Rules Rakesh Agrawal Ramakrishnan Srikant Slides from Ofer Pasternak.
© Vipin Kumar CSci 8980 Fall CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance Computing Research Center Department of Computer.
MIS 451 Building Business Intelligence Systems Association Rule Mining (1)
Association Rules. 2 Customer buying habits by finding associations and correlations between the different items that customers place in their “shopping.
1 Apriori Algorithm Review for Finals. SE 157B, Spring Semester 2007 Professor Lee By Gaurang Negandhi.
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining By Tan, Steinbach, Kumar Lecture.
Modul 7: Association Analysis. 2 Association Rule Mining  Given a set of transactions, find rules that will predict the occurrence of an item based on.
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
Association Rules. CS583, Bing Liu, UIC 2 Association rule mining Proposed by Agrawal et al in Initially used for Market Basket Analysis to find.
Efficient Data Mining for Calling Path Patterns in GSM Networks Information Systems, accepted 5 December 2002 SPEAKER: YAO-TE WANG ( 王耀德 )
ASSOCIATION RULE DISCOVERY (MARKET BASKET-ANALYSIS) MIS2502 Data Analytics Adapted from Tan, Steinbach, and Kumar (2004). Introduction to Data Mining.
Introduction of Data Mining and Association Rules cs157 Spring 2009 Instructor: Dr. Sin-Min Lee Student: Dongyi Jia.
Fast Algorithms for Mining Association Rules Rakesh Agrawal and Ramakrishnan Srikant VLDB '94 presented by kurt partridge cse 590db oct 4, 1999.
Association Rule Mining Data Mining and Knowledge Discovery Prof. Carolina Ruiz and Weiyang Lin Department of Computer Science Worcester Polytechnic Institute.
Association Rule Mining
CURE Clustering Using Representatives Handles outliers well. Hierarchical, partition First a constant number of points c, are chosen from each cluster.
 Frequent Word Combinations Mining and Indexing on HBase Hemanth Gokavarapu Santhosh Kumar Saminathan.
Elsayed Hemayed Data Mining Course
Overview Definition of Apriori Algorithm
Chapter 8 Association Rules. Data Warehouse and Data Mining Chapter 10 2 Content Association rule mining Mining single-dimensional Boolean association.
Chap 6: Association Rules. Rule Rules!  Motivation ~ recent progress in data mining + warehousing have made it possible to collect HUGE amount of data.
Data Mining Association Rules Mining Frequent Itemset Mining Support and Confidence Apriori Approach.
COMP53311 Association Rule Mining Prepared by Raymond Wong Presented by Raymond Wong
1 Data Mining Lecture 6: Association Analysis. 2 Association Rule Mining l Given a set of transactions, find rules that will predict the occurrence of.
CS548 Spring 2016 Association Rules Showcase by Shijie Jiang, Yuting Liang and Zheng Nie Showcasing work by C.J. Carmona, S. Ramírez-Gallego, F. Torres,
MIS2502: Data Analytics Association Rule Mining David Schuff
DATA MINING: ASSOCIATION ANALYSIS (2) Instructor: Dr. Chun Yu School of Statistics Jiangxi University of Finance and Economics Fall 2015.
P Left half of rt half ? false  Left half pure1? false  Whole is pure1? false  0 5. Rt half of right half? true  1.
Mining Association Rules in Large Database This work is created by Dr. Anamika Bhargava, Ms. Pooja Kaul, Ms. Priti Bali and Ms. Rajnipriya Dhawan and licensed.
Data Mining – Association Rules
Data Mining Association Analysis: Basic Concepts and Algorithms
Tutorial 3: Using XLMiner for Association Rule Mining
CARPENTER Find Closed Patterns in Long Biological Datasets
Market Basket Analysis and Association Rules
Market Basket Many-to-many relationship between different objects
Data Mining Association Analysis: Basic Concepts and Algorithms
DIRECT HASHING AND PRUNING (DHP) ALGORITHM
Data Mining Association Rules Assoc.Prof.Songül Varlı Albayrak
Association Analysis: Basic Concepts and Algorithms
Closed Itemset Mining CSCI-7173: Computational Complexity & Algorithms, Final Project - Spring 16 Supervised By Dr. Tom Altman Presented By Shahab Helmi.
MIS2502: Data Analytics Association Rule Learning
Association Analysis: Basic Concepts
Presentation transcript:

Mining Negative Rules in Large Databases using GRD Dhananjay R Thiruvady Supervisor: Professor Geoffrey Webb

Overview Aims Association Rule Discovery Generalized Rule Discovery Tidsets and Diffsets Conclusion

Aims 1. To mine negative rules in a large database using GRD 2. To assess whether the negative rules are of potential interest to a user

Association Rule Discovery Rule: A => B (e.g. tea => coffee) A is the antecedent B is the consequent Aim: Searches database to find strong associations between itemsets Itemsets are subsets of the dataset e.g. tea in a supermarket

Association Rule Discovery (Contd.) Support of Tea => Coffee: Transactions with Tea or Coffee / |Data space| Confidence of Tea => Coffee : Transactions with Tea or Coffee /Transactions with Tea

Association Rule Discovery (Contd.) Generates rules based on minimum support (frequent itemsets) Further constraints can be applied, e.g. confidence (interest)

Generalized Rule Discovery An alternative Association Rule Discovery Uses The OPUS Algorithm for an unordered Search [Webb, 95] Generates large number of rules based on user specified constraints. Constraints include minimum support, confidence, etc.

Tidsets and Diffsets [Zaki, Gouda, 01] Every itemset is stored with it’s corresponding transaction set (Tidsets) Using Vertical Mining has proved to be more efficient than Horizontal Mining. TeaCoffeeMilk

Tidsets and Diffsets (contd.) Diffsets are a set of transactions that the itemset does not appear in. Diffsets are Tidsets for a negative association of an itemset. TeaCoffeeMilkDiffset (Tea) Diffset (Coffee) Diffset (Milk)

Tidsets and Diffsets (contd.) GRD calculates Tidsets for an Itemset Therefore Diffsets for an Itemset can be computed with very little extra cost. ABC~A~B~C

Conclusion Find negative correlations between Itemsets in a database. Rule: tea => ~coffee, ~tea => coffee, ~tea => ~ coffee This will be achieved by extending the GRD technique.

Conclusion (Contd.) Using diffsets: tidsets A = diffset ~A Negative associations can be calculated with very little additional computational overheads Assess whether the results of negative correlations are potentially interesting or not

Any questions?