DATA MINING Using Association Rules by Andrew Williamson.

Slides:



Advertisements
Similar presentations
Data Warehousing and Data Mining J. G. Zheng May 20 th 2008 MIS Chapter 3.
Advertisements

Association Rules Spring Data Mining: What is it?  Two definitions:  The first one, classic and well-known, says that data mining is the nontrivial.
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Nadia Andreani Dwiyono DESIGN AND MAKE OF DATA MINING MARKET BASKET ANALYSIS APLICATION AT DE JOGLO RESTAURANT.
Unit 7: Store and Retrieve it Database Management Systems (DBMS)
Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
ICS 421 Spring 2010 Data Mining 1 Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 4/6/20101Lipyeow Lim.
Week 9 Data Mining System (Knowledge Data Discovery)
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Association Rules Presented by: Anilkumar Panicker Presented by: Anilkumar Panicker.
DATA MINING -ASSOCIATION RULES-
Data Mining By Archana Ketkar.
Data Mining Adrian Tuhtan CS157A Section1.
Research Project Mining Negative Rules in Large Databases using GRD.
Chapter Extension 12 Database Marketing.
Data mining By Aung Oo.
Data Mining.
TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING.
Enterprise systems infrastructure and architecture DT211 4
3-1 Chapter Three. 3-2 Secondary Data vs. Primary Data Secondary Data: Data that have been gathered previously. Primary Data: New data gathered to help.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
Data Mining : Introduction Chapter 1. 2 Index 1. What is Data Mining? 2. Data Mining Functionalities 1. Characterization and Discrimination 2. MIning.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
Mining and Summarizing Customer Reviews
『 Data Mining 』 By Jung, hae-sun. 1.Introduction 2.Definition 3.Data Mining Applications 4.Data Mining Tasks 5. Overview of the System 6. Data Mining.
Mining Association Rules between Sets of Items in Large Databases presented by Zhuang Wang.
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
ACS1803 Lecture Outline 2 DATA MANAGEMENT CONCEPTS Text, Ch. 3 How do we store data (numeric and character records) in a computer so that we can optimize.
Data Mining An Introduction.
Association Rules. 2 Customer buying habits by finding associations and correlations between the different items that customers place in their “shopping.
Spatial Statistics and Spatial Knowledge Discovery First law of geography [Tobler]: Everything is related to everything, but nearby things are more related.
1 Data Mining Books: 1.Data Mining, 1996 Pieter Adriaans and Dolf Zantinge Addison-Wesley 2.Discovering Data Mining, 1997 From Concept to Implementation.
1.Understand the essential elements that comprise a customer relationship management program 2.Describe the relationship that exists between marketing.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
1 1 Slide Introduction to Data Mining and Business Intelligence.
Data Mining By Fu-Chun (Tracy) Juang. What is Data Mining? ► The process of analyzing LARGE databases to find useful patterns. ► Attempts to discover.
Association Rule By Kenneth Leung. Data Mining The process of extracting valid, previously unknown, comprehensible, and actionable information from large.
Copyright © Curt Hill Data Mining A Brief Overview.
Data Mining By : Tung, Sze Ming ( Leo ) CS 157B. Definition A class of database application that analyze data in a database using tools which look for.
HW#2: A Strategy for Mining Association Rules Continuously in POS Scanner Data.
Data Mining By Dave Maung.
Data Mining: Association Rule By: Thanh Truong. Association Rules In Association Rules, we look at the associations between different items to draw conclusions.
Introduction of Data Mining and Association Rules cs157 Spring 2009 Instructor: Dr. Sin-Min Lee Student: Dongyi Jia.
Association Rule Mining Data Mining and Knowledge Discovery Prof. Carolina Ruiz and Weiyang Lin Department of Computer Science Worcester Polytechnic Institute.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Association Rule Mining
DATA MINING By Cecilia Parng CS 157B.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Why BI….? Most companies collect a large amount of data from their business operations. To keep track of that information, a business and would need to.
CS507 Information Systems. Lesson # 11 Online Analytical Processing.
Academic Year 2014 Spring Academic Year 2014 Spring.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
Introduction.  Instructor: Cengiz Örencik   Course materials:  myweb.sabanciuniv.edu/cengizo/courses.
Data Mining – Introduction (contd…) Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
Data Mining Tarek Soukieh 11/18/2010. Agenda 1.The Evolution of Database Technology 2.Introduction 3.Data Preprocessing 4.OLAP vs. Data Mining 5.Data.
Mining Association Rules in Large Database This work is created by Dr. Anamika Bhargava, Ms. Pooja Kaul, Ms. Priti Bali and Ms. Rajnipriya Dhawan and licensed.
Data Mining Functionalities
Data Mining.
By Arijit Chatterjee Dr
MIS2502: Data Analytics Advanced Analytics - Introduction
DATA MINING © Prentice Hall.
Introduction to Data Mining
Association Rules.
Adrian Tuhtan CS157A Section1
Data Warehousing and Data Mining
Frequent patterns and Association Rules
MIS2502: Data Analytics Introduction to Advanced Analytics
MIS2502: Data Analytics Introduction to Advanced Analytics and R
Fast Algorithms for Mining Association Rules
Presentation transcript:

DATA MINING Using Association Rules by Andrew Williamson

What is Data Mining? A.K.A. – Knowledge Discovery in DataBases (KDD) “Data mining is the automated extraction of hidden predictive information from databases. Data mining software allows users to analyze large databases. This can enable you to solve business decision problems, for data mining as an extension of statistics; while it doesn't solve your problems, it is the technology that can find your problems. You can build a predictive model of an ideal customer from your own databases and, using this information, build your marketing strategy or even determine the creation of new products and services.” -

In short “Data mining can be defined as, “the non-trivial extraction of implicit, previously unknown, and potentially useful information from data” ”.

An example of Association “A simple example of data mining is its use in a retail sales department. If a store tracks the purchases of a customer and notices that a customer buys a lot of silk shirts, the data mining system will make a correlation between that customer and silk shirts…

Association example cont. …The sales department will look at that information and may begin direct mail marketing of silk shirts to that customer, or it may alternatively attempt to get the customer to buy a wider range of products. In this case, the data mining system used by the retail store discovered new information about the customer that was previously unknown to the company. “ - wikipedia.org

Association Rules Using Association Rules, X  Y, helps to identify shopping trends over a given amount of time 1) Support 2) Confidence

Association Rules Cont. 1) Support –The ratio of transactions that contain both an item X and an item Y, over all transactions. P(X U Y) –Measures the significance of the given rule.

Association Rules.. 2) Confidence –The ratio of transactions containing X, and also contains Y. P(X U Y) / P( X ) –Measures the strength of the correlation of the given rule.

Association Rules… The Association rules X  Y with a support and confidence ratio of 50% or more are considered to be meaningful and therefore are kept, otherwise the rule will be discarded or ignored.

2 Phases of Association Rules When considering an association rule to examine follow these 2 steps: Phase 1 –List together all rules with a high support ratio. –A.K.A. – The Frequent Itemset. Phase 2 –List together all rules with a high Confidence ratio from the Frequent Itemset group.

Extended Association Rules Standard Association Rules express a correlation between values of a single dimensional schema. While, more meaningful associations may be discovered when incorporating multi - dimensional schemas

Extended Association Rules cont. A Simple Example: –Sleeping bags  Tents This alone, seems obvious and not useful. –Sleeping bags  Tents ( region=north, season=summer) more meaningful incorporating region and season of transactions.

Extended Association Rules cont.. More meaningful information may be associated by augmenting the original Association Rules – X  Y (Z) Transactions which satisfy Z and contain X, are more likely to contain Y as well.

Work Cited