Eco 6380 Predictive Analytics For Economists Spring 2016

Slides:



Advertisements
Similar presentations
Eco 5385 Predictive Analytics For Economists Spring 2014 Professor Tom Fomby Director, Richard B. Johnson Center for Economic Studies Department of Economics.
Advertisements

Mining Association Rules from Microarray Gene Expression Data.
DATA MINING Association Rule Discovery. AR Definition aka Affinity Grouping Common example: Discovery of which items are frequently sold together at a.
IT 433 Data Warehousing and Data Mining Association Rules Assist.Prof.Songül Albayrak Yıldız Technical University Computer Engineering Department
10 -1 Lecture 10 Association Rules Mining Topics –Basics –Mining Frequent Patterns –Mining Frequent Sequential Patterns –Applications.
MIS2502: Data Analytics Association Rule Mining. Uses What products are bought together? Amazon’s recommendation engine Telephone calling patterns Association.
Chapter 16 Parallel Data Mining 16.1From DB to DW to DM 16.2Data Mining: A Brief Overview 16.3Parallel Association Rules 16.4Parallel Sequential Patterns.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Association Rule Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Association Rules Hawaii International Conference on System Sciences (HICSS-40) January 2007 David L. Olson Yanhong Li.
Association Rules Olson Yanhong Li. Fuzzy Association Rules Association rules mining provides information to assess significant correlations in large.
Copyright © 2004 Pearson Education, Inc.. Chapter 27 Data Mining Concepts.
Chapter 5 Data mining : A Closer Look.
Introduction to Data Mining Data mining is a rapidly growing field of business analytics focused on better understanding of characteristics and.
MIS 451 Building Business Intelligence Systems Association Rule Mining (1)
Chapter 13 – Association Rules
『 Data Mining 』 By Jung, hae-sun. 1.Introduction 2.Definition 3.Data Mining Applications 4.Data Mining Tasks 5. Overview of the System 6. Data Mining.
Basic Data Mining Techniques
1 Apriori Algorithm Review for Finals. SE 157B, Spring Semester 2007 Professor Lee By Gaurang Negandhi.
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining By Tan, Steinbach, Kumar Lecture.
Modul 7: Association Analysis. 2 Association Rule Mining  Given a set of transactions, find rules that will predict the occurrence of an item based on.
ASSOCIATION RULE DISCOVERY (MARKET BASKET-ANALYSIS) MIS2502 Data Analytics Adapted from Tan, Steinbach, and Kumar (2004). Introduction to Data Mining.
 Fundamentally, data mining is about processing data and identifying patterns and trends in that information so that you can decide or judge.  Data.
Copyright © 2004 Pearson Education, Inc.. Chapter 27 Data Mining Concepts.
Introduction of Data Mining and Association Rules cs157 Spring 2009 Instructor: Dr. Sin-Min Lee Student: Dongyi Jia.
Evaluation of Recommender Algorithms for an Internet Information Broker based on Simple Association Rules and on the Repeat-Buying Theory WEBKDD 2002 Edmonton,
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Charles Tappert Seidenberg School of CSIS, Pace University
Association Rules presented by Zbigniew W. Ras *,#) *) University of North Carolina – Charlotte #) ICS, Polish Academy of Sciences.
DATA MINING Using Association Rules by Andrew Williamson.
Eco 6380 Predictive Analytics For Economists Spring 2016 Professor Tom Fomby Department of Economics SMU.
Data Analytics CMIS Short Course part II Day 1 Part 1: Clustering Sam Buttrey December 2015.
Eco 6380 Predictive Analytics For Economists Spring 2016 Professor Tom Fomby Department of Economics SMU.
Eco 6380 Predictive Analytics For Economists Spring 2016 Professor Tom Fomby Department of Economics SMU.
Chapter 3 Data Mining: Classification & Association Chapter 4 in the text box Section: 4.3 (4.3.1),
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
Eco 6380 Predictive Analytics For Economists Spring 2016 Professor Tom Fomby Department of Economics SMU.
MIS2502: Data Analytics Association Rule Mining David Schuff
Chapter 14 – Association Rules and Collaborative Filtering © Galit Shmueli and Peter Bruce 2016 Data Mining for Business Analytics (3rd ed.) Shmueli, Bruce.
Chapter 13 – Association Rules DM for Business Intelligence.
MIS2502: Data Analytics Association Rule Mining Jeremy Shafer
Profiling: What is it? Notes and reflections on profiling and how it could be used in process mining.
Supplemental Chapter: Business Intelligence Information Systems Development.
What are digital skills?
Entity- Relationship (ER) Model
7.2 Batch Processing All data are collected into groups and sorted into sequence before being processed together. Computer Studies Today Chapter.
A Research Oriented Study Report By :- Akash Saxena
Eco 6380 Predictive Analytics For Economists Spring 2016
Knowledge discovery & data mining Association rules and market basket analysis--introduction UCLA CS240A Course Notes*
Tutorial 3: Using XLMiner for Association Rule Mining
Association Rule Mining (Data Mining Tool)
Data Structures Recursion CIS265/506: Chapter 06 - Recursion.
William Norris Professor and Head, Department of Computer Science
Data Mining Association Analysis: Basic Concepts and Algorithms
Association Rule Mining
Data Mining Association Analysis: Basic Concepts and Algorithms
An Excel-based Data Mining Tool
Eco 6380 Predictive Analytics For Economists Spring 2014
Data Mining Association Rules Assoc.Prof.Songül Varlı Albayrak
Data Mining Association Analysis: Basic Concepts and Algorithms
Eco 6380 Predictive Analytics For Economists Spring 2016
Eco 6380 Predictive Analytics For Economists Spring 2016
MIS2502: Data Analytics Association Rule Mining
MIS2502: Data Analytics Association Rule Mining
Data Science in Industry
Association Rules :A book store case
MIS2502: Data Analytics Association Rule Learning
Chapter 14 – Association Rules
Association Analysis: Basic Concepts
Charles Tappert Seidenberg School of CSIS, Pace University
Presentation transcript:

Eco 6380 Predictive Analytics For Economists Spring 2016 Professor Tom Fomby Department of Economics SMU

Presentation 16 Association Rules (aka Affinity Analysis) Chapter 13 in SPB

First go read the pdf file “Association Rules_v3 First go read the pdf file “Association Rules_v3.pdf” on the Class Website for a detailed discussion of Association Rules and an illustrative example of the method. In this PowerPoint we only provide a set of short notes on the basic concepts of Association Rule Analysis.

Some Basics Primarily viewed as an Unsupervised Learning Method: Neither prediction or classification. A Data Discovery Tool. However, in the case of sequential time use of the method, it can be viewed as a predictive tool and thus be considered a supervised learning method. Applications: Amazon.com sidebars of additional books to consider. Grocery Store shelving. Sequential Association Rules are useful for analyzing what is a likely outcome given a time sequence of previous outcomes. For example, one might use it to look for precursors of an outbreak of an epidemic. Pr 𝐶 𝐴 C = consequent items, A = antecedent items Transaction File: Binary Format versus Item List Format Confidence of a rule = 𝑠𝑢𝑝𝑝𝑜𝑟𝑡(𝐴∩𝐶) 𝑠𝑢𝑝𝑝𝑜𝑟𝑡 𝑜𝑓 𝐴 = Pr 𝐶 𝐴 = Pr(𝐶∩𝐴) Pr(𝐴) Lift = Pr C A 𝑃𝑟(𝐶) : 1 ≤𝐿𝑖𝑓𝑡< ∞ ; Lift = 1 when C and A are independent 𝑃𝑟 𝐶 𝐴 = absolute measure ; Lift = relative measure

The Apriori Algorithm (Agrawal and Srikant (1994)) In the first phase build up the item sets that meet the minimum support requirement. First, collect together all of the one-item sets that have minimum support. Then form two-item sets built from the previous minimum support one-item sets and then retain the two-item sets that have minimum support. Then form three-item sets built from the retained one-item and two-item sets and then choose those three-item sets that have minimum support. One continues this process building up larger and larger minimum support item sets until at say the K-item set level none of the K-items sets meet the minimum support requirement. Then the first phase is complete. Then in the second phase one examines all of the retained one, two, … (K-1) item sets for possible Association Rules that have minimum confidence. In the making of the Association Rules it must be remember that the consequent (C) and antecedent(A) item sets must be disjoint. Two tuning parameters: a) minimum support (t) of k-item sets b) minimum confidence of Association Rule The choice of “top” Rules (Associations) is sensitive to the choices of the above tuning parameters. The larger (smaller) the choices of the tuning parameters, the more simple (complex) the resulting Rules. You should examine the robustness of the “top” Rules over various choices of tuning parameter values. Rules need to make sense given our domain-specific knowledge, otherwise you shouldn’t use them.

Classroom Exercise: Exercise 12