Data Mining Quantitative Values

Slides:



Advertisements
Similar presentations
CSE 634 Data Mining Techniques
Advertisements

Data Mining Techniques Association Rule
Data Mining (Apriori Algorithm)DCS 802, Spring DCS 802 Data Mining Apriori Algorithm Spring of 2002 Prof. Sung-Hyuk Cha School of Computer Science.
AB 11 22 33 44 55 66 77 88 99 10  20  19  18  17  16  15  14  13  12  11  21  22  23  24  25  26  27  28.
Simulating Sports: The Inputs and the Engines Paul Bessire General Manager, Co-Founder PredictionMachine.com September 29, 2010.
Association Rules l Mining Association Rules between Sets of Items in Large Databases (R. Agrawal, T. Imielinski & A. Swami) l Fast Algorithms for.
Ex. 11 (pp.409) Given the lattice structure shown in Figure 6.33 and the transactions given in Table 6.24, label each node with the following letter(s):
1 Data Warehousing. 2 Data Warehouse A data warehouse is a huge database that stores historical data Example: Store information about all sales of products.
4/3/01CS632 - Data Mining1 Data Mining Presented By: Kevin Seng.
Mining Baseball Statistics
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Association Rule Mining II Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
FALL 2004CENG 3511 Hashing Reference: Chapters: 11,12.
6/23/2015CSE591: Data Mining by H. Liu1 Association Rules Transactional data Algorithm Applications.
1 BA 275 Quantitative Business Methods Simple Linear Regression Introduction Case Study: Housing Prices Agenda.
Statistical Analysis of Transaction Dataset Data Visualization Homework 2 Hongli Li.
Mining Association Rules
Mining Association Rules in Large Databases. What Is Association Rule Mining?  Association rule mining: Finding frequent patterns, associations, correlations,
1 times table 2 times table 3 times table 4 times table 5 times table
MIS 451 Building Business Intelligence Systems Association Rule Mining (1)
CS 349: Market Basket Data Mining All about beer and diapers.
Mining Association Rules between Sets of Items in Large Databases presented by Zhuang Wang.
School of Business and Economics Discussion on “Risk horizon Predictors of Euro Area Financial Stress” by Thomas Lejeune Stefan Straetmans Maastricht University.
1 Chapter 12: Indexing and Hashing Indexing Indexing Basic Concepts Basic Concepts Ordered Indices Ordered Indices B+-Tree Index Files B+-Tree Index Files.
Data Mining CS157B Fall 04 Professor Lee By Yanhua Xue.
Information Systems Today (©2006 Prentice Hall) 3-1 CS3754 Class Note 12 Summery of Relational Database.
Graphing Functions We are learning to…model functions as tables, graphs and equations (rules) Thursday, October 15, 2015.
Splash Screen. Lesson Menu Objectives Vocabulary Example 1 Example 2 Example 3 Quick Quiz.
CS 8751 ML & KDDSupport Vector Machines1 Mining Association Rules KDD from a DBMS point of view –The importance of efficiency Market basket analysis Association.
Mixing integer and floating point numbers in an arithmetic operation.
2.1 to 2.3 Displacement, Velocity, and Acceleration.
DATA MINING Using Association Rules by Andrew Williamson.
CMU SCS : Multimedia Databases and Data Mining Lecture #30: Data Mining - assoc. rules C. Faloutsos.
Integer Exponents. Warm Up Find Each Product or Quotient x x ÷ ÷ x x
$100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300.
Tables Learning Support
PowerPoint Animated Template For technology, originality presentation.
$100 $200 $300 $400 $500 $200 $300 $400 $500 Rate of Change and Slope Intercept Standard Form and Point Slope Absolute Value Equations Parallel and.
Sitecore. Compelling Web Experiences Page 1www.sitecore.net Patrick Schweizer Director of Sales Enablement 2013.
Scorekeeping Guide BASIC BATTING STATISTICS At Bats (AB) = Plate appearances minus [ BB + HBP + SB + SF + CI ] Hits = 1B + 2B + 3B + HR Batting Avg. =
As a general rule you should be using multiple languages these days (except for Java)
Chapter 7 Risk and Portfolio Theory. Expected Return E( r ) = E ( D ) +g P.
Market Basket Analysis
Tools for Code Review Static Analysis Handles unfinished code
Scorekeeping Guide CPLL-Spring 2014
Data Mining-Association Rule
Mining Association Rules
© 2017, Innovation Engineering LLC.
Times Tables.
I. Association Market Basket Analysis.
Scan Sampling Time Animal and Behavior 1200
Lecture 13: Error Detection
Transactional data Algorithm Applications
أنماط الإدارة المدرسية وتفويض السلطة الدكتور أشرف الصايغ
12/2/2018.
8.2 Speed Vs Velocity Speed (v):
Frequent patterns and Association Rules
مديريت موثر جلسات Running a Meeting that Works
How to use a scorebook.
Association Rules :A book store case
Chapter 11: Indexing and Hashing
C.2.10 Sample Questions.
Multiplying integers 3(3) = (3) = (3) = (3) = 0 -3
15-826: Multimedia Databases and Data Mining
C.2.8 Sample Questions.
3 times tables.
6 times tables.
Supplementary Figure e-3
Presented by : Aman Gupta PGT CS KV No.1, Narimedu, Madurai
Scorekeeping Guide Lacamas Little League--Spring 2009
Data Mining CSCI 307, Spring 2019 Lecture 18
Presentation transcript:

Data Mining Quantitative Values By Noah Clemons Andrew Seidel

Associational Rule Mining Data in market basket format: Each “Basket” is list of items (integers) present. Returns rules based on items. Rules useful to discover trends.

Quantitative Data

Problem Data not in market basket format. How do we fit data to necessary format? Convert the data.

Conversion table

Approaches To Conversion Static Approach: Convert data before using associational mining tool. Good if doing a lot of runs on one dataset with one conversion table. Speed depends on tools used to convert.

Approaches To Conversion Dynamic Approach Convert data as it is used by associational mining tool. Can be much faster than Static. Good for changing datasets or conversion tables.

Static vs. Dynamic Speed of 16 Static Runs: Speed of 16 Dynamic Runs: 769.05 Seconds Speed of 16 Dynamic Runs: 27.53 Seconds Static 27.9 times slower.

Rules Run with 20 Buckets, .1% Support 80% Confidence 646 Rules Sample Rules: AB_551_558 RBI_116_147 ==> HR_37_51 (0.866667, 13) BB_35_37 H_193_226 ==> AB_637_689 (0.846154, 11) IBB_18_31 SO_136_180 R_112_137 ==> RBI_116_147 (0.833333, 5) GIDP_5 AB_543_550 ==> 3B_2 (0.833333, 5)

Rules Run with 80 Buckets, .1% Support 80% Confidence 60 Rules Sample Rules: H_112_114 2B_22 ==> 3B_2 (0.833333, 5) AB_465_469 SH_3 ==> 3B_4 (0.833333, 5) SB_25 HBP_4 ==> GIDP_8 (0.833333, 5) H_200_205 IBB_4 ==> CS_4 (1, 5) BB_57 SB_1 ==> CS_1 (0.833333, 5)

Problems Encountered Hard to pick good values for support, confidence, conversion table. Many values related, lead to large rules. At Bats, Games, Etc.

Future Work Use correlated mining to find items. Create tool to find optimum values for support, confidence, and conversion table.