A Comparison of Capabilities of Data Mining Tools

Slides:



Advertisements
Similar presentations
Data Mining: Potentials and Challenges Rakesh Agrawal & Jeff Ullman.
Advertisements

Data Mining: What? WHY? HOW?
Association Rules Spring Data Mining: What is it?  Two definitions:  The first one, classic and well-known, says that data mining is the nontrivial.
Nadia Andreani Dwiyono DESIGN AND MAKE OF DATA MINING MARKET BASKET ANALYSIS APLICATION AT DE JOGLO RESTAURANT.
1. Abstract 2 Introduction Related Work Conclusion References.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Chapter Extension 14 Database Marketing © 2008 Pearson Prentice Hall, Experiencing MIS, David Kroenke.
An overview of The IBM Intelligent Miner for Data By: Neeraja Rudrabhatla 11/04/1999.
Basic Data Mining Techniques Chapter Decision Trees.
Basic Data Mining Techniques
Chapter Extension 12 Database Marketing.
IBM SPSS Modeler 14.2 Data Mining Concepts Introduction to Undirected Data Mining: Association Analysis Prepared by David Douglas, University of ArkansasHosted.
CS157A Spring 05 Data Mining Professor Sin-Min Lee.
Microsoft Enterprise Consortium Data Mining Concepts Introduction to Directed Data Mining: Decision Trees Prepared by David Douglas, University of ArkansasHosted.
Computer Science Universiteit Maastricht Institute for Knowledge and Agent Technology Data mining and the knowledge discovery process Summer Course 2005.
Beyond Opportunity; Enterprise Miner Ronalda Koster, Data Analyst.
Introduction to Directed Data Mining: Decision Trees
TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING.
Microsoft Enterprise Consortium Data Mining Concepts Introduction: The essential background Prepared by David Douglas, University of ArkansasHosted by.
Information Retrieval from Data Bases for Decisions Dr. Gábor SZŰCS, Ph.D. Assistant professor BUTE, Department Information and Knowledge Management.
Data Mining Using IBM Intelligent Miner Presented by: Qiyan (Jennifer ) Huang.
Chapter 5: Data Mining for Business Intelligence
Data Mining Techniques
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
DATA MINING Team #1 Kristen Durst Mark Gillespie Banan Mandura University of DaytonMBA APR 09.
Data Mining Chun-Hung Chou
Introduction: The essential background
Chapter 13 Genetic Algorithms. 2 Data Mining Techniques So Far… Chapter 5 – Statistics Chapter 6 – Decision Trees Chapter 7 – Neural Networks Chapter.
1 Research Groups : KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems SCI 2 SMetrology and Models Intelligent.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
1 1 Slide Introduction to Data Mining and Business Intelligence.
Copyright © 2010, SAS Institute Inc. All rights reserved. Applied Analytics Using SAS ® Enterprise Miner™
Final Exam Review. The following is a list of items that you should review in preparation for the exam. Note that not every item in the following slides.
Chapter 11 Business Intelligence Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall 11-1.
M. Sulaiman Khan Dept. of Computer Science University of Liverpool 2009 COMP527: Data Mining Association Rule Mining March 5, 2009.
Guest Lecture Introduction to Data Mining Dr. Bhavani Thuraisingham September 17, 2010.
1 Improving quality of graduate students by data mining Asst. Prof. Kitsana Waiyamai, Ph.D. Dept. of Computer Engineering Faculty of Engineering, Kasetsart.
CS157B Fall 04 Introduction to Data Mining Chapter 22.3 Professor Lee Yu, Jianji (Joseph)
Multi-Relational Data Mining: An Introduction Joe Paulowskey.
1 STAT 5814 Statistical Data Mining. 2 Use of SAS Data Mining.
Copyright © 2010 SAS Institute Inc. All rights reserved. Decision Trees Using SAS Sylvain Tremblay SAS Canada – Education SAS Halifax Regional User Group.
Association Rule Mining
An Evaluation of Commercial Data Mining Proposed and Presented by Emily Davis Supervisor: John Ebden.
1 Chapter 8: Introduction to Pattern Discovery 8.1 Introduction 8.2 Cluster Analysis 8.3 Market Basket Analysis (Self-Study)
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
MIS2502: Data Analytics Advanced Analytics - Introduction.
An Introduction Student Name: Riaz Ahmad Program: MSIT( ) Subject: Data warehouse & Data Mining.
Ramakrishnan and Gehrke. Database Management Systems, 3 rd Edition. Definition Data mining is the exploration and analysis of large quantities of data.
Academic Year 2014 Spring Academic Year 2014 Spring.
DECISION TREES Asher Moody, CS 157B. Overview  Definition  Motivation  Algorithms  ID3  Example  Entropy  Information Gain  Applications  Conclusion.
Data Mining: Confluence of Multiple Disciplines Data Mining Database Systems Statistics Other Disciplines Algorithm Machine Learning Visualization.
Supplemental Chapter: Business Intelligence Information Systems Development.
Visualization in Process Mining
MIS2502: Data Analytics Advanced Analytics - Introduction
DATA MINING © Prentice Hall.
A Research Oriented Study Report By :- Akash Saxena
Data and Applications Security Introduction to Data Mining
Association Rule Mining (Data Mining Tool)
I. Association Market Basket Analysis.
©Jiawei Han and Micheline Kamber
Introduction Data Mining for Business Analytics.
MIS5101: Data Analytics Advanced Analytics - Introduction
Analysis of Customer Behavior and Service Modeling
Supporting End-User Access
(Understanding customer behavior Using data mining skills)
I. Association Market Basket Analysis.
Comparisons of Clustering Detection and Neural Network in E-Miner, Clementine and I-Miner Jong-Hee Lee and Yong-Seok Choi.
Association Rues Analysis .Event A -> Event ?
Presentation transcript:

A Comparison of Capabilities of Data Mining Tools Jong-Hee Lee, Yong-Seok Choi Dept. of Statistics Pusan National University

1.Introduction The exploration and analysis of large quantities of data in order to discover meaningful patterns and rules (Berry and Linoff, 1997, p. 5) Data Mining(DM)

Data Mining Tools for DM There are many commercial DM tools (http://kdnuggets.com). DM tools vendors increase. DM tools are updated rapidly. Some comparisons : Abbott et al.(1998) , Elder and Abbott(1998): out of date ! Some companies : not objective ! Rapidly-updated tools are needed objective comparisons..

Tools Selected for our comparison Product Company Version Tested Clementine SPSS 5.2 Enterprise Miner SAS Institute 3.0 Intelligent Miner IBM 6.1

Comparisons of DM tools 2. Constitution of Main Windows and Nodes 3. Techniques 3.1 Market Basket Analysis 3.2 Decision Tree

2. Constitution of Main Windows and Nodes Comparative Criteria Visualization using Streams Main Windows Nodes In order of using According to purpose of modeling

Visualization using Streams Main Window of Clementine In order of using Visualization using Streams

Visualization using Streams Main Window of Enterprise Miner In order of using Visualization using Streams

Main Window of Intelligent Miner According to purpose of modeling In order of using

Conclusion of comparison for Constitution of Main Windows and Nodes (excellent) Clementine (poor) Intelligent Miner Main Windows (excellent) Intelligent Miner Nodes

3. Techniques 3.1 Market Basket Analysis 3.2 Decision Tree For the other techniques , see J.H.Lee(2000).

3.1 Market Basket Analysis (MBA) gives insight into the merchandise by telling us which products tend to be purchased together (Berry and Linoff, 1997, p.124)

Comparative Criteria for MBA Market Basket Analysis Algorithm option Result

Algorithm of MBA Mining Tool Clementine Algorithm Intelligent Miner Enterprise ○ Association Rule × Sequential Pattern ○  

Market Basket Analysis Mining Tool Clementine Option Intelligent Miner Enterprise × Minimum of Confidence ○ Minimum of Support ○ Minimum of Coverage × ○ Maximum of Items × Item Constraints ○

Market Basket Analysis Mining Tool Clementine Result Intelligent Miner Enterprise ○ Confidence × Support Lift ○ Coverage × × Textual Display ○ ○ Visualization ×

Data Format in MBA ID 132 A 132 428 ID A B C D ... 132 B Y Y 428 C Y Y < Horizontal Format > 132 A 132 428 ID A B C D ... 132 B Y Y 428 C Y Y Y 428 < Vertical Format > D 428 B Upper tables cited at the xore web site(http://www. Exclusiveore.com/index.html)

Conclusion of comparison for MBA (poor) Clementine Algorithm (excellent) Intelligent Miner (poor) Clementine Option (excellent) Intelligent Miner (poor) Clementine Result

3.2 Decision Tree Decision Tree is for classification and prediction

Comparative Criteria for Decision Tree Algorithm option Result

Decision Tree Mining Tool Clementine Algorithm Intelligent Miner Enterprise × CHAID ○ CART C4.5 or C5.0 ○ ID3 × × SPRINT ○

Misclassification Costs Decision Tree Mining Tool Clementine Option Intelligent Miner Enterprise × Misclassification Costs ○ Priors ○ Pruning Severity ○ Stopping Rule ○ Missing Value

Decision Tree Mining Tool Clementine Result Intelligent Miner Enterprise × Tree View ○ × Confusion Matrix ○

Conclusion of comparison for Decision Tree (excellent) Enterprise Miner Algorithm (excellent) Enterprise Miner (poor) Clementine Option (poor) Clementine Result

Constitution of Main Windows 4.Concluding Remarks Constitution of Main Windows (excellent) Clementine (poor) Intelligent Miner Constitution of Nodes (excellent) Intelligent Miner

Market Basket Analysis (excellent) Intelligent Miner (poor) Clementine Decision Tree (excellent) Enterprise Miner (poor) Clementine

Enterprise and University A Comparison of Capabilities of DM tools in this study potential Purchasers: Enterprise and University DM companies