An overview of The IBM Intelligent Miner for Data By: Neeraja Rudrabhatla 11/04/1999.

Slides:



Advertisements
Similar presentations
Decision Trees Decision tree representation ID3 learning algorithm
Advertisements

Decision Tree Approach in Data Mining
Decision Tree Algorithm (C4.5)
Classification Techniques: Decision Tree Learning
Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter 3 of Data Mining by I. H. Witten, E. Frank and M. A. Hall.
Data Mining: A Closer Look Chapter Data Mining Strategies.
DATA MINING CS157A Swathi Rangan. A Brief History of Data Mining The term “Data Mining” was only introduced in the 1990s. Data Mining roots are traced.
Week 9 Data Mining System (Knowledge Data Discovery)
Induction of Decision Trees
Data Mining.
Basic concepts of Data Mining, Clustering and Genetic Algorithms Tsai-Yang Jea Department of Computer Science and Engineering SUNY at Buffalo.
Data Mining – Intro.
CS157A Spring 05 Data Mining Professor Sin-Min Lee.
Machine Learning Lecture 10 Decision Trees G53MLE Machine Learning Dr Guoping Qiu1.
Data Mining: A Closer Look
Data Mining: A Closer Look Chapter Data Mining Strategies 2.
Data Mining CS 157B Section 2 Keng Teng Lao. Overview Definition of Data Mining Application of Data Mining.
GUHA method in Data Mining Esko Turunen Tampere University of Technology Tampere, Finland.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Dr. Awad Khalil Computer Science Department AUC
Data Mining Techniques
MAKING THE BUSINESS BETTER Presented By Mohammed Dwikat DATA MINING Presented to Faculty of IT MIS Department An Najah National University.
Data Mining Chun-Hung Chou
Data Mining Joyeeta Dutta-Moscato July 10, Wherever we have large amounts of data, we have the need for building systems capable of learning information.
3 Objects (Views Synonyms Sequences) 4 PL/SQL blocks 5 Procedures Triggers 6 Enhanced SQL programming 7 SQL &.NET applications 8 OEM DB structure 9 DB.
Data Mining and Application Part 1: Data Mining Fundamentals Part 2: Tools for Knowledge Discovery Part 3: Advanced Data Mining Techniques Part 4: Intelligent.
Overview of Data Mining Methods Data mining techniques What techniques do, examples, advantages & disadvantages.
Data Mining CS157B Fall 04 Professor Lee By Yanhua Xue.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Lecture 7. Outline 1. Overview of Classification and Decision Tree 2. Algorithm to build Decision Tree 3. Formula to measure information 4. Weka, data.
1 Knowledge Discovery Transparencies prepared by Ho Tu Bao [JAIST] ITCS 6162.
Copyright © 2004 Pearson Education, Inc.. Chapter 27 Data Mining Concepts.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
CS157B Fall 04 Introduction to Data Mining Chapter 22.3 Professor Lee Yu, Jianji (Joseph)
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
1 1 Slide Using Weka. 2 2 Slide Data Mining Using Weka n What’s Data Mining? We are overwhelmed with data We are overwhelmed with data Data mining is.
Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Chapter 12 Discovering New Knowledge – Data Mining.
1 STAT 5814 Statistical Data Mining. 2 Use of SAS Data Mining.
Chapter 20 Data Analysis and Mining. 2 n Decision Support Systems  Obtain high-level information out of detailed information stored in (DB) transaction-processing.
An Introduction Student Name: Riaz Ahmad Program: MSIT( ) Subject: Data warehouse & Data Mining.
Elsayed Hemayed Data Mining Course
Data Mining Copyright KEYSOFT Solutions.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
COM24111: Machine Learning Decision Trees Gavin Brown
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
DATA MINING TECHNIQUES (DECISION TREES ) Presented by: Shweta Ghate MIT College OF Engineering.
Data Mining Practical Machine Learning Tools and Techniques Chapter 6.3: Association Rules Rodney Nielsen Many / most of these slides were adapted from:
Data Mining is the process of analyzing data and summarizing it into useful information Data Mining is usually used for extremely large sets of data It.
DATA MINING and VISUALIZATION Instructor: Dr. Matthew Iklé, Adams State University Remote Instructor: Dr. Hong Liu, Embry-Riddle Aeronautical University.
Oracle Advanced Analytics
Data Mining – Intro.
DATA MINING © Prentice Hall.
Prepared by: Mahmoud Rafeek Al-Farra
Decision Trees: Another Example
Mining Time-Changing Data Streams
Data Mining Concept Description
I don’t need a title slide for a lecture
Privacy Preserving Data Mining
כריית נתונים.
Machine Learning: Lecture 3
Play Tennis ????? Day Outlook Temperature Humidity Wind PlayTennis
Data Mining: Introduction
Decision Trees Decision tree representation ID3 learning algorithm
Artificial Intelligence 9. Perceptron
Kenneth C. Laudon & Jane P. Laudon
A task of induction to find patterns
A task of induction to find patterns
Presentation transcript:

An overview of The IBM Intelligent Miner for Data By: Neeraja Rudrabhatla 11/04/1999

Mining Features supported by the Data Miner: Association Rules Clustering - Demographic, Neural networks Predicting classifications - Neural Networks, Decision Trees Predicting values Discovering sequential patterns Discovering similar time sequences

Steps for mining data using the Data Miner: Creation of data Analyze and prepare data for mining Mine the data using one or a combination of mining techniques Visualize mining results using advanced graphical techniques

Main Window of the Data Miner:

Database used for mining association rules: Store ID Customer # Date(yymmdd) Transaction # ItemID

Name Mapping:

Results of mining for associations:

Results on the automobile Database:

Another view:

Database used for Clustering: Gender Age Siblings Income Type Product female red 2 female green 3 male red 4 female green 5 male blue 6 female blue 7 female green8 female pink 1 female red 2 female pink3 female green 4 male blue 5 male blue 6 female pink 7 female green 8 male blue 1 male blue 2 female green 3

Clustering - Demographic: Max #clusters: 9 Accuracy: 5% Max #clusters: 9 Accuracy: 5%

Details of Cluster 7:

Detailed pie-chart for attribute Type:

Detailed bar-graph of attribute Age:

Output obtained with Clustering using Neural Networks:

Details of Cluster 6:

Database used for Classification: Day Outlook Temperature Humidity Wind PlayTennis D1 Sunny Hot High Weak No D2 Sunny Hot High Strong No D3 Overcast Hot High Weak Yes D4 Rain Mild High Weak Yes D5 Rain Cool Normal Weak Yes D6 Rain Cool Normal Strong No D7 Overcast Cool Normal Strong Yes D8 Sunny Mild High Weak No D9 Sunny Cool Normal Weak Yes D10 Rain Mild Normal Weak Yes D11 Sunny Mild Normal Strong Yes D12 Overcast Mild High Strong Yes D13 Overcast Hot Normal Weak Yes D14 Rain Mild High Strong No

Classification using Decision Tree:

A view of a leaf node of the decision tree:

Classification using neural network: In-sample: 4 Out-Sample: 1 Accuracy: 80 Error: 10 Learning Rate: 0.1 Momentum: 0.9 In-sample: 4 Out-Sample: 1 Accuracy: 80 Error: 10 Learning Rate: 0.1 Momentum: 0.9

Viewing the results in bar-graphs:

Database for Value Prediction: D1 Sunny 80 High Weak No D2 Sunny 75 High Strong No D3 Overcast 70 High Weak Yes D4 Rain 55 High Weak Yes D5 Rain 32 Normal Weak Yes D6 Rain 35 Normal Strong No D7 Overcast 40 Normal Strong Yes D8 Sunny 60 High Weak No D9 Sunny 20 Normal Weak Yes D10 Rain 67 Normal Weak Yes D11 Sunny 62 Normal Strong Yes D12 Overcast 58 High Strong Yes D13 Overcast 74 Normal Weak Yes D14 Rain 61 High Strong No

Results of PlayTennis: In-sample: 2 Out-sample: 1 In-sample: 2 Out-sample: 1

One partition of the PlayTennis-Prediction:

Textual Representation of a single partition:

Sequential Patterns Mining and Time Sequence Mining: Sequential patterns are used to find predictable patterns of behavior over a period of time. (A certain behavior at a given time is likely to produce another behavior or a sequence of behaviors within a certain time-span) Time sequences help find all occurrences of similar subsequences in a database of time sequences.

Sequences: Combine several objects into a single object that you can run The benefit is that you can combine several steps into one step If you combine several functions into a sequence, you need run only the sequence, which then runs each of the objects within it

Applications: The Intelligent Miner offerings are intended for use by Data Analysts and Business Technologists in the following areas: Perform database marketing Streamline business and manufacturing processes Detect potential cases of fraud Helps in customer relationship management