Classification as data mining tool Classification as data mining tool Done by William Hellela William Hellela Rauf Gadar Alex Prewett.

Slides:



Advertisements
Similar presentations
Data Mining Tools Overview Business Intelligence for Managers.
Advertisements

Data Mining Tri Nguyen. Agenda Data Mining As Part of KDD Decision Tree Association Rules Clustering Amazon Data Mining Examples.
Cluster Analysis Measuring latent groups. Cluster Analysis - Discussion Definition Vocabulary Simple Procedure SPSS example ICPSR and hands on.
Brown, Suter, and Churchill Basic Marketing Research (8 th Edition) © 2014 CENGAGE Learning Basic Marketing Research Customer Insights and Managerial Action.
Data Mining: A Closer Look Chapter Data Mining Strategies.
DATA MINING CS157A Swathi Rangan. A Brief History of Data Mining The term “Data Mining” was only introduced in the 1990s. Data Mining roots are traced.
Week 9 Data Mining System (Knowledge Data Discovery)
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
University of Minnesota
Building Knowledge-Driven DSS and Mining Data
Business Intelligence
What is Predictive Modeling?. Predictive Modeling encompasses a variety of techniques to analyze current and historical facts to make predictions about.
Oracle Data Mining Ying Zhang. Agenda Data Mining Data Mining Algorithms Oracle DM Demo.
Data Mining: A Closer Look
Introduction to Data Mining Data mining is a rapidly growing field of business analytics focused on better understanding of characteristics and.
Discriminant Analysis Testing latent variables as predictors of groups.
Data Mining: An Introduction Wing Kee Ho Xiaohua Luan.
Data Mining CS 157B Section 2 Keng Teng Lao. Overview Definition of Data Mining Application of Data Mining.
TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING.
4.00 Understand promotion and intermediate uses of marketing- information.
GUHA method in Data Mining Esko Turunen Tampere University of Technology Tampere, Finland.
Enterprise systems infrastructure and architecture DT211 4
CSCI 347 / CS 4206: Data Mining Module 01: Introduction Topic 03: Stages in Data Mining.
1 Data Mining DT211 4 Refer to Connolly and Begg 4ed.
Data Mining Solutions (Westphal & Blaxton, 1998) Dr. K. Palaniappan Dept. of Computer Engineering & Computer Science, UMC.
© Copyright McGraw-Hill CHAPTER 1 The Nature of Probability and Statistics.
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
1 1 Slide Introduction to Data Mining and Business Intelligence.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Multivariate Data Analysis CHAPTER seventeen.
Chapter Fourteen Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Business Plug-In B18 Business Intelligence.
Data Mining: Classification & Predication Hosam Al-Samarraie, PhD. Centre for Instructional Technology & Multimedia Universiti Sains Malaysia.
Banking on Analytics Dr A S Ramasastri Director, IDRBT.
3-1 Data Mining Kelby Lee. 3-2 Overview ¨ Transaction Database ¨ What is Data Mining ¨ Data Mining Primitives ¨ Data Mining Objectives ¨ Predictive Modeling.
CRM - Data mining Perspective. Predicting Who will Buy Here are five primary issues that organizations need to address to satisfy demanding consumers:
APPLICATION OF DATAMINING TOOL FOR CLASSIFICATION OF ORGANIZATIONAL CHANGE EXPECTATION Şule ÖZMEN Serra YURTKORU Beril SİPAHİ.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
What is Data Mining? process of finding correlations or patterns among dozens of fields in large relational databases process of finding correlations or.
Week 1 - An Introduction to Machine Learning & Soft Computing
DATA MINING WITH CLUSTERING AND CLASSIFICATION Spring 2007, SJSU Benjamin Lam.
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 5-1 Data Mining Methods: Classification Most frequently used DM method Employ supervised.
MIS2502: Data Analytics Advanced Analytics - Introduction.
DR. SATISH NARGUNDKAR GEORGIA STATE UNIVERSITY Analytics Overview.
Decision Tree Algorithms Rule Based Suitable for automatic generation.
CHAPTER 8 DATA MINING BASICS.
Computational Biology Group. Class prediction of tumor samples Supervised Clustering Detection of Subgroups in a Class.
Data Mining Copyright KEYSOFT Solutions.
Customer Relationship Management (CRM) Chapter 4 Customer Portfolio Analysis Learning Objectives Why customer portfolio analysis is necessary for CRM implementation.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Introduction Exploring Categorical Variables Exploring Numerical Variables Exploring Categorical/Numerical Variables Selecting Interesting Subsets of Data.
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
Data Mining – Introduction (contd…) Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
By Arijit Chatterjee Dr
Business Intelligence
MIS2502: Data Analytics Advanced Analytics - Introduction
DATA MINING © Prentice Hall.
Business Research Methods William G. Zikmund
Lecture 3: Linear Regression (with One Variable)
Introduction to Data Mining
Dr. Satish Nargundkar Georgia State University
Fuzzy Clustering.
MIS2502: Data Analytics Clustering and Segmentation
MIS2502: Data Analytics Clustering and Segmentation
MIS2502: Data Analytics Introduction to Advanced Analytics
Kenneth C. Laudon & Jane P. Laudon
CSE591: Data Mining by H. Liu
Presentation transcript:

Classification as data mining tool Classification as data mining tool Done by William Hellela William Hellela Rauf Gadar Alex Prewett

Definition Classification is the process of dividing a dataset into mutually exclusive groups so that the members of each group are as "close" as possible to one another, and different groups are as "far" as possible from one another, where distance is measured with respect to specific variable (s) we are trying to predict. Classification is the process of dividing a dataset into mutually exclusive groups so that the members of each group are as "close" as possible to one another, and different groups are as "far" as possible from one another, where distance is measured with respect to specific variable (s) we are trying to predict.

Business Examples A typical classification problem is to divide a database of companies into groups that are as homogeneous as possible with respect to a creditworthiness variable with values "Good" and "Bad“ A typical classification problem is to divide a database of companies into groups that are as homogeneous as possible with respect to a creditworthiness variable with values "Good" and "Bad“ Predicting the response to direct marketing campaign [will/ will not respond] Predicting the response to direct marketing campaign [will/ will not respond]

Classification vs. Regression What distinguishes classification from regression is the type of output that is predicted. Classification, as the name implies, predicts class membership. What distinguishes classification from regression is the type of output that is predicted. Classification, as the name implies, predicts class membership. For example, a classification model predicts that a potential customer will respond to an offer. For example, a classification model predicts that a potential customer will respond to an offer.

Classification vs. Regression (cont) However, regression model predicts a specific value. For example, a model predicts that a certain customer profitability will be $854 However, regression model predicts a specific value. For example, a model predicts that a certain customer profitability will be $854

Predictive vs. Descriptive Models Predictive: forecast explicit values, based on patterns determined from known results. Predictive: forecast explicit values, based on patterns determined from known results. Descriptive: describe the patterns in existing data, and are generally used to create meaningful subgroups. Descriptive: describe the patterns in existing data, and are generally used to create meaningful subgroups.

Data Mining Models

Classification VS Other tools Classification is used to answer questions form a finite set of classes Classification is used to answer questions form a finite set of classes Estimation is used to answer questions from an unknown, continuous set of answers Estimation is used to answer questions from an unknown, continuous set of answers Clustering also does not require a finite set of predefined classes Clustering also does not require a finite set of predefined classes Prediction is a task of learning a pattern from examples using a developed model. Prediction is a task of learning a pattern from examples using a developed model.

Questions or comments Any Questions ? Any Questions ?