Prepared by: Mahmoud Rafeek Al-Farra

Slides:



Advertisements
Similar presentations
Data Mining Tools Overview Business Intelligence for Managers.
Advertisements

Data Mining Classification: Basic Concepts, Decision Trees, and Model Evaluation Lecture Notes for Chapter 4 Part I Introduction to Data Mining by Tan,
Classification: Definition Given a collection of records (training set ) –Each record contains a set of attributes, one of the attributes is the class.
Data Mining Classification: Basic Concepts, Decision Trees, and Model Evaluation Lecture Notes for Chapter 4 Introduction to Data Mining by Tan, Steinbach,
Data Mining Classification: Alternative Techniques
Prepared by: Mahmoud Rafeek Al-Farra College of Science & Technology Dep. Of Computer Science & IT BCs of Information Technology Data Mining
Introduction to Data Mining with XLMiner
Data Mining Classification: Naïve Bayes Classifier
Lecture Notes for Chapter 4 Introduction to Data Mining
SAK 5609 DATA MINING Prof. Madya Dr. Md. Nasir bin Sulaiman
Data Mining with Decision Trees Lutz Hamel Dept. of Computer Science and Statistics University of Rhode Island.
Classification.
ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.
Oracle Data Mining Ying Zhang. Agenda Data Mining Data Mining Algorithms Oracle DM Demo.
1 © Goharian & Grossman 2003 Introduction to Data Mining (CS 422) Fall 2010.
DATA MINING : CLASSIFICATION. Classification : Definition  Classification is a supervised learning.  Uses training sets which has correct answers (class.
Data Mining Joyeeta Dutta-Moscato July 10, Wherever we have large amounts of data, we have the need for building systems capable of learning information.
Data Mining – Day 2 Fabiano Dalpiaz Department of Information and Communication Technology University of Trento - Italy
Knowledge Discovery and Data Mining Evgueni Smirnov.
Chapter 8 Discriminant Analysis. 8.1 Introduction  Classification is an important issue in multivariate analysis and data mining.  Classification: classifies.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Basic Data Mining Technique
Data Mining: Classification & Predication Hosam Al-Samarraie, PhD. Centre for Instructional Technology & Multimedia Universiti Sains Malaysia.
Knowledge Discovery and Data Mining Evgueni Smirnov.
1 Knowledge Discovery Transparencies prepared by Ho Tu Bao [JAIST] ITCS 6162.
Prepared by: Mahmoud Rafeek Al-Farra College of Science & Technology Dep. Of Computer Science & IT BCs of Information Technology Data Mining
Classification and Prediction Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot Readings: Chapter 6 – Han and Kamber.
Classification Techniques: Bayesian Classification
Prepared by: Mahmoud Rafeek Al-Farra College of Science & Technology Dep. Of Computer Science & IT BCs of Information Technology Data Mining
Prepared by: Mahmoud Rafeek Al-Farra
Chapter 6 Classification and Prediction Dr. Bernard Chen Ph.D. University of Central Arkansas.
Classification And Bayesian Learning
Machine Learning: Decision Trees Homework 4 assigned courtesy: Geoffrey Hinton, Yann LeCun, Tan, Steinbach, Kumar.
1 Classification: predicts categorical class labels (discrete or nominal) classifies data (constructs a model) based on the training set and the values.
Data Mining By Farzana Forhad CS 157B. Agenda Decision Tree and ID3 Rough Set Theory Clustering.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Introduction to Data Mining Clustering & Classification Reference: Tan et al: Introduction to data mining. Some slides are adopted from Tan et al.
DATA MINING TECHNIQUES (DECISION TREES ) Presented by: Shweta Ghate MIT College OF Engineering.
Jawad Tahsin Danish Mustafa Zaidi Kazim Zaidi Zulfiqar Hadi.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
DATA MINING and VISUALIZATION Instructor: Dr. Matthew Iklé, Adams State University Remote Instructor: Dr. Hong Liu, Embry-Riddle Aeronautical University.
Data Mining: Confluence of Multiple Disciplines Data Mining Database Systems Statistics Other Disciplines Algorithm Machine Learning Visualization.
CSE 4705 Artificial Intelligence
Who am I? Work in Probabilistic Machine Learning Like to teach 
Semi-Supervised Clustering
Introduction Machine Learning 14/02/2017.
School of Computer Science & Engineering
Prepared by: Mahmoud Rafeek Al-Farra
Chapter 6 Classification and Prediction
Chapter 11: Learning Introduction
Data Mining: Concepts and Techniques Course Outline
Classification and Prediction
Classification Techniques: Bayesian Classification
Prepared by: Mahmoud Rafeek Al-Farra
Prepared by: Mahmoud Rafeek Al-Farra
Machine Learning with Weka
Data Mining 資料探勘 分群分析 (Cluster Analysis) Min-Yuh Day 戴敏育
Classification & Prediction
Supervised vs. unsupervised Learning
Classification and Prediction
CSCI N317 Computation for Scientific Applications Unit Weka
Prepared by: Mahmoud Rafeek Al-Farra
©Jiawei Han and Micheline Kamber
A task of induction to find patterns
Practice Project Overview
Instructor: Vincent Conitzer
COSC 4368 Intro Supervised Learning Organization
Presentation transcript:

Prepared by: Mahmoud Rafeek Al-Farra College of Science & Technology Dep. Of Computer Science & IT BCs of Information Technology Data Mining Chapter 4: Classification Methods (Definition) Prepared by: Mahmoud Rafeek Al-Farra 2013 www.cst.ps/staff/mfarra

Course’s Out Lines Introduction Data Preparation and Preprocessing Data Representation Classification Methods Evaluation Clustering Methods Mid Exam Association Rules Knowledge Representation Special Case study : Document clustering Discussion of Case studies by students

Out Lines Definition of Classification Learning Supervised vs Unsupervised Classification vs. Prediction How does the classification work ?!

Definition of Classification Classification is (Techniques used to predict group membership for data instances). For example, you may wish to use classification to predict whether the weather on a particular day will be “sunny”, “rainy” or “cloudy”.

Definition of Classification Classification is a classic data mining task, with roots in machine learning. A typical application is : "Given past records of customers who switched to another supplier, predict which current customers are likely to do the same."

Classes are pre-defined Learning This type of learning called supervised learning Example: Classification model may be built to categorize bank loan applications as either safe or risky. Classes are pre-defined

Supervised vs Unsupervised Supervised learning (classification) The set of possible classes is known in advance. New data is classified based on the training set Unsupervised learning (clustering) Set of possible classes is not known. After classification we can try to assign a name to that class. Unsupervised classification is called clustering.

Classification vs. Prediction predicts categorical class labels (discrete or nominal) classifies data based on the training set and the values (class labels) in a classifying attribute and uses it in classifying new data Prediction: models continuous-valued functions, i.e., predicts unknown or missing values

How ?! Given a collection of records (training set ) Each record contains a set of attributes, one of the attributes is the class. Find a model for class attribute as a function of the values of other attributes. Goal: previously unseen records should be assigned a class as accurately as possible. A test set is used to determine the accuracy of the model. Usually, the given data set is divided into training and test sets, with training set used to build the model and test set used to validate it.

How ?! Example: Process 1 Model Construction Classification Algorithms Training Data Classifier (Model) IF rank = ‘professor’ OR years > 6 THEN tenured = ‘yes’

How ?! Tenured? Process 2 Using the Model in Prediction Classifier Testing Data Unseen Data (Jeff, Professor, 4) Tenured?

How ?! 12

Next … Machine learning techniques Decision Trees Neural Networks k-Nearest Neighbors Naïve Bayesian Classifiers

Thanks