R for Classification Jennifer Broughton Shimadzu Research Laboratory Manchester, UK 2 nd May 2013.

Slides:



Advertisements
Similar presentations
TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST
Advertisements

Slide 1 Insert your own content. Slide 2 Insert your own content.
1 Chapter 40 - Physiology and Pathophysiology of Diuretic Action Copyright © 2013 Elsevier Inc. All rights reserved.
Combining Like Terms. Only combine terms that are exactly the same!! Whats the same mean? –If numbers have a variable, then you can combine only ones.
A General Framework for Mining Concept-Drifting Data Streams with Skewed Distributions Jing Gao Wei Fan Jiawei Han Philip S. Yu University of Illinois.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
0 - 0.
ALGEBRAIC EXPRESSIONS
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
ADDING INTEGERS 1. POS. + POS. = POS. 2. NEG. + NEG. = NEG. 3. POS. + NEG. OR NEG. + POS. SUBTRACT TAKE SIGN OF BIGGER ABSOLUTE VALUE.
MULTIPLICATION EQUATIONS 1. SOLVE FOR X 3. WHAT EVER YOU DO TO ONE SIDE YOU HAVE TO DO TO THE OTHER 2. DIVIDE BY THE NUMBER IN FRONT OF THE VARIABLE.
SUBTRACTING INTEGERS 1. CHANGE THE SUBTRACTION SIGN TO ADDITION
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
Addition Facts
ALGEBRAIC EXPRESSIONS
Ensemble Learning – Bagging, Boosting, and Stacking, and other topics
O X Click on Number next to person for a question.
© S Haughton more than 3?
1 Directed Depth First Search Adjacency Lists A: F G B: A H C: A D D: C F E: C D G F: E: G: : H: B: I: H: F A B C G D E H I.
1 CS 391L: Machine Learning: Rule Learning Raymond J. Mooney University of Texas at Austin.
Take from Ten First Subtraction Strategy -9 Click on a number below to go directly to that type of subtraction problems
Linking Verb? Action Verb or. Question 1 Define the term: action verb.
Energy & Green Urbanism Markku Lappalainen Aalto University.
Classification.. continued. Prediction and Classification Last week we discussed the classification problem.. – Used the Naïve Bayes Method Today..we.
Past Tense Probe. Past Tense Probe Past Tense Probe – Practice 1.
Limits (Algebraic) Calculus Fall, What can we do with limits?
Addition 1’s to 20.
25 seconds left…...
Bab /44 Bab 4 Classification: Basic Concepts, Decision Trees & Model Evaluation Part 1 Classification With Decision tree Muhamad Arief Hidayat
Test B, 100 Subtraction Facts
11 = This is the fact family. You say: 8+3=11 and 3+8=11
Week 1.
O X Click on Number next to person for a question.
Homework 1. A Letter Recognition Task (Frey & Slate, 1991) Instances: –20,000 unique letter images were generated by randomly distorting pixel images.
Does one size really fit all? Evaluating classifiers in Bag-of-Visual-Words classification Christian Hentschel, Harald Sack Hasso Plattner Institute.
Classification Classification Examples
Information Extraction Lecture 7 – Linear Models (Basic Machine Learning) CIS, LMU München Winter Semester Dr. Alexander Fraser, CIS.
Pattern Recognition: Readings: Ch 4: , , 4.13
1 Pattern Recognition Pattern recognition is: 1. The name of the journal of the Pattern Recognition Society. 2. A research area in which patterns in data.
Lazy Learning k-Nearest Neighbour Motivation: availability of large amounts of processing power improves our ability to tune k-NN classifiers.
Decision Tree Classifier for Signature Recognition and State Classification in Intrusion Detection IEE591C Presentation Xiangyang Li, Qiang Chen and Yebin.
An Extended Introduction to WEKA. Data Mining Process.
Chapter 7 Neural Networks in Data Mining Automatic Model Building (Machine Learning) Artificial Intelligence.
Data Mining: Classification & Predication Hosam Al-Samarraie, PhD. Centre for Instructional Technology & Multimedia Universiti Sains Malaysia.
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Business Intelligence and Decision Modeling Week 9 Customer Profiling Decision Trees (Part 2) CHAID CRT.
BAGGING ALGORITHM, ONLINE BOOSTING AND VISION Se – Hoon Park.
1 Pattern Recognition Pattern recognition is: 1. A research area in which patterns in data are found, recognized, discovered, …whatever. 2. A catchall.
Pattern Recognition 1 Pattern recognition is: 1. The name of the journal of the Pattern Recognition Society. 2. A research area in which patterns in data.
Data Mining – Best Practices Part #2 Richard Derrig, PhD, Opal Consulting LLC CAS Spring Meeting June 16-18, 2008.
© 2013 WESTERN DIGITAL TECHNOLOGIES, INC. ALL RIGHTS RESERVED Machine Learning and Failure Prediction in Hard Disk Drives Dr. Amit Chattopadhyay Director.
A Brief Introduction and Issues on the Classification Problem Jin Mao Postdoc, School of Information, University of Arizona Sept 18, 2015.
Competition II: Springleaf Sha Li (Team leader) Xiaoyan Chong, Minglu Ma, Yue Wang CAMCOS Fall 2015 San Jose State University.
Combining multiple learners Usman Roshan. Decision tree From Alpaydin, 2010.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Source: Procedia Computer Science(2015)70:
Estimating Link Signatures with Machine Learning Algorithms
CS 8520: Artificial Intelligence
Basic machine learning background with Python scikit-learn
CS548 Fall 2017 Decision Trees / Random Forest Showcase by Yimin Lin, Youqiao Ma, Ran Lin, Shaoju Wu, Bhon Bunnag Showcasing work by Cano,
Features & Decision regions
Machine Learning Week 1.
Introduction to Data Mining, 2nd Edition
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Basics of ML Rohan Suri.
Machine Learning with Clinical Data
Advisor: Dr.vahidipour Zahra salimian Shaghayegh jalali Dec 2017
What is Artificial Intelligence?
Presentation transcript:

R for Classification Jennifer Broughton Shimadzu Research Laboratory Manchester, UK 2 nd May 2013

Classification? Automatic Identification of Type (Class) of Object from Measured Variables (Features) Object TypeFeature1Feature2Feature3…….Feature n Label 1val[1,1]val[1,2]val[1,3]……. val[1,n] Label 2val[2,1]val[2,2]val[2,3]…….val[2,n] ………….…….…….…….……… Label mval[m,1]val[m.2]val[m,3]…….val[m,n] 2 of 17

Example Data 3 of 17

Data Preparation & Investigation EDA Technique Box Plots PCA Decision Trees Clustering Training Set Best features to distinguish between classes Relationships between features Feature reduction 4 of 17

Box Plots PCA & Multivariate Analysis: ade4 FactoMineR 5 of 17

Example Classifier 6 of 17

Classification Algorithms in R Rattle: R Analytical Tool to Learn Easily ( Rattle: A Data Mining GUI for R, Graham J Williams, The R Journal, 1(2):45-55 ) 7 of 17

SVM 8 of 17

Ensemble Algorithm 9 of 17

Training and Testing Classification Algorithm: Neural Network Support Vector Machine Random Forest Training Set (labelled) Test Set (unlabelled) Trained Classifier Classification Results Prediction Results + Labels Assess Predictions: Confusion Matrix ROC Curve (2 categories) …. 10 of 17

Using Classifiers in R Select Training Data Build Classifier Run Classifier classifier  algorithm(formula, data, options) (boosting and nnet) classifier.pred  predict(classifier, newdata, options) 11 of 17

SVM & Neural Net Tuning 12 of 17

Classifier Feedback print(classifier) plot(classifier) high Gini Coefficient = high dispersion 13 of 17

Classifier Prediction Results predict(type = “class”) predict(type = “prob”) confusion matrix 14 of 17

False Negative False Negative True Positive True Positive True Negative True Negative False Positive False Positive Binary Classification Results Y N Class Present? Class Detected? Y N   15 of 17

ROC Curves in R ROCR package 16 of 17

Example Results 17 of 17