Diagonal is sum of variances In general, these will be larger when “within” class variance is larger (a bad thing) Sw(iris[,1:4],iris[,5]) Sepal.Length.

Slides:



Advertisements
Similar presentations
SPEED IS: Distance traveled by an object in a given amount of time.
Advertisements

1 Peter Fox Data Analytics – ITWS-4963/ITWS-6965 Week 6a, February 25, 2014, SAGE 3101 kNN, K-Means, Clustering and Bayesian Inference.
Decision Trees in R Arko Barman With additions and modifications by Ch. Eick COSC 4335 Data Mining.
Three-dimensional co-occurrence matrices & Gabor filters: Current progress Gray-level co-occurrence matrices Carl Philips Gabor filters Daniel Li Supervisor:
Variability 2011, 10, 4. Learning Topics  Variability of a distribution: The extent to which values vary –Range –Variance** –Standard Deviation**
Neural Nets How the brain achieves intelligence 10^11 1mhz cpu’s.
Rules for means Rule 1: If X is a random variable and a and b are fixed numbers, then Rule 2: If X and Y are random variables, then.
Standard Deviation A measure of variability
A Short Introduction to Weka Natural Language Processing Thursday, September 25th.
1 Cluster Analysis EPP 245 Statistical Analysis of Laboratory Data.
Supervised Learning & Classification, part I Reading: W&F ch 1.1, 1.2, , 3.2, 3.3, 4.3, 6.1*
Arko Barman Slightly edited by Ch. Eick COSC 6335 Data Mining
A Short Introduction to Weka Natural Language Processing Thursday, September 27 Frank Enos and Andrew Rosenberg.
X = =2.67.
1 Multivariate Analysis and Discrimination EPP 245 Statistical Analysis of Laboratory Data.
1 Multivariate Analysis and Discrimination EPP 245 Statistical Analysis of Laboratory Data.
Properties of Poisson The mean and variance are both equal to. The sum of independent Poisson variables is a further Poisson variable with mean equal to.
ADDING INTEGERS Positive + Positive = Positive Positive + Positive = Positive ( +3) + (+2) = +5 ( +3) + (+2) = +5 When a number is positive, you do not.
Tree-Based Methods (V&R 9.1) Demeke Kasaw, Andreas Nguyen, Mariana Alvaro STAT 6601 Project.
A quick introduction to R prog. 淡江統計 陳景祥 (Steve Chen)
1 Measures of Variability Chapter 5 of Howell (except 5.3 and 5.4) People are all slightly different (that’s what makes it fun) Not everyone scores the.
R-Graphics Day 2 Stephen Opiyo. Basic Graphs One of the main reasons data analysts turn to R is for its strong graphic capabilities. R generates publication-ready.
Right Triangles And the Pythagorean Theorem. Legs of a Right Triangle Leg -the two sides of a right triangle that form the right angle Leg.
Univariate Graphs III Review Create histogram from Commands Window. Multipanel histogram. Quantile Plots Quantile-Normal Plots Quantile-Quantile Plots.
Find the mean, variance, and standard deviation of these values: 6.9, 8.7, 7.6, 4.8, 9.0 Sum=
Last time we talked about how the number of individuals in a population changes over time. Who remembers what this is called?
A Brief Introduction to R Programming Darren J. Fitzpatrick, PhD The Bioinformatics Support Team 27/08/2015.
Figure 1.1 Rules for the contact lens data.. Figure 1.2 Decision tree for the contact lens data.
Lecture 6 Statistical Lecture ─ Cluster Analysis.
1 Running Clustering Algorithm in Weka Presented by Rachsuda Jiamthapthaksin Computer Science Department University of Houston.
List a zero pair. Fill in the blank. 7 + ____ = 0.
ITSC/University of Alabama in Huntsville ADaM version 4.0 (Eagle) Tutorial Information Technology and Systems Center University of Alabama in Huntsville.
Weka Just do it Free and Open Source ML Suite Ian Witten & Eibe Frank University of Waikato New Zealand.
Machine Learning (ML) with Weka Weka can classify data or approximate functions: choice of many algorithms.
The Substitution Method Objectives: To solve a system of equations by substituting for a variable.
Neural networks – Hands on
1 Statistics & R, TiP, 2011/12 Multivariate Methods  Multivariate data  Data display  Principal component analysis Unsupervised learning technique 
How are features distributed?
Decision Tree Lab. Load in iris data: Display iris data as a sanity.
Data Mining CH6 Implementation: Real machine learning schemes(2) Reporter: H.C. Tsai.
Lesson Topic: The Mean Absolute Deviation (MAD) Lesson Objective: I can…  I can calculate the mean absolute deviation (MAD) for a given data set.  I.
Addition Flashcards Sum of
How can I subtract by using addition? Isn’t addition the opposite of subtraction?
How far apart are these points?. A (6,4) B (6,0) A B 4 units.
2.4 Measures of Variation The Range of a data set is simply: Range = (Max. entry) – (Min. entry)
Wed 5/25 Lesson 11 – 7 Learning Objective: To find standard deviation & variance Hw: Pg. 722 #6 – 9, 13, 21.
Rodney Nielsen Many of these slides were adapted from: I. H. Witten, E. Frank and M. A. Hall Data Science Output: Knowledge Representation WFH: Data Mining,
Common Linear & Classification for Machine Learning using Microsoft R
Clustering CSC 600: Data Mining Class 21.
Evaluating-Ayasdi’s-Topological-Data-Analysis-For-Big-Data_HKim2015
Discriminant Analysis
Consecutive Integer Problems
Even More on Dimensions
Peter Fox and Greg Hughes Data Analytics – ITWS-4600/ITWS-6600
Consecutive Integers: Numbers are one apart
מיחזור במערכת החינוך.
K-Means Lab.
مفهوم التربية المدنية.
Weka Free and Open Source ML Suite Ian Witten & Eibe Frank
أهداف الفصل الفرق بين الموازنة المرنة والثابتة .
DataMining, Morgan Kaufmann, p Mining Lab. 김완섭 2004년 10월 27일
Experimental Design.
Magic Squares   10   X.
10.2 Variance Math Models.
Nonlinear Fitting.
Objective- To use the Third Side Property to find the third side of a triangle given the other two. 5 ft. 7 ft. X What could the value of x be?
H0: m = m0 Ha: m > m0 Ha: m < m0 Ha: m  m0 “EXTREME”
Copyright: Martin Kramer
Arko Barman COSC 6335 Data Mining
Presentation transcript:

Diagonal is sum of variances In general, these will be larger when “within” class variance is larger (a bad thing) Sw(iris[,1:4],iris[,5]) Sepal.Length Sepal.Width Petal.Length Petal.Width Sepal.Length Sepal.Width Petal.Length Petal.Width

When clusters are far apart their means are far from the global mean The differences between the means grow