MSV 33: Measures of Spread

Slides:



Advertisements
Similar presentations
Unit 16: Statistics Sections 16AB Central Tendency/Measures of Spread.
Advertisements

Dual Tragedies in the B-ham Paper. Module 2 Simple Descriptive Statistics and Univariate Displays of Data A Tale of Three Cities George Howard, DrPH.
Measures of Dispersion or Measures of Variability
Variability Measures of spread of scores range: highest - lowest standard deviation: average difference from mean variance: average squared difference.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 3: Central Tendency And Dispersion.
1 Chapter 4: Variability. 2 Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure.
Variability Ibrahim Altubasi, PT, PhD The University of Jordan.
Central Tendency and Variability Chapter 4. Central Tendency >Mean: arithmetic average Add up all scores, divide by number of scores >Median: middle score.
Measures of Central Tendency
May 28, 2008Stat Lecture 3 - Numerical Summaries 1 Exploring Data Numerical Summaries of One Variable Statistics Lecture 3.
Summary statistics Using a single value to summarize some characteristic of a dataset. For example, the arithmetic mean (or average) is a summary statistic.
Exploration of Mean & Median Go to the website of “Introduction to the Practice of Statistics”website Click on the link to “Statistical Applets” Select.
Statistics Recording the results from our studies.
The Sample Variance © Chistine Crisp Edited by Dr Mike Hughes.
Basic Statistics Concepts Marketing Logistics. Basic Statistics Concepts Including: histograms, means, normal distributions, standard deviations.
Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability usually accompanies.
Chapter 3 Basic Statistics Section 2.2: Measures of Variability.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
Measures of Variability. Variability Measure of the spread or dispersion of a set of data 4 main measures of variability –Range –Interquartile range –Variance.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Part II  igma Freud & Descriptive Statistics Chapter 3 Viva La Difference: Understanding Variability.
CHS Statistics 2.5: Measures of Spread
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
1 Review Mean—arithmetic average, sum of all scores divided by the number of scores Median—balance point of the data, exact middle of the distribution,
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Page 1 Chapter 3 Variability. Page 2 Central tendency tells us about the similarity between scores Variability tells us about the differences between.
Measures of Position. ● The standard deviation is a measure of dispersion that uses the same dimensions as the data (remember the empirical rule) ● The.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Measures of Dispersion How far the data is spread out.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Averages and Variation.
DATA ANALYSIS n Measures of Central Tendency F MEAN F MODE F MEDIAN.
INVESTIGATION 1.
Statistics and parameters. To find out about a population we take a sample.
Unit 3 Lesson 2 (4.2) Numerical Methods for Describing Data
Practice Page 65 –2.1 Positive Skew Note Slides online.
Measures of Spread 1. Range: the distance from the lowest to the highest score * Problem of clustering differences ** Problem of outliers.
Variability Pick up little assignments from Wed. class.
Numerical Measures of Variability
Chapter 3 Looking at Data: Distributions Chapter Three
Chapter 3: Averages and Variation Section 2: Measures of Dispersion.
Chapter 4: Variability. Variability Provides a quantitative measure of the degree to which scores in a distribution are spread out or clustered together.
Statistics and Modelling Topic 1: Introduction to statistical analysis Purpose – To revise and advance our understanding of descriptive statistics.
Measures of variability: understanding the complexity of natural phenomena.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
1.3 Describing Quantitative Data with Numbers Pages Objectives SWBAT: 1)Calculate measures of center (mean, median). 2)Calculate and interpret measures.
Thinking Mathematically Statistics: 12.3 Measures of Dispersion.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
1 Research Methods in Psychology AS Descriptive Statistics.
Chapter 1 Lesson 7 Variance and Standard Deviation.
One-Variable Statistics. Descriptive statistics that analyze one characteristic of one sample  Where’s the middle?  How spread out is it?  How do different.
Descriptive statistics
One-Variable Statistics
Practice Page Practice Page Positive Skew.
Mathematical Presentation of Data Measures of Dispersion
Reasoning in Psychology Using Statistics
IB Psychology Today’s Agenda: Turn in:
CHAPTER 1 Exploring Data
Central tendency and spread
Chapter 3.
Chapter 5: Describing Distributions Numerically
Please take out Sec HW It is worth 20 points (2 pts
11.1 Measures of Center and Variation
Chapter 3 Section 4 Measures of Position.
Summary descriptive statistics: means and standard deviations:
Describing Quantitative Data with Numbers
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Presentation transcript:

MSV 33: Measures of Spread www.making-statistics-vital.co.uk MSV 33: Measures of Spread

The Bee Academy

‘And our topic today, my fellow bees, is spread!’ Professor Zzub ‘Mmm...’

‘I’m lost. Example please...’ ‘No, no, no, Millie! I mean, How can we measure how spread out a data set is!’ ‘I’m lost. Example please...’ ‘The data sets 1, 3, 5, 7, 9 and 3, 4, 5, 6, 7 have the same mean, but the first set is clearly more spread out than the second.’

‘Nice idea, Ding – and this measure is used! It’s called the RANGE. ‘So you are asking how we could measure that – how about the top number take away the bottom for each set? If the spread is big, that’ll be big!’ 1, 3, 5, 7, 9 and 3, 4, 5, 6, 7 ‘Nice idea, Ding – and this measure is used! It’s called the RANGE. So the range for our first set is 9 - 1 = 8, while the range for our second set is 7 - 3 = 4.’

‘Let me guess – there’s more to it than that.’ ‘Sadly, Brenda, the range is badly affected by extreme values or outliers. It can give a rather misleading picture of the data.’ 1, 3, 5, 7, 9, 11, 13 and 3, 4, 5, 6, 7, 8, 20 Range = 12 Range = 17

‘Okay, then, don’t take all the data; chuck away the lowest quarter, and the highest quarter, and THEN take the range. Just taking the middle 50%, you’ve got rid of all those extreme values.‘ ‘1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 ‘Great idea, Paul – so for example with this small data set, we can add the quartiles, Q1, Q2 (the median) and Q3...’ ‘... And the Interquartile Range is Q3 – Q1 = 6, the range of the middle 50% of the data.

‘I’ve got another idea!’ ‘What’s that, Millie?’ ‘1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 ‘Go back to this data set again. We could find the mean, then find the difference of each of these numbers from the mean, and then add the differences together. If the numbers are spread out, then this will be big!’

‘That is nearly a great idea, Millie, but watch what happens...’ ‘1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 ‘So the differences add to 0. Always.’ ‘But that is easily fixed...’

‘Find the POSITIVE difference of each of these numbers from the mean, and then add these differences together. It won’t be 0 now!’ ‘Indeed, Virender, the sum now is 30. But is that a fair measure of spread?’

‘Surely you have to divide by the total number of numbers you have - to take an average!’ ‘Excellent, Ding! And this takes us to what is called ‘the mean deviation from the mean’. If we write it in symbols, we have

‘There is still a problem, however – The modulus function is not always easy to handle mathematically. It is true that |ab|=|a||b|, but it is not generally true that |a + b| = |a|+|b|.’ ‘Well, there are other ways to make the differences from the mean all positive. You could square the differences, for example!’ ‘Great idea, Millie. So we can find the square of the difference of each of these numbers from the mean, and then add these together. Then divide by the total number of numbers we have.’

‘This is called the MSD, or ‘the population variance’. If we multiply out, we get an alternative formulation that is usually easier to calculate, especially if the mean is not a whole number.’

Is this the measure of spread ‘As before.’ ‘So have we got it now? Is this the measure of spread we generally use?’ ‘We are very nearly there, Brenda. There is, sadly, a problem with the MSD. Most of the time we are taking a SAMPLE from a population. We would like the expectation of our variance statistic to be the variance of the population. But in order for that to happen...

‘We have to take our MSD statistic... ‘And divide by n-1 rather than n.’ ‘This statistic is called ‘the ‘sample variance’ or simply the ‘variance’. The expected value of this is the population variance. As with the population variance statistics, there is an alternative form... ‘Which is often easier to use.’

‘So is that all the measures of spread we need to know?’ ‘I should add, Virender, that we do use the square root of the MSD (called RMSD) and the square root of the variance (called the Standard Deviation) as measures of spread too. The advantages of the RMSD and the SD are that they are measured in the same units as the random variable we are interested in.’ ‘So to summarise...’

Interquartile range (IQR)= Q3  Q1, where the quartiles Q1, Q2 and Q3 divide the data set into four groups of equal size. Range = Top value – bottom value.

Mean Square Deviation (population variance). Root Mean Square Deviation = RMSD. Variance (or sample variance). Standard Deviation.

is written by Jonny Griffiths With thanks to pixabay.com www.making-statistics-vital.co.uk is written by Jonny Griffiths hello@jonny-griffiths.net