Measures of Dispersion

Slides:



Advertisements
Similar presentations
Population vs. Sample Population: A large group of people to which we are interested in generalizing. parameter Sample: A smaller group drawn from a population.
Advertisements

DESCRIPTIVE STATISTICS Chapter 2 BASED ON SCHAUM’S Outline of Probability and Statistics BY MURRAY R. SPIEGEL, JOHN SCHILLER, AND R. ALU SRINIVASAN ABRIDGMENT,
Statistical Reasoning for everyday life
Lecture 2 Describing Data II ©. Summarizing and Describing Data Frequency distribution and the shape of the distribution Frequency distribution and the.
Descriptive Statistics
Measures of Dispersion or Measures of Variability
Calculating & Reporting Healthcare Statistics
DESCRIBING DATA: 2. Numerical summaries of data using measures of central tendency and dispersion.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Variability Measures of spread of scores range: highest - lowest standard deviation: average difference from mean variance: average squared difference.
Descriptive Statistics
Analysis of Research Data
Measures of Variability
VARIABILITY. PREVIEW PREVIEW Figure 4.1 the statistical mode for defining abnormal behavior. The distribution of behavior scores for the entire population.
Data observation and Descriptive Statistics
Standard Scores & Correlation. Review A frequency curve either normal or otherwise is simply a line graph of all frequency of scores earned in a data.
Learning Objectives In this chapter you will learn about the importance of variation how to measure variation range variance standard deviation.
SHOWTIME! STATISTICAL TOOLS IN EVALUATION DESCRIPTIVE VALUES MEASURES OF VARIABILITY.
Measures of Variability: Range, Variance, and Standard Deviation
Chapter 4 SUMMARIZING SCORES WITH MEASURES OF VARIABILITY.
Z Scores & Correlation Greg C Elvers.
1 Measures of Central Tendency Greg C Elvers, Ph.D.
Measures of Central Tendency
Overview Summarizing Data – Central Tendency - revisited Summarizing Data – Central Tendency - revisited –Mean, Median, Mode Deviation scores Deviation.
Smith/Davis (c) 2005 Prentice Hall Chapter Six Summarizing and Comparing Data: Measures of Variation, Distribution of Means and the Standard Error of the.
Chapter 4 Variability. Variability In statistics, our goal is to measure the amount of variability for a particular set of scores, a distribution. In.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Descriptive Statistics Measures of Variation. Essentials: Measures of Variation (Variation – a must for statistical analysis.) Know the types of measures.
Variability.
Part II  igma Freud & Descriptive Statistics Chapter 3 Viva La Difference: Understanding Variability.
Measures of Dispersion & The Standard Normal Distribution 2/5/07.
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
By: Amani Albraikan 1. 2  Synonym for variability  Often called “spread” or “scatter”  Indicator of consistency among a data set  Indicates how close.
Page 1 Chapter 3 Variability. Page 2 Central tendency tells us about the similarity between scores Variability tells us about the differences between.
Measures of Dispersion & The Standard Normal Distribution 9/12/06.
Objectives The student will be able to: find the variance of a data set. find the standard deviation of a data set.
Skewness & Kurtosis: Reference
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
Measures of Dispersion
Measures of Dispersion How far the data is spread out.
INVESTIGATION 1.
Practice Page 65 –2.1 Positive Skew Note Slides online.
INVESTIGATION Data Colllection Data Presentation Tabulation Diagrams Graphs Descriptive Statistics Measures of Location Measures of Dispersion Measures.
Variability Pick up little assignments from Wed. class.
3 common measures of dispersion or variability Range Range Variance Variance Standard Deviation Standard Deviation.
Central Tendency & Dispersion
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Part II Sigma Freud and Descriptive Statistics Chapter 3 Vive La Différence: Understanding Variability.
Summary Statistics: Measures of Location and Dispersion.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
1.  In the words of Bowley “Dispersion is the measure of the variation of the items” According to Conar “Dispersion is a measure of the extent to which.
LIS 570 Summarising and presenting data - Univariate analysis.
Numerical descriptions of distributions
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Descriptive Statistics for one variable. Statistics has two major chapters: Descriptive Statistics Inferential statistics.
1 Day 1 Quantitative Methods for Investment Management by Binam Ghimire.
Descriptive Statistics(Summary and Variability measures)
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Chapter Six Summarizing and Comparing Data: Measures of Variation, Distribution of Means and the Standard Error of the Mean, and z Scores PowerPoint Presentation.
Descriptive measures Capture the main 4 basic Ch.Ch. of the sample distribution: Central tendency Variability (variance) Skewness kurtosis.
Practice Page Practice Page Positive Skew.
Chapter 4 Fundamental statistical characteristics II: Dispersion and form measurements.
Descriptive Statistics
Measures of Dispersion
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Measures of Dispersion
Presentation transcript:

Measures of Dispersion Greg C Elvers, Ph.D.

Definition Measures of dispersion are descriptive statistics that describe how similar a set of scores are to each other The more similar the scores are to each other, the lower the measure of dispersion will be The less similar the scores are to each other, the higher the measure of dispersion will be In general, the more spread out a distribution is, the larger the measure of dispersion will be

Measures of Dispersion Which of the distributions of scores has the larger dispersion? The upper distribution has more dispersion because the scores are more spread out That is, they are less similar to each other

Measures of Dispersion There are three main measures of dispersion: The range The semi-interquartile range (SIR) Variance / standard deviation

The Range The range is defined as the difference between the largest score in the set of data and the smallest score in the set of data, XL - XS What is the range of the following data: 4 8 1 6 6 2 9 3 6 9 The largest score (XL) is 9; the smallest score (XS) is 1; the range is XL - XS = 9 - 1 = 8

When To Use the Range The range is used when you have ordinal data or you are presenting your results to people with little or no knowledge of statistics The range is rarely used in scientific work as it is fairly insensitive It depends on only two scores in the set of data, XL and XS Two very different sets of data can have the same range: 1 1 1 1 9 vs 1 3 5 7 9

The Semi-Interquartile Range The semi-interquartile range (or SIR) is defined as the difference of the first and third quartiles divided by two The first quartile is the 25th percentile The third quartile is the 75th percentile SIR = (Q3 - Q1) / 2

SIR Example What is the SIR for the data to the right? 25 % of the scores are below 5 5 is the first quartile 25 % of the scores are above 25 25 is the third quartile SIR = (Q3 - Q1) / 2 = (25 - 5) / 2 = 10

When To Use the SIR The SIR is often used with skewed data as it is insensitive to the extreme scores

Variance Variance is defined as the average of the square deviations:

What Does the Variance Formula Mean? First, it says to subtract the mean from each of the scores This difference is called a deviate or a deviation score The deviate tells us how far a given score is from the typical, or average, score Thus, the deviate is a measure of dispersion for a given score

What Does the Variance Formula Mean? Why can’t we simply take the average of the deviates? That is, why isn’t variance defined as: This is not the formula for variance!

What Does the Variance Formula Mean? One of the definitions of the mean was that it always made the sum of the scores minus the mean equal to 0 Thus, the average of the deviates must be 0 since the sum of the deviates must equal 0 To avoid this problem, statisticians square the deviate score prior to averaging them Squaring the deviate score makes all the squared scores positive

What Does the Variance Formula Mean? Variance is the mean of the squared deviation scores The larger the variance is, the more the scores deviate, on average, away from the mean The smaller the variance is, the less the scores deviate, on average, from the mean

Standard Deviation When the deviate scores are squared in variance, their unit of measure is squared as well E.g. If people’s weights are measured in pounds, then the variance of the weights would be expressed in pounds2 (or squared pounds) Since squared units of measure are often awkward to deal with, the square root of variance is often used instead The standard deviation is the square root of variance

Standard Deviation Standard deviation = variance Variance = standard deviation2

Computational Formula When calculating variance, it is often easier to use a computational formula which is algebraically equivalent to the definitional formula: 2 is the population variance, X is a score,  is the population mean, and N is the number of scores

Computational Formula Example

Computational Formula Example

Variance of a Sample Because the sample mean is not a perfect estimate of the population mean, the formula for the variance of a sample is slightly different from the formula for the variance of a population: s2 is the sample variance, X is a score, X is the sample mean, and N is the number of scores

Measure of Skew Skew is a measure of symmetry in the distribution of scores Normal (skew = 0) Positive Skew Negative Skew

Measure of Skew The following formula can be used to determine skew:

Measure of Skew If s3 < 0, then the distribution has a negative skew If s3 > 0 then the distribution has a positive skew If s3 = 0 then the distribution is symmetrical The more different s3 is from 0, the greater the skew in the distribution

Kurtosis (Not Related to Halitosis) Kurtosis measures whether the scores are spread out more or less than they would be in a normal (Gaussian) distribution Mesokurtic (s4 = 3) Leptokurtic (s4 > 3) Platykurtic (s4 < 3)

Kurtosis When the distribution is normally distributed, its kurtosis equals 3 and it is said to be mesokurtic When the distribution is less spread out than normal, its kurtosis is greater than 3 and it is said to be leptokurtic When the distribution is more spread out than normal, its kurtosis is less than 3 and it is said to be platykurtic

Measure of Kurtosis The measure of kurtosis is given by:

s2, s3, & s4 Collectively, the variance (s2), skew (s3), and kurtosis (s4) describe the shape of the distribution