Measures of Dispersion Advanced Higher Geography Statistics.

Slides:



Advertisements
Similar presentations
Measures of Dispersion or Measures of Variability
Advertisements

1. 2 BIOSTATISTICS TOPIC 5.4 MEASURES OF DISPERSION.
Measures of Dispersion
Week 7: Means, SDs & z-scores problem sheet (answers)
Chapter 2 Describing distributions with numbers. Chapter Outline 1. Measuring center: the mean 2. Measuring center: the median 3. Comparing the mean and.
Measures of Central Tendency
Study of Measures of Dispersion and Position. DATA QUALITATIVEQUANTITATIVE DISCRETE CONTINUOUS.
Objective To understand measures of central tendency and use them to analyze data.
May 06th, Chapter - 7 INFORMATION PRESENTATION 7.1 Statistical analysis 7.2 Presentation of data 7.3 Averages 7.4 Index numbers 7.5 Dispersion from.
 IWBAT summarize data, using measures of central tendency, such as the mean, median, mode, and midrange.
Anthony J Greene1 Dispersion Outline What is Dispersion? I Ordinal Variables 1.Range 2.Interquartile Range 3.Semi-Interquartile Range II Ratio/Interval.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Chapter 3 Descriptive Measures
Statistics Recording the results from our studies.
Measures of Spread Chapter 3.3 – Tools for Analyzing Data I can: calculate and interpret measures of spread MSIP/Home Learning: p. 168 #2b, 3b, 4, 6, 7,
Measures of Dispersion CUMULATIVE FREQUENCIES INTER-QUARTILE RANGE RANGE MEAN DEVIATION VARIANCE and STANDARD DEVIATION STATISTICS: DESCRIBING VARIABILITY.
Why statisticians were created Measure of dispersion FETP India.
Applied Quantitative Analysis and Practices LECTURE#08 By Dr. Osman Sadiq Paracha.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Objectives Vocabulary
Thinking About Psychology: The Science of Mind and Behavior 2e Charles T. Blair-Broeker Randal M. Ernst.
Table of Contents 1. Standard Deviation
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
Warm up The following graphs show foot sizes of gongshowhockey.com users. What shape are the distributions? Calculate the mean, median and mode for one.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Holt McDougal Algebra 2 Measures of Central Tendency and Variation Check It Out! Example 3 Make a box-and-whisker plot of the data. Find the interquartile.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
Measures of Dispersion
DATA ANALYSIS n Measures of Central Tendency F MEAN F MODE F MEDIAN.
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
 IWBAT summarize data, using measures of central tendency, such as the mean, median, mode, and midrange.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Data and Variation.
INVESTIGATION Data Colllection Data Presentation Tabulation Diagrams Graphs Descriptive Statistics Measures of Location Measures of Dispersion Measures.
Variability Pick up little assignments from Wed. class.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Describing Distributions Numerically.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
Introduction to statistics I Sophia King Rm. P24 HWB
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Chapter 2 Describing and Presenting a Distribution of Scores.
1 Research Methods in Psychology AS Descriptive Statistics.
Descriptive Statistics(Summary and Variability measures)
Introduction Dispersion 1 Central Tendency alone does not explain the observations fully as it does reveal the degree of spread or variability of individual.
CHAPTER 11 Mean and Standard Deviation. BOX AND WHISKER PLOTS  Worksheet on Interpreting and making a box and whisker plot in the calculator.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 2 Describing and Presenting a Distribution of Scores.
One-Variable Statistics. Descriptive statistics that analyze one characteristic of one sample  Where’s the middle?  How spread out is it?  How do different.
Descriptive Statistics
Descriptive statistics
INTRODUCTION TO STATISTICS
One-Variable Statistics
Objectives The student will be able to:
Describing Distributions Numerically
Measures of Central Tendency & Center of Spread
Introduction to Summary Statistics
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Description of Data (Summary and Variability measures)
Introduction to Summary Statistics
Measures of Central Tendency & Center of Spread
Introduction to Summary Statistics
Introduction to Summary Statistics
Introduction to Summary Statistics
Displaying and Summarizing Quantitative Data
Introduction to Summary Statistics
Warm Up # 3: Answer each question to the best of your knowledge.
Summary (Week 1) Categorical vs. Quantitative Variables
Introduction to Summary Statistics
Introduction to Summary Statistics
The data sets {19, 20, 21} and {0, 20, 40} have the same mean and median, but the sets are very different. The way that data are spread out from the mean.
Presentation transcript:

Measures of Dispersion Advanced Higher Geography Statistics

Introduction So far we have looked at ways of summarising data by showing some sort of average (central tendency). But it is often useful to show how much these figures differ from the average. This measure is called dispersion.

Types of dispersion There are five ways of showing dispersion: –Range –Inter-quartile range –Standard deviation –Coefficient of variation –The standard error of the mean

The Range The range is the difference between the maximum and minimum values. The range is quite limited in statistics apart from using it to say the range is quite large or quite small. 10 – 25 – 45 – 47 – 49 – 51 – 52 – 52 – 54 – 56 – 57 – 58 – 60 – 62 – 66 – Range But most results were between

The Inter-Quartile Range The inter-quartile range is the range of the middle half of the values. It is a better measurement to use than the range because it only refers to the middle half of the results. Basically, the extremes are omitted and cannot affect the answer.

To calculate the inter-quartile range we must first find the quartiles. There are three quartiles, called Q1, Q2 & Q3. We do not need to worry about Q2 (this is just the median). Q1 is simply the middle value of the bottom half of the data and Q3 is the middle value of the top half of the data.

We calculate the inter quartile range by taking Q1 away from Q3 (Q3 – Q1). 10 – 25 – 45 – 47 – 49 – 51 – 52 – 52 – 54 – 56 – 57 – 58 – 60 – 62 – 66 – 68 – Remember data must be placed in order Because there is an even number of values (18) we can split them into two groups of 9. Q1Q3 IR = Q3 – Q1, IR = 62 – 49. IR = 13

Your turn Read the information on pages 1 and 2 of the note ‘Measures of Deviation, Dispersion and Variability’ Read pages 74 and 75 of the note ‘Measures of Dispersion’ and complete the exercises on page 76

Standard Deviation The standard deviation is one of the most important measures of dispersion. It is much more accurate than the range or inter quartile range. It takes into account all values and is not unduly affected by extreme values.

What does it measure? It measures the dispersion (or spread) of figures around the mean. A large number for the standard deviation means there is a wide spread of values around the mean, whereas a small number for the standard deviation implies that the values are grouped close together around the mean.

The formula You may need to sit down for this! σ = √{∑ (x - ẍ ) 2 / n} This is the symbol for the standard deviation

Semi-worked example We are going to try and find the standard deviation of the minimum temperatures of 10 weather stations in Britain on a winters day. The temperatures are: 5, 9, 3, 2, 7, 9, 8, 2, 2, 3 (˚Centigrade)

To calculate the standard deviation we construct a table like this one: (x - ẍ ) 2 ∑(x - ẍ ) 2 = ∑(x - ẍ ) 2 /n = √∑(x - ẍ ) 2 /n = (x - ẍ ) ẍ x ∑x = ẍ = ∑x/n = There should be enough space here to fit in the number of values. Eg: there are 10 temperatures so leave 10 lines. x = temperature --- ẍ = mean temperature --- √ = square root ∑ = total of = squared --- n = number of values

x = temperature --- ẍ = mean temperature --- √ = square root ∑ = total of = squared --- n = number of values To calculate the standard deviation we construct a table like this one: (x - ẍ ) 2 ∑(x - ẍ ) 2 = ∑(x - ẍ ) 2 /n = √∑(x - ẍ ) 2 /n = (x - ẍ ) ẍ x ∑x = ẍ = ∑x/n = Next we write the values (temperatures) in column x (they can be in any order)

(x - ẍ ) 2 ∑(x - ẍ ) 2 = ∑(x - ẍ ) 2 /n = √∑(x - ẍ ) 2 /n = (x - ẍ ) ẍ x ∑x = ẍ = ∑x/n = x = temperature --- ẍ = mean temperature --- √ = square root ∑ = total of = squared --- n = number of values Add them up (∑x) Calculate the mean ( ẍ) 50/10 = 5 50

(x - ẍ ) 2 ∑(x - ẍ ) 2 = ∑(x - ẍ ) 2 /n = √∑(x - ẍ ) 2 /n = (x - ẍ ) ẍ x ∑x = ẍ = ∑x/n = x = temperature --- ẍ = mean temperature --- √ = square root ∑ = total of = squared --- n = number of values /10 = Write the mean temperature ( ẍ ) in every row in the second column.

(x - ẍ ) 2 ∑(x - ẍ ) 2 = ∑(x - ẍ ) 2 /n = √∑(x - ẍ ) 2 /n = (x - ẍ ) ẍ x ∑x = ẍ = ∑x/n = x = temperature --- ẍ = mean temperature --- √ = square root ∑ = total of = squared --- n = number of values /10 = Subtract each value (temperature) from the mean. It does not matter if you obtain a negative number

(x - ẍ ) 2 ∑(x - ẍ ) 2 = ∑(x - ẍ ) 2 /n = √∑(x - ẍ ) 2 /n = (x - ẍ ) ẍ x ∑x = ẍ = ∑x/n = x = temperature --- ẍ = mean temperature --- √ = square root ∑ = total of = squared --- n = number of values /10 = Square ( 2 ) all of the figures you obtained in column 3 to get rid of the negative numbers

(x - ẍ ) 2 ∑(x - ẍ ) 2 = ∑(x - ẍ ) 2 /n = √∑(x - ẍ ) 2 /n = (x - ẍ ) ẍ x ∑x = ẍ = ∑x/n = x = temperature --- ẍ = mean temperature --- √ = square root ∑ = total of = squared --- n = number of values /10 = Add up all of the figures that you calculated in column 4 to get ∑ (x - ẍ ) 2. 80

(x - ẍ ) 2 ∑(x - ẍ ) 2 = ∑(x - ẍ ) 2 /n = √∑(x - ẍ ) 2 /n = (x - ẍ ) ẍ x ∑x = ẍ = ∑x/n = x = temperature --- ẍ = mean temperature --- √ = square root ∑ = total of = squared --- n = number of values /10 = Divide ∑(x - ẍ ) 2 by the total number of values (in this case 10 – weather stations) 8

(x - ẍ ) 2 ∑(x - ẍ ) 2 = ∑(x - ẍ ) 2 /n = √∑(x - ẍ ) 2 /n = (x - ẍ ) ẍ x ∑x = ẍ = ∑x/n = x = temperature --- ẍ = mean temperature --- √ = square root ∑ = total of = squared --- n = number of values /10 = Take the square root (√) of the figure to obtain the standard deviation. (Round your answer to the nearest decimal place) 8

Answer 2.8°C

Why? Standard deviation is much more useful. For example our 2.8 means that there is a 68% chance of the temperature falling within ± 2.8°C of the mean temperature of 5°C. That is one standard deviation away from the mean. Normally, values are said to lie between one, two or three standard deviations from the mean.

Where did the 68% come from? This is a normal distribution curve. It is a bell-shaped curve with most of the data cluster around the mean value and where the data gradually declines the further you get from the mean until very few data appears at the extremes.

For Example – peoples height Most people are near average height. Some are short Some are tall But few are very short And few are very tall.

If you look at the graph you can see that most of the data (68%) is located within 1 standard deviation on either side of the mean, even more (95%) is located within 2 standard deviations on either side of the mean, and almost all (99%) of the data is located within 3 standard deviations on either side of the mean.

Your turn Read page 32 – 33 in the blue book (It re-caps what we have just been talking about). Complete task 4 on page 33.

The coefficient of variation (This will seem easy compared to the standard deviation!)

Coefficient of variation The coefficient of variation indicates the spread of values around the mean by a percentage. Coefficient of variation = Standard Deviation x 100 mean

Things you need to know The higher the Coefficient of Variation the more widely spread the values are around the mean. The purpose of the Coefficient of Variation is to let us compare the spread of values between different data sets. 2 woodlands 3 sand dunes 7 rivers

Your Turn Read page 34 & 35 of the blue book about the ‘standard error of the mean’. Do you think you could use this in your study? Answer task 5 (page36).