Recap All about measures of location Mean Median Mode

Slides:



Advertisements
Similar presentations
Quantitative Methods in HPELS 440:210
Advertisements

Measures of Central Tendency and Variation 11-5
CHAPTER 1 Exploring Data
Measures of Variation Sample range Sample variance Sample standard deviation Sample interquartile range.
Measures of Dispersion
VARIABILITY. PREVIEW PREVIEW Figure 4.1 the statistical mode for defining abnormal behavior. The distribution of behavior scores for the entire population.
CHAPTER 2: Describing Distributions with Numbers
Chapter 2 Describing Data with Numerical Measurements
Describing distributions with numbers
Working with one variable data. Spread Joaquin’s Tests Taran’s Tests: 76, 45, 83, 68, 64 67, 70, 70, 62, 62 What can you infer, justify and conclude about.
LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.
CHAPTER 2: Describing Distributions with Numbers ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Objectives 1.2 Describing distributions with numbers
Methods for Describing Sets of Data
Chapter 3 Averages and Variations
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Chapter 4 Variability. Variability In statistics, our goal is to measure the amount of variability for a particular set of scores, a distribution. In.
1 1 Slide Descriptive Statistics: Numerical Measures Location and Variability Chapter 3 BA 201.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Table of Contents 1. Standard Deviation
Describing distributions with numbers
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Measures of Dispersion How far the data is spread out.
Numerical Statistics Given a set of data (numbers and a context) we are interested in how to describe the entire set without listing all the elements.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Three Averages and Variation.
Unit 3 Lesson 2 (4.2) Numerical Methods for Describing Data
Quantitative data. mean median mode range  average add all of the numbers and divide by the number of numbers you have  the middle number when the numbers.
Business Statistics Spring 2005 Summarizing and Describing Numerical Data.
 The mean is typically what is meant by the word “average.” The mean is perhaps the most common measure of central tendency.  The sample mean is written.
What are the effects of outliers on statistical data?
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Numerical descriptions of distributions
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
CHAPTER 1 Exploring Data
Measures of Central Tendency and Variation
Notes 13.2 Measures of Center & Spread
Measures of dispersion
Chapter 6 ENGR 201: Statistics for Engineers
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Lecture 2 Chapter 3. Displaying and Summarizing Quantitative Data
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
1.2 Describing Distributions with Numbers
CHAPTER 1 Exploring Data
Describing Quantitative Data with Numbers
Measures of Central Tendency and Variation 8-1
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
MATH 2400 – Ch. 2 Vocabulary Mean: the average of a set of data sum/n
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Presentation transcript:

Recap All about measures of location Mean Median Mode measures of centre Mean Median Mode measures of Any Position Percentiles You should be able to calculate these from grouped and raw data You should also be able to draw a box and whisker plot MH-Variance -Kuwait

This week Measures of Spread Sample of Heights of peoples in Coventry and Norwich We need more then the mean to compare data sets We need a numerical measure representing how the data varies MH-Variance -Kuwait

Measures of Spread Range Inter Quartile Range Variance Standard Deviation This hour lesson we concentrate on how to calculate the following two measures MH-Variance -Kuwait

Range = largest value - smallest value MH-Variance -Kuwait

Interquartile Range The interquartile range of a data set is the difference between the third quartile and the first quartile. It is the range for the middle 50% of the data. It overcomes the sensitivity to extreme data values. 375 400 425 450 475 500 525 550 575 600 625

Interquartile Range = Q3 - Q1 = 525 - 445 = 80 L25= (n+1)*25/100 71/4 = 17.75 18th value L75= (n+1)*75/100 71*3/4 = 53.25 53th value 3rd Quartile (Q3) = 525 1st Quartile (Q1) = 445 Interquartile Range = Q3 - Q1 = 525 - 445 = 80 MH-Variance -Kuwait

Basic Notation As we will be working with formulas we need to make sure about some notation Data set “X” 10, 30, 301 , 46, 18, 21, 19, 83, 4, .............., 88 x1 x2 x3 x4 x5 x6 x5 x6 x7 xn We often refer to a data set with an upper case letter like X, In which case the numbers in the data set are called elements (x1, x2, ..., xn) “n” or “N” is the number of elements or observations MH-Variance -Kuwait

Net deviations from the mean will always sum to zero So “total distance” from the mean is zero Because +ve and –ve contributions cancel MH-Variance -Kuwait

Measures of data Spread But we want a measure that will represent these net deviations somehow. One way to ensure a non-zero result is to square each deviation before adding it. We can then average these deviations by dividing by their number “n” and use this compare data sets OR, we can average and take the square root of the above This latter approach will have the same units as the underlying data. Variance Units squared Standard deviation Units of Units MH-Variance -Kuwait

Calculate the Variance for the following data set This data relates to Measures of distance travelled to work in units of (miles) 10 3.5 27 12 2 -0.9 -7.4 16.1 1.1 -8.9 0.81 54.76 259.21 1.21 79.21 395.2 This is the population variance (miles2) This is the population standard deviation (miles) Mean is 10.9 n=5 Units in miles MH-Variance -Kuwait

Population Variance for Grouped Data Mi is calls midpoint our Xi MH-Variance -Kuwait

Variance for Grouped Data For sample data For population data Sample variance s2 is commonly referred to by σ2n-1 Sample Standard Deviation s is commonly referred to by σn-1 So why is the sample measure divided by (n-1) ? – will deal with this soon! MH-Variance -Kuwait

Formulae RAW DATA Sample Variance RAW DATA Population Variance GROUPED DATA Sample Variance GROUPED DATA Population Variance MH-Variance -Kuwait

Things will now do 1- Understand why the following two formulas are the same and appreciate that the second form is much quicker to calculate than the first form 2- I would like you think of calculating variance as or Where Sxx can be calculated in different ways and can be divided appropriately dependent on whether we have a sample or population 3- We should investigate why we average , S2 , by (n-1) when we are dealing with a sample We will deal with this third and unusual point next!! MH-Variance -Kuwait

Why we divide by (n-1) Population Sample We take a random sample from the population and use it to estimate σ2 v v Sample v v MH-Variance -Kuwait

We are trying to estimate the true population mean σ2 In the real world we take a sample and use it Population Sample I am going to show you that S2 will be the better estimator of the true population variance, σ2 MH-Variance -Kuwait

Taking Lots of Samples of fixed size n & Build distributions of S2 and σ2 MH-Variance -Kuwait

Calculating s2 and σ2 of many samples , grouping and counting we can build distributions for S2 dist’n σs2 dist’n <σ2 σ2 RED distribution is centered around the real population variance MH-Variance -Kuwait

Showing = σ2 I will generate a Population of numbers AVG(S2) AVG(σs2) Row 1 Sample 1 S2 σs2 Row 2- Sample 2 S2 σs2 I will generate a Population of numbers And calculate the Pop Var (σ2) Row 3 Sample 3 S2 σs2 Row 4 Sample 4 S2 σS2 Then show that AVG(S2) = σ2 AVG(σs2) < σ2 Therefore E(S2)= σ2 Row 100 Sample 100 S2 σs2 MH-Variance -Kuwait

Summary Some questions We have looked at the formula for calculating Variance and Its square root Std- Deviation We have noted that we average by n or n-1 depending on whether or not we are working with a sample or population We have noted that that we can write Sxx = in different ways that are faster to calculate. We should work these different ways through shortly But first Some questions MH-Variance -Kuwait