Variance Variance: Standard deviation:

Slides:



Advertisements
Similar presentations
Unit 16: Statistics Sections 16AB Central Tendency/Measures of Spread.
Advertisements

Measures of Dispersion and Standard Scores
The standard error of the sample mean and confidence intervals
Descriptive Statistics Statistical Notation Measures of Central Tendency Measures of Variability Estimating Population Values.
Measures of Spread The Range, Variance, and Standard Deviation.
Variability Measures of spread of scores range: highest - lowest standard deviation: average difference from mean variance: average squared difference.
Chapter 1 The mean, the number of observations, the variance and the standard deviation.
Ignore parts with eye-ball estimation & computational formula
Wednesday, October 3 Variability. nominal ordinal interval.
VARIABILITY. PREVIEW PREVIEW Figure 4.1 the statistical mode for defining abnormal behavior. The distribution of behavior scores for the entire population.
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
The Basics  A population is the entire group on which we would like to have information.  A sample is a smaller group, selected somehow from.
1 Chapter 4: Variability. 2 Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure.
Variability Ibrahim Altubasi, PT, PhD The University of Jordan.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Central Tendency and Variability
Chapter 4 SUMMARIZING SCORES WITH MEASURES OF VARIABILITY.
MEASURES OF CENTRAL TENDENCY & DISPERSION Research Methods.
PXGZ6102 BASIC STATISTICS FOR RESEARCH IN EDUCATION Chap 3 - Measures of Variability – Standard Deviation, Variance.
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
Chapter 6: Sampling Distributions
Quiz 2 Measures of central tendency Measures of variability.
Measures of Dispersion Week 3. What is dispersion? Dispersion is how the data is spread out, or dispersed from the mean. The smaller the dispersion values,
CRIM 483 Measuring Variability. Variability  Variability refers to the spread or dispersion of scores  Variability captures the degree to which scores.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Chapter 3 Descriptive Measures
Statistics 1 Measures of central tendency and measures of spread.
Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability usually accompanies.
Chapter 3 Basic Statistics Section 2.2: Measures of Variability.
Measures of Variability. Variability Measure of the spread or dispersion of a set of data 4 main measures of variability –Range –Interquartile range –Variance.
Module 11: Standard Deviations and the Like This module describes measures of dispersion or unlikeness, including standard deviations, variances,
Chapter 3 Central Tendency and Variability. Characterizing Distributions - Central Tendency Most people know these as “averages” scores near the center.
SECTION 12-3 Measures of Dispersion Slide
Statistics 11 The mean The arithmetic average: The “balance point” of the distribution: X=2 -3 X=6+1 X= An error or deviation is the distance from.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
Math 3680 Lecture #2 Mean and Standard Deviation.
Educ 200C Wed. Oct 3, Variation What is it? What does it look like in a data set?
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
Variability Pick up little assignments from Wed. class.
Standard Deviation Lecture 18 Sec Tue, Feb 15, 2005.
Measures of Location INFERENTIAL STATISTICS & DESCRIPTIVE STATISTICS Statistics of location Statistics of dispersion Summarise a central pointSummarises.
Chapter 4 Variability. Introduction Purpose of measures of variability Consider what we know if we know that the mean test score was 75 Single score to.
Chapter 4: Variability. Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability.
Standard Deviation. Two classes took a recent quiz. There were 10 students in each class, and each class had an average score of 81.5.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 4-6 Peer Tutor Slides Instructor: Mr. Ethan W. Cooper, Lead Tutor © 2013.
Sampling Distributions Chapter 9 First, a word from our textbook A statistic is a numerical value computed from a sample. EX. Mean, median, mode, etc.
Monday, September 27 More basics.. _ “Life is a series of samples, you can infer the truth from the samples but you never see the truth.”
Chapter 1 Lesson 7 Variance and Standard Deviation.
Chapter 2 The Mean, Variance, Standard Deviation, and Z Scores.
Measures of Dispersion
Measures of Central Tendency
Descriptive Statistics (Part 2)
Distribution of the Sample Means
Reasoning in Psychology Using Statistics
Central Tendency and Variability
Summary descriptive statistics: means and standard deviations:
Central Limit Theorem General version.
More basics: central tendency, variability, populations and samples.
Central Tendency.
Statistics Variability
Summary descriptive statistics: means and standard deviations:
Preview Bem, 2011, claimed that through nine experiments he had demonstrated the existence of precognition Failure to replicate: “Across seven experiments.
Political Science 30 Political Inquiry
Friday, October 2 Variability.
Standard Deviation How many Pets?.
Sample Standard Deviation
Introduction to Sampling Distributions
Lecture 4 Psyc 300A.
Standard Deviation Mean - the average of a set of data
The Mean Variance Standard Deviation and Z-Scores
Presentation transcript:

Variance Variance: Standard deviation: The average squared distance of observations from the mean. Its corresponding measure of central tendency is the mean. Standard deviation: The square root of the variance. Variance is in squared units, so can be hard to interpret. Standard deviation is in the original units of the measurements. These measure how much the data deviate from the mean. They use all the data in their calculations. Draw distributions with large and small variances on the board.

Variance Two versions depending on the context of data Population: all subjects of interest Every single person in the United States Entire UCM student body Calculation: 𝜎 2 = 𝑖=1 𝑁 𝑥 𝑖 −𝜇 2 𝑁 = 𝑆𝑆 𝑁 Population standard deviation: 𝜎= 𝜎 2 Sample: subset of population 100 random people in the US 100 random students from UCM 𝑠 2 = 𝑖=1 𝑛 𝑥 𝑖 − 𝑥 2 𝑛−1 = 𝑆𝑆 𝑛−1 Sample standard deviation: 𝑠= 𝑠 2 Point out what a deviation is. Make note that SS stands for sum of squares. Make a slight allusion to why s^2 is calculated with n-1 instead of n.

Population Standard Deviation (σ) Measure of how spread out the values are in a data set consisting of the entire population of interest (i.e., how much do the population values deviate from the mean?) Step 1: Calculate the deviation (distance) of each value (𝒙 𝒊 ) from the population mean (µ) 1 2 3 4 5 µ = 3 Step 2: Square each deviation, then add them up (−2) 2 + (−1) 2 + (0) 2 +(1) 2 +(2) 2 =10 Step 3: Divide the sum of squared deviations by the number of observations (N) 10/5=2 (population variance = σ 2 ) Step 4: Take the square root of the “average squared deviation” (population variance) 2 =1.41=𝝈 (3-3) = 0 (2-3) = -1 (4-3) = 1 (1-3) = -2 (5-3) = 2 You square these values because otherwise they will add up to 0 BY DEFINITION. Show why that is.

𝜎= (1−3) 2 + (2−3) 2 + (3−3) 2 + (4−3) 2 + (5−3) 2 5 =1.41 Formulas Population Standard Deviation 𝜎= 𝑖=1 𝑁 ( 𝑥 𝑖 −𝜇) 2 𝑁 𝜎= (1−3) 2 + (2−3) 2 + (3−3) 2 + (4−3) 2 + (5−3) 2 5 =1.41

Sample Standard Deviation (s) Measure of how spread out the values are in a data set consisting of a sample of the population of interest (i.e., how much do the sample values deviate from the mean?) Step 1: Calculate the deviation (distance) of each value ( 𝒙 𝒊 ) from the sample mean ( 𝒙 ) 1 2 3 4 5 𝑥 =3 Step 2: Square each deviation, then add them up (−2) 2 + (−1) 2 + (0) 2 +(1) 2 +(2) 2 =10 Step 3: Divide the sum of squared deviations by number of observations minus 1 (n-1) 10/(5−1)=10/4=2.5 (sample variance = 𝑠 2 ) Step 4: Take the square root of the “corrected average squared deviation” (sample variance) 2.5 =1.58=𝒔 (3-3) = 0 (2-3) = -1 (4-3) = 1 (1-3) = -2 (5-3) = 2

𝑠= (1−3) 2 + (2−3) 2 + (3−3) 2 + (4−3) 2 + (5−3) 2 5−1 =1.58 Formulas Sample Standard Deviation 𝑠= 𝑖=1 𝑛 ( 𝑥 𝑖 − 𝑥 ) 2 𝑛−1 𝑠= (1−3) 2 + (2−3) 2 + (3−3) 2 + (4−3) 2 + (5−3) 2 5−1 =1.58 Write the computational formula/proof on the board.

Simplified Formula Using the rules for summations, a simpler formula for the standard deviation can be found. 𝑠= 𝑖=1 𝑛 ( 𝑥 𝑖 − 𝑥 ) 2 𝑛−1 = 𝑖=1 𝑛 ( 𝑥 𝑖 2 −2 𝑥 𝑖 𝑥 + 𝑥 2 ) 𝑛−1 = 𝑖=1 𝑛 𝑥 𝑖 2 − 𝑖=1 𝑛 2 𝑥 𝑥 𝑖 + 𝑖=1 𝑛 𝑥 2 𝑛−1 = 𝑖=1 𝑛 𝑥 𝑖 2 −2 𝑥 𝑖=1 𝑛 𝑥 𝑖 +𝑛 𝑥 2 𝑛−1 = 𝑖=1 𝑛 𝑥 𝑖 2 −2𝑛 𝑥 2 +𝑛 𝑥 2 𝑛−1 = 𝑖=1 𝑛 𝑥 𝑖 2 −𝑛 𝑥 2 𝑛−1 𝑠= ( 1 2 + 2 2 + 3 2 + 4 2 + 5 2 )−5× 3 2 5−1 =1.58

Another Example Suppose we have a sample of 6 scores on a quiz: 5, 10, 3, 7, 2, 3 What is the mode for these data? 3 What is the median for these data? 4 What is the mean for these data? 5 What is the sample variance for these data? 46 5 =9.2 What is the sample standard deviation for these data? 9.2 ≈3.033

Population versus Sample SD The sample SD seeks to estimate the population SD, in the same way that the sample mean seeks to estimate the population mean The distribution of the sample mean is such that it may incorrectly estimate the population mean, but it’s no more likely to overestimate than it is to underestimate It is unbiased The sample SD, on the other hand, will consistently underestimate the population SD, so we divide it by a smaller number (n-1 instead of n) to correct for this underestimation error. Without correction, the sample SD and sample variance would be biased estimators