Basic Statistics Measures of Variability.

Slides:



Advertisements
Similar presentations
BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.
Advertisements

Numerically Summarizing Data
Descriptive Statistics
Statistics.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 3 Chicago School of Professional Psychology.
Sampling Distributions
Biostatistics Unit 2 Descriptive Biostatistics 1.
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
VARIABILITY. PREVIEW PREVIEW Figure 4.1 the statistical mode for defining abnormal behavior. The distribution of behavior scores for the entire population.
1 Chapter 4: Variability. 2 Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure.
Variability Ibrahim Altubasi, PT, PhD The University of Jordan.
Measures of Variability: Range, Variance, and Standard Deviation
Exploring Marketing Research William G. Zikmund
Chapter 4 SUMMARIZING SCORES WITH MEASURES OF VARIABILITY.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
A Look at Means, Variances, Standard Deviations, and z-Scores
Intra-Individual Variability Intra-individual variability is greater among older adults (Morse 1993) –May be an indicator of the functioning of the central.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Basic Statistics Measures of Variability The Range Deviation Score The Standard Deviation The Variance.
Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability usually accompanies.
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
Chapter 4 Variability. Variability In statistics, our goal is to measure the amount of variability for a particular set of scores, a distribution. In.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Variability. Statistics means never having to say you're certain. Statistics - Chapter 42.
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
What does Statistics Mean? Descriptive statistics –Number of people –Trends in employment –Data Inferential statistics –Make an inference about a population.
Variability Pick up little assignments from Wed. class.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
Chapter 4: Variability. Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Variability. What Do We Mean by Variability?  Variability provides a quantitative measure of the degree to which scores in a distribution are spread.
Descriptive Statistics(Summary and Variability measures)
Descriptive Statistics Printing information at: Class website:
Outline Sampling Measurement Descriptive Statistics:
Essentials of Marketing Research William G. Zikmund
Statistics in Forensics
Dependent-Samples t-Test
Measures of Dispersion
INF397C Introduction to Research in Information Studies Spring, Day 12
2.5: Numerical Measures of Variability (Spread)
Basic Practice of Statistics - 5th Edition
Descriptive Statistics: Overview
Introductory Statistics
Central Tendency and Variability
Descriptive Statistics: Presenting and Describing Data
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Summary descriptive statistics: means and standard deviations:
Chapter 12 Using Descriptive Analysis, Performing
Numerical Descriptive Measures
Descriptive Statistics
MEASURES OF CENTRAL TENDENCY
Introduction Second report for TEGoVA ‘Assessing the Accuracy of Individual Property Values Estimated by Automated Valuation Models’ Objective.
Measures of Location Statistics of location Statistics of dispersion
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Teacher Introductory Statistics Lesson 2.4 D
Summary descriptive statistics: means and standard deviations:
Preview Bem, 2011, claimed that through nine experiments he had demonstrated the existence of precognition Failure to replicate: “Across seven experiments.
Chapter 2 Exploring Data with Graphs and Numerical Summaries
Measures of Dispersion (Spread)
Numerical Descriptive Measures
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Presentation transcript:

Basic Statistics Measures of Variability

Measures of Variability The Range Deviation Score The Standard Deviation The Variance

STRUCTURE OF STATISTICS TABULAR Continuing with numerical approaches. DESCRIPTIVE GRAPHICAL NUMERICAL STATISTICS NUMERICAL CONFIDENCE INTERVALS INFERENTIAL TESTS OF HYPOTHESIS

STRUCTURE OF STATISTICS NUMERICAL DESCRIPTIVE MEASURES TABULAR CENTRAL TENDENCY DESCRIPTIVE GRAPHICAL NUMERICAL VARIABILITY SYMMETRY

STRUCTURE OF STATISTICS NUMERICAL DESCRIPTIVE MEASURES CENTRAL TENDENCY RANGE VARIABILITY VARIANCE NUMERICAL SYMMETRY STANDARD DEVIATION

Do you think your project will succeed ? We need the variability of IQs in the class! You are an elementary school teacher who has been assigned a class of fifth graders whose mean IQ is 115. Because children with IQ of 115 can handle more complex, abstract material, you plan many sophisticated projects for the year. Do you think your project will succeed ? General population 85% 55 70 85 100 115 130 145

Can you conclude that the pay for two occupations is equal? Is the average salary enough? We need the variability! Having graduated from college, you are considering two offers of employment. One in sales and the other in management. The pay is about same for both. After checking out the statistics for salespersons and managers at the library, you find that those who have been working for 5 years in each type of job also have similar averages. Can you conclude that the pay for two occupations is equal? Sales management Much more Much less $20,000

Single score Group of scores Central Tendency measures Mean IQ=118 IQ of 100 students

? ? ? More homogeneous Central Tendency Measures Measures of Central Tendency do not tell you the differences that exist among the scores

Same Mean---Different Variability So What? 4 How many are out here? 2 1 3 60 Central Tendency

1. The Range = The difference between the largest (Xmax) and the smallest (Xmin). 25, 21, 22, 23, 28, 26, 24, 29 21 22 23 24 25 26 28 29 Range = 29 –21 = 8 A large range means there is a lot of variability in data.

? Range = 29 –10 = 19 Drawbacks of The Range Range = 29 –26=3 10, 28, 26, 27, 29 ? Range = 29 –26=3 10 26 27 28 29 Range = 29 –10 = 19 The Range depends on only the two extreme scores

Range and Extreme Observations Because the range is determined by just two scores in the group, it ignores the spread of all scores except the largest and smallest. One aberrant score or outlier can be greatly increase the range

Before you determine the Range, all scores must be arranged in order Range and Measurement Scales Before you determine the Range, all scores must be arranged in order Country Code SES F Age 1, 2, 3, 1, 2, 3, 1, 2, 3, 1, 2, 3, 3-1=2 3-1=2 3-1=2 3-1=2 1=American 2=Asian 3=Mexican 1=Upper 2=Middle 3=Lower

Differences among Scores 3. The Variance Differences among Scores 78 49 49 41 88 35 66 53 44 66 49 27 83 53 95 27 Differences among Scores 81 35 42 62 72 57 41 81 49 49 63 35 41 53 ? 77 35 78 88 66

How can you determine the variability of each individual in group? Total Variability = Sum of Individual Variability How can you determine the variability of each individual in group? 72 22 22 55 70 3 12 67 The amount of Individual difference entirely depends on comparison criteria.

Can you figure out total amount of differences among scores ? Can you figure out how much each score is different from other scores ? Can you figure out total amount of differences among scores ?

You need a Common Criteria for computing Total Variability 46 48 47 Mean Score 53 ? 49 Reference score 51 50 45 52

You need a Common Criteria for computing Total Variability -3 46 -1 48 -2 47 Deviation Scores 49 +4 53 ? 49 Reference score +2 51 50 +1 45 -4 +3 52 A Deviation score tells you that a particular score deviate, or differs from the mean

DEVIATION SCORE = (Xi - Mean) A score a great distance from the mean will have large deviation score. 2 1 3 Mean A B C D E F

Sum of Deviation Scores Total amount of variability?! Sum of all distance values! mathematically No way! conceptually

The idea makes sense…but If you compute the sum of the deviation scores, the sum of the deviation scores equals zero! Sum of Deviation scores = (-4) + (-3) + (-2) + 0 + (1) + (2) + (3) + (4) = 0

The Sum of Absolute Deviation Scores The sum of absolute deviations is rarely used as a measure of variability because the process of taking absolute values does not provide meaningful information for inferential statistics.

Sum of Squares of deviation scores “SS” Conceptually And Mathematically

SS= Sum of Squares of Deviation Scores, SS Instead of working with the absolute values of deviation scores, it is preferable to (1) square each deviation score and (2) sum them to obtain a quantity know as the Sum of Squares. SS=(-4)+(-3)+(-2)+(-1)+0+(1)+(2)+(3)+(4) =16+9+4+1+0+1+2+9+16 =60 2 2 2 2 2 2 2 2 2 SS= i

So ! Group of scores “B” Group of scores “A” SS(A)=30 SS(B)=40 Can you say that the variability of the data in Group B is greater than the data in Group A?

3, 4 3, 4 =.50 What happens to SS when we look at some data? Group A Group B Mean = 3.5 Mean = 3.5 2 2 2 2 SS = (3 - 3.5) + (4 - 3.5) =.50 SS = (3 - 3.5) + (4 - 3.5) + (3 - 3.5) + (4 - 3.5) =1.00 2 2

SS tends to increase as number of data(N) increase. Mean i i=1 SS tends to increase as number of data(N) increase. SS is not appropriate for comparing variability among groups having unequal sample size. How can you overcome the limitation of SS

The resulting value will be Mean of the Deviation Scores (Mean Square) If SS is divided by N The resulting value will be Mean of the Deviation Scores (Mean Square) VARIANCE

3, 4 3, 4 Group A Group B Mean = 3.5 Mean = 3.5 2 2 2 2 V = (3 - 3.5) + (4 - 3.5) =.50/2 = .25 V = (3 - 3.5) + (4 - 3.5) + (3 - 3.5) + (4 - 3.5) = 1.00/4 = 2.5 2 2

Variance Population Variance Sample Variance

POPULATION VARIANCE Individual value Population mean Sigma Square Population size

SAMPLE VARIANCE Sample variance Sample size-1 Degree of freedom Sample Mean Individual value Sample variance Sample size-1 Degree of freedom The sample variance (S2) is used to estimate the population variance (2)

Why n-1 instead n ?

population 100 = Sampling n Sampling error sample ?<100<?

Ideally, a sample variance would be based on (x - m)2 Ideally, a sample variance would be based on (x - m)2. This is impossible since m is not known if one has only a sample of n cases. m is substituted by . The value of the squared deviations is less from X than from any other score . Hence, in a sample, the value of (X-X) n would be less than n. > n n

> Ideal sample variance One could correct for this bias by dividing by a factor somewhat less than n

7 sample n=5 We know that ? must equal 25. If we know that the mean is equal to 5, and the first 4 scores add to 18, then the last score MUST equal 7. We know that ? must equal 25. n-1 are free to change Degree of freedom

4. Standard Deviation SD Positive square root of the variance Population Sample

The Standard Deviation and the Mean with Normal Distribution

Relationship between m and s Normal Distribution m m-3s m-2s m-1s m+1s m+2s m+3s Relationship between m and s

S 68% 95% 99.9% Relationship between and Normal Distribution -3S -2S

EMPIRICAL RULE For any symmetrical, bell-shaped distribution, approximately 68% of the observations will lie within  1 standard deviation of the mean; approximately 98% within  2 standard deviations of the mean; and approximately 99.9% within  3 standard deviations of the mean.

You can approximately reproduce your data! If a set of data has a Mean=50 and SD=10, then… 68% 95% 99% 20 30 40 50 60 70 80