Download presentation
Presentation is loading. Please wait.
Published byRosalyn Cannon Modified over 9 years ago
1
Descriptive Statistics: Overview Measures of Center Mode Median Mean * Measures of Symmetry Skewness Measures of Spread Range Inter-quartile Range Variance Standard deviation * * Measures of Position Percentile Deviation Score Z-score * *
2
Central tendency Seeks to provide a single value that best represents a distribution
3
Central tendency
6
Seeks to provide a single value that best represents a distribution Typical measures are –mode –median –mean
7
Mode the most frequently occurring score value corresponds to the highest point on the frequency distribution For a given sample N=16: 33 35 36 37 38 38 38 39 39 39 39 40 40 41 41 45 The mode = 39
8
Mode The mode is not sensitive to extreme scores. For a given sample N=16: 33 35 36 37 38 38 38 39 39 39 39 40 40 41 41 50 The mode = 39
9
Mode a distribution may have more than one mode For a given sample N=16: 34 34 35 35 35 35 36 37 38 38 39 39 39 39 40 40 The modes = 35 and 39
10
Mode there may be no unique mode, as in the case of a rectangular distribution For a given sample N=16: 33 33 34 34 35 35 36 36 37 37 38 38 39 39 40 40 No unique mode
11
Median the score value that cuts the distribution in half (the “middle” score) 50th percentile For N = 15 the median is the eighth score = 37
12
Median For N = 16 the median is the average of the eighth and ninth scores = 37.5
13
Mean this is what people usually have in mind when they say “average” the sum of the scores divided by the number of scores Changing the value of a single score may not affect the mode or median, but it will affect the mean. For a population:For a sample:
14
Mean X=7.07 In many cases the mean is the preferred measure of central tendency, both as a description of the data and as an estimate of the parameter. __ In order for the mean to be meaningful, the variable of interest must be measures on an interval scale. 0 1 2 3 4 5 Buddhist Protestant Catholic Jewish Muslim Score Frequency X=2.4 __
15
Mean The mean is sensitive to extreme scores and is appropriate for more symmetrical distributions. X=36.8 __ X=36.5 __ X=93.2 __
16
a symmetrical distribution exhibits no skewness in a symmetrical distribution the Mean = Median = Mode Symmetry
17
Skewness refers to the asymmetry of the distribution Skewed distributions A positively skewed distribution is asymmetrical and points in the positive direction. Mode = 70,000$ Median = 88,700$ Mean = 93,600$ modemean median mode < median < mean
18
A negatively skewed distribution Skewed distributions mode > median > mean mode mean median
19
Measures of central tendency +- Mode quick & easy to compute useful for nominal data poor sampling stability Median not affected by extreme scoressomewhat poor sampling stability Mean sampling stability related to variance inappropriate for discrete data affected by skewed distributions
20
Distributions Center: mode, median, mean Shape: symmetrical, skewed Spread
21
Measures of Spread the dispersion of scores from the center a distribution of scores is highly variable if the scores differ wildly from one another Three statistics to measure variability –range –interquartile range –variance
22
Range largest score minus the smallest score these two have same range (80) but spreads look different says nothing about how scores vary around the center greatly affected by extreme scores (defined by them)
23
Interquartile range the distance between the 25th percentile and the 75th percentile Q3-Q1 = 70 - 30 = 40 Q3-Q1 = 52.5 - 47.5 = 5 effectively ignores the top and bottom quarters, so extreme scores are not influential dismisses 50% of the distribution
24
Deviation measures Might be better to see how much scores differ from the center of the distribution -- using distance Scores further from the mean have higher deviation scores ScoreDeviation Amy10-40 Theo20-30 Max30-20 Henry40-10 Leticia500 Charlotte6010 Pedro7020 Tricia8030 Lulu9040 AVERAGE50
25
Deviation measures To see how ‘deviant’ the distribution is relative to another, we could sum these scores But this would leave us with a big fat zero ScoreDeviation Amy10-40 Theo20-30 Max30-20 Henry40-10 Leticia500 Charlotte6010 Pedro7020 Tricia8030 Lulu9040 SUM0
26
Deviation measures So we use squared deviations from the mean ScoreDeviation Sq. Deviation Amy10-401600 Theo20-30900 Max30-20400 Henry40-10100 Leticia5000 Charlotte6010100 Pedro7020400 Tricia8030900 Lulu90401600 SUM06000 This is the sum of squares (SS) SS= ∑(X-X) 2 __
27
Variance We take the “average” squared deviation from the mean and call it VARIANCE (to correct for the fact that sample variance tends to underestimate pop variance) For a population: For a sample:
28
Variance 1.Find the mean. 2.Subtract the mean from every score. 3.Square the deviations. 4.Sum the squared deviations. 5.Divide the SS by N or N-1. ScoreDev’nSq. Dev. Amy10-401600 Theo20-30900 Max30-20400 Henry40-10100 Leticia5000 Charlotte6010100 Pedro7020400 Tricia8030900 Lulu90401600 SUM060006000/8 =750
29
The standard deviation is the square root of the variance The standard deviation measures spread in the original units of measurement, while the variance does so in units squared. Variance is good for inferential stats. Standard deviation is nice for descriptive stats. Standard deviation
30
Example N = 28 X = 50 s 2 = 555.55 s = 23.57 N = 28 X = 50 s 2 = 140.74 s = 11.86
31
Descriptive Statistics: Quick Review Measures of Center Mode Median Mean ** Measures of Symmetry Skewness Measures of Spread Range Inter-quartile Range Variance Standard deviation ** **
32
Descriptive Statistics: Quick Review For a population:For a sample: MeanVariance Standard Deviation
33
Treat this little distribution as a sample and calculate: –Mode, median, mean –Range, variance, standard deviation Exercise
34
Descriptive Statistics: Overview Measures of Center Mode Median Mean * Measures of Symmetry Skewness Measures of Spread Range Inter-quartile Range Variance Standard deviation * * Measures of Position Percentile Deviation Score Z-score * *
35
Measures of Position How to describe a data point in relation to its distribution
36
Quantile Deviation Score Z-score Measures of Position
37
Quantiles Quartile Divides ranked scores into four equal parts 25% (minimum)(maximum) (median)
38
Quantiles 10% Divides ranked scores into ten equal parts Decile
39
Quantiles Divides ranked scores into 100 equal parts Percentile rank of score x = 100 number of scores less than x total number of scores Percentile rank
40
Deviation Scores ScoreDeviation Amy10-40 Theo20-30 Max30-20 Henry40-10 Leticia500 Charlotte6010 Pedro7020 Tricia8030 Lulu9040 Average50 For a population: For a sample:
41
What if we want to compare scores from distributions that have different means and standard deviations? Example –Nine students scores on two different tests –Tests scored on different scales
42
Nine Students on Two Tests Test 1Test 2 Amy101 Theo202 Max303 Henry404 Leticia505 Charlotte606 Pedro707 Tricia808 Lulu909 Average505
43
Nine Students on Two Tests Test 1Test 2 Deviation Score 1 Deviation Score 2 Amy101-40-4 Theo202-30-3 Max303-20-2 Henry404-10 Leticia50500 Charlotte606101 Pedro707202 Tricia808303 Lulu909404 Average505
44
Z-Scores Z-scores modify a distribution so that it is centered on 0 with a standard deviation of 1 Subtract the mean from a score, then divide by the standard deviation For a population:For a sample:
45
Z-Scores Test 1Test 2Z- Score 1Z-Score 2 Amy101-1.5 Theo202-1.2 Max303-.77 Henry404-.34 Leticia50500 Charlotte606.34 Pedro707.77 Tricia8081.2 Lulu9091.5 Average50500 St Dev25.82.5811
46
A distribution of Z-scores… Z-Scores Always has a mean of zero Always has a standard deviation of 1 Converting to standard or z scores does not change the shape of the distribution: z scores cannot normalize a non-normal distribution A Z-score is interpreted as “number of standard deviations above/below the mean”
47
Exercise Test 3Z-Score Amy52 Theo39 Max-1.5 Henry1.3 On their third test, the class average was 45 and the standard deviation was 6. Fill in the rest.
48
Descriptive Statistics: Quick Review For a population:For a sample: Mean Variance Z-score Standard Deviation
49
:If you add or subtract a constant from each value in a distribution, then the mean is increased/decreased by that amount the standard deviation is unchanged the z-scores are unchanged 6 If you multiply or divide each value in a distribution by a constant, then the mean is multiplied/divided by that amount the standard deviation is multiplied/divided by that amount the z-scores are unchanged Messing with Units
50
Example ScoreDev’sSq devZ-score Theo 51-1.5 Max 3-39-.5 Henry 51.5 Leticia 711.5 Charlotte 7111.0 Pedro 824 Tricia 4-241.5 Lulu 939-.5 MEAN 6 STDEV 1.94
51
Adding 1 ScoreDev’sSq devZ-score Theo 61-1.5 Max 4-39-.5 Henry 61.5 Leticia 811.5 Charlotte 8111.0 Pedro 924 Tricia 5-241.5 Lulu 1039-.5 MEAN 7 STDEV 1.94
52
Example ScoreDev’sSq devZ-score Theo 51-1.5 Max 3-39-.5 Henry 51.5 Leticia 711.5 Charlotte 7111.0 Pedro 824 Tricia 4-241.5 Lulu 939-.5 MEAN 6 STDEV 1.94
53
Multiplying by 10 ScoreDev’sSq devZ-score Theo 50-10100-1.5 Max 30-30900-.5 Henry 50-10100.5 Leticia 7010100.5 Charlotte 70101001.0 Pedro 8020400 Tricia 40-204001.5 Lulu 9030900-.5 MEAN 60 STDEV 19.4
54
Other Standardized Distributions The Z distribution is not the only standardized distribution. You can easily create others (it’s just messing with units, really).
55
Score Theo5 Max3 Henry5 Leticia7 Charlotte7 Pedro8 Tricia4 Lulu9 Average6 St Dev1.94 Example: Let’s change these test scores into ETS type scores (mean 500, stdev 100) Other Standardized Distributions
56
ScoreZ-Score ETS type score Theo3-1.5350 Max5-.5450 Henry7.5550 Leticia7.5550 Charlotte81.0600 Pedro4400 Tricia91.5650 Lulu5-.5450 Average60500 St Dev1.941100 Here’s How: Convert to Z scores Multiply by 100 to increase the st dev Add 500 to increase the mean Other Standardized Distributions
57
Exercise ScorePercentile Deviation ScoreZ-Score IQ type score (Mean 100 Stdev 10) Theo20 Max18 Henry13 Leticia17 Charlotte19 Pedro16 Tricia11 Lulu9
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.