Download presentation
Presentation is loading. Please wait.
Published byDominic Allison Modified over 9 years ago
1
Inquiry 1 written and oral reports are due in lab the week of 9/29. Today: Statistics This weekend, on 6th street. Pecan Street Arts Festival (www.oldpecanstreetfestival.com)
2
Most statistical work can be done, and more easily done, by computer using programs such as: MS Excel SPSS SAS
3
Most statistical work can be done, and more easily done, by computer using programs such as: MS Excel is the most common. Available from UT for cheap, ~$30. If you have not used it, start practicing now.
4
The Basics: Mean, median, and mode
5
Mean- aka the average. Sum of all numbers divided by the number of data points. (14+17+7+6+4+11+8)/7 = 9.57
6
Median- the middle number of a group of ordered numbers 14 17 7 6 4 11 8 4 6 7 8 11 14 17 median is 8
7
Median- the middle number of a group of ordered numbers 14 17 7 6 4 11 8 46 7 8 11 14 17 median is 8 What about 4 6 7 11 14 17?
8
Median- the middle number of a group of ordered numbers 14 17 7 6 4 11 8 46 7 8 11 14 17 median is 8 What about 4 6 7 11 14 17? Median is 9.
9
Mode- the most common value in a group. 9, 8, 3, 4, 5, 2, 4, 5, 2, 3, 6, 1, 6, 2, 3, 9, 2, 6 Mode is 2
10
Why are there 3 ways to analyze a group of numbers?
11
The mean is the most common form of analysis.
12
Why are there 3 ways to analyze a group of numbers? The mean is the most common form of analysis. 2, 3, 2, 4, 2, 7, 2, 5, 3, 2, 5, 4, 3, 5, 6, 121, 130 Mean = 18
13
Why are there 3 ways to analyze a group of numbers? 2, 3, 2, 4, 2, 7, 2, 5, 3, 2, 5, 4, 3, 5, 6, 121, 130 Mean = 18 Is this an accurate representation of these numbers?
14
Why are there 3 ways to analyze a group of numbers? 2, 2, 2, 2, 2, 3, 3, 3, 4, 4, 5, 5, 5, 6, 7, 121, 130 Median = 4 Mean = 18 Median can be more accurate when there are a few especially large or small numbers.
15
What is your favorite color?
16
Mode can be used with non-numerical data.
17
Is there a numerical way to determine the accuracy of our analysis? 2, 2, 2, 2, 2, 3, 3, 3, 4, 4, 5, 5, 5, 6, 7, 121, 130 Median = 4 Mean = 18
18
Is there a numerical way to determine the accuracy of our analysis? 2, 2, 2, 2, 2, 3, 3, 3, 4, 4, 5, 5, 5, 6, 7, 121, 130 Median = 4 Mean = 18 Standard Deviation is a measure of variability.
19
Standard deviation is a measure of variability. The standard deviation is the root mean square (RMS) deviation of the values from their arithmetic mean. where ∑ = Sum of X = Individual score M = Mean of all scores N = Sample size (Number of scores)
20
Is there a numerical way to determine the accuracy of our analysis? 2, 2, 2, 2, 2, 3, 3, 3, 4, 4, 5, 5, 5, 6, 7, 121, 130 Mean = 18 Standard deviation = 40.5 Standard deviation is a measure of variability.
21
Is there a numerical way to determine the accuracy of our analysis? 2, 2, 2, 2, 2, 3, 3, 3, 4, 4, 5, 5, 5, 6, 7 Mean = 3.67 Standard deviation = 1.6 Standard deviation is a measure of variability.
22
Is there a numerical way to determine the accuracy of our analysis? 2, 2, 2, 2, 2, 3, 3, 3, 4, 4, 5, 5, 5, 6, 7 (121, 130) Mean = 3.67 Median was 4
23
Is there a numerical way to determine the accuracy of our analysis? 2, 2, 2, 2, 2, 3, 3, 3, 4, 4, 5, 5, 5, 6, 7 Mean = 3.67 ±1.6 Standard deviation is a measure of variability.
24
Percent of data at 1, 2, or 3 standard deviations from the mean http://www.westgard.com/lesson34.htm 3.7 2.15.3 -1.1 0.56.9 8.5
25
http://www.westgard.com/lesson34.htm 3.7 2.15.3 -1.1 0.56.9 8.5 2, 2, 2, 2, 2, 3, 3, 3, 4, 4, 5, 5, 5, 6, 7 8/15=53% 14/15=93% 15/15=100%
26
Standard deviation can also be used to determine if 2 data sets are statistically different.
27
Standard deviation can also be used to determine if 2 data sets are statistically different… Set 1= 2, 2, 2, 2, 2, 3, 3, 3, 4, 4, 5, 5, 5, 6, 7 Mean = 3.67 ±1.6 And Set 2= 8, 6, 7, 8, 9, 5, 6, 7, 9, 8, 9, 5 Mean = 7.25 ± 1.48
28
Do the standard deviations overlap? Set 1= 2, 2, 2, 2, 2, 3, 3, 3, 4, 4, 5, 5, 5, 6, 7 Mean = 3.67 ±1.6 range = 2.07 to 5.27 And Set 2= 8, 6, 7, 8, 9, 5, 6, 7, 9, 8, 9, 5 Mean = 7.25 ± 1.48 range = 5.77 to 8.73 No overlap, might be different
29
Do the standard deviations overlap? Set 1= 2, 2, 2, 2, 2, 3, 3, 3, 4, 4, 5, 5, 5, 6, 7 Mean = 3.67 ±1.6 range = 2.07 to 5.27 And Set 2= 8, 6, 7, 8, 9, 5, 6, 7, 9, 8, 4, 5 Mean = 6.83 ± 1.64 range = 5.19 to 8.47 Overlap, different means, but might not be a statistically significant difference
30
How significant of a difference is this? Set 1= 2, 2, 2, 2, 2, 3, 3, 3, 4, 4, 5, 5, 5, 6, 7 Mean = 3.67 ±1.6 range = 2.07 to 5.27 And Set 2= 8, 6, 7, 8, 9, 5, 6, 7, 9, 8, 9, 5 Mean = 7.25 ± 1.48 range = 5.77 to 8.73 No overlap, might be different
31
The ‘Students’ T-test is a method to assign a numerical value of statistical difference.
32
X 1 –X 2 T= n1n1 Sx 1 22 + n2n2 Sx 2
33
The ‘Students’ T-test is a method to assign a numerical value of statistical difference. X 1 –X 2 T= n1n1 Sx 1 22 + n2n2 Sx 2 T is then used to look up the P-value from a table. Also need ‘degrees of freedom’ = (n 1 +n 2 )-1.
34
P-value Df0.050.020.01 112.7131.8263.66 24.3036.9659.925 33.1824.5415.841 42.7763.7474.604 52.5713.3654.032 62.4473.1433.707 72.3652.9983.499 82.3062.8963.355 92.2622.8213.250 102.2282.7643.169 T Partial table for determining P from T
35
How significant of a difference is this? Using MS Excel to get a P value = 3.44x10 -6. Set 1= 2, 2, 2, 2, 2, 3, 3, 3, 4, 4, 5, 5, 5, 6, 7 Mean = 3.67 ±1.6 And Set 2= 8, 6, 7, 8, 9, 5, 6, 7, 9, 8, 9, 5 Mean = 7.25 ± 1.48
36
How significant of a difference is this? MS Excel P value = 3.44x10 -6. So the chance that these 2 sets of data are significantly different is 1 - 3.44x10 -6 or 0.999996559 Set 1= 2, 2, 2, 2, 2, 3, 3, 3, 4, 4, 5, 5, 5, 6, 7 Mean = 3.67 ±1.6 And Set 2= 8, 6, 7, 8, 9, 5, 6, 7, 9, 8, 9, 5 Mean = 7.25 ± 1.48
37
How significant of a difference is this? MS Excel P value = 3.44x10 -6. So the chance that these 2 sets of data are significantly different is 1 - 3.44x10 -6 or 0.999996559 We can be 99.9996559% certain that the difference is statistically significant. Set 1= 2, 2, 2, 2, 2, 3, 3, 3, 4, 4, 5, 5, 5, 6, 7 Mean = 3.67 ±1.6 Set 2= 8, 6, 7, 8, 9, 5, 6, 7, 9, 8, 9, 5 Mean = 7.25 ± 1.48
38
Overlap, different means, but might not be a statistically significant difference Set 1= 2, 2, 2, 2, 2, 3, 3, 3, 4, 4, 5, 5, 5, 6, 7 Mean = 3.67 ±1.6 range = 2.07 to 5.27 Set 2= 8, 6, 7, 8, 9, 5, 6, 7, 9, 8, 4, 5 Mean = 6.83 ± 1.64 range = 5.19 to 8.47 MS Excel P-value = 4.41 x 10 -5
39
What is, or is not, a statistically significant difference? 20% random difference : 80% confidence 10% random difference : 90% confidence 5% random difference : 95% confidence 1% random difference : 99% confidence 0.1% random difference : 99.9% confidence
40
Generally a P-value of 0.5 or less is considered a statistically significant difference. 20% random difference : 80% confidence 10% random difference : 90% confidence 5% random difference : 95% confidence 1% random difference : 99% confidence 0.1% random difference : 99.9% confidence
41
Standard deviation is NOT a valid method for determining statistical signifigance. T-test is one valid and accurate method for determining if 2 means have a statistically significant difference, or if the difference is merely by chance.
42
Inquiry 1 written and oral reports are due in lab the week of 9/29. more Statistics next week This weekend, on 6th street. Pecan Street Arts Festival (www.oldpecanstreetfestival.com)
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.