Download presentation
Presentation is loading. Please wait.
Published byHandoko Jayadi Modified over 6 years ago
3
What if. . . You were asked to determine if psychology and sociology majors have significantly different class attendance (i.e., the number of days a person misses class) You would simply do a two-sample t-test two-tailed Easy!
4
But, what if. . . You were asked to determine if psychology, sociology, and biology majors have significantly different class attendance You can’t do a two-sample t-test You have three samples No such thing as a three sample t-test!
5
One-Way ANOVA ANOVA = Analysis of Variance
This is a technique used to analyze the results of an experiment when you have more than two groups
6
Example You measure the number of days 7 psychology majors, 7 sociology majors, and 7 biology majors are absent from class You wonder if the average number of days each of these three groups was absent is significantly different from one another
7
Hypothesis Alternative hypothesis (H1)
H1: The three population means are not all equal
8
Hypothesis Alternative hypothesis (H1) socio = bio
9
Hypothesis Alternative hypothesis (H1) socio = psych
10
Hypothesis Alternative hypothesis (H1) psych = bio
11
Hypothesis Alternative hypothesis (H1) psych = bio = soc
12
Hypothesis Alternative hypothesis (H1)
Notice: It does not say where this difference is at!!
13
Hypothesis Null hypothesis (H0) psych = socio = bio
In other words, all three means are equal to one another (i.e., no difference between the means)
14
Results X = 3.00 X = 2.00 X = 1.00
15
Logic Is the same as t-tests
1) calculate a variance ratio (called an F; like t-observed) 2) Find a critical value 3) See if the the F value falls in the critical area
16
Between and Within Group Variability
Two types of variability Between / Treatment the differences between the mean scores of the three groups The more different these means are, the more variability!
17
Between Variability Compute S2 on the means X = 3.00 X = 2.00 X = 1.00
18
Between Variability S2 = 1 X = 3.00 X = 2.00 X = 1.00
19
Between Variability + 5 X = 3.00 X = 2.00 X = 1.00
20
Between Variability X = 8.00 X = 2.00 X = 1.00
21
Between Variability Compute S2 on the means X = 8.00 X = 2.00 X = 1.00
22
Between Variability S2 = 14.33 X = 8.00 X = 2.00 X = 1.00
23
Between Group Variability
What causes this variability to increase? 1) Effect of the variable (college major) 2) Sampling error
24
Between and Within Group Variability
Two types of variability Within / Error the variability of the scores within each group
25
Within Variability Compute S2 within each group X = 3.00 X = 2.00
26
Within Variability S2 =.67 S2 =1.67 S2 =.67 X = 3.00 X = 2.00 X = 1.00
27
Within Group Variability
What causes this variability to increase? 1) Sampling error
28
Between and Within Group Variability
Between-group variability Within-group variability
29
Between and Within Group Variability
sampling error + effect of variable sampling error
30
Between and Within Group Variability
sampling error + effect of variable sampling error Thus, if null hypothesis was true this would result in a value of 1.00
31
Between and Within Group Variability
sampling error + effect of variable sampling error Thus, if null hypothesis was not true this value would be greater than 1.00
32
Calculating this Variance Ratio
33
Calculating this Variance Ratio
34
Calculating this Variance Ratio
35
Degrees of Freedom dfbetween dfwithin dftotal
dftotal = dfbetween + dfwithin
36
Degrees of Freedom dfbetween = k - 1 (k = number of groups)
dfwithin = N - k (N = total number of observations) dftotal = N - 1 dftotal = dfbetween + dfwithin
37
Degrees of Freedom dfbetween = k - 1 3 - 1 = 2
dfwithin = N - k = 18 dftotal = N = 20 20 =
38
Sum of Squares SSBetween SSWithin SStotal
SStotal = SSBetween + SSWithin
39
Sum of Squares SStotal
40
Sum of Squares SSWithin
41
Sum of Squares SSBetween
43
Sum of Squares Ingredients: X X2 Tj2 N n
44
To Calculate the SS
45
X Xs = 21 Xp = 14 XB = 7
46
X X = 42 Xs = 21 Xp = 14 XB = 7
47
X2 X = 42 Xs = 21 Xp = 14 XB = 7 X2s = 67 X2P = 38 X2B = 11
48
X2 X = 42 X2 = 116 Xs = 21 Xp = 14 XB = 7 X2s = 67 X2P = 38
49
T2 = (X)2 for each group X = 42 X2 = 116 Xs = 21 Xp = 14 XB = 7
T2P = 196 T2B = 49 T2s = 441
50
Tj2 X = 42 X2 = 116 Tj2 = 686 Xs = 21 Xp = 14 XB = 7 X2s = 67
T2P = 196 T2B = 49 T2s = 441
51
N X = 42 X2 = 116 Tj2 = 686 N = 21 Xs = 21 Xp = 14 XB = 7
T2P = 196 T2B = 49 T2s = 441
52
n X = 42 X2 = 116 Tj2 = 686 N = 21 n = 7 Xs = 21 Xp = 14 XB = 7
T2P = 196 T2B = 49 T2s = 441
53
Ingredients X = 42 X2 = 116 Tj2 = 686 N = 21 n = 7
54
Calculate SS X = 42 X2 = 116 Tj2 = 686 N = 21 n = 7 SStotal
55
Calculate SS 42 32 116 21 SStotal X = 42 X2 = 116 Tj2 = 686 N = 21
56
Calculate SS X = 42 X2 = 116 Tj2 = 686 N = 21 n = 7 SSWithin
57
Calculate SS 686 18 116 7 SSWithin X = 42 X2 = 116 Tj2 = 686 N = 21
58
Calculate SS X = 42 X2 = 116 Tj2 = 686 N = 21 n = 7 SSBetween
59
Calculate SS 14 686 42 7 21 SSBetween X = 42 X2 = 116 Tj2 = 686
60
Sum of Squares SSBetween SSWithin SStotal
SStotal = SSBetween + SSWithin
61
Sum of Squares SSBetween = 14 SSWithin = 18 SStotal = 32 32 =
62
Calculating the F value
63
Calculating the F value
64
Calculating the F value
14 7 2
65
Calculating the F value
7
66
Calculating the F value
7 18 1 18
67
Calculating the F value
7 7 1
68
How to write it out
69
Significance Is an F value of 7.0 significant at the .05 level?
To find out you need to know both df
70
Degrees of Freedom Dfbetween = k - 1 (k = number of groups)
dfwithin = N - k (N = total number of observations)
71
Degrees of Freedom Dfbetween = k - 1 3 - 1 = 2
dfwithin = N - k = 18 Use F table Dfbetween are in the numerator Dfwithin are in the denominator Write this in the table
72
Critical F Value F(2,18) = 3.55 The nice thing about the F distribution is that everything is a one- tailed test
73
Decision Thus, if F value > than F critical
Reject H0, and accept H1 If F value < or = to F critical Fail to reject H0
74
Current Example F value = 7.00 F critical = 3.55
Thus, reject H0, and accept H1
75
Alternative hypothesis (H1)
H1: The three population means are not all equal In other words, psychology, sociology, and biology majors do not have equal IQs Notice: It does not say where this difference is at!!
76
How to write it out
80
Six Easy Steps for an ANOVA
1) State the hypothesis 2) Find the F-critical value 3) Calculate the F-value 4) Decision 5) Create the summary table 6) Put answer into words
81
Example Want to examine the effects of feedback on self-esteem. Three different conditions -- each have five subjects 1) Positive feedback 2) Negative feedback 3) Control Afterward all complete a measure of self-esteem that can range from 0 to 10.
82
Example: Question: Is the type of feedback a person receives significantly (.05) related their self-esteem?
83
Results
84
Step 1: State the Hypothesis
H1: The three population means are not all equal H0: pos = neg = cont
85
Step 2: Find F-Critical Step 2.1
Need to first find dfbetween and dfwithin Dfbetween = k (k = number of groups) dfwithin = N - k (N = total number of observations) dftotal = N - 1 Check yourself dftotal = Dfbetween + dfwithin
86
Step 2: Find F-Critical Step 2.1
Need to first find dfbetween and dfwithin Dfbetween = (k = number of groups) dfwithin = (N = total number of observations) dftotal = 14 Check yourself 14 =
87
Step 2: Find F-Critical Step 2.2
Look up F-critical using table F on pages F (2,12) = 3.88
88
Step 3: Calculate the F-value
Has 4 Sub-Steps 3.1) Calculate the needed ingredients 3.2) Calculate the SS 3.3) Calculate the MS 3.4) Calculate the F-value
89
Step 3.1: Ingredients X X2 Tj2 N n
90
Step 3.1: Ingredients
91
X X = 85 Xp = 40 Xn = 25 Xc = 20
92
X2 X = 85 X2 = 555 Xp = 40 Xn = 25 Xc = 20 X2n = 135 X2c = 90
93
T2 = (X)2 for each group X = 85 X2 = 555 Xp = 40 Xn = 25 Xc = 20
T2n = 625 T2c = 400 T2p = 1600
94
Tj2 X = 85 X2 = 555 Tj2 = 2625 Xp = 40 Xn = 25 Xc = 20
T2n = 625 T2c = 400 T2p = 1600
95
N X = 85 X2 = 555 Tj2 = 2625 N = 15 Xp = 40 Xn = 25 Xc = 20
T2n = 625 T2c = 400 T2p = 1600
96
n X = 85 X2 = 555 Tj2 = 2625 N = 15 n = 5 Xp = 40 Xn = 25
Xc = 20 X2n = 135 X2c = 90 X2p = 330 T2n = 625 T2c = 400 T2p = 1600
97
Step 3.2: Calculate SS SStotal X = 85 X2 = 555 Tj2 = 2625 N = 15
98
Step 3.2: Calculate SS 85 73.33 555 15 SStotal X = 85 X2 = 555
Tj2 = 2625 N = 15 n = 5 Step 3.2: Calculate SS SStotal 85 73.33 555 15
99
Step 3.2: Calculate SS SSWithin X = 85 X2 = 555 Tj2 = 2625 N = 15
100
Step 3.2: Calculate SS 2625 30 555 5 SSWithin X = 85 X2 = 555
Tj2 = 2625 N = 15 n = 5 Step 3.2: Calculate SS SSWithin 2625 30 555 5
101
Step 3.2: Calculate SS SSBetween X = 85 X2 = 555 Tj2 = 2625 N = 15
102
Step 3.2: Calculate SS 43.33 2625 85 5 15 SSBetween X = 85 X2 = 555
Tj2 = 2625 N = 15 n = 5 Step 3.2: Calculate SS SSBetween 43.33 2625 85 5 15
103
Step 3.2: Calculate SS Check! SStotal = SSBetween + SSWithin
104
Step 3.2: Calculate SS Check! 73.33 =
105
Step 3.3: Calculate MS
106
Step 3.3: Calculate MS 43.33 21.67 2
107
Calculating this Variance Ratio
108
Step 3.3: Calculate MS 30 2.5 12
109
Step 3.4: Calculate the F value
110
Step 3.4: Calculate the F value
21.67 8.67 2.5
111
Step 4: Decision If F value > than F critical
Reject H0, and accept H1 If F value < or = to F critical Fail to reject H0
112
Step 4: Decision If F value > than F critical
Reject H0, and accept H1 If F value < or = to F critical Fail to reject H0
113
Step 5: Create the Summary Table
114
Step 6: Put answer into words
Question: Is the type of feedback a person receives significantly (.05) related their self-esteem? H1: The three population means are not all equal The type of feedback a person receives is related to their self-esteem
115
SPSS
117
Practice You are interested in comparing the performance of three models of cars. Random samples of five owners of each car were used. These owners were asked how many times their car had undergone major repairs in the last 2 years.
118
Results
119
Practice Is there a significant (.05) relationship between the model of car and repair records?
120
Step 1: State the Hypothesis
H1: The three population means are not all equal H0: V = F = G
121
Step 2: Find F-Critical Step 2.1
Need to first find dfbetween and dfwithin Dfbetween = (k = number of groups) dfwithin = (N = total number of observations) dftotal = 14 Check yourself 14 =
122
Step 2: Find F-Critical Step 2.2
Look up F-critical using table F on pages F (2,12) = 3.88
123
Step 3.1: Ingredients X = 60 X2 = 304 Tj2 = 1400 N = 15 n = 5
124
Step 3.2: Calculate SS SStotal X = 60 X2 = 304 Tj2 = 1400 N = 15
125
Step 3.2: Calculate SS 60 64 304 15 SStotal X = 60 X2 = 304
Tj2 = 1400 N = 15 n = 5 Step 3.2: Calculate SS SStotal 60 64 304 15
126
Step 3.2: Calculate SS SSWithin X = 60 X2 = 304 Tj2 = 1400 N = 15
127
Step 3.2: Calculate SS 1400 24 304 5 SSWithin X = 60 X2 = 304
Tj2 = 1400 N = 15 n = 5 Step 3.2: Calculate SS SSWithin 1400 24 304 5
128
Step 3.2: Calculate SS SSBetween X = 60 X2 = 304 Tj2 = 1400 N = 15
129
Step 3.2: Calculate SS 40 1400 60 5 15 SSBetween X = 60 X2 = 304
Tj2 = 1400 N = 15 n = 5 Step 3.2: Calculate SS SSBetween 40 1400 60 5 15
130
Step 3.2: Calculate SS Check! SStotal = SSBetween + SSWithin
131
Step 3.2: Calculate SS Check! 64 =
132
Step 3.3: Calculate MS
133
Step 3.3: Calculate MS 40 20 2
134
Calculating this Variance Ratio
135
Step 3.3: Calculate MS 24 2 12
136
Step 3.4: Calculate the F value
137
Step 3.4: Calculate the F value
20 10 2
138
Step 4: Decision If F value > than F critical
Reject H0, and accept H1 If F value < or = to F critical Fail to reject H0
139
Step 4: Decision If F value > than F critical
Reject H0, and accept H1 If F value < or = to F critical Fail to reject H0
140
Step 5: Create the Summary Table
141
Step 6: Put answer into words
Question: Is there a significant (.05) relationship between the model of car and repair records? H1: The three population means are not all equal There is a significant relationship between the type of car a person drives and how often the car is repaired
143
A way to think about ANOVA
Make no assumption about Ho The populations the data may or may not have equal means
144
A way to think about ANOVA
145
A way to think about ANOVA
The samples can be used to estimate the variance of the population Assume that the populations the data are from have the same variance It is possible to use the same variances to estimate the variance of the populations
146
S2 = .50 S2 = .50 S2 = 5.0
147
A way to think about ANOVA
148
A way to think about ANOVA
Assume about Ho is true The population mean are not different from each other They are three samples from the same population All have the same variance and the same mean
150
2,1,2,3,2,5,4,3,4,4,9,6,3,7,5
151
2,1,2,3,2,5,4,3,4,4,9,6,3,7,5
152
A way to think about ANOVA
Central Limit Theorem For any population of scores, regardless of form, the sampling distribution of the mean will approach a normal distribution a N (sample size) get larger. Furthermore, the sampling distribution of the mean will have a mean equal to and a standard deviation equal to / N
153
A way to think about ANOVA
Central Limit Theorem For any population of scores, regardless of form, the sampling distribution of the mean will approach a normal distribution a N (sample size) get larger. Furthermore, the sampling distribution of the mean will have a mean equal to and a standard deviation equal to / N
154
A way to think about ANOVA
Central Limit Theorem (remember) The variance of the means drawn from the same population equals the variance of the population divided by the sample size.
155
A way to think about ANOVA
Can estimate population variance from the sample means with the formula *This only works if the means are from the same population
156
A way to think about ANOVA
S2 = 4.00
157
A way to think about ANOVA
158
A way to think about ANOVA
*Estimates population variance only if the three means are from the same population
159
A way to think about ANOVA
*Estimates population variance regardless if the three means are from the same population
160
What do all of these numbers mean?
161
Why do we call it “sum of squares”?
SStotal SSbetween SSwithin Sum of squares is the sum the squared deviations about the mean
162
Why do we use “sum of squares”?
SS are additive Variances and MS are only additive if df are the same
163
Another way to think about ANOVA
Think in “sums of squares” Represents the SS of all observations, regardless of the treatment.
164
Another way to think about ANOVA
Overall Mean= 4
165
Another way to think about ANOVA
166
Another way to think about ANOVA
Think in “sums of squares” Represents the SS deviations of the treatment means around the grand mean Its multiplied by n to give an estimate of the population variance (Central limit theorem)
167
Overall Mean= 4
168
Another way to think about ANOVA
169
Another way to think about ANOVA
Think in “sums of squares” Represents the SS deviations of the observations within each group
170
Overall Mean= 4
171
Another way to think about ANOVA
172
Sum of Squares SStotal SSbetween SSwithin
The total deviation in the observed scores SSbetween The total deviation in the scores caused by the grouping variable and error SSwithin The total deviation in the scores not caused by the grouping variable (error)
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.