Download presentation
Presentation is loading. Please wait.
Published bySophie Sharp Modified over 9 years ago
1
Formula for Linear Regression y = bx + a Y variable plotted on vertical axis. X variable plotted on horizontal axis. Slope or the change in y for every unit change in x Y-intercept or the value of y when x = 0.
2
Interpretation of parameters The regression slope is the average change in Y when X increases by 1 unit The intercept is the predicted value for Y when X = 0 If the slope = 0, then X does not help in predicting Y (linearly)
3
General ANOVA Setting Comparisons of 2 or more means Investigator controls one or more independent variables –Called factors (or treatment variables) –Each factor contains two or more levels (or groups or categories/classifications) Observe effects on the dependent variable –Response to levels of independent variable Experimental design: the plan used to collect the data
4
Logic of ANOVA Each observation Mean is different from the Grand (total sample) Mean by some amount There are two sources of variance from the mean: –1) That due to the treatment or independent variable –2) That which is unexplained by our treatment
5
One-Way Analysis of Variance Evaluate the difference among the means of two or more groups Examples: Accident rates for 1 st, 2 nd, and 3 rd shift Expected mileage for five brands of tires Assumptions –Populations are normally distributed –Populations have equal variances –Samples are randomly and independently drawn
6
Hypotheses of One-Way ANOVA -All population means are equal -i.e., no treatment effect (no variation in means among groups) -At least one population mean is different -i.e., there is a treatment effect -Does not mean that all population means are different (some pairs may be the same) - ANOVA does not tell you where the difference lies. For this reason,you need another test, either the Scheffe' or Tukey post ANOVA test.
7
One-Factor ANOVA All Means are the same: The Null Hypothesis is True (No Treatment Effect)
8
One-Factor ANOVA At least one mean is different: The Null Hypothesis is NOT true (Treatment Effect is present) or (continued)
9
Partitioning the Variation Total variation can be split into two parts: SST = Total Sum of Squares (Total variation) SSA = Sum of Squares Among Groups (Among-group variation) SSW = Sum of Squares Within Groups (Within-group variation) SST = SSA + SSW
10
Partitioning the Variation Total Variation = the aggregate dispersion of the individual data values across the various factor levels (SST) Within-Group Variation = dispersion that exists among the data values within a particular factor level (SSW) Among-Group Variation = dispersion between the factor sample means (SSA) SST = SSA + SSW (continued)
11
Partition of Total Variation Variation Due to Factor (SSA) Variation Due to Random Sampling (SSW) Total Variation (SST) Commonly referred to as: Sum of Squares Within Sum of Squares Error Sum of Squares Unexplained Within-Group Variation Commonly referred to as: Sum of Squares Between Sum of Squares Among Sum of Squares Explained Among Groups Variation = + d.f. = n – 1 d.f. = c – 1d.f. = n – c
12
Among-Group Variation Variation Due to Differences Among Groups Mean Square Among = SSA/degrees of freedom (continued)
13
Within-Group Variation Summing the variation within each group and then adding over all groups Mean Square Within = SSW/degrees of freedom (continued)
14
One-Way ANOVA Table Source of Variation dfSS MS (Variance) Among Groups SSAMSA = Within Groups n - cSSWMSW = Totaln - 1 SST = SSA+SSW c - 1 MSA MSW F ratio c = number of groups n = sum of the sample sizes from all groups df = degrees of freedom SSA c - 1 SSW n - c F =
15
One-Way ANOVA: F Test Statistic Test statistic MSA is mean squares among groups MSW is mean squares within groups Degrees of freedom –df 1 = c – 1 (c = number of groups) –df 2 = n – c (n = sum of sample sizes from all populations) H 0 : μ 1 = μ 2 = … = μ c H 1 : At least two population means are different
16
Interpreting One-Way ANOVA F Statistic The F statistic is the ratio of the among estimate of variance and the within estimate of variance –The ratio must always be positive – df 1 = c -1 will typically be small – df 2 = n - c will typically be large Decision Rule: Reject H 0 if F > F U, otherwise do not reject H 0 0 =.05 Reject H 0 Do not reject H 0 FUFU
17
One-Way ANOVA : F Test Example You want to see if cholesterol level is different in three groups. You randomly select five patients. Measure their cholesterol levels. At the 0.05 significance level, is there a difference in mean cholesterol? Gp 1 Gp 2 Gp 3 254 234 200 263 218 222 241 235 197 237 227 206 251 216 204
18
One-Way ANOVA Example: Scatter Diagram 270 260 250 240 230 220 210 200 190 Cholesterol Gp 1 Gp 2 Gp 3 254 234 200 263 218 222 241 235 197 237 227 206 251 216 204 Groups 1 2 3
19
One-Way ANOVA Example Computations Gp 1 Gp 2 Gp 3 254 234 200 263 218 222 241 235 197 237 227 206 251 216 204 X 1 = 249.2 X 2 = 226.0 X 3 = 205.8 X = 227.0 n 1 = 5 n 2 = 5 n 3 = 5 n = 15 c = 3 SSA = 5 (249.2 – 227) 2 + 5 (226 – 227) 2 + 5 (205.8 – 227) 2 = 4716.4 SSW = (254 – 249.2) 2 + (263 – 249.2) 2 +…+ (204 – 205.8) 2 = 1119.6 MSA = 4716.4 / (3-1) = 2358.2 MSW = 1119.6 / (15-3) = 93.3
20
F = 25.275 One-Way ANOVA Example Solution H 0 : μ 1 = μ 2 = μ 3 H 1 : μ j not all equal = 0.05 df 1 = 2 df 2 = 12 Test Statistic: Decision: Conclusion: Reject H 0 at = 0.05 There is evidence that at least one μ j differs from the rest 0 =.05 F U = 3.89 Reject H 0 Do not reject H 0 Critical Value: F U = 3.89
21
Significant and Non-significant Differences Significant: Between > Within Non-significant: Within > Between
22
ANOVA (summary) Null hypothesis is that there is no difference between the means. Alternate hypothesis is that at least two means differ. Use the F statistic as your test statistic. It tests the between-sample variance (difference between the means) against the within-sample variance (variability within the sample). The larger this is the more likely the means are different. Degrees of freedom for numerator is k-1 (k is the number of treatments) Degrees of freedom for the denominator is n-k (n is the number of responses) If test F is larger than critical F, then reject the null. If p-value is less than alpha, then reject the null.
23
ANOVA (summary) WHEN YOU REJECT THE NULL For an one-way ANOVA after you have rejected the null, you may want to determine which treatment yielded the best results. Must do follow-on analysis to determine if the difference between each pair of means if significant (post ANOVA test).
24
One-way ANOVA (example) The study described here is about measuring cortisol levels in 3 groups of subjects : Healthy (n = 16) Depressed: Non-melancholic depressed (n = 22) Depressed: Melancholic depressed (n = 18)
25
Results Results were obtained as follows Source DF SS MS F P Grp. 2 164.7 82.3 6.61 0.003 Error 53 660.0 12.5 Total 55 824.7 Individual 95% CIs For Mean Based on Pooled StDev Level N Mean StDev -+---------+---------+---------+----- 1 16 9.200 2.931 (------*------) 2 22 10.700 2.758 (-----*-----) 3 18 13.500 4.674 (------*------) -+---------+---------+---------+----- Pooled StDev = 3.529 7.5 10.0 12.5 15.0
26
Multiple Comparison of the Means - 1 Several methods are available depending upon whether one wishes to compare means with a control mean (Dunnett) or just overall comparison (Tukey and Fisher) Dunnett's comparisons with a control Critical value = 2.27 Control = level (1) of Grp. Intervals for treatment mean minus control mean Level Lower Center Upper -----+---------+---------+---------+-- 2 -1.127 1.500 4.127 (----------*----------) 3 1.553 4.300 7.047 (----------*----------) -----+---------+---------+---------+-- -1.0 1.5 4.0 7.0
27
Multiple Comparison of Means - 2 Tukey's pair wise comparisons Intervals for (column level mean) − (row level mean) 1 2 2 -4.296 1.296 3 -7.224 -5.504 -1.376 -0.096 Fisher's pair wise comparisons Intervals for (column level mean) − (row level mean) 1 2 2 -3.826 0.826 3 -6.732 -5.050 -1.868 -0.550 The End
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.