Analysis of Variance ANOVA Anwar Ahmad. ANOVA Samples from different populations (treatment groups) Any difference among the population means? Null hypothesis:

Slides:



Advertisements
Similar presentations
Analysis of Variance Compares means to determine if the population distributions are not similar Uses means and confidence intervals much like a t-test.
Advertisements

Comparing k Populations Means – One way Analysis of Variance (ANOVA)
PSY 307 – Statistics for the Behavioral Sciences
Independent Sample T-test Formula
Lecture 10 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
ANOVA Analysis of Variance: Why do these Sample Means differ as much as they do (Variance)? Standard Error of the Mean (“variance” of means) depends upon.
Analysis of Variance: Inferences about 2 or More Means
ANalysis Of VAriance (ANOVA) Comparing > 2 means Frequently applied to experimental data Why not do multiple t-tests? If you want to test H 0 : m 1 = m.
PSY 307 – Statistics for the Behavioral Sciences
Intro to Statistics for the Behavioral Sciences PSYC 1900
Confidence Limits on Mean Sample mean is a point estimateSample mean is a point estimate We want interval estimateWe want interval estimate  Probability.
Lecture 9: One Way ANOVA Between Subjects
Testing for differences between 2 means Does the mean weight of cats in Toledo differ from the mean weight of cats in Cleveland? Do the mean quiz scores.
Analysis of Differences Between Two Groups Between Multiple Groups Independent Groups Dependent Groups Independent Groups Dependent Groups Independent.
One-way Between Groups Analysis of Variance
Chapter 9 - Lecture 2 Computing the analysis of variance for simple experiments (single factor, unrelated groups experiments).
Today Concepts underlying inferential statistics
Chapter 14 Inferential Data Analysis
Richard M. Jacobs, OSA, Ph.D.
COURSE: JUST 3900 Tegrity Presentation Developed By: Ethan Cooper Final Exam Review.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Inferential Statistics
Analysis of Variance (ANOVA) Quantitative Methods in HPELS 440:210.
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
AM Recitation 2/10/11.
Selecting the Correct Statistical Test
QNT 531 Advanced Problems in Statistics and Research Methods
1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
January 31 and February 3,  Some formulae are presented in this lecture to provide the general mathematical background to the topic or to demonstrate.
t(ea) for Two: Test between the Means of Different Groups When you want to know if there is a ‘difference’ between the two groups in the mean Use “t-test”.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
PSY 307 – Statistics for the Behavioral Sciences Chapter 16 – One-Factor Analysis of Variance (ANOVA)
Chapter 13 Analysis of Variance (ANOVA) PSY Spring 2003.
Testing Multiple Means and the Analysis of Variance (§8.1, 8.2, 8.6) Situations where comparing more than two means is important. The approach to testing.
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
FOUNDATIONS OF NURSING RESEARCH Sixth Edition CHAPTER Copyright ©2012 by Pearson Education, Inc. All rights reserved. Foundations of Nursing Research,
Copyright © 2004 Pearson Education, Inc.
Testing Hypotheses about Differences among Several Means.
Chapter 14 – 1 Chapter 14: Analysis of Variance Understanding Analysis of Variance The Structure of Hypothesis Testing with ANOVA Decomposition of SST.
The Completely Randomized Design (§8.3)
ANALYSIS OF VARIANCE (ANOVA) BCT 2053 CHAPTER 5. CONTENT 5.1 Introduction to ANOVA 5.2 One-Way ANOVA 5.3 Two-Way ANOVA.
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
General Linear Model 2 Intro to ANOVA.
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Chapter 12 Introduction to Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick.
1 Experimental Statistics - week 9 Chapter 17: Models with Random Effects Chapter 18: Repeated Measures.
1 Experimental Statistics Spring week 6 Chapter 15: Factorial Models (15.5)
Experimental Statistics - week 3
One-Way Analysis of Variance Recapitulation Recapitulation 1. Comparing differences among three or more subsamples requires a different statistical test.
Introduction to ANOVA Research Designs for ANOVAs Type I Error and Multiple Hypothesis Tests The Logic of ANOVA ANOVA vocabulary, notation, and formulas.
Experimental Statistics - week 9
The p-value approach to Hypothesis Testing
Chapter 13 Understanding research results: statistical inference.
Formula for Linear Regression y = bx + a Y variable plotted on vertical axis. X variable plotted on horizontal axis. Slope or the change in y for every.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
1 Experimental Statistics - week 8 Chapter 17: Mixed Models Chapter 18: Repeated Measures.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 10 Introduction to the Analysis.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 4 Investigating the Difference in Scores.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
Chapter 12 Introduction to Analysis of Variance
Factorial Experiments
Comparing Three or More Means
Comparing Populations
Chapter 10 Introduction to the Analysis of Variance
Presentation transcript:

Analysis of Variance ANOVA Anwar Ahmad

ANOVA Samples from different populations (treatment groups) Any difference among the population means? Null hypothesis: no difference among the means

ANOVA Examples Effect of different lots of vaccine on antibody titer Effect of different measurement techniques on serum cholesterol determination from the same pool of serum

ANOVA Examples Water samples drawn at various location in a city Effect of antihypertensive drugs and placebo on mean systolic blood pressure

ANOVA Partitioning of the sum of squares The fundamental technique is a partitioning of the total sum of squares into components related to the effects used in the model.

Analysis of Variance ANOVA is a technique to differentiate between sample means to draw inferences about the presence or absence of variations between populations means.

ANOVA The key statistic in ANOVA is the F-test of difference of group means, testing if the means of the groups formed by values of the independent variable (or combinations of values for multiple independent variables) are different enough not to have occurred by chance.

ANOVA If the group means do not differ significantly then it is inferred that the independent variable(s) did not have an effect on the dependent variable. If the F test shows that overall the independent variable(s) is (are) related to the dependent variable, then multiple comparison tests of significance are used to explore just which values of the independent(s) have the most to do with the relationship.

ANOVA The overall test for differences among means. Used when we wish to determine significance among two or more means. H o =       

Analysis of Variance Analysis of variance is a technique for testing the null hypothesis that one or more samples were drawn at random from the same population. Like “t” or “z” the analysis of variance provides us with a test of significance. The “F” test provides an estimate of the experimental effect and an estimate of the error terms.

Analysis of Variance A procedure for determining how much of the total variability among scores to attribute to various sources of variation and for testing hypotheses concerning some of the sources.

Analysis of Variance A ratio is then made of the two independent variance estimates. This ratio is then compared with the critical f-ratio found in the F table.

One way-Analysis of Variance Consider the following experimental design with one experimental variable – dietary intervention to reduce body weight. ANOVA to evaluate the reduction in weight obtained when volunteer were given 4 dietary treatments. Using COMPLETELY RANDOMIZED DESIGN. 1 classification variable (dietary intervention). Randomly assign 5 volunteers to each of the 4 treatments for a total of 20.

Assumptions of ANOVA Assume: –Observations normally distributed within each population –Population (treatment) variances are equal Homogeneity of variance or homoscedasticity –Observations are independent

Assumptions--cont. Analysis of variance is generally robust –A robust test is one that is not greatly affected by violations of assumptions.

Logic of Analysis of Variance Null hypothesis (H o ): Population means from different conditions are equal –m 1 = m 2 = m 3 = m 4 Alternative hypothesis: H 1 –Not all population means equal.

Visualize total amount of variance in the Experiment Between Group Differences (Mean Square Group) Error Variance (Individual Differences + Random Variance) Mean Square Error Total Variance = Mean Square Total F ratio is a proportion of the MS group/MS Error. The larger the group differences, the bigger the F The larger the error variance, the smaller the F

Logic--cont. Create a measure of variability among treatment group means –MS group Create a measure of variability within treatment groups –MS error

Logic--cont. Form ratio of MS group /MS error –Ratio approximately 1 if null true –Ratio significantly larger than 1 if null false

Calculations Sum of Squares (SS) SS total SS groups SS error Compute degrees of freedom (df ) Compute mean squares and F-ratio Cont.

Degrees of Freedom (df ) Number of “observations” free to vary –df total = N - 1 N observations –df groups = g - 1 g means –df error = (n - 1)-(g-1) n observations in each group = n - 1 df times g groups

ANOVA Example Efforts to reduce body weight: 4 treatment groups: 1.control; 2.diet; 3.physical activity; 4.diet plus physical activity After 3 months body weight loss in lbs.

Example Trt gp wt loss in lbs T i x i. T 2 i T 2 i /5 T1: 5 – = T2: = T3: = T4: = T T 2 / Treatment Mean

ANOVA COMPUTATION

Example 5  x ij = T1 = 8; T2 = 30; T3 = 18; T4 = 51 j=1 x i. = 8/5=1.6; 30/5=6; 18/5=3.6; 51/5=10.2 = T 1 2 = 64; T 2 2 = 900; T 3 2 = 324; T 4 2 = 2601 T = 107 T 2 = 11, 449; T 2 /20 =   x 2 ij = 5 2 +(-2 2 ) = 963 Overall Mean

Example 4 5  x 2 ij = 963  T 2 i /5 = i=1 SS among = – = SS within = 963 – = SS y = 963 – 572 = 391 Treatment Mean Overall Mean Squared values

ANOVA TABLE Sourced.f.SSMSF-ratiop Among gp <.05 Within gp Total19 F.95(3,16) = 3.2 F calculated, 5.7 is bigger than F tabulated,3.2 therefore, reject null hypothesis with less than 5% chance of Type I error.

When there are more than two groups Significant F only shows that not all groups are equal –what groups are different??? –Food for Thought

Analysis of Differences Between Two Groups Between Multiple Groups Independent Groups Dependent Groups Independent Groups Dependent Groups Independent Samples t-test Repeated Measures t-test Independent Samples ANOVA Repeated Measures ANOVA Frequency CHI Square Nominal / Ordinal Data Some kinds of Regression Correlation: Pearson Regression Analysis of Relationships Multiple Predictors Correlation: Spearman Multiple Regression One Predictor Interval Data Type of Data Ordinal Regression

One Factor-ANOVA (Gill, p148) Fixed Treatment Effects: Y ij = μ + τ i + E (i)j An experiment was designed to compare t = 5 different media (treatments) for ability to support the growth of fibroblast cells of mice tissue culture. For replication, r = 5 bottles were used for each medium with same number of cells implanted into each bottle and total cell protein (Y) determined after seven days. The results (yij = μg protein nitrogen) are given in the table:

Growth of fibroblast cells in 5 tissue culture media (μg) One Factor-ANOVA (Gill, p148)

One Factor-ANOVA (Gill, p148) SS y = ( … ) – [ …116) 2 /25] = 279,985 – 279,418 = 567 SS T = [( …101) 2 /5 +( …+102) 2 /5+…] = 279,820 – 279,418 = 402 SS E = 567 – 402 = 165

One Factor-ANOVA (Gill, p148) Sourced.fSSMSFP ≤ Media Error Total f.01,4,20 =

One Factor-ANOVA (Gill, p150) Random Treatment Effects:Y ij = μ + T i + E (i)j Consider the data on daily weight gains, kg, of steer calves sired by 4 different bulls. T = 4 bulls (treatments).

Random Treatment Effects:Y ij = μ + T i + E (i)j

Random Treatment Effects:Y ij = μ + T i + E (i)j SS y = ( … ) – [ …1.10) 2 /29] = – = SS T = [( …1.15) 2 /6 +( …+0.97) 2 /8+…] = – = SS E = – =

Random Treatment Effects:Y ij = μ + T i + E (i)j Sourced.fSSMSFP ≤ Bulls Error Total f.05,3,25 =

Data STEER; INPUT BULLS $ WTGAINK; CARDS; B11.46 B11.23 B11.12 B11.23 B11.02 B11.15 B21.17 B21.08 B21.20 B21.08 B21.01 B20.86 B21.19 B20.97 B30.98 B31.06 B31.15 B31.11 B30.83 B30.86 B30.99 B40.95 B41.10 B41.07 B41.11 B40.89 B41.12 B41.15 B41.10 ; RUN; PROC PRINT DATA = STEER; RUN; PROC MEANS DATA = STEER; RUN; PROC SORT DATA = STEER OUT = BULLSORT; BY BULLS; RUN; PROC MEANS DATA = BULLSORT; BY BULLS; VAR WTGAINK; RUN; PROC GLM; CLASS BULLS; MODEL WTGAINK = BULLS; MEANS BULLS/TUKEY; RUN; QUIT;

The SAS System The GLM Procedure Dependent Variable: WTGAINK Sum of Source DF Squares Mean Square F Value Pr > F Model Error Corrected Total R-Square Coeff Var Root MSE WTGAINK Mean Source DF Type I SS Mean Square F Value Pr > F BULLS SAS OUT PUT

ANOVA-3stage Nested Models Gill p201 Fixed effects of treatments: Y ij = μ + τ i + E (i)j + U (ij)k An animal behavior trial was designed to study the potential depressant effects of 2 pharmaceutical products to stimulate response. Thirty (n) rats were randomly assigned, ten (r) to each product and to a control group that received a placebo. On two occasions (u), an observed response was recorded for each animal. The results are given in the table.

Rat no./gpTreatment 1Treatment 2Treatment 3 133, 3537,3340,42 239, 3831,3052,50 329, 3143,4545,44 441, 4136,3851,53 534, 3630,3944,41 626, 2338,3950,52 740, 3743,4643,43 849, 4632,3556,53 929, 3244,4651, , 3830,2941,43

Y ij = μ + τ i + E (i)j + U (ij)k SS y = ( …+43 2 ) – (33+35+…+43) 2 /60 = – = 3471 SS T = (33+35+…+38) 2 /20 + (37+33+…+29) 2 /60 +(40+42+…+43) 2 / = = 1572 SS E =(33+35) 2 /2 +(39+38) 2 /2 +…+(41+43) 2 / = – = 1788 SS U = 3471 – = 111

ANOVA RESPONSE TO STIMULUS Source of vardfSSMSF Treatments2 t /66.2 = 11.9 Exp error (rats/trt) 27 t(r-1) Samples/rats30 tr(u-1) Total f.001,2,27 =9.02 Tru-1 3*10*2 =60

2-way ANOVA

2-way ANOVA Example 4 vaccines 6 additives Response antibody titer in mouse 4*6 = 24 treatment combinations 72 mouse randomly divided into 24 groups of three mouse each.

AdditiveR i x i.. VaccineIIIIIIIVVVI ∑ µ A B C D ∑(C j ) (T) µ( x. i. ) (x)

Cell Total (T ij ) Additive VaccineIIIIIIIVVVI A B C D

∑ R i 2 /CM = 87 2 / / / /18 = 1489 ∑ T 2 /N = /72 = 1449 SS R = = 40 MS R = 40/3 = ∑ C j 2 /RM = ( ) /12 = 1482 SS C = = 33/5 = 6.61

∑ T ij 2 /M = …11 2 = 1560 SS I = =38 MS I = 38/15 = 2.52 Within cell = ∑ ∑ ∑x 2 ijk = …4 2 = 1711 SS within = =151 MS within = 151/48 = 3.15

2-way ANOVA Table Sourced.f.SS MSF-ratio p Vaccines * Additives NS VaccAdd Int NS Within cells F.95(5,48) = 2.45 F calculated, 2.1 is smaller than F tabulated,2.45 therefore, accept null hypothesis.

DATA ABTITER; INPUT VACCINES $ ADDITIVES MOUSE ABTITER; DATALINES; A A A A A A ; RUN; PROC ANOVA; CLASS VACCINES ADDITIVES MOUSE ; MODEL ABTITER = VACCINES ADDITIVES MOUSE VACCINES*ADDITIVES; MEANS VACCINES ADDITIVES /DUNNETT; RUN; PROC TABULATE; TITLE '2-WAY ANOVA WITH VACCINES AND ADDITIVES MAIN EFFECTS'; CLASS VACCINES ADDITIVES MOUSE ; VAR ABTITER; TABLE VACCINES ADDITIVES MOUSE VACCINES*ADDITIVES, ABTITER*MEAN; RUN; QUIT; SAS DATA SET

2-WAY ANOVA WITH VACCINES AND ADDITIVES MAIN EFFECTS The ANOVA Procedure Class Level Information Class Levels Values VACCINES 4 A B C D ADDITIVES Number of Observations Read 72 Number of Observations Used 72

The ANOVA Procedure Dependent Variable: ABTITER Sum of Source DF Squares Mean Square F Value Pr > F Model Error Corrected Total R-Square Coeff Var Root MSE ABTITER Mean Source DF Anova SS Mean Square F Value Pr > F VACCINES ADDITIVES VACCINES*ADDITIVES

The ANOVA Procedure Dunnett's t Tests for ABTITER NOTE: This test controls the Type I experimentwise error for comparisons of all treatments against a control. Alpha 0.05 Error Degrees of Freedom 48 Error Mean Square Critical Value of Dunnett's t Minimum Significant Difference Comparisons significant at the 0.05 level are indicated by ***. Difference VACCINES BetweenSimultaneous 95% Comparison Means Confidence Limits C - A B - A D - A ***

Two Factor, Fixed Effects Y ijk = μ + α i + β j + (αβ) ij E (ij)k Effects of sex and stage of gestation on the activity of fructose-1-phosphate aldolase (n- moles substrate metabolized/min/mg protein) in the upper third of the intestinal mucosa of calves taken by Cesarean section from 18 Holstein heifers undergoing first gestations. The data are shown: (Gill, p225)

Sex (A)90 d Stage of 180 d Gestation 270 d (B) Total Males subtotal Females subtotal Total

Y ijk = μ + α i + β j + (αβ) ij E (ij)k SS y = ( … ) – …+110.7) 2 /18 = 115,379 – 94,569 = 20, 810 SS A = ( ) /9 – 94,569 = 1122 SS B = ( ) / 6 – 94,569 = 7604 SS AB = ( … )/3 – 94,569 – 1122 – 7604 = 3010 SS E = 20,810 – 1122 – 7604 – 3010 = 9075

Two Factor, Fixed Effects ANOVA Source of variation dfSSMSF ratio Sex (A) ns f.05,1,12=4.75 Gestation (B) * f.05,2,12=3.89 Interaction (AB) ns f.05,2,12=3.89 Expt. Error Total denom.

DATA SEXGESTATION; INPUT SEX $ GESTATION $ F1P; DATALINES; M M M M M M M M M F F F F F F F F F ; RUN; PROC MEANS DATA = SEXGESTATION; PROC SORT DATA = SEXGESTATION OUT = SORT; BY SEX GESTATION; PROC MEANS DATA = SORT; BY SEX GESTATION; VAR F1P; PROC ANOVA; CLASS SEX GESTATION; MODEL F1P = SEX GESTATION SEX*GESTATION; MEANS SEX GESTATION /DUNNETT; RUN; PROC TABULATE; TITLE '2-WAY ANOVA WITH VACCINES AND ADDITIVES MAIN EFFECTS'; CLASS SEX GESTATION; VAR F1P; TABLE SEX GESTATION SEX*GESTATION, F1P*MEAN; RUN; QUIT;

2-WAY ANOVA WITH VACCINES AND ADDITIVES MAIN EFFECTS The ANOVA Procedure Dependent Variable: F1P Sum ofMean Source DF Squares Square F Value Pr > F Model Error Corrected Total R-Square Coeff Var Root MSE F1P Mean Source DF Anova SS Mean Square F Value Pr > F SEX GESTATION SEX*GESTATION