 The idea of ANOVA  Comparing several means  The problem of multiple comparisons  The ANOVA F test 1.

Slides:



Advertisements
Similar presentations
BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.
Advertisements

Chapter 11 Analysis of Variance
ANOVA: Analysis of Variation
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Variance Chapter 16.
© 2010 Pearson Prentice Hall. All rights reserved The Complete Randomized Block Design.
© 2010 Pearson Prentice Hall. All rights reserved Single Factor ANOVA.
Copyright ©2011 Brooks/Cole, Cengage Learning Analysis of Variance Chapter 16 1.
Part I – MULTIVARIATE ANALYSIS
Analysis of Variance: Inferences about 2 or More Means
Statistics Are Fun! Analysis of Variance
Chapter 3 Analysis of Variance
PSY 307 – Statistics for the Behavioral Sciences
Lecture 9: One Way ANOVA Between Subjects
Two Groups Too Many? Try Analysis of Variance (ANOVA)
One-way Between Groups Analysis of Variance
Lecture 12 One-way Analysis of Variance (Chapter 15.2)
Statistical Methods in Computer Science Hypothesis Testing II: Single-Factor Experiments Ido Dagan.
Chapter 12: Analysis of Variance
F-Test ( ANOVA ) & Two-Way ANOVA
11. Analysis of Variance (ANOVA). Analysis of Variance Review of T-Test ✔ The basic ANOVA situation How ANOVA works One factor ANOVA model ANCOVA and.
Analysis of Variance or ANOVA. In ANOVA, we are interested in comparing the means of different populations (usually more than 2 populations). Since this.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 13 Experimental Design and Analysis of Variance nIntroduction to Experimental Design.
STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Comparing Three or More Means 13.
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
Lecture Presentation Slides SEVENTH EDITION STATISTICS Moore / McCabe / Craig Introduction to the Practice of Chapter 12 One-Way Analysis of Variance.
ANOVA One Way Analysis of Variance. ANOVA Purpose: To assess whether there are differences between means of multiple groups. ANOVA provides evidence.
t(ea) for Two: Test between the Means of Different Groups When you want to know if there is a ‘difference’ between the two groups in the mean Use “t-test”.
Sociology 5811: Lecture 14: ANOVA 2
PSY 307 – Statistics for the Behavioral Sciences Chapter 16 – One-Factor Analysis of Variance (ANOVA)
One-way ANOVA: - Inference for one-way ANOVA IPS chapter 12.1 © 2006 W.H. Freeman and Company.
One-Way ANOVA Ch 12.  Recall:  A categorical variable = factor.  Its values =levels  ANOVA in general studies the effect of categorical variables.
ANOVA (Analysis of Variance) by Aziza Munir
Testing Hypotheses about Differences among Several Means.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
INTRODUCTION TO ANALYSIS OF VARIANCE (ANOVA). COURSE CONTENT WHAT IS ANOVA DIFFERENT TYPES OF ANOVA ANOVA THEORY WORKED EXAMPLE IN EXCEL –GENERATING THE.
Comparing Three or More Means ANOVA (One-Way Analysis of Variance)
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
ANOVA: Analysis of Variance. The basic ANOVA situation Two variables: 1 Nominal, 1 Quantitative Main Question: Do the (means of) the quantitative variables.
Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics ANalysis Of VAriance: ANOVA.
Chapter 13 Multiple Regression
Lecture 9-1 Analysis of Variance
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
Analysis of Variance (One Factor). ANOVA Analysis of Variance Tests whether differences exist among population means categorized by only one factor or.
One-way ANOVA: - Comparing the means IPS chapter 12.2 © 2006 W.H. Freeman and Company.
The Analysis of Variance. One-Way ANOVA  We use ANOVA when we want to look at statistical relationships (difference in means for example) between more.
12-1 Chapter Twelve McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Objectives (IPS Chapter 12.1) Inference for one-way ANOVA  Comparing means  The two-sample t statistic  An overview of ANOVA  The ANOVA model  Testing.
Chapter 4 Analysis of Variance
One-Way Analysis of Variance Recapitulation Recapitulation 1. Comparing differences among three or more subsamples requires a different statistical test.
One-way ANOVA Example Analysis of Variance Hypotheses Model & Assumptions Analysis of Variance Multiple Comparisons Checking Assumptions.
Introduction to ANOVA Research Designs for ANOVAs Type I Error and Multiple Hypothesis Tests The Logic of ANOVA ANOVA vocabulary, notation, and formulas.
Analysis of Variance STAT E-150 Statistical Methods.
Formula for Linear Regression y = bx + a Y variable plotted on vertical axis. X variable plotted on horizontal axis. Slope or the change in y for every.
1/54 Statistics Analysis of Variance. 2/54 Statistics in practice Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality.
The basic ANOVA situation Two variables: 1 Categorical, other variable interval or ratio Main Question: Do the (means of) the quantitative variables depend.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
One-Way ANOVA: One-Way Analysis of Variance PBS Chapter 14.1 © 2009 W.H. Freeman and Company.
ANOVA: Analysis of Variation
ANOVA: Analysis of Variation
ANOVA: Analysis of Variation
ANOVA: Analysis of Variation
Basic Practice of Statistics - 5th Edition
Hypothesis testing using contrasts
Statistics Analysis of Variance.
Post Hoc Tests on One-Way ANOVA
ANOVA: Analysis of Variance
Presentation transcript:

 The idea of ANOVA  Comparing several means  The problem of multiple comparisons  The ANOVA F test 1

 Values of a quantitative random variable are of interest ◦ score, cholesterol level, etc.  We will compare several groups described or labeled by values of one (typically) categorical random variable ◦ gender, age group, etc.  This is One-Way ANOVA  Compare means of several groups  Question: The difference between the group means, are they statistically significant?

The Idea of ANOVA When comparing different populations or treatments, the data are subject to sampling variability. We can pose the question for inference in terms of the mean response. Analysis of variance (ANOVA) is the technique used to compare several means. One-way ANOVA is used for situations in which there is only one factor, or only one way to classify the populations of interest. Two-way ANOVA is used to analyze the effect of two factors. 3

Example 1: Numbers of days for healing a standard wound (in an animal) for several treatments Example 2: Wages of different ethnic groups in a company Example 3: Lifetimes of different brands of tires  If comparing means of two groups, ANOVA is equivalent to a 2-sample (two-sided) pooled t-test  ANOVA allows for 3 or more groups

 Goal: test H 0 : μ 1 = μ 2 against H a : μ 1 ≠ μ 2  σ 1, σ 2 are unknown but assumed to be equal ◦ The pooled t test statistic is given in 7.2 (p. 499)  Then the ANOVA test statistic (denoted by an F) is the square of the t-statistic.

 Graphical investigation: ◦ side-by-side box plots ◦ multiple histograms  Whether the differences between the groups are significant depends on ◦ the difference in the means ◦ the standard deviations of each group ◦ the sample sizes

Are the means significantly different?

Difference in means more plausible. (Still depends on sample sizes.)

 ANOVA tests the following hypotheses: ◦ H 0 :  1 =  2 =  3 = … =  I (the means of all the groups are equal) ◦ H a : Not all the means are equal  Does not say how or which ones differ  Can follow up with “multiple comparisons”, to check which pairs differ  We refer to the sub-populations as “groups”

N = number of individuals all together I = number of groups = (grand) mean for entire data set  Group i has ◦ n i = # of individuals in group i ◦ x ij = value for individual j in group i ◦ = sample mean for group i ◦ s i = sample standard deviation for group i

DATA=FIT+RESIDUAL  µ i = population mean for group i  Note: the standard deviations are assumed to be equal among all groups

 Rule for examining SD for ANOVA: ◦ If the largest standard deviation (s) is less than twice the smallest standard deviation than we may still assume that SD (σ) is constant among groups (and we can use ANOVA).  Pooled variance and standard deviation:

 A study of reading comprehension in children compared three methods of instruction. Several pretest variables were measured before any instruction was given. One purpose of the pretest was to see if the three groups of children were similar in their comprehension skills. The three methods or instruction are called basal, DRTA, and strategies. We use Basal, DRTA, and Strat as values for the categorical variable indicating which method each student received.

 Are (were) the groups the same with regard to reading ability before instruction?  Hypotheses:  First check ANOVA assumption that data is normally distributed:

 Are the groups’ means significantly different?

 Multiple comparisons (between all pairs of groups) greatly increase the probability of false rejection of a null hypothesis. There are 3 comparisons (instead of 1 for ANOVA).  Estimation of standard deviation (s p ): ANOVA technique uses all information of data and usually gives higher precision  We will use ANOVA and an F-test  F=Fisher

ANOVA table  Three categories of calculations  within  between  total  Three things to calculate  SS (sum of squares)  df (degrees of freedom)  MS (mean square)  We will use computer output for homework and exams

Variability df degrees of freedom SS sum of squares MS mean square Between Groups (called Model) DFM=I-1 (num) SSM (from data) SSG/DFG Within groups (called Error) DFE=N-I (den) SSE (from data) SSE/DFE TotalDFT=N-1 SST (from data) ANOVA table

 Test statistic is F = MSM/MSE.  Under H 0, F has an F distribution with DFM, DFE  In Table E, read P-values for F given DFM, DFE ◦ larger values of F than the critical value will reject H 0  “Degrees of freedom in the numerator" = DFM  “Degrees of freedom in the denominator" = DFE

p Fs Fs

 Note that table E has a different layout than Table D or F.  You have two kinds of DF to look up.  No multiplication of P-value by 2 or subtraction from 1, whatever you note directly from Table E is the P-value.

 Considered groups of elementary school students do not differ significantly (p-value = 0.33) in reading scores before instruction.  This also means that later differences will be due to the effect of the (three) teaching methods subsequently applied to these children.  Important that the SDs for the groups were similar.

 After months of instruction (by three different methods) another reading test was given in each class. The score will be denoted COMP.  Do the methods differ in effectiveness? State hypotheses:…

 The groups DRTA and Strat were taught by innovative methods (of similar philosophy).  Basal was the standard method used in schools.  Now that we know there is a difference, can we compare the groups pairwise? ◦ Use multiple comparison tests (pg 661 in book) ◦ In this class, we will use the Bonferroni procedure  There are other multiple comparison procedures  Namely, Tukey, LSD, Scheffe, SNK

Means with the same letter are not significantly different. Bon Grouping Mean N Group A Basal A A DRTA A A Strat  So for the pretest, none of the groups are significantly different. This should not be surprising since we failed to reject H 0 for the ANOVA F-test.

Means with the same letter are not significantly different. Bon Grouping Mean N Group A DRTA A B A Strat B B Basal  So for the post test, only DRTA and Basal are significantly different.

 On SAS output R-square: R 2 =SSM/SST  Shows what part of the total variation is due to differences among groups.  Before instruction the groups explained only 3% of the variation in score, R 2 =  After instruction the groups explain 12% of the variation in score, R 2 =  For the comp scores we are seeing a lot more of the difference in the groups than we did with the pretest.

 Since we assume the standard deviation is the same for all the groups, we have “constant variance” (the normal curve is the same shape for all 3 groups)  It would be nice to have an estimate for this variation ◦ The MSE is an estimate for this variance, σ 2.  To get an estimate for just σ, we take the square root ◦ The Root MSE is an estimate for σ, see SAS output.

 Inference procedure for comparing means of several groups.  Two kinds of variance: Between group and within group.  If Between variation much larger than Within variation, there exists a difference among the group means.  Using F-statistic and Table E.