Analysis of Variance and Multiple Comparisons Comparing more than two means and figuring out which are different.

Slides:



Advertisements
Similar presentations
Week 2 – PART III POST-HOC TESTS. POST HOC TESTS When we get a significant F test result in an ANOVA test for a main effect of a factor with more than.
Advertisements

ANOVA and Linear Models. Data Data is from the University of York project on variation in British liquids. Data is from the University of York project.
Analysis of Variance (ANOVA) ANOVA methods are widely used for comparing 2 or more population means from populations that are approximately normal in distribution.
Model Adequacy Checking in the ANOVA Text reference, Section 3-4, pg
MARE 250 Dr. Jason Turner Analysis of Variance (ANOVA)
© 2010 Pearson Prentice Hall. All rights reserved The Complete Randomized Block Design.
Lecture 10 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Nemours Biomedical Research Statistics April 16, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Analysis of Variance: Inferences about 2 or More Means
Lecture 14 – Thurs, Oct 23 Multiple Comparisons (Sections 6.3, 6.4). Next time: Simple linear regression (Sections )
Comparing Means.
Analysis of Variance (ANOVA) MARE 250 Dr. Jason Turner.
POST HOC COMPARISONS What is the Purpose?
Analysis of Differential Expression T-test ANOVA Non-parametric methods Correlation Regression.
Lecture 9: One Way ANOVA Between Subjects
8. ANALYSIS OF VARIANCE 8.1 Elements of a Designed Experiment
Finals Schedule n Section 1: 9:00 AM Monday, May 15.
Statistics for the Social Sciences Psychology 340 Spring 2005 Analysis of Variance (ANOVA)
One-way Between Groups Analysis of Variance
Nemours Biomedical Research Statistics March 26, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Comparing Several Means: One-way ANOVA Lesson 14.
Today Concepts underlying inferential statistics
Chapter 14 Inferential Data Analysis
Administrata New final exam schedule: Handed out Tues, Dec 3
Linear Contrasts and Multiple Comparisons (Chapter 9)
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Chapter 12: Analysis of Variance
Analysis of Variance (ANOVA) Quantitative Methods in HPELS 440:210.
1 STATISTICAL HYPOTHESES AND THEIR VERIFICATION Kazimieras Pukėnas.
QNT 531 Advanced Problems in Statistics and Research Methods
Intermediate Applied Statistics STAT 460
1 Multiple Comparison Procedures Once we reject H 0 :   =   =...  c in favor of H 1 : NOT all  ’s are equal, we don’t yet know the way in which.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Comparing Three or More Means 13.
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
STATISTICAL INFERENCE PART IX HYPOTHESIS TESTING - APPLICATIONS – MORE THAN TWO POPULATION.
January 31 and February 3,  Some formulae are presented in this lecture to provide the general mathematical background to the topic or to demonstrate.
Testing Multiple Means and the Analysis of Variance (§8.1, 8.2, 8.6) Situations where comparing more than two means is important. The approach to testing.
Between-Groups ANOVA Chapter 12. >When to use an F distribution Working with more than two samples >ANOVA Used with two or more nominal independent variables.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
I. Statistical Tests: A Repetive Review A.Why do we use them? Namely: we need to make inferences from incomplete information or uncertainty þBut we want.
Statistics for the Social Sciences Psychology 340 Fall 2013 Tuesday, October 15, 2013 Analysis of Variance (ANOVA)
STA MCP1 Multiple Comparisons: Example Study Objective: Test the effect of six varieties of wheat to a particular race of stem rust. Treatment:
Chapter 15 – Analysis of Variance Math 22 Introductory Statistics.
ANOVA: Analysis of Variance.
Analysis of Variance (One Factor). ANOVA Analysis of Variance Tests whether differences exist among population means categorized by only one factor or.
One-way ANOVA: - Comparing the means IPS chapter 12.2 © 2006 W.H. Freeman and Company.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: One-way ANOVA Marshall University Genomics Core.
Chapter 12 Introduction to Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick.
Chapter 13 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 13: Multiple Comparisons Experimentwise Alpha (α EW ) –The probability.
Psy 230 Jeopardy Related Samples t-test ANOVA shorthand ANOVA concepts Post hoc testsSurprise $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
MARE 250 Dr. Jason Turner Analysis of Variance (ANOVA)
One-way ANOVA Example Analysis of Variance Hypotheses Model & Assumptions Analysis of Variance Multiple Comparisons Checking Assumptions.
University of Ottawa - Bio 4158 – Applied Biostatistics © Antoine Morin and Scott Findlay 20/02/ :23 PM 1 Multiple comparisons What are multiple.
Topic 22: Inference. Outline Review One-way ANOVA Inference for means Differences in cell means Contrasts.
Chapters Way Analysis of Variance - Completely Randomized Design.
O A post-hoc test is needed after we complete an ANOVA in order to determine which groups differ from each other. o Do not conduct a post-hoc test unless.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
ANalysis Of VAriance (ANOVA) Used for continuous outcomes with a nominal exposure with three or more categories (groups) Result of test is F statistic.
Stats/Methods II JEOPARDY. Jeopardy Estimation ANOVA shorthand ANOVA concepts Post hoc testsSurprise $100 $200$200 $300 $500 $400 $300 $400 $300 $400.
MARE 250 Dr. Jason Turner Analysis of Variance (ANOVA)
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Posthoc Comparisons finding the differences. Statistical Significance What does a statistically significant F statistic, in a Oneway ANOVA, tell us? What.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Chapter 12 Introduction to Analysis of Variance
STAT Single-Factor ANOVA
Hypothesis testing using contrasts
Data Analysis and Interpretation
Analysis of Variance (ANOVA)
I. Statistical Tests: Why do we use them? What do they involve?
Estimating the Variance of the Error Terms
Presentation transcript:

Analysis of Variance and Multiple Comparisons Comparing more than two means and figuring out which are different

Analysis of Variance (ANOVA) Despite the name, the procedures compares the means of two or more groups Null hypothesis is that the group means are all equal Widely used in experiments, it is less common in anthropology

ANOVA in Rcmdr Statistics | Means | One-way ANOVA –Accept or change the model name –Select a group (only factors are listed here) –Select a response variable (only numeric variables are listed here) –Check Pairwise comparison of means

> AnovaModel.1 <- aov(Area ~ Segment, data=Snodgrass) > summary(AnovaModel.1) Df Sum Sq Mean Sq F value Pr(>F) Segment e-15 *** Residuals Signif. codes: 0 '***' '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 > numSummary(Snodgrass$Area, groups=Snodgrass$Segment, + statistics=c("mean", "sd")) mean sd n

Results Since the ANOVA statistic is less than our critical value (.05), we reject the null hypothesis that the mean Areas of Segments 1 = 2 = 3 But we usually want to know more Since we did not make predictions in advance our comparisons are post hoc

Multiple Comparisons To find out which means are different from each other we have to compare the various combinations: 1 with 2, 1 with 3, and 2 with 3 (we could also perform other comparisons such as 1 and 2 with 3 but they are rare in anthropology

More Kinds of Errors Our statistical tests have focused on setting the Type I error rate at.05 – the comparisonwise error rate But this error rate holds for a single test. If we do many tests, the chance that we will commit at least one Type 1 error will be higher – the experimentwise error rate

Calculating Errors If the probability of a Type I error is.05, the probability of not making a Type I error is (1 -.05) =.95 The probability of not making a Type I error twice is.95 2 =.9025, three times =.8574, four times =.8145

Calculating Errors The probability of making at least one Type I error is –Twice – ( ) =.0975 –Thrice – ( ) =.1426 –Four times – ( ) =.1855 The probability of making at least one Type I error increases with each additional test

curve((1-(1-.05)^x), 1, 50, 50, yaxp=c(0,.9, 9), xaxp=c(0, 50, 10), xlab="Number of Comparisons", ylab="Type I Error Rate", las=1, main="Experimentwise Error Rate") curve((1-(1-.01)^x), 1, 50, 50, lty=2, add=TRUE) text(30,.92, expression(p == 1-(1-.05)^x), pos=4) text(30,.37, expression(p == 1-(1-.01)^x), pos=4) abline(h=seq(.1,.9,.1), v=seq(0, 50, 5), lty=3, col="gray") legend("topleft", c("Comparisonwise p =.05", "Comparisonwise p =.01"), lty=c(1, 2), bg="white")

Multiple Comparisons Multiple Comparisons procedures take experimentwise error into account when comparing the group means There are a number of methods available, but we’ll stick with Tukey’s Honestly Significant Differences (aka Tukey’s range test)

Tukey’s HSD One of the few multiple comparison tests that can adjust for different sample sizes among the groups You requested this test in Rcmdr when you checked “Pairwise comparison of the means”

>.Pairs <- glht(AnovaModel.1, linfct = mcp(Segment = "Tukey")) > summary(.Pairs) # pairwise tests Simultaneous Tests for General Linear Hypotheses Multiple Comparisons of Means: Tukey Contrasts Fit: aov(formula = Area ~ Segment, data = Snodgrass) Linear Hypotheses: Estimate Std. Error t value Pr(>|t|) == <1e-04 *** == <1e-04 *** == Signif. codes: 0 '***' '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 (Adjusted p values reported -- single-step method)

> confint(.Pairs) # confidence intervals Simultaneous Confidence Intervals Multiple Comparisons of Means: Tukey Contrasts Fit: aov(formula = Area ~ Segment, data = Snodgrass) Quantile = % family-wise confidence level Linear Hypotheses: Estimate lwr upr == == ==

NonParametric ANOVA The non-parametric alternative to ANOVA is the Kruskal-Wallis Rank Sum Test The null hypothesis is that the medians of the groups are equal If the test is significant, a multiple comparison method is available to identify which groups are different

Kruskal-Wallis in Rcmdr Statistics | Nonparametric tests | Kruskal-Wallis test –Select a group (only factors are listed here) –Select a response variable (only numeric variables are listed here)

Multiple Comparisons If there are significant differences the function kruskalmc() in package pgirmess will tell you what groups are different

> kruskal.test(Area ~ Segment, data=Snodgrass) Kruskal-Wallis rank sum test data: Area by Segment Kruskal-Wallis chi-squared = , df = 2, p- value = 1.113e-11 library(pgirmess) > kruskalmc(Area ~ Segment, data=Snodgrass) Multiple comparison test after Kruskal-Wallis p.value: 0.05 Comparisons obs.dif critical.dif difference TRUE TRUE FALSE