One-way ANOVA: - Comparing the means IPS chapter 12.2 © 2006 W.H. Freeman and Company.

Slides:



Advertisements
Similar presentations
BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.
Advertisements

CHAPTER 25: One-Way Analysis of Variance Comparing Several Means
The Analysis of Variance
Topics Today: Case I: t-test single mean: Does a particular sample belong to a hypothesized population? Thursday: Case II: t-test independent means: Are.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Significance Testing Chapter 13 Victor Katch Kinesiology.
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
Inference for distributions: - Comparing two means IPS chapter 7.2 © 2006 W.H. Freeman and Company.
PSY 307 – Statistics for the Behavioral Sciences
SADC Course in Statistics Comparing Means from Independent Samples (Session 12)
ANOVA Determining Which Means Differ in Single Factor Models Determining Which Means Differ in Single Factor Models.
Analysis of Variance: Inferences about 2 or More Means
Chapter 3 Analysis of Variance
Analysis of Differential Expression T-test ANOVA Non-parametric methods Correlation Regression.
Lecture 9: One Way ANOVA Between Subjects
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
One-way Between Groups Analysis of Variance
Chapter 11: Inference for Distributions
Chapter 9 Hypothesis Testing.
IPS Chapter 12 © 2010 W.H. Freeman and Company  Inference for On-Way Analysis of Variance  Comparing the Means One-Way Analysis of Variance.
Hypothesis Testing Using The One-Sample t-Test
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Chapter 12: Analysis of Variance
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
Inference for Distributions
ANOVA Greg C Elvers.
Education 793 Class Notes T-tests 29 October 2003.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
Comparing Two Population Means
One-Way Analysis of Variance Comparing means of more than 2 independent samples 1.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Chapter 11 Inference for Distributions AP Statistics 11.1 – Inference for the Mean of a Population.
Inference for distributions: - Comparing two means IPS chapter 7.2 © 2006 W.H. Freeman and Company.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 24 Comparing Means.
 The idea of ANOVA  Comparing several means  The problem of multiple comparisons  The ANOVA F test 1.
Lecture Presentation Slides SEVENTH EDITION STATISTICS Moore / McCabe / Craig Introduction to the Practice of Chapter 12 One-Way Analysis of Variance.
t(ea) for Two: Test between the Means of Different Groups When you want to know if there is a ‘difference’ between the two groups in the mean Use “t-test”.
One-way ANOVA: - Inference for one-way ANOVA IPS chapter 12.1 © 2006 W.H. Freeman and Company.
One-Way ANOVA Ch 12.  Recall:  A categorical variable = factor.  Its values =levels  ANOVA in general studies the effect of categorical variables.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Statistics (cont.) Psych 231: Research Methods in Psychology.
1 Section 9-4 Two Means: Matched Pairs In this section we deal with dependent samples. In other words, there is some relationship between the two samples.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
Lesson Comparing Two Means. Knowledge Objectives Describe the three conditions necessary for doing inference involving two population means. Clarify.
AP Statistics Chapter 24 Comparing Means.
Ch11: Comparing 2 Samples 11.1: INTRO: This chapter deals with analyzing continuous measurements. Later, some experimental design ideas will be introduced.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Chapter 10 The t Test for Two Independent Samples
CHAPTER 27: One-Way Analysis of Variance: Comparing Several Means
Business Statistics: A First Course (3rd Edition)
© Copyright McGraw-Hill 2004
Objectives (IPS Chapter 12.1) Inference for one-way ANOVA  Comparing means  The two-sample t statistic  An overview of ANOVA  The ANOVA model  Testing.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
One-Way Analysis of Variance Recapitulation Recapitulation 1. Comparing differences among three or more subsamples requires a different statistical test.
Comparing Means Chapter 24. Plot the Data The natural display for comparing two groups is boxplots of the data for the two groups, placed side-by-side.
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
Chapter 7 Inference Concerning Populations (Numeric Responses)
Statistics (cont.) Psych 231: Research Methods in Psychology.
Statistics 24 Comparing Means. Plot the Data The natural display for comparing two groups is boxplots of the data for the two groups, placed side-by-side.
Inferential Statistics Psych 231: Research Methods in Psychology.
Objectives (PSLS Chapter 18) Comparing two means (σ unknown)  Two-sample situations  t-distribution for two independent samples  Two-sample t test 
One-Way ANOVA: One-Way Analysis of Variance PBS Chapter 14.1 © 2009 W.H. Freeman and Company.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Chapter 15 Analysis of Variance. The article “Could Mean Platelet Volume be a Predictive Marker for Acute Myocardial Infarction?” (Medical Science Monitor,
Basic Practice of Statistics - 5th Edition
Hypothesis testing using contrasts
I. Statistical Tests: Why do we use them? What do they involve?
Psych 231: Research Methods in Psychology
Presentation transcript:

One-way ANOVA: - Comparing the means IPS chapter 12.2 © 2006 W.H. Freeman and Company

Objectives (IPS chapter 12.2) Comparing the means  Contrasts: planned comparisons  Multiple comparisons  Power of the one-way ANOVA test

You have calculated a p-value for your ANOVA test. Now what? If you found a significant result, you still need to determine which treatments were different from which others.  You will gain insight just by looking back at the boxplots.  There are several tests of statistical significance designed specifically for multiple tests. You can choose apriori contrasts, or aposteriori multiple comparisons.  You can find the confidence interval for each mean  or a confidence interval for the difference of any pair of means.

 Contrasts can be used only when there are clear expectations BEFORE starting an experiment, and these are reflected in the experimental design. Contrasts are planned comparisons.  Patients are given either drug A, drug B, or a placebo. The three treatments are not symmetrical. The placebo is meant to provide a baseline against which the other drugs can be compared.  Multiple comparisons should be used when there are no justified expectations. Those are aposteriori, pair-wise tests of significance.  We compare gas mileage for eight brands of SUVs. We have no prior knowledge to expect any brand to perform differently from the rest. Pair- wise comparisons should be performed here, but only if an ANOVA test on all eight brands reached statistical significance first. It is NOT appropriate to use a contrast test when suggested comparisons appear only after the data is collected.

Contrasts: planned comparisons When an experiment is planned to test a specific hypothesis that involves showing a particular difference, we can use contrasts to test for significant differences between those treatments.  Contrasts are more powerful than multiple comparisons because they are more specific. They are more likely to pick up a significant difference if it exists.  You can use a t-test on the contrasts or calculate a t-confidence interval.  The results are valid regardless of the results of your multiple sample ANOVA test (you are still testing a valid hypothesis).

A contrast is a combination of population means of the form : Where the coefficients a i have sum 0. The corresponding sample contrast is : The standard error of c is : To test the null hypothesis H 0 :  = 0 use the t-statistic: With degrees of freedom DFE that is associated with sp. The alternative hypothesis can be one- or two-sided. A level C confidence interval for the difference  is : Where t* is the critical value defining the middle C% of the t distribution with DFE degrees of freedom.

Contrasts are not always readily available in statistical software packages (when they are, you need to assign the coefficients “a i ”), or may be limited to comparing each sample to a control. If your software doesn’t provide an option for contrasts, you can test your contrast hypothesis with a regular t-test using the formulas we just highlighted. Remember to use the pooled variance and degrees of freedom as they reflect your better estimate of the population variance. Then you can look up your p-value in a table of t-distribution.

Do nematodes affect plant growth? A botanist prepares 16 identical planting pots and adds different numbers of nematodes into the pots. Seedling growth (in mm) is recorded two weeks later. Nematodes and plant growth One group contains no nematode at all. If the botanist planned this group as a baseline/control, then a contrast of all the nematode groups against the control would be valid.

Nematodes: planned comparison Contrast of all the nematode groups against the control: Combined contrast hypotheses: H 0 : µ 1 = 1/3 (µ 2 + µ 3 + µ 4 ) vs. H a : µ 1 > 1/3 (µ 2 + µ 3 + µ 4 )  one tailed Contrast coefficients: (+1 −1/3 −1/3 −1/3) or (+3 −1 −1 −1) The presence of nematodes result in significantly shorter seedlings (  1%). x¯ i s i G1: 0 nematode G2: 1,000 nematodes G3: 5,000 nematodes G4: 1,0000 nematodes

Multiple comparisons Multiple comparison tests are variants on the two-sample t-test.  They use the pooled standard deviation s p = √ MSE.  They use pooled degrees of freedom DFE.  And they compensate for the fact that we are making many comparisons. We compute the t-statistic for all pairs of means: A given test is significant (µ i and µ j significantly different), when |t ij | ≥ t** (df = DFE). The value of t** depends on which procedure you choose to use.

The Bonferroni procedure performs a number of pair-wise comparisons with t-tests and then multiplies each p-value by the number of comparisons made. This ensures that the probability of making any false rejection among all comparisons made is no greater than the chosen significance level α. As a consequence, the higher the number of pair-wise comparisons you make, the more difficult it will be to show statistical significance for each test. But the chance of committing a type I error also increases with the number of tests made. The Bonferroni procedure lowers the working significance level of each test to compensate for the increased chance of type I errors among all tests performed. The Bonferroni procedure

Simultaneous confidence intervals We can also calculate simultaneous level C confidence intervals for all pair-wise differences (µ i − µ j ) between population means:  s p is the pooled variance, MSE.  t** is the t critical with degrees of freedom DFE = N – I, adjusted for multiple, simultaneous comparisons (e.g., Bonferroni procedure).

The ANOVA test is significant (α 5%): we have found evidence that the three methods do not all yield the same population mean reading comprehension score. Teaching methods A study compares the reading comprehension (“COMP,” a test score) of children randomly assigned to one of three teaching methods: basal, DRTA, and strategies. We test: H 0 : µ Basal = µ DRTA = µ Strat vs. H a : H 0 not true

What do you conclude? The three methods do not yield the same results: We found evidence of a significant difference between DRTA and basal methods (DRTA gave better results on average), but the data gathered does not support the claim of a difference between the other methods (DRTA vs. strategies or basal vs. strategies).

Power The power, or sensitivity, of a one-way ANOVA is the probability that the test will be able to detect a difference among the groups (i.e. reach statistical significance) when there really is a difference. Estimate the power of your test while designing your experiment to select sample sizes appropriate to detect an amount of difference between means that you deem important.  Too small a sample is a waste of experiment, but too large a sample is also a waste of resources.  A power of at least 80% is often suggested.

Power computations ANOVA power is affected by  The significance level   The sample sizes and number of groups being compared  The real differences between group means µ i  The population standard deviation You need to decide what alternative H a you would consider important detect statistically for the means µ i and to approximate the common standard deviation σ (from similar studies or preliminary work). The power computations then require calculating a noncentrality paramenter λ to arrive at the power of the test.