STAT 3130 Statistical Methods I Session 2 One Way Analysis of Variance (ANOVA)

Slides:



Advertisements
Similar presentations
BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.
Advertisements

CHAPTER 25: One-Way Analysis of Variance Comparing Several Means
Chapter 14, part D Statistical Significance. IV. Model Assumptions The error term is a normally distributed random variable and The variance of  is constant.
Inference for Regression
ANOVA: Analysis of Variation
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Variance Chapter 16.
Analysis and Interpretation Inferential Statistics ANOVA
© 2010 Pearson Prentice Hall. All rights reserved Single Factor ANOVA.
ANALYSIS OF VARIANCE.
Copyright ©2011 Brooks/Cole, Cengage Learning Analysis of Variance Chapter 16 1.
1 Analysis of Variance This technique is designed to test the null hypothesis that three or more group means are equal.
Independent Sample T-test Formula
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE.
Statistics Are Fun! Analysis of Variance
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide Are the Means of Several Groups Equal? Ho:Ha: Consider the following.
5-3 Inference on the Means of Two Populations, Variances Unknown
ANOVA 3/19/12 Mini Review of simulation versus formulas and theoretical distributions Analysis of Variance (ANOVA) to compare means: testing for a difference.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
AM Recitation 2/10/11.
F-Test ( ANOVA ) & Two-Way ANOVA
PS 225 Lecture 15 Analysis of Variance ANOVA Tables.
CHAPTER 3 Analysis of Variance (ANOVA) PART 1
Chapter 13: Inference in Regression
1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.
T-Tests and Chi2 Does your sample data reflect the population from which it is drawn from?
More About Significance Tests
Comparing Two Population Means
T tests comparing two means t tests comparing two means.
One-Way Analysis of Variance Comparing means of more than 2 independent samples 1.
 The idea of ANOVA  Comparing several means  The problem of multiple comparisons  The ANOVA F test 1.
ANOVA One Way Analysis of Variance. ANOVA Purpose: To assess whether there are differences between means of multiple groups. ANOVA provides evidence.
ANOVA (Analysis of Variance) by Aziza Munir
Testing Hypotheses about Differences among Several Means.
Chapter 14 – 1 Chapter 14: Analysis of Variance Understanding Analysis of Variance The Structure of Hypothesis Testing with ANOVA Decomposition of SST.
Comparing Three or More Means ANOVA (One-Way Analysis of Variance)
Statistics for the Social Sciences Psychology 340 Fall 2012 Analysis of Variance (ANOVA)
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
ANOVA: Analysis of Variance. The basic ANOVA situation Two variables: 1 Nominal, 1 Quantitative Main Question: Do the (means of) the quantitative variables.
PPA 415 – Research Methods in Public Administration Lecture 7 – Analysis of Variance.
Chapter 14 – 1 Chapter 14: Analysis of Variance Understanding Analysis of Variance The Structure of Hypothesis Testing with ANOVA Decomposition of SST.
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
ANALYSIS OF VARIANCE STAT 200. OUTLINE Introduction of concepts without numbers, notation or details Motivation The four steps: Hypothesis in words The.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Three Broad Purposes of Quantitative Research 1. Description 2. Theory Testing 3. Theory Generation.
The Analysis of Variance. One-Way ANOVA  We use ANOVA when we want to look at statistical relationships (difference in means for example) between more.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
ANOVA P OST ANOVA TEST 541 PHL By… Asma Al-Oneazi Supervised by… Dr. Amal Fatani King Saud University Pharmacy College Pharmacology Department.
The Analysis of Variance ANOVA
Analysis of Variance STAT E-150 Statistical Methods.
T tests comparing two means t tests comparing two means.
STAT 3120 Statistical Methods I Lecture Notes 6 Analysis of Variance (ANOVA)
While you wait: Enter the following in your calculator. Find the mean and sample variation of each group. Bluman, Chapter 121.
Analysis of Variance ANOVA - method used to test the equality of three or more population means Null Hypothesis - H 0 : μ 1 = μ 2 = μ 3 = μ k Alternative.
CHAPTER 10: ANALYSIS OF VARIANCE(ANOVA) Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
Oneway ANOVA comparing 3 or more means. Overall Purpose A Oneway ANOVA is used to compare three or more average scores. A Oneway ANOVA is used to compare.
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Chapter 15 Analysis of Variance. The article “Could Mean Platelet Volume be a Predictive Marker for Acute Myocardial Infarction?” (Medical Science Monitor,
Data Analysis Module: One Way Analysis of Variance (ANOVA)
CHAPTER 3 Analysis of Variance (ANOVA) PART 1
Bivariate Testing (ANOVA)
CHAPTER 3 Analysis of Variance (ANOVA)
Basic Practice of Statistics - 5th Edition
Bivariate Testing (ANOVA)
Presentation transcript:

STAT 3130 Statistical Methods I Session 2 One Way Analysis of Variance (ANOVA)

ANOVA In Statistical Methods I, we covered several different forms of t-tests. For example, we used ttests to answer questions such as: Are cars going over the speed limit on this residential street? Did students who took a preparatory course, score better on the standardized exam than students who did not take the course?

ANOVA The general setting for all of these questions is that some quantitative variable (speed, test scores) has been measured for one or two categories of subjects. What if we have more than two categories across which we want to compare the value of some quantitative variable? For example, lets say that we wanted to compare the mean weight loss of subjects who were put on one of four diet plans. For ease of discussion, lets call these plans A, B, C and D.

ANOVA The following approach would be tempting… H 0 :  Plan A =  Plan B H 1 :  plan A   Plan B H 0 :  Plan B =  Plan C H 1 :  plan B   Plan C H 0 :  Plan C =  Plan D H 1 :  plan C   Plan D H 0 :  Plan A =  Plan C H 1 :  plan A   Plan C H 0 :  Plan A =  Plan D H 1 :  plan A   Plan D H 0 :  Plan B =  Plan D H 1 :  plan B   Plan D

ANOVA …but wrong. Apart from being very cumbersome, there is a critical problem – we are inflating our probability of making a type 1 error. Think about that – lets use alpha =.05. If we ran 6 separate tests, that would generate a cumulative probability of a type 1 error of.3. We could lower the alpha value to.05/6 – I hear you saying. But this has its own problems – what happens if the number of tests increase to 8 or 10? Our alpha value would become so low, we would almost never reject the null (recall Power).

ANOVA What we need is a single test which will allow us to evaluate all of the relationships simultaneously while using a reasonable alpha level. The test needs to be able to provide information regarding differences in the mean values of multiple subject groupings. To accomplish this, we use the Analysis of Variance or ANOVA procedure.

ANOVA Lets discuss how to use ANOVA to test a hypothesis by returning to our dieters… In this instance there are four levels (diet plans) to a single factor (weight loss). The hypothesis statements would look like this: H 0 : All level means are equal. In other words, all four of the diet plans generate approximately the same amount of weight loss. H 1 : Not all of the level means are equal. In other words, at least one of the plans’ weight loss mean is statistically significant different from the other plans’ means.

ANOVA Prior to executing the test, we must check for three important assumptions about our data: 1.All the groups are normally distributed. 2.All the populations sampled have approximately equal variance (you can check this by generating side-by-side boxplots). The rule of thumb is that the largest std is <2x the smallest std. 3.The samples of the groups are independent of each other and subjects within the groups were randomly selected. As with most, but not all, statistical tests, if our samples are large, we can relax our assumptions and work around non normal data.

ANOVA Lets examine the hypothesis statements in more detail: H 0 : µ a = µ b = µ c = µ d H 1 : µ a ≠ µ b ≠ µ c ≠ µ d Consider – what would the hypothesized distributions look like under H 0 and H 1 ?

ANOVA Ok. We understand the concept, we have the hypotheses, we have the assumptions – we need a test statistic. In ANOVA, we use the F-distribution. In the science of statistics, whenever you need to evaluate a ratio of variances you will be using an F-statistic. The ratio in question here is: The variation BETWEEN the groups The variation WITHIN the groups Question – what kind of value would indicate difference versus no difference?

ANOVA The result of this ratio is the F-statistic. You can see the F- Distribution on page 675 of your book. As the number of groups and observations increases, the distribution will start to appear normal. Lets start working with an example…

ANOVA PLAN Mean PLAN A PLAN B PLAN C PLAN D OVERALL MEAN31.33 Returning to the diet plans…

ANOVA Our hypotheses statements would be: H 0 : The four diets plans have the same results (the mean weight loss is the same) H 1 : At least one of the diet plans has a different result (the mean weight loss is different) We will now calculate our test statistic: The variation BETWEEN the groups The variation WITHIN the groups

ANOVA To calculate the F-Statistic, we use the following table (refer to page 682 in your book): SOURCESUM OF SQUARES DEGREES OF FREEDOM MEAN SQUARE F-stat BETWEEN SSB# levels – 1 SSB/(# levels – 1) {SSB/(# levels – 1)} {SSW(n- # levels)} WITHIN SSWn- # levels SSW/(n- # levels) TOTAL SST (SSB + SSW) n-1

ANOVA For those who are interested: SST = SSW + SSB  i  j (X ij -X) 2 =  i  j (X ij -X j ) 2 + n  j (X j -X) 2 _ _ _

ANOVA For the present problem: SOURCESUM OF SQUARES DEGREES OF FREEDOM MEAN SQUARE F-stat BETWEEN WITHIN TOTAL SSB = 6( ) 2 SSW = ( ) =

ANOVA Now…what to do with an F-statistic of 24.33? This is a fairly strong statistic – recall that as the variance ratio approaches 1, the null is true. As the variance ratio grows larger than 1, we can more confidently reject the null. As with all test statistics, this result will translate into a p-value. The p-value associated with this statistic is less than.001. Based upon this result, we can confidently reject the null hypothesis and conclude that at least one of the results is different. Lets execute this same problem in SAS…