Introduction to ANOVA.

Slides:



Advertisements
Similar presentations
Chapter 10 Hypothesis Testing Using Analysis of Variance (ANOVA)
Advertisements

Chapter 11 Analysis of Variance
PTP 560 Research Methods Week 9 Thomas Ruediger, PT.
Hypothesis Testing IV Chi Square.
Analysis and Interpretation Inferential Statistics ANOVA
T-Tests.
Independent Sample T-test Formula
Lecture 10 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
ANOVA Analysis of Variance: Why do these Sample Means differ as much as they do (Variance)? Standard Error of the Mean (“variance” of means) depends upon.
Chapter 10 Hypothesis Testing III (ANOVA). Basic Logic  ANOVA can be used in situations where the researcher is interested in the differences in sample.
Analysis of Variance: Inferences about 2 or More Means
Chapter 3 Analysis of Variance
Intro to Statistics for the Behavioral Sciences PSYC 1900
Lecture 9: One Way ANOVA Between Subjects
One-way Between Groups Analysis of Variance
Statistical Methods in Computer Science Hypothesis Testing II: Single-Factor Experiments Ido Dagan.
Introduction to Analysis of Variance (ANOVA)
Inferential Statistics
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Analysis of Variance Statistics for Managers Using Microsoft.
Chap 10-1 Analysis of Variance. Chap 10-2 Overview Analysis of Variance (ANOVA) F-test Tukey- Kramer test One-Way ANOVA Two-Way ANOVA Interaction Effects.
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
Statistics: A Tool For Social Research
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
Chapter 10 Analysis of Variance.
ANOVA (Analysis of Variance) by Aziza Munir
Testing Multiple Means and the Analysis of Variance (§8.1, 8.2, 8.6) Situations where comparing more than two means is important. The approach to testing.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap th Lesson Analysis of Variance.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
INTRODUCTION TO ANALYSIS OF VARIANCE (ANOVA). COURSE CONTENT WHAT IS ANOVA DIFFERENT TYPES OF ANOVA ANOVA THEORY WORKED EXAMPLE IN EXCEL –GENERATING THE.
One-Way Analysis of Variance
1 G Lect 11a G Lecture 11a Example: Comparing variances ANOVA table ANOVA linear model ANOVA assumptions Data transformations Effect sizes.
General Linear Model 2 Intro to ANOVA.
Lecture 9-1 Analysis of Variance
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
Analysis of Variance (One Factor). ANOVA Analysis of Variance Tests whether differences exist among population means categorized by only one factor or.
Chapter 17 Comparing Multiple Population Means: One-factor ANOVA.
Previous Lecture: Phylogenetics. Analysis of Variance This Lecture Judy Zhong Ph.D.
Chapter 10 The t Test for Two Independent Samples
Chapter 12 Introduction to Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick.
Chapter 13 Repeated-Measures and Two-Factor Analysis of Variance
Chapter 10 Hypothesis Testing III (ANOVA). Chapter Outline  Introduction  The Logic of the Analysis of Variance  The Computation of ANOVA  Computational.
Introduction to ANOVA Research Designs for ANOVAs Type I Error and Multiple Hypothesis Tests The Logic of ANOVA ANOVA vocabulary, notation, and formulas.
Chapter 9 Introduction to the Analysis of Variance Part 1: Oct. 22, 2013.
Chapter 13 Understanding research results: statistical inference.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
Chapter 12 Introduction to Analysis of Variance
Chapter 11 Analysis of Variance
Chapter 10 Two-Sample Tests and One-Way ANOVA.
SEMINAR ON ONE WAY ANOVA
INF397C Introduction to Research in Information Studies Spring, Day 12
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Fall 2016 Room 150 Harvill Building 10: :50 Mondays, Wednesdays.
Statistics for Managers Using Microsoft Excel 3rd Edition
Factorial Experiments
Statistics: A Tool For Social Research
Hypothesis Testing Review
CHAPTER 12 ANALYSIS OF VARIANCE
Econ 3790: Business and Economic Statistics
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Spring 2017 Room 150 Harvill Building 9:00 - 9:50 Mondays, Wednesdays.
INTEGRATED LEARNING CENTER
Chapter 9 Hypothesis Testing.
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Spring 2018 Room 150 Harvill Building 9:00 - 9:50 Mondays, Wednesdays.
Chapter 11 Analysis of Variance
Analysis of Variance (ANOVA)
Inferences about Population Means
Some statistics questions answered:
Chapter 10 – Part II Analysis of Variance
Presentation transcript:

Introduction to ANOVA

Questions ANOVA makes assumptions about error for significance tests. What are the assumptions? What might happen (why would it be a problem) if the assumption of {normality, equality of error, independence of error} turned out to be false? What is an expected mean square? Why is it important? Why do we use the F test to decide whether means are equal in ANOVA?

Questions (2) Correctly interpret ANOVA summary tables. Find correct values of critical F from tabled values for a given test. Suppose someone has worked out that a one-way ANOVA with 6 levels has a power of .80 for the overall F test. What does this mean? Describe (make up) a concrete example of a one-way ANOVA where it makes sense to use an overall F test. Explain why ANOVA (not t, chi-square or something else) is the best method for the analysis.

ANOVA Assumptions Recall we can partition total SS into between (treatment) and within (error) SS. No assumptions needed. To conduct tests about population effects, have to make assumptions: Within cells (treatments) error is normal. Homogeneity of error variance. Independent errors.

Assumptions Normality – sampling distribution of means,variances; not bad if N is large; e.g. reaction time Homogeneity – pooled estimate of population value. Where are means different? Assumed equal error for each. E.g., ceiling effects in training. Independence – sampling distribution again; e.g., cheating on exam, nesting (schools, labs)

Mean Square Between Groups Mean square = SS/df = = variance estimate. MS between = E(MS between) = If there is no treatment effect, MS between = error variance. If there is a treatment effect, MS between is bigger than error variance. (J treatments)

Mean Square Within Groups (N is total sample size and J is number of groups.) MS within = E(MS within) = Expected mean square for error is . Expected mean square for treatment is same plus treatment effect: . When there is no treatment effect, between and within estimate same thing.

Review ANOVA makes assumptions about error for significance tests. What are the assumptions? What might happen (why would it be a problem) if the assumption of {normality, equality of error, independence of error} turned out to be false? What is an expected mean square? Why is it important?

The F Test (1) Suppose The null is equivalent to: If the null is true, then for all j for some j The ratio of the two variance estimates will be distributed as F with J-1 and N-J degrees of freedom. Note that the p-value will be a statement of conditional probability (if the null is true, then…)

The F Test (2) This is a big deal because we can use variance estimates to test the hypothesis that any number of population means are equal. Equality of means is same as testing population treatment effect(s). For a treatment effect to be detected, F must be larger than 1. F is one-tailed in the tables which show upper tail values of F given the two df.

F Table – Critical Values Numerator df: dfB dfW 1 2 3 4 5 5 5% 1% 6.61 16.3 5.79 13.3 5.41 12.1 5.19 11.4 5.05 11.0 10 5% 4.96 10.0 4.10 7.56 3.71 6.55 3.48 5.99 3.33 5.64 12 5% 4.75 9.33 3.89 6.94 3.49 5.95 3.26 3.11 5.06 14 5% 4.60 8.86 3.74 6.51 3.34 5.56 5.04 2.96 4.70

Review Why do we use the F test to decide whether means are equal in ANOVA? Suppose we have an ANOVA design with 3 cells and 5 people per cell. What is the critical value of F at alpha = .05?

Calculating F – 1 Way ANOVA Sums of squares (squared deviations from the mean) tell the story of variance. The simple ANOVA designs have 3 sums of squares. The total sum of squares comes from the distance of all the scores from the grand mean. This is the total; it’s all you have. The within-group or within-cell sum of squares comes from the distance of the observations to the cell means. This indicates error. The between-cells or between-groups sum of squares tells of the distance of the cell means from the grand mean. This indicates IV effects.

Computational Example: Caffeine on Test Scores G1: Control G2: Mild G3: Jolt Test Scores 75 80 70 77 82 72 79 84 74 81 86 76 83 88 78 Means SDs (N-1) 3.16

Total Sum of Squares G1 75 79 16 Control 77 4 M=79 SD=3.16 81 83 G2 80 SD=3.16 81 83 G2 80 1 M=84 82 9 84 25 86 49 88 G3 70 M=74 72 74 76 78 Sum 370 Total Sum of Squares

Within Sum of Squares G1 75 79 16 Control 77 4 M=79 SD=3.16 81 83 G2 SD=3.16 81 83 G2 80 84 M=84 82 86 88 G3 70 74 M=74 72 76 78 Sum 120 Within Sum of Squares

Between Sum of Squares G1 79 Control M=79 SD=3.16 G2 84 25 M=84 G3 74 Control M=79 SD=3.16 G2 84 25 M=84 G3 74 M=74 Sum 250 Between Sum of Squares

ANOVA Source (Summary) Table SS df MS F Between Groups 250 J-1= 3-1=2 SS/df 250/2= 125 =MSB F = MSB/MSW = 125/10 =12.5 Within Groups 120 N-J= 15-3=12 120/12 = 10 = MSW Total 370 N-1= 15-1=14

ANOVA Summary Calculate SS (total, between, within) Each SS has associated df to calculate MS F is ratio of MSb to MSw Compare obtained F (12.5) to critical value (3.89). Significant if obtained F is larger than critical. One-tailed test makes sense for F.

R code simple ANOVA Download the .R file Caffeine and .csv file from Canvas. Run the problem you saw in the slide show. Compare your results to what you see in the slides.

Review Suppose we have 4 groups and 10 people per group. We find that SSB = 60 and SSW = 40. Construct an ANOVA summary table and test for significance of the overall effect.

ANOVA Descriptive Stats Because SStot = SSb+SSw we can figure proportion of total variance due to treatment. Proportion of total variance due to treatment is: R2= SSb/SStot. Varies from 0 (no effect) to 1 (no error). Sample value is biased (too large).

Estimating Power Power for what? For one-way ANOVA, power usually means for the overall F, i.e., at least 1 group mean is different from the others. Howell uses noncentral F for sample size calculation. Where k is the number of treatment goups; n is sample size per group. Variance of error is MSE in the population (variance of DV within cells). Mu(j) are treatment means; mu is grand mean.

SAS Power calculation SAS will compute sample size requirements for a given scenario. You input the expected means and a common (within cell) standard deviation, (along with alpha and desired power) and it will tell you the sample size you need.

SAS Input proc power ; onewayanova run; SAS Input ********************************************************** * Power computation example from Howell, 2010, p. 350. * Note the standard deviation is the square root of the * provided MSE: sqrt(240.35) = ~ 15.5. **********************************************************; proc power ; onewayanova groupmeans = 34 | 50.8 | 60.33 | 48.5 | 38.1 stddev = 15.5 alpha = 0.05 npergroup = . power = .8;

SAS Output The POWER Procedure Overall F Test for One-Way ANOVA Fixed Scenario Elements Method Exact Alpha 0.05 Group Means 34 50.8 60.33 48.5 38.1 Standard Deviation 15.5 Nominal Power 0.8 Computed N Per Group Actual N Per Power Group 0.831 8

Review Suppose someone has worked out that a one-way ANOVA with 6 levels has a power of .80 for the overall F test. What does this mean? Describe (make up) a concrete example of a one-way ANOVA where it makes sense to use an overall F test. Explain why ANOVA (not t, chi-square or something else) is the best method for the analysis.