Chapter 12: Introduction to Analysis of Variance

Slides:



Advertisements
Similar presentations
Chapter 10: The t Test For Two Independent Samples
Advertisements

INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Chapter 9 Introduction to the t-statistic
Chapter 11: The t Test for Two Related Samples
Chapter 10 Analysis of Variance (ANOVA) Part III: Additional Hypothesis Tests Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social.
Lecture 10 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
PSY 307 – Statistics for the Behavioral Sciences
Chapter 9: Introduction to the t statistic
1 Chapter 13: Introduction to Analysis of Variance.
Analysis of Variance (ANOVA) Quantitative Methods in HPELS 440:210.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
QNT 531 Advanced Problems in Statistics and Research Methods
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
12-1 Chapter Twelve McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
Chapter 12: Introduction to Analysis of Variance
t(ea) for Two: Test between the Means of Different Groups When you want to know if there is a ‘difference’ between the two groups in the mean Use “t-test”.
PSY 307 – Statistics for the Behavioral Sciences Chapter 16 – One-Factor Analysis of Variance (ANOVA)
Basic concept Measures of central tendency Measures of central tendency Measures of dispersion & variability.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
Copyright © 2004 Pearson Education, Inc.
Testing Hypotheses about Differences among Several Means.
Chapter 14 Repeated Measures and Two Factor Analysis of Variance
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Chapter 12 Introduction to Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick.
Chapter 13 Repeated-Measures and Two-Factor Analysis of Variance
12-1 Chapter Twelve McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Psy 230 Jeopardy Related Samples t-test ANOVA shorthand ANOVA concepts Post hoc testsSurprise $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
The Analysis of Variance ANOVA
Statistics for Political Science Levin and Fox Chapter Seven
1 Chapter 14: Repeated-Measures Analysis of Variance.
Introduction to ANOVA Research Designs for ANOVAs Type I Error and Multiple Hypothesis Tests The Logic of ANOVA ANOVA vocabulary, notation, and formulas.
Oneway/Randomized Block Designs Q560: Experimental Methods in Cognitive Science Lecture 8.
Chapter 8: Introduction to Hypothesis Testing. Hypothesis Testing A hypothesis test is a statistical method that uses sample data to evaluate a hypothesis.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
1 Statistics for the Behavioral Sciences (5 th ed.) Gravetter & Wallnau Chapter 13 Introduction to Analysis of Variance (ANOVA) University of Guelph Psychology.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Stats/Methods II JEOPARDY. Jeopardy Estimation ANOVA shorthand ANOVA concepts Post hoc testsSurprise $100 $200$200 $300 $500 $400 $300 $400 $300 $400.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
Chapter 10: The t Test For Two Independent Samples.
Chapter 12 Introduction to Analysis of Variance
Chapter 11 Analysis of Variance
Statistical Inferences for Population Variances
Chapter 11 Created by Bethany Stubbe and Stephan Kogitz.
CHAPTER 3 Analysis of Variance (ANOVA) PART 1
One-Way Between-Subjects Design and Analysis of Variance
Analysis of Variance . Chapter 12.
SEMINAR ON ONE WAY ANOVA
Characteristics of F-Distribution
Applied Business Statistics, 7th ed. by Ken Black
Comparing Three or More Means
Statistics Analysis of Variance.
Post Hoc Tests on One-Way ANOVA
Inferential Statistics and Probability a Holistic Approach
Statistics for Business and Economics (13e)
Econ 3790: Business and Economic Statistics
Chapter 14 Repeated Measures
Kin 304 Inferential Statistics
Analysis of Variance (ANOVA)
Chapter 14: Analysis of Variance One-way ANOVA Lecture 8
Analysis of Variance.
One-Way Analysis of Variance
Chapter 14: Two-Factor Analysis of Variance (Independent Measures)
Chapter 13: Repeated-Measures Analysis of Variance
Psych 231: Research Methods in Psychology
1 Chapter 8: Introduction to Hypothesis Testing. 2 Hypothesis Testing The general goal of a hypothesis test is to rule out chance (sampling error) as.
Chapter 10 – Part II Analysis of Variance
Presentation transcript:

Chapter 12: Introduction to Analysis of Variance

The Logic and the Process of Analysis of Variance Chapter 12 presents the general logic and basic formulas for the hypothesis testing procedure known as analysis of variance (ANOVA). The purpose of ANOVA is much the same as the t tests presented in the preceding chapters: the goal is to determine whether the mean differences that are obtained for sample data are sufficiently large to justify a conclusion that there are mean differences between the populations from which the samples were obtained.

independent vs quasi-independent variable independent variable: manipulated variable  experimental study, e.g. with/without phone quasi-independent variable: non-manipulated variable  non-experimental study, e.g. age both are called “factor” in ANOVA factor can be independent-measures (separate groups) or repeated-measures (same group)

ch12 - ch14 ch 12: 1-factor independent-measures designs ch 13: 1-factor repeated-measures designs ch14: 2-factor designs (factorial designs)

The Logic and the Process of Analysis of Variance (cont'd.) The difference between ANOVA and the t tests is that ANOVA can be used in situations where there are two or more means being compared, whereas the t tests are limited to situations where only two means are involved. ANOVA is necessary to protect researchers from excessive risk of a Type I error in situations where a study is comparing more than two population means.

The Logic and the Process of Analysis of Variance (cont'd.) These situations would require a series of several t tests to evaluate all of the mean differences. (Remember, a t test can compare only two means at a time.) Although each t test can be done with a specific α-level (risk of Type I error), the α-levels accumulate over a series of tests so that the final experimentwise α-level can be quite large.

test on 3 samples ANOVA: H0: μ1 = μ2 = μ3 , H1: not H0 (mutually exclusive) α=5%: testwise alpha level t test for H0: μ1 = μ2 = μ3  pairwise testing test 1: H0: μ1 = μ2 , then test 2: H0: μ1 = μ3 , then test 3: H0: μ2 = μ3 , done! α=5%P(failed to reject H0|H0 is true)=0.95 P(no error at all tests) = (0.95)3 = 0.857375 P(at least one mistakes) = 0.142625 : experimentwise alpha level > 0.05

The Logic and the Process of Analysis of Variance (cont'd.) ANOVA allows researcher to evaluate all of the mean differences in a single hypothesis test using a single α-level and, thereby, keeps the risk of a Type I error under control no matter how many different means are being compared. Although ANOVA can be used in a variety of different research situations, this chapter presents only independent-measures designs involving only one independent variable. i.e. 1-factor independent-measures designs

The Logic and the Process of Analysis of Variance (cont'd.)

The Logic and the Process of Analysis of Variance (cont'd.) The test statistic for ANOVA is an F-ratio, which is a ratio of two sample variances. In the context of ANOVA, the sample variances are called mean squares, or MS values. The top of the F-ratio, MSbetween, measures the size of mean differences between samples. The bottom of the ratio, MSwithin, measures the magnitude of differences that would be expected without any treatment effects.

The Logic and the Process of Analysis of Variance (cont'd.) Thus, the F-ratio has the same basic structure as the independent-measures t statistic presented in Chapter 10. obtained mean differences (including treatment effects) MSbetween F = ────────────────────────────────────── = ─────── differences expected by chance (without treatment effects) MSwithin

The Logic and the Process of Analysis of Variance (cont'd.)

The Logic and the Process of Analysis of Variance (cont'd.) A large value for the F-ratio indicates that the obtained sample mean differences are greater than would be expected if the treatments had no effect. Each of the sample variances, MS values, in the F-ratio is computed using the basic formula for sample variance: SS sample variance = S2 = ── df

The Logic and the Process of Analysis of Variance (cont'd.) To obtain the SS and df values, you must go through an analysis that separates the total variability for the entire set of data into two basic components: within-treatment variability (which will be the denominator: MSE) and between-treatment variability (which will become the numerator of the F-ratio: MST).

The Logic and the Process of Analysis of Variance (cont'd.) The two components of the F-ratio can be described as follows: Within-Treatments Variability: MSwithin measures the size of the differences that exist inside each of the samples. (MSE) Because all the individuals in a sample receive exactly the same treatment, any differences (or variance) within a sample cannot be caused by different treatments.

The Logic and the Process of Analysis of Variance (cont'd.) Thus, these differences are caused by only one source: (MSE) Chance or Error: The unpredictable differences that exist between individual scores are not caused by any systematic factors and are simply considered to be random chance or error.

The Logic and the Process of Analysis of Variance (cont'd.) Between-Treatments Variability: MSbetween measures the size of the differences between the sample means. For example, suppose that three treatments, each with a sample of n = 5 subjects, have means of M1 = 1, M2 = 2, and M3 = 3. (MST) Notice that the three means are different; that is, they are variable.

The Logic and the Process of Analysis of Variance (cont'd.) By computing the variance for the three means we can measure the size of the differences. Although it is possible to compute a variance for the set of sample means, it usually is easier to use the total, T, for each sample instead of the mean, and compute variance for the set of T values.

The Logic and the Process of Analysis of Variance (cont'd.) Logically, the differences (or variance) between means can be caused by two sources: Treatment Effects: If the treatments have different effects, this could cause the mean for one treatment to be higher (or lower) than the mean for another treatment. Chance or Sampling Error: If there is no treatment effect at all, you would still expect some differences between samples. Mean differences from one sample to another are an example of random, unsystematic sampling error.

The Logic and the Process of Analysis of Variance (cont'd.) Considering these sources of variability, the structure of the F-ratio becomes: treatment effect + random differences F = ────────────────────── random differences

The Logic and the Process of Analysis of Variance (cont'd.) When the null hypothesis is true and there are no differences between treatments, the F-ratio is balanced. That is, when the "treatment effect" is zero, the top and bottom of the F-ratio are measuring the same variance. In this case, you should expect an F-ratio near 1.00. When the sample data produce an F-ratio near 1.00, we will conclude that there is no significant treatment effect.

The Logic and the Process of Analysis of Variance (cont'd.) On the other hand, a large treatment effect will produce a large value for the F-ratio. Thus, when the sample data produce a large F-ratio we will reject the null hypothesis and conclude that there are significant differences between treatments. To determine whether an F-ratio is large enough to be significant, you must select an α-level, find the df values for the numerator and denominator of the F-ratio, and consult the F-distribution table to find the critical value.

The Logic and the Process of Analysis of Variance (cont'd.)

p. 397 SSTotal = SST + SSE k treatments: j = 1,2,...,k, N = Σnj , G = ΣTj , Tj = ΣiXij SSTotal = ΣX2 - (G2/N) SST = Σ(T2j/nj ) - (G2/N)  MST = SST/(k-1) SSE = SSTotal – SST  MSE = SSE/(N-k) SSE = Σ SSj  SST = SSTotal - SSE F = MST / MSE

Analysis of Variance and Post Tests The null hypothesis for ANOVA states that for the general population there are no mean differences among the treatments being compared; H0: μ1 = μ2 = μ3 = . . . When the null hypothesis is rejected, the conclusion is that there are significant mean differences. However, the ANOVA simply establishes that differences exist, it does not indicate exactly which treatments are different.

ANOVA table Source SS df MS F Between SST k-1 MST Within SSE N-k MSE Total SSTotal N-1

Characteristics of a F-Distribution LO12-1 LO12-1 Characteristics of a F-Distribution There is a “family” of F- distributions. A particular member of the family is determined by two parameters: the degrees of freedom in the numerator and the degrees of freedom in the denominator. The F-distribution is continuous. A F-value cannot be negative i.e.:F>0. The F-distribution is positively skewed. It is asymptotic. As F  , the curve approaches the X-axis but never touches it. 12-*

F has 2 df, (df1:numerator,df2 :denominator), when df↑F is less skewed F(df1,df2) (30,30) (12,12) (5,5) F–distribution

F - distribution 1/2 F1-α(df1, df2) Fα(df1, df2) Figure: Fα and F1-α

F - distribution 2/2 F1-α(df1,df2) : F-value with (df1,df2) and type I error =1-α , i.e. F1-α : (10-30) Note:F(df1, df2) and F(df2, df1) are not from the same F distribution, i.e. F(df1, df2) and F(df2, df1) cannot be shown in the same distribution. Notice that in the previous ppt: Fα and F1-α have the same (df1, df2)

Measuring Effect Size for an Analysis of Variance As with other hypothesis tests, an ANOVA evaluates the significance of the sample mean differences; that is, are the differences bigger than would be reasonable to expect just by chance. With large samples, however, it is possible for relatively small mean differences to be statistically significant. Thus, the hypothesis test does not necessarily provide information about the actual size of the mean differences.

F test k ↑  df1 ↑  F* ↓  more likely to reject H0 N ↑  df2 ↑  F* ↓  more likely to reject H0

Measuring Effect Size for an Analysis of Variance (cont'd.) To supplement the hypothesis test, it is recommended that you calculate a measure of effect size. For an analysis of variance the common technique for measuring effect size is to compute the percentage of variance that is accounted for by the treatment effects.

Measuring Effect Size for an Analysis of Variance (cont'd.) For the t statistics, this percentage was identified as r2, but in the context of ANOVA the percentage is identified as η2 (the Greek letter eta, squared). The formula for computing effect size is: SSbetween treatments η2 = ─────────── SStotal

Post Hoc Tests With more than two treatments, this creates a problem. Specifically, you must follow the ANOVA with additional tests, called post hoc tests, to determine exactly which treatments are different and which are not. The Tukey’s HSD and Scheffé test are examples of post hoc tests. These tests are done after an ANOVA where H0 is rejected with more than two treatment conditions. The tests compare the treatments, two at a time, to test the significance of the mean differences.