© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 12: Analysis of Variance: Differences among Means of Three or More Groups.

Slides:



Advertisements
Similar presentations
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
Advertisements

ANOVA: Analysis of Variance
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
ANOVA Analysis of Variance: Why do these Sample Means differ as much as they do (Variance)? Standard Error of the Mean (“variance” of means) depends upon.
Statistics Are Fun! Analysis of Variance
Linear Regression and Correlation
SIMPLE LINEAR REGRESSION
Lecture 9: One Way ANOVA Between Subjects
Chapter 9 Hypothesis Testing.
Statistical Methods in Computer Science Hypothesis Testing II: Single-Factor Experiments Ido Dagan.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Introduction to Analysis of Variance (ANOVA)
Correlation and Linear Regression
Leedy and Ormrod Ch. 11 Gray Ch. 14
Example of Simple and Multiple Regression
Statistics: A Tool For Social Research
PS 225 Lecture 15 Analysis of Variance ANOVA Tables.
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
Copyright © 2012 by Nelson Education Limited. Chapter 8 Hypothesis Testing II: The Two-Sample Case 8-1.
1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 13: Nominal Variables: The Chi-Square and Binomial Distributions.
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
t(ea) for Two: Test between the Means of Different Groups When you want to know if there is a ‘difference’ between the two groups in the mean Use “t-test”.
Sociology 5811: Lecture 14: ANOVA 2
© Copyright McGraw-Hill CHAPTER 12 Analysis of Variance (ANOVA)
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 10. Hypothesis Testing II: Single-Sample Hypothesis Tests: Establishing the Representativeness.
Between-Groups ANOVA Chapter 12. >When to use an F distribution Working with more than two samples >ANOVA Used with two or more nominal independent variables.
Testing Hypotheses about Differences among Several Means.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 15: Correlation and Regression Part 2: Hypothesis Testing and Aspects of a Relationship.
Chapter 14 – 1 Chapter 14: Analysis of Variance Understanding Analysis of Variance The Structure of Hypothesis Testing with ANOVA Decomposition of SST.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
I. Statistical Tests: A Repetive Review A.Why do we use them? Namely: we need to make inferences from incomplete information or uncertainty þBut we want.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Jeopardy Hypothesis Testing t-test Basics t for Indep. Samples Related Samples t— Didn’t cover— Skip for now Ancient History $100 $200$200 $300 $500 $400.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 16 Data Analysis: Testing for Associations.
Chapter 13 Multiple Regression
MARKETING RESEARCH CHAPTER 18 :Correlation and Regression.
Chapter 14 – 1 Chapter 14: Analysis of Variance Understanding Analysis of Variance The Structure of Hypothesis Testing with ANOVA Decomposition of SST.
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
Previous Lecture: Phylogenetics. Analysis of Variance This Lecture Judy Zhong Ph.D.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 11: Bivariate Relationships: t-test for Comparing the Means of Two Groups.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Marketing Research Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides.
© The McGraw-Hill Companies, Inc., Chapter 13 Analysis of Variance (ANOVA)
Chapter 12 Introduction to Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick.
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
McGraw-Hill, Bluman, 7th ed., Chapter 12
IE241: Introduction to Design of Experiments. Last term we talked about testing the difference between two independent means. For means from a normal.
The Analysis of Variance ANOVA
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 5. Measuring Dispersion or Spread in a Distribution of Scores.
Introduction to ANOVA Research Designs for ANOVAs Type I Error and Multiple Hypothesis Tests The Logic of ANOVA ANOVA vocabulary, notation, and formulas.
Chapter 9 Introduction to the Analysis of Variance Part 1: Oct. 22, 2013.
Analysis of Variance 11/6. Comparing Several Groups Do the group means differ? Naive approach – Independent-samples t-tests of all pairs – Each test doesn't.
While you wait: Enter the following in your calculator. Find the mean and sample variation of each group. Bluman, Chapter 121.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 10 Introduction to the Analysis.
MM570Sec02 Zrotowski Unit 8, Unit 8, Chapter 9 Chapter 9 1 ANOVA 1 ANOVA.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 4 Investigating the Difference in Scores.
CHAPTER 10: ANALYSIS OF VARIANCE(ANOVA) Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Chapter 12 Introduction to Analysis of Variance
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Bivariate Association. Introduction This chapter is about measures of association This chapter is about measures of association These are designed to.
I. Statistical Tests: Why do we use them? What do they involve?
Chapter 13 Group Differences
Chapter 10 Introduction to the Analysis of Variance
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 12: Analysis of Variance: Differences among Means of Three or More Groups

© 2008 McGraw-Hill Higher Education Analysis of Variance (ANOVA) ANOVA is used to compare three or more group means Instead of comparing each group mean to the others (as with a t-test), ANOVA compares each group mean to the grand mean, which is the mean for all cases in the sample

© 2008 McGraw-Hill Higher Education Main Effects In ANOVA, the difference between each group mean and the grand mean is a test effect, which are called main effects When the main effects are zero, this indicates that there are no differences among the means

© 2008 McGraw-Hill Higher Education The ANOVA Hypothesis Test For the ANOVA test, the H 0 states that the population means of the groups are equal The H 0 can also be stated as “the main effects are equal to zero,” or “there is no difference among the means”

© 2008 McGraw-Hill Higher Education The Idea Behind ANOVA ANOVA hypothesizes about differences among means, but its calculation is based on explaining variance around the grand mean E.G., suppose that the overall or “grand” mean of socioeconomic status (SES) of all household heads is 45. Urban residents, however, average 50. The 5-point difference we call the main effect of the category urban

© 2008 McGraw-Hill Higher Education The Idea Behind ANOVA (cont.) Shaneka, an urban dweller, scores 60. This is 15 SES points more than the grand mean of 45. This 15 SES points is her deviation score, the difference between her raw score and the overall mean ANOVA determines whether it is feasible to say that 5 SES points of her 15-point deviation score are due to the fact that she is an urban resident

© 2008 McGraw-Hill Higher Education The Idea Behind ANOVA (cont.) The focus with ANOVA is on explaining deviation scores Deviation scores when squared, summed, and averaged for a group of scores make up the variance. Hence the name “analysis of variance”

© 2008 McGraw-Hill Higher Education The Idea Behind ANOVA (cont.) With ANOVA we are asserting that the spread of scores is due to the main effects of the groups, as illustrated in Figure 12-2 in the text Can scores be explained by differences between group classifications? If so, then scores will cluster around group means rather than the grand mean, and this suggests a difference among means

© 2008 McGraw-Hill Higher Education The General Linear Model The general linear model is a useful framework for understanding ANOVA The general linear model states that the best prediction of an individual’s score on a dependent variable is the overall mean plus an adjustment for the effects of group membership on an independent variable

© 2008 McGraw-Hill Higher Education Applying the General Linear Model For Shaneka, the urban resident with a SES of 60, we decompose her score into 45 points for the grand mean and 5 points explained by urban resident (the main effect of urban). The remaining 10 points are unexplained error

© 2008 McGraw-Hill Higher Education Calculating ANOVA Statistics ANOVA calculations are summarized in a source table To obtain variances, we calculate three parts of the variation (or sums of squares) of the interval/ratio dependent variable and divide them by degrees of freedom

© 2008 McGraw-Hill Higher Education Sums of Squares The three types of sums of squares for ANOVA are: 1.the total sum of squares (SS T ) 2.the between-group or “explained” sum of squares (SS B ), and 3.the within-group or unexplained sum of squares (SS W )

© 2008 McGraw-Hill Higher Education Calculating the SS T The total sum of squares (SS T ) is calculated by summing the squared deviation scores for all cases The SS T is the same sum of squares calculated for the standard deviation (Chapter 5)

© 2008 McGraw-Hill Higher Education Calculating the SS B The between-group or explained sum of squares (SS B ) is calculated by squaring the main effect of each case and summing these squares The SS B is explained in the sense that it is accounted for by differences among the group means, as measured by main effects

© 2008 McGraw-Hill Higher Education Calculating the SS W The within-group or unexplained sum of squares (SS W ) is that part of the squared deviation scores that is not accounted for by main effects. It is unexplained error in the prediction of scores The SS W is most easily calculated by subtracting the between-group sum of squares from the total sum of squares

© 2008 McGraw-Hill Higher Education Calculating the Mean Square Variance (MSV) After sums of squares are computed, to account for sample size and the number of groups, these sums are divided by their degrees of freedom. The resulting variances are called mean square variances (MSV) MSW B = the mean square variance between groups MSW W = the mean square variance within groups

© 2008 McGraw-Hill Higher Education Calculating the F-Ratio Test Statistic The test statistic for ANOVA is the F-ratio statistic This is the ratio of the mean square variance between groups to the mean square variance within groups: F = MSV B / MSV W The p-value is determined using F- distribution curves, Appendix B, Tables D and E

© 2008 McGraw-Hill Higher Education When to Use the F-ratio Test In general, we use ANOVA and the F- ratio when testing a hypothesis between a nominal/ordinal independent variable with three or more categories, and an interval/ratio dependent variable ANOVA is a difference of means test and a cousin of the t-test

© 2008 McGraw-Hill Higher Education When to Use the F-ratio Test (cont.) 1. Number of variables, samples, and populations: a) One population with a single interval/ratio dependent variable, comparing means for three or more groups of a single nominal/ordinal independent variable. Each group’s sample must be representative of its subpopulation, or b) a single interval/ratio dependent variable whose mean is compared for three or more populations using representative samples

© 2008 McGraw-Hill Higher Education When to Use the F-ratio Test (cont.) 2) Sample size: generally no requirements. However, the dependent interval/ratio variable should not be highly skewed within any group sample. Moreover, range tests are unreliable unless sample sizes of groups are about equal. These restrictions are less important when group sample sizes are large

© 2008 McGraw-Hill Higher Education When to Use the F-ratio Test (cont.) 3) Variances (and standard deviations) of the groups are equal. This is the same restraint for the t-test (see equality of variances, Chapter 11)

© 2008 McGraw-Hill Higher Education Existence and Direction of the Relationship for ANOVA Existence: Determined by using the F-ratio to test the null hypothesis of equal group means Direction: Not applicable (because the independent variable is nominal)

© 2008 McGraw-Hill Higher Education Strength of the Relationship for ANOVA Strength: A strong relationship is one in which a high proportion of the total variance in the dependent interval/ratio variable is accounted for by the group variable The correlation ratio, ε 2 (epsilon squared) is a conservative measure that is unlikely to overinflate the strength of the relationship

© 2008 McGraw-Hill Higher Education Nature of the Relationship for ANOVA To assess the nature for ANOVA: 1)Make best estimates at the group level by reporting the grand mean, group means, and main effects 2)Provide examples of best estimates for individuals using the general linear model 3)Use range tests to specify which group means are significantly different from others

© 2008 McGraw-Hill Higher Education Range Tests With ANOVA, rejection of the null hypothesis merely indicates that at least two group means are significantly different Range tests determine which means differ, by establishing the range of differences between means that is statistically significant Tukey’s Highly Significant Difference (HSD) is a conservative range test, unlikely to mistakenly tell us that a difference exists when in fact it does not

© 2008 McGraw-Hill Higher Education Statistical Follies Care must be taken not to apply a group finding to individuals The “ecological fallacy,” drawing conclusions about individuals on the basis of analysis of group units, such as communities, is an extreme case of misapplying statistical findings