Analysis of Variance (ANOVA)

Slides:



Advertisements
Similar presentations
Multiple-choice question
Advertisements

2  How to compare the difference on >2 groups on one or more variables  If it is only one variable, we could compare three groups with multiple ttests:
Smith/Davis (c) 2005 Prentice Hall Chapter Thirteen Inferential Tests of Significance II: Analyzing and Interpreting Experiments with More than Two Groups.
Analysis and Interpretation Inferential Statistics ANOVA
Independent Sample T-test Formula
Lecture 10 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Analysis of Variance: ANOVA. Group 1: control group/ no ind. Var. Group 2: low level of the ind. Var. Group 3: high level of the ind var.
Analysis of Variance: Inferences about 2 or More Means
Intro to Statistics for the Behavioral Sciences PSYC 1900
Chi-Square and Analysis of Variance (ANOVA) Lecture 9.
Lecture 9: One Way ANOVA Between Subjects
One-way Between Groups Analysis of Variance
Comparing Several Means: One-way ANOVA Lesson 14.
Administrata New final exam schedule: Handed out Tues, Dec 3
ANOVA Chapter 12.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
PS 225 Lecture 15 Analysis of Variance ANOVA Tables.
Part IV Significantly Different: Using Inferential Statistics
1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)
One-Way Analysis of Variance Comparing means of more than 2 independent samples 1.
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
t(ea) for Two: Test between the Means of Different Groups When you want to know if there is a ‘difference’ between the two groups in the mean Use “t-test”.
One-way Analysis of Variance 1-Factor ANOVA. Previously… We learned how to determine the probability that one sample belongs to a certain population.
ANOVA (Analysis of Variance) by Aziza Munir
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Chapter 12 A Primer for Inferential Statistics What Does Statistically Significant Mean? It’s the probability that an observed difference or association.
I. Statistical Tests: A Repetive Review A.Why do we use them? Namely: we need to make inferences from incomplete information or uncertainty þBut we want.
CLASS 5 Normal Distribution Hypothesis Testing Test between means of related groups Test between means of different groups Analysis of Variance (ANOVA)
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
Analysis of Variance (One Factor). ANOVA Analysis of Variance Tests whether differences exist among population means categorized by only one factor or.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
1 Analysis of Variance (ANOVA) Educational Technology 690.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
Research Methods and Data Analysis in Psychology Spring 2015 Kyle Stephenson.
Introduction to ANOVA Research Designs for ANOVAs Type I Error and Multiple Hypothesis Tests The Logic of ANOVA ANOVA vocabulary, notation, and formulas.
EDUC 200C Section 9 ANOVA November 30, Goals One-way ANOVA Least Significant Difference (LSD) Practice Problem Questions?
Chapter 13 Understanding research results: statistical inference.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
I231B QUANTITATIVE METHODS Analysis of Variance (ANOVA)
Independent Samples ANOVA. Outline of Today’s Discussion 1.Independent Samples ANOVA: A Conceptual Introduction 2.The Equal Variance Assumption 3.Cumulative.
DSCI 346 Yamasaki Lecture 4 ANalysis Of Variance.
Six Easy Steps for an ANOVA 1) State the hypothesis 2) Find the F-critical value 3) Calculate the F-value 4) Decision 5) Create the summary table 6) Put.
Psych 200 Methods & Analysis
I. ANOVA revisited & reviewed
The Chi Square Test A statistical method used to determine goodness of fit Chi-square requires no assumptions about the shape of the population distribution.
Chapter 13 f distribution and 0ne-way anova
Comparing Multiple Groups:
Hypothesis Testing Review
Qualitative data – tests of association
Comparing Multiple Groups: Analysis of Variance ANOVA (1-way)
Kin 304 Inferential Statistics
What if. . . You were asked to determine if psychology and sociology majors have significantly different class attendance (i.e., the number of days a person.
Inferential Statistics
The Chi Square Test A statistical method used to determine goodness of fit Goodness of fit refers to how close the observed data are to those predicted.
The Chi-Square Distribution and Test for Independence
The Chi Square Test A statistical method used to determine goodness of fit Goodness of fit refers to how close the observed data are to those predicted.
Last class Tutorial 1 Census Overview
I. Statistical Tests: Why do we use them? What do they involve?
The Chi Square Test A statistical method used to determine goodness of fit Goodness of fit refers to how close the observed data are to those predicted.
Inferential Stat Week 13.
Statistics for the Behavioral Sciences
The Analysis of Variance
Parametric versus Nonparametric (Chi-square)
Conceptual Understanding
How do you know if the variation in data is the result of random chance or environmental factors? O is the observed value E is the expected value.
CLASS 6 CLASS 7 Tutorial 2 (EXCEL version)
ANOVA: Analysis of Variance
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Analysis of Variance (ANOVA)

Agenda Lab Stuff Questions about Chi-Square? Intro to Analysis of Variance (ANOVA)

This Thursday: Lab 4 Final lab will be distributed on Thursday Very similar to lab 3, but with different data You will be expected to find appropriate variables for three major tests (correlation, t-test, chi-square test of independence) You will be expected to interpret the findings from each test (one short paragraph per test). We will use the first 15 minutes of class to return lab 3 and discuss common issues and questions

Example Crosstab: Gender x Student   Student Not Student Total Males 46 (40.97) 71 (76.02) 117 Females 37 (42.03) 83 (77.97) 120 154 237 Observed Expected

ANOVA

Analysis of Variance In its simplest form, it is used to compare means for three or more categories. Example: Income (metric) and Marital Status (many categories) Relies on the F-distribution Just like the t-distribution and chi-square distribution, there are several sampling distributions for each possible value of df.

What is ANOVA? If we have a categorical variable with 3+ categories and a metric/scale variable, we could just run 3 t-tests. One problem is that the 3 tests would not be independent of each other (i.e., all of the information is known). As number of comparisons grow, likelihood of some differences are expected– but do not necessarily indicate an overall difference. A better approach: compare the variability between groups (treatment variance + error) to the variability within the groups (error) Think of it as an additive probability of each t-test (even though its slightly less than this).

The F-ratio MS = mean square bg = between groups wg = within groups The numerator and denominator have their own degrees of freedom df = # of categories – 1 (k-1) In the simplest terms, it is a ratio of the overall variance between the means of all groups, over the overall variance within the groups. Named after R.A. Fisher

Interpreting the F-ratio Generally, an f-ratio is a measure of how different the means are relative to the variability within each sample Larger values  greater likelihood that the difference between means are not just due to chance alone

Null Hypothesis in ANOVA If there is no difference between the means, then the between-group sum of squares should = the within-group sum of squares. Why do we use the sum of squares? Because…whenever we have three or more numerical values, the measure of their variability is equivalent to the measure of their aggregate differences. If this were a t-test we would have just taken mean differences.

F-distribution A right-skewed distribution It is a ratio of two chi-square distributions Why is the ratio of two chi-square distributions an appropriate distribution to use? Or, what does the F-ratio have to do with chi-square? Remember, Chi-square is the sum of squared deviates for randomly distributed variables… the F-distribution is the ratio of two sum of squared deviates (or, one Chi-Square divided by another)

F-distribution F-test for ANOVA is a one-tailed test.

Visual ANOVA and f-ratio http://tinyurl.com/271ANOVA

ANOVA and t-test How do we know where the differences exist once we know that we have an overall difference between groups? t-tests become important after an ANOVA so that we can find out which pairs are significantly different (post-hoc tests). Certain ‘corrections’ can be applied to such post-hoc t-tests so that we account for multiple comparisons (e.g., Bonferroni correction, which divides p-value by the number of comparisons being made) There are many means comparisons test available (Tukey, Sidak, Bonferroni, etc). All are basically modified means comparisons.

Logic of the ANOVA Conceptual Intro to ANOVA Class Example: anova.do GSS96_small.dta