Chi-Square and Analysis of Variance (ANOVA) Lecture 9.

Slides:



Advertisements
Similar presentations
Multiple-choice question
Advertisements

15- 1 Chapter Fifteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Chapter 13: The Chi-Square Test
PSY 307 – Statistics for the Behavioral Sciences
Analysis of frequency counts with Chi square
Independent Sample T-test Formula
Cross Tabulation and Chi Square Test for Independence.
Differences Between Group Means
PSY 307 – Statistics for the Behavioral Sciences
CJ 526 Statistical Analysis in Criminal Justice
Chi-square Test of Independence
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Dr. Michael R. Hyman, NMSU Cross-tabulations and Banners.
Cross-Tabulations.
Cross-tabulations and Banners. Cross-tabulation Way to organize data by groups or categories, thus facilitating comparisons; joint frequency distribution.
Statistical Methods in Computer Science Hypothesis Testing II: Single-Factor Experiments Ido Dagan.
Intro to Parametric Statistics, Assumptions & Degrees of Freedom Some terms we will need Normal Distributions Degrees of freedom Z-values of individual.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Introduction to Analysis of Variance (ANOVA)
Chi-Square Tests and the F-Distribution
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Leedy and Ormrod Ch. 11 Gray Ch. 14
Testing Group Difference
AM Recitation 2/10/11.
Business Research Methods William G. Zikmund Chapter 22: Bivariate Analysis - Tests of Differences.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
PS 225 Lecture 15 Analysis of Variance ANOVA Tables.
1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)
CJ 526 Statistical Analysis in Criminal Justice
One-Way Analysis of Variance Comparing means of more than 2 independent samples 1.
Analyzing Data: Comparing Means Chapter 8. Are there differences? One of the fundament questions of survey research is if there is a difference among.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
PSY 307 – Statistics for the Behavioral Sciences Chapter 16 – One-Factor Analysis of Variance (ANOVA)
Between-Groups ANOVA Chapter 12. >When to use an F distribution Working with more than two samples >ANOVA Used with two or more nominal independent variables.
Correlation Patterns.
Exploring Marketing Research William G. Zikmund Chapter 22: Bivariate Statistics- Tests of Differences.
Testing Hypotheses about Differences among Several Means.
Education 793 Class Notes Presentation 10 Chi-Square Tests and One-Way ANOVA.
One-Way ANOVA ANOVA = Analysis of Variance This is a technique used to analyze the results of an experiment when you have more than two groups.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Chapter 22 Bivariate Statistical Analysis: Differences Between Two Variables © 2010 South-Western/Cengage Learning. All rights reserved. May not be scanned,
Previous Lecture: Phylogenetics. Analysis of Variance This Lecture Judy Zhong Ph.D.
CHI SQUARE TESTS.
Analysis of Variance. What is Variance? Think….think…
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
Chapter Outline Goodness of Fit test Test of Independence.
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
Inferential Statistics. Explore relationships between variables Test hypotheses –Research hypothesis: a statement of the relationship between variables.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis – mutually exclusive – exhaustive.
Copyright c 2001 The McGraw-Hill Companies, Inc.1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent variable.
Introduction to ANOVA Research Designs for ANOVAs Type I Error and Multiple Hypothesis Tests The Logic of ANOVA ANOVA vocabulary, notation, and formulas.
Bullied as a child? Are you tall or short? 6’ 4” 5’ 10” 4’ 2’ 4”
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
The p-value approach to Hypothesis Testing
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent.
T-tests Chi-square Seminar 7. The previous week… We examined the z-test and one-sample t-test. Psychologists seldom use them, but they are useful to understand.
Research Methods William G. Zikmund Bivariate Analysis - Tests of Differences.
I231B QUANTITATIVE METHODS Analysis of Variance (ANOVA)
Copyright © 2008 by Nelson, a division of Thomson Canada Limited Chapter 18 Part 5 Analysis and Interpretation of Data DIFFERENCES BETWEEN GROUPS AND RELATIONSHIPS.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 10: Comparing Models.
Analysis of Variance (ANOVA)
Qualitative data – tests of association
The Chi-Square Distribution and Test for Independence
Quadrat sampling & the Chi-squared test
Quadrat sampling & the Chi-squared test
Presentation transcript:

Chi-Square and Analysis of Variance (ANOVA) Lecture 9

The Chi-Square Distribution and Test for Independence Hypothesis testing between two or more categorical variables

Chi-square Test of Independence Tests the association between two nominal (categorical) variables. Null Hyp: The 2 variables are independent. Null Hyp: The 2 variables are independent. Its really just a comparison between expected frequencies and observed frequencies among the cells in a crosstabulation table.

YesNoTotal Males 46 (40.97)71 (76.02)117 Females 37 (42.03)83(77.97)120 Total Example Crosstab: gender x binary question

Degrees of freedom Chi-square degrees of freedom df = (r-1) (c-1) df = (r-1) (c-1) Where r = # of rows, c = # of columns Thus, in any 2x2 contingency table, the degrees of freedom = 1. As the degrees of freedom increase, the distribution shifts to the right and the critical values of chi-square become larger.

Chi-Square Distribution The chi-square distribution results when independent variables with standard normal distributions are squared and summed. The chi-square distribution results when independent variables with standard normal distributions are squared and summed.

Requirements for Chi-Square test Must be a random sample from population Data must be in raw frequencies Variables must be independent Categories for each I.V. must be mutually exclusive and exhaustive

Using the Chi-Square Test Often used with contingency tables (i.e., crosstabulations) E.g., gender x race E.g., gender x race Basically, the chi-square test of independence tests whether the columns are contingent on the rows in the table. In this case, the null hypothesis is that there is no relationship between row and column frequencies. In this case, the null hypothesis is that there is no relationship between row and column frequencies.

Practical Example: Expected frequencies versus observed frequencies General Social Survey Example

ANOVA and the f-distribution Hypothesis testing between a 3+ category variable and a metric variable

Analysis of Variance In its simplest form, it is used to compare means for three or more categories. Example: Example: Life Happiness scale and Marital Status (married, never married, divorced) Relies on the F-distribution Just like the t-distribution and chi-square distribution, there are several sampling distributions for each possible value of df. Just like the t-distribution and chi-square distribution, there are several sampling distributions for each possible value of df.

What is ANOVA? If we have a categorical variable with 3+ categories and a metric/scale variable, we could just run 3 t-tests. The problem is that the 3 tests would not be independent of each other (i.e., all of the information is known). The problem is that the 3 tests would not be independent of each other (i.e., all of the information is known). A better approach: compare the variability between groups (treatment variance + error) to the variability within the groups (error)

The F-ratio MS = mean square bg = between groups wg = within groups Numerator is the “effect” and denominator is the “error” df = # of categories – 1 (k-1)

Between-Group Sum of Squares (Numerator) Total variability – Residual Variability Total variability is quantified as the sum of the squares of the differences between each value and the grand mean. Also called the total sum-of-squares Also called the total sum-of-squares Variability within groups is quantified as the sum of squares of the differences between each value and its group mean Also called residual sum-of-squares Also called residual sum-of-squares

Null Hypothesis in ANOVA If there is no difference between the means, then the between-group sum of squares should = the within-group sum of squares.

F-distribution F-test is always a one-tailed test. Why? Why?

Logic of the ANOVA Conceptual Intro to ANOVA Conceptual Intro to ANOVA

Bringing it all together: Choosing the appropriate bivariate statistic

Reminder About Causality Remember from earlier lectures: bivariate statistics do not test causal relationships, they only show that there is a relationship. Even if you plan to use more sophisticated causal tests, you should always run simple bivariate statistics on your key variables to understand their relationships.

Choosing the Appropriate Statistical Test General rules for choosing a bivariate test: Two categorical variables Two categorical variables Chi-Square (crosstabulations) Two metric variables Two metric variablesCorrelation One 3+ categorical variable, one metric variable One 3+ categorical variable, one metric variable ANOVA ANOVA One binary categorical variable, one metric variable One binary categorical variable, one metric variableT-test

Assignment #2 Online (course website) course websitecourse website Due next Monday in class (April 10 th )