Chi-Square Non-parametric test (distribution- free) Nominal level dependent measure.

Slides:



Advertisements
Similar presentations
1 2 Test for Independence 2 Test for Independence.
Advertisements

Overview of Lecture Parametric vs Non-Parametric Statistical Tests.
Elementary Statistics
Statistics for Business and Economics
Chapter 16 Goodness-of-Fit Tests and Contingency Tables
Chi-Square and Analysis of Variance (ANOVA)
© The McGraw-Hill Companies, Inc., Chapter 12 Chi-Square.
Chapter 18: The Chi-Square Statistic
 2 test Chi-square  2 test  2  2 is the most popular discrete data hypothesis testing method. It is a non-parametric test of statistical significance.
CHI-SQUARE(X2) DISTRIBUTION
Chapter 16: Chi Square PSY —Spring 2003 Summerfelt.
Chi Square Example A researcher wants to determine if there is a relationship between gender and the type of training received. The gender question is.
Chapter 19 Chi-Square Fundamental Statistics for the Behavioral Sciences, 5th edition David C. Howell © 2003 Brooks/Cole Publishing Company/ITP.
Basic Statistics The Chi Square Test of Independence.
Hypothesis Testing IV Chi Square.
Statistical Inference for Frequency Data Chapter 16.
Analysis of frequency counts with Chi square
Chi-square Basics. The Chi-square distribution Positively skewed but becomes symmetrical with increasing degrees of freedom Mean = k where k = degrees.
© 2010 Pearson Prentice Hall. All rights reserved The Chi-Square Test of Independence.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 12 Chicago School of Professional Psychology.
Chi Square: A Nonparametric Test PSYC 230 June 3rd, 2004 Shaun Cook, ABD University of Arizona.
Chi Square Test Dealing with categorical dependant variable.
1 Nominal Data Greg C Elvers. 2 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics.
The Chi-square Statistic. Goodness of fit 0 This test is used to decide whether there is any difference between the observed (experimental) value and.
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 10.7.
Phi Coefficient Example A researcher wishes to determine if a significant relationship exists between the gender of the worker and if they experience pain.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Two Way Tables and the Chi-Square Test ● Here we study relationships between two categorical variables. – The data can be displayed in a two way table.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Chi Square Classifying yourself as studious or not. YesNoTotal Are they significantly different? YesNoTotal Read ahead Yes.
Chi-Square Test James A. Pershing, Ph.D. Indiana University.
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
4 normal probability plots at once par(mfrow=c(2,2)) for(i in 1:4) { qqnorm(dataframe[,1] [dataframe[,2]==i],ylab=“Data quantiles”) title(paste(“yourchoice”,i,sep=“”))}
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chi-Square Test (χ 2 ) χ – greek symbol “chi”. Chi-Square Test (χ 2 ) When is the Chi-Square Test used? The chi-square test is used to determine whether.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Basic Statistics The Chi Square Test of Independence.
The Chi-square Statistic
CHAPTER 26 Comparing Counts.
Chi-square Basics.
Chapter 9: Non-parametric Tests
Chapter 11 Chi-Square Tests.
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Chapter Fifteen McGraw-Hill/Irwin
Hypothesis Testing Review
Chapter 12 Tests with Qualitative Data
Qualitative data – tests of association
What goes in a results section?
Chapter 11 Goodness-of-Fit and Contingency Tables
Chi-Square Test.
The Chi-Square Distribution and Test for Independence
Is a persons’ size related to if they were bullied
Consider this table: The Χ2 Test of Independence
Chi-Square Test.
Is a persons’ size related to if they were bullied
Chapter 10 Analyzing the Association Between Categorical Variables
Contingency Tables: Independence and Homogeneity
Chapter 11 Chi-Square Tests.
Inference on Categorical Data
Chi-Square Test.
Fundamental Statistics for the Behavioral Sciences, 4th edition
Karl L. Wuensch Department of Psychology East Carolina University
Presentation transcript:

Chi-Square Non-parametric test (distribution- free) Nominal level dependent measure

Categorical Variables Generally the count of objects falling in each of several categories.Generally the count of objects falling in each of several categories. Examples:Examples: Xnumber of fraternity, sorority, and nonaffiliated members of a class Xnumber of students choosing answers: 1, 2, 3, 4, or 5 Emphasis on frequency in each categoryEmphasis on frequency in each category

One-way Classification Observations sorted on only one dimensionObservations sorted on only one dimension Example:Example: XObserve children and count red, green, yellow, or orange Jello choices XAre these colors chosen equally often, or is there a preference for one over the other? Cont.

One-way--cont. Want to compare observed frequencies with frequencies predicted by null hypothesisWant to compare observed frequencies with frequencies predicted by null hypothesis Chi-square test used to compare expected and observedChi-square test used to compare expected and observed Called goodness-of-fit chi-square ( 2 ) Called goodness-of-fit chi-square ( 2 )

Goodness-of-Fit Chi-square Fombonne (1989) Season of birth and childhood psychosisFombonne (1989) Season of birth and childhood psychosis Are children born at particular times of year more likely to be diagnosed with childhood psychosisAre children born at particular times of year more likely to be diagnosed with childhood psychosis He knew the % normal children born in each monthHe knew the % normal children born in each month Xe.g..8.4% normal children born in January

Fombonnes Data

Chi-Square ( 2 ) Compare Observed (O) with Expected (E)Compare Observed (O) with Expected (E) Take size of E into accountTake size of E into account XWith large E, a large (O-E) is not unusual. XWith small E, a large (O-E) is unusual.

Calculation of (11) = 19.68

Conclusions Obtained 2 = 14.58Obtained 2 = df = c - 1, where c = # categoriesdf = c - 1, where c = # categories Critical value of 2 on 11 df = 19.68Critical value of 2 on 11 df = Since > 14.58, do not reject H 0Since > 14.58, do not reject H 0 Conclude that birth month distribution of children with psychoses doesnt differ from normal.Conclude that birth month distribution of children with psychoses doesnt differ from normal.

Jello Choices Red GreenYellowOrangeRed GreenYellowOrange Is there a significant preference for one color of jello over other colors?Is there a significant preference for one color of jello over other colors?

RedGreen YellowOrangeRedGreen YellowOrange O: E: X 2 = (35-25) 2 /25 + (20-25) 2 /25 + (25-25) 2 /25 + (20-25) 2 /25= 6 There was not one jello color chosen significantly more often than any other jello color, X 2 (3, N= 100) = 6, p >.05

Contingency Tables Two independent variablesTwo independent variables XAre men happier than women? Male vs. Female X Happy vs Not HappyMale vs. Female X Happy vs Not Happy XIntimacy (Yes/No) X Depression/Nondepression

Intimacy and Depression Everitt & Smith (1979)Everitt & Smith (1979) Asked depressed and non-depressed women about intimacy with boyfriend/husbandAsked depressed and non-depressed women about intimacy with boyfriend/husband Data on next slideData on next slide

Data

Chi-Square on Contingency Table Same formulaSame formula Expected frequenciesExpected frequencies XE = RT X CT GT RT = Row total, CT = Column total, GT = Grand totalRT = Row total, CT = Column total, GT = Grand total

Expected Frequencies E 11 = 37*138/419 = 12.19E 11 = 37*138/419 = E 12 = 37*281/419 = 24.81E 12 = 37*281/419 = E 21 = 382*138/419 = E 21 = 382*138/419 = E 22 = 382*281/419 = E 22 = 382*281/419 = Enter on following tableEnter on following table

Observed and Expected Freq.

Chi-Square Calculation

Degrees of Freedom For contingency table, df = (R - 1)(C - 1)For contingency table, df = (R - 1)(C - 1) For our example this is (2 - 1)(2 - 1) = 1For our example this is (2 - 1)(2 - 1) = 1 XNote that knowing any one cell and the marginal totals, you could reconstruct all other cells.

Conclusions Since > 3.84, reject H 0Since > 3.84, reject H 0 Conclude that depression and intimacy are not independent.Conclude that depression and intimacy are not independent. XHow one responds to satisfaction with intimacy depends on whether they are depressed. XCould be depression-->dissatisfaction, lack of intimacy --> depression, depressed people see world as not meeting needs, etc.

Larger Contingency Tables Jankowski & Leitenberg(pers. comm.)Jankowski & Leitenberg(pers. comm.) XDoes abuse continue? Do adults who are, and are not, being abused differ in childhood history of abuse?Do adults who are, and are not, being abused differ in childhood history of abuse? One variable = adult abuse (yes or no)One variable = adult abuse (yes or no) Other variable = number of abuse categories (out of 4) suffered as childrenOther variable = number of abuse categories (out of 4) suffered as children XSexual, Physical, Alcohol, or Personal violence

Chi-Square Calculation

Conclusions > > 7.82 XReject H 0 XConclude that adult abuse is related to childhood abuse XIncreasing levels of childhood abuse are associated with greater levels of adult abuse. e.g. Approximately 10% of nonabused children are later abused as adults.e.g. Approximately 10% of nonabused children are later abused as adults. Cont.

Nonindependent Observations We require that observations be independent.We require that observations be independent. XOnly one score from each respondent XSum of frequencies must equal number of respondents If we dont have independence of observations, test is not valid.If we dont have independence of observations, test is not valid.

Small Expected Frequencies Rule of thumb: E > 5 in each cellRule of thumb: E > 5 in each cell XNot firm rule XViolated in earlier example, but probably not a problem More of a problem in tables with few cells.More of a problem in tables with few cells. Never have expected frequency of 0.Never have expected frequency of 0. Collapse adjacent cells if necessary.Collapse adjacent cells if necessary. Cont.

Expected Frequencies--cont. More of a problem in tables with few cells.More of a problem in tables with few cells. Never have expected frequency of 0.Never have expected frequency of 0. Collapse adjacent cells if necessary.Collapse adjacent cells if necessary.

Effect Size Phi and Cramers PhiPhi and Cramers Phi XDefine N and k XNot limited to 2X2 tables Cont.

Effect Sizecont. Everitt & cc dataEveritt & cc data Cont.

Effect SizeOdss Ratio. Odds Dep|Lack IntimacyOdds Dep|Lack Intimacy X26/112 =.232 Odds Dep | No LackOdds Dep | No Lack X11/270 =.041 Odds Ratio =.232/.041 = 5.69Odds Ratio =.232/.041 = 5.69 Odds Depressed = 5.69 times great if experiencing lack of intimacy.Odds Depressed = 5.69 times great if experiencing lack of intimacy.

Effect SizeRisk Ratio. Risk Depression/Lack IntimacyRisk Depression/Lack Intimacy X26/138 =.188 Risk Depression | No LackRisk Depression | No Lack X11/281 =.039 Odds Ratio =.188/.039 = 4.83Odds Ratio =.188/.039 = 4.83 Risk of Depressed = 4.83 times greater if experiencing lack of intimacy.Risk of Depressed = 4.83 times greater if experiencing lack of intimacy.