1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.

Slides:



Advertisements
Similar presentations
CHI-SQUARE(X2) DISTRIBUTION
Advertisements

SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic.
Chi Square Example A researcher wants to determine if there is a relationship between gender and the type of training received. The gender question is.
Basic Statistics The Chi Square Test of Independence.
The Chi-Square Test for Association
Hypothesis Testing IV Chi Square.
Chapter 13: The Chi-Square Test
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 12 Chicago School of Professional Psychology.
Chi-square Test of Independence
Crosstabs and Chi Squares Computer Applications in Psychology.
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Crosstabs. When to Use Crosstabs as a Bivariate Data Analysis Technique For examining the relationship of two CATEGORIC variables  For example, do men.
1 Chapter 20 Two Categorical Variables: The Chi-Square Test.
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
+ Quantitative Statistics: Chi-Square ScWk 242 – Session 7 Slides.
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Research Methods for Counselors COUN 597 University of Saint Joseph Class # 9 Copyright © 2014 by R. Halstead. All rights reserved.
For testing significance of patterns in qualitative data Test statistic is based on counts that represent the number of items that fall in each category.
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
1 Chi-Square Heibatollah Baghi, and Mastee Badii.
Two Way Tables and the Chi-Square Test ● Here we study relationships between two categorical variables. – The data can be displayed in a two way table.
Chapter 16 The Chi-Square Statistic
1 The  2 test Sections 19.1 and 19.2 of Howell This section actually includes 2 totally separate tests goodness-of-fit test contingency table analysis.
Chi-Square X 2. Parking lot exercise Graph the distribution of car values for each parking lot Fill in the frequency and percentage tables.
FPP 28 Chi-square test. More types of inference for nominal variables Nominal data is categorical with more than two categories Compare observed frequencies.
Chi-square Test of Independence
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Chapter 11, 12, 13, 14 and 16 Association at Nominal and Ordinal Level The Procedure in Steps.
Reasoning in Psychology Using Statistics Psychology
Chapter 13 Inference for Counts: Chi-Square Tests © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Nonparametric Tests of Significance Statistics for Political Science Levin and Fox Chapter Nine Part One.
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
Week 13a Making Inferences, Part III t and chi-square tests.
Leftover Slides from Week Five. Steps in Hypothesis Testing Specify the research hypothesis and corresponding null hypothesis Compute the value of a test.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chapter 13. The Chi Square Test ( ) : is a nonparametric test of significance - used with nominal data -it makes no assumptions about the shape of the.
Bullied as a child? Are you tall or short? 6’ 4” 5’ 10” 4’ 2’ 4”
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
DRAWING INFERENCES FROM DATA THE CHI SQUARE TEST.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Bivariate Association. Introduction This chapter is about measures of association This chapter is about measures of association These are designed to.
Political Science 30: Political Inquiry. How Sure is Sure? Quantifying Uncertainty in Tables Using Two-Way Tables SAT scores and UC admissions What’s.
Introduction to Marketing Research
Basic Statistics The Chi Square Test of Independence.
Chapter 9: Non-parametric Tests
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Slides to accompany Weathington, Cunningham & Pittenger (2010), Chapter 16: Research with Categorical Data.
Hypothesis Testing Review
Qualitative data – tests of association
The Chi-Square Distribution and Test for Independence
Is a persons’ size related to if they were bullied
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Reasoning in Psychology Using Statistics
Hypothesis Testing and Comparing Two Proportions
Chapter 10 Analyzing the Association Between Categorical Variables
Contingency Tables (cross tabs)
Reasoning in Psychology Using Statistics
Reasoning in Psychology Using Statistics
UNIT V CHISQUARE DISTRIBUTION
S.M.JOSHI COLLEGE, HADAPSAR
Hypothesis Testing - Chi Square
Presentation transcript:

1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008

2 Test for Association Used to determine whether two variables are associated (related). The variables are both categorical; which can be nominal, ordinal, or even cardinal scores divided into intervals. H0: the variables are independent Ha: the variables are associated

3 Example We will begin with an example where the variables are ‘type of tree’ (deciduous or evergreen) and ‘condition of tree’ (normal, diseased, or has parasites). We sample 310 trees from a forest and note both what type of tree it is as well as its condition. Each tree must fall into one and only one of the six cells of the table (we will assume that a tree can’t both be diseased and have parasites at the same time).

4 As our variables are categorical in nature, the only thing we can really do with the data is to count how many trees fall into each category (e.g. it makes no sense to find the mean condition of the trees). The data are given below, these are our observed frequencies. Observed Frequencies

5 Expected Frequencies We have our observed frequencies, next we need to determine what the frequencies would look like if H0 were true and the variables were independent (i.e. not associated). Then, we can use Chi Square to see if our obtained frequencies differ significantly from the frequencies we would expect to get if H0 were true.

6 Independence If H0 is true and our variables are independent then that means that knowing in which category a tree falls in one variable is of no help in predicting in which category it falls in the other variable. In other words, if the variables are independent then knowing what type of tree it is (deciduous or evergreen) does not help us predict what condition the tree is in (normal, diseased, parasitic). And, knowing what condition the tree is in does not help us predict which type of tree it is. Let’s see what the frequencies would look like if the variables were independent.

7 First, in this table I have calculated the total number of trees that were normal (100), diseased (120), and parasitic (90), which add up to 310 (the total number of trees). I have also calculated the total number of trees that were deciduous (186) and evergreen (124) which also add up to 310.

8 Second, if we ignore the variable ‘type of tree’ we can see that overall 100 of the 310 trees were ‘normal’, so we can say that the proportion of trees that were normal is.32, or 32%. We can also see that.39 (39%) of the trees were diseased, and.29 (29%) had parasites.

9 Independence Third, this table shows what the proportions would look like if the two variables were independent, we can see that knowing which type of tree it is does not change the chances of it being normal, diseased, or parasitic.

10 Expected Frequencies Fourth, the expected frequencies are those we would expect to get in each cell if H0 were true and the variables were independent. So the next step is to use the expected proportions (repeated below) to compute the expected frequencies if Ho were true (next slide)

11 If the variables are independent then 32% of the 186 deciduous trees would be normal, and 32% of the 124 evergreen trees would be normal, and so on. These are what the frequencies would be if the variables were independent. Note the number of deciduous trees still adds up to 186 and the number of evergreen to 124.

12 Observed Frequencies (our actual data) Expected Frequencies (if H0 true) The obtained frequencies differ somewhat from the frequencies we would expect if H0 were true, do they differ enough to reject H0?

13 Chi Square Test for Association This is the same formula as for ‘goodness of fit’. This time we apply it to the observed and expected frequencies from each cell. The formula for degrees of freedom for the test for association: df = (# of rows – 1)(# of cols – 1) Which in this example would be: df = (3-1)(2-1) =2

14 Chi Square Test for Association If H0 is true the mean value of χ²=df=2. If H0 is false then the value of χ² is expected to be greater than 2. How large does χ² have to be do reject H0? With two degrees of freedom χ² critical = As χ² obtained = we easily reject H0 and conclude that there is a relationship (association) between the two variables ‘type of tree’ and ‘condition of tree’. In the standard format the results would be χ²(2)=26.185, p<.001

15 ‘Eyeballing’ an Association While Chi Square works with frequencies, it is not all that easy to look at a table of frequencies and guess whether the variables are associated or not. Table of Frequencies

16 ‘Eyeballing’ an Association It is much easier to view the proportions or percents. If the sample exactly fits the null hypothesis then the columns would be identical. Here are the percentages from our example, the columns are not all the same, thus the variables may be associated (get a p value to make sure). Table of Percents

17 Effect Size: Cramer’s V The value of χ² obtained and it’s corresponding p value are affected by both the strength of the association between the two variable and the size of N and thus are not direct indications of how strongly the two variables are associated. A measure that removes the effect of N, leaving just a measure of the strength of the relationship between the two variables is Cramer’s V.

18 Cramer’s V The formula for computing Cramer’s V is quite simple: This will result in a value of V that is between 0 (no association between the variables) and 1 (the strongest possible association between the variables). V is a pure measure of strength of association (having removed the effect of N). By the way, why not use a formula that will result in a value between –1 and 1, as in correlation? Think about it.

19 Cramer’s V in our Example

20 Strength of Association Examples a1a2total b b total60120 a1a2total b1600 b20120 total60120 V=0 V=1.00