An explanation of the Chi-Square Test for Independence Jeffrey Marks Bhavisha Talsania California State University San Marcos.

Slides:



Advertisements
Similar presentations
What is Chi-Square? Used to examine differences in the distributions of nominal data A mathematical comparison between expected frequencies and observed.
Advertisements

CHI-SQUARE(X2) DISTRIBUTION
Bivariate Analysis Cross-tabulation and chi-square.
Statistical Inference for Frequency Data Chapter 16.
Chapter 13: The Chi-Square Test
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
CJ 526 Statistical Analysis in Criminal Justice
Chi Square Test Dealing with categorical dependant variable.
Chi-square Test of Independence
Statistics 303 Chapter 9 Two-Way Tables. Relationships Between Two Categorical Variables Relationships between two categorical variables –Depending on.
Crosstabs and Chi Squares Computer Applications in Psychology.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
1 Chapter 20 Two Categorical Variables: The Chi-Square Test.
1 of 27 PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2013, Michael Kalsher Michael J. Kalsher Department of Cognitive Science Adv. Experimental.
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
Statistics for the Social Sciences Psychology 340 Fall 2013 Tuesday, November 19 Chi-Squared Test of Independence.
Statistics for the Social Sciences Psychology 340 Fall 2013 Thursday, November 21 Review for Exam #4.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 10.7.
1 Inference for Categorical Data William P. Wattles, Ph. D. Francis Marion University.
CJ 526 Statistical Analysis in Criminal Justice
Week 10 Chapter 10 - Hypothesis Testing III : The Analysis of Variance
Tests of Significance June 11, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Two Variable Statistics
1 Chi-Square Heibatollah Baghi, and Mastee Badii.
Chi-Square. All the tests we’ve learned so far assume that our data is normally distributed z-test t-test We test hypotheses about parameters of these.
Two Way Tables and the Chi-Square Test ● Here we study relationships between two categorical variables. – The data can be displayed in a two way table.
Chapter 26 Chi-Square Testing
Lecture 8 Chi-Square STAT 3120 Statistical Methods I.
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
Essential Statistics Chapter 161 Review Part III_A_Chi Z-procedure Vs t-procedure.
Chi-square Test of Independence
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Reasoning in Psychology Using Statistics Psychology
Inferential Statistics. Coin Flip How many heads in a row would it take to convince you the coin is unfair? 1? 10?
4 normal probability plots at once par(mfrow=c(2,2)) for(i in 1:4) { qqnorm(dataframe[,1] [dataframe[,2]==i],ylab=“Data quantiles”) title(paste(“yourchoice”,i,sep=“”))}
Chapter Outline Goodness of Fit test Test of Independence.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 12. The Chi-Square Test.
Non-parametric Tests e.g., Chi-Square. When to use various statistics n Parametric n Interval or ratio data n Name parametric tests we covered Tuesday.
12/23/2015Slide 1 The chi-square test of independence is one of the most frequently used hypothesis tests in the social sciences because it can be used.
Inferential Statistics. Explore relationships between variables Test hypotheses –Research hypothesis: a statement of the relationship between variables.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
ContentFurther guidance  Hypothesis testing involves making a conjecture (assumption) about some facet of our world, collecting data from a sample,
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chapter 13- Inference For Tables: Chi-square Procedures Section Test for goodness of fit Section Inference for Two-Way tables Presented By:
1 Week 3 Association and correlation handout & additional course notes available at Trevor Thompson.
Bullied as a child? Are you tall or short? 6’ 4” 5’ 10” 4’ 2’ 4”
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Ch 13: Chi-square tests Part 2: Nov 29, Chi-sq Test for Independence Deals with 2 nominal variables Create ‘contingency tables’ –Crosses the 2 variables.
Statistics 300: Elementary Statistics Section 11-3.
Objectives (BPS chapter 12) General rules of probability 1. Independence : Two events A and B are independent if the probability that one event occurs.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Comparing Observed Distributions A test comparing the distribution of counts for two or more groups on the same categorical variable is called a chi-square.
Introduction to Marketing Research
Chi-Square (Association between categorical variables)
Hypothesis Testing Review
Qualitative data – tests of association
The Chi-Square Distribution and Test for Independence
Is a persons’ size related to if they were bullied
Consider this table: The Χ2 Test of Independence
Inference for Categorical Data
Reasoning in Psychology Using Statistics
Reasoning in Psychology Using Statistics
Chapter 26 Comparing Counts.
Reasoning in Psychology Using Statistics
Presentation transcript:

An explanation of the Chi-Square Test for Independence Jeffrey Marks Bhavisha Talsania California State University San Marcos

Research Question: Do veteran students take the same majors as non-veteran students? Special Considerations: --Focus on STEM majors (will help us group the variable). --Results must be understandable and explainable.

Data and Choice of Statistical Tests Data Type: Veteran Status and Major are Categorical. Major data grouped into college categories: (STEM, Social Sciences, Arts & Humanities, Business, Health and Education. Would like to see if a relationship exists between two categorical variables (nominal). Pearson’s Chi-square test for Independence.

Karl Pearson ( ) Credited with Establishing Mathematical Statistics. Linear Regression, Correlation, Standard Deviation, Kurtosis. Classification of Probability Distributions. Chi-Square Distribution Rediscovery Introduced in July 1900.

Karl Pearson, 1890 and 1910

Descriptive Statistics Results Veterans tend to be male and transfer students. Freshman (3) and Postbacc veterans (11) excluded in the analysis. Choice of comparison group data: Student status (freshman vs. transfer) important, gender not. VetStatsF2013.xlsx

Chi-square Test for Independence Overview and Setup Hypotheses: H o : Veteran Status is Independent of Major. (No significant relationship between major and veteran Status). H a : Veteran Status is NOT Independent of Major. (A significant relationship exists between major and veteran status, meaning veterans take different majors than non-veterans). P-value is compared to Alpha or Calculated χ² is compared to a Critical χ² Cutoff Value from a table and determines if we reject or fail to reject Ho. Degrees of Freedom = (#rows-1)(#columns-1).

Chi Square Distribution, Statistic

Chi-Square Setup Table setup: ChiSQMajorData.xlsxChiSQMajorData.xlsx These are the Observed or Actual Values Row totals Column Totals Grand Total Expected Values for Each Cell: Row Total x Column Total Grand Total

Chi-Square Considerations Are the Data Accurate? Can You Independently Verify? 80% Rule for Expected Values. Nature/Extent of of Relationship Not Clear. When Sample Sizes Differ, Size of χ² not comparable. Hard to Compare Tables of Different Dimensions. Cramer’s V attempts to Adjust for the Above. Effect Size:.10 Low,.30 Medium,.50 Large effect size is a quantitative measure of the strength of a phenomenon.

χ² Calculation For each Cell, (8 in both Examples) we do a calculation. For All F13 Transfer Students Cell 1,1 (Row 1, Column 1): χ² = ( )² = ∑ χ² = = , this is the Chi-Square Calculated Statistic.

χ² Calculation, continued. Calculated χ² for all F13 Transfer Students: Calculated χ² for all New F13 Transfer Students: ∑ χ² = = 9.035, this is the Chi-Square Calculated Statistic. χ² Degrees of Freedom = (row-1)(column-1) = (1)(3)=3 3 df in χ² Table with Alpha =.05 gives us

Results Compare the Calculated χ² value vs. the Critical χ², Alternately compare the p-Value vs. Alpha. If the Calculated Value Exceeds the Critical Value we Reject H o. If the p-value is < Alpha (.05), we reject the H o. In this case, if we Reject H o, then there is a significant relationship between Veteran Status and Major. The expected and observed values were far enough apart to calculate a large χ² statistic which exceeds the critical value. Results are just an example- Do for different groups, longitudinally, compare different semesters.

Results, All Transfer Students For All Transfer Students, Calculated χ² = Critical χ² = Therefore we reject the H o that Veteran Status is Independent of Major. We can say that the variables are NOT Independent, therefore a statistically significant relationship exists between Major and Veteran Status. Veterans choose different majors this semester. χ² (3, N=4177) = , p <.05.

Results, New Transfer Students For NEW Transfer Students, Calculated χ² = Critical χ² = Therefore we reject the H o that Veteran Status is Independent of Major. We can say that the variables are NOT Independent, therefore a statistically significant relationship exists between Major and Veteran Status. Veterans choose different majors this semester. χ² (3, N=1614) = 9.035, p <.05.

SPSS Results, All Transfers To do Chi-Square in SPSS use the Crosstabs function. Analyze-Descriptives-CrossTabs Be sure to select under Statistics: Chi-Square and Cramer’s V. Results of Veteran Status vs. Major, All Fall 2013 Transfer Students ValuedfSignificance 2-sided Pearson Chi-Square Cramer’s V N of Valid Cases4177

SPSS Results, New Transfers G:\2014Pres\ChiSqMajVVet.spv ValuedfSignificance 2-sided Pearson Chi-Square Cramer’s V N of Valid Cases1614

What do we do with the Results? Potential Issues: How are major changes tracked? Is there a lag? Data Correctness and Completeness. STEM Center Faculty– very interested. Veterans Coordinator- present at meeting, she can use to help veterans get jobs.. Share with Colleagues.

Questions or Comments? How do you share your findings with others? Who do you share them with? Data concerns or considerations?

Pearson, 1930