Chi-square Basics.

Slides:



Advertisements
Similar presentations
Contingency Table Analysis Mary Whiteside, Ph.D..
Advertisements

Chapter 11 Other Chi-Squared Tests
 2 test for independence Used with categorical, bivariate data from ONE sample Used to see if the two categorical variables are associated (dependent)
Chi-square A very brief intro. Distinctions The distribution The distribution –Chi-square is a probability distribution  A special case of the gamma.
Chi square.  Non-parametric test that’s useful when your sample violates the assumptions about normality required by other tests ◦ All other tests we’ve.
Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
Basic Statistics The Chi Square Test of Independence.
The Chi-Square Test for Association
Chi-square Basics. The Chi-square distribution Positively skewed but becomes symmetrical with increasing degrees of freedom Mean = k where k = degrees.
Chi Square Test Dealing with categorical dependant variable.
1 SOC 3811 Basic Social Statistics. 2 Reminder  Hand in your assignment 5  Remember to pick up your previous homework  Final exam: May 12 th (Saturday),
Presentation 12 Chi-Square test.
The Chi-square Statistic. Goodness of fit 0 This test is used to decide whether there is any difference between the observed (experimental) value and.
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
Phi Coefficient Example A researcher wishes to determine if a significant relationship exists between the gender of the worker and if they experience pain.
Week 10 Chapter 10 - Hypothesis Testing III : The Analysis of Variance
1 Measuring Association The contents in this chapter are from Chapter 19 of the textbook. The crimjust.sav data will be used. cjsrate: RATE JOB DONE: CJ.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
1 Chi-Square Heibatollah Baghi, and Mastee Badii.
Chi-square test Chi-square test or  2 test Notes: Page Goodness of Fit 2.Independence 3.Homogeneity.
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
CHI SQUARE TESTS.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics: A First Course Fifth Edition.
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
Chapter Outline Goodness of Fit test Test of Independence.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chi Square & Correlation
Chapter 13. The Chi Square Test ( ) : is a nonparametric test of significance - used with nominal data -it makes no assumptions about the shape of the.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Chi Square Test for Goodness of Fit Determining if our sample fits the way it should be.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Chi Square Test Dr. Asif Rehman.
Comparing Counts Chi Square Tests Independence.
Cross Tabulation with Chi Square
Basic Statistics The Chi Square Test of Independence.
The Chi-square Statistic
Other Chi-Square Tests
Chi-Square (Association between categorical variables)
Chapter 12 Chi-Square Tests and Nonparametric Tests
Chi-Square hypothesis testing
Presentation 12 Chi-Square test.
10 Chapter Chi-Square Tests and the F-Distribution Chapter 10
Chapter 11 Chi-Square Tests.
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Chi-square test or c2 test
Extra Brownie Points! Lottery To Win: choose the 5 winnings numbers from 1 to 49 AND Choose the "Powerball" number from 1 to 42 What is the probability.
Hypothesis Testing Review
Qualitative data – tests of association
1) A bicycle safety organization claims that fatal bicycle accidents are uniformly distributed throughout the week. The table shows the day of the week.
Chapter 11 Goodness-of-Fit and Contingency Tables
The Chi-Square Distribution and Test for Independence
Different Scales, Different Measures of Association
Chi Square Two-way Tables
Part IV Significantly Different Using Inferential Statistics
Contingency Tables: Independence and Homogeneity
Statistical Analysis Chi-Square.
Chapter 13 Goodness-of-Fit Tests and Contingency Analysis
Chi-square test or c2 test
Chapter 11 Chi-Square Tests.
Analyzing the Association Between Categorical Variables
Parametric versus Nonparametric (Chi-square)
Fundamental Statistics for the Behavioral Sciences, 4th edition
Chapter 13 Goodness-of-Fit Tests and Contingency Analysis
CLASS 6 CLASS 7 Tutorial 2 (EXCEL version)
Chapter 11 Chi-Square Tests.
Presentation transcript:

Chi-square Basics

The Chi-square distribution Positively skewed but becomes symmetrical with increasing degrees of freedom Mean = k where k = degrees of freedom Variance = 2k Assuming a normally distributed dataset and sampling a single z2 value at a time 2(1) = z2 If more than one… 2(N) =

Why used? Chi-square analysis is primarily used to deal with categorical (frequency) data We measure the “goodness of fit” between our observed outcome and the expected outcome for some variable With two variables, we test in particular whether they are independent of one another using the same basic approach.

One-dimensional Suppose we want to know how people in a particular area will vote in general and go around asking them. How will we go about seeing what’s really going on? Republican Democrat Other 20 30 10

Hypothesis: Dems should win district Solution: chi-square analysis to determine if our outcome is different from what would be expected if there was no preference

Plug in to formula 20 30 10 Observed Expected Republican Democrat Other Observed 20 30 10 Expected

Reject H0 The district will probably vote democratic However…

Conclusion Note that all we really can conclude is that our data is different from the expected outcome given a situation Although it would appear that the district will vote democratic, really we can only conclude they were not responding by chance Regardless of the position of the frequencies we’d have come up with the same result In other words, it is a non-directional test regardless of the prediction

More complex What do stats kids do with their free time? 30 40 20 10 TV Nap Worry Stare at Ceiling Males 30 40 20 10 Females

Example for males TV: (100*50)/200 = 25 Is there a relationship between gender and what the stats kids do with their free time? Expected = (Ri*Cj)/N Example for males TV: (100*50)/200 = 25 TV Nap Worry Stare at Ceiling Total Males 30 40 20 10 100 Females 50 70 60 200

df = (R-1)(C-1) R = number of rows C = number of columns 30 (25) TV Nap Worry Stare at Ceiling Total Males (E) 30 (25) 40 (35) 20 (30) 10 (10) 100 Females (E) 20 (25) 30 (35) 40 (30) 50 70 60 20 200

Interpretation Reject H0, there is some relationship between gender and how stats students spend their free time

Other Important point about the non-directional nature of the test, the chi-square test by itself cannot speak to specific hypotheses about the way the results would come out Not useful for ordinal data because of this

Assumptions Normality Inclusion of non-occurences Independence Rule of thumb is that we need at least 5 for our expected frequencies value Inclusion of non-occurences Must include all responses, not just those positive ones Independence Not that the variables are independent or related (that’s what the test can be used for), but rather as with our t-tests, the observations (data points) don’t have any bearing on one another. To help with the last two, make sure that your N equals the total number of people who responded

Measures of Association Contingency coefficient Phi Cramer’s Phi Odds Ratios Kappa These were discussed in 5700