Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 17: Chi-Square.

Slides:



Advertisements
Similar presentations
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Advertisements

Chi square.  Non-parametric test that’s useful when your sample violates the assumptions about normality required by other tests ◦ All other tests we’ve.
Hypothesis: It is an assumption of population parameter ( mean, proportion, variance) There are two types of hypothesis : 1) Simple hypothesis :A statistical.
Chi Square Tests Chapter 17.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Hypothesis Testing IV Chi Square.
Chapter 13: The Chi-Square Test
PSY 307 – Statistics for the Behavioral Sciences
Inferential Stats for Two-Group Designs. Inferential Statistics Used to infer conclusions about the population based on data collected from sample Do.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 12 Chicago School of Professional Psychology.
Intro to Statistics for the Behavioral Sciences PSYC 1900
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 17: Nonparametric Tests & Course Summary.
Intro to Statistics for the Behavioral Sciences PSYC 1900
Chi Square Test Dealing with categorical dependant variable.
Chi-square Test of Independence
Inferential Statistics  Hypothesis testing (relationship between 2 or more variables)  We want to make inferences from a sample to a population.  A.
Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications
Crosstabs and Chi Squares Computer Applications in Psychology.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 14: Factorial ANOVA.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Crosstabs. When to Use Crosstabs as a Bivariate Data Analysis Technique For examining the relationship of two CATEGORIC variables  For example, do men.
+ Quantitative Statistics: Chi-Square ScWk 242 – Session 7 Slides.
Chapter 11(1e), Ch. 10 (2/3e) Hypothesis Testing Using the Chi Square ( χ 2 ) Distribution.
Hypothesis Testing IV (Chi Square)
Statistics for the Behavioral Sciences (5 th ed.) Gravetter & Wallnau Chapter 17 The Chi-Square Statistic: Tests for Goodness of Fit and Independence University.
Statistics for the Behavioral Sciences
1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.
Imagine a a bag that contained 90 white marbles and 10 black marbles. If you drew 10 marbles, how many would you expect to come up white, and how many.
Chi-square Test of Independence Steps in Testing Chi-square Test of Independence Hypotheses.
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
Copyright © 2012 by Nelson Education Limited. Chapter 10 Hypothesis Testing IV: Chi Square 10-1.
1 Chi-Square Heibatollah Baghi, and Mastee Badii.
Chi-squared Tests. We want to test the “goodness of fit” of a particular theoretical distribution to an observed distribution. The procedure is: 1. Set.
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.
Chapter 16 The Chi-Square Statistic
Chapter 11 Hypothesis Testing IV (Chi Square). Chapter Outline  Introduction  Bivariate Tables  The Logic of Chi Square  The Computation of Chi Square.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
© 2014 by Pearson Higher Education, Inc Upper Saddle River, New Jersey All Rights Reserved HLTH 300 Biostatistics for Public Health Practice, Raul.
Slide 26-1 Copyright © 2004 Pearson Education, Inc.
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
CHI SQUARE TESTS.
Reasoning in Psychology Using Statistics Psychology
Chapter 13 Inference for Counts: Chi-Square Tests © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Chapter Eight: Using Statistics to Answer Questions.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chi-Square Analyses.
Chapter 13. The Chi Square Test ( ) : is a nonparametric test of significance - used with nominal data -it makes no assumptions about the shape of the.
Outline of Today’s Discussion 1.The Chi-Square Test of Independence 2.The Chi-Square Test of Goodness of Fit.
Chapter Fifteen Chi-Square and Other Nonparametric Procedures.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Chi Square Tests Chapter 17. Assumptions for Parametrics >Normal distributions >DV is at least scale >Random selection Sometimes other stuff: homogeneity,
Ch 13: Chi-square tests Part 2: Nov 29, Chi-sq Test for Independence Deals with 2 nominal variables Create ‘contingency tables’ –Crosses the 2 variables.
Chapter 13 Understanding research results: statistical inference.
Chi-Square (Association between categorical variables)
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Hypothesis Testing Review
Qualitative data – tests of association
Hypothesis Testing Using the Chi Square (χ2) Distribution
Different Scales, Different Measures of Association
Chapter 13 Group Differences
Analyzing the Association Between Categorical Variables
Reasoning in Psychology Using Statistics
Parametric versus Nonparametric (Chi-square)
Reasoning in Psychology Using Statistics
UNIT V CHISQUARE DISTRIBUTION
Hypothesis Testing - Chi Square
CHI SQUARE (χ2) Dangerous Curves Ahead!.
Presentation transcript:

Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 17: Chi-Square

Chi-Square Analyses Chi-Square tests are used to analyze categorical (as opposed to continuous or ranked) data. Chi-Square tests are used to analyze categorical (as opposed to continuous or ranked) data. Both independent and depended variables are on nominal scales Both independent and depended variables are on nominal scales Data in cells represent frequencies as opposed to measured scores on variables. Data in cells represent frequencies as opposed to measured scores on variables.

One Classification Variable: Chi-Square Goodness-of-Fit Test Sometimes, we may be interested in determining if a specific category for a nominal variable occurs more frequently than would be expected by chance alone. Sometimes, we may be interested in determining if a specific category for a nominal variable occurs more frequently than would be expected by chance alone. For example, are people more likely to be right-handed than left-handed? Is there a significant preference for salty as opposed to sweet or spicy snacks? For example, are people more likely to be right-handed than left-handed? Is there a significant preference for salty as opposed to sweet or spicy snacks? We can answer such questions by comparing observed frequencies with theoretically predicted ones. We can answer such questions by comparing observed frequencies with theoretically predicted ones.

Example We have a sample of 99 participants and ask them to choose one of 3 snacks (salty, sweet, spicy). We have a sample of 99 participants and ask them to choose one of 3 snacks (salty, sweet, spicy). The null hypothesis would be that no divergent preferences exist – each option is as likely to be selected. The null hypothesis would be that no divergent preferences exist – each option is as likely to be selected. Expected frequencies are the number of observations expected if the null is true. Expected frequencies are the number of observations expected if the null is true. This would imply that the expected frequencies would be 33 for each type of snack. This would imply that the expected frequencies would be 33 for each type of snack.

Example We can then compare the actual versus predicted preferences. We can then compare the actual versus predicted preferences. Observed: Observed: 45 (Salty), 26 (Sweet), 28 (Spicy) 45 (Salty), 26 (Sweet), 28 (Spicy) Expected: Expected: 33 (Salty), 33 (Sweet), 33 (Spicy) 33 (Salty), 33 (Sweet), 33 (Spicy) Our task now is to determine if the deviation from expected frequencies is unlikely to represent sampling error. Our task now is to determine if the deviation from expected frequencies is unlikely to represent sampling error.

Chi-Square Test The logic of the Chi- Square test is straightforward. The logic of the Chi- Square test is straightforward. We calculate the size of the squared deviations scaled by the average size of the expected values. We calculate the size of the squared deviations scaled by the average size of the expected values. For example, if we had expected only 10 observations and found 20, that is a large discrepancy. If we had expected 100 and found 110, it is much less consequential. For example, if we had expected only 10 observations and found 20, that is a large discrepancy. If we had expected 100 and found 110, it is much less consequential.

Chi-Square Test

Is it Significant? Of course, we now have to determine the likelihood of this value. We do so by referring to the Chi-Square distribution. Of course, we now have to determine the likelihood of this value. We do so by referring to the Chi-Square distribution. df=#groups-1 df=#groups-1 Like t and F, Chi-Square distribution is a family of distributions whose shape changes as a function of df’s. Like t and F, Chi-Square distribution is a family of distributions whose shape changes as a function of df’s. It is positively skewed, especially for small df’s. It is positively skewed, especially for small df’s.

Is it Significant? We can see that the critical value for a df=2 test at alpha =.05 is We can see that the critical value for a df=2 test at alpha =.05 is We can reject the null and state that there seems to be a significant preference for salty snacks. We can reject the null and state that there seems to be a significant preference for salty snacks.

Two Classification Variables A more common use occurs with 2 variables (often iv and dv). A more common use occurs with 2 variables (often iv and dv). For example, does a political advertisement that makes you angry result in greater votes for a candidate than a more neutral one? For example, does a political advertisement that makes you angry result in greater votes for a candidate than a more neutral one? Have participants watch one type of ad and then record their voting behavior. Have participants watch one type of ad and then record their voting behavior.

Data Formula is the same to calculate chi-square. Expected frequencies are calculated as the product of the row and column total (i.e., marginal totals) divided by the total sample size N.

Results df = (R-1)(C-1)

Effect Size Most common is Cramer’s Phi. Most common is Cramer’s Phi. Cramer’s squared gives an index of the amount of variance explained (similar to eta sqaured): Cramer’s squared gives an index of the amount of variance explained (similar to eta sqaured):

Chi-Square and Proportions Chi-Square tests can be used to analyze proportions if you convert the proportions to actual frequencies. Chi-Square tests can be used to analyze proportions if you convert the proportions to actual frequencies.

Chi-Square Assumptions All data are independent. All data are independent. No participant can be included more than once. No participant can be included more than once. As a rule of thumb, the expected frequencies for all cells should be no smaller than 5. As a rule of thumb, the expected frequencies for all cells should be no smaller than 5.