Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.

Slides:



Advertisements
Similar presentations
Chapter 18: The Chi-Square Statistic
Advertisements

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Chi square.  Non-parametric test that’s useful when your sample violates the assumptions about normality required by other tests ◦ All other tests we’ve.
Hypothesis: It is an assumption of population parameter ( mean, proportion, variance) There are two types of hypothesis : 1) Simple hypothesis :A statistical.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Hypothesis Testing IV Chi Square.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 12 Chicago School of Professional Psychology.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 17: Chi-Square.
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
CHI-SQUARE statistic and tests
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Chapter 14 Inferential Data Analysis
Richard M. Jacobs, OSA, Ph.D.
COURSE: JUST 3900 Tegrity Presentation Developed By: Ethan Cooper Final Exam Review.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Inferential Statistics
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
AM Recitation 2/10/11.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Statistics for the Behavioral Sciences (5 th ed.) Gravetter & Wallnau Chapter 17 The Chi-Square Statistic: Tests for Goodness of Fit and Independence University.
Overview of Statistical Hypothesis Testing: The z-Test
EDRS 6208 Analysis and Interpretation of Data Non Parametric Tests
Chapter 15 Correlation and Regression
230 Jeopardy Unit 4 Chi-Square Repeated- Measures ANOVA Factorial Design Factorial ANOVA Correlation $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500.
Chapter 6 Probability PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick J. Gravetter and Larry.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Chapter 8 Introduction to Hypothesis Testing
Chapter 2 Frequency Distributions
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
1 Chi-Square Heibatollah Baghi, and Mastee Badii.
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.
FOUNDATIONS OF NURSING RESEARCH Sixth Edition CHAPTER Copyright ©2012 by Pearson Education, Inc. All rights reserved. Foundations of Nursing Research,
Chapter 16 The Chi-Square Statistic
Statistics for the Behavioral Sciences, Sixth Edition by Frederick J. Gravetter and Larry B. Wallnau Copyright © 2004 by Wadsworth Publishing, a division.
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Chapter 14 Repeated Measures and Two Factor Analysis of Variance
CHI SQUARE TESTS.
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
© aSup-2007 CHI SQUARE   1 The CHI SQUARE Statistic Tests for Goodness of Fit and Independence.
Chapter 10 The t Test for Two Independent Samples
Chapter 12 Introduction to Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick.
Chapter 14 Correlation and Regression
Chapter 13 Repeated-Measures and Two-Factor Analysis of Variance
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
Chapter 11 The t-Test for Two Related Samples
Chapter 13- Inference For Tables: Chi-square Procedures Section Test for goodness of fit Section Inference for Two-Way tables Presented By:
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Chi Square Tests Chapter 17. Assumptions for Parametrics >Normal distributions >DV is at least scale >Random selection Sometimes other stuff: homogeneity,
Chapter 13 Understanding research results: statistical inference.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Chapter 12 Introduction to Analysis of Variance
Appendix I A Refresher on some Statistical Terms and Tests.
Chapter 14 Repeated Measures and Two Factor Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Seventh.
Chapter 9 Introduction to the t Statistic
Cross Tabulation with Chi Square
Part Four ANALYSIS AND PRESENTATION OF DATA
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Chapter 14 Repeated Measures
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Analyzing the Association Between Categorical Variables
Chapter 18: The Chi-Square Statistic
Presentation transcript:

Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick J. Gravetter and Larry B. Wallnau

Chapter 15 Learning Outcomes Explain when chi-square test is appropriate 2 Test hypothesis about shape of distribution using chi-square goodness of fit 3 Test hypothesis about relationship of variables using chi-square test of independence 4 Evaluate effect size using phi coefficient or Cramer’s V

Tools You Will Need Proportions (math review, Appendix A) Frequency distributions (Chapter 2)

15.1 Parametric and Nonparametric Statistical Tests Hypothesis tests used thus far tested hypotheses about population parameters Parametric tests share several assumptions Normal distribution in the population Homogeneity of variance in the population Numerical score for each individual Nonparametric tests are needed if research situation does not meet all these assumptions

Chi-Square and Other Nonparametric Tests Do not state the hypotheses in terms of a specific population parameter Make few assumptions about the population distribution (“distribution-free” tests) Participants usually classified into categories Nominal or ordinal scales are used Nonparametric test data may be frequencies

Circumstances Leading to Use of Nonparametric Tests Sometimes it is not easy or possible to obtain interval or ratio scale measurements Scores that violate parametric test assumptions High variance in the original scores Undetermined or infinite scores cannot be measured—but can be categorized

15.2 Chi-Square Test for “Goodness of Fit” Uses sample data to test hypotheses about the shape or proportions of a population distribution Tests the fit of the proportions in the obtained sample with the hypothesized proportions of the population

Figure 15.1 Grade Distribution for a Sample of n = 40 Students FIGURE 15.1 A distribution of grades for a sample of n = 40 students. The same frequency distribution is shown as a bar graph, as a table, and with the frequencies written in a series of boxes.

Goodness of Fit Null Hypothesis Specifies the proportion (or percentage) of the population in each category Rationale for null hypotheses: No preference (equal proportions) among categories, OR No difference in specified population from the proportions in another known population

Goodness of Fit Alternate Hypothesis Could be as terse as “Not H0” Often equivalent to “…population proportions are not equal to the values specified in the null hypothesis…”

Goodness of Fit Test Data Individuals are classified (counted) in each category, e.g., grades; exercise frequency; etc. Observed Frequency is tabulated for each measurement category (classification) Each individual is counted in one and only one category (classification)

Expected Frequencies in the Goodness of Fit Test Goodness of Fit test compares the Observed Frequencies from the data with the Expected Frequencies predicted by null hypothesis Construct Expected Frequencies that are in perfect agreement with the null hypothesis Expected Frequency is the frequency value that is predicted from H0 and the sample size; it represents an idealized sample distribution

Chi-Square Statistic Notation Chi-Square Statistic χ2 is the lower-case Greek letter Chi fo is the Observed Frequency fe is the Expected Frequency Chi-Square Statistic

Chi-Square Distribution Null hypothesis should Not be rejected when the discrepancy between the Observed and Expected values is small Rejected when the discrepancy between the Observed and Expected values is large Chi-Square distribution includes values for all possible random samples when H0 is true All chi-square values ≥ 0. When H0 is true, sample χ2 values should be small

Chi-Square Degrees of Freedom Chi-square distribution is positively skewed Chi-square is a family of distributions Distributions determined by degrees of freedom Slightly different shape for each value of df Degrees of freedom for Goodness of Fit Test df = C – 1 C is the number of categories

Figure 15.2 The Chi-Square Distribution Critical Region FIGURE 15.2 Chi-square distributions are positively skewed. The critical region is placed in the extreme tail, which reflects large chi-square values.

Figure 15.3 Chi-square Distributions with Different df FIGURE 15.3 The shape of the chi-square distribution for different values of df. As the number of categories increases, the peak (mode) of the distribution has a larger chi-square value.

Locating the Chi-Square Distribution Critical Region Determine significance level (alpha) Locate critical value of chi-square in a table of critical values according to Value for degrees of freedom (df) Significance level chosen

Figure 15.4 Critical Region for Example 15.1 FIGURE 15.4 For Example 15.1 , the critical region begins at a chi-square value of 7.81.

In the Literature Report should describe whether there were significant differences between category preferences Report should include χ2 df, sample size (n) and test statistic value Significance level E.g., χ2 (3, n = 50) = 8.08, p < .05

“Goodness of Fit” Test and the One Sample t Test Nonparametric versus parametric test Both tests use data from one sample to test a hypothesis about a single population Level of measurement determines test: Numerical scores (interval / ratio scale) make it appropriate to compute a mean and use a t test Classification in non-numerical categories (ordinal or nominal scale) make it appropriate to compute proportions or percentages to do a chi-square test

Learning Check The expected frequencies in a chi-square test _____________________________________ A are always whole numbers B can contain fractions or decimal values C can contain both positive and negative values D can contain fractions and negative numbers

Learning Check - Answer The expected frequencies in a chi-square test _____________________________________ A are always whole numbers B can contain fractions or decimal values C can contain both positive and negative values D can contain fractions and negative numbers

Learning Check Decide if each of the following statements is True or False T/F In a Chi-Square Test, the Observed Frequencies are always whole numbers A large value for Chi-square will tend to retain the null hypothesis

Learning Check - Answers True Observed frequencies are just frequency counts, so there can be no fractional values False Large values of chi-square indicate observed frequencies differ a lot from null hypothesis predictions

15.3 Chi-Square Test for Independence Chi-Square Statistic can test for evidence of a relationship between two variables Each individual jointly classified on each variable Counts are presented in the cells of a matrix Design may be experimental or non-experimental Frequency data from a sample is used to test the evidence of a relationship between the two variables in the population using a two-dimensional frequency distribution matrix

Null Hypothesis for Test of Independence Null hypothesis: the two variables are independent (no relationship exists) Two versions Single population: No relationship between two variables in this population. Two separate populations: No difference between distribution of variable in the two populations Variables are independent if there is no consistent predictable relationship Note 1: under the second version of the null hypothesis, the null hypothesis does NOT say that the two distributions are identical; instead it says they have the same proportions. Note 2: stating that there is no relationship between variables (version 1) is equivalent to stating that the distributions have equal proportions.

Observed and Expected Frequencies Frequencies in the sample are the Observed frequencies for the test Expected frequencies are based on the null hypothesis prediction of the same proportions in each category (population) Expected frequency of any cell is jointly determined by its column proportion and its row proportion

Computing Expected Frequencies Frequencies computed by same method for each cell in the frequency distribution table fc is frequency total for the column fr is frequency total for the row

Computing Chi-Square Statistic for Test of Independence Same equation as the Chi-Square Test of Goodness of Fit Chi-Square Statistic Degrees of freedom (df) = (R-1)(C-1) R is the number of rows C is the number of columns

Chi-Square Compared to Other Statistical Procedures Hypotheses may be stated in terms of relationships between variables (version 1) or differences between groups (version 2) Chi-square test for independence and Pearson correlation both evaluate the relationships between two variables Depending on the level of measurement, Chi-square, t test or ANOVA could be used to evaluate differences between various groups

Chi-Square Compared to Other Statistical Procedures (cont.) Choice of statistical procedure determined primarily by the level of measurement Could choose to test the significance of the relationship Chi-square t test ANOVA Could choose to evaluate the strength of the relationship with r2

15.4 Measuring Effect Size for Chi-Square A significant Chi-square hypothesis test shows that the difference did not occur by chance Does not indicate the size of the effect For a 2x2 matrix, the phi coefficient (Φ) measures the strength of the relationship So Φ2 would provide proportion of variance accounted for just like r2

Effect size in a larger matrix For a larger matrix, a modification of the phi-coefficient is used: Cramer’s V df* is the smaller of (R-1) or (C-1)

Interpreting Cramer’s V Small Effect Medium Effect Large Effect For df* = 1 0.10 0.30 0.50 For df* = 2 0.07 0.21 0.35 For df* = 3 0.06 0.17 0.29

15.5 Chi-Square Test Assumptions and Restrictions Independence of observations Each observed frequency is generated by a different individual Size of expected frequencies Chi-square test should not be performed when the expected frequency of any cell is less than 5

Learning Check A basic assumption for a chi-square hypothesis test is ______________________ A the population distribution(s) must be normal B the scores must come from an interval or ratio scale C the observations must be independent D None of the other choices are assumptions for chi-square

Learning Check - Answer A basic assumption for a chi-square hypothesis test is ______________________ A the population distribution(s) must be normal B the scores must come from an interval or ratio scale C the observations must be independent D None of the other choices are assumptions for chi-square

Learning Check Decide if each of the following statements is True or False T/F The value of df for a chi-square test does not depend on the sample size (n) A positive value for the chi-square statistic indicates a positive correlation between the two variables

Learning Check - Answers True The value of df for a chi-square test depends only on the number of rows and columns in the observation matrix False Chi-square cannot be a negative number, so it cannot accurately show the type of correlation between the two variables

Figure 15.5: Example 15.2 SPSS Output—Chi-square Test for Independence FIGURE 15.5. The SPSS output for the chi-square test for independence in Example 15.2

Any Questions? Concepts? Equations?