Qualitative data – tests of association

Slides:



Advertisements
Similar presentations
CHI-SQUARE(X2) DISTRIBUTION
Advertisements

Chi square.  Non-parametric test that’s useful when your sample violates the assumptions about normality required by other tests ◦ All other tests we’ve.
Chi Square Example A researcher wants to determine if there is a relationship between gender and the type of training received. The gender question is.
Basic Statistics The Chi Square Test of Independence.
Hypothesis Testing IV Chi Square.
Chapter 13: The Chi-Square Test
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 12 Chicago School of Professional Psychology.
CJ 526 Statistical Analysis in Criminal Justice
Chi-Square and Analysis of Variance (ANOVA) Lecture 9.
Crosstabs and Chi Squares Computer Applications in Psychology.
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics, A First Course 4 th Edition.
CJ 526 Statistical Analysis in Criminal Justice
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 13: Nominal Variables: The Chi-Square and Binomial Distributions.
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.
Chapter 16 The Chi-Square Statistic
Chi- square test x 2. Chi Square test Symbolized by Greek x 2 pronounced “Ki square” A Test of STATISTICAL SIGNIFICANCE for TABLE data.
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
CHI SQUARE TESTS.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics: A First Course Fifth Edition.
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
Reasoning in Psychology Using Statistics Psychology
Chapter Outline Goodness of Fit test Test of Independence.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Chi Square 11.1 Chi Square. All the tests we’ve learned so far assume that our data is normally distributed z-test t-test We test hypotheses about parameters.
Basic Statistics The Chi Square Test of Independence.
The Chi-square Statistic
Chapter 12 Chi-Square Tests and Nonparametric Tests
Chi-Square hypothesis testing
Chapter 9: Non-parametric Tests
Chapter 11 Chi-Square Tests.
Analysis of Discrete Variables
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Chapter Fifteen McGraw-Hill/Irwin
Hypothesis Testing Review
Active Learning Lecture Slides
The Chi-Square Distribution and Test for Independence
Reasoning in Psychology Using Statistics
Chapter 10 Analyzing the Association Between Categorical Variables
Statistical Analysis Chi-Square.
Chapter 13 Goodness-of-Fit Tests and Contingency Analysis
Chapter 11 Chi-Square Tests.
Analyzing the Association Between Categorical Variables
Reasoning in Psychology Using Statistics
Parametric versus Nonparametric (Chi-square)
Reasoning in Psychology Using Statistics
Copyright © Cengage Learning. All rights reserved.
Inference for Two Way Tables
UNIT V CHISQUARE DISTRIBUTION
S.M.JOSHI COLLEGE, HADAPSAR
Chapter 13 Goodness-of-Fit Tests and Contingency Analysis
Chapter Outline Goodness of Fit test Test of Independence.
Chapter 11 Chi-Square Tests.
Contingency Tables (cross tabs)
CHI SQUARE (χ2) Dangerous Curves Ahead!.
Presentation transcript:

Qualitative data – tests of association The Chi-Square Distribution and Test for Independence Hypothesis testing between two or more categorical variables Sporiš Goran, PhD. http://kif.hr/predmet/mki http://www.science4performance.com/

Chi-Square Distribution The chi-square distribution results when independent variables with standard normal distributions are squared and summed.

Chi-square Degrees of freedom df = (r-1) (c-1) Where r = # of rows, c = # of columns Thus, in any 2x2 contingency table, the degrees of freedom = 1. As the degrees of freedom increase, the distribution shifts to the right and the critical values of chi-square become larger.

Chi-Square Test of Independence

Using the Chi-Square Test Often used with contingency tables (i.e., crosstabulations) E.g., gender x student The chi-square test of independence tests whether the columns are contingent on the rows in the table. In this case, the null hypothesis is that there is no relationship between row and column frequencies. H0: The 2 variables are independent.

Requirements for Chi-Square test Must be a random sample from population Data must be in raw frequencies Variables must be independent Categories for each I.V. must be mutually exclusive and exhaustive

Example Crosstab: Gender x Student   Student Not Student Total Males 46 (40.97) 71 (76.02) 117 Females 37 (42.03) 83 (77.97) 120 154 237 Observed Expected

Special Cases Fisher’s Exact Test Strength of Association When you have a 2 x 2 table with expected frequencies less than 5. Strength of Association Some use Cramer’s V (for any two nominal variables) or Phi (for 2 x 2 tables) to give a value of association between the variables.

Two chi square tests Goodness of fit Independence One variable Determines how well the sample proportions match a pre-specified distribution Independence Two variables Determines whether there is a relationship between two variables

Steps in hypothesis testing State the hypotheses null research Select an alpha level and determine the critical value Compute the test statistic Make a decision

Test for goodness of fit Forms of the null hypothesis No preference There is no difference in proportions among the categories Participants do not prefer one category over another Example: Pepsi: 50%, Coke 50% No difference from a comparison population There is no difference between the sample distribution and a known (population) distribution Example: ND: 20% Bl, 75% Br, 5% R US: 20% Bl, 75% Br, 5% R

Test for goodness of fit Null hypothesis Specifies a distribution of proportions Research hypothesis Specifies that the distribution will be different than that indicated in the null hypothesis

Calculating the test statistic Observed frequencies the number of individuals from the sample who are classified in a particular category fo Expected frequencies the number of individuals from the sample who are expected to be classified in a particular category fe

Calculating the test statistic Coin flip: What percentage of people will predict heads? tails? Heads Tails Percentages 50% Proportions .5

Calculating the test statistic Expected frequency = fe = pn n = 50 (sample size) fe = .5 x 50 = 25 Expected Heads Tails Proportions .5 Frequencies 25

Calculating the test statistic Question: The last five flips were tails. What do you predict for the next flip? Heads Tails Observed 35 15 Expected 25

Calculating the test statistic Heads Tails Observed 35 15 Expected 25 x2 = ∑ (fo - fe)2 fe Steps find the difference between fo and fe for each category square the difference divide the squared difference by fe sum the values from all categories

x2 = ∑ (fo - fe)2 = 4 + 4 = 8 Heads Tails Observed (fo) 35 15 Expected (fe) 25 fo - fe 10 -10 (fo - fe)2 100 (fo - fe)2/fe 4 x2 = ∑ (fo - fe)2 = 4 + 4 = 8 fe

Chi square distribution Critical range x2 Low chi square Hi chi square

Critical values for chi square distribution Table B.8

Test for goodness of fit The greater the number of categories, the greater the likelihood of a large observed chi square value Degrees of freedom (df) The number of values that are free to vary df = C – 1 C = the number of categories

Chi square distribution

Critical values for chi square distribution

Chi square distribution 5% 3.84 x2 Critical value (df = 1,  = .05) = 3.84

Goodness of fit Make a decision Critical value = 3.84 with df = 1 and  = .05. Observed chi square = 8.0 8.0 > 3.84 Observed chi square is greater than critical value We reject the null hypothesis Conclude that category frequencies are different People were more likely to predict heads than tails