Psy B07 Chapter 6Slide 1 CATEGORICAL DATA & χ 2. Psy B07 Chapter 6Slide 2 A Quick Look Back  Reminder about hypothesis testing: 1) Assume what you believe.

Slides:



Advertisements
Similar presentations
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Advertisements

Hypothesis: It is an assumption of population parameter ( mean, proportion, variance) There are two types of hypothesis : 1) Simple hypothesis :A statistical.
Statistical Issues in Research Planning and Evaluation
Research Methods for Counselors COUN 597 University of Saint Joseph Class # 8 Copyright © 2015 by R. Halstead. All rights reserved.
Hypothesis Testing IV Chi Square.
Chapter 13: The Chi-Square Test
Chi-square Basics. The Chi-square distribution Positively skewed but becomes symmetrical with increasing degrees of freedom Mean = k where k = degrees.
Chi-square Test of Independence
T-Tests Lecture: Nov. 6, 2002.
11-2 Goodness-of-Fit In this section, we consider sample data consisting of observed frequency counts arranged in a single row or column (called a one-way.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Today Concepts underlying inferential statistics
Hypothesis Testing Using The One-Sample t-Test
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Getting Started with Hypothesis Testing The Single Sample.
Inferential Statistics
1 Nominal Data Greg C Elvers. 2 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Presentation 12 Chi-Square test.
1 of 27 PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2013, Michael Kalsher Michael J. Kalsher Department of Cognitive Science Adv. Experimental.
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
Copyright © Cengage Learning. All rights reserved. 11 Applications of Chi-Square.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
AM Recitation 2/10/11.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Hypothesis Testing:.
Psy B07 Chapter 8Slide 1 POWER. Psy B07 Chapter 8Slide 2 Chapter 4 flashback  Type I error is the probability of rejecting the null hypothesis when it.
1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.
Research Methods for Counselors COUN 597 University of Saint Joseph Class # 9 Copyright © 2014 by R. Halstead. All rights reserved.
CORRELATION & REGRESSION
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Copyright © 2012 by Nelson Education Limited. Chapter 7 Hypothesis Testing I: The One-Sample Case 7-1.
Chapter 11: Applications of Chi-Square. Count or Frequency Data Many problems for which the data is categorized and the results shown by way of counts.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.
Chapter 9 Three Tests of Significance Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Chi Square Classifying yourself as studious or not. YesNoTotal Are they significantly different? YesNoTotal Read ahead Yes.
Copyright © 2010 Pearson Education, Inc. Slide
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
KNR 445 Statistics t-tests Slide 1 Introduction to Hypothesis Testing The z-test.
Remember Playing perfect black jack – the probability of winning a hand is.498 What is the probability that you will win 8 of the next 10 games of blackjack?
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
Copyright © Cengage Learning. All rights reserved. Chi-Square and F Distributions 10.
Dan Piett STAT West Virginia University Lecture 12.
© Copyright McGraw-Hill 2004
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Statistics 300: Elementary Statistics Section 11-2.
The Analysis of Variance ANOVA
Chi-Square Analyses.
Outline of Today’s Discussion 1.The Chi-Square Test of Independence 2.The Chi-Square Test of Goodness of Fit.
Bullied as a child? Are you tall or short? 6’ 4” 5’ 10” 4’ 2’ 4”
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
T-tests Chi-square Seminar 7. The previous week… We examined the z-test and one-sample t-test. Psychologists seldom use them, but they are useful to understand.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Chapter 9 Introduction to the t Statistic
Chi-square Basics.
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Extra Brownie Points! Lottery To Win: choose the 5 winnings numbers from 1 to 49 AND Choose the "Powerball" number from 1 to 42 What is the probability.
Chapter 11 Goodness-of-Fit and Contingency Tables
Chapter 10 Analyzing the Association Between Categorical Variables
Overview and Chi-Square
Extra Brownie Points! Lottery To Win: choose the 5 winnings numbers from 1 to 49 AND Choose the "Powerball" number from 1 to 42 What is the probability.
Section 11-1 Review and Preview
Presentation transcript:

Psy B07 Chapter 6Slide 1 CATEGORICAL DATA & χ 2

Psy B07 Chapter 6Slide 2 A Quick Look Back  Reminder about hypothesis testing: 1) Assume what you believe (H 1 ) is wrong.  Construct H 0 and accept it as a default. 2) Show that some event is of sufficiently low probability given H 0 ***. 3) Reject H 0. *** In order to do this, we need to know the distribution associated with H 0, because we use that distribution as the basis for our probability calculation.

Psy B07 Chapter 6Slide 3 z-score  Use when we have acquired some data set, then want to ask questions concerning the probability of certain specific data values (e.g., do certain values seem extreme?).  In this case, the distribution associated with H 0 is described by X and S 2 because the data points reflect a continuous variable that is normally distributed.

Psy B07 Chapter 6Slide 4 Chi-square ( χ 2 ) test  The Chi-square test is a general purpose test for use with discrete variables.  It has a number of uses, including the detection of bizarre outcomes given some a priori probability for binomial situation, and for multinomial situations.

Psy B07 Chapter 6Slide 5 Chi-square ( χ 2 ) test  In addition, it allows us to go beyond questions of bizarreness, and move into the question of whether pairs of variables are related. For example:  It does so by mapping the discreet variables unto a continuous distribution assuming H 0, the chi-square distribution.

Psy B07 Chapter 6Slide 6 The chi-square distribution  Let’s reconsider a simple binomial problem. Say, we have a batter who hits.300 [i.e., P(Hit)=0.30], and we want to know whether it is abnormal for him to go 6 for 10 (i.e., 6 hits in 10 at bats).  We could do this using the binomial stuff that I did not cover in Chapter 5 (and for which you are not responsible)  But we can also do it with a chi-square test

Psy B07 Chapter 6Slide 7 The way of the chi 2  We can put our values into a contingency table as follows:  Then consider the distribution of the following formula given H 0 :

Psy B07 Chapter 6Slide 8 The way of the chi 2

Psy B07 Chapter 6Slide 9 The way of the chi 2

Psy B07 Chapter 6Slide 10 The way of the chi 2 In-Class Example:  Note that while the observed values are discreet, the derived score is continuous.  If we calculated enough of these derived scores, we could plot a frequency distribution which would be a chi-square distribution with 1 degree of freedom or  2 (1).  Given this distribution and appropriate tables, we can then find the probability associated with any particular  2 value.

Psy B07 Chapter 6Slide 11 The way of the chi 2 Continuing the Baseball Example: So if the probability of obtaining a  2 of 4.29 or greater is less than , then the observed outcome can be considered bizarre (i.e., the result of something other than a.300 hitter getting lucky). So if the probability of obtaining a  2 of 4.29 or greater is less than , then the observed outcome can be considered bizarre (i.e., the result of something other than a.300 hitter getting lucky).

Psy B07 Chapter 6Slide 12 The way of the chi 2  Just like the t-test, chi 2 distribution is based on degrees of freedom  Thus, since our obtained  2 value of 4.29 is greater than 3.84, we can reject H 0 and assume that hitting 6 of 10 reflects more than just chance performance.

Psy B07 Chapter 6Slide 13 The way of the chi 2 Going a Step Further:  Suppose we complicate the previous example by taking walks and hit by pitches into account. That is, suppose the average batter gets a hit with a probability of 0.28, gets walked with a probability of.08, gets hit by a pitch (HBP) with a probability of.02, and gets out the rest of the time.

Psy B07 Chapter 6Slide 14 The way of the chi 2  Now we ask, can you reject H 0 (that this batter is typical of the average batter) given the following outcomes from 50 at bats? 1) Calculate expected values (Np). 2) Calculate  2 obtained. 3) Figure out the appropriate df (C-1). 4) Find  2 critical and compare  2 obtained to it.

Psy B07 Chapter 6Slide 15 The way of the chi 2

Psy B07 Chapter 6Slide 16 Two types of chi 2 tests  So far, all the tests have been to assess whether some observation or set of observations seems out-of-line with some expected distribution. This is also known as the goodness-of-fit chi-square test  However, the logic of the chi-square test can be extended to examine the issue of whether two variables are independent (i.e., not systematically related) or dependent (i.e., systematically related).

Psy B07 Chapter 6Slide 17 χ 2 test for independence  Consider the following data set again:  Are the variables of gender and opinion concerning the legalization of marijuana independent?

Psy B07 Chapter 6Slide 18 χ 2 test for independence

Psy B07 Chapter 6Slide 19 χ 2 test for independence  If these two variables are independent, then by the multiplicative law, we expect that:

Psy B07 Chapter 6Slide 20 χ 2 test for independence  If we do this for all four cells, we get:

Psy B07 Chapter 6Slide 21 χ 2 test for independence  Are the observed values different enough from the expected values to reject the notion that the differences are due to chance variation?

Psy B07 Chapter 6Slide 22 χ 2 test for independence  The df associated with 2 variable contingency tables can be calculated using the formula:  where C is the number of columns and R is the number of rows.

Psy B07 Chapter 6Slide 23 χ 2 test for independence  Thus, to finish our previous example, the  2 critical with alpha equal.05 and 1 df equals Since our  2 is not bigger than that (i.e., 3.6) we cannot reject H 0.

Psy B07 Chapter 6Slide 24 Assumptions of χ 2 Independence of observations:  Chi-square analyses are only valid when the actual observations within the cells are independent.  This independence of observations is different from the issue of whether the variables are independent, that is what the chi-square is testing.

Psy B07 Chapter 6Slide 25 Assumptions of χ 2 Independence of observations:  You know your observations are not independent when the grand total is larger than the number of subjects.  Example: The activity level of 5 rats was tested over 4 days, producing these values:

Psy B07 Chapter 6Slide 26 Assumptions of χ 2 Normality:  Use of the chi-square distribution for finding critical values assumes that the expected values (i.e., Np) are normally distributed.  This assumption breaks down when the expected values are small (specifically, the distribution of Np becomes more and more positively skewed as Np gets small).

Psy B07 Chapter 6Slide 27 Assumptions of χ 2 Normality:  Thus, one should be cautious using the chi-square test when the expected values are small.  How small? This is debatable but if expected values are as low as 5, you should be worried.

Psy B07 Chapter 6Slide 28 Assumptions of χ 2 Inclusion of Non-Occurrences:  The chi-square test assumes that all outcomes (occurrences and non- occurrences) are considered in the contingency table.  As an example of a failure to include a non-occurrence, see page 160 of the text.

Psy B07 Chapter 6Slide 29 A tale of tails  We only reject H 0 when values of  2 are larger than  2 obtained.  This suggests that the  2 test is always one-tailed and, in terms of the rejection region, it is.  In a different sense, however, the test is actually multiple tailed.

Psy B07 Chapter 6Slide 30 A tale of tails  Reconsider the following “marking scheme” example:  If we do not specify how we expect the results to fall out then any outcome with a high enough  2 obtained can be used to reject H 0.  However, if we specify our outcome, we are allowed to increase our alpha - in the example we can increase alpha to 0.30 if we specified the exact ordering (in advance) that was observed.

Psy B07 Chapter 6Slide 31 Measures of Association  The chi-square test only tells us whether two variables are independent, it does not say anything about the magnitude of the dependency if one is found to exist.  Stealing from the book, consider the following two cases, both of which produce a significant  2 obtained, but which imply different strengths of relation:

Psy B07 Chapter 6Slide 32 Measures of Association

Psy B07 Chapter 6Slide 33 Measures of Association  There are a number of ways to quantify the strength of a relation (see sections in the text on the contingency coefficient, Phi, & Odds Ratios), but the two most relevant to psychologists are Cramer’s Phi and Cohen’s Kappa.

Psy B07 Chapter 6Slide 34 Measures of Association  Cramer’s Phi ( φ c) can be used with any contingency table and is calculated as:  Values of range from 0 to 1. The values the tables on the previous page are 0.12 and 0.60 respectively, indicating a much stronger relation in the second example.

Psy B07 Chapter 6Slide 35 Measures of Association  Often, in psychology, we will ask some “judge” to categorize things into specific categories.  For example, imagine a beer brewing competition where we asked a judge to categorize beers as Yucky, OK, or Yummy.  Obviously, we are eventually interested in knowing something about the beers after they are categorized.

Psy B07 Chapter 6Slide 36 Measures of Association  However, one issue that arises is the judges abilities to tell the difference between the beers.  One way around this is to get two judges and show that a given beer is reliably rated across the judges (i.e., that both judges tend to categorize things in a similar way).

Psy B07 Chapter 6Slide 37 Measures of Association  Such a finding would suggest that the judges are sensitive to some underlying quality of the beers as opposed to just guessing.

Psy B07 Chapter 6Slide 38 Measures of Association  Note that if you just looked at the proportion of decisions that me and Judge 2 agreed on, it looks like we are doing OK: P(Agree)=21/30 = 0.70 or 70%

Psy B07 Chapter 6Slide 39 Measures of Association  There is a problem here, however, because both judges are biased to judge a beer as OK such that even if they were guessing, the agreement would seem high because both would guess OK on a lot of trials and would therefore agree a lot.

Psy B07 Chapter 6Slide 40 Measures of Association  Such a finding would suggest that the judges are sensitive to some underlying quality of the beers as opposed to just guessing.

Psy B07 Chapter 6Slide 41