Chapter 13: Categorical Data Analysis

Slides:



Advertisements
Similar presentations
© 2011 Pearson Education, Inc
Advertisements

McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chi-Square Tests Chapter 12.
Categorical Data Analysis
Chi Squared Tests. Introduction Two statistical techniques are presented. Both are used to analyze nominal data. –A goodness-of-fit test for a multinomial.
Inference about the Difference Between the
Discrete (Categorical) Data Analysis
1 1 Slide © 2009 Econ-2030-Applied Statistics-Dr. Tadesse. Chapter 11: Comparisons Involving Proportions and a Test of Independence n Inferences About.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
12.The Chi-square Test and the Analysis of the Contingency Tables 12.1Contingency Table 12.2A Words of Caution about Chi-Square Test.
Chapter 16 Chi Squared Tests.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on Categorical Data 12.
Chapter 11: Applications of Chi-Square. Count or Frequency Data Many problems for which the data is categorized and the results shown by way of counts.
Chapter 11: Applications of Chi-Square. Chapter Goals Investigate two tests: multinomial experiment, and the contingency table. Compare experimental results.
Chapter 16 – Categorical Data Analysis Math 22 Introductory Statistics.
Chapter 13: Categorical Data Analysis Statistics.
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
Introduction Many experiments result in measurements that are qualitative or categorical rather than quantitative. Humans classified by ethnic origin Hair.
AP STATS EXAM REVIEW Chapter 8 Chapter 13 and 14 Chapter 11 and 12 Chapter 9 and Chapter 10 Chapter 7.
Slide 26-1 Copyright © 2004 Pearson Education, Inc.
Copyright © 2010 Pearson Education, Inc. Slide
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Chapter 13 Inference for Counts: Chi-Square Tests © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Chapter Outline Goodness of Fit test Test of Independence.
Copyright © Cengage Learning. All rights reserved. Chi-Square and F Distributions 10.
Dan Piett STAT West Virginia University Lecture 12.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Statistics 300: Elementary Statistics Section 11-2.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
Comparing Observed Distributions A test comparing the distribution of counts for two or more groups on the same categorical variable is called a chi-square.
©2006 Thomson/South-Western 1 Chapter 12 – Analysis of Categorical Data Slides prepared by Jeff Heyl Lincoln University ©2006 Thomson/South-Western Concise.
Comparing Counts Chi Square Tests Independence.
Keller: Stats for Mgmt & Econ, 7th Ed Chi-Squared Tests
Chapter 11 – Test of Independence - Hypothesis Test for Proportions of a Multinomial Population In this case, each element of a population is assigned.
Chi-Square hypothesis testing
Chapter 9: Non-parametric Tests
Chi-square test or c2 test
Chapter 12 Chi-Square Tests.
CHAPTER 11 CHI-SQUARE TESTS
Chapter 4. Inference about Process Quality
Chapter 12 Tests with Qualitative Data
Chapter 25 Comparing Counts.
Goodness of Fit Tests The goal of χ2 goodness of fit tests is to test is the data comes from a certain distribution. There are various situations to which.
Chapter 11 Goodness-of-Fit and Contingency Tables
SA3202 Statistical Methods for Social Sciences
Inference on Categorical Data
The Analysis of Categorical Data and Chi-Square Procedures
Inferential Statistics and Probability a Holistic Approach
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Econ 3790: Business and Economics Statistics
Chapter 11: Inference for Distributions of Categorical Data
Overview and Chi-Square
Chapter 13 – Applications of the Chi-Square Statistic
Inference on Categorical Data
The Analysis of Categorical Data and Goodness of Fit Tests
Paired Samples and Blocks
Analyzing the Association Between Categorical Variables
CHAPTER 11 CHI-SQUARE TESTS
The Analysis of Categorical Data and Goodness of Fit Tests
Chapter 26 Comparing Counts.
The Analysis of Categorical Data and Goodness of Fit Tests
Section 11-1 Review and Preview
Chapter 26 Comparing Counts Copyright © 2009 Pearson Education, Inc.
Inference for Two Way Tables
The Analysis of Categorical Data and Goodness of Fit Tests
Chapter 26 Comparing Counts.
Inference for Two-way Tables
Chapter Outline Goodness of Fit test Test of Independence.
St. Edward’s University
Presentation transcript:

Chapter 13: Categorical Data Analysis Statistics Chapter 13: Categorical Data Analysis

McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis Where We’ve Been Presented methods for making inferences about the population proportion associated with a two-level qualitative variable (i.e., a binomial variable) Presented methods for making inferences about the difference between two binomial proportions McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis Where We’re Going Discuss qualitative (categorical) data with more than two outcomes Present a chi-square hypothesis test for comparing the category proportions associated with a single qualitative variable – called a one-way analysis Present a chi-square hypothesis test relating two qualitative variables – called a two-way analysis McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.1: Categorical Data and the Multinomial Experiment Properties of the Multinomial Experiment The experiment consists of n identical trials. There are k possible outcomes (called classes, categories or cells) to each trial. The probabilities of the k outcomes, denoted by p1, p2, …, pk, where p1+ p2+ … + pk = 1, remain the same from trial to trial. The trials are independent. The random variables of interest are the cell counts n1, n2, …, nk of the number of observations that fall into each of the k categories. McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.2: Testing Categorical Probabilities: One-Way Table Suppose three candidates are running for office, and 150 voters are asked their preferences. Candidate 1 is the choice of 61 voters. Candidate 2 is the choice of 53 voters. Candidate 3 is the choice of 36 voters. Do these data suggest the population may prefer one candidate over the others? McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.2: Testing Categorical Probabilities: One-Way Table Candidate 1 is the choice of 61 voters. Candidate 2 is the choice of 53 voters. Candidate 3 is the choice of 36 voters. n =150 McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.2: Testing Categorical Probabilities: One-Way Table Reject the null hypothesis McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.2: Testing Categorical Probabilities: One-Way Table Test of a Hypothesis about Multinomial Probabilities: One-Way Table H0: p1 = p1,0, p2 = p2,0, … , pk = pk,0 where p1,0, p2,0, …, pk,0 represent the hypothesized values of the multinomial probabilities Ha: At least one of the multinomial probabilities does not equal its hypothesized value where Ei = np1,0, is the expected cell count given the null hypothesis. McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.2: Testing Categorical Probabilities: One-Way Table Conditions Required for a Valid 2 Test: One-Way Table A multinomial experiment has been conducted. The sample size n will be large enough so that, for every cell, the expected cell count E(ni) will be equal to 5 or more. McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.2: Testing Categorical Probabilities: One-Way Table Example 13.2: Distribution of Opinions About Marijuana Possession Before Television Series has Aired Legalization Decriminalization Existing Law No Opinion 7% 18% 65% 10% Table 13.2: Distribution of Opinions About Marijuana Possession After Television Series has Aired Legalization Decriminalization Existing Law No Opinion 39 99 336 26 McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.2: Testing Categorical Probabilities: One-Way Table McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.2: Testing Categorical Probabilities: One-Way Table Expected Distribution of 500 Opinions About Marijuana Possession After Television Series has Aired Legalization Decriminalization Existing Law No Opinion 500(.07)=35 500(.18)=90 500(.65)=325 500(.10)=50 McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.2: Testing Categorical Probabilities: One-Way Table Expected Distribution of 500 Opinions About Marijuana Possession After Television Series has Aired Legalization Decriminalization Existing Law No Opinion 500(.07)=35 500(.18)=90 500(.65)=325 500(.10)=50 McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.2: Testing Categorical Probabilities: One-Way Table Expected Distribution of 500 Opinions About Marijuana Possession After Television Series has Aired Legalization Decriminalization Existing Law No Opinion 500(.07)=35 500(.18)=90 500(.65)=325 500(.10)=50 Reject the null hypothesis McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.2: Testing Categorical Probabilities: One-Way Table Inferences can be made on any single proportion as well: 95% confidence interval on the proportion of citizens in the viewing area with no opinion is McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.3: Testing Categorical Probabilities: Two-Way Table Chi-square analysis can also be used to investigate studies based on qualitative factors. Does having one characteristic make it more/less likely to exhibit another characteristic? McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.3: Testing Categorical Probabilities: Two-Way Table The columns are divided according to the subcategories for one qualitative variable and the rows for the other qualitative variable. Column 1 2  c Row Totals n11 n12 n1c R1 Row n21 n22 n2c R2  r nr1 nr2 nrc Rr Column Totals C1 n McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.3: Testing Categorical Probabilities: Two-Way Table McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.3: Testing Categorical Probabilities: Two-Way Table The results of a survey regarding marital status and religious affiliation are reported below (Example 13.3 in the text). Religious Affiliation A B C D None Totals Divorced 39 19 12 28 18 116 Married, never divorced 172 61 44 70 37 384 211 80 56 98 55 500 Marital Status H0: Marital status and religious affiliation are independent Ha: Marital status and religious affiliation are dependent McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.3: Testing Categorical Probabilities: Two-Way Table The expected frequencies (see Figure 13.4) are included below: Religious Affiliation A B C D None Totals Divorced 39 (48.95) 19 (18.56) 12 (12.99) 28 (27.74) 18 (12.76) 116 Married, never divorced 172 (162.05) 61 (61.44) 44 (43.01) 70 (75.26) 37 (42.24) 384 211 80 56 98 55 500 Marital Status The chi-square value computed with SAS is 7.1355, with p-value = .1289. Even at the  = .10 level, we cannot reject the null hypothesis. McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.3: Testing Categorical Probabilities: Two-Way Table McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.4: A Word of Caution About Chi-Square Tests Relative ease of use Widespread applications Misuse and misinterpretation McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.4: A Word of Caution About Chi-Square Tests Sample is from the correct population Expected counts are ≥ 5 Avoid Type II errors by not accepting non-rejected null hypotheses Avoid mistaking dependence with causation To produce (possibly) valid 2 results Be sure McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis