Chapter 13: Categorical Data Analysis Statistics.

Slides:



Advertisements
Similar presentations
© 2011 Pearson Education, Inc
Advertisements

Categorical Data Analysis
Chi Squared Tests. Introduction Two statistical techniques are presented. Both are used to analyze nominal data. –A goodness-of-fit test for a multinomial.
Inference about the Difference Between the
Discrete (Categorical) Data Analysis
1 1 Slide © 2009 Econ-2030-Applied Statistics-Dr. Tadesse. Chapter 11: Comparisons Involving Proportions and a Test of Independence n Inferences About.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
12.The Chi-square Test and the Analysis of the Contingency Tables 12.1Contingency Table 12.2A Words of Caution about Chi-Square Test.
Chapter 16 Chi Squared Tests.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
Chi-Square and F Distributions Chapter 11 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
1 Chapter 20 Two Categorical Variables: The Chi-Square Test.
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 26 Comparing Counts.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on Categorical Data 12.
Chapter 11: Applications of Chi-Square. Count or Frequency Data Many problems for which the data is categorized and the results shown by way of counts.
Chapter 11: Applications of Chi-Square. Chapter Goals Investigate two tests: multinomial experiment, and the contingency table. Compare experimental results.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Chi-square test or c2 test
Chapter 16 – Categorical Data Analysis Math 22 Introductory Statistics.
1 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 12 The Analysis of Categorical Data and Goodness-of-Fit Tests.
Two Way Tables and the Chi-Square Test ● Here we study relationships between two categorical variables. – The data can be displayed in a two way table.
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.
Chapter 26 Chi-Square Testing
1 Pertemuan 11 Uji kebaikan Suai dan Uji Independen Mata kuliah : A Statistik Ekonomi Tahun: 2010.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
1 1 Slide Chapter 11 Comparisons Involving Proportions n Inference about the Difference Between the Proportions of Two Populations Proportions of Two Populations.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
Introduction Many experiments result in measurements that are qualitative or categorical rather than quantitative. Humans classified by ethnic origin Hair.
Slide 26-1 Copyright © 2004 Pearson Education, Inc.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 16 Chi-Squared Tests.
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
© 2000 Prentice-Hall, Inc. Statistics The Chi-Square Test & The Analysis of Contingency Tables Chapter 13.
Statistical Significance for a two-way table Inference for a two-way table We often gather data and arrange them in a two-way table to see if two categorical.
AP Statistics Section 14.. The main objective of Chapter 14 is to test claims about qualitative data consisting of frequency counts for different categories.
Copyright © 2010 Pearson Education, Inc. Slide
Introduction to Probability and Statistics Thirteenth Edition Chapter 13 Analysis of Categorical Data.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter 13 Inference for Counts: Chi-Square Tests © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Chapter Outline Goodness of Fit test Test of Independence.
1 Chapter 10. Section 10.1 and 10.2 Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Copyright © Cengage Learning. All rights reserved. Chi-Square and F Distributions 10.
Dan Piett STAT West Virginia University Lecture 12.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Statistics 300: Elementary Statistics Section 11-2.
Chapter 12 The Analysis of Categorical Data and Goodness of Fit Tests.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
Statistics 300: Elementary Statistics Section 11-3.
Objectives (BPS chapter 12) General rules of probability 1. Independence : Two events A and B are independent if the probability that one event occurs.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Comparing Observed Distributions A test comparing the distribution of counts for two or more groups on the same categorical variable is called a chi-square.
Goodness-of-Fit and Contingency Tables Chapter 11.
©2006 Thomson/South-Western 1 Chapter 12 – Analysis of Categorical Data Slides prepared by Jeff Heyl Lincoln University ©2006 Thomson/South-Western Concise.
Comparing Counts Chi Square Tests Independence.
Chapter 11 – Test of Independence - Hypothesis Test for Proportions of a Multinomial Population In this case, each element of a population is assigned.
Chapter 12 Tests with Qualitative Data
Chapter 25 Comparing Counts.
Chapter 13: Categorical Data Analysis
Overview and Chi-Square
Chapter 13 – Applications of the Chi-Square Statistic
Inference on Categorical Data
Chapter 26 Comparing Counts.
Chapter 26 Comparing Counts Copyright © 2009 Pearson Education, Inc.
Chapter 26 Comparing Counts.
Presentation transcript:

Chapter 13: Categorical Data Analysis Statistics

McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 2 Where We’ve Been Presented methods for making inferences about the population proportion associated with a two-level qualitative variable (i.e., a binomial variable) Presented methods for making inferences about the difference between two binomial proportions

Where We’re Going McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 3 Discuss qualitative (categorical) data with more than two outcomes Present a chi-square hypothesis test for comparing the category proportions associated with a single qualitative variable – called a one-way analysis Present a chi-square hypothesis test relating two qualitative variables – called a two-way analysis

13.1: Categorical Data and the Multinomial Experiment Properties of the Multinomial Experiment 1. The experiment consists of n identical trials. 2. There are k possible outcomes (called classes, categories or cells) to each trial. 3. The probabilities of the k outcomes, denoted by p 1, p 2, …, p k, where p 1 + p 2 + … + p k = 1, remain the same from trial to trial. 4. The trials are independent. 5. The random variables of interest are the cell counts n 1, n 2, …, n k of the number of observations that fall into each of the k categories. McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 4

13.2: Testing Categorical Probabilities: One-Way Table Suppose three candidates are running for office, and 150 voters are asked their preferences.  Candidate 1 is the choice of 61 voters.  Candidate 2 is the choice of 53 voters.  Candidate 3 is the choice of 36 voters. Do these data suggest the population may prefer one candidate over the others? McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 5

13.2: Testing Categorical Probabilities: One-Way Table Candidate 1 is the choice of 61 voters. Candidate 2 is the choice of 53 voters. Candidate 3 is the choice of 36 voters. n =150 McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 6

13.2: Testing Categorical Probabilities: One-Way Table McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 7 Reject the null hypothesis

Test of a Hypothesis about Multinomial Probabilities: One-Way Table H 0 : p 1 = p 1,0, p 2 = p 2,0, …, p k = p k,0 where p 1,0, p 2,0, …, p k,0 represent the hypothesized values of the multinomial probabilities H a : At least one of the multinomial probabilities does not equal its hypothesized value where E i = np 1,0, is the expected cell count given the null hypothesis. 13.2: Testing Categorical Probabilities: One-Way Table McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 8

Conditions Required for a Valid  2 Test: One-Way Table 1. A multinomial experiment has been conducted. 2. The sample size n will be large enough so that, for every cell, the expected cell count E ( n i ) will be equal to 5 or more. 13.2: Testing Categorical Probabilities: One-Way Table McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 9

LegalizationDecriminalizationExisting LawNo Opinion 7%18%65%10% 13.2: Testing Categorical Probabilities: One-Way Table McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 10 Example 13.2: Distribution of Opinions About Marijuana Possession Before Television Series has Aired Table 13.2: Distribution of Opinions About Marijuana Possession After Television Series has Aired LegalizationDecriminalizationExisting LawNo Opinion

13.2: Testing Categorical Probabilities: One-Way Table 11McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.2: Testing Categorical Probabilities: One-Way Table McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 12 Expected Distribution of 500 Opinions About Marijuana Possession After Television Series has Aired LegalizationDecriminalizationExisting LawNo Opinion 500(.07)=35500(.18)=90500(.65)=325500(.10)=50

13.2: Testing Categorical Probabilities: One-Way Table McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 13 Expected Distribution of 500 Opinions About Marijuana Possession After Television Series has Aired LegalizationDecriminalizationExisting LawNo Opinion 500(.07)=35500(.18)=90500(.65)=325500(.10)=50

13.2: Testing Categorical Probabilities: One-Way Table McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 14 Expected Distribution of 500 Opinions About Marijuana Possession After Television Series has Aired LegalizationDecriminalizationExisting LawNo Opinion 500(.07)=35500(.18)=90500(.65)=325500(.10)=50 Reject the null hypothesis

13.2: Testing Categorical Probabilities: One-Way Table Inferences can be made on any single proportion as well:  95% confidence interval on the proportion of citizens in the viewing area with no opinion is McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 15

13.3: Testing Categorical Probabilities: Two-Way Table Chi-square analysis can also be used to investigate studies based on qualitative factors.  Does having one characteristic make it more/less likely to exhibit another characteristic? McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 16

13.3: Testing Categorical Probabilities: Two-Way Table McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 17 Column 12  cRow Totals 1n 11 n 12  n 1c R1R1 Row2n 21 n 22  n 2c R2R2  rn r1 n r2  n rc RrRr Column TotalsC1C1 C1C1 C1C1 n The columns are divided according to the subcategories for one qualitative variable and the rows for the other qualitative variable.

13.3: Testing Categorical Probabilities: Two-Way Table McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 18

13.3: Testing Categorical Probabilities: Two-Way Table McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 19 The results of a survey regarding marital status and religious affiliation are reported below (Example 13.3 in the text). ABCDNoneTotals Divorced Married, never divorced Totals Marital Status Religious Affiliation H 0 : Marital status and religious affiliation are independent H a : Marital status and religious affiliation are dependent

13.3: Testing Categorical Probabilities: Two-Way Table McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 20 The expected frequencies (see Figure 13.4) are included below: ABCDNoneTotals Divorced39 (48.95) 19 (18.56) 12 (12.99) 28 (27.74) 18 (12.76) 116 Married, never divorced 172 (162.05) 61 (61.44) 44 (43.01) 70 (75.26) 37 (42.24) 384 Totals Marital Status Religious Affiliation The chi-square value computed with SAS is , with p-value = Even at the  =.10 level, we cannot reject the null hypothesis.

13.3: Testing Categorical Probabilities: Two-Way Table 21McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis

13.4: A Word of Caution About Chi-Square Tests Relative ease of use Widespread applications Misuse and misinterpretation McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 22

13.4: A Word of Caution About Chi-Square Tests McClave, Statistics, 11th ed. Chapter 13: Categorical Data Analysis 23 Sample is from the correct population Expected counts are ≥ 5 Avoid Type II errors by not accepting non-rejected null hypotheses Avoid mistaking dependence with causation To produce (possibly) valid  2 results Be sure