Inference for Distributions of Categorical Variables (C26 BVD)

Slides:



Advertisements
Similar presentations
Chapter 11 Other Chi-Squared Tests
Advertisements

Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
Chapter 13: Inference for Distributions of Categorical Data
Copyright ©2011 Brooks/Cole, Cengage Learning More about Inference for Categorical Variables Chapter 15 1.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Categorical Variables Chapter 15.
Chapter 26: Comparing Counts
CHAPTER 11 Inference for Distributions of Categorical Data
Ch. 28 Chi-square test Used when the data are frequencies (counts) or proportions for 2 or more groups. Example 1.
Chapter 26: Comparing Counts. To analyze categorical data, we construct two-way tables and examine the counts of percents of the explanatory and response.
Chi-square Goodness of Fit Test
Chi-Square Analysis Test of Independence. We will now apply the principles of Chi-Square analysis to determine if two variables are independent of one.
Presentation 12 Chi-Square test.
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
Copyright © 2012 Pearson Education. All rights reserved Copyright © 2012 Pearson Education. All rights reserved. Chapter 15 Inference for Counts:
Chapter 11: Inference for Distributions of Categorical Data.
Chi-square test or c2 test
Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
Chapter 26 Chi-Square Testing
Chapter 11 Inference for Tables: Chi-Square Procedures 11.1 Target Goal:I can compute expected counts, conditional distributions, and contributions to.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
CHAPTER 11 SECTION 2 Inference for Relationships.
Slide 26-1 Copyright © 2004 Pearson Education, Inc.
Non-Parametric Statistics Part I: Chi-Square .
Chapter 11 The Chi-Square Test of Association/Independence Target Goal: I can perform a chi-square test for association/independence to determine whether.
FPP 28 Chi-square test. More types of inference for nominal variables Nominal data is categorical with more than two categories Compare observed frequencies.
Chapter 22: Comparing Two Proportions. Yet Another Standard Deviation (YASD) Standard deviation of the sampling distribution The variance of the sum or.
+ Chi Square Test Homogeneity or Independence( Association)
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
Essential Statistics Chapter 161 Review Part III_A_Chi Z-procedure Vs t-procedure.
Chapter 14: Chi-Square Procedures – Test for Goodness of Fit.
Chapter 11 Chi- Square Test for Homogeneity Target Goal: I can use a chi-square test to compare 3 or more proportions. I can use a chi-square test for.
Copyright © 2010 Pearson Education, Inc. Slide
Comparing Counts.  A test of whether the distribution of counts in one categorical variable matches the distribution predicted by a model is called a.
Chapter 13 Inference for Counts: Chi-Square Tests © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Chapter Outline Goodness of Fit test Test of Independence.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
AGENDA:. AP STAT Ch. 14.: X 2 Tests Goodness of Fit Homogeniety Independence EQ: What are expected values and how are they used to calculate Chi-Square?
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Chapter 11: Additional Topics Using Inferences 11.1 – Chi-Square: Tests of Independence 11.2 – Chi-Square: Goodness of Fit 11.3 – Testing a Single Variance.
Chapter 13- Inference For Tables: Chi-square Procedures Section Test for goodness of fit Section Inference for Two-Way tables Presented By:
DICE!  You are going to make your own and then we are going to test them (later) to see if they are fair!
Bullied as a child? Are you tall or short? 6’ 4” 5’ 10” 4’ 2’ 4”
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
+ Section 11.1 Chi-Square Goodness-of-Fit Tests. + Introduction In the previous chapter, we discussed inference procedures for comparing the proportion.
Section 13.2 Chi-Squared Test of Independence/Association.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
Statistics 300: Elementary Statistics Section 11-3.
11.1 Chi-Square Tests for Goodness of Fit Objectives SWBAT: STATE appropriate hypotheses and COMPUTE expected counts for a chi- square test for goodness.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
The Chi-Square Distribution  Chi-square tests for ….. goodness of fit, and independence 1.
12.2 Tests for Homogeneity and Independence in a two-way table Wednesday, June 22, 2016.
Comparing Observed Distributions A test comparing the distribution of counts for two or more groups on the same categorical variable is called a chi-square.
Goodness-of-Fit and Contingency Tables Chapter 11.
Chi Square Procedures Chapter 14. Chi-Square Goodness-of-Fit Tests Section 14.1.
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8… Where we are going… Significance Tests!! –Ch 9 Tests about a population proportion –Ch 9Tests.
 Check the Random, Large Sample Size and Independent conditions before performing a chi-square test  Use a chi-square test for homogeneity to determine.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Warm Up Check your understanding on p You do NOT need to calculate ALL the expected values by hand but you need to do at least 2. You do NOT need.
CHAPTER 11 Inference for Distributions of Categorical Data
Chi-square test or c2 test
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Chapter 11: Inference for Distributions of Categorical Data
Inference for Relationships
Chi-Square Hypothesis Testing PART 3:
Inference for Two Way Tables
Chi Square Test of Homogeneity
Presentation transcript:

Inference for Distributions of Categorical Variables (C26 BVD)

* Chi-Squared distributions are appropriate to model sampling distributions for counted data (categorical variable(s)). * If you have only 1 count being compared to 1 expected, you can use a 1 proportion z-interval or test if conditions are met * Conditions to check are: random sampling/assignment, counted data (not percents, etc.), expected cell counts all >5 (not observed, expected!) * If expected cell count is violated, may be able to “fix” by collapsing table

* If you have one set of counts for one variable being compared to an expected distribution, that is Goodness of Fit (One- Way Table) * If you have multiple distributions for a single variable (Think: 1 question asked in survey) = Test of Homogeneity * If you have two variables (Think: 2 survey questions) = Test of Independence

* (observed – expected) 2 / expected for each cell * Sum all those and you have your statistic * Degrees of freedom for GOF = categories – 1 * Degrees of freedom for Homogeneity and Independence = (rows – 1) * (columns – 1) - does not include table margins

* Old calculators: Put observed in L1, expected in L2, (o-e) 2 / e in L3, then sum L3. That is Chi-squared. Then used Xcdf(statistic, big number, df) to find p-value * Newer calculators: Put observed in L1, expected in L2, then run GOF test under Stat-Test menu. * Always report your test statistic, df, and p- value, then make conclusion

* Put table in matrix A. Do not include margins. * Run chi-squared test under Stat-Test menu. * Don’t forget to check Matrix B for expected count violations. * Always report Chi-squared statistic, df, and p-value then make conclusion

* GOF: Ho: The distribution for _ is as expected (may need to be more specific). Ha: The distribution for _ is NOT as expected. * Homogeneity: Ho: There is no difference in distribution of __ for the populations/treatments __ Ha: There is a difference in distribution…. * Independence: Ho: There is no association between _ and _. Ha: There IS an association between _ and _.

* If reject the null, you should look at each of the components in the sum for the chi-squared statistic (i.e. each (o-e) 2 /e) and see which are the largest. * You should comment about which one or two are largest and thus were the largest contributors to rejecting Ho, i.e. were the most different from expected.

* If you need to find an expected cell count “by hand”: * GOF: Find out what proportion of the total count that category is supposed to be (like 30% of M&M’s are supposed to be yellow) and then take that percent of the total to find expected count. Do not round to whole numbers. * Homogeneity/Independence: Find totals/margins for table. Then, find what percent of total table is in the category of interest for whole table. Then, take that percent of the column of interest and that is the expected cell count.