Chapter 26: Comparing Counts

Slides:



Advertisements
Similar presentations
Chapter 26 Comparing Counts
Advertisements

CHAPTER 23: Two Categorical Variables: The Chi-Square Test
Chapter 11 Inference for Distributions of Categorical Data
Chi Square Procedures Chapter 11.
Chapter 11 Inference for Distributions of Categorical Data
Chapter 13: Inference for Distributions of Categorical Data
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
CHAPTER 11 Inference for Distributions of Categorical Data
11-2 Goodness-of-Fit In this section, we consider sample data consisting of observed frequency counts arranged in a single row or column (called a one-way.
Chapter 26: Comparing Counts. To analyze categorical data, we construct two-way tables and examine the counts of percents of the explanatory and response.
1 Chapter 20 Two Categorical Variables: The Chi-Square Test.
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
Chapter 13: Inference for Tables – Chi-Square Procedures
Testing Distributions Section Starter Elite distance runners are thinner than the rest of us. Skinfold thickness, which indirectly measures.
Copyright © 2012 Pearson Education. All rights reserved Copyright © 2012 Pearson Education. All rights reserved. Chapter 15 Inference for Counts:
AP STATISTICS LESSON 13 – 1 (DAY 1) CHI-SQUARE PROCEDURES TEST FOR GOODNESS OF FIT.
 Involves testing a hypothesis.  There is no single parameter to estimate.  Considers all categories to give an overall idea of whether the observed.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 26 Comparing Counts.
Chapter 26: Comparing Counts AP Statistics. Comparing Counts In this chapter, we will be performing hypothesis tests on categorical data In previous chapters,
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
1 Desipramine is an antidepressant affecting the brain chemicals that may become unbalanced and cause depression. It was tested for recovery from cocaine.
CHAPTER 26: COMPARING COUNTS OF CATEGORICAL DATA To test claims and make inferences about counts for categorical variables Objective:
Chapter 26 Chi-Square Testing
Chapter 11 Inference for Tables: Chi-Square Procedures 11.1 Target Goal:I can compute expected counts, conditional distributions, and contributions to.
Chapter 26: Comparing counts of categorical data
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 26 Comparing Counts.
Slide 26-1 Copyright © 2004 Pearson Education, Inc.
Chapter 11 The Chi-Square Test of Association/Independence Target Goal: I can perform a chi-square test for association/independence to determine whether.
+ Chi Square Test Homogeneity or Independence( Association)
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
Essential Statistics Chapter 161 Review Part III_A_Chi Z-procedure Vs t-procedure.
Chapter 11 Chi- Square Test for Homogeneity Target Goal: I can use a chi-square test to compare 3 or more proportions. I can use a chi-square test for.
Warm up On slide.
Copyright © 2010 Pearson Education, Inc. Slide
Comparing Counts.  A test of whether the distribution of counts in one categorical variable matches the distribution predicted by a model is called a.
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Chapter 13 Inference for Counts: Chi-Square Tests © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Lesson Inference for Two-Way Tables. Knowledge Objectives Explain what is mean by a two-way table. Define the chi-square (χ 2 ) statistic. Identify.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapter 13- Inference For Tables: Chi-square Procedures Section Test for goodness of fit Section Inference for Two-Way tables Presented By:
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
+ Section 11.1 Chi-Square Goodness-of-Fit Tests. + Introduction In the previous chapter, we discussed inference procedures for comparing the proportion.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
Statistics 300: Elementary Statistics Section 11-3.
Statistics 26 Comparing Counts. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Comparing Observed Distributions A test comparing the distribution of counts for two or more groups on the same categorical variable is called a chi-square.
Chi Square Procedures Chapter 14. Chi-Square Goodness-of-Fit Tests Section 14.1.
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8… Where we are going… Significance Tests!! –Ch 9 Tests about a population proportion –Ch 9Tests.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted by a model is called a goodness-of-fit.
Comparing Counts Chi Square Tests Independence.
CHAPTER 26 Comparing Counts.
Chapter 25 Comparing Counts.
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Chapter 11: Inference for Distributions of Categorical Data
Paired Samples and Blocks
Analyzing the Association Between Categorical Variables
Chapter 26 Comparing Counts.
Chapter 26 Comparing Counts Copyright © 2009 Pearson Education, Inc.
Inference for Two Way Tables
Chapter 26 Comparing Counts.
Chapter 14.1 Goodness of Fit Test.
Presentation transcript:

Chapter 26: Comparing Counts AP Statistics Chapter 26: Comparing Counts

Test for Goodness of Fit: To analyze categorical data, we construct two-way tables and examine the counts or percents of the explanatory and response variables. Hershey Activity: This table was changed for Hershey’s Miniatures distribution, data courtesy of Joan Thomas after she saw an episode on Oprah (I think). Kind Milk Chocolate Mr. Goodbar Krackel Special Dark Claimed % 40% 21% 18%

Proportion of Hershey Miniatures KIND Observed Count, O Expected Count, E Milk Chocolate Mr. Goodbar Krackel Special Dark

We want to compare the observed counts to the expected counts. The null hypothesis is that there is no difference between the observed and expected counts. The alternative hypothesis is that there is a difference between the observed and expected counts.

is called the chi-square test statistic is called the chi-square test statistic . It measures how well the observed counts fit the expected counts, assuming that the null hypothesis is true.

The distribution of the chi-square statistic is called the chi-square distribution, 2. This distribution is a density curve. The total area under the curve is 1 . The curve begins at zero on the horizontal axis and is skewed right . As the degrees of freedom increase, the shape of the curve becomes more symmetric .

Using the Hershey’s Miniatures chi-square statistic, find the probability of obtaining a 2 value at least this extreme assuming the null hypothesis is true. This is known as the “Goodness of Fit Test.” Graph the chi-square distribution with (k – 1) = 3 degrees of freedom, where k = # of categories:

TI-83+: 2pdf (X, 5) We would expect to obtain a 2 value at least this extreme in about _____ out of every _____ samples, assuming the null hypothesis is true.

CONDITIONS: The Goodness of Fit Test may be used when: SRS we are working with counts . all counts are at least 1 and most (at least 80%) of the counts are at least 5 .   *Following the Goodness of Fit Test, check to see which component made the greatest contribution to the chi-square statistic to see where the biggest changes occurred.

Inference for Two Way Tables: To compare two proportions, we use a 2-Proportion Z Test. If we want to compare three or more proportions, we need a new procedure. The first step in the overall test for comparing several proportions is to arrange the data in a 2-way table. Think of the counts as elements of a matrix with r rows and c columns. This is called an r x c table with (r)(c) cells. Our null hypothesis is that there is no difference among the proportions. The alternative hypothesis is that there are some difference among the proportions.

We will use the chi-square test to measure how far the observed values are from the expected values. To calculate the expected counts, multiply the row total by the column total, and divide by the table total :

The chi-square statistic is the sum over all r x c cells in the table: The degrees of freedom is (r – 1)(c – 1) . The P-value is the area to the right of the statistic under the chi-square density curve.

In Summary: Use a chi-square goodness of fit test when you want to show that the distribution of counts in one categorical variable matches the distribution predicted by a model. ( one categorical variable, one -way table) H0: The distribution of Hershey Miniatures is the same as the advertised distribution.

Use a chi-square test for homogeneity when you want to show that the distribution of counts for two or more groups is the same. “Are the groups homogeneous?”; more than one population; split, then sample ( one categorical variable, two -way table) H0: The distribution of X is the same as the distribution of Y.

H0: X and Y are independent. Use a chi-square test for independence when you want to show that two categorical variables are independent for one group of individuals. “Are the variables independent?”; one population; sample, then split ( two categorical variables, two -way table) H0: X and Y are independent.

What Can Go Wrong? Don’t use chi-square methods unless you have counts. Just because numbers are in a two-way table doesn’t make them suitable for chi-square analysis. Beware large samples. With a sufficiently large sample size, a chi-square test can always reject the null hypothesis. Don’t say that one variable “depends” on the other just because they’re not independent. Association is not causation.

What have we learned? We’ve learned how to test hypotheses about categorical variables. All three methods we examined look at counts of data in categories and rely on chi-square models. Goodness-of-fit tests compare the observed distribution of a single categorical variable to an expected distribution based on theory or model. Tests of homogeneity compare the distribution of several groups for the same categorical variable. Tests of independence examine counts from a single group for evidence of an association between two categorical variables.

What have we learned? (cont.) Mechanically, these tests are almost identical. While the tests appear to be one-sided, conceptually they are many-sided, because there are many ways that the data can deviate significantly from what we hypothesize. When we reject the null hypothesis, we know to examine standardized residuals to better understand the patterns in the data.