Chi-squared tests Goodness of fit: Does the actual frequency distribution of some data agree with an assumption? Test of Independence: Are two characteristics.

Slides:



Advertisements
Similar presentations
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Advertisements

CHAPTER 23: Two Categorical Variables: The Chi-Square Test
Hypothesis Testing IV Chi Square.
Chapter 11 Inference for Distributions of Categorical Data
Chapter 10 Chi-Square Tests and the F- Distribution 1 Larson/Farber 4th ed.
Please turn in your signed syllabus. We will be going to get textbooks shortly after class starts. Homework: Reading Guide – Chapter 2: The Chemical Context.
CHAPTER 11 Inference for Distributions of Categorical Data
Chi-Square Test.
Copyright © Cengage Learning. All rights reserved. 11 Applications of Chi-Square.
13.1 Goodness of Fit Test AP Statistics. Chi-Square Distributions The chi-square distributions are a family of distributions that take on only positive.
Section 10.1 Goodness of Fit. Section 10.1 Objectives Use the chi-square distribution to test whether a frequency distribution fits a claimed distribution.
10.1: Multinomial Experiments Multinomial experiment A probability experiment consisting of a fixed number of trials in which there are more than two possible.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
Chapter 11: Inference for Distributions of Categorical Data Section 11.1 Chi-Square Goodness-of-Fit Tests.
Chi-Square Test.
Chapter 10 Chi-Square Tests and the F-Distribution
GOODNESS OF FIT Larson/Farber 4th ed 1 Section 10.1.
Warm up On slide.
Chi-Square Test James A. Pershing, Ph.D. Indiana University.
1 Chapter 10. Section 10.1 and 10.2 Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Seven Steps for Doing  2 1) State the hypothesis 2) Create data table 3) Find  2 critical 4) Calculate the expected frequencies 5) Calculate  2 6)
Science Practice 2: The student can use mathematics appropriately. Science Practice 5: The student can perform data analysis and evaluation of evidence.
+ Section 11.1 Chi-Square Goodness-of-Fit Tests. + Introduction In the previous chapter, we discussed inference procedures for comparing the proportion.
Goodness-of-Fit and Contingency Tables Chapter 11.
Chi Square Analysis. What is the chi-square statistic? The chi-square (chi, the Greek letter pronounced "kye”) statistic is a nonparametric statistical.
Section 10.1 Goodness of Fit © 2012 Pearson Education, Inc. All rights reserved. 1 of 91.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Check your understanding: p. 684
CHAPTER 11 Inference for Distributions of Categorical Data
Warm up On slide.
CHAPTER 11 Inference for Distributions of Categorical Data
11.1 Chi-Square Tests for Goodness of Fit
Chi-Square Test.
Other Chi-Square Tests
M & M Statistics: A Chi Square Analysis
Chapter 13 Test for Goodness of Fit
Test for Goodness of Fit
Chi-Square X2.
Section 10-1 – Goodness of Fit
Elementary Statistics: Picturing The World
Chi-Square Test.
The Analysis of Categorical Data and Chi-Square Procedures
Lecture Slides Elementary Statistics Tenth Edition
Chi Square Two-way Tables
Chi-Square Test.
Chi-Square - Goodness of Fit
Is a persons’ size related to if they were bullied
Goodness of Fit Test - Chi-Squared Distribution
CHAPTER 11 Inference for Distributions of Categorical Data
Contingency Tables: Independence and Homogeneity
Overview and Chi-Square
The Analysis of Categorical Data and Goodness of Fit Tests
Chi-Square Test.
Day 66 Agenda: Quiz Ch 12 & minutes.
Chi2 (A.K.A X2).
The Analysis of Categorical Data and Goodness of Fit Tests
What Color is it?.
Chi-square = 2.85 Chi-square crit = 5.99 Achievement is unrelated to whether or not a child attended preschool.
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
The Analysis of Categorical Data and Goodness of Fit Tests
Section 11-1 Review and Preview
The Analysis of Categorical Data and Goodness of Fit Tests
CHAPTER 11 Inference for Distributions of Categorical Data
Chapter 26 Part 2 Comparing Counts.
CHAPTER 11 Inference for Distributions of Categorical Data
Chapter 14.1 Goodness of Fit Test.
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Inference for Distributions of Categorical Data
Presentation transcript:

Chi-squared tests Goodness of fit: Does the actual frequency distribution of some data agree with an assumption? Test of Independence: Are two characteristics of a set of data dependent or not? Test of homogeneity: Do different populations have the same characteristic in the same proportion? page 574 of text

Example: The distribution of colors in a bag of M&M’s is the same Goodness-of-Fit Test Test the hypothesis that an observed frequency distribution fits some claimed distribution Example: The distribution of colors in a bag of M&M’s is the same Another example: The distribution of colors in a bag of M&M’s is the following: 30% Brown 10% Green 20% Red 10% Blue 20% Yellow 10% Orange The name for this test comes from the idea that we are testing how well an observed frequency distribution fits some specified theoretical distribution.

Categories have equal frequencies Hypothesis Testing Categories have equal frequencies H0: p1 = p2 = p3 = . . . = pk H1: at least one of the probabilities is different Categories have unequal frequencies H0 : p1 , p2, p3, . . . , pk are as claimed H1 : at least one of the above proportions is not the claimed value Examples on page 578 - 579 of text.

Claim: The distribution of colors is the same: Expected Frequencies Brown Yellow Red Orange Green Blue Observed frequency (O) 33 26 21 8 7 5 Expected frequency (E) Claim: The distribution of colors is the same: First, we collect our sample (Observed frequency) Next, we calculate our Expected frequency: If all expected frequencies are equal: If all expected frequencies are not all equal: Each category’s expected frequency is the product the total and the category’s probability page 577 of text

Test Statistics and Critical Values Brown Yellow Red Orange Green Blue Observed frequency (O) 33 26 21 8 7 5 Expected frequency (E) (O-E)2/E Test Statistic: This formula is a measure of the discrepancies between the O and E values. The chi-distribution was first presented in Chapter 6, Section 6-6, page 343.

Critical Values in Chi Square table Fail to Reject Reject Discuss with students that the test does not identify where the significant discrepancies are. However, the figure on the following slide helps to visualize where they might be. Heading: Alpha Goodness-of-fit hypothesis tests are always right-tailed. Degrees of freedom: number of categories minus 1

Uneven Distribution Example Mars claims its M&M candies are distributed with the color percentages of 30% brown, 20% yellow, 20% red, 10% orange, 10% green, and 10% blue. At the 0.05 significance level, test the claim that the color distribution is as claimed by Mars, Inc. H0 (and claim): p1 = 0.30, p2 = 0.20, p3 = 0.20, p4 = 0.10, p5 = 0.10, p6 = 0.10 H1: At least one of the proportions is different from the claimed value.

Uneven Distribution Test statistic Critical Value Brown Yellow Red Orange Green Blue Observed frequency(O) 33 26 21 8 7 5 Expected frequency (O-E)2/E Test statistic Critical Value

Locating the test statistic  = 0.05 Fail to Reject Reject Discuss with students that the test does not identify where the significant discrepancies are. However, the figure on the following slide helps to visualize where they might be.

For example Even distribution: Uneven distribution: The population of South students is evenly distributed among the four classes Uneven distribution: The population of South students is distributed as follows: 15% Lincroft 15% River Plaza 20% Nut Swamp 20% Navesink 20% Leonard 5% Village 5% Private

Live example Leonardo 20 Frosh 29 Lincroft 18 Sophomore 24 Navesink 19 Junior Nut Swamp 21 Senior 23 River Plaza 10 Private 7 Village 5

Your Turn Month Frequency 1 2 3 5 4 6 7 9 8 10 11 12 Claim: students’ favorite month is equally distributed throughout the year

Your Turn, again Claim: students’ favorite colors distribution are are 25% green, 25% blue, and 10% the remaining colors red 8 green 10 blue 13 orange 2 purple 7 black yellow 4