Lecture 18 Section 8.3 Objectives: Chi-squared distributions

Slides:



Advertisements
Similar presentations
Categorical Data Analysis
Advertisements

Chapter 11 Other Chi-Squared Tests
Chi-square test Chi-square test or  2 test. Chi-square test countsUsed to test the counts of categorical data ThreeThree types –Goodness of fit (univariate)
AP Statistics Tuesday, 15 April 2014 OBJECTIVE TSW (1) identify the conditions to use a chi-square test; (2) examine the chi-square test for independence;
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Categorical Data Goodness-of-Fit Tests.
The Analysis of Categorical Data and Goodness of Fit Tests
Chapter 11 Inference for Distributions of Categorical Data
11-2 Goodness-of-Fit In this section, we consider sample data consisting of observed frequency counts arranged in a single row or column (called a one-way.
Chi-square Goodness of Fit Test
Presentation 12 Chi-Square test.
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on Categorical Data 12.
Chi-square test or c2 test
Slide Copyright © 2008 Pearson Education, Inc. Chapter 12 Chi-Square Procedures.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
Chapter 12: The Analysis of Categorical Data and Goodness- of-Fit Test.
Chi-Squared Significance Tests Chapters 26/27 Objectives: Chi-Squared Distribution Chi-Squared Test Statistic Chi-Squared Goodness of Fit Test Chi-Squared.
+ Chi Square Test Homogeneity or Independence( Association)
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
Comparing Counts.  A test of whether the distribution of counts in one categorical variable matches the distribution predicted by a model is called a.
11.2 Tests Using Contingency Tables When data can be tabulated in table form in terms of frequencies, several types of hypotheses can be tested by using.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Chapter 12 The Analysis of Categorical Data and Goodness of Fit Tests.
Lecture 11. The chi-square test for goodness of fit.
Chapter 13- Inference For Tables: Chi-square Procedures Section Test for goodness of fit Section Inference for Two-Way tables Presented By:
Chi-Square Goodness of Fit Test. In general, the chi-square test statistic is of the form If the computed test statistic is large, then the observed and.
+ Section 11.1 Chi-Square Goodness-of-Fit Tests. + Introduction In the previous chapter, we discussed inference procedures for comparing the proportion.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
The Chi-Square Distribution  Chi-square tests for ….. goodness of fit, and independence 1.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Copyright © Cengage Learning. All rights reserved. 14 Goodness-of-Fit Tests and Categorical Data Analysis.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Test of Goodness of Fit Lecture 41 Section 14.1 – 14.3 Wed, Nov 14, 2007.
Check your understanding: p. 684
Comparing Counts Chi Square Tests Independence.
Warm Up Check your understanding on p You do NOT need to calculate ALL the expected values by hand but you need to do at least 2. You do NOT need.
Presentation 12 Chi-Square test.
CHAPTER 11 Inference for Distributions of Categorical Data
Chi-square test or c2 test
10 Chapter Chi-Square Tests and the F-Distribution Chapter 10
Chi-squared test or c2 test
CHAPTER 11 CHI-SQUARE TESTS
Chapter 11 Chi-Square Tests.
Test for Goodness of Fit
Chapter 12 Tests with Qualitative Data
Chapter 12: Inference about a Population Lecture 6b
Data Analysis for Two-Way Tables
Chapter 11 Goodness-of-Fit and Contingency Tables
Elementary Statistics: Picturing The World
The Analysis of Categorical Data and Chi-Square Procedures
Analysis of count data 1.
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Chapter 11: Inference for Distributions of Categorical Data
Chapter 10 Analyzing the Association Between Categorical Variables
Lecture 36 Section 14.1 – 14.3 Mon, Nov 27, 2006
Chi-square test or c2 test
Chapter 11 Chi-Square Tests.
Inference on Categorical Data
The Analysis of Categorical Data and Goodness of Fit Tests
Lecture 41 Section 14.1 – 14.3 Wed, Nov 14, 2007
Analyzing the Association Between Categorical Variables
CHAPTER 11 CHI-SQUARE TESTS
The Analysis of Categorical Data and Goodness of Fit Tests
The Analysis of Categorical Data and Goodness of Fit Tests
Inference for Two Way Tables
The Analysis of Categorical Data and Goodness of Fit Tests
Chapter 11 Chi-Square Tests.
Lecture 43 Section 14.1 – 14.3 Mon, Nov 28, 2005
Presentation transcript:

Lecture 18 Section 8.3 Objectives: Chi-squared distributions Testing concerning hypotheses about a categorical population Chi-squared distributions Tests based on univariate categorical data Testing for homogeneity of several categorical variables

Goodness-of-fit Test A factory produces marbles in the sizes small, medium, and large. A third of the marbles are supposed to be small, half of them medium, and a sixth are supposed to be large. Denote the observed value for the small size by O1, the observed value for medium by O2, and the observed value for large by O3. A simple random sample of 120 marbles from the factory contains O1=25 , O2=72 , and O3=23 . Is the observed distribution in the sample consistent with the theoretical distribution?

Goodness-of-fit Test The appropriate test for evaluating this claim is Ha: not all equalities hold in H0 where π1 denotes the proportion of small marbles, π2 the proportion of medium marbles, and π3 the proportion of large marbles produced by the factory. Denote the expected value for the small size by E1, the expected value for medium by E2, and the expected value for large by E3. From a sample of 120 marbles, what would be the expected number of small, medium and large marbles?

Goodness-of-fit Test small medium large Observed 25 72 23 Expected 40 60 20 How close are the observed values to the expected values? In our example X2 = 8.475. A large value of X2 is therefore considered evidence that the null hypothesis is not true. Is 8.475 a large value? How likely it is to obtain a value of X2 that is 8.475 or larger when the null hypothesis is in fact true.

Chi-Squared Distribution Test statistic X2 has approximately the chi-squared distribution with df = k−1 degrees of freedom, where k is the number of categories. The p-value of this test: p-value = Table VII gives the area under the χ2 curve to the right of the calculated X2 value. Find the p-value for the marbles example.

Example The U.S. Federal Bureau of Investigation (FBI) compiles data on crimes and crime rates and publishes the information in Crime in the United States. A violent crime is classified as by the FBI as murder, forcible rape, robbery, or aggravated assault. The following table provides a relative frequency distribution for the reported violent crimes in 1995. A random sample of 500 violent-crime reports from last year yielded the frequency distribution shown in the following table. Do the data provide sufficient evidence to conclude that last year's distribution of violent crimes has changed from the 1995 distribution?

Testing for Homogeneity of Several Categorical Variables Suppose that an investigator is interested in several different categorical populations or processes, each one consisting of the same categories. The investigator wishes to test whether the populations are homogeneous – the proportion in the first category is the same for all populations, the proportion in the second category is the same for all populations, and so on. The Chi-squared test for homogeneity of several categorical populations Denote the number of population by r and the number of categories for each population by k (the same k categories for all r populations). 1. State the Hypotheses: H0: the r populations are homogeneous with respect to the categories. Ha: the populations are not homogeneous.

Testing for Homogeneity of Several Categorical Variables 2. Calculate the test statistic The test statistic is, The chi-square statistic compares the observed cell counts with the expected cell counts given by Expected Cell Count Note1: If the expected counts and the observed counts are very different, a large value of X2 will result. Large values of X2 provide evidence against the null hypothesis. Note2: When all expected cell counts are at least 5, approx Approximately under H0.

Example A company packages a particular product in cans of three different sizes, each one using a different production line. Most cans conform to specifications, but a quality control engineer has identified the following reasons for non-conformance: 1. Blemish on can 2. Crack in can 3. Improper pull tab location 4. Pull tab missing 5. Other A sample of nonconforming units is selected from each of the three lines, and each unit is categorized according to reason for nonconformity, resulting in the following table. Does the data suggest that the proportions falling in the various conformance categories are not the same for the three lines?