Slide 1 Copyright © 2004 Pearson Education, Inc..

Slides:



Advertisements
Similar presentations
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Advertisements

1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Categorical Data Goodness-of-Fit Tests.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Categorical Variables Chapter 15.
© 2010 Pearson Prentice Hall. All rights reserved The Chi-Square Test of Homogeneity.
© 2010 Pearson Prentice Hall. All rights reserved The Chi-Square Test of Independence.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
Chapter 26: Comparing Counts
11-2 Goodness-of-Fit In this section, we consider sample data consisting of observed frequency counts arranged in a single row or column (called a one-way.
11-3 Contingency Tables In this section we consider contingency tables (or two-way frequency tables), which include frequency counts for categorical data.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Presentation 12 Chi-Square test.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics, A First Course 4 th Edition.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 26 Comparing Counts.
Copyright © 2013 Pearson Education, Inc. All rights reserved Chapter 10 Inferring Population Means.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on Categorical Data 12.
Chapter 11: Applications of Chi-Square. Count or Frequency Data Many problems for which the data is categorized and the results shown by way of counts.
Chapter 11: Applications of Chi-Square. Chapter Goals Investigate two tests: multinomial experiment, and the contingency table. Compare experimental results.
Chapter 11 Chi-Square Procedures 11.3 Chi-Square Test for Independence; Homogeneity of Proportions.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Chapter 11 Goodness of Fit Test (section 11.2)
Copyright © 2010, 2007, 2004 Pearson Education, Inc. 1.. Section 11-2 Goodness of Fit.
Chapter 16 – Categorical Data Analysis Math 22 Introductory Statistics.
Copyright © 2009 Pearson Education, Inc LEARNING GOAL Interpret and carry out hypothesis tests for independence of variables with data organized.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 11-4 McNemar’s Test for Matched Pairs.
Copyright © 2004 Pearson Education, Inc.
1 Pertemuan 11 Uji kebaikan Suai dan Uji Independen Mata kuliah : A Statistik Ekonomi Tahun: 2010.
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
Introduction Many experiments result in measurements that are qualitative or categorical rather than quantitative. Humans classified by ethnic origin Hair.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chi-Squared Significance Tests Chapters 26/27 Objectives: Chi-Squared Distribution Chi-Squared Test Statistic Chi-Squared Goodness of Fit Test Chi-Squared.
Slide 26-1 Copyright © 2004 Pearson Education, Inc.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 16 Chi-Squared Tests.
Chapter 11 Chi- Square Test for Homogeneity Target Goal: I can use a chi-square test to compare 3 or more proportions. I can use a chi-square test for.
© 2000 Prentice-Hall, Inc. Statistics The Chi-Square Test & The Analysis of Contingency Tables Chapter 13.
AP Statistics Section 14.. The main objective of Chapter 14 is to test claims about qualitative data consisting of frequency counts for different categories.
Copyright © 2010 Pearson Education, Inc. Slide
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter 13 Inference for Counts: Chi-Square Tests © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Chapter Outline Goodness of Fit test Test of Independence.
1 Chapter 10. Section 10.1 and 10.2 Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Copyright © Cengage Learning. All rights reserved. Chi-Square and F Distributions 10.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
11.2 Tests Using Contingency Tables When data can be tabulated in table form in terms of frequencies, several types of hypotheses can be tested by using.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
CHAPTER INTRODUCTORY CHI-SQUARE TEST Objectives:- Concerning with the methods of analyzing the categorical data In chi-square test, there are 3 methods.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 11 Multinomial Experiments and Contingency Tables 11-1 Overview 11-2 Multinomial Experiments:
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Test of Independence Tests the claim that the two variables related. For example: each sample (incident) was classified by the type of crime and the victim.
Goodness-of-Fit and Contingency Tables Chapter 11.
Copyright © 2009 Pearson Education, Inc LEARNING GOAL Interpret and carry out hypothesis tests for independence of variables with data organized.
Chapter 26 Comparing Counts. Objectives Chi-Square Model Chi-Square Statistic Knowing when and how to use the Chi- Square Tests; Goodness of Fit Test.
Chapter 11 – Test of Independence - Hypothesis Test for Proportions of a Multinomial Population In this case, each element of a population is assigned.
Lecture Slides Elementary Statistics Twelfth Edition
Chapter 12 Tests with Qualitative Data
Chapter 11 Goodness-of-Fit and Contingency Tables
Elementary Statistics
Lecture Slides Elementary Statistics Tenth Edition
Contingency Tables: Independence and Homogeneity
Overview and Chi-Square
Analyzing the Association Between Categorical Variables
Section 11-1 Review and Preview
Chapter 11 Lecture 2 Section: 11.3.
Presentation transcript:

Slide 1 Copyright © 2004 Pearson Education, Inc.

Slide 2 Copyright © 2004 Pearson Education, Inc. Chapter 10 Multinomial Experiments and Contingency Tables 10-1 Overview 10-2 Multinomial Experiments: Goodness-of-fit 10-3 Contingency Tables: Independence and Homogeneity

Slide 3 Copyright © 2004 Pearson Education, Inc. Created by Erin Hodgess, Houston, Texas Section 10-1 & 10-2 Overview and Multinomial Experiments: Goodness of Fit

Slide 4 Copyright © 2004 Pearson Education, Inc. Overview  We focus on analysis of categorical (qualitative or attribute) data that can be separated into different categories (often called cells).  Use the  2 (chi-square) test statistic (Table A-4).  The goodness-of-fit test uses a one-way frequency table (single row or column).  The contingency table uses a two-way frequency table (two or more rows and columns).

Slide 5 Copyright © 2004 Pearson Education, Inc. Multinomial Experiment This is an experiment that meets the following conditions: 1. The number of trials is fixed. 2. The trials are independent. 3. All outcomes of each trial must be classified into exactly one of several different categories. 4. The probabilities for the different categories remain constant for each trial. Definition

Slide 6 Copyright © 2004 Pearson Education, Inc. Example: Last Digit Analysis In 2001, Barry Bonds hit 73 home runs. Table 10-2 summarizes the last digit of those home run distances. Verify that the four conditions of a multinomial experiment are satisfied.

Slide 7 Copyright © 2004 Pearson Education, Inc. Example: Last Digit Analysis 1. The number of trials (last digits) is the fixed number The trials are independent, because the last digit of the length of a home run does not affect the last digit of the length of any other home run. 3. Each outcome (last digit) is classified into exactly 1 of 10 different categories. The categories are 0, 1, …, Finally, if we assume that the home run distances are measured, the last digits should be equally likely, so that each possible digit has a probability of 1/10. In 2001, Barry Bonds hit 73 home runs. Table 10-2 summarizes the last digit of those home run distances. Verify that the four conditions of a multinomial experiment are satisfied.

Slide 8 Copyright © 2004 Pearson Education, Inc. Definition Goodness-of-fit test A goodness-of-fit test is used to test the hypothesis that an observed frequency distribution fits (or conforms to) some claimed distribution.

Slide 9 Copyright © 2004 Pearson Education, Inc. 0 represents the observed frequency of an outcome E represents the expected frequency of an outcome k represents the number of different categories or outcomes n represents the total number of trials Goodness-of-Fit Test Notation

Slide 10 Copyright © 2004 Pearson Education, Inc. Expected Frequencies If all expected frequencies are equal: the sum of all observed frequencies divided by the number of categories n E = k

Slide 11 Copyright © 2004 Pearson Education, Inc. If all expected frequencies are not all equal: each expected frequency is found by multiplying the sum of all observed frequencies by the probability for the category E = n p Expected Frequencies

Slide 12 Copyright © 2004 Pearson Education, Inc. Goodness-of-fit Test in Multinomial Experiments Test Statistic Critical Values 1. Found in Table A-4 using k – 1 degrees of freedom where k = number of categories 2. Goodness-of-fit hypothesis tests are always right-tailed.  2 =  ( O – E ) 2 E

Slide 13 Copyright © 2004 Pearson Education, Inc.  A large disagreement between observed and expected values will lead to a large value of  2 and a small P -value.  A significantly large value of  2 will cause a rejection of the null hypothesis of no difference between the observed and the expected.  A close agreement between observed and expected values will lead to a small value of  2 and a large P -value.

Slide 14 Copyright © 2004 Pearson Education, Inc. Figure 10-3Relationships Among Components in Goodness-of-Fit Hypothesis Test

Slide 15 Copyright © 2004 Pearson Education, Inc. Example: Last Digit Analysis In 2001, Barry Bonds hit 73 home runs. Table 10-2 summarizes the last digit of those home run distances. Test the claim that the digits do not occur with the same frequency. H 0 : p 0 = p 1 =  = p 9 H 1 : At least one of the probabilities is different from the others.  = 0.05 k – 1 = 9  2.05,9 =

Slide 16 Copyright © 2004 Pearson Education, Inc. Example: Last Digit Analysis In 2001, Barry Bonds hit 73 home runs. Table 10-2 summarizes the last digit of those home run distances. Test the claim that the digits do not occur with the same frequency.

Slide 17 Copyright © 2004 Pearson Education, Inc. Example: Last Digit Analysis In 2001, Barry Bonds hit 73 home runs. Table 10-2 summarizes the last digit of those home run distances. Test the claim that the digits do not occur with the same frequency. The test statistic is  2 = Since the critical value is , we reject the null hypothesis. There is sufficient evidence to support the claim that the last digits do not occur with the same relative frequency.

Slide 18 Copyright © 2004 Pearson Education, Inc. Example: Last Digit Analysis In 2001, Barry Bonds hit 73 home runs. Table 10-2 summarizes the last digit of those home run distances. Test the claim that the digits do not occur with the same frequency.

Slide 19 Copyright © 2004 Pearson Education, Inc. Example: Detecting Fraud In the Chapter Problem, it was noted that statistics can be used to detect fraud. Table 10-1 list the percentages for leading digits. Test the claim that there is a significant discrepancy between the leading digits expected from Benford’s Law and the leading digits from the 784 checks. H 0 : p 1 = 0.301, p 2 = 0.176, p 3 = 0.125, p 4 = 0.097, p 5 = 0.079, p 6 = 0.067, p 7 = 0.058, p 8 = and p 9 = H 1 : At least one of the proportions is different from the claimed values.  = 0.01 k – 1 =8  2.01,8 =

Slide 20 Copyright © 2004 Pearson Education, Inc. Example: Detecting Fraud In the Chapter Problem, it was noted that statistics can be used to detect fraud. Table 10-1 list the percentages for leading digits. Test the claim that there is a significant discrepancy between the leading digits expected from Benford’s Law and the leading digits from the 784 checks.

Slide 21 Copyright © 2004 Pearson Education, Inc. Example: Detecting Fraud In the Chapter Problem, it was noted that statistics can be used to detect fraud. Table 10-1 list the percentages for leading digits. Test the claim that there is a significant discrepancy between the leading digits expected from Benford’s Law and the leading digits from the 784 checks. The test statistic is  2 = Since the critical value is , we reject the null hypothesis. There is sufficient evidence to reject the null hypothesis.

Slide 22 Copyright © 2004 Pearson Education, Inc. Example: Detecting Fraud In the Chapter Problem, it was noted that statistics can be used to detect fraud. Table 10-1 list the percentages for leading digits. Test the claim that there is a significant discrepancy between the leading digits expected from Benford’s Law and the leading digits from the 784 checks.

Slide 23 Copyright © 2004 Pearson Education, Inc. Example: Detecting Fraud In the Chapter Problem, it was noted that statistics can be used to detect fraud. Table 10-1 list the percentages for leading digits. Test the claim that there is a significant discrepancy between the leading digits expected from Benford’s Law and the leading digits from the 784 checks.

Slide 24 Copyright © 2004 Pearson Education, Inc. Created by Erin Hodgess, Houston, Texas Section 10-3 Contingency Tables: Independence and Homogeneity

Slide 25 Copyright © 2004 Pearson Education, Inc.  Contingency Table (or two-way frequency table) A contingency table is a table in which frequencies correspond to two variables. (One variable is used to categorize rows, and a second variable is used to categorize columns.) Contingency tables have at least two rows and at least two columns. Definition

Slide 26 Copyright © 2004 Pearson Education, Inc.

Slide 27 Copyright © 2004 Pearson Education, Inc.  Test of Independence This method tests the null hypothesis that the row variable and column variable in a contingency table are not related. (The null hypothesis is the statement that the row and column variables are independent.) Definition

Slide 28 Copyright © 2004 Pearson Education, Inc. Assumptions 1. The sample data are randomly selected. 2.The null hypothesis H 0 is the statement that the row and column variables are independent; the alternative hypothesis H 1 is the statement that the row and column variables are dependent. 3. For every cell in the contingency table, the expected frequency E is at least 5. (There is no requirement that every observed frequency must be at least 5.)

Slide 29 Copyright © 2004 Pearson Education, Inc. Test of Independence Test Statistic Critical Values 1. Found in Table A-4 using degrees of freedom = (r – 1)(c – 1) r is the number of rows and c is the number of columns 2. Tests of Independence are always right-tailed.  2 =  ( O – E ) 2 E

Slide 30 Copyright © 2004 Pearson Education, Inc. (row total) (column total) (grand total) E = Total number of all observed frequencies in the table

Slide 31 Copyright © 2004 Pearson Education, Inc. Tests of Independence H 0 : The row variable is independent of the column variable H 1 : The row variable is dependent (related to) the column variable This procedure cannot be used to establish a direct cause-and-effect link between variables in question. Dependence means only there is a relationship between the two variables.

Slide 32 Copyright © 2004 Pearson Education, Inc. Expected Frequency for Contingency Tables E = grand total row total column total grand total E = (row total) (column total) (grand total) (probability of a cell) n p

Slide 33 Copyright © 2004 Pearson Education, Inc. Observed and Expected Frequencies Men Women Boys GirlsTotal Survived Died Total We will use the mortality table from the Titanic to find expected frequencies. For the upper left hand cell, we find: = E = (706)(1692) 2223

Slide 34 Copyright © 2004 Pearson Education, Inc Men Women Boys GirlsTotal Survived Died Total Find the expected frequency for the lower left hand cell, assuming independence between the row variable and the column variable. = E = (1517)(1692) 2223 Observed and Expected Frequencies

Slide 35 Copyright © 2004 Pearson Education, Inc Men Women Boys GirlsTotal Survived Died Total To interpret this result for the lower left hand cell, we can say that although 1360 men actually died, we would have expected men to die if survivablility is independent of whether the person is a man, woman, boy, or girl. Observed and Expected Frequencies

Slide 36 Copyright © 2004 Pearson Education, Inc. Example: Using a 0.05 significance level, test the claim that when the Titanic sank, whether someone survived or died is independent of whether that person is a man, woman, boy, or girl. H 0 : Whether a person survived is independent of whether the person is a man, woman, boy, or girl. H 1 : Surviving the Titanic and being a man, woman, boy, or girl are dependent.

Slide 37 Copyright © 2004 Pearson Education, Inc. Example: Using a 0.05 significance level, test the claim that when the Titanic sank, whether someone survived or died is independent of whether that person is a man, woman, boy, or girl.  2 = (332–537.36) 2 + (318– ) 2 + (29–20.326) 2 + (27–14.291) (1360– ) 2 + (104– ) 2 + (35–43.674) 2 + (18–30.709)  2 = =

Slide 38 Copyright © 2004 Pearson Education, Inc. Example: Using a 0.05 significance level, test the claim that when the Titanic sank, whether someone survived or died is independent of whether that person is a man, woman, boy, or girl. The number of degrees of freedom are (r–1)(c–1)= (2–1)(4–1)=3.  2.05,3 = We reject the null hypothesis. Survival and gender are dependent.

Slide 39 Copyright © 2004 Pearson Education, Inc. Test Statistic  2 = with  = 0.05 and ( r – 1) ( c– 1) = (2 – 1) (4 – 1) = 3 degrees of freedom Critical Value  2 = (from Table A-4)

Slide 40 Copyright © 2004 Pearson Education, Inc. Relationships Among Components in X 2 Test of Independence Figure 10-8

Slide 41 Copyright © 2004 Pearson Education, Inc. Definition  Test of Homogeneity In a test of homogeneity, we test the claim that different populations have the same proportions of some characteristics.

Slide 42 Copyright © 2004 Pearson Education, Inc. How to distinguish between a test of homogeneity and a test for independence: Were predetermined sample sizes used for different populations (test of homogeneity), or was one big sample drawn so both row and column totals were determined randomly (test of independence)?

Slide 43 Copyright © 2004 Pearson Education, Inc. Example: Using Table 10-7 as seen below, with a 0.05 significance level, test the effect of pollster gender on survey responses by men.

Slide 44 Copyright © 2004 Pearson Education, Inc. Example: Using Table 10-7 as seen below, with a 0.05 significance level, test the effect of pollster gender on survey responses by men. H 0 : The proportions of agree/disagree responses are the same for the subjects interviewed by men and the subjects interviewed by women. H 1 : The proportions are different.

Slide 45 Copyright © 2004 Pearson Education, Inc. Example: Using Table 10-7 as seen below, with a 0.05 significance level, test the effect of pollster gender on survey responses by men.

Slide 46 Copyright © 2004 Pearson Education, Inc. Example: Using Table 10-7 as seen below, with a 0.05 significance level, test the effect of pollster gender on survey responses by men. The Minitab display includes the test statistic of  2 = and a P-value of Using the P-value approach, we reject the null hypothesis of equal(homogeneous) proportions(because the P-value of is less than There is sufficient evidence to reject the claim of equal proportions.