MATH 4030 – 11A ANALYSIS OF R X C TABLES (GOODNESS-OF-FIT TEST) 1.

Slides:



Advertisements
Similar presentations
CHI-SQUARE(X2) DISTRIBUTION
Advertisements

Chi square.  Non-parametric test that’s useful when your sample violates the assumptions about normality required by other tests ◦ All other tests we’ve.
Parametric/Nonparametric Tests. Chi-Square Test It is a technique through the use of which it is possible for all researchers to:  test the goodness.
Chi-Squared Hypothesis Testing Using One-Way and Two-Way Frequency Tables of Categorical Variables.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Categorical Data Goodness-of-Fit Tests.
Statistical Inference for Frequency Data Chapter 16.
INTRODUCTION TO NON-PARAMETRIC ANALYSES CHI SQUARE ANALYSIS.
Chi Square Analyses: Comparing Frequency Distributions.
CHI-SQUARE TEST OF INDEPENDENCE
12.The Chi-square Test and the Analysis of the Contingency Tables 12.1Contingency Table 12.2A Words of Caution about Chi-Square Test.
Chapter Goals After completing this chapter, you should be able to:
Horng-Chyi HorngStatistics II127 Summary Table of Influence Procedures for a Single Sample (I) &4-8 (&8-6)
CHI-SQUARE GOODNESS OF FIT TEST u A nonparametric statistic u Nonparametric: u does not test a hypothesis about a population value (parameter) u requires.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
CHI-SQUARE GOODNESS OF FIT TEST What Are Nonparametric Statistics? What is the Purpose of the Chi-Square GOF? What Are the Assumptions? How Does it Work?
Presentation 12 Chi-Square test.
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
The Chi-square Statistic. Goodness of fit 0 This test is used to decide whether there is any difference between the observed (experimental) value and.
Chi-Square Test Dr Kishor Bhanushali. Chi-Square Test Chi-square, symbolically written as χ2 (Pronounced as Ki-square), is a statistical measure used.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
Chapter 26: Comparing Counts AP Statistics. Comparing Counts In this chapter, we will be performing hypothesis tests on categorical data In previous chapters,
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
1 1 Slide © 2005 Thomson/South-Western Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial Population Goodness of.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Chapter 11 Goodness of Fit Test (section 11.2)
Chi-square test or c2 test
Chapter 16 – Categorical Data Analysis Math 22 Introductory Statistics.
A Course In Business Statistics 4th © 2006 Prentice-Hall, Inc. Chap 9-1 A Course In Business Statistics 4 th Edition Chapter 9 Estimation and Hypothesis.
1 Pertemuan 11 Uji kebaikan Suai dan Uji Independen Mata kuliah : A Statistik Ekonomi Tahun: 2010.
ENGINEERING STATISTICS I
Non-Parametric Statistics Part I: Chi-Square .
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
The Practice of Statistics Third Edition Chapter (13.1) 14.1: Chi-square Test for Goodness of Fit Copyright © 2008 by W. H. Freeman & Company Daniel S.
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter Outline Goodness of Fit test Test of Independence.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
1 Chapter 10. Section 10.1 and 10.2 Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Dan Piett STAT West Virginia University Lecture 12.
11.2 Tests Using Contingency Tables When data can be tabulated in table form in terms of frequencies, several types of hypotheses can be tested by using.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Inferences Concerning Variances
1 Chi-square Test Dr. T. T. Kachwala. Using the Chi-Square Test 2 The following are the two Applications: 1. Chi square as a test of Independence 2.Chi.
1 Math 4030 – 10b Inferences Concerning Proportions.
381 Goodness of Fit Tests QSCI 381 – Lecture 40 (Larson and Farber, Sect 10.1)
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Module 25: Confidence Intervals and Hypothesis Tests for Variances for One Sample This module discusses confidence intervals and hypothesis tests.
Chapter 18 Chi-Square Tests.  2 Distribution Let x 1, x 2,.. x n be a random sample from a normal distribution with  and  2, and let s 2 be the sample.
Chapter 11 Chi Square Distribution and Its applications.
1 Section 8.4 Testing a claim about a mean (σ known) Objective For a population with mean µ (with σ known), use a sample (with a sample mean) to test a.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 11 Multinomial Experiments and Contingency Tables 11-1 Overview 11-2 Multinomial Experiments:
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
The Chi-Square Distribution  Chi-square tests for ….. goodness of fit, and independence 1.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Goodness-of-Fit and Contingency Tables Chapter 11.
Test of independence: Contingency Table
Presentation 12 Chi-Square test.
Chi-square test or c2 test
Hypothesis Testing II: The Two-sample Case
Confidence Intervals and Hypothesis Tests for Variances for One Sample
Math 4030 – 10a Tests for Population Mean(s)
Chi-Square Test Dr Kishor Bhanushali.
Contingency Tables: Independence and Homogeneity
Overview and Chi-Square
Presentation transcript:

MATH 4030 – 11A ANALYSIS OF R X C TABLES (GOODNESS-OF-FIT TEST) 1

Example 1: 3 shops are used to repair electric motors. 100 motors are send to each shop. 2 We need to compare the works done by 3 shops. - How many independent samples? - What should we compare? Null hypothesis? - How do we test? Sample statistic design? - Distribution of statistic(s)? Shop 1Shop 2Shop 3Total Repair Complete Adjustment Needed Repair Incomplete Total

Analysis of r  c Table (Sec. 10.4): 3 C1C2…CcR Total R1X 11 X 12 …X 1c R2X 21 X 22 …X 2c …………… RrX r1 X r2 …X rc C Total N Sample sizes Total count for each outcome Grand Total Shop 1Shop 2Shop 3Total Repair Complete Adjustment Needed Repair Incomplete Total

Shop 1Shop 2Shop 3Total Repair Complete Adjustment Needed Repair Incomplete Total If there is no difference between shops, the proportion of “complete”, “adjustment”, and “incomplete” are the same for all shops. 188/300 76/300 36/ /300 X 100 = /300 X 100 = 12

c independent samples from c populations. Column totals are fixed (sample sizes). 1. Test for Homogeneity 5 C1C2…Cc R Total R1X 11 X 12 …X 1c R2X 21 X 22 …X 2c …………… RrX r1 X r2 …X rc C Total N Test whether the outcome distributions are the same for all populations.

Example 2: To determine whether there really is a relationship (dependency) between a student’s lecture attendance and final exam mark, data are collected from 400 students. 6 - How many independent sample(s)? - What should we compare? Null hypothesis? - How do we test? Sample statistic design? - Distribution of statistic(s)?

7 C1C2…CcR Total R1X 11 X 12 …X 1c R2X 21 X 22 …X 2c …………… RrX r1 X r2 …X rc C Total N Total count for each outcome of lecture attendance Total count for each outcome (of final exam marks) Sample size

8 If there is no relationship (dependence) between “lecture attendance” and “final exam performance”, 112/ / /400 23/60 = 112/400 ? Or 23/400 = 112/400 X 60/400? 60/ / /400 63/152 = 121/400 ? Or 63/400 = 121/400 X 152/400?

One sample, but two (categorical) measures(values) for each unit in the sample. Only the grand total is fixed (sample sizes). 2. Test for Independency 9 C1C2…Cc R Total R1X 11 X 12 …X 1c R2X 21 X 22 …X 2c …………… RrX r1 X r2 …X rc C Total N Test whether the two (categorical) variables/factors are independent.

Chi-Square (Goodness of Fit) Test; 10 Expected cell frequencies: Observed cell frequencies: Test statistic: Assumption: Large sample(s) so that all expected frequencies are at least 5. Otherwise, may combine groups.

Example 1: 3 shops are used to repair electric motors. 100 motors are send to each shop. 11 OijShop 1Shop 2Shop 3Total Repair Complete Adjustment Needed Repair Incomplete Total EijShop 1Shop 2Shop 3Total Repair Complete Adjustment Needed Repair Incomplete Total

Hypothesis Testing: 12 OijShop 1Shop 2Shop 3Total Repair Complete Adjustment Needed Repair Incomplete Total Level of significance: (Right-tailed) With df = (3 – 1)(3 – 1) = 4, the critical chi-squared value is The chi-squared value from the sample is Conclusion: Since the chi-square value from the sample data is greater than 9.488, we reject the null hypothesis. At least for one level of repair status, the proportions from the 3 shops are not all the same.

Example 2: To determine whether there really is a relationship (dependency) between a student’s lecture attendance and final exam mark. 13 Oij Less than 25% attendance Between 25% - 60% attendence Over 60% attendence Total Final Exam < 50% Final Exam 51-70% Final Exam > 70% Total Eij Less than 25% attendance Between 25% - 60% attendence Over 60% attendence Total Final Exam < 50% Final Exam 51-70% Final Exam > 70% Total

Oij Less than 25% attendance Between 25% - 60% attendence Over 60% attendence Total Final Exam < 50% Final Exam % Final Exam > 70% Total Hypothesis Testing: 14 Level of significance: (Right-tailed) With df = (3 – 1)(3 – 1) = 4, the critical chi-squared value is The chi-squared value from the sample is Conclusion: Since the chi-square value from the sample data is greater than 9.488, we reject the null hypothesis. Attendance of lectures and final exam mark are not independent.

Goodness of Fit Test (for distributions); 15 To compare the observed frequencies (from sample(s)) and a theoretical distribution. The grouped frequency table with all frequencies at least 5. Combine groups if needed; Calculate the expected frequencies using theoretical probabilities. Same chi-square statistic can be used; The null hypothesis: the population where the sample is drawn from has the assumed distribution (normal, Poisson, etc.)

Example 3: 16 Number of radio messages received by an air traffic controller during a time period of 5 minutes is assumed to have Poison distribution with parameter 4.6. To verify this assumption, data from 400 five-minutes intervals are collected, with the frequency table: Number of radio messages Observed frequencies

17 Number of radio messages Observed frequencies Poisson Probability Expected frequencies vs