Goodness of Fit.

Slides:



Advertisements
Similar presentations
CHI-SQUARE(X2) DISTRIBUTION
Advertisements

1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
11-3 Contingency Tables In this section we consider contingency tables (or two-way frequency tables), which include frequency counts for categorical data.
Inferences About Process Quality
Chi-Square Tests and the F-Distribution
Chapter 5 Hypothesis Testing Introduction Definition Hypothesis testing is a formal approach for determining if data from a sample support.
1 Chapter 11 Analysis of Variance Introduction 11.2 One-Factor Analysis of Variance 11.3 Two-Factor Analysis of Variance: Introduction and Parameter.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 1 Slide © 2005 Thomson/South-Western Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial Population Goodness of.
For testing significance of patterns in qualitative data Test statistic is based on counts that represent the number of items that fall in each category.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
Slide Slide 1 Section 8-6 Testing a Claim About a Standard Deviation or Variance.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
1 Math 10 M Geraghty Part 8 Chi-square and ANOVA tests © Maurice Geraghty 2015.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Chi Square Test for Goodness of Fit Determining if our sample fits the way it should be.
Chapter 10 Section 5 Chi-squared Test for a Variance or Standard Deviation.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Chapter 10 Chi-Square Tests and the F-Distribution
Comparing Counts Chi Square Tests Independence.
Test of independence: Contingency Table
Chapter 11 – Test of Independence - Hypothesis Test for Proportions of a Multinomial Population In this case, each element of a population is assigned.
Chapter 12 Chi-Square Tests and Nonparametric Tests
Chi-Square hypothesis testing
CHAPTER 11 Inference for Distributions of Categorical Data
Chi-square test or c2 test
10 Chapter Chi-Square Tests and the F-Distribution Chapter 10
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Lecture Slides Elementary Statistics Twelfth Edition
Test for Goodness of Fit
i) Two way ANOVA without replication
Applied Business Statistics, 7th ed. by Ken Black
Testing a Claim About a Mean:  Not Known
10 Chapter Chi-Square Tests and the F-Distribution Chapter 10
CHAPTER 11 Inference for Distributions of Categorical Data
Part Three. Data Analysis
CHAPTER 12 ANALYSIS OF VARIANCE
Chapter 10 Chi-Square Tests and the F-Distribution
Chapter 5 Hypothesis Testing
Chapter 11 Goodness-of-Fit and Contingency Tables
Elementary Statistics
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Econ 3790: Business and Economics Statistics
CHAPTER 11 Inference for Distributions of Categorical Data
Contingency Tables: Independence and Homogeneity
Inference on Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHI SQUARE TEST OF INDEPENDENCE
Analyzing the Association Between Categorical Variables
Chapter 13: Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Inference for Two Way Tables
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Chapter 10 – Part II Analysis of Variance
CHAPTER 11 Inference for Distributions of Categorical Data
Testing a Claim About a Standard Deviation or Variance
Presentation transcript:

Goodness of Fit

5.7 – Goodness-of-fit Test Chi-square Goodness-of-fit Test Purpose: To test the claim that a random variable X has some particular distribution. Divide the range of X into k categories Observe values of X and record frequencies of the categories, 𝑂 𝑖 Calculate expected frequencies of the categories, 𝐸 𝑖 Calculate the test statistic

Chi-square Goodness-of-fit Test Critical value: 𝜒 𝛼 2 (𝑘−1) P-value = Area to the right of c under the 𝜒 2 (𝑘−1) density curve If 𝑐 >𝜒 𝛼 2 (𝑘−1) or P-value <𝛼, then reject the claim that X has the claimed distribution Requirements The data have been randomly chosen Each expected frequency is at least 5

Example 5.7.3 A student simulated dandelions in a lawn by randomly placing 300 dots on a piece of paper with an area of 100 in2. He then randomly chose 75 different 1 in2 sections of paper and counted the number of dots in each section.

Example 5.7.3 Let X = number of dots in a 1 in2 section Claim: X has a Poisson distribution with 𝜆=3 Let 𝑝𝑖=𝑃(𝑋=𝑖), 𝑖=0,…,6 If the claim were true, then, for instance Denote this number 𝜋 1 𝐸 1 =75 0.149 =11.20

Example 5.7.3 Critical value: Final conclusion: 𝜒 0.05 2 (7−1) =12.59 Do not reject H0 Final conclusion: It is reasonable to assume that X has Poisson distribution with 𝜆=3

5.8 – Test of Independence Two students want to determine if their university men’s basketball team benefits from home-court advantage. They randomly select 205 games played by the team and record if each one was played at home or away and if the team won or lost (data collected by Emily Hudgins and Courtney Santistevan, 2009). “Contingency Table”

Chi-square Test of Independence Purpose: To test if the row events of a contingency table are independent of the column events. Let 𝑛= total number of observations 𝑎= number of rows 𝑏= number of columns 𝑅𝑖 = sum of the i th row 𝐶𝑗 = sum of the j th column 𝑂𝑖𝑗= frequency in the i th row and j th column H0: The rows are independent of the columns

Test of Independence Test statistic: Critical value: 𝜒 𝛼 2 [ 𝑎−1 𝑏−1 ] P-value = area to the right of 𝑐 Reject H0 if 𝑐> c.v. Requirements The data in the table represent frequency counts and are randomly selected All expected frequencies are at least 5

Example Final Conclusion Critical value: 𝜒 0.05 2 2−1 2−1 =3.841 Reject H0 Final Conclusion The result is not independent of the location

5.9 – One-way ANOVA A seed company plants four types of new corn seed on several plots of land and records the yield (in bushels/acre) of each plot as shown below. Test the claim that the four types of seed produce the same mean yield.

One-way ANOVA Purpose: To test for equality of two or more populations means Null hypothesis: H0: 𝜇 1 = ⋯= 𝜇 𝑘 𝑁= total number of data values Test statistic

One-way ANOVA Critical region: [ 𝑓 𝛼 𝑘−1, 𝑁−𝑘 , ∞) P-value: area to the right of F under the F-distribution density curve with k − 1 and N − k degrees of freedom Requirements (“loose” requirements) The populations are normally distributed The populations have the same variance The samples are random and independent Definition: Treatment (or factor) A characteristic that distinguishes the different populations (or groups) from each other

H0: 𝜇 1 = 𝜇 2 = 𝜇 3 = 𝜇 4 H1: At least one mean is different Example 5.9.3 Let 𝜇 1 = mean yield of Type A, etc H0: 𝜇 1 = 𝜇 2 = 𝜇 3 = 𝜇 4 H1: At least one mean is different Critical region 𝑓 0.05 4−1, 14−4 , ∞ =[3.71, ∞) - Reject H0 Final conclusion: The data do not support the claim of equal means

5.10 – Two-way ANOVA Randomized Block Design A statistics professor is comparing four different delivery methods for her introduction to statistics class: face-to-face, online, hybrid, and video (called the treatments). She divides the population of students into three groups according to their overall GPA: high, middle, and low. (called blocks) She randomly chooses four students from each block and randomly assigns each one to a class using one of the delivery methods At the end of the semester she records each student’s overall grade

Two-way ANOVA Two questions: Do the four treatments have the same population mean? Do the blocks have any affect on the scores?

Two-way ANOVA Parameters Null hypotheses 𝜇 𝑖 = ith treatment mean 𝛽 𝑖 = jth “block effect” (a measure of the effect that block i has on the score) Null hypotheses

ANOVA Table P-value = 0.271 – Do not reject H0 There is not a statistically significant difference between the treatment means P-value = 0.008 – Reject HB There is a statistically significant difference in the block effects

Factorial Experiment The statistics professors randomly chooses 16 students from each GPA level, randomly assigns four to each delivery method, and records their scores at the end of the semester. Three questions: Is there any difference between the population mean scores of the delivery methods? Does the GPA level affect the scores? Does the interaction of the delivery method and GPA level affect the scores?

Factorial Experiment 3×4 factorial experiment with 4 replications per treatment Factors GPA (factor A) Delivery Method (factor B) Levels 3 levels of GPA 4 levels of Delivery Method

Factorial Experiment

Factorial Experiment Parameters Null hypotheses 𝜇 𝑗 = population mean of the jth delivery method 𝛼 𝑖 = effect of the ith GPA level on the score 𝛾 𝑖𝑗 =“interaction effect” of the ith GPA level on the jth delivery method Null hypotheses

ANOVA Table P-value = 0.176 – Do not reject HB There is not a statistically significant difference between the means of the delivery methods P-value = 0.000 – Reject HA There is a statistically significant difference in the effects of the GPA levels

ANOVA Table P-value = 0.004 – Reject HAB Overall There is a statistically significant difference in the interaction effects Overall There is not a “best” method Consider certain delivery methods to certain GPA levels