381 Goodness of Fit Tests QSCI 381 – Lecture 40 (Larson and Farber, Sect 10.1)

Slides:



Advertisements
Similar presentations
Parametric/Nonparametric Tests. Chi-Square Test It is a technique through the use of which it is possible for all researchers to:  test the goodness.
Advertisements

Chi-Squared Hypothesis Testing Using One-Way and Two-Way Frequency Tables of Categorical Variables.
Chapter 10 Chi-Square Tests and the F- Distribution 1 Larson/Farber 4th ed.
© 2010 Pearson Prentice Hall. All rights reserved The Chi-Square Goodness-of-Fit Test.
Chapter 11 Chi-Square Procedures 11.1 Chi-Square Goodness of Fit.
CJ 526 Statistical Analysis in Criminal Justice
Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
11-2 Goodness-of-Fit In this section, we consider sample data consisting of observed frequency counts arranged in a single row or column (called a one-way.
Chapter 9 Hypothesis Testing.
Chi-Square Tests and the F-Distribution
Chi-square Goodness of Fit Test
Goodness of Fit Test for Proportions of Multinomial Population Chi-square distribution Hypotheses test/Goodness of fit test.
8/15/2015Slide 1 The only legitimate mathematical operation that we can use with a variable that we treat as categorical is to count the number of cases.
Hypothesis Testing with Two Samples
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
The Chi-square Statistic. Goodness of fit 0 This test is used to decide whether there is any difference between the observed (experimental) value and.
Claims about a Population Mean when σ is Known Objective: test a claim.
EDRS 6208 Analysis and Interpretation of Data Non Parametric Tests
Chapter 26: Comparing Counts AP Statistics. Comparing Counts In this chapter, we will be performing hypothesis tests on categorical data In previous chapters,
CJ 526 Statistical Analysis in Criminal Justice
Section 10.1 Goodness of Fit. Section 10.1 Objectives Use the chi-square distribution to test whether a frequency distribution fits a claimed distribution.
Chapter 11 Chi-Square Procedures 11.3 Chi-Square Test for Independence; Homogeneity of Proportions.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Chapter 11 Goodness of Fit Test (section 11.2)
381 Hypothesis Testing (Testing with Two Samples-III) QSCI 381 – Lecture 32 (Larson and Farber, Sects 8.3 – 8.4)
Copyright © 2010, 2007, 2004 Pearson Education, Inc. 1.. Section 11-2 Goodness of Fit.
Chapter 16 – Categorical Data Analysis Math 22 Introductory Statistics.
10.1: Multinomial Experiments Multinomial experiment A probability experiment consisting of a fixed number of trials in which there are more than two possible.
Chapter 10 Chi-Square Tests and the F-Distribution
Hypothesis Testing with One Sample Chapter 7. § 7.2 Hypothesis Testing for the Mean (Large Samples)
Chapter Chi-Square Tests and the F-Distribution 1 of © 2012 Pearson Education, Inc. All rights reserved.
381 Hypothesis Testing (Decisions on Means, Proportions and Variances) QSCI 381 – Lecture 28 (Larson and Farber, Sect )
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
Other Chi-Square Tests
Chapter 10 Chi-Square Tests and the F-Distribution
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 16 Chi-Squared Tests.
Testing Differences in Population Variances
GOODNESS OF FIT Larson/Farber 4th ed 1 Section 10.1.
Other Chi-Square Tests
Chi-Square Test James A. Pershing, Ph.D. Indiana University.
AP Statistics Section 14.. The main objective of Chapter 14 is to test claims about qualitative data consisting of frequency counts for different categories.
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter Outline Goodness of Fit test Test of Independence.
1 Chapter 10. Section 10.1 and 10.2 Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
11.2 Tests Using Contingency Tables When data can be tabulated in table form in terms of frequencies, several types of hypotheses can be tested by using.
© Copyright McGraw-Hill 2004
381 Hypothesis Testing (Introduction-II) QSCI 381 – Lecture 26 (Larson and Farber, Sect 7.1)
Statistics 300: Elementary Statistics Section 11-2.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Chapter 11 Chi-Square Procedures 11.1 Chi-Square Goodness of Fit.
Chapter 18 Chi-Square Tests.  2 Distribution Let x 1, x 2,.. x n be a random sample from a normal distribution with  and  2, and let s 2 be the sample.
381 Hypothesis Testing (Testing with Two Samples-II) QSCI 381 – Lecture 31 (Larson and Farber, Sect 8.2)
The Chi-Square Distribution  Chi-square tests for ….. goodness of fit, and independence 1.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Goodness-of-Fit and Contingency Tables Chapter 11.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
Section 10.1 Goodness of Fit © 2012 Pearson Education, Inc. All rights reserved. 1 of 91.
10 Chapter Chi-Square Tests and the F-Distribution Chapter 10
Lecture Slides Elementary Statistics Twelfth Edition
Chapter 8 Hypothesis Testing with Two Samples.
CONCEPTS OF HYPOTHESIS TESTING
Elementary Statistics: Picturing The World
Testing for Independence
Econ 3790: Business and Economics Statistics
Confidence Intervals for Proportions and Variances
Chapter Outline Goodness of Fit test Test of Independence.
Presentation transcript:

381 Goodness of Fit Tests QSCI 381 – Lecture 40 (Larson and Farber, Sect 10.1)

381 Multinomial Experiments A is a probability experiment consisting of a fixed number of trials in which there are more than two possible outcomes for each independent trial. The probability for each outcome is fixed and each outcome is classified into. Examples of multinomial experiments include: You sample 100 animals from a population. The categories could be age, length, maturity state. You sample 1000 poppies in a field. The categories could be colour. You sample 20 animals and calculate the frequency that each has a particular genetic haplotype.

381 Goodness-of-fit Tests A is used to test whether an observed frequency distribution fits an expected distribution. We need to specify a null and an alternative hypothesis. Generally the null hypothesis is that the observed frequency distribution (the data) fits the expected distribution. The alternative hypothesis is that this is not the case.

381 Example-I We expect that a “healthy” marine mammal population should consist of an equal number of males and females, and that 60% of the population should be mature. We sample 150 animals and assess the fraction in each of four categories to be: Mature Female Mature Male Immature Female Immature Male

381 Observed and Expected Frequencies The of a category is the frequency for the category observed in the data. The of a category is the calculated frequency for the category. Expected frequencies are obtained by assuming the specified (or hypothesized) distribution is correct. The expected frequency for the i th category is: Where n is the number of trials, and p i is the assumed probability for the i th category.

381 Observed and Expected Frequencies (Example) Mature Female Mature Male Immature Female Immature Male Observed frequency Assumed probability Expected frequency 45 (150 x 0.3) 45 (150 x 0.3) 30 (150 x 0.2) 30 (150 x 0.2)

381 The Chi-square goodness-of-fit Test-I IF: 1. the observed frequencies are obtained from a random sample, and 2. the expected frequencies are greater than or equal to 5 (pool categories if this is not the case). then the sampling distribution for the goodness-of-fit test is a chi-square distribution with k-1 degrees of freedom where k is the number of categories. The test statistic is:

381 The Chi-square goodness-of-fit Test-II 1. Identify the claim and state the null and alternative hypotheses. 2. Specify the level of significance, . 3. Determine the degrees of freedom, d.f=k Find the critical value of the chi-square distribution and hence define the rejection region for the test. 5. Calculate the test statistic. 6. Check whether or not the value of the test statistic is in the rejection region.

381 Example (Test using  =0.01) H 0 : the distribution of animals between sex and maturity classes equals that expected for a healthy population. The degrees of freedom=k-1=3. The critical value of the chi-square distribution is (CHIINV(0.01,3))

381 Example (Test using  =0.01) Mature Female Mature Male Immature Female Immature Male Observed frequency Expected frequency We reject the null hypothesis at the 1% level of significance.

381 Example-A-1 (  =0.05) The probability of a particular bird species utilizing each of five habitats is known. We collect data for a different species (n=137) and wish to assess whether the two species differ in their habitat requirements. Habitat type Expected p Observed

381 Example-A-2 (  =0.05) Habitat type Observed frequency Expected frequency The critical value is 9.49 – we fail to reject the null hypothesis

381 Testing for Normality We can use the chi-square test in some cases to assess whether a variable is normally distributed. The null and alternative hypotheses are that: The variable has a normal distribution. The variable does not have a normal distribution.

381 Example Class boundaries Frequency Can we assume that these data are normal (assume  =0.05)?

381 Calculating the Test Statistic Class boundaries Observed frequency O Cumulative normalExpected p Expected Frequency E LowerUpperDifference  x i is the mid-point of each class E i =p i x 149