Goodness of Fit Multinomials. Multinomial Proportions Thus far we have discussed proportions for situations where the result for the qualitative variable.

Slides:



Advertisements
Similar presentations
Contingency Tables For Tests of Independence. Multinomials Over Various Categories Thus far the situation where there are multiple outcomes for the qualitative.
Advertisements

Chi Squared Tests. Introduction Two statistical techniques are presented. Both are used to analyze nominal data. –A goodness-of-fit test for a multinomial.
Chapter 12 Goodness-of-Fit Tests and Contingency Analysis
Chi square.  Non-parametric test that’s useful when your sample violates the assumptions about normality required by other tests ◦ All other tests we’ve.
Inference about the Difference Between the
Discrete (Categorical) Data Analysis
1 1 Slide © 2009 Econ-2030-Applied Statistics-Dr. Tadesse. Chapter 11: Comparisons Involving Proportions and a Test of Independence n Inferences About.
Hypothesis Tests for Proportions. The Random Variable Recall –Normal distribution –Mean  = p –Standard Deviation =
Chapter 12 Chi-Square Tests and Nonparametric Tests
Chapter 12 Chi-Square Tests and Nonparametric Tests
Ch. 28 Chi-square test Used when the data are frequencies (counts) or proportions for 2 or more groups. Example 1.
Chapter Goals After completing this chapter, you should be able to:
Chapter 16 Chi Squared Tests.
Lecture Inference for a population mean when the stdev is unknown; one more example 12.3 Testing a population variance 12.4 Testing a population.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
Goodness of Fit Test for Proportions of Multinomial Population Chi-square distribution Hypotheses test/Goodness of fit test.
1 Chapter 20 Two Categorical Variables: The Chi-Square Test.
Presentation 12 Chi-Square test.
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
Chapter 13: Inference in Regression
GOODNESS OF FIT TEST & CONTINGENCY TABLE
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics, A First Course 4 th Edition.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on Categorical Data 12.
Comparing Two Proportions
Economics 173 Business Statistics Lecture 6 Fall, 2001 Professor J. Petry
Ch9. Inferences Concerning Proportions. Outline Estimation of Proportions Hypothesis concerning one Proportion Hypothesis concerning several proportions.
Copyright © 2009 Cengage Learning 15.1 Chapter 16 Chi-Squared Tests.
A Course In Business Statistics 4th © 2006 Prentice-Hall, Inc. Chap 9-1 A Course In Business Statistics 4 th Edition Chapter 9 Estimation and Hypothesis.
CHAPTER 17: Tests of Significance: The Basics
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide Chapter 11 Comparisons Involving Proportions n Inference about the Difference Between the Proportions of Two Populations Proportions of Two Populations.
Testing Hypothesis That Data Fit a Given Probability Distribution Problem: We have a sample of size n. Determine if the data fits a probability distribution.
FPP 28 Chi-square test. More types of inference for nominal variables Nominal data is categorical with more than two categories Compare observed frequencies.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 16 Chi-Squared Tests.
+ Chi Square Test Homogeneity or Independence( Association)
Lecture 9 Chap 9-1 Chapter 2b Fundamentals of Hypothesis Testing: One-Sample Tests.
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
Chapter 11 Chi- Square Test for Homogeneity Target Goal: I can use a chi-square test to compare 3 or more proportions. I can use a chi-square test for.
Fitting probability models to frequency data. Review - proportions Data: discrete nominal variable with two states (“success” and “failure”) You can do.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics: A First Course Fifth Edition.
Copyright © 2010 Pearson Education, Inc. Slide
Comparing Counts.  A test of whether the distribution of counts in one categorical variable matches the distribution predicted by a model is called a.
Chap 11-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 11 Chi-Square Tests Business Statistics: A First Course 6 th Edition.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests and Nonparametric Tests Statistics for.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Statistics: Unlocking the Power of Data Lock 5 Exam 2 Review STAT 101 Dr. Kari Lock Morgan 11/13/12 Review of Chapters 5-9.
Inference for Distributions of Categorical Variables (C26 BVD)
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Chapter Outline Goodness of Fit test Test of Independence.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
AP Statistics Section 11.1 B More on Significance Tests.
Dan Piett STAT West Virginia University Lecture 12.
© Copyright McGraw-Hill 2004
Statistics 300: Elementary Statistics Section 11-2.
Week 6 Dr. Jenne Meyer.  Article review  Rules of variance  Keep unaccounted variance small (you want to be able to explain why the variance occurs)
Lecture 11. The chi-square test for goodness of fit.
Chapter 13- Inference For Tables: Chi-square Procedures Section Test for goodness of fit Section Inference for Two-Way tables Presented By:
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Chi-Två Test Kapitel 6. Introduction Two statistical techniques are presented, to analyze nominal data. –A goodness-of-fit test for the multinomial experiment.
Chi Square 11.1 Chi Square. All the tests we’ve learned so far assume that our data is normally distributed z-test t-test We test hypotheses about parameters.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
Chapter 11 Chi-Square Tests.
Chapter 13 Goodness-of-Fit Tests and Contingency Analysis
Chapter 11 Chi-Square Tests.
Chapter 13 Goodness-of-Fit Tests and Contingency Analysis
Chapter 11 Chi-Square Tests.
Presentation transcript:

Goodness of Fit Multinomials

Multinomial Proportions Thus far we have discussed proportions for situations where the result for the qualitative variable could be only “success” or “failure” Now we discuss the situation where there are multiple outcomes for the qualitative variable

EXAMPLE Suppose the 1000 people had 5 choices COLA OBSERVED FREQUENCY (1) Coke f 1 = 410 (2) Pepsif 2 = 350 (3) RCf 3 = 80 (4) Shastaf 4 = 50 (5) Joltf 5 = 110

QUESTIONS (1) Can we conclude that there are differences in cola preference? (2) Last year 40% favored Coke, 35% Pepsi and 25% all other brands. Can we conclude these preferences have changed? (3) Give a 95% confidence interval for those who favor Coke.

(1) CAN IT BE CONCLUDED THAT THERE ARE DIFFERENCES IN COLA PREFERENCES? The answer is NO unless we can conclusively show otherwise. H 0 : (NO) p 1 = p 2 = p 3 = p 4 = p 5 =.2 H A : (YES) At least one p j .2  =.05 THIS IS A  2 (Chi-squared) TEST!

THE  2 (Chi-squared) STATISTIC The  2 (Chi-squared) statistic is defined as the cumulative mean square differences between the observed values (f i ) and the expected values if H 0 were true (e i )

RULE OF 5  2 (Chi-squared) is actually only an approximate distribution for the test statistic To be a “valid” approximation: ALL e i ’s should be  5 If the rule of 5 is violated, combine some categories so that the condition is met

THE  2 (Chi-squared) TEST Reject H 0 if  2 >  2.05,DF DF = k-1, where k = # categories (=5, here)  2.05,4 = a critical value found in a  2 table  2.05,4 = If H 0 were true, p 1 = p 2 = p 3 = p 4 = p 5 =.2 We would expect to find: e 1 =.2(1000) = 200; and e 2 =.2(1000) = 200; e 3 =.2(1000) = 200; e 4 =.2(1000) = 200; e 5 =.2(1000) = 200 ALL e i ’s ARE  5

CALCULATION OF  2 THE MULTINOMIAL TABLE Cola ObservedExpected Difference Mean Sq. Dif. i f i e i (f i - e i ) (f i - e i ) 2 /e i SUM = =  2

RESULTS  2 = >  2.05,4 = There is strong evidence that differences exist in cola preferences

(2) CAN IT BE CONCLUDED COLA PREFERENCES HAVE CHANGED SINCE LAST YEAR? H 0 : (NO) p 1 =.40; p 2 =.35; p OTHER =.25 H A : (YES) At least one p j  its hypothesized value  =.05 There are now k = 3 categories. Reject H 0 if  2 >  2.05,2 =

CALCULATION OF  2 Cola ObservedExpected Difference Mean Sq. Dif. i f i e i (f i - e i ) (f i - e i ) 2 /e i Other SUM =.65 =  2  2 =.65 < Cannot conclude preferences have changed All e i ’s > 5

(3) CONFIDENCE INTERVAL FOR PROPORTION WHO FAVOR COKE This is now binomial –Coke and everything else

Excel The Excel function CHITEST returns the p-value for the hypothesis test. Its form is CHITEST (Range of observed values,Range of estimated values)

=C3*$B$8 Drag down to D4:D7 =CHITEST(B3:B7,D3:D7 ) VERY LOW p-value =C12*$B$15 Drag down to D13:D14 VERY HIGH p-value =CHITEST(B12:B14,D12:D14 )

Review Multinomial problems exist when there are more than two possible outcomes for a qualitative variable Excel Approach -- compare observed values to expected values by using CHITEST to give the p-value Hand approach -- Compare the  2 statistic to  2 ,DF where DF = # categories - 1 = k-1 The value of the  2 statistic is: