1 Power 14 Goodness of Fit & Contingency Tables. 2 Outline u I. Parting Shots On the Linear Probability Model u II. Goodness of Fit & Chi Square u III.Contingency.

Slides:



Advertisements
Similar presentations
1 2 Test for Independence 2 Test for Independence.
Advertisements

McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chi-Square Tests Chapter 12.
Categorical Data Analysis
Copyright ©2006 Brooks/Cole A division of Thomson Learning, Inc. Introduction to Probability and Statistics Twelfth Edition Robert J. Beaver Barbara M.
CHI-SQUARE(X2) DISTRIBUTION
1 The Vision Thing Power Thirteen Bivariate Normal Distribution.
Hypothesis Testing IV Chi Square.
Discrete (Categorical) Data Analysis
1 Power 14 Goodness of Fit & Contingency Tables. 2 II. Goodness of Fit & Chi Square u Rolling a Fair Die u The Multinomial Distribution u Experiment:
1 Midterm Review Econ 240A. 2 The Big Picture The Classical Statistical Trail Descriptive Statistics Inferential Statistics Probability Discrete Random.
12.The Chi-square Test and the Analysis of the Contingency Tables 12.1Contingency Table 12.2A Words of Caution about Chi-Square Test.
1 Power 14 Goodness of Fit & Contingency Tables. 2 Outline u I. Projects u II. Goodness of Fit & Chi Square u III.Contingency Tables.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
1 Chapter 16 Linear regression is a procedure that identifies relationship between independent variables and a dependent variable.Linear regression is.
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Linear statistical models 2009 Count data  Contingency tables and log-linear models  Poisson regression.
1 The Vision Thing Power Thirteen Bivariate Normal Distribution.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
The Chi-square Statistic. Goodness of fit 0 This test is used to decide whether there is any difference between the observed (experimental) value and.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
For testing significance of patterns in qualitative data Test statistic is based on counts that represent the number of items that fall in each category.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chapter 16 – Categorical Data Analysis Math 22 Introductory Statistics.
1 Pertemuan 11 Uji kebaikan Suai dan Uji Independen Mata kuliah : A Statistik Ekonomi Tahun: 2010.
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
Introduction Many experiments result in measurements that are qualitative or categorical rather than quantitative. Humans classified by ethnic origin Hair.
AP STATS EXAM REVIEW Chapter 8 Chapter 13 and 14 Chapter 11 and 12 Chapter 9 and Chapter 10 Chapter 7.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
© 2000 Prentice-Hall, Inc. Statistics The Chi-Square Test & The Analysis of Contingency Tables Chapter 13.
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) x ±  x and x ±  y What.
Introduction to Probability and Statistics Thirteenth Edition Chapter 13 Analysis of Categorical Data.
Chapter Outline Goodness of Fit test Test of Independence.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Dan Piett STAT West Virginia University Lecture 12.
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis – mutually exclusive – exhaustive.
Statistics 300: Elementary Statistics Section 11-2.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Chi Square Test for Goodness of Fit Determining if our sample fits the way it should be.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Chi Square Chi square is employed to test the difference between an actual sample and another hypothetical or previously established distribution such.
The Chi-square Statistic
Distributions of Nominal Variables
Test of independence: Contingency Table
Chapter 11 – Test of Independence - Hypothesis Test for Proportions of a Multinomial Population In this case, each element of a population is assigned.
Chapter 9: Non-parametric Tests
Chapter 11 Chi-Square Tests.
Chapter 4. Inference about Process Quality
Distributions of Nominal Variables
The binomial applied: absolute and relative risks, chi-square
Chapter 12 Tests with Qualitative Data
Active Learning Lecture Slides
Part Three. Data Analysis
Data Analysis for Two-Way Tables
Prepared by: Francis Joseph Campeña
Econ 3790: Business and Economics Statistics
Chapter 13: Categorical Data Analysis
Statistical Process Control
Chapter 11 Chi-Square Tests.
Chapter 13 – Applications of the Chi-Square Statistic
Lecture 41 Section 14.1 – 14.3 Wed, Nov 14, 2007
CHI SQUARE TEST OF INDEPENDENCE
Inference for Two Way Tables
Chapter Outline Goodness of Fit test Test of Independence.
Chapter 11 Chi-Square Tests.
Math 10, Spring 2019 Introductory Statistics
Chi - square.
MATH 2311 Section 8.5.
Presentation transcript:

1 Power 14 Goodness of Fit & Contingency Tables

2 Outline u I. Parting Shots On the Linear Probability Model u II. Goodness of Fit & Chi Square u III.Contingency Tables

3 The Vision Thing u Discriminating BetweenTwo Populations u Decision Theory and the Regression Line

4 income education  x = a,  x 2 >  y 2  y = b  x, y > 0 mean income non Mean educ. non Mean Educ Players Mean income Players Players Non-players Discriminating line

5 Expected Costs of Misclassification u E C MC = C(n/p)*P(n/p)*P(p) + C(p/n)*P(n/p)*P(p)  where P(n) = 23/100  u Suppose C(n/p) = C(p/n) u then E C MC = C*P(n/p)*3/4 + C*P(p/n)*1/4 u And the two costs of misclassification will be balanced if P(p/n) =3/4 = Bern

6 The Regression Line- Discriminant Function u Bern = 3/4 u Bern = c + b 1 *educ + b 2 *income u Bern = 3/4 = *educ * income, or u *educ = *income u Educ = *income, u the regression line

7 Lottery: Players and Non-Players Vs. Education & Income Income ($000) Education (Years) Discriminant Function or Decision Rule: Bern = ¾ = 1.39 – *education – *income Legend: Non-Players Players Mean- Nonplayers Mean-Players

8 II. Goodness of Fit & Chi Square u Rolling a Fair Die u The Multinomial Distribution u Experiment: 600 Tosses

9 The Expected Frequencies

10 The Expected Frequencies & Empirical Frequencies Empirical Frequency

11 Hypothesis Test u Null H 0 : Distribution is Multinomial u Statistic: (O i - E i ) 2 /E i, : observed minus expected squared divided by expected u Set Type I 5% for example u Distribution of Statistic is Chi Square P(n 1 =1, n 2 n 3 =0, n 4 =0, n 5 =0, n 6 =0) = n!/ P(n 1 =1, n 2 =0, n 3 =0, n 4 =0, n 5 =0, n 6 =0) = n!/ P(n 1 =1, n 2 n 3 =0, n 4 =0, n 5 =0, n 6 =0)= 1!/1!0!0!0!0!0!(1/6) 1 (1/6) 0 P(n 1 =1, n 2 =0, n 3 =0, n 4 =0, n 5 =0, n 6 =0)= 1!/1!0!0!0!0!0!(1/6) 1 (1/6) 0 (1/6) 0 (1/6) 0 (1/6) 0 (1/6) 0 One Throw, side one comes up: multinomial distribution

12

13 Chi Square: x 2 =  (O i - E i ) 2 = 6.15

Chi Square Density for 5 degrees of freedom %

15 Contingency Table Analysis u Tests for Association Vs. Independence For Qualitative Variables

16 Does Consumer Knowledge Affect Purchases? Frost Free Refrigerators Use More Electricity

17 Marginal Counts

18 Marginal Distributions, f(x) & f(y)

19 Joint Disribution Under Independence f(x,y) = f(x)*f(y)

20 Expected Cell Frequencies Under Independence

21 Observed Cell Counts

22 Contribution to Chi Square: (observed-Expected) 2 /Expected Chi Sqare = = 3.09 (m-1)*(n-1) = 1*1=1 degrees of freedom Upper Left Cell: ( ) 2 /324 = 100/324 =0.31

5% 5.02

24 Using Goodness of Fit to Choose Between Competing Proabaility Models u Men on base when a home run is hit

25 Men on base when a home run is hit

26 Conjecture u Distribution is binomial

27 Average # of men on base Sum of products = n*p = = 0.63

28 Using the binomial k=men on base, n=# of trials u P(k=0) = [3!/0!3!] (0.21) 0 (0.79) 3 = u P(k=1) = [3!/1!2!] (0.21) 1 (0.79) 2 = u P(k=2) = [3!/2!1!] (0.21) 2 (0.79) 1 = u P(k=3) = [3!/3!0!] (0.21) 3 (0.79) 0 = 0.009

29 Goodness of Fit

Chi Square, 3 degrees of freedom 5% 7.81

31 Conjecture: Poisson where  np = 0.63 u P(k=3) = 1- P(k=2)-P(k=1)-P(k=0)  P(k=0) = e -   k /k! = e (0.63) 0 /0! =  P(k=1) = e -   k /k! = e (0.63) 1 /1! =  P(k=2) = e -   k /k! = e (0.63) 2 /2! =

32 Goodness of Fit