Statistical Inference

Slides:



Advertisements
Similar presentations
Contingency Table Analysis Mary Whiteside, Ph.D..
Advertisements

What is Chi-Square? Used to examine differences in the distributions of nominal data A mathematical comparison between expected frequencies and observed.
CHI-SQUARE(X2) DISTRIBUTION
Inference about the Difference Between the
Statistical Inference for Frequency Data Chapter 16.
© 2010 Pearson Prentice Hall. All rights reserved The Chi-Square Test of Independence.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Hypothesis Testing.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on Categorical Data 12.
Chapter 11: Applications of Chi-Square. Chapter Goals Investigate two tests: multinomial experiment, and the contingency table. Compare experimental results.
Chapter 11 Chi-Square Procedures 11.3 Chi-Square Test for Independence; Homogeneity of Proportions.
1 Chi-Square Heibatollah Baghi, and Mastee Badii.
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.
1 Chapter 17 Statistical Inference For Frequency Data IThree Applications of Pearson’s  2 Testing goodness of fit Testing independence Testing equality.
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
Other Chi-Square Tests
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Chapter Outline Goodness of Fit test Test of Independence.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 12. The Chi-Square Test.
11.2 Tests Using Contingency Tables When data can be tabulated in table form in terms of frequencies, several types of hypotheses can be tested by using.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
1 Chi-square Test Dr. T. T. Kachwala. Using the Chi-Square Test 2 The following are the two Applications: 1. Chi square as a test of Independence 2.Chi.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Chi Square Tests Chapter 17. Assumptions for Parametrics >Normal distributions >DV is at least scale >Random selection Sometimes other stuff: homogeneity,
Chi Square Test for Goodness of Fit Determining if our sample fits the way it should be.
Statistics 300: Elementary Statistics Section 11-3.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Chi Square Chi square is employed to test the difference between an actual sample and another hypothetical or previously established distribution such.
Basic Statistics The Chi Square Test of Independence.
Test of independence: Contingency Table
Chapter 11 – Test of Independence - Hypothesis Test for Proportions of a Multinomial Population In this case, each element of a population is assigned.
St. Edward’s University
Chapter 12 Chi-Square Tests and Nonparametric Tests
CHAPTER 11 Inference for Distributions of Categorical Data
10 Chapter Chi-Square Tests and the F-Distribution Chapter 10
Statistical Analysis Professor Lynne Stokes
Hypothesis Testing Review
Chapter 12 Tests with Qualitative Data
Active Learning Lecture Slides
John Loucks St. Edward’s University . SLIDES . BY.
Qualitative data – tests of association
Statistics for Business and Economics (13e)
The Chi-Square Distribution and Test for Independence
Different Scales, Different Measures of Association
Chi Square Two-way Tables
Econ 3790: Business and Economics Statistics
Chapter 11: Inference for Distributions of Categorical Data
Chapter 10 Analyzing the Association Between Categorical Variables
Contingency Tables: Independence and Homogeneity
STAT 312 Introduction Z-Tests and Confidence Intervals for a
Chapter 13 – Applications of the Chi-Square Statistic
Inference on Categorical Data
CHI SQUARE TEST OF INDEPENDENCE
Analyzing the Association Between Categorical Variables
The Analysis of Categorical Data and Goodness of Fit Tests
The Analysis of Categorical Data and Goodness of Fit Tests
Inference for Two Way Tables
Chapter Outline Goodness of Fit test Test of Independence.
Presentation transcript:

Statistical Inference Chapter 17 Statistical Inference For Frequency Data I Three Applications of Pearson’s 2  Testing goodness of fit  Testing independence  Testing equality of proportions

A. Testing Goodness of Fit 1. Statistical hypotheses H0: OPop 1 = EPop 1, . . . , OPop k = EPop k H1: OPop j ≠ EPop j for some j and j 2. Randomization Plan  One random sample of n elements  Each element is classified in terms of membership in one of k mutually exclusive categories

B. Testing Independence 1. Statistical hypotheses H0: p(A and B) = p(A)p(B) H1: p(A and B) ≠ p(A)p(B) 2. Randomization Plan  One random sample of n elements  Each element is classified in terms of two variables, denoted by A and B, where each variable has two or more categories.

C. Testing Equality of Proportions 1. Statistical hypotheses H0: p1 = p2 = . . . = pc H1: pj ≠ pj for some j and j 2. Randomization Plan  c random samples, where c ≥ 2  For each sample, elements are classified in terms of membership in one of r = 2 mutually exclusive categories

II Testing Goodness of Fit A. Chi-Square Distribution

B. Pearson’s chi-square statistic 1. Oj and Ej denote, respectively, observed and expected frequencies. k denotes the number of categories. 2. Critical value of chi square is with  = k – 1 degrees of freedom.

1. Is the distribution of grades for summer-school C. Grade-Distribution Example 1. Is the distribution of grades for summer-school students in a statistics class different from that for the fall and spring semesters? Fall and Spring Summer Grade Proportion Obs. frequency A .12 15 B .23 21 C .47 30 D .13 6 F .05 0 1.00 24

2. The statistical hypotheses are H0: OPop 1 = EPop 1, . . . , OPop 5 = EPop 5 H1: OPop j ≠ EPop j for some j and j 3. Pearson’s chi-square statistic is 4. Critical value of chi square for  = .05, k = 5 categories, and  = 5 – 1 = 4 degrees of freedom is

Table 1. Computation of Pearson’s Chi-Square for n = 72 Table 1. Computation of Pearson’s Chi-Square for n = 72 Summer-School Students (1) (2) (3) (4) (5) (6) Grade Oj pj npj = Ej Oj – Ej A 15 .12 72(.12) = 8.6 6.4 4.763 B 21 .23 72(.23) =16.6 4.4 1.166 C 30 .47 72(.47) = 33.8 –3.8 0.427 D 6 .13 72(.13) = 9.4 –-3.4 1.230 F 0 .05 72(.05) = 3.6 –3.6 3.600 72 1.00 72.0 0 2 = 11.186* *p < .025

5. Degrees of freedom when e parameters of a theoretical distribution must be estimated is k – 1 – e. D. Practical Significance 1. Cohen’s w observed and and expected proportions in the jth category.

2. Simpler equivalent formula for Cohen’s 3. Cohen’s guidelines for interpreting w  0.1 is a small effect  0.3 is a medium effect  0.5 is a large effect

1. When  = 1, Yates’ correction can be applied to E. Yates’ Correction 1. When  = 1, Yates’ correction can be applied to make the sampling distribution of the test statistic for Oj – Ej , which is discrete, better approximate the chi-square distribution.

F. Assumptions of the Goodness-of-Fit Test 1. Every observation is assigned to one and only one category. 2. The observations are independent 3. If  = 1, every expected frequency should be at least 10. If  > 1, every expected frequency should be at least 5.

A. Statistical Hypotheses III Testing Independence A. Statistical Hypotheses H0: p(A and B) = p(A)p(B) H1: p(A and B) ≠ p(A)p(B) B. Chi-Square Statistic for an r  c Contingency Table with i = 1, . . . , r Rows and j = 1, . . . , c Columns

C. Computational Example: Is Success on an C. Computational Example: Is Success on an Employment-Test Item Independent of Gender? Observed Expected b1 b2 b1 b2 Fail Pass Fail Pass a1 Man 84 18 102 88.9 13.1 a2 Women 93 8 101 88.1 12.9 177 26 203

p(ai and bj) = p(ai)p(bj) D. Computation of expected frequencies 1. A and B are statistically independent if p(ai and bj) = p(ai)p(bj) 2. Expected frequency, for the cell in row i and column j

Observed Expected b1 b2 b1 b2 a1 84 18 102 88.9 13.1 a2 93 8 101 88.1 12.9 177 26 203    

E. Degrees of Freedom for an r  c Contingency Table df = k – 1 – e = rc – 1 – [(r – 1) + (c – 1)] = rc – 1 – r + 1 – c + 1 = rc – r – c + 1 = (r – 1)(c – 1) = (2 – 1)(2 – 1) = 1

F. Strength of Association and Practical Significance where s is the smaller of the number of rows and columns.

3. For a contingency table, an alternative formula for is

1. Motivation and education of conscientious G. Three-By-Three Contingency Table 1. Motivation and education of conscientious objectors during WWII High Grade College School School Total Coward 12 25 35 72 Partly Coward 19 23 30 72 Not Coward 71 56 24 151 Total 102 104 89 295

2. Strength of Association, Cramér’s 3. Practical significance

H. Assumptions of the Independence Test 1. Every observation is assigned to one and only one cell of the contingency table. 2. The observations are independent 3. If  = 1, every expected frequency should be at least 10. If  > 1, every expected frequency should be at least 5.

IV Testing Equality of c ≥ 2 Proportions A. Statistical Hypotheses H0: p1 = p2 = . . . = pc H1: pj ≠ pj for some j and j 1. Computational example: three samples of n = 100 residents of nursing homes were surveyed. Variable A was age heterogeneity in the home; variable B was resident satisfaction.

Table 2. Nursing Home Data Age Heterogeneity Low b1 Medium b2 High b3 Satisfied a1 O = 56 O = 58 O = 38 E = 50.67 E = 50.67 E = 50.67 Not Satisfied a2 O = 44 O = 42 O = 52 E = 49.33 E = 49.33 E = 49.33

B. Assumptions of the Equality of Proportions Test 1. Every observation is assigned to one and only one cell of the contingency table.

C. Test of Homogeneity of Proportions 2. The observations are independent 3. If  = 1, every expected frequency should be at least 10. If  > 1, every expected frequency should be at least 5. C. Test of Homogeneity of Proportions 1. Extension of the test of equality of proportions when variable A has r > 2 rows

2. Statistical hypotheses for columns j and j'