Lecture 43 Sections 14.4 – 14.5 Mon, Nov 26, 2007

Slides:



Advertisements
Similar presentations
Chapter 11 Other Chi-Squared Tests
Advertisements

General Genetic Bio 221 Lab 6. Law of Independent Assortment (The "Second Law") The Law of Independent Assortment, also known as "Inheritance Law", states.
Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
Estimation and Hypothesis Testing Faculty of Information Technology King Mongkut’s University of Technology North Bangkok 1.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 10.7.
Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
Chapter 26 Chi-Square Testing
Education 793 Class Notes Presentation 10 Chi-Square Tests and One-Way ANOVA.
Chapter 14: Chi-Square Procedures – Test for Goodness of Fit.
Test of Homogeneity Lecture 45 Section 14.4 Wed, Apr 19, 2006.
Chi square analysis Just when you thought statistics was over!!
Test of Goodness of Fit Lecture 43 Section 14.1 – 14.3 Fri, Apr 8, 2005.
Test of Independence Lecture 43 Section 14.5 Mon, Apr 23, 2007.
11.2 Tests Using Contingency Tables When data can be tabulated in table form in terms of frequencies, several types of hypotheses can be tested by using.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Testing Hypotheses about a Population Proportion Lecture 31 Sections 9.1 – 9.3 Wed, Mar 22, 2006.
Test of Homogeneity Lecture 45 Section 14.4 Tue, Apr 12, 2005.
Statistics 300: Elementary Statistics Section 11-3.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
The  2 (chi-squared) test for independence. One way of finding out is to perform a  2 (chi-squared) test for independence. We might want to find out.
Independent Samples: Comparing Means Lecture 39 Section 11.4 Fri, Apr 1, 2005.
Test of Goodness of Fit Lecture 41 Section 14.1 – 14.3 Wed, Nov 14, 2007.
Test of independence: Contingency Table
Testing Hypotheses about a Population Proportion
Testing a Claim About a Mean:  Not Known
Chapter 12 Tests with Qualitative Data
Community &family medicine
Patterns of inheritance
1) A bicycle safety organization claims that fatal bicycle accidents are uniformly distributed throughout the week. The table shows the day of the week.
Data Analysis for Two-Way Tables
The Analysis of Categorical Data and Chi-Square Procedures
Analysis of count data 1.
Consider this table: The Χ2 Test of Independence
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Day 67 Agenda: Submit THQ #6 Answers.
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
X 2 (Chi-Square) ·. one of the most versatile. statistics there is ·
Are you planning on doing a project in Math 124
Chapter 10 Analyzing the Association Between Categorical Variables
Independent Samples: Comparing Means
Contingency Tables: Independence and Homogeneity
Lecture 36 Section 14.1 – 14.3 Mon, Nov 27, 2006
Chi-square test or c2 test
The Analysis of Categorical Data and Goodness of Fit Tests
Lecture 41 Section 14.1 – 14.3 Wed, Nov 14, 2007
Making Decisions about a Population Mean with Confidence
Lecture 42 Section 14.4 Wed, Apr 17, 2007
Lecture 37 Section 14.4 Wed, Nov 29, 2006
Lecture 38 Section 14.5 Mon, Dec 4, 2006
The 2 (chi-squared) test for independence
The Analysis of Categorical Data and Goodness of Fit Tests
Testing Hypotheses about a Population Proportion
Chapter 13: Inference for Distributions of Categorical Data
The Analysis of Categorical Data and Goodness of Fit Tests
Chapter 26 Comparing Counts.
Independent Samples: Comparing Proportions
Inference for Two Way Tables
The Analysis of Categorical Data and Goodness of Fit Tests
Testing Hypotheses about a Population Proportion
SENIORS: Final transcript request must be made by Friday.
Testing Hypotheses about a Population Proportion
Independent Samples: Comparing Means
Lecture 42 Section 14.3 Mon, Nov 19, 2007
Inference for Distributions of Categorical Data
Chi Square Test of Homogeneity
Testing Hypotheses about a Population Proportion
Lecture 46 Section 14.5 Wed, Apr 13, 2005
Lecture 43 Section 14.1 – 14.3 Mon, Nov 28, 2005
Presentation transcript:

Lecture 43 Sections 14.4 – 14.5 Mon, Nov 26, 2007 Test of Homogeneity Lecture 43 Sections 14.4 – 14.5 Mon, Nov 26, 2007

Homogeneous Populations Two distributions are called homogeneous if they exhibit the same proportions within the same categories.

Examples If two colleges’ student bodies are each 55% female and 45% male, then the distributions are homogeneous. If a teacher teaches two sections of Statistics using two different teaching methods and the two sections have the same grade distribution, then they are homogeneous.

Example Suppose a teacher teaches two sections of Statistics and uses two different teaching methods. At the end of the semester, he gives both sections the same final exam and he compares the grade distributions. He wants to know if the differences that he observes are significant.

Example Does there appear to be a difference? Or are the two populations homogeneous? A B C D F Method I 5 7 36 17 Method II 11 18

The Test of Homogeneity The null hypothesis is that the populations are homogeneous. The alternative hypothesis is that the populations are not homogeneous. H0: The populations are homogeneous. H1: The populations are not homogeneous.

Independence Two variables are independent if the value of one has no bearing on the value of the other.

Mendel’s Experiments In Mendel’s experiments, Mendel observed 75% yellow seeds, 25% green seeds. 75% smooth seeds, 25% wrinkled seeds. Because color and texture were independent, he also observed 9/16 yellow and smooth 3/16 yellow and wrinkled 3/16 green and smooth 1/16 green and wrinkled

Mendel’s Experiments That is, he observed the same ratios within categories that he observed for the totals. Smooth Wrinkled Yellow 9 3 Green 1

Mendel’s Experiments That is, he observed the same ratios within categories that he observed for the totals. Smooth Wrinkled Yellow 9 3 Green 1 3 : 1 Ratio

Mendel’s Experiments That is, he observed the same ratios within categories that he observed for the totals. Smooth Wrinkled Yellow 9 3 Green 1 3 : 1 Ratio

Mendel’s Experiments That is, he observed the same ratios within categories that he observed for the totals. Smooth Wrinkled Yellow 9 3 Green 1 3 : 1 Ratio

Mendel’s Experiments That is, he observed the same ratios within categories that he observed for the totals. Smooth Wrinkled Yellow 9 3 Green 1 3 : 1 Ratio

Mendel’s Experiments Had the traits not been independent, he might have observed something different. Smooth Wrinkled Yellow 11 1 Green 3

Example Suppose a university researcher suspects that a student’s SAT-M score is related to his performance in Statistics. At the end of the semester, he compares each student’s grade to his SAT-M score for all Statistics classes at that university. He wants to know whether the student’s with the higher SAT-M scores got the higher grades.

Example Does there appear to be a difference between the rows? Or are the rows independent? Grade A B C D F 400 – 500 7 8 16 20 21 500 – 600 13 28 32 22 600 – 700 23 10 9 700 – 800 14 5 SAT-M

The Test of Independence The null hypothesis is that the variables are independent. The alternative hypothesis is that the variables are not independent. H0: The variables are independent. H1: The variables are not independent.

An Example of Homogeneity We will work the homogeneity example of the two teaching methods. A B C D F Method I 5 7 36 17 Method II 11 18

The Test Statistic The test statistic is the chi-square statistic, computed as The question now is, how do we compute the expected counts?

Expected Counts Under the assumption of homogeneity or independence (H0), the rows should exhibit the same proportions. We can get the best estimate of those proportions by pooling the rows. That is, find the column totals then compute the column proportions from them.

Row and Column Proportions B C D F Method I 5 7 36 17 Method II 11 18

Row and Column Proportions B C D F Method I 5 7 36 17 Method II 11 18 Col Total 12 54 24

Row and Column Proportions B C D F Method I 5 7 36 17 Method II 11 18 Col Total 12 54 24 10% 15% 45% 20%

Expected Counts Similarly, the columns should exhibit the same proportions, so we can get the best estimate by pooling the columns. That is, find the row totals and then compute the row proportions from them.

Row and Column Proportions B C D F Method I 5 7 36 17 Method II 11 18 Col Total 12 54 24 10% 15% 45% 20%

Row and Column Proportions B C D F Row Total Method I 5 7 36 17 72 Method II 11 18 48 Col Total 12 54 24 10% 15% 45% 20%

Row and Column Proportions B C D F Row Total Method I 5 7 36 17 72 60% Method II 11 18 48 40% Col Total 12 54 24 10% 15% 45% 20%

Row and Column Proportions B C D F Row Total Method I 5 7 36 17 72 60% Method II 11 18 48 40% Col Total 12 54 24 120 10% 15% 45% 20% Grand Total

Row and Column Totals A B C D F Total Method I 5 7 36 17 Method II 11 18 12 54 24 120

Expected Counts Now apply the appropriate row and column proportions to each cell to get the expected count. In the upper-left cell, according to the row and column proportions, it should contain 60% of 10% of 120.

Expected Counts That is, the expected count is 0.60 0.10  120 = 7.2 This is

Expected Counts Therefore, the quick formula is

Expected Counts Apply that formula to each cell to find the expected counts and add them to the table. A B C D F Method I 5 (7.2) 7 (10.8) 36 (32.4) 17 (14.4) Method II (4.8) 11 18 (21.6) (9.6)

The Test Statistic Now compute 2 in the usual way.

df = (no. of rows – 1)  (no. of cols – 1). Degrees of Freedom The number of degrees of freedom is df = (no. of rows – 1)  (no. of cols – 1). In our example, df = (2 – 1)  (5 – 1) = 4. To find the p-value, calculate 2cdf(7.2106, E99, 4) = 0.1252. At the 5% level of significance, the differences are not statistically significant.

TI-83 – Test of Homogeneity or Independence The tables in these examples are not lists, so we can’t use the lists in the TI-83. Instead, the tables are matrices. The TI-83 can handle matrices.

TI-83 – Test of Homogeneity or Independence Enter the observed counts into a matrix. Press MATRIX. Select EDIT. Use the arrow keys to select the matrix to edit, say [A]. Press ENTER to edit that matrix. Enter the number of rows and columns. (Press ENTER to advance.) Enter the observed counts in the cells. Press 2nd Quit to exit the matrix editor.

TI-83 – Test of Homogeneity or Independence Perform the test of homogeneity. Select STATS > TESTS > 2-Test… Press ENTER. Enter the name of the matrix of observed counts. Enter the name (e.g., [E]) of a matrix for the expected counts. These will be computed for you by the TI-83. Select Calculate.

TI-83 – Test of Homogeneity or Independence The window displays The title “2-Test”. The value of 2. The p-value. The number of degrees of freedom. See the matrix of expected counts. Press MATRIX. Select matrix [E]. Press ENTER.

TI-83 – Example Work the previous example, using the TI-83.

Example Is the Math SAT score independent from the grade in Statistics? Grade A B C D F 400 - 500 7 8 16 20 21 500 – 600 13 28 32 22 600 – 700 23 10 9 700 - 800 14 5 SAT-M

Expected Counts A B C D F 400 - 500 7 (8.64) 8 (17.28) 16 (20.16) 20 (14.40) 21 (11.52) 500 – 600 13 (12.96) 28 (25.92) 32 (30.24) 22 (21.60) 600 – 700 23 10 9 700 - 800 (5.76) 14 (13.44) (9.60) 5 (7.68)

The Test Statistic The value of 2 is 23.7603.

df = (no. of rows – 1)  (no. of cols – 1). Degrees of Freedom The degrees of freedom are the same as before df = (no. of rows – 1)  (no. of cols – 1). In our example, df = (4 – 1)  (5 – 1) = 12.

The p-value To find the p-value, calculate 2cdf(23.7603, E99, 12) = 0.0219. The results are significant at the 5% level.

TI-83 – Test of Independence The test for independence on the TI-83 is identical to the test for homogeneity.