Lecture 38 Section 14.5 Mon, Dec 4, 2006

Slides:



Advertisements
Similar presentations
Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
Advertisements

Chapter 11 Inference for Distributions of Categorical Data
CJ 526 Statistical Analysis in Criminal Justice
Chi-Square Test A fundamental problem in genetics is determining whether the experimentally determined data fits the results expected from theory (i.e.
Copyright © Cengage Learning. All rights reserved. 11 Applications of Chi-Square.
11.4 Hardy-Wineberg Equilibrium. Equation - used to predict genotype frequencies in a population Predicted genotype frequencies are compared with Actual.
Chi-Squared Test.
CJ 526 Statistical Analysis in Criminal Justice
Test of Independence. The chi squared test statistic and test procedure can also be used to investigate association between 2 categorical variables in.
Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
Education 793 Class Notes Presentation 10 Chi-Square Tests and One-Way ANOVA.
+ Chi Square Test Homogeneity or Independence( Association)
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
Test of Homogeneity Lecture 45 Section 14.4 Wed, Apr 19, 2006.
Chi square analysis Just when you thought statistics was over!!
Test of Goodness of Fit Lecture 43 Section 14.1 – 14.3 Fri, Apr 8, 2005.
Test of Independence Lecture 43 Section 14.5 Mon, Apr 23, 2007.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Test of Homogeneity Lecture 45 Section 14.4 Tue, Apr 12, 2005.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Independent Samples: Comparing Means Lecture 39 Section 11.4 Fri, Apr 1, 2005.
Test of Goodness of Fit Lecture 41 Section 14.1 – 14.3 Wed, Nov 14, 2007.
Chi Square Test Dr. Asif Rehman.
Check your understanding: p. 684
The Chi Square Test A statistical method used to determine goodness of fit Chi-square requires no assumptions about the shape of the population distribution.
Political Science 30: Political Inquiry
Chi-Square hypothesis testing
Chapter 12 Analysis of count data.
Presentation 12 Chi-Square test.
Statistical Analysis Chi Square (X2).
Hypothesis Testing Review
Chapter 12 Tests with Qualitative Data
Patterns of inheritance
Data Analysis for Two-Way Tables
Chi-Square Test.
The Analysis of Categorical Data and Chi-Square Procedures
Analysis of count data 1.
Is a persons’ size related to if they were bullied
Consider this table: The Χ2 Test of Independence
Testing for Independence
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Chi-Square Test.
Chapter 10 Analyzing the Association Between Categorical Variables
Independent Samples: Comparing Means
Contingency Tables: Independence and Homogeneity
Lecture 36 Section 14.1 – 14.3 Mon, Nov 27, 2006
Chi-square test or c2 test
Chi-Square Test.
Lecture 41 Section 14.1 – 14.3 Wed, Nov 14, 2007
Lecture 42 Section 14.4 Wed, Apr 17, 2007
Lecture 37 Section 14.4 Wed, Nov 29, 2006
Analysis of Frequencies
The 2 (chi-squared) test for independence
Analyzing the Association Between Categorical Variables
Lecture 43 Sections 14.4 – 14.5 Mon, Nov 26, 2007
Testing Hypotheses about a Population Proportion
Assistant prof. Dr. Mayasah A. Sadiq FICMS-FM
Chi-square = 2.85 Chi-square crit = 5.99 Achievement is unrelated to whether or not a child attended preschool.
Chapter 26 Comparing Counts.
Inference for Two Way Tables
Chapter 14.1 Goodness of Fit Test.
Testing Hypotheses about a Population Proportion
Lecture 42 Section 14.3 Mon, Nov 19, 2007
Hypothesis Testing - Chi Square
Inference for Distributions of Categorical Data
Testing Hypotheses about a Population Proportion
Lecture 46 Section 14.5 Wed, Apr 13, 2005
Lecture 43 Section 14.1 – 14.3 Mon, Nov 28, 2005
CHI SQUARE (χ2) Dangerous Curves Ahead!.
What is Chi-Square and its used in Hypothesis? Kinza malik 1.
Presentation transcript:

Lecture 38 Section 14.5 Mon, Dec 4, 2006 Test of Independence Lecture 38 Section 14.5 Mon, Dec 4, 2006

Independence Only one sample is taken. For each subject in the sample, two observations are made (i.e., two variables are measured). We wish to determine whether there is a relationship between the two variables. The two variables are independent if there is no relationship between them.

Mendel’s Experiments In Mendel’s experiments, Mendel observed 75% yellow seeds, 25% green seeds. 75% smooth seeds, 25% wrinkled seeds. Because color and texture were independent, he also observed 9/16 yellow and smooth 3/16 yellow and wrinkled 3/16 green and smooth 1/16 green and wrinkled

Mendel’s Experiments That is, he observed the same ratios within categories that he observed for the totals. Smooth Wrinkled Yellow 9 3 Green 1

Mendel’s Experiments That is, he observed the same ratios within categories that he observed for the totals. Smooth Wrinkled Yellow 9 3 Green 1 3 : 1 Ratio

Mendel’s Experiments That is, he observed the same ratios within categories that he observed for the totals. Smooth Wrinkled Yellow 9 3 Green 1 3 : 1 Ratio

Mendel’s Experiments That is, he observed the same ratios within categories that he observed for the totals. Smooth Wrinkled Yellow 9 3 Green 1 3 : 1 Ratio

Mendel’s Experiments That is, he observed the same ratios within categories that he observed for the totals. Smooth Wrinkled Yellow 9 3 Green 1 3 : 1 Ratio

Mendel’s Experiments Had the traits not been independent, he might have observed something different. Smooth Wrinkled Yellow 10 2 Green

Example Suppose a university researcher suspects that a student’s SAT-M score is related to his performance in Statistics. At the end of the semester, he compares each student’s grade to his SAT-M score for all Statistics classes at that university. He wants to know whether the student’s with the higher SAT-M scores got the higher grades.

Example Does there appear to be a difference between the rows? Or are the rows independent? Grade A B C D F 400 - 500 7 8 16 20 21 500 – 600 13 28 32 22 600 – 700 23 10 9 700 - 800 14 5 SAT-M

The Test of Independence The null hypothesis is that the variables are independent. The alternative hypothesis is that the variables are not independent. H0: The variables are independent. H1: The variables are not independent. Let  = 0.05.

The Test Statistic The test statistic is the chi-square statistic, computed as The question now is, how do we compute the expected counts?

Expected Counts Under the assumption of independence (H0), the rows should exhibit the same proportions. This is the same as when testing for homogeneity. Therefore, we may calculate the expected counts in the same way.

Expected Counts A B C D F 400 - 500 7 (8.64) 8 (17.28) 16 (20.16) 20 (14.40) 21 (11.52) 500 – 600 13 (12.96) 28 (25.92) 32 (30.24) 22 (21.60) 600 – 700 23 10 9 700 - 800 (5.76) 14 (13.44) (9.60) 5 (7.68)

The Test Statistic The value of 2 is 23.7603.

df = (no. of rows – 1)  (no. of cols – 1). Degrees of Freedom The degrees of freedom are the same as before df = (no. of rows – 1)  (no. of cols – 1). In our example, df = (4 – 1)  (5 – 1) = 12.

The p-value To find the p-value, calculate 2cdf(23.7603, E99, 12) = 0.0219. The results are significant at the 5% level.

TI-83 – Test of Independence The test for independence on the TI-83 is identical to the test for homogeneity.

Example Admissions figures for the School of Arts and Sciences. Acceptance Status Accepted Not Accepted Race Female 50 150 Male 500 1000

Example Admissions figures for the Business School. Acceptance Status Accepted Not Accepted Race Female 850 1500 Male 150 200

Example Admissions figures for the two schools combined. Acceptance Status Accepted Not Accepted Race Female 900 1650 Male 650 1200

Practice This is called Simpson’s paradox. It occurs whenever the aggregate population shows a different relationship than in the subpopulations.