The mystery of the CHI SQUARE Is it CHEE square Or CHAI Square?!

Slides:

Advertisements

Similar presentations

Chapter 16 Goodness-of-Fit Tests and Contingency Tables

Advertisements

Chi-Square and Analysis of Variance (ANOVA)

Lesson Test for Goodness of Fit One-Way Tables.

Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.

1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Categorical Data Goodness-of-Fit Tests.

CHAPTER 23: Two Categorical Variables: The Chi-Square Test

Chi Square Procedures Chapter 11.

Chapter 11 Inference for Distributions of Categorical Data

Chapter 10 Chi-Square Tests and the F- Distribution 1 Larson/Farber 4th ed.

Chapter 26: Comparing Counts. To analyze categorical data, we construct two-way tables and examine the counts of percents of the explanatory and response.

Chapter 13: Inference for Tables – Chi-Square Procedures

Analysis of Count Data Chapter 26

13.1 Goodness of Fit Test AP Statistics. Chi-Square Distributions The chi-square distributions are a family of distributions that take on only positive.

Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.

Chapter 11: Inference for Distributions of Categorical Data.

Chapter 11: Inference for Distributions of Categorical Data

Chapter 11 Inference for Tables: Chi-Square Procedures 11.1 Target Goal:I can compute expected counts, conditional distributions, and contributions to.

Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.

Chapter 12: The Analysis of Categorical Data and Goodness- of-Fit Test.

GOODNESS OF FIT Larson/Farber 4th ed 1 Section 10.1.

+ Chi Square Test Homogeneity or Independence( Association)

The Practice of Statistics Third Edition Chapter (13.1) 14.1: Chi-square Test for Goodness of Fit Copyright © 2008 by W. H. Freeman & Company Daniel S.

Chapter 13 Inference for Tables: Chi-Square Procedures AP Statistics 13 – Chi-Square Tests.

Non-parametric tests (chi-square test) Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.

+ Chapter 11 Inference for Distributions of Categorical Data 11.1Chi-Square Goodness-of-Fit Tests 11.2Inference for Relationships.

Chi-Square Test (χ 2 ) χ – greek symbol “chi”. Chi-Square Test (χ 2 ) When is the Chi-Square Test used? The chi-square test is used to determine whether.

The Practice of Statistics Third Edition Chapter 14: Inference for Distributions of Categorical Variables: Chi-Square Procedures Copyright © 2008 by W.

+ Section 11.1 Chi-Square Goodness-of-Fit Tests. + Introduction In the previous chapter, we discussed inference procedures for comparing the proportion.

Chapter 14 Inference for Distribution of Categorical Variables: Chi-Squared Procedures.

The Chi-Square Distribution  Chi-square tests for ….. goodness of fit, and independence 1.

Inference for Tables: Chi-Squares procedures (2 more chapters to go!)

Check your understanding: p. 684

The Chi Square Test A statistical method used to determine goodness of fit Chi-square requires no assumptions about the shape of the population distribution.

Chi-square test or c2 test

Chapter 11: Inference for Distributions of Categorical Data

Chapter 12 Tests with Qualitative Data

Chapter 11: Inference for Distributions of Categorical Data

Elementary Statistics: Picturing The World

The Chi Square Test A statistical method used to determine goodness of fit Goodness of fit refers to how close the observed data are to those predicted.

The Chi Square Test A statistical method used to determine goodness of fit Goodness of fit refers to how close the observed data are to those predicted.

AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…

Chapter 11: Inference for Distributions of Categorical Data

Chapter 11: Inference for Distributions of Categorical Data

Chapter 10 Analyzing the Association Between Categorical Variables

Inference for Relationships

The Chi Square Test A statistical method used to determine goodness of fit Goodness of fit refers to how close the observed data are to those predicted.

Chapter 13 Inference for Tables: Chi-Square Procedures

Analyzing the Association Between Categorical Variables

Chapter 11: Inference for Distributions of Categorical Data

Chapter 13: Inference for Distributions of Categorical Data

Chapter 11: Inference for Distributions of Categorical Data

Chapter 11: Inference for Distributions of Categorical Data

Chapter 11: Inference for Distributions of Categorical Data

Chapter 11: Inference for Distributions of Categorical Data

Chapter 11: Inference for Distributions of Categorical Data

Chapter 11: Inference for Distributions of Categorical Data

Chapter 11: Inference for Distributions of Categorical Data

UNIT V CHISQUARE DISTRIBUTION

Chapter 11: Inference for Distributions of Categorical Data

S.M.JOSHI COLLEGE, HADAPSAR

Inference for Distributions of Categorical Data

Chapter 14.1 Goodness of Fit Test.

Chapter 11: Inference for Distributions of Categorical Data

Chapter 11: Inference for Distributions of Categorical Data

Chapter 13: Chi-Square Procedures

Chapter 11: Inference for Distributions of Categorical Data

Inference for Distributions of Categorical Data

Chapter 11: Inference for Distributions of Categorical Data

Presentation transcript:

The mystery of the CHI SQUARE Is it CHEE square Or CHAI Square?!

Chi Square X 2 goodness of fit There is a single test that can be applied to see if the observed sample distribution is significantly different in some way from the hypothesized population distribution

Accidents on Cellphones Are you more likely to have a motor vehicle collision when using a cell phone? A study of 699 drivers who were using a cell phone when they were involved in a collision examined this question. These drivers made 26,798 cell phone calls during a 14-month study period. Each of the 699 collisions was classified in various ways. Here are the counts for each day of the week:

Hypotheses: H 0 : Motor vehicle accidents involving cell phone use are equally likely to occur on each of the seven days of the week. H a : The probabilities of a motor vehicle accident involving cell phone use vary from day to day (that is, they are not all the same).

Chi square procedure: In general, the expected count for any categorical variable is obtained by multiplying the proportion of the distribution for each category by the sample size.

Chi-square test statistics For Sunday: For Monday:

Finding the p-value Degrees of freedom: n-1 df: 7-1 = 6 Calculator syntax: 2nd - VARS - 8 (enter) X 2 cdf( test statistic, 1E99, df ) X 2 cdf( , 1E99, 6 ) p= 2.48 x

Conclusion Since the p value is extremely small ( p= 2.48 x ), there is sufficient evidence to reject H 0 and conclude that these types of accidents are not equally likely to occur on each of the seven days of the week. H 0 : Motor vehicle accidents involving cell phone use are equally likely to occur on each of the seven days of the week. H a : The probabilities of a motor vehicle accident involving cell phone use vary from day to day (that is, they are not all the same).

Red Eye Fruit Fly Any offspring receiving an R gene will have red eyes, and any offspring receiving a C gene will have straight wings. So based on this Punnett square, the biologists predict a ratio of 9 red-eyed, straight-winged (x) : 3 red- eyed, curly-winged (y) : 3 white-eyed, straight-winged (z) : 1 white-eyed, curly-winged (w) offspring. To test their hypothesis about the distribution of offspring, the biologists mate the fruit flies. Of 200 offspring, 99 had red eyes and straight wings, 42 had red eyes and curly wings, 49 had white eyes and straight wings, and 10 had white eyes and curly wings. Do these data differ significantly from what the biologists have predicted?

Given Distribution parents proportio n offspring s Red-eyed, straight-winged Red-eyed, curly-winged White-eyed, straight-winged White-eyed, curly-winged: total H o : these proportions is correct for the the offspring of 2 parents H a : at least one of these proportions is incorrect

Conditions and calculations: We can use a chi-square goodness of fit test to measure the strength of the evidence against the hypothesized distribution, provided that the expected cell counts are large enough. SampleproportionObservedExpected Red-eyed, straight-winged (200)(0.5625) = Red-eyed, curly-winged (200)(0.1875) = 37.5 White-eyed, straight-winged (200)(0.1875) = 37.5 White-eyed, curly-winged: (200)(0.0625) = 12.5 total X 2 cdf(6.187, 1E99, 3 ) p=0.1029

Interpretations The P-value of indicates that the probability of obtaining a sample of 200 fruit fly offspring in which the proportions differ from the hypothesized values by at least as much as the ones in our sample is over 10%, assuming that the null hypothesis is true. This is not sufficient evidence to reject the biologists' predicted distribution.

Your Turn Course grades Most students in a large college statistics course are taught by teaching assistants (TAs). One section is taught by the course supervisor, a full- time professor. The distribution of grades for the hundreds of students taught by TAs this semester was The grades assigned by the professor to the 91 students in his section were (a) What percents of students in the professor's section earned A, B, C, and D/F? In what ways does this distribution of grades differ from the TA distribution? (b) Because the TA distribution is based on hundreds of students, we are willing to regard it as a fixed probability distribution. If the professor's grading follows this distribution, what are the expected counts of each grade in his section? (c) Does the chi-square test for goodness of fit give good evidence that the professor's grade distribution differs from the TA distributions? Use the Inference Toolbox.

Answers: (a) A : 24.2%, B : 41.8%, C : 22.0%, D/F : 12.1%. Fewer A s and more D/F s than the TA sections. (b) A : 29.12, B : 37.31, C : 18.20, D/F : (c) H 0 : p 1 = 0.32, p 1 = 0.41, p 1 = 0.20, p 1 = 0.07 vs. H a : at least one of these proportions is different. All the expected counts are greater than 5, so the condition for X 2 is satisfied. X 2 = (df = 3), so the P–value = ; there is not enough evidence to conclude that the professor s grade distribution was different from the TA grade distribution.

Chi-Sq. Practice (with probability model) Thai, the manager of a car dealership, did not want to stock cars that were bought less frequently because of their unpopular color. The five colors that he ordered were red, yellow, green, blue, and white. According to Thai,the expected frequencies or number of customers choosing each color should follow the percentages of last year. She felt 20% would choose yellow, 30% would choose red, 10% would choose green, 10% would choose blue, and 30% would choose white. She now took a random sample of 150 customers and asked them their color preferences.

Hypotheses: Ho: there is no significant difference between the proportion of the costumers car color preferences. Ho: p1 = p2 = p3 = p4 = p5 Ha: there is a significant difference between the proportion of the costumers car color preferences. Ha: p1 p2 p3 p4 p5

Chi-square procedure: X 2 = P-value = 2.03x10 -5

Conclusion Since our p-value is small, we have sufficient reason to reject the null hypothesis making our test significant. Therefore, there is a significant difference between the proportion of the costumers car color preferences.