Chi-Squared Hypothesis Testing Using One-Way and Two-Way Frequency Tables of Categorical Variables.

Slides:



Advertisements
Similar presentations
Chi-square test Chi-square test or  2 test. Chi-square test countsUsed to test the counts of categorical data ThreeThree types –Goodness of fit (univariate)
Advertisements

 2 test for independence Used with categorical, bivariate data from ONE sample Used to see if the two categorical variables are associated (dependent)
Chapter 12 Goodness-of-Fit Tests and Contingency Analysis
Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
CHAPTER 23: Two Categorical Variables: The Chi-Square Test
Chapter 13: Inference for Distributions of Categorical Data
Copyright ©2011 Brooks/Cole, Cengage Learning More about Inference for Categorical Variables Chapter 15 1.
© 2010 Pearson Prentice Hall. All rights reserved The Chi-Square Test of Independence.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
CHAPTER 11 Inference for Distributions of Categorical Data
11-2 Goodness-of-Fit In this section, we consider sample data consisting of observed frequency counts arranged in a single row or column (called a one-way.
Chapter 26: Comparing Counts. To analyze categorical data, we construct two-way tables and examine the counts of percents of the explanatory and response.
Presentation 12 Chi-Square test.
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Categorical Data Test of Independence.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics, A First Course 4 th Edition.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 26 Comparing Counts.
Chapter 11 Chi-Square Procedures 11.3 Chi-Square Test for Independence; Homogeneity of Proportions.
Chi-square test or c2 test
Chi-square test Chi-square test or  2 test Notes: Page Goodness of Fit 2.Independence 3.Homogeneity.
Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
Chapter 26 Chi-Square Testing
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
Chi-square test. Data Statistics Continuous measurement: Interval Mean Categorical measurement: NominalFrequency, count, proportion One-way chi-square.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
Other Chi-Square Tests
Chapter 12: The Analysis of Categorical Data and Goodness- of-Fit Test.
Slide 26-1 Copyright © 2004 Pearson Education, Inc.
FPP 28 Chi-square test. More types of inference for nominal variables Nominal data is categorical with more than two categories Compare observed frequencies.
13.2 Chi-Square Test for Homogeneity & Independence AP Statistics.
+ Chi Square Test Homogeneity or Independence( Association)
Chapter 14: Chi-Square Procedures – Test for Goodness of Fit.
Chi-Square Test James A. Pershing, Ph.D. Indiana University.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics: A First Course Fifth Edition.
AP Statistics Section 14.. The main objective of Chapter 14 is to test claims about qualitative data consisting of frequency counts for different categories.
Copyright © 2010 Pearson Education, Inc. Slide
Comparing Counts.  A test of whether the distribution of counts in one categorical variable matches the distribution predicted by a model is called a.
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Chapter Outline Goodness of Fit test Test of Independence.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
1 Chapter 10. Section 10.1 and 10.2 Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
11.2 Tests Using Contingency Tables When data can be tabulated in table form in terms of frequencies, several types of hypotheses can be tested by using.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapter 12 The Analysis of Categorical Data and Goodness of Fit Tests.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Lesson 12 - R Chapter 12 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
Chi-square test Chi-square test or  2 test. Chi-square test countsUsed to test the counts of categorical data ThreeThree types –Goodness of fit (univariate)
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
+ Section 11.1 Chi-Square Goodness-of-Fit Tests. + Introduction In the previous chapter, we discussed inference procedures for comparing the proportion.
Chi-Squared Test of Homogeneity Are different populations the same across some characteristic?
11.1 Chi-Square Tests for Goodness of Fit Objectives SWBAT: STATE appropriate hypotheses and COMPUTE expected counts for a chi- square test for goodness.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Comparing Observed Distributions A test comparing the distribution of counts for two or more groups on the same categorical variable is called a chi-square.
Goodness-of-Fit and Contingency Tables Chapter 11.
Chi Square Procedures Chapter 14. Chi-Square Goodness-of-Fit Tests Section 14.1.
Chapter 12 Lesson 12.2b Comparing Two Populations or Treatments 12.2: Test for Homogeneity and Independence in a Two-way Table.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Test of Goodness of Fit Lecture 41 Section 14.1 – 14.3 Wed, Nov 14, 2007.
Presentation 12 Chi-Square test.
Chapter 11 Chi-Square Tests.
Qualitative data – tests of association
Chapter 10 Analyzing the Association Between Categorical Variables
Chi-square test or c2 test
Chapter 11 Chi-Square Tests.
Lesson 11 - R Chapter 11 Review:
Chapter 11 Chi-Square Tests.
Presentation transcript:

Chi-Squared Hypothesis Testing Using One-Way and Two-Way Frequency Tables of Categorical Variables

 2 Hypothesis Test Goodness-of-Fit Independence Homogeneity

Analyzing an Exam Question How does a teacher determine if students were “clueless” on an exam question vs. students were unprepared for that particular exam question?

Goodness-of-Fit Test If you need to test whether populations are distributed evenly (or “preset” proportions), then use Goodness-of-Fit test. 1.This requires a one-way frequency (count) table. 2.Random sample is required for counts. 3.Expected cell counts greater than 5. What’s an expected cell count?

Expected Cell Count? Suppose 300 students answered a multiple choice question with the following distribution. Did the students randomly select answers (I.e. are the answers equally distributed)? The expected cell count for A is 300(1/5) = 60. As the same is true for B thru E. If we assume the answers are equally distributed (null hypothesis), then we “share” the 300 responses equally. ABCDE

Observed vs. Expected The observed values are the actual sampled counts (occurrences). The expected values are the hypothesized outcomes based on the null hypothesis. In this example, we are assuming the each answer was equally selected by students. ABCDE Observed Expected60

 2 Statistic The computer (or calculator) will calculate the chi-squared statistic for you, and determine the degrees of freedom and p-value. What is degrees of freedom?

Chi-Squared Statistic and p-value  2 = 6.5, df = 4, P(  2 > 6.5) =.16479

 2 Statistic H o :  A =  B =  C =  D =  E H a : at least one  is different  2 = 12.7, df = 4, P(  2 > 12.7) =.0128 ABCDE Observed Expected60

Goodness-of-Fit Test What if the hypothesized proportions were not all the same? Example: Does the color of your car influence the chance it will be stolen? Suppose it is known that all cars in the world consist of 15% white, 30% black, 35% red, 15% blue, and 5% other colors.

Color of Stolen Car H o :  W =.15,  B =.30,  R =.35,  U =.30,  E =.05 H a : at least one  is different WhiteBlackRedBlueOther Obsv Expect  2 = 66.33, df = 4, P(  2 > 66.33) = 1.3x10 -13

Two-Way Tables Homogeneity—tests for equal category proportions for all populations (because separate random samples were used to collect information). Independence—tests for an independence (no association) between 2 categorical variables. Don’t worry; same test!

College Students’ Drinking Levels The data on drinking behavior for independently chosen random samples of male and female students was collected. Does there appear to be a gender difference with respect to drinking behavior?

Homogeneity Test Gender DrinkingMenWomen None (158.6)(167.4) Low (554.0)(585.0) Moderate (230.1)(242.9) High6316 (38.4)(40.6)

College Students’ Drinking Levels H o : True proportions for the 4 drinking levels are the same for males and females. H a : At least one true proportion is different.  2 = 96.53, df = (4 – 1)(2 – 1) = 3 P(  2 > 96.53) = 8.68 x Reject H o ; data indicates that males and females differ with respect to drinking levels.

Sexual Risk-Taking Factors Among Adolescents Each person in a random sample of sexually active teens was classified according to gender and contraceptive use. Is there a relationship between gender and contraceptive use by sexually active teens?

Independent (No Association) Test Gender Contraceptive Use FemaleMale Rarely/Never (224)(336) Sometimes/ Most Times (204)(306) Always (372)(558)

Sexual Risk-Taking Factors Among Adolescents H o : Gender and contraceptive use have no association (independent). H a : Gender and contraceptive use have an association (dependent).  2 = 6.572, df = (3 – 1)(2 – 1) = 2 P(  2 > 6.572) =.035 Reject H o and conclude there is an association between gender and contraceptive use.

Expected (Cell) Count for Two-Way Tables

Conditions (Requirements) for  2 Test with 2-Way Tables 1)Random Sample 2)At least 80% of Expected Cell Counts are greater than 5. 3)All Expected Cell Counts and Observed values are greater than or equal to 1.

Titanic Moviemakers of Titanic imply that lower- class passengers were treated unfairly. Was that accurate?

Likelihood of Survival on Titanic? H o :  C = 109/1318,  W = 402/1318,  M = 807/1318 H a : at least one  is different  2 = , df = 2, P(  2 > ) = Reject H o and conclude at least one proportion is different. ChildrenWomenMen Observed Expected