Download presentation
Presentation is loading. Please wait.
1
Associations between Categorical Variables
Module 31 & Module 32: Chi-Square Procedures
2
Three tests... Two procedures...
Chi-Square Test of Goodness of Fit one-way tables, 1 variable Chi-Square Test for Homogeneity Two-way table, 2 categorical variables; 2 populations Chi-Square Test of Association/Independence Two-way table, 2 categorical variables, 1 population; same exact procedure as Homogeneity
3
M&M Mars Company claims...
that you will receive, in each bag of milk chocolate candies M&M’s ... 13% red, 13% brown, 20% orange, 24% blue, 16% green, 14% yellow What is one way we can we test this claim? Discuss for 1 minute...
4
Claim: Customers will receive 13% red, 13% brown, 20% orange, 24% blue, 16% green, 14% yellow
We could conduct a one-proportion z procedure/ hypothesis test for each of the following: Ho: pBrown = Ha: pBrown ≠ 0.13 Ho: pRed = Ha: pRed ≠ 0.13 Ho: pOrange = Ha: pOrange ≠ 0.20 Ho: pBlue = Ha: pBlue ≠ 0.24 Ho: pGreen = Ha: pGreen ≠ 0.14 Ho: pYellow = Ha: pYellow ≠ 0.14 Lots of work, time-consuming...
5
Claim: Customers will receive 13% red, 13% brown, 20% orange, 24% blue, 16% green, 14% yellow
We could conduct a one-proportion z procedure/ hypothesis test six times... Lots of work, time- consuming... But more importantly, this process would not tell us how likely it is that six sample proportions differ from the claim made by M&M’s Mars. So, we have a single test to determine if our observed sample distribution is significantly different in some way from the hypothesized population distribution of all M&M colors
6
Chi-Square (X2) Test for Goodness of Fit
Are you more likely to have a motor vehicle collision when using a cell phone on a given day of the week? A study of 699 randomly-selected drivers who were using a cell phone when they were involved in a collision examined this question. These drivers made 26,798 cell phone calls during a 14-month study period. Each of the 699 collisions was classified in various ways. Here are the counts for each day of the week: Our first of 3 tests; goodness of fit.
7
Are accidents equally likely to occur on any day of the week?
Observation: In this sample, #’s are not equal... But is this just due to sampling variability OR are all accidents like these truly not equally likely to occur on any day of the week? To answer this question, we must conduct an hypothesis test; Chi-Square Good of Fit test
8
SIDE NOTE... In this situation, the question is “Are all accidents like these equally likely to occur on any day of the week?” Versus In the M&M situation, the question was “Will we receive, in each/all bags of milk chocolate candies M&M’s, 13% red, 13% brown, 20% orange, 24% blue, 16% green, 14% yellow ? Look at claim or question for what we are testing
9
OR Psunday = pmonday = ptuesday = ... = psaturday = 1/7
Ho: Motor vehicle accidents involving cell phone use are equally likely to occur on each of 7 days of the week OR Psunday = pmonday = ptuesday = ... = psaturday = 1/7 Ha: The probability of a motor vehicle accident involving cell phone use vary from day-to-day (not all the same; at least one day is different from another). OR At least one of the proportions differs from the stated value(s) Look at claim or question for what we are testing to get your null hypothesis. Remember null is always =, status quo, neutral
10
Ho: Psunday = pmonday = ptuesday = ... = psaturday = 1/7
Ha: At least one of the proportions differs from the stated value(s) GENERAL IDEA: Goodness of Fit Test compares observed (our sample) versus what we expect (our null hypothesis, Ho) Does our sample support our null hypothesis or does it support our alternative hypothesis? Look at claim or question for what we are testing
11
Conditions: Random sample. Independent. “Large Sample.”
Ho: Psunday = pmonday = ptuesday = ... = psaturday = 1/7 Ha: At least one of the proportions differs from the stated value(s) Conditions: Random sample. Independent. “Large Sample.” Randomly selected – stated in problem Independent – population is at least 10 times sample size; one individual does not influence any other “Large Sample” – this is all about expected counts, not the actual sample counts; all individual expected counts must be at least 5 Conditions for chi square procedures are a bit different from other procedures we have done (what else have we done? Review)
12
699 total accidents that given week
Ho: Psunday = pmonday = ptuesday = ... = psaturday = 1/7 Ha: At least one of the proportions differs from the stated value(s) We have observed count data (our sample data above), now we need our expected count data 699 total accidents that given week Expected accidents each day (if Ho is true) = 699 ÷ 7 = each day (Ho; null hypothesis; 1/7 of total each day) Note; observed will always be in whole #s; never fractional; however expected MAY be fractional; the expected is key to checking our conditions. That’s why we are focussing on it.
13
Ho: Psunday = pmonday = ptuesday = ... = psaturday = 1/7
Ha: At least one of the proportions differs from the stated value(s) Expected accidents each day = 699 ÷ 7 = each day (Ho; null hypothesis; 1/7 of total each day) NOTE: Observed is count data; no such thing as 3-1/2 accidents on a given day; either 3 or 4; BUT the EXPECTED accidents CAN be a decimal/fraction.
14
Ho: Psunday = pmonday = ptuesday = ... = psaturday = 1/7
Ha: At least one of the proportions differs from the stated value(s) Expected = each day; therefore, all individual expected counts are at least 5; ✔
15
degrees of freedom = k – 1 (where k is # of categories)
Ho: Psunday = pmonday = ptuesday = ... = psaturday = 1/7 Ha: At least one of the proportions differs from the stated value(s) Chi-Square Goodness of Fit procedure has degrees of freedom (like t procedures) degrees of freedom = k – 1 (where k is # of categories) Df will be automatically calculated in Minitab
16
A little X2 detour... properties of X2
Distribution is skewed to the right; area under density curve still = 1 Degrees of freedom = k – 1 As df increase (as k increases), density curve becomes less right skewed How likely/unlikely is it that this statistic (or more extreme a statistic) would occur just by chance IF our null hypothesis is really true? i.e., p-value p-value calculation: use StatCrunch “Rejection zone” still same idea Default α still 5% (but could be 1% or 10%)
17
Ho: Psunday = pmonday = ptuesday = ... = psaturday = 1/7
Ha: At least one of the proportions differs from the stated value(s) Remember, expected accidents each day = 699 ÷ 7 = each day (Ho; null hypothesis; 1/7 of total each day); we expect about accidents per day IF the null hypothesis is true. Need a method to measure how well the observed counts (“O”) fit or match the expected counts (“E”); i.e., are the differences between observed and expected just due to sampling variability or is Ho incorrect?
18
Ho: Psunday = pmonday = ptuesday = ... = psaturday = 1/7
Ha: At least one of the proportions differs from the stated value(s) Need a method to measure how well the observed counts (“O”) fit or match the expected counts (“E”); i.e., are the differences between observed and expected just due to sampling variability or is Ho incorrect? X2 = Chi-Square = Expected accidents each day = 699 ÷ 7 = each day (Ho; null hypothesis; 1/7 of total each day); Minitab does this for us. Tables, goodness of fit,
19
Ho:. Psunday = pmonday = ptuesday =. = psaturday = 1/7 Ha:
Ho: Psunday = pmonday = ptuesday = ... = psaturday = 1/7 Ha: At least one of the proportions differs from the stated value(s) Calculations: StatCrunch, stat, goodness of fit, chi-square test X2 = df = k – 1 = 7 – 1 = 6 p-value ≈ 0
20
Ho: Psunday = pmonday = ptuesday = ... = psaturday = 1/7
Ha: At least one of the proportions differs from the stated value(s) Interpretation: Reject Ho. As p-value ≈ 0, which is less than any reasonable α, we conclude that all accidents like these are not equally likely to occur on each of the seven days of the week (at least one of the proportions of all accidents differs from the stated values). Notice we still did all 4 steps like we have been doing all along with our inference procedures.
21
Saving birds from windows...
Many birds are injured or killed by flying into windows. It appears that birds see windows as open space. Can tilting windows down so that they reflect earth rather than sky reduce bird strikes? Suppose we randomly place three windows at the edge of a woods: one vertical, one tilted 20 degrees, and one tilted 40 degrees. During a randomly-selected four- month period, there are 53 bird strikes: 31 on the vertical window, 14 on the 20-degree window, and 8 on the 40- degree window. If tilting had no effect, we would expect strikes on all three windows to have equal probability. Test this null hypothesis. What do you conclude?
22
53 bird strikes total: 31 on the vertical window, 14 on the 20 degree window, 8 on the 40 degree window We want to test Ho: pv = pt20 = pt40 = 1/3 Ha: At least one of these proportions of bird hits is different
23
53 bird strikes; 31 on the vertical window, 14 on the 20 degree window, 8 on the 40 degree window
Conditions: 53 birds total in our sample; so expected counts are for each 53 ÷ 3 = 17.67; so each expected count is > 5 ✔ Random – stated in problem ✔ Independent – reasonable to assume population is at least (10)(53) ✔ Use Minitab to do calculations... See next slide for #’s
24
53 bird strikes; 31 on the vertical window, 14 on the 20 degree window, 8 on the 40 degree window
Calculations: stat, goodness of it, chi square test X2 = p-value ≈ 0
25
53 bird strikes; 31 on the vertical window, 14 on the 20 degree window, 8 on the 40 degree window
Interpretation Reject null hypothesis. Since the p-value ≈ 0 (less than α = 5%), there is evidence that at least one proportion of all bird hits like these is different.
26
The University of Chicago's General Social Survey (GSS) is the nation's most important social science sample survey. The GSS regularly asks randomly-selected subjects their astrological sign. Above are the counts of responses in the most recent year this question was asked. If births were spread uniformly across the year, we would expect all 12 signs to be equally likely. Are they?
27
We want to test: Chi Square Goodness of Fit Ho: pAries = pTaurus = pGemini = ... = pPisces = 1/12 Ha: At least one of the proportions differs from 1/12 Check conditions; do calculations. Name test. Df = 11; chi square 14.39; p-val = 0.21
28
Ho: pAries = pTaurus = pGemini =
Ho: pAries = pTaurus = pGemini = ... = pPisces = 1/12 Ha: At least one of the proportions differs from 1/12 Conditions; Random, Expected Counts > 5, Independent Calculations; StatCrunch Interpretation; Fail to reject the null hypothesis. With a p- value of 0.21, which is larger than any reasonable α, there is not enough evidence to conclude that all births are not uniformly spread throughout the year. Remember errors...
29
Housing... According to the Census Bureau, the distribution by ethnic background of the New York City population in a recent year was: Hispanic: 28% Black: 24% White: 35% Asian: 12% Other: 1% The manager of a large housing complex in the city wonders whether the distribution by race of the complex’s residents is consistent with the population distribution. To find out, she records data from a random sample of 800 residents. The following table displays the sample data.
30
Census bureau says: 28% Hispanic; 24% Black; 35% White; 12% Asian; 1% Other
Are these data significantly different from the city’s distribution by race? Carry out an appropriate test to support your answer. Race Hispanic Black White Asian Other Total Count 212 202 270 94 22 800
31
Census bureau says: 28% Hispanic; 24% Black; 35% White; 12% Asian; 1% Other
Race Hispanic Black White Asian Other Total Count 212 202 270 94 22 800 Ho: pHispanic = 28% pBlack =24% pWhite = 35% pAsian = 12% pother = 1% Ha: At least one population proportion is different Conditions: Random, Expected Counts, Independent
32
Census bureau says: 28% Hispanic; 24% Black; 35% White; 12% Asian; 1% Other
Race Hispanic Black White Asian Other Total Count 212 202 270 94 22 800 Random – stated in problem Expected Counts – all expected counts must be > 5 Hispanic: (800)(.28) = 224 > 5 ✔ Black: (800)(.24) = 192 > 5 ✔ White: (800)(.35) = 280 > 5 ✔ Asian: (800)(.12) = 96 > 5 ✔ Other: (800)(.01) = 8 > 5 ✔ Independent – Reasonable to assume population at least (10)(800)
33
Census bureau says: 28% Hispanic; 24% Black; 35% White; 12% Asian; 1% Other
Race Hispanic Black White Asian Other Total Count 212 202 270 94 22 800 Calculations: StatCrunch, stat, goodness of fit, chi square test (expected, select column) X2 = p-value < df = 4 Reject Ho. With a p-value of about zero, and an alpha level of 5%, we have sufficient evidence to support that at least one proportion is different (or all residents like these in a housing complex do not follow the ethnic background distribution of all of New York City as a whole).
34
Aw, Nuts! A company claims that each batch of its deluxe mixed nuts contains 52% cashews, 27% almonds, 13% macadamia nuts, and 8% brazil nuts. To test this claim, a quality control inspector takes a random sample of 150 nuts from the latest batch. The following table below displays the sample data: Nut Cashew Almond Mac Brazil Actual Sample Count 83 29 20 18
35
Aw, nuts! Nut Cashew Almond Mac Brazil Actual Sample Count 83 29 20 18 Claim: 52% cashews, 27% almonds, 13% macadamia nuts, and 8% brazil nuts. Test the following null and alternative hypotheses: Ho: : pcashews = 52% palmonds = 27% pmac = 13% pbrazil = 8% Ha: At least one proportion is different Conditions: Random – stated in the problem
36
Aw, nuts! Test the following null and alternative hypotheses:
Cashew Almond Mac Brazil Actual Count 83 29 20 18 Test the following null and alternative hypotheses: Ho: : pcashews = 52% palmonds = 27% pmac = 13% pbrazil = 8% Ha: At least one proportion is different Conditions: Expected Counts must all be > 5 Cashews: (150)(.52) Almond: (150)(.27) Mac: (150)(.13) Brazil: (150)(.08)
37
Aw, nuts! Test the following null and alternative hypotheses:
Cashew Almond Mac Brazil Actual Count 83 29 20 18 Test the following null and alternative hypotheses: Ho: : pcashews = 52% palmonds = 27% pmac = 13% pbrazil = 8% Ha: At least one proportion is different Conditions: Assume population is at least (10)(150).
38
Aw, nuts! Test the following null and alternative hypotheses:
Cashew Almond Mac Brazil Actual Count 83 29 20 18 Test the following null and alternative hypotheses: Ho: : pcashews = 52% palmonds = 27% pmac = 13% pbrazil = 8% Ha: At least one proportion is different Calculations: StatCrunch, stat, goodness of fit, Chi square test; X2 = 6.598, p-value = , df = 3
39
Aw, nuts! Test the following null and alternative hypotheses:
Cashew Almond Mac Brazil Actual Sample Count 83 29 20 18 Test the following null and alternative hypotheses: Ho: pcashews = 52% palmonds = 27% pmac = 13% pbrazil = 8% Ha: At least one proportion is different Fail to reject Ho. With a p-value of about 9% and an alpha level of 5%, we do not have sufficient evidence to conclude that the proportions of all nuts are different from the claim.
40
M&M’s Activity... Mars claims: 13% red, 13% brown, 20% orange, 24% blue, 16% green, 14% yellow in each bag of milk chocolate M&Ms Based on our sample data, do we have reason to doubt the color distribution claim made by M&M/Mars Company? Give appropriate statistical evidence to support your conclusion. Let’s randomly form groups and turn in one paper, start to finish, with all of your group’s names on the paper. What do we do with broken m&ms?
41
Mars claims: 13% red, 13% brown, 20% orange, 24% blue, 16% green, 14% yellow in each bag of milk chocolate M&Ms Ho: pred = 13% pbrown = 13% porange = 20% pblue = 24% pgreen = 16% pyellow = 14% Ha: At least one of these proportions is different
42
Mars claims: 13% red, 13% brown, 20% orange, 24% blue, 16% green, 14% yellow in each bag of milk chocolate M&Ms Conditions: All individual expected counts are at least 5 (0.13)(Total) (0.13)(Total) (0.20)(Total) (0.24)(Total) (0.16)(Total) (0.14)(Total)
43
Mars claims: 13% red, 13% brown, 20% orange, 24% blue, 16% green, 14% yellow in each bag of milk chocolate M&Ms Interpretation: Reject/Fail to reject; reference α and p-value, in context of problem
44
Inference for 2-way tables...(categorical data)
Two tests possible with 2-way tables (for categorical data) Test of Homogeneity – independent SRS’s from each of ‘c’ populations (2, 3, 4, etc. populations, treatments, etc.) OR Test of Independence/Association – one SRS, one population
45
Inference for 2-way tables...
Same test (Chi-Square Test) used for either homogeneity or independence/association Test of Homogeneity – independent SRS’s from each of ‘c’ populations (2, 3, 4, etc. populations, treatments, etc.) OR Test of Independence/Association – one SRS, one population
46
Test of Homogeneity of Populations... two-way tables...
2 independent random samples 3 randomly selected groups High social comparison, high mastery, etc.; whether we are looking at homogeneity or independence, we are interested in knowing whether the two categorical variables are related/associated or unrelated.
47
Chi-Square Test of Homogeneity of Populations... (2 or more populations)
rows & columns r x c always; (never c x r) don’t count the ‘totals’ as a row or a column categories in rows and other categories in columns separate and independent SRS’s from each population
48
Chi-Square Test of Homogeneity of Populations... (2 or more populations)
Ho (null hypothesis): xxx (context) is independent of xxx (context) OR there is no association between xxx and xxx Ha (alternative hypothesis): xxx (context) is dependent of xxx (context) or there is an association between xxx and xxx Still use: Idea is still how far is observed from expected counts?
49
Chi-Square Test of Homogeneity of Populations... (2 or more populations)
Calculations for expected cell count, for EACH cell: So, in this table, you would have to calculate 15 expected values... lots of time and work... If doing by hand; we will never do by hand... Always Minitab. Note: original 2-way table is always in whole counts; but expected counts may not be whole counts (quite often are decimals). And as always, Minitab will calculate p-val, chi sq statistic, df, etc.
50
Chi-Square Test of Homogeneity of Populations... (2 or more populations)
degrees of freedom... up to now, general rule k – 1 now, 2-way tables, so df = (r – 1)(c – 1) p-value is area to the right, under the X2 density curve Again, this is just fyi. Minitab will do this for us automatically.
51
Chi-Square Test of Homogeneity of Populations... (2 or more populations)
Conditions.... Random Samples Independent Measurements – each measurement on an individual is independent of all other measurements; each population is at least (10) times each sample size Large Sample - All individual expected counts must be ≥ 5; it’s all about expected
52
Chi-Square Test of Homogeneity of Populations... (2 or more populations)
Market researchers know that background music can influence the mood and purchasing behavior of customers. One study in a supermarket in Northern Ireland compared three treatments: no music, French accordion music, and Italian string music. Under each condition, the researchers recorded the numbers of bottles of French, Italian, and other wine. We may assume each a SRS.
53
Chi-Square Test of Homogeneity of Populations... (2 or more populations)
Are the distributions of wines selected different in all three populations of music types?
54
We want to use X2 test of homogeneity to compare the distribution of types of wine selected for each type of music. Our hypotheses are: H0: The distributions of wine selected are the same (are not associated; are independent) in all three populations of music types. Ha: The distributions of wine selected are not all the same (are dependent, are associated).
55
Conditions To use the chi-square test for homogeneity of populations:
Random – stated in problem. Independence – Must assume that one wine purchaser did not influence any other; & that each population is at least (10) times each sample size. Large Samples - All expected cell counts are ≥ 5 (how can we check this?) We must actually perform the test to check conditions; kind of backwards, but is faster than calculating the expected’s by hand. So let’s go ahead and enter this data into StatCrunch; remember the large sample is all about expected, so no worries on the ‘1’ for french music and italian wine. That MAY be ok. NOTE: don’t put in totals; not part of data.
56
Calculations (& checking expected counts)
Input data; then StatCrunch, stat, tables, contingency, with summary X2 = p = df = 4
57
Chi-Square Test of Homogeneity of Populations... (2 or more populations)
H0: The distributions of wine selected are the same (no association; independent) in all three populations of music types. Ha: The distributions of wine selected are not all the same (dependent; associated). Interpretation Reject null hypothesis. There is evidence to reject Ho (p- value ≈ 0 < any reasonable α) and conclude that the type of music being played has an effect on all wine sales like these (distribution of wine sales are not the same, are associated, are dependent).
58
How are schools doing? The nonprofit group Public Agenda conducted telephone interviews with 3 independent, randomly selected groups of parents of high school children. There were 202 black parents, 202 Hispanic parents, and 201 white parents. One question asked was “Are the high schools in your state doing an excellent, good, fair, or poor job, or don't you know enough to say?” Here are the survey results:
59
How are schools doing? Conduct a test of significance to determine if the distributions for these three populations are the same. Homogeneity; do with a partner; 10 mins remember no totals
60
How are schools doing? Ho: The distributions of responses to this question are the same for each group (independent, not associated). Ha: The distributions of responses to this question are not the same (dependent, associated).
61
How are schools doing? Conditions: Data is from independent random samples ✔ All expected cell counts must be ≥ 5
62
How are schools doing? Conditions continued... all expected cell counts must be ≥
63
How are schools doing? Calculations... X2 = p = df = 8
64
How are schools doing? X2 = p = df = 8 Interpretation: With a p-value of which is less than any reasonable α level, reject Ho and conclude that all groups like these have different opinions about performance of high schools in their state (dependent, associated).
65
One more X2 Test.... When do we use Chi-Square tests? Why haven’t we used them before now? Chi-Square Goodness of Fit Test Chi-Square Test of Homogeneity Now, Chi-Square Test of Association/Independence
66
Chi-Square Test of Association/Independence...
Two-way table (like X2 test of homogeneity) BUT this test involves only a single population/sample; single SRS, with each individual classified according to both of two categorical variables Bonus (or not....), we use the exact same procedure in the calculator for BOTH of these tests; conditions are the same as well
67
Chi-Square Test of Association/Independence...
Ho: No association between categorical variables OR Categorical variables are independent Ha: There is an association (or they are not independent) Caution: Do not use ‘cause;’ the most we can say is that there is/is not a relation or they are/are not associated or they are/are not independent
68
Franchises that succeed...
Many popular businesses, like McDonalds, are franchises. The owner of a local franchise benefits from the brand recognition, national advertising, and detailed guidelines provided by the franchise chain. In return, he or she pays fees to the franchise firm and agrees to follow its policies. The relationship between the local entrepreneur and the franchise firm is spelled out in a detailed contract. One clause that the contract may contain is the entrepreneur's right to an exclusive territory. This means that the new outlet will be the only representative of the franchise in a specified territory and will not have to compete with other outlets of the same chain. How does the presence of an exclusive-territory clause in the contract relate to the survival of the business? A study designed to address this question collected data from a random sample of 170 new franchise firms. Do our own test on movies and pizza
69
Franchises that succeed...
How does the presence of an exclusive territory clause in a contract relate to the survival of a business? i.e., are these two qualities related/associated? Are they independent?
70
Franchises that succeed...
Ho: Success and exclusive-territory are independent (not associated). Ha: Success and exclusive-territory are dependent (associated)
71
Ho:. Success and exclusive-territory are independent (not associated)
Ho: Success and exclusive-territory are independent (not associated). Ha: Success and exclusive-territory are dependent (associated) Conditions: SRS – stated in problem Independence – Must assume that one respondent does not influence any other; and (10)(170) Expected cell counts – all must be ≥ 5
72
Ho:. Success and exclusive-territory are independent (not associated)
Ho: Success and exclusive-territory are independent (not associated). Ha: Success and exclusive-territory are dependent (associated) Conditions continued... Expected cell counts – all must be ≥ 5 Expected cell counts:
73
Ho:. Success and exclusive-territory are independent (not associated)
Ho: Success and exclusive-territory are independent (not associated). Ha: Success and exclusive-territory are dependent (associated) Calculations X2 = 5.91 p = df = 1
74
Ho:. Success and exclusive-territory are independent (not associated)
Ho: Success and exclusive-territory are independent (not associated). Ha: Success and exclusive-territory are dependent (associated) X2 = 5.91 p = df = 1 Interpretation: Reject Ho. There is sufficient evidence against Ho (p = 0.015) at α = 0.05 level. Conclude success and exclusive territory clause are dependent for all franchise firms like these (there is an association between the two qualities) Caution: This procedure does not show ‘cause’
75
Extracurricular Activities & Grades...
North Carolina State University studied student performance in a course required by its chemical engineering major. One question of interest is the relationship between time spent in extracurricular activities and whether a student earned a C or better in the course. Here are the data from a SRS of 119 students who answered a question about extracurricular activities:
76
Extracurricular Activities & Grades...
Is there statistically significant evidence of an association between the amount of time spent on extracurricular activities and grades earned in the course? 10 mins with partner.
77
Extracurricular Activities & Grades...
Ho: There is no association (they are independent) between amount of time spent on extracurricular activities and grades earned in the course Ha: There is an association (they are not independent)
78
Extracurricular Activities & Grades...
Conditions: SRS: stated in the problem Independence: stated in problem Each expected cell count ≥ 5 Expected cell counts
79
Calculations: X2 = 6. 93 p = 0. 031 df = 2 Interpretation: Reject Ho
Calculations: X2 = 6.93 p = df = 2 Interpretation: Reject Ho. With a p-value of and an α = 0.05, there is not sufficient evidence to support Ho. Conclude that there is an association between extracurricular activities and grades earned by all students like these. What if our alpha would have been 1%? Possible errors to make?
80
Let’s collect data... Pizza Nachos Horror Action/Adventure Comedy
Gather data; then in pairs (randomly assigned) let’s do a chi square test to see if there is a relationship between fav movie snack and fav movie. Turn in. Start to finish.
81
Let’s use Math 140 data... Assignment: Choose two categorical sets of data that we believe are related/associated Cut and paste into StatCrunch; clean up dirty data (justify any data that you ‘clean’) Complete a full X2 Test of Independence with data (full write up, including null and alternative hypothesis, conditions, screen shot of test results, interpretation); include what type of error you could have committed; include the largest contributor and the smallest contributor to your Chi-square test statistic; also what would happen if you changed your alpha level? Would your decision change? Why or why not? Let’s do a practice run through using political party and favorite social media
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.