Warm Up The number of motor vehicles registered (in millions) in the U.S. has grown as charted in the table. 1)Plot the number of vehicles against time. What kind of growth does this exhibit? 2)Use logs to transform into a linear association and plot the transformed data. 3)Clearly, the point (1945, 1.49) is an outlier. Can you suggest an explanation? 4)Delete the outlier and use the remaining points to find the LSRL equation. Write the equation and draw this line on a second graph. 5)Determine the model for the original data set. YearVehicles
4.2: Relationships between Categorical Variables Two-Way Tables To analyze categorical data, we use the counts or percents of individuals that fall into various categories; finding percentages is the key to “describing the relationships” among categorical variables. Round-off Error = okay To calculate percentages = favorable/possible Two types of distributions: Marginal distributions (has to do with what goes into the margins – the counts for each row and column) – calculated from marginal frequencies - label as percentages. Each marginal distribution from a two-way table is a distribution for a single categorical variable which can then be displayed as a bar graph or pie chart. Conditional distributions (has to do with calculating the distribution of percents for one variable across some condition on the other variable); reveals the nature of an association
Census Bureau data describing the age and sex of college students 1) Row Variable/Column Variable/Marginal Distributions/Roundoff Error 2) % of college students that are women? 3) % of college students who are ? 4) Marginal distribution of age group in percents (Graph next slide). 5) % of college students that are women in the age group? % of college students that are men in the age group? (Conditional dist.) 6) % of college students that are women in the 35+ age group? % of college students that are men in the 35+ age group? (Conditional dist.) 7) % of female college students in four age groups? Age Group FemaleMaleTOTAL TOTAL
Left Graph: Bar graph of the distribution of age for college students (marginal dist.) Right graph: Comparing the percent of female college students in four age groups. * Common to display categories on x-axis and frequencies (or %s) on y-axis*
Reading Software
In a survey at a public high school, a random sample from each class was asked whether they considered themselves to be a Democrat or a Republican. Here is the data 1)What are the row and column variables? 2)Calculate the Marginal Frequencies by Grade and by Voting Party. 3)Calculate the Marginal distributions by Grade and by Voting Party. 4)Compute the conditional distribution of Freshmen who are Republicans. 5)Compute the conditional distribution of Seniors who are Democrats. 6)Draw a bar graph depicting the marginal distribution of the sample across grade. 7)Draw a bar graph comparing the percent of democrats across the four grades. Grade DemocratRepublican Total Total
Accident victims are sometimes taken by helicopter from an accident to a hospital. Helicopters save time. Do they also save lives? HeliRoad Died64260 Survived Total
SeriousLess Serious HeliRoad Died4860 Surviv ed 5240 Total100 HeliRoad Died16200 Surviv ed Total