Auto Accidents: Whats responsible? Math 70: Group Project Group 8 Janelle Chang Helena Jeanty Rhiana Quail
DISCLAIMER!!! Weather conditions Drivers mental health Drivers physical health Time of day Time of year
Sorting the Data... National vs. Regional –National: all 50 states (not including DC) –Regional: Region 1 ~ North East Region 2 ~ South East Region 3 ~ South MidWest Region 4 ~ North MidWest Region 5 ~ South West Region 6 ~ North West Reasoning… –Allows one to view any type of national behavior –Allows for comparisons to be made within the United States
Normalizing Data Reason: –Every entry needs to be expressed in a standard proportion so that the data can be evaluated equally. State populations differs Number of states per region differ –Basic assumption: more people = more cars = higher number of automobile fatalities.
Testing #1: Does alcohol affect the number of drivers killed in car accidents? assumption –Alcohol affects the number of people killed in car accidents BUT is not the only contributing factor. –Younger people probably drink more irresponsibly so more likely to be involved and be responsible for fatal car accidents. #2: Does a combination of age and alcohol affect the number of people (including drivers) killed in car accidents? #3: Do individual regions mimic national data?
t-Test For each region: H 0 : tot. drivers killed = drunk drivers killed H 1 : tot. drivers killed drunk drivers killed t-Test: » = 0.05, 95% confidence »2-sided test » df = (# obs) - 1
t-Test (#1) Rejecting H 0 Source | SS df MS Number of obs = F( 1, 15) = Model | e e-12 Prob > F = Residual | e e-14 R-squared = Adj R-squared = Total | e e-14 Root MSE = 1.7e reg1normki~d | Coef. Std. Err. t P>|t| [95% Conf. Interval] reg1normdr~k | _cons | 1.37e e e e Reject H 0 : | t| > t 15 ie > 2.131
Regression: driverskilled = 1.37e * drunkdriverskilled
t-Test Accepting H 0 Source | SS df MS Number of obs = F( 1, 2) = Model | e e-14 Prob > F = Residual | e e-15 R-squared = Adj R-squared = Total | e e-14 Root MSE = 8.1e reg5normki~d | Coef. Std. Err. t P>|t| [95% Conf. Interval] reg5normdr~k | _cons | 1.85e e e e Accept H 0 : | t| < t 2 ie < 3.18 < 4.303
Regression: driverskilled = e-07 *drunkdriverskilled
Testing #1: Does alcohol affect the number of drivers killed in car accidents? assumption –Alcohol affects the number of people killed in car accidents BUT is not the only contributing factor. –Younger people probably drink more irresponsibly so more likely to be involved and be responsible for fatal car accidents. #2: Does a combination of age and alcohol affect the number of people (including drivers) killed in car accidents? #3: Do individual regions mimic national data?
F-Test For each region: H 0 : 1 = 2 = 0 H 1 : 1 2 (at least one i 0) F-Test: = 0.05, 95% confidence 1-sided test
F-Test (#2) Rejecting H 0 Source | SS df MS Number of obs = F( 2, 8) = 8.22 Model | e e-13 Prob > F = Residual | e e-14 R-squared = Adj R-squared = T Total | e e-13 Root MSE = 2.1e reg1normki~d | Coef. Std. Err. t P>|t| [95% Conf. Interval] reg1normdr~k | personskil~d | _cons | 2.69e e e e Reject H 0 : F 0.025, 2, 8 > 4.46 ie > 4.46 peoplekilled = 2.69e * drunkdrivers * agekilled
F-Test Accepting H 0 Source | SS df MS Number of obs = F( 2, 5) = 3.64 Model | e e-14 Prob > F = Residual| e e-15 R-squared = Adj R-squared = Total | e e-15 Root MSE = 6.7e reg3normki~d | Coef. Std. Err. t P>|t| [95% Conf. Interval] reg3normdr~k | agekilled | _cons | 1.38e e e e Accept H 0 :F 0.025, 2, 5 < 5.79 ie < 5.79 driverskilled = 1.38e * drunkdrivers * agekilled
Testing #1: Does alcohol affect the number of drivers killed in car accidents? assumption –Alcohol affects the number of people killed in car accidents BUT is not the only contributing factor. –Younger people probably drink more irresponsibly so more likely to be involved and be responsible for fatal car accidents. #2: Does a combination of age and alcohol affect the number of people (including drivers) killed in car accidents? #3: Do individual regions mimic national data?
Confidence Intervals (#3) Confidence Interval of the mean for the National Data ( E E-08, E E-08) Confidence Interval ( E-07, E-07) National Mean of drivers killed: E-07
Region Results with Confidence Intervals RegionMean Lies within National CI
Graph of National Data
ANOVA Test H 0 : national = reg 1 = reg 2 =.….. = reg 6 The number of divers killed in car accidents is independent of the region in which they occur. Reject H 0 if F > F 0.95, 3, 2 = 19.2 F = < 19.2 so accept H 0
Conclusions Nationally, 4 out of the 6 regions rejected the F- test null hypothesis => there is a correlation between age, BAC, and the number of drivers killed. Regionally, 4 out 6 supported the national data trend. The regressions carried out confirm that the number of people killed depends on the number of drunk drivers. Regions do not reflect the national trend for the average number of drivers killed. The number of drivers killed does not depend on the region in which they occur.