Download presentation
Presentation is loading. Please wait.
Published byEthelbert Daniels Modified over 8 years ago
1
Bellwork (Why do we want scattered residual plots?): 10/2/15 I feel like I didn’t explain this well, so this is residual plots redux! Copy down these two scatterplots
2
3.2B&C LSRL—stdev of the Residuals & r 2
3
If she loves you more each and every day, by linear regression she hated you before you met.
4
Objectives: CALCULATE residuals CONSTRUCT and INTERPRET residual plots DETERMINE how well a line fits observed data
5
A scatterplot of home price in thousands of dollars vs. home size in thousands of square feet shows a relatively linear, positive association with r = 0.85. There are no unusual points and the residual plot shows random scatter.
6
1.If a home size is one stdev above the mean home size, how many stdev above the mean would you expect the sale price to be?
7
So, I would expect it’s sale price to be 0.85 stdev above the mean.
8
2. What would you predict about the sale price of a home 2 SD below the average size?
9
So, I would expect its sale price to be 1.70 stdev below the mean.
10
Consider the linear regression: 3.What are the units of the slope?
11
Consider the linear regression: 4.Interpret the slope.
12
Consider the linear regression: 5.By how much would the value of my home increase after the addition of 500 sq ft?
13
Consider the linear regression: 6.How much would you expect to pay for a 3000 sq ft home?
14
Homework Comments If form isn’t linear, what is it? Correlation vs Association Form, strength (r), direction, outliers Only use correlation/scatter plots for quantitative data r has no units
15
Let’s go back to the Handspan vs. Height activity
16
r2r2
17
Computer Output
18
BAC
19
Computer Output
20
Scatterplot and Residual Plot
21
Questions
22
Definition: If we use a least-squares regression line to predict the values of a response variable y from an explanatory variable x, the standard deviation of the residuals (AKA s) is given by: (Whaaat?!)
23
The standard deviation of the residuals gives us a numerical estimate of the typical size of our prediction errors (AKA residuals.) There is another numerical quantity that tells us how well the least-squares regression line predicts values of the response y.
24
Definition: The coefficient of determination r 2 is the fraction of the variation in the values of y that is accounted for by the least- squares regression line of y on x. We can calculate r 2 using the following formula: Where and (Whaaat?!) 2
25
TeamGames WonRuns Scored Atlanta104767 Chic Cubs84738 Cincinnati73722 Colorado67758 Florida64581 Houston85716 LA81675 Montreal94732 NY Mets59672 Philly97877 Pittsburg75707 San Diego61679 San Fran103808 St. Louis87758 1993 NL Statistics for MLB
26
Miles Driven Price 2200017998 2900016450 3500014998 3900013998 4500014599 4900014988 5500013599 5600014599 6900011998 7000014450 8600010998 Miles driven and the price of a used Honda CR-V
27
= data value y Error w.r.t. mean model Proportion of error eliminated by new model for this data point = x 10 8 mean model = 0.8 Error w.r.t. mean model10 8 ? ? ? ? ? ? ? ? ? ? Conceptually, if we computed a proportion in the same way for each data point and combined them sensibly, we would end up with r 2. Est. This Call it 10 units! r 2 is proportion of error (variability) in the response variable (y) accounted for by the given model (w.r.t the mean model).
28
Bellwork: 10/5/15 Describe what each of these residual plots is telling us about their linear regression
29
Homework Comments Define your variables. #37: “The slope is 1.109 which means that the typical highway gas mileage increases on average by 1.109 mpg for each 1 mpg increase in city mileage.” “An increase in city mileage of 1 mpg is associated with a predicted increase of 1.109 mpg on average in highway mileage” Reference residual plot CONTEXT!! Meaning!
30
FRQ Practice Sewing machines Study time
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.