Bellwork (Why do we want scattered residual plots?): 10/2/15 I feel like I didn’t explain this well, so this is residual plots redux! Copy down these two.

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

CHAPTER 3 Describing Relationships
LSRL Least Squares Regression Line
CHAPTER 3 Describing Relationships
AP STATISTICS LESSON 3 – 3 LEAST – SQUARES REGRESSION.
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
3.2 Least Squares Regression Line. Regression Line Describes how a response variable changes as an explanatory variable changes Formula sheet: Calculator.
Objective: Understanding and using linear regression Answer the following questions: (c) If one house is larger in size than another, do you think it affects.
3.2 - Least- Squares Regression. Where else have we seen “residuals?” Sx = data point - mean (observed - predicted) z-scores = observed - expected * note.
Correlation and Regression. Section 9.1  Correlation is a relationship between 2 variables.  Data is often represented by ordered pairs (x, y) and.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
 Find the Least Squares Regression Line and interpret its slope, y-intercept, and the coefficients of correlation and determination  Justify the regression.
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
Chapter 14: Inference for Regression. A brief review of chapter 4... (Regression Analysis: Exploring Association BetweenVariables )  Bi-variate data.
SWBAT: Calculate and interpret the residual plot for a line of regression Do Now: Do heavier cars really use more gasoline? In the following data set,
CHAPTER 3 Describing Relationships
LEAST-SQUARES REGRESSION 3.2 Role of s and r 2 in Regression.
Does adding a fuel additive help gasoline mileage in automobiles? Use Linear Regression to analyze the following data: Amount of STP fuel additive added.
The coefficient of determination, r 2, is The fraction of the variation in the value of y that is explained by the regression line and the explanatory.
3.2 - Residuals and Least Squares Regression Line.
The following data represents the amount of Profit (in thousands of $) made by a trucking company dependent on gas prices. Gas $
1. Analyzing patterns in scatterplots 2. Correlation and linearity 3. Least-squares regression line 4. Residual plots, outliers, and influential points.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Does adding a fuel additive help gasoline mileage in automobiles?
3.2A Least Squares Regression
LSRL Least Squares Regression Line
The following data represents the amount of Profit (in thousands of $) made by a trucking company dependent on gas prices. Gas $
The Least-Squares Regression Line
Correlation and Regression
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Least-Squares Regression
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Least-Squares Regression
Chapter 3 Describing Relationships Section 3.2
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Least-Squares Regression
Chapter 3: Describing Relationships
Least-Squares Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Warmup A study was done comparing the number of registered automatic weapons (in thousands) along with the murder rate (in murders per 100,000) for 8.
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. Find the correlation coefficient & interpret.
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Presentation transcript:

Bellwork (Why do we want scattered residual plots?): 10/2/15 I feel like I didn’t explain this well, so this is residual plots redux! Copy down these two scatterplots

3.2B&C LSRL—stdev of the Residuals & r 2

If she loves you more each and every day, by linear regression she hated you before you met.

Objectives:  CALCULATE residuals  CONSTRUCT and INTERPRET residual plots  DETERMINE how well a line fits observed data

A scatterplot of home price in thousands of dollars vs. home size in thousands of square feet shows a relatively linear, positive association with r = There are no unusual points and the residual plot shows random scatter.

1.If a home size is one stdev above the mean home size, how many stdev above the mean would you expect the sale price to be?

So, I would expect it’s sale price to be 0.85 stdev above the mean.

2. What would you predict about the sale price of a home 2 SD below the average size?

So, I would expect its sale price to be 1.70 stdev below the mean.

Consider the linear regression: 3.What are the units of the slope?

Consider the linear regression: 4.Interpret the slope.

Consider the linear regression: 5.By how much would the value of my home increase after the addition of 500 sq ft?

Consider the linear regression: 6.How much would you expect to pay for a 3000 sq ft home?

Homework Comments If form isn’t linear, what is it? Correlation vs Association Form, strength (r), direction, outliers Only use correlation/scatter plots for quantitative data r has no units

Let’s go back to the Handspan vs. Height activity

r2r2

Computer Output

BAC

Computer Output

Scatterplot and Residual Plot

Questions

Definition: If we use a least-squares regression line to predict the values of a response variable y from an explanatory variable x, the standard deviation of the residuals (AKA s) is given by: (Whaaat?!)

The standard deviation of the residuals gives us a numerical estimate of the typical size of our prediction errors (AKA residuals.) There is another numerical quantity that tells us how well the least-squares regression line predicts values of the response y.

Definition: The coefficient of determination r 2 is the fraction of the variation in the values of y that is accounted for by the least- squares regression line of y on x. We can calculate r 2 using the following formula: Where and (Whaaat?!) 2

TeamGames WonRuns Scored Atlanta Chic Cubs84738 Cincinnati73722 Colorado67758 Florida64581 Houston85716 LA81675 Montreal94732 NY Mets59672 Philly97877 Pittsburg75707 San Diego61679 San Fran St. Louis NL Statistics for MLB

Miles Driven Price Miles driven and the price of a used Honda CR-V

= data value y Error w.r.t. mean model Proportion of error eliminated by new model for this data point = x 10 8 mean model = 0.8 Error w.r.t. mean model10 8 ? ? ? ? ? ? ? ? ? ? Conceptually, if we computed a proportion in the same way for each data point and combined them sensibly, we would end up with r 2. Est. This Call it 10 units! r 2 is proportion of error (variability) in the response variable (y) accounted for by the given model (w.r.t the mean model).

Bellwork: 10/5/15 Describe what each of these residual plots is telling us about their linear regression

Homework Comments Define your variables. #37: “The slope is which means that the typical highway gas mileage increases on average by mpg for each 1 mpg increase in city mileage.” “An increase in city mileage of 1 mpg is associated with a predicted increase of mpg on average in highway mileage” Reference residual plot CONTEXT!! Meaning!

FRQ Practice Sewing machines Study time