Residuals Recall that the vertical distances from the points to the least-squares regression line are as small as possible.  Because those vertical distances.

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

AP Statistics Section 3.2 C Coefficient of Determination
AP Statistics Section 3.2 B Residuals
The Role of r2 in Regression Target Goal: I can use r2 to explain the variation of y that is explained by the LSRL. D4: 3.2b Hw: pg 191 – 43, 46, 48,
2nd Day: Bear Example Length (in) Weight (lb)
CHAPTER 3 Describing Relationships
Regression, Residuals, and Coefficient of Determination Section 3.2.
C HAPTER 3: E XAMINING R ELATIONSHIPS. S ECTION 3.3: L EAST -S QUARES R EGRESSION Correlation measures the strength and direction of the linear relationship.
Objectives (BPS chapter 5)
Ch 3 – Examining Relationships YMS – 3.1
Section 3.2 Least-Squares Regression
Lesson Least-Squares Regression. Knowledge Objectives Explain what is meant by a regression line. Explain what is meant by extrapolation. Explain.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.2 Least-Squares Regression.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
AP STATISTICS LESSON 3 – 3 LEAST – SQUARES REGRESSION.
Linear Regression Least Squares Method: the Meaning of r 2.
Section 2.2 Correlation A numerical measure to supplement the graph. Will give us an indication of “how closely” the data points fit a particular line.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
WARM-UP Do the work on the slip of paper (handout)
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
LEAST-SQUARES REGRESSION 3.2 Least Squares Regression Line and Residuals.
CHAPTER 3 Describing Relationships
3.2 - Residuals and Least Squares Regression Line.
AP STATISTICS LESSON 3 – 3 (DAY 2) The role of r 2 in regression.
Describing Relationships. Least-Squares Regression  A method for finding a line that summarizes the relationship between two variables Only in a specific.
Describing Bivariate Relationships. Bivariate Relationships When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response.
Describing Relationships
Statistics 101 Chapter 3 Section 3.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Least-Squares Regression
AP STATISTICS LESSON 3 – 3 (DAY 2)
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Least-Squares Regression
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
GET OUT p.161 HW!.
Least Squares Regression
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Least-Squares Regression
Least-Squares Regression
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Section 3.2: Least Squares Regressions
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
9/27/ A Least-Squares Regression.
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Presentation transcript:

Residuals Recall that the vertical distances from the points to the least-squares regression line are as small as possible.  Because those vertical distances represent “left- over” variation in the response after fitting the regression line, these distances are called residuals.

Or in other words, the residuals are the distances from the points to the LSRL.

Calculating a Residual‏ One subject's NEA rose by 135 calories and he gained 2.7 kg of fat. The predicted gain for 135 calories from the regression equation is: The residual for this subject is therefore: observed - predicted

Fat Gain & NEA (yet again!)‏ Here are the residuals for all 16 data values from the NEA experiment: Although residuals can be calculated from any model that is fitted to the data, the residuals from the least- squares line have a special property: the sum of the least-squares residuals is always zero. (Try adding the numbers above- - they add up to zero!)‏

The line y=0 corresponds with the regression line, and also marks the mean of our residuals. The residuals plot magnifies the deviations from the line to make patterns easier to see.

Residual Plots What to look for when examining a residual plot: 1. Residual plots should have no pattern.

Residual Plots What to look for when examining a residual plot: A curved pattern shows that the relationships may not be linear. Increasing spread about the line as x increases indicates the prediction will be less accurate for larger x values. Similarly, decreasing spread indicates the prediction will be less accurate for smaller x values.

Residual Plots What to look for when examining a residual plot: 1. The residual plot should show no pattern. 2.The residuals should be relatively small in size.

The role of r 2 in regression A residual plot is a graphical tool for evaluating how well a linear model fits the data. Look at the residual plot first to see if a linear model is a good fit. If the linear model is a good fit, then there is also a numerical quantity that tells us how well the LSRL does at predicting values of the response variable y. It is r 2, the coefficient of determination.

The role of r 2 in regression r 2 is actually the correlation squared, but there's more to the story... The idea of r 2 is this: how much better is the least- squares line at predicting responses y than if we just used our mean?

The role of r 2 in regression Is the LSRL better at predicting the data values than the mean? r 2 tells us how much better. Here's the line that represents the y mean of our data. Here's our LSRL

Note: Remember we defined the variance back when we talked about standard deviation. r 2 compares the variance from the mean (the SST part of the equation) with the residuals (the SSE part of the equation). Here's the formula:

For example, if r 2 =0.606 (as it does in the NEA example), then about 61% of the variation in fat gain among the individual subjects is due to the straight-line relationship between fat gain and NEA. The other 39% is individual variation among subjects that is not explained by the linear relationship.

When you report a regression, give r 2 as a measure of how successful the regression was in explaining the response. When you see a correlation, square it to get a better feel for the strength of the linear relationship.

Review Facts About Least-Square Regression  The distinction between explanatory and response variables is essential in regression. In the regression setting you must know clearly which variable is explanatory!

Review Facts About Least-Square Regression There is a close connection between correlation and the slope of the LSRL. The slope is This equation says that along the regression line, a change of one standard deviation in x corresponds to a change of r standard deviations in y.

The least-squares regression line of y on x always passes through the point (mean of x values, mean of y values) Review Facts About Least-Square Regression

The correlation r describes the strength of a straight-line relationship. The square of the correlation, r 2, is the fraction of the variation in the values of y that is explained by the least-squares regression of y on x.