Linear Regression Chapter 8.

Slides:



Advertisements
Similar presentations
Least-Squares Regression Section 3.3. Correlation measures the strength and direction of a linear relationship between two variables. How do we summarize.
Advertisements

Linear Regression (C7-9 BVD). * Explanatory variable goes on x-axis * Response variable goes on y-axis * Don’t forget labels and scale * Statplot 1 st.
AP Statistics Chapter 8 Linear Regression.
Chapter 3 Bivariate Data
2nd Day: Bear Example Length (in) Weight (lb)
Warm up Use calculator to find r,, a, b. Chapter 8 LSRL-Least Squares Regression Line.
Scatter Diagrams and Linear Correlation
Regression Wisdom.
Getting to Know Your Scatterplot and Residuals
LSRL Least Squares Regression Line
Chapter 9: Regression Alexander Swan & Rafey Alvi.
Least Squares Regression Line (LSRL)
Chapter 5 Regression. Chapter outline The least-squares regression line Facts about least-squares regression Residuals Influential observations Cautions.
Linear Regression.
ASSOCIATION: CONTINGENCY, CORRELATION, AND REGRESSION Chapter 3.
Regression Wisdom.  Linear regression only works for linear models. (That sounds obvious, but when you fit a regression, you can’t take it for granted.)
Chapter 5 Residuals, Residual Plots, & Influential points.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. DosageHeart rate
Chapter 3.3 Cautions about Correlations and Regression Wisdom.
Copyright © 2010 Pearson Education, Inc. Chapter 9 Regression Wisdom.
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
AP STATISTICS Section 3.2 Least Squares Regression.
POD 09/19/ B #5P a)Describe the relationship between speed and pulse as shown in the scatterplot to the right. b)The correlation coefficient, r,
Linear Regression Day 1 – (pg )
LEAST-SQUARES REGRESSION 3.2 Least Squares Regression Line and Residuals.
Least Squares Regression Lines Text: Chapter 3.3 Unit 4: Notes page 58.
Unit 4 Lesson 3 (5.3) Summarizing Bivariate Data 5.3: LSRL.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapter 7 Linear Regression. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable.
Regression Wisdom. Getting the “Bends”  Linear regression only works for linear models. (That sounds obvious, but when you fit a regression, you can’t.
Residuals, Influential Points, and Outliers
Chapter 5 Lesson 5.2 Summarizing Bivariate Data 5.2: LSRL.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 9 Regression Wisdom.
CHAPTER 5: Regression ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Unit 3 – Association: Contingency, Correlation, and Regression Lesson 3-3 Linear Regression, Residuals, and Variation.
Describing Relationships. Least-Squares Regression  A method for finding a line that summarizes the relationship between two variables Only in a specific.
Least Squares Regression Textbook section 3.2. Regression LIne A regression line describes how the response variable (y) changes as an explanatory variable.
Chapter 3 LSRL. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable Use x to predict.
Chapter 5 LSRL. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable Use x to predict.
Linear Regression Essentials Line Basics y = mx + b vs. Definitions
CHAPTER 3 Describing Relationships
Unit 4 LSRL.
LSRL.
Least Squares Regression Line.
Statistics 101 Chapter 3 Section 3.
CHAPTER 3 Describing Relationships
Chapter 5 LSRL.
LSRL Least Squares Regression Line
Chapter 3.2 LSRL.
Outliers… Leverage… Influential points….
Least-Squares Regression
Least Squares Regression Line LSRL Chapter 7-continued
CHAPTER 3 Describing Relationships
Chapter 5 LSRL.
Chapter 5 LSRL.
Chapter 5 LSRL.
Least-Squares Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Inference for Regression Slope
CHAPTER 3 Describing Relationships
Chapter 3 Vocabulary Linear Regression.
Chapter 3.2 Regression Wisdom.
Chapter 9 Regression Wisdom.
Chapter 9 Regression Wisdom.
Honors Statistics Review Chapters 7 & 8
CHAPTER 3 Describing Relationships
Presentation transcript:

Linear Regression Chapter 8

AP Statistics – Chapter 8 Linear Regression AP Statistics – Chapter 8 We use actual values for “x”… so no hat here. 𝑦 = 𝑏 0 + 𝑏 1 𝑥 We are predicting the y-values, thus the “hat” over the “y”. slope y-intercept

Is the scatterplot fairly linear? Is a linear model appropriate? Check 2 things: Is the scatterplot fairly linear? Is there a pattern in the plot of the residuals?

Residuals Residual: Observed y – Predicted y e=𝑦− 𝑦 (difference between observed value and predicted value) Believe it or not, our “best fit line” will actually MISS most of the points. Residual: Observed y – Predicted y e=𝑦− 𝑦

Every point has a residual... and if we plot them all, we have a residual plot. We do NOT want a pattern in the residual plot! This residual plot has no distinct pattern… so it looks like a linear model is appropriate.

OOPS!!! Does a linear model seem appropriate? Although the scatterplot is fairly linear… the residual plot has a clear curved pattern. A linear model is NOT appropriate here.

Linear Not linear Is a linear model appropriate? A residual plot that has no distinct pattern is an indication that a linear model might be appropriate. Linear Not linear

Note about residual plots residuals vs. 𝒙 and residuals vs. 𝒚 will look the same but don’t plot residuals vs. 𝒚 (that will look different)

Consider the following 4 points: (1, 3) (3, 5) (5, 3) (7, 7) Least Squares Regression Line Consider the following 4 points: (1, 3) (3, 5) (5, 3) (7, 7) How do we find the best fit line?

is the line (model) which minimizes the sum of the squared residuals. Least Squares Regression Line is the line (model) which minimizes the sum of the squared residuals.

Facts about LSRL sum of all residuals is zero (some are positive, some negative) sum of all squared residuals is the lowest possible value (but not 0). (since we square them, they are all positive) goes through the point ( 𝑥 , 𝑦 )

𝑥 𝑦 least squares line slope= 𝑟 𝑠 𝑦 𝑠 𝑥 Regression line always contains (x-bar, y-bar) 𝑥 least squares line 𝑦 slope= 𝑟 𝑠 𝑦 𝑠 𝑥

Regression Wisdom Chapter 9

Another look at height vs. age: (this is cm vs months!) ℎ𝑒𝑖𝑔ℎ𝑡 =64.93+0.635∗𝑎𝑔𝑒 What does the model predict about the height of a 180-month (15-year) old person? ℎ𝑒𝑖𝑔ℎ𝑡 =64.93+0.635(180) ℎ𝑒𝑖𝑔ℎ𝑡 =179.23 cm… or about 70.56 inches! (that’s 6 feet, 8 inches!) THAT’S A TALL 15-YEAR OLD!!!

(that’s 12 feet, 1.56 inches!) …what about a 40-year old human… ℎ𝑒𝑖𝑔ℎ𝑡 =64.93+0.635∗𝑎𝑔𝑒 ℎ𝑒𝑖𝑔ℎ𝑡 =64.93+0.635(480) ℎ𝑒𝑖𝑔ℎ𝑡 =369.73 cm… or 145.56 inches! (that’s 12 feet, 1.56 inches!)

Extrapolation (going beyond the useful ends of our mathematical model) Whenever we go beyond the ends of our data (specifically the x-values), we are extrapolating. Extrapolation leads us to results that may be unreliable.

Outliers… Leverage… Influential points…

Outliers, leverage, and influence If a point’s x-value is far from the mean of the x-values, it is said to have high leverage. (it has the potential to change the regression line significantly) A point is considered influential if omitting it gives a very different model.

(model does not change drastically) Outlier or Influential point? (or neither?) Outlier: - Low leverage - Weakens “r” WITHOUT “outlier” WITH “outlier” (model does not change drastically)

(slope changes drastically!) Outlier or Influential point? (or neither?) Influential Point: - HIGH leverage - Weakens “r” WITHOUT “outlier” WITH “outlier” (slope changes drastically!)

Outlier or Influential point? (or neither?) - HIGH leverage - STRENGTHENS “r” Linear model WITH and WITHOUT “outlier”

fin~