LSRL Least Squares Regression Line

Slides:



Advertisements
Similar presentations
Linear Regression (LSRL)
Advertisements

Examining Relationships Chapter 3. Least Squares Regression Line If the data in a scatterplot appears to be linear, we often like to model the data by.
Chapter 3 Bivariate Data
Warm up Use calculator to find r,, a, b. Chapter 8 LSRL-Least Squares Regression Line.
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
Least Squares Regression
Haroon Alam, Mitchell Sanders, Chuck McAllister- Ashley, and Arjun Patel.
Correlation and Linear Regression
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Linear Regression.
Section 5.2: Linear Regression: Fitting a Line to Bivariate Data.
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Chapter 20 Linear Regression. What if… We believe that an important relation between two measures exists? For example, we ask 5 people about their salary.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
3.2 Least Squares Regression Line. Regression Line Describes how a response variable changes as an explanatory variable changes Formula sheet: Calculator.
Chapters 8 & 9 Linear Regression & Regression Wisdom.
Objective: Understanding and using linear regression Answer the following questions: (c) If one house is larger in size than another, do you think it affects.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. DosageHeart rate
Correlation and Regression. Section 9.1  Correlation is a relationship between 2 variables.  Data is often represented by ordered pairs (x, y) and.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
AP STATISTICS Section 3.2 Least Squares Regression.
Least Squares Regression.   If we have two variables X and Y, we often would like to model the relation as a line  Draw a line through the scatter.
 Chapter 3! 1. UNIT 7 VOCABULARY – CHAPTERS 3 & 14 2.
POD 09/19/ B #5P a)Describe the relationship between speed and pulse as shown in the scatterplot to the right. b)The correlation coefficient, r,
Linear Regression Day 1 – (pg )
Least Squares Regression Lines Text: Chapter 3.3 Unit 4: Notes page 58.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Bellwork (Why do we want scattered residual plots?): 10/2/15 I feel like I didn’t explain this well, so this is residual plots redux! Copy down these two.
Unit 4 Lesson 3 (5.3) Summarizing Bivariate Data 5.3: LSRL.
Chapter 7 Linear Regression. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable.
Chapter 5 Summarizing Bivariate Data Correlation.
Chapter 5 Lesson 5.2 Summarizing Bivariate Data 5.2: LSRL.
CHAPTER 5: Regression ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Unit 3 Correlation. Homework Assignment For the A: 1, 5, 7,11, 13, , 21, , 35, 37, 39, 41, 43, 45, 47 – 51, 55, 58, 59, 61, 63, 65, 69,
Describing Relationships. Least-Squares Regression  A method for finding a line that summarizes the relationship between two variables Only in a specific.
Least Squares Regression Textbook section 3.2. Regression LIne A regression line describes how the response variable (y) changes as an explanatory variable.
Describing Bivariate Relationships. Bivariate Relationships When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response.
1. Analyzing patterns in scatterplots 2. Correlation and linearity 3. Least-squares regression line 4. Residual plots, outliers, and influential points.
Chapter 3 LSRL. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable Use x to predict.
Chapter 5 LSRL. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable Use x to predict.
CHAPTER 3 Describing Relationships
Unit 4 LSRL.
LSRL.
Chapter 4.2 Notes LSRL.
Least Squares Regression Line.
Chapter 5 LSRL.
LSRL Least Squares Regression Line
Chapter 4 Correlation.
Chapter 3.2 LSRL.
The Least-Squares Regression Line
Correlation and Regression
Least Squares Regression Line LSRL Chapter 7-continued
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 5 LSRL.
Chapter 5 LSRL.
Chapter 5 LSRL.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. Find the correlation coefficient & interpret.
Honors Statistics Review Chapters 7 & 8
CHAPTER 3 Describing Relationships
Presentation transcript:

LSRL Least Squares Regression Line Chapter 5 LSRL Least Squares Regression Line

Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable Use x to predict y

Be sure to put the hat on the y - (y-hat) means the predicted y b – is the slope it is the approximate amount by which y increases when x increases by 1 unit a – is the y-intercept it is the approximate height of the line when x = 0 in some situations, the y-intercept has no meaning Be sure to put the hat on the y

Least Squares Regression Line LSRL The line that gives the best fit to the data set The line that minimizes the sum of the squares of the deviations from the line

(0,0) (3,10) (6,2) y =.5(6) + 4 = 7 2 – 7 = -5 4.5 y =.5(0) + 4 = 4 0 – 4 = -4 -5 y =.5(3) + 4 = 5.5 10 – 5.5 = 4.5 -4 (0,0) Sum of the squares = 61.25

What is the sum of the deviations from the line? Will it always be zero? Use a calculator to find the line of best fit (0,0) (3,10) (6,2) 6 Find y - y -3 The line that minimizes the sum of the squares of the deviations from the line is the LSRL. -3 Sum of the squares = 54

Interpretations Slope: For each unit increase in x, there is an approximate increase/decrease of b in y. Correlation coefficient: There is a direction, strength, linear of association between x and y.

The ages (in months) and heights (in inches) of seven children are given. x 16 24 42 60 75 102 120 y 24 30 35 40 48 56 60 Find the LSRL. Interpret the slope and correlation coefficient in the context of the problem.

Correlation coefficient: Predicted heights = 20.4 + .342(age in months) Correlation coefficient: There is a strong, positive, linear association between the age and height of children. Slope: For an increase in age of one month, there is an approximate increase of .34 inches in heights of children.

Predict the height of a child who is 4.5 years old. The ages (in months) and heights (in inches) of seven children are given. x 16 24 42 60 75 102 120 y 24 30 35 40 48 56 60 Predict the height of a child who is 4.5 years old. Predict the height of someone who is 20 years old. 38.5 inches Graph, find lsrl, also examine mean of x & y 102.48 inches or 8.5 feet?

Interpolation (good): Using a regression line for estimating predicted values between known values.

Extrapolation (bad) The LSRL should not be used to predict y for values of x outside the data set. It is unknown whether the pattern observed in the scatterplot continues outside this range.

For these data, this is the best equation to predict y from x. The ages (in months) and heights (in inches) of seven children are given. The LSRL is Can this equation be used to estimate the age of a child who is 50 inches tall? Calculate: LinReg L2,L1 For these data, this is the best equation to predict y from x. Do you get the same LSRL? However, statisticians will always use this equation to predict x from y

Will this point always be on the LSRL? The ages (in months) and heights (in inches) of seven children are given. x 16 24 42 60 75 102 120 y 24 30 35 40 48 56 60 Calculate x & y and sx and sy : Plot the point (x, y) on the scatterplot. Graph, find lsrl, also examine mean of x & y Will this point always be on the LSRL?

The correlation coefficient and the LSRL are both non-resistant measures.

Slope = correlation coefficient (st. dev. of y / st.dev. of x) Formulas – on chart Predicted y = y-intercept + slope(x) Slope formula Y-intercept = mean of y – slope (mean of x) Slope = correlation coefficient (st. dev. of y / st.dev. of x)

The following statistics are found for the variables posted speed limit and the average number of accidents. Find the LSRL & predict the number of accidents for a posted speed limit of 50 mph.

b0 = 18 - .7228(40) = -10.92 For LSRL need slope and y-intercept: Predicted # of accidents = .723(posted speed limit) – 10.92

Describe the relationship between the two variables: Sleep Score 8 25 9 28 7 21 10 26 8.5 18 6.5 16 5.5 11 29 19 6 7.5 23 24 27 20 Describe the relationship between the two variables: (Score = a + b (Hours of Sleep)) What is the equation for the LSRL? What is the slope of this line? Interpret the slope in the context of the problem. What are the units? If a student got 10 hours of sleep the night before the exam, use the linear equation to approximate her score on the ACT.

Example: The average annual cost per person due to traffic delays for 70 US cities in 2000 was $298.96 with a standard deviation of $180.83. The peak period average freeway speed is 54.34 mph with a standard deviation of 4.494 mph. The correlation between cost per person and freeway speed is -0.90. Write a regression model to estimate costs per person associated with traffic delays. = = = = r =

c) The regression model is Price = 9.564 + 122.74 size Example: A scatterplot of house prices (in thousands of dollars) vs. house size (in thousands of square feet) shows a relationship that is straight, with only moderate scatter and no outliers. The correlation between house price and house size is 0.85. If a house is 1 SD above the mean in size (making it about 2170 sq ft), how many SDs above the mean would you predict its sale price to be? b) What would you predict about the sale price of a house that’s 2 SDs below average in size? c) The regression model is Price = 9.564 + 122.74 size What does the slope of 122.74 mean? d) What are the units? e) How much can a homeowner expect the value of his house to increase if he builds on an additional 2000 sq ft? f) How much would you expect to pay for a house of 3000 sq ft?