A student wonders if tall women tend to date taller men than do short women. She measures herself, her dormitory roommate, and the women in the adjoining.

Slides:



Advertisements
Similar presentations
AP Statistics Mrs Johnson
Advertisements

CHAPTER 3 Describing Relationships
Least Squares Regression
Least Squares Regression Line (LSRL)
Regression, Residuals, and Coefficient of Determination Section 3.2.
Chapter 5 Regression. Chapter outline The least-squares regression line Facts about least-squares regression Residuals Influential observations Cautions.
Linear Regression.
A student wonders if tall women tend to date taller men than do short women. She measures herself, her dormitory roommate, and the women in the adjoining.
Chapter 20 Linear Regression. What if… We believe that an important relation between two measures exists? For example, we ask 5 people about their salary.
3.2 Least Squares Regression Line. Regression Line Describes how a response variable changes as an explanatory variable changes Formula sheet: Calculator.
Objective: Understanding and using linear regression Answer the following questions: (c) If one house is larger in size than another, do you think it affects.
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. DosageHeart rate
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Do Now Women (x): Men (y): Using your calculator, make a scatterplot. Describe the direction, strength, form, and any outliers.
Correlation tells us about strength (scatter) and direction of the linear relationship between two quantitative variables. In addition, we would like to.
Objective Find the line of regression. Use the Line of Regression to Make Predictions.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
Least Squares Regression Remember y = mx + b? It’s time for an upgrade… A regression line is a line that describes how a response variable y changes as.
Least Squares Regression.   If we have two variables X and Y, we often would like to model the relation as a line  Draw a line through the scatter.
POD 09/19/ B #5P a)Describe the relationship between speed and pulse as shown in the scatterplot to the right. b)The correlation coefficient, r,
^ y = a + bx Stats Chapter 5 - Least Squares Regression
LEAST-SQUARES REGRESSION 3.2 Least Squares Regression Line and Residuals.
SWBAT: Calculate and interpret the equation of the least-squares regression line Do Now: If data set A of (x, y) data has correlation r = 0.65, and a second.
CHAPTER 3 Describing Relationships
Unit 4 Lesson 3 (5.3) Summarizing Bivariate Data 5.3: LSRL.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
3.2 - Residuals and Least Squares Regression Line.
THE SAT ESSAY: IS LONGER BETTER?  In March of 2005, Dr. Perelmen from MIT reported, “It appeared to me that regardless of what a student wrote, the longer.
Chapter 5 Lesson 5.2 Summarizing Bivariate Data 5.2: LSRL.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
Chapter 3: Describing Relationships
CHAPTER 5: Regression ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Describing Relationships. Least-Squares Regression  A method for finding a line that summarizes the relationship between two variables Only in a specific.
Least Squares Regression Textbook section 3.2. Regression LIne A regression line describes how the response variable (y) changes as an explanatory variable.
Part II Exploring Relationships Between Variables.
A student wonders if tall women tend to date taller men than do short women. She measures herself, her dormitory roommate, and the women in the adjoining.
CHAPTER 3 Describing Relationships
Linear Regression Special Topics.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Least-Squares Regression
LSRL Least Squares Regression Line
The Least-Squares Regression Line
Ice Cream Sales vs Temperature
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Least-Squares Regression
Chapter 3: Describing Relationships
Least-Squares Regression
Chapter 3 Describing Relationships Section 3.2
Least-Squares Regression
Least-Squares Regression
Least-Squares Regression
Least-Squares Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
3.2 – Least Squares Regression
CHAPTER 3 Describing Relationships
Chapter 3 Describing Relationships
CHAPTER 3 Describing Relationships
Algebra Review The equation of a straight line y = mx + b
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. Find the correlation coefficient & interpret.
9/27/ A Least-Squares Regression.
CHAPTER 3 Describing Relationships
Presentation transcript:

A student wonders if tall women tend to date taller men than do short women. She measures herself, her dormitory roommate, and the women in the adjoining rooms. Then she measures the next man each woman date. Draw & discuss the scatterplot and calculate the correlation coefficient. Women (x) Men (y)

Linear Regression

Guess the correlation coefficient

Can we make a Line of Best Fit

Regression Line This is a line that describes how a response variable (y) changes as an explanatory variable (x) changes. It’s used to predict the value of (y) for a given value of (x). Unlike correlation, regression requires that we have an explanatory variable.

Let’s try some! x?ID=146 x?ID=146

Regression Line

The following data shows the number of miles driven and advertised price for 11 used Honda CR-Vs from the model years (prices found at The scatterplot below shows a strong, negative linear association between number of miles and advertised cost. The correlation is The line on the plot is the regression line for predicting advertised price based on number of miles. Thousand Miles Driven Cost (dollars)

The regression line is shown below…. Use it to answer the following. Slope: Y-intercept:

Predict the price for a Honda with 50,000 miles.

Extrapolation This refers to using a regression line for prediction far outside the interval of values of the explanatory variable x used to obtain the line. They are not usually very accurate predictions.

Slope: Y-int: Predict weight after 16 wk Predict weight at 2 years:

Residual

The equation of the least-squares regression line for the sprint time and long- jump distance data is predicted long-jump distance = – 27.3 (sprint time). Find and interpret the residual for the student who had a sprint time of 8.09 seconds.

Regression Let’s see how a regression line is calculated.

Fat vs Calories in Burgers Fat (g)Calories

Let’s standardize the variables FatCalz - x'sz - y's The line must contain the point and pass through the origin.

Let’s clarify a little. (Just watch & listen) The equation for a line that passes through the origin can be written with just a slope & no intercept: y = mx. But, we’re using z-scores so our equation should reflect this and thus it’s Many lines with different slope pass through the origin. Which one fits our data the best? That is which slope determines the line that minimizes the sum of the squared residuals.

Line of Best Fit –Least Squares Regression Line It’s the line for which the sum of the squared residuals is smallest. We want to find the mean squared residual. Focus on the vertical deviations from the line. Residual = Observed - Predicted

Let’s find it. (just watch & soak it in) St. Dev of z scores is 1 so variance is 1 also. This is r!

Continue…… Since this is a parabola – it reaches it’s minimum at This gives us Hence – the slope of the best fit line for z-scores is the correlation coefficient → r.

Slope – rise over run A slope of r for z-scores means that for every increase of 1 standard deviation in, there is an increase of r standard deviations in. “Over 1 and up r” Translate back to x & y values – “over one standard deviation in x, up r standard deviations in y. Slope of the regression line is:

Why is correlation “r” Because it was calculated from the regression of y on x after standardizing the variables – just like we have just done – thus he used r to stand for (standardized) regression.

The number of miles (in thousands) for the 11 used Hondas have a mean of 50.5 and a standard deviation of The asking prices had a mean of $14,425 and a standard deviation of $1,899. The correlation for these variables is r = Find the equation of the least-squares regression line and explain what change in price we would expect for each additional 19.3 thousand miles.

So let’s write the equation! Fat (g)Calories Slope: Explain the slope:

Homework Page 191 (27-32, 35, 37, 39, 41)