Examining Relationships Chapter 3. Least Squares Regression Line If the data in a scatterplot appears to be linear, we often like to model the data by.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

2nd Day: Bear Example Length (in) Weight (lb)
Warm up Use calculator to find r,, a, b. Chapter 8 LSRL-Least Squares Regression Line.
LSRL Least Squares Regression Line
Simple Linear Regression
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Linear Regression and Correlation Analysis
Simple Linear Regression. Introduction In Chapters 17 to 19, we examine the relationship between interval variables via a mathematical equation. The motivation.
Correlation and Linear Regression Chapter 13 McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
Correlation and Linear Regression
Correlation and Linear Regression
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Linear Regression and Correlation
Linear Regression.
Linear Regression and Correlation
Section 5.2: Linear Regression: Fitting a Line to Bivariate Data.
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Simple Linear Regression. Deterministic Relationship If the value of y (dependent) is completely determined by the value of x (Independent variable) (Like.
Chapter 11 Correlation and Simple Linear Regression Statistics for Business (Econ) 1.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
SWBAT: Calculate and interpret the residual plot for a line of regression Do Now: Do heavier cars really use more gasoline? In the following data set,
Linear Regression Day 1 – (pg )
LEAST-SQUARES REGRESSION 3.2 Least Squares Regression Line and Residuals.
Linear Regression and Correlation Chapter GOALS 1. Understand and interpret the terms dependent and independent variable. 2. Calculate and interpret.
Least Squares Regression Lines Text: Chapter 3.3 Unit 4: Notes page 58.
Unit 4 Lesson 3 (5.3) Summarizing Bivariate Data 5.3: LSRL.
The coefficient of determination, r 2, is The fraction of the variation in the value of y that is explained by the regression line and the explanatory.
Chapter 7 Linear Regression. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable.
Chapter 5 Lesson 5.2 Summarizing Bivariate Data 5.2: LSRL.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Unit 3 Correlation. Homework Assignment For the A: 1, 5, 7,11, 13, , 21, , 35, 37, 39, 41, 43, 45, 47 – 51, 55, 58, 59, 61, 63, 65, 69,
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Describing Relationships. Least-Squares Regression  A method for finding a line that summarizes the relationship between two variables Only in a specific.
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
We will use the 2012 AP Grade Conversion Chart for Saturday’s Mock Exam.
Describing Bivariate Relationships. Bivariate Relationships When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response.
1. Analyzing patterns in scatterplots 2. Correlation and linearity 3. Least-squares regression line 4. Residual plots, outliers, and influential points.
Chapter 3 LSRL. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable Use x to predict.
Chapter 5 LSRL. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable Use x to predict.
CHAPTER 3 Describing Relationships
The Least Square Regression Line (LSRL) is:
Unit 4 LSRL.
LSRL.
Least Squares Regression Line.
Regression.
Examining Relationships
Examining Relationships
Chapter 5 LSRL.
LSRL Least Squares Regression Line
The scatterplot shows the advertised prices (in thousands of dollars) plotted against ages (in years) for a random sample of Plymouth Voyagers on several.
Chapter 4 Correlation.
Chapter 3.2 LSRL.
1) A residual: a) is the amount of variation explained by the LSRL of y on x b) is how much an observed y-value differs from a predicted y-value c) predicts.
Chapter 12 Regression.
Least Squares Regression Line LSRL Chapter 7-continued
Regression.
The Weather Turbulence
Regression.
Regression Chapter 8.
Chapter 5 LSRL.
Chapter 5 LSRL.
Chapter 5 LSRL.
Least-Squares Regression
CHAPTER 3 Describing Relationships
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. Find the correlation coefficient & interpret.
Presentation transcript:

Examining Relationships Chapter 3

Least Squares Regression Line If the data in a scatterplot appears to be linear, we often like to model the data by a line. Least-squares regression is a method for writing an equation passing through the centroid for a line that models linear data. A least squares regression line is a straight line that predicts how a response variable, y, changes as an explanatory variable, x, changes.

Following are the mean heights of Kalama children: Age (months) Height (cm) a) Sketch a scatter plot. Age (months) Height (cm)

b) What is the correlation coefficient? Interpret in terms of the problem. c) Calculate and interpret the slope. d) Calculate and interpret the y-intercept. e) Write the equation of the regression line. f) Predict the height of a 32 month old child. b) d)a = cm At zero months, the estimated mean height for the Kalama children is cm. There is a strong positive linear relationship between height and age. c) b=.635 For every month in age, there is an increase of about.635 cm in height. f)

Good runners take more steps per second as they speed up. Here are there average numbers of steps per second for a group of top female runners at different speeds. The speeds are in feet per second. Speed (ft/s) Steps per second b) c) There is a strong, positive linear relationship between speed and steps per second. For every 1 foot per second increase in speed, the steps increase typically by.0803 steps per second.

Good runners take more steps per second as they speed up. Here are there average numbers of steps per second for a group of top female runners at different speeds. The speeds are in feet per second. Speed (ft/s) Steps per second d) e) f) If a runners speed was 0, the steps per second is about 1.77 steps.

According to the article “First-Year Academic Success...”(1999) there is a mild correlation (r =.55) between high school GPA and college GPA. The high school GPA’s have a mean of 3.7 and standard deviation of The college GPA’s have a mean of 2.86 with standard deviation of a) What is the explanatory variable? b) What is the slope of the LSRL of college GPA on high school GPA? Intercept? Interpret these in context of the problem. c) Billy Bob’s high school GPA is 3.2, what could we expect of him in college? a) High school GPA b) c)

Car dealers across North America use the “Red Book” to help them determine the value of used cars that their customers trade in when purchasing new cars. The book lists on a monthly basis the amount paid at recent used-car auctions and indicates the values according to condition and optional features, but does not inform the dealers as to how odometer readings affect the trade-in value. In an experiment to determine whether the odometer reading should be included, ten 3-year-old cars are randomly selected of the same make, condition, and options. The trade-in value and mileage are shown below. a) b) For every 1000 miles on the odometer, there is a decrease of about $26.68 in trade in value. c) There is a strong negative linear relationship between a car’s odometer reading and the trade-in value.

Coefficient of determination Specifically, the value is the percentage of the variation of the dependent variable that is explained by the regression line based on the independent variable. In other words, in a bivariate data set, the y- values vary a certain amount. How much of that variation can be accounted for if we use a line to model the data.

r 2 example 1 Jimmy works at a restaurant and gets paid $8 an hour. He tracks how much total money he has earned each hour during his first shift.

r 2 example 1 What is my correlation coefficient? r = 1 What is my coefficient of determination? r 2 = 1

r 2 example 1 Look at the variation of y-values about the mean. What explains why the money varies as much as it does?

r 2 example 1 If I draw a line, what percent of the changes in the values of money can be explained by the regression line based on hours?

r 2 example 1 We know 100% of the variation in money can be determined by the linear relationship based on hours.

r 2 example 2 Will be able to explain the relationship between hours and total money for a member of the wait staff with the same precision?

Car dealers across North America use the “Red Book” to help them determine the value of used cars that their customers trade in when purchasing new cars. The book lists on a monthly basis the amount paid at recent used-car auctions and indicates the values according to condition and optional features, but does not inform the dealers as to how odometer readings affect the trade-in value. In an experiment to determine whether the odometer reading should be included, ten 3-year-old cars are randomly selected of the same make, condition, and options. The trade-in value and mileage are shown below. e) f) d) We know 79.82% of the variation in trade-in values can be determined by the linear relationship between odometer reading and trade-in value. In other words, about 80% of the variation can be explained by the odometer while the remaining 20% of the variation relies on other variables.

In one of the Boston city parks there has been a problem with muggings in the summer months. A police cadet took a random sample of 10 days (out of the 90-day summer) and compiled the following data. For each day, x represents the number of police officers on duty in the park and y represents the number of reported muggings on that day. a) For every additional police officer on duty, there is a decrease of approximately.4932 muggings in the park. c) d) We know 93.91% of the variation in muggings can be predicted by the linear relationship between number of police officers on duty and number of muggings. e) b) There is a strong negative linear relationship between the number of police officers and the number of muggings.