Created by Erin Hodgess, Houston, Texas

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Section 10-3 Regression.
Kin 304 Regression Linear Regression Least Sum of Squares
Chapter 12 Simple Linear Regression
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Correlation and Regression
Chapter 4 The Relation between Two Variables
Definition  Regression Model  Regression Equation Y i =  0 +  1 X i ^ Given a collection of paired data, the regression equation algebraically describes.
Lecture 5 Curve fitting by iterative approaches MARINE QB III MARINE QB III Modelling Aquatic Rates In Natural Ecosystems BIOL471 © 2001 School of Biological.
Simple Linear Regression and Correlation
Simple Linear Regression Analysis
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
Correlation & Regression
Linear Regression.
Introduction to Linear Regression and Correlation Analysis
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-3 Regression.
Relationship of two variables
1 FORECASTING Regression Analysis Aslı Sencer Graduate Program in Business Information Systems.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Section 5.2: Linear Regression: Fitting a Line to Bivariate Data.
Chapter 10 Correlation and Regression
Correlation & Regression
Chapter 10 Lecture 2 Section: We analyzed paired data with the goal of determining whether there is a linear correlation between two variables.
Inference for Regression Section Starter The Goodwill second-hand stores did a survey of their customers in Walnut Creek and Oakland. Among.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Slide Slide 1 Warm Up Page 536; #16 and #18 For each number, answer the question in the book but also: 1)Prove whether or not there is a linear correlation.
Simple Linear Regression. The term linear regression implies that  Y|x is linearly related to x by the population regression equation  Y|x =  +  x.
Created by Erin Hodgess, Houston, Texas
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
College Prep Stats. x is the independent variable (predictor variable) ^ y = b 0 + b 1 x ^ y = mx + b b 0 = y - intercept b 1 = slope y is the dependent.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-1 Overview Overview 10-2 Correlation 10-3 Regression-3 Regression.
1 Objective Given two linearly correlated variables (x and y), find the linear function (equation) that best describes the trend. Section 10.3 Regression.
Lecture Slides Elementary Statistics Twelfth Edition
Linear Regression Essentials Line Basics y = mx + b vs. Definitions
Inference for Least Squares Lines
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
AP Statistics Chapter 14 Section 1.
10.2 Regression If the value of the correlation coefficient is significant, the next step is to determine the equation of the regression line which is.
Correlation and Simple Linear Regression
SCATTERPLOTS, ASSOCIATION AND RELATIONSHIPS
Kin 304 Regression Linear Regression Least Sum of Squares
CHS 221 Biostatistics Dr. wajed Hatamleh
CHAPTER 12 More About Regression
Correlation and Regression
BPK 304W Regression Linear Regression Least Sum of Squares
The Least-Squares Regression Line
CHAPTER 10 Correlation and Regression (Objectives)
Correlation and Simple Linear Regression
Lecture Slides Elementary Statistics Thirteenth Edition
Correlation and Regression
Chapter 10 Correlation and Regression
Lecture Notes The Relation between Two Variables Q Q
Correlation and Simple Linear Regression
Lecture Slides Elementary Statistics Eleventh Edition
Lecture Slides Elementary Statistics Tenth Edition
Introduction to Regression
CHAPTER 12 More About Regression
Correlation and Regression
CHAPTER 3 Describing Relationships
Lecture Slides Elementary Statistics Eleventh Edition
CHAPTER 3 Describing Relationships
CHAPTER 12 More About Regression
CHAPTER 3 Describing Relationships
Algebra Review The equation of a straight line y = mx + b
Chapter 9 Correlation and Regression
Correlation and Simple Linear Regression
Presentation transcript:

Created by Erin Hodgess, Houston, Texas Section 9-3 Regression Created by Erin Hodgess, Houston, Texas

Regression Definition Regression Equation ^ The regression equation expresses a relationship between x (called the independent variable, predictor variable or explanatory variable, and y (called the dependent variable or response variable. The typical equation of a straight line y = mx + b is expressed in the form y = b0 + b1x, where b0 is the y-intercept and b1 is the slope. ^

Assumptions 1. We are investigating only linear relationships. 2. For each x-value, y is a random variable having a normal (bell-shaped) distribution. All of these y distributions have the same variance. Also, for a given value of x, the distribution of y-values has a mean that lies on the regression line. (Results are not seriously affected if departures from normal distributions and equal variances are not too extreme.)

Regression Definition Regression Equation y = b0 + b1x Regression Line Given a collection of paired data, the regression equation y = b0 + b1x ^ algebraically describes the relationship between the two variables Regression Line The graph of the regression equation is called the regression line (or line of best fit, or least squares line).

Notation for Regression Equation Population Parameter Sample Statistic ^ y-intercept of regression equation 0 b0 Slope of regression equation 1 b1 Equation of the regression line y = 0 + 1 x y = b0 + b1 x

calculators or computers can compute these values Formula for b0 and b1 Formula 9-2 n(xy) – (x) (y) b1 = (slope) n(x2) – (x)2 b0 = y – b1 x (y-intercept) Formula 9-3 calculators or computers can compute these values

b0 = y - b1x If you find b1 first, then Formula 9-4 Can be used for Formula 9-2, where y is the mean of the y-values and x is the mean of the x values

The regression line fits the sample points best.

Rounding the y-intercept b0 and the slope b1 Round to three significant digits. If you use the formulas 9-2 and 9-3, try not to round intermediate values. page 527 of text

Calculating the Regression Equation 1 2 8 3 6 5 4 Data x y In Section 9-2, we used these values to find that the linear correlation coefficient of r = –0.135. Use this sample to find the regression equation. This is exercise #7 on page 521.

Calculating the Regression Equation 1 2 8 3 6 5 4 Data x y n(xy) – (x) (y) n(x2) –(x)2 b1 = 4(48) – (10) (20) 4(36) – (10)2 –8 44 = –0.181818 n = 4 x = 10 y = 20 x2 = 36 y2 = 120 xy = 48 This is exercise #7 on page 521.

Calculating the Regression Equation 1 2 8 3 6 5 4 Data x y n = 4 x = 10 y = 20 x2 = 36 y2 = 120 xy = 48 b0 = y – b1 x 5 – (–0.181818)(2.5) = 5.45 This is exercise #7 on page 521.

Calculating the Regression Equation 1 2 8 3 6 5 4 Data x y n = 4 x = 10 y = 20 x2 = 36 y2 = 120 xy = 48 The estimated equation of the regression line is: y = 5.45 – 0.182x ^ This is exercise #7 on page 521.

Example: Boats and Manatees Given the sample data in Table 9-1, find the regression equation. Using the same procedure as in the previous example, we find that b1 = 2.27 and b0 = –113. Hence, the estimated regression equation is: This is exercise #7 on page 521. y = –113 + 2.27x ^

Example: Boats and Manatees Given the sample data in Table 9-1, find the regression equation. This is exercise #7 on page 521.

Predictions In predicting a value of y based on some given value of x ... 1. If there is not a significant linear correlation, the best predicted y-value is y. 2. If there is a significant linear correlation, the best predicted y-value is found by substituting the x-value into the regression equation.

Figure 9-8 Predicting the Value of a Variable page 529 of text Figure 9-8 Predicting the Value of a Variable

Example: Boats and Manatees Given the sample data in Table 9-1, we found that the regression equation is y = –113 + 2.27x. Assume that in 2001 there were 850,000 registered boats. Because Table 9-1 lists the numbers of registered boats in tens of thousands, this means that for 2001 we have x = 85. Given that x = 85, find the best predicted value of y, the number of manatee deaths from boats. ^ This is exercise #7 on page 521.

Example: Boats and Manatees Given the sample data in Table 9-1, we found that the regression equation is y = –113 + 2.27x. Given that x = 85, find the best predicted value of y, the number of manatee deaths from boats. ^ We must consider whether there is a linear correlation that justifies the use of that equation. We do have a significant linear correlation (with r = 0.922). This is exercise #7 on page 521.

Example: Boats and Manatees Given the sample data in Table 9-1, we found that the regression equation is y = –113 + 2.27x. Given that x = 85, find the best predicted value of y, the number of manatee deaths from boats. ^ y = –113 + 2.27x –113 + 2.27(85) = 80.0 ^ The predicted number of manatee deaths is 80.0. The actual number of manatee deaths in 2001 was 82, so the predicted value of 80.0 is quite close. This is exercise #7 on page 521.

Guidelines for Using The Regression Equation 1. If there is no significant linear correlation, don’t use the regression equation to make predictions. 2. When using the regression equation for predictions, stay within the scope of the available sample data. 3. A regression equation based on old data is not necessarily valid now. 4. Don’t make predictions about a population that is different from the population from which the sample data was drawn. page 530 of text

Definitions Marginal Change: The marginal change is the amount that a variable changes when the other variable changes by exactly one unit. Outlier: An outlier is a point lying far away from the other data points. Influential Points: An influential point strongly affects the graph of the regression line. page 531 of text The slope b1 in the regression equation represents the marginal change in y that occurs when x changes by one unit.

Residuals and the Least-Squares Property Definitions Residual for a sample of paired (x, y) data, the difference (y - y) between an observed sample y-value and the value of y, which is the value of y that is predicted by using the regression equation. ^ ^ page 533 of text

Residuals and the Least-Squares Property Definitions Residual for a sample of paired (x, y) data, the difference (y - y) between an observed sample y-value and the value of y, which is the value of y that is predicted by using the regression equation. Least-Squares Property A straight line satisfies this property if the sum of the squares of the residuals is the smallest sum possible. ^ ^ page 533 of text

Residuals and the Least-Squares Property Figure 9-9 x 1 2 4 5 y 4 24 8 32 y = 5 + 4x ^ Data is found in margin on page 532.