Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 1 of 20 Chapter 4 Section 2 Least-Squares Regression.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Regresi Linear Sederhana Pertemuan 01 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Regression Greg C Elvers.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Chapter 4 The Relation between Two Variables
Cal State Northridge  320 Andrew Ainsworth PhD Regression.
Chapter 4 Describing the Relation Between Two Variables
LINEAR REGRESSION: What it Is and How it Works Overview What is Bivariate Linear Regression? The Regression Equation How It’s Based on r.
LINEAR REGRESSION: What it Is and How it Works. Overview What is Bivariate Linear Regression? The Regression Equation How It’s Based on r.
Describing the Relation Between Two Variables
REGRESSION What is Regression? What is the Regression Equation? What is the Least-Squares Solution? How is Regression Based on Correlation? What are the.
Correlation and Regression Analysis
Math 227 Elementary Statistics Math 227 Elementary Statistics Sullivan, 4 th ed.
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Describing the Relation between Two Variables 4.
Business Statistics - QBM117 Least squares regression.
CHAPTER 3 Describing Relationships
REGRESSION Predict future scores on Y based on measured scores on X Predictions are based on a correlation from a sample where both X and Y were measured.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Linear Regression and Linear Prediction Predicting the score on one variable.
Least Squares Regression
Simple Linear Regression Analysis
Chapter 2 – Simple Linear Regression - How. Here is a perfect scenario of what we want reality to look like for simple linear regression. Our two variables.
Chapter 6 (cont.) Regression Estimation. Simple Linear Regression: review of least squares procedure 2.
Regression, Residuals, and Coefficient of Determination Section 3.2.
1 1 Slide Simple Linear Regression Chapter 14 BA 303 – Spring 2011.
Linear Regression Analysis
LSRL Chapter 4.
Descriptive Methods in Regression and Correlation
Linear Regression.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Introduction to Linear Regression and Correlation Analysis
Chapter 11 Simple Regression
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
1 Chapter 3: Examining Relationships 3.1Scatterplots 3.2Correlation 3.3Least-Squares Regression.
Chapter 6 & 7 Linear Regression & Correlation
Least-Squares Regression Section 3.3. Why Create a Model? There are two reasons to create a mathematical model for a set of bivariate data. To predict.
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-6 Regression and Correlation.
Section 4.2 Least Squares Regression. Finding Linear Equation that Relates x and y values together Based on Two Points (Algebra) 1.Pick two data points.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Section 5.2: Linear Regression: Fitting a Line to Bivariate Data.
Statistical Methods Statistical Methods Descriptive Inferential
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Scatter Diagrams Objective: Draw and interpret scatter diagrams. Distinguish between linear and nonlinear relations. Use a graphing utility to find the.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
STA291 Statistical Methods Lecture LINEar Association o r measures “closeness” of data to the “best” line. What line is that? And best in what terms.
Chapter 6 (cont.) Difference Estimation. Recall the Regression Estimation Procedure 2.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
The correlation coefficient, r, tells us about strength (scatter) and direction of the linear relationship between two quantitative variables. In addition,
9.2 Linear Regression Key Concepts: –Residuals –Least Squares Criterion –Regression Line –Using a Regression Equation to Make Predictions.
Least Squares Regression.   If we have two variables X and Y, we often would like to model the relation as a line  Draw a line through the scatter.
POD 09/19/ B #5P a)Describe the relationship between speed and pulse as shown in the scatterplot to the right. b)The correlation coefficient, r,
Chapter 8: Simple Linear Regression Yang Zhenlin.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.3 Predicting the Outcome.
June 30, 2008Stat Lecture 16 - Regression1 Inference for relationships between variables Statistics Lecture 16.
© 2001 Prentice-Hall, Inc.Chap 13-1 BA 201 Lecture 18 Introduction to Simple Linear Regression (Data)Data.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
1 Simple Linear Regression and Correlation Least Squares Method The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES.
Regression Analysis Deterministic model No chance of an error in calculating y for a given x Probabilistic model chance of an error First order linear.
Chapter 8 Linear Regression. Fat Versus Protein: An Example 30 items on the Burger King menu:
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Describing the Relation between Two Variables 4.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Chapter 12 Simple Regression Statistika.  Analisis regresi adalah analisis hubungan linear antar 2 variabel random yang mempunyai hub linear,  Variabel.
1 Objective Given two linearly correlated variables (x and y), find the linear function (equation) that best describes the trend. Section 10.3 Regression.
The simple linear regression model and parameter estimation
Lecture Notes The Relation between Two Variables Q Q
3 4 Chapter Describing the Relation between Two Variables
Ch 4.1 & 4.2 Two dimensions concept
Regression and Categorical Predictors
Presentation transcript:

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 1 of 20 Chapter 4 Section 2 Least-Squares Regression

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 2 of 20 Chapter 4 – Section 2 ●Learning objectives  Find the least-squares regression line and use the line to make predictions  Interpret the slope and the y-intercept of the least squares regression line  Compute the sum of squared residuals 1 2 3

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 3 of 20 Chapter 4 – Section 2 ●Learning objectives  Find the least-squares regression line and use the line to make predictions  Interpret the slope and the y-intercept of the least squares regression line  Compute the sum of squared residuals 1 2 3

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 4 of 20 Chapter 4 – Section 2 ●If we have two variables X and Y, we often would like to model the relation as a line ●Draw a line through the scatter diagram ●We want to find the line that “best” describes the linear relationship … the regression line

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 5 of 20 Chapter 4 – Section 2 ●We want to use a linear model ●Linear models can be written in several different (equivalent) ways  y = m x + b  y – y 1 = m (x – x 1 )  y = b 1 x + b 0 ●We want to use a linear model ●Linear models can be written in several different (equivalent) ways  y = m x + b  y – y 1 = m (x – x 1 )  y = b 1 x + b 0 ●Because the slope and the intercept both are important to analyze, we will use y = b 1 x + b 0

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 6 of 20 Chapter 4 – Section 2 ●One difference between math and stat is that statistics assumes that the measurements are not exact, that there is an error or residual ●The formula for the residual is always Residual = Observed – Predicted ●One difference between math and stat is that statistics assumes that the measurements are not exact, that there is an error or residual ●The formula for the residual is always Residual = Observed – Predicted ●This relationship is not just for this chapter … it is the general way of defining error in statistics

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 7 of 20 Chapter 4 – Section 2 ●For example, say that we want to predict a value of y for a specific value of x  Assume that we are using y = 10 x + 25 as our model ●For example, say that we want to predict a value of y for a specific value of x  Assume that we are using y = 10 x + 25 as our model  To predict the value of y when x = 3, the model gives us y = 10  = 55, or a predicted value of 55 ●For example, say that we want to predict a value of y for a specific value of x  Assume that we are using y = 10 x + 25 as our model  To predict the value of y when x = 3, the model gives us y = 10  = 55, or a predicted value of 55  Assume the actual value of y for x = 3 is equal to 50 ●For example, say that we want to predict a value of y for a specific value of x  Assume that we are using y = 10 x + 25 as our model  To predict the value of y when x = 3, the model gives us y = 10  = 55, or a predicted value of 55  Assume the actual value of y for x = 3 is equal to 50  The actual value is 50, the predicted value is 55, so the residual (or error) is 50 – 55 = –5

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 8 of 20 Chapter 4 – Section 2 ●What the residual is on the scatter diagram The model line The x value of interest The observed value y The residual The predicted value y

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 9 of 20 Chapter 4 – Section 2 ●We want to minimize the residuals, but we need to define what this means ●We use the method of least-squares  We consider a possible linear mode  We calculate the residual for each point  We add up the squares of the residuals ●We want to minimize the residuals, but we need to define what this means ●We use the method of least-squares  We consider a possible linear mode  We calculate the residual for each point  We add up the squares of the residuals ●The line that has the smallest is called the least-squares regression line

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 10 of 20 Chapter 4 – Section 2 ●The equation for the least-squares regression line is given by y = b 1 x + b 0  b 1 is the slope of the least-squares regression line  b 0 is the y-intercept of the least-squares regression line

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 11 of 20 Chapter 4 – Section 2 ●Finding the values of b 1 and b 0, by hand, is a very tedious process ●You should use software for this ●Finding the values of b 1 and b 0, by hand, is a very tedious process ●You should use software for this ●Finding the coefficients b 1 and b 0 is only the first step of a regression analysis  We need to interpret the slope b 1  We need to interpret the y-intercept b 0  We need to do quite a bit more statistical analysis … this is covered in Section 4.3 and also in Chapter 14

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 12 of 20 Chapter 4 – Section 2 ●Learning objectives  Find the least-squares regression line and use the line to make predictions  Interpret the slope and the y-intercept of the least squares regression line  Compute the sum of squared residuals 1 2 3

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 13 of 20 Chapter 4 – Section 2 ●Interpreting the slope b 1  The slope is sometimes defined as as  The slope is also sometimes defined as as ●The slope relates changes in y to changes in x

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 14 of 20 Chapter 4 – Section 2 ●For example, if b 1 = 4  If x increases by 1, then y will increase by 4  If x decreases by 1, then y will decrease by 4  A positive linear relationship ●For example, if b 1 = 4  If x increases by 1, then y will increase by 4  If x decreases by 1, then y will decrease by 4  A positive linear relationship ●For example, if b 1 = –7  If x increases by 1, then y will decrease by 7  If x decreases by 1, then y will increase by 7  A negative linear relationship

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 15 of 20 Chapter 4 – Section 2 ●For example, say that a researcher studies the population in a town (the y or response variable) in each year (the x or predictor variable)  To simplify the calculations, years are measured from 1900 (i.e. x = 55 is the year 1955) ●For example, say that a researcher studies the population in a town (the y or response variable) in each year (the x or predictor variable)  To simplify the calculations, years are measured from 1900 (i.e. x = 55 is the year 1955) ●The model used is y = 300 x + 12,000 ●For example, say that a researcher studies the population in a town (the y or response variable) in each year (the x or predictor variable)  To simplify the calculations, years are measured from 1900 (i.e. x = 55 is the year 1955) ●The model used is y = 300 x + 12,000 ●A slope of 300 means that the model predicts that, on the average, the population increases by 300 per year

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 16 of 20 Chapter 4 – Section 2 ●Interpreting the y-intercept b 0 ●Sometimes b 0 has an interpretation, and sometimes not  If 0 is a reasonable value for x, then b 0 can be interpreted as the value of y when x is 0  If 0 is not a reasonable value for x, then b 0 does not have an interpretation ●Interpreting the y-intercept b 0 ●Sometimes b 0 has an interpretation, and sometimes not  If 0 is a reasonable value for x, then b 0 can be interpreted as the value of y when x is 0  If 0 is not a reasonable value for x, then b 0 does not have an interpretation ●In general, we should not use the model for values of x that are much larger or much smaller than the observed values

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 17 of 20 Chapter 4 – Section 2 ●For example, say that a researcher studies the population in a town (the y or response variable) in each year (the x or predictor variable)  To simplify the calculations, years are measured from 1900 (i.e. x = 55 is the year 1955) ●For example, say that a researcher studies the population in a town (the y or response variable) in each year (the x or predictor variable)  To simplify the calculations, years are measured from 1900 (i.e. x = 55 is the year 1955) ●The model used is y = 300 x + 12,000 ●For example, say that a researcher studies the population in a town (the y or response variable) in each year (the x or predictor variable)  To simplify the calculations, years are measured from 1900 (i.e. x = 55 is the year 1955) ●The model used is y = 300 x + 12,000 ●An intercept of 12,000 means that the model predicts that the town had a population of 12,000 in the year 1900 (i.e. when x = 0)

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 18 of 20 Chapter 4 – Section 2 ●Learning objectives  Find the least-squares regression line and use the line to make predictions  Interpret the slope and the y-intercept of the least squares regression line  Compute the sum of squared residuals 1 2 3

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 19 of 20 Chapter 4 – Section 2 ●After finding the slope b 1 and the intercept b 0, it is very useful to compute the residuals, particularly ●Again, this is a tedious computation ●All the least-squares regression software would compute this quantity ●We will use it in future sections

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 20 of 20 Summary: Chapter 4 – Section 2 ●We can find the least-squares regression line that is the “best” linear model for a set of data ●The slope can be interpreted as the change in y for every change of 1 in x ●The intercept can be interpreted as the value of y when x is 0, as long as a value of 0 for x is reasonable