Simple Linear Regression. Start by exploring the data Construct a scatterplot  Does a linear relationship between variables exist?  Is the relationship.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Chapter 12 Inference for Linear Regression
Kin 304 Regression Linear Regression Least Sum of Squares
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Inference for Regression
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Simple Linear Regression. G. Baker, Department of Statistics University of South Carolina; Slide 2 Relationship Between Two Quantitative Variables If.
Linear regression models
Objectives (BPS chapter 24)
Simple Linear Regression
The Simple Linear Regression Model: Specification and Estimation
Chapter 10 Simple Regression.
9. SIMPLE LINEAR REGESSION AND CORRELATION
Chapter 12 Simple Regression
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
The Simple Regression Model
Lesson #32 Simple Linear Regression. Regression is used to model and/or predict a variable; called the dependent variable, Y; based on one or more independent.
1 Pertemuan 13 Uji Koefisien Korelasi dan Regresi Matakuliah: A0392 – Statistik Ekonomi Tahun: 2006.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 13-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Chapter Topics Types of Regression Models
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Simple Linear Regression Analysis
Introduction to Probability and Statistics Linear Regression and Correlation.
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
SIMPLE LINEAR REGRESSION
Korelasi dalam Regresi Linear Sederhana Pertemuan 03 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Simple Linear Regression and Correlation
Chapter 7 Forecasting with Simple Regression
Simple Linear Regression Analysis
Correlation & Regression
Regression and Correlation Methods Judy Zhong Ph.D.
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Simple Linear Regression Models
1 FORECASTING Regression Analysis Aslı Sencer Graduate Program in Business Information Systems.
A.P. STATISTICS LESSON 14 – 2 ( DAY 2) PREDICTIONS AND CONDITIONS.
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
Lecture 10: Correlation and Regression Model.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Stats of Engineers, Lecture 8. 1.If the sample mean was larger 2.If you increased your confidence level 3.If you increased your sample size 4.If the population.
Statistics for Managers Using Microsoft® Excel 5th Edition
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
BUSINESS MATHEMATICS & STATISTICS. Module 6 Correlation ( Lecture 28-29) Line Fitting ( Lectures 30-31) Time Series and Exponential Smoothing ( Lectures.
Chapter 12: Correlation and Linear Regression 1.
Stats Methods at IC Lecture 3: Regression.
The simple linear regression model and parameter estimation
Chapter 20 Linear and Multiple Regression
Regression Analysis AGEC 784.
Basic Estimation Techniques
Least-Squares Regression
Stat 112 Notes 4 Today: Review of p-values for one-sided tests
CHAPTER 29: Multiple Regression*
Unit 3 – Linear regression
Least-Squares Regression
Simple Linear Regression
Simple Linear Regression
Presentation transcript:

Simple Linear Regression

Start by exploring the data Construct a scatterplot  Does a linear relationship between variables exist?  Is the relationship strong?  How much variation can be explained by a linear relationship with the independent or explanatory variable?

Beers and BAC

Variance “Candy Bar” ExplainedUnexplained The R-sq value: estimates the percentage of variation explained by a linear relationship with the independent or explanatory variable. Unless this estimate is 100% (or very near), it is not sufficient on its own. The amounts of explained and unexplained information due to the model are measured by Sums of Squares

Decomposition of information into explained and unexplained parts

Residuals residual A residual is the difference between an observed value of the dependent variable and the value predicted by the regression line. Residual = (observed y) - (predicted y)= y – ŷ They help us assess the fit of a regression line.

Variance “Candy Bar” ExplainedUnexplained SS explained by model SS Total SS Error Systematic SS + Random SS = Total SS

Model Assumptions about the residuals (ε) The distribution is NORMAL The mean is ZERO The variance is CONSTANT for all values of x (σ 2 ) Errors associated with any two observations are independent

Assessing the utility of the model: model variance Variance is variability of the random error (σ 2 ) The higher the variability of the random error, the greater the error of prediction σ 2 is estimated with s 2 (often called the mean square for error, MSE) Variance: s 2 = SSE/degrees of freedom (n-2) Standard error:  This is like standard deviation; with standard error, we are looking at deviation from the line  Approximately 95% of observed y values will lie within 2s of their respective predicted values

Assessing the utility of the model: Slope Does y change as x changes? Does x contribute information for the prediction of y? Test this with the t-statistic or p-value (p<.05); these values are included in software output

Assessing the utility of the model: Correlation Coefficient r Measure of the strength and direction of the linear relationship between x and y Always between -1 and +1 High correlation does not imply causality

Assessing the utility of the model: Coefficient of Determination (r 2) The R squared value is the % of the variation in y explained by the model. For linear regression, the higher the value, the better the model.

Using the model for estimation and prediction: Confidence interval for mean response For any specific value of x: A confidence interval for adds to this estimate a margin of error based on the standard error. Confidence intervals widen as the value of x is further from its mean.

Confidence interval for mean response

Prediction interval for a future observation Similar to confidence interval for mean response Standard error used in prediction interval includes  Variability due to the fact that the least- squares line is not exactly equal to the true regression line  Variability of the future response variable y around the subpopulation mean.

Prediction interval for a future observation

In the MINITAB regression window, you might want to… Set confidence levels in Options Enter a value for prediction in Options Store Residuals and Fits in Storage Display full table of fits and residuals in Results (select last bullet)

Beware of Extrapolation Extrapolation is the use of a regression line for prediction far outside the range of values of the independent variable x that you used to obtain the line. Such predictions are not accurate.

Example from book: p. 138 How can we tell if it is reasonable to fit a linear regression model? Let’s run the analysis and interpret the results