Least Square Regression

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Lesson 10: Linear Regression and Correlation
Objectives 10.1 Simple linear regression
13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Inference for Regression
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Simple Linear Regression. G. Baker, Department of Statistics University of South Carolina; Slide 2 Relationship Between Two Quantitative Variables If.
Linear regression models
Chapter 3 Bivariate Data
Objectives (BPS chapter 24)
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 11 th Edition.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation Analysis
Simple Linear Regression Analysis
SIMPLE LINEAR REGRESSION
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Simple Linear Regression and Correlation
Simple Linear Regression Analysis
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
STA291 Statistical Methods Lecture 27. Inference for Regression.
M23- Residuals & Minitab 1  Department of ISM, University of Alabama, ResidualsResiduals A continuation of regression analysis.
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Chap 13-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 12.
Copyright ©2011 Nelson Education Limited Linear Regression and Correlation CHAPTER 12.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
Lecture 10 Chapter 23. Inference for regression. Objectives (PSLS Chapter 23) Inference for regression (NHST Regression Inference Award)[B level award]
[1] Simple Linear Regression. The general equation of a line is Y = c + mX or Y =  +  X.  > 0  > 0  > 0  = 0  = 0  < 0  > 0  < 0.
Regression. Height Weight How much would an adult female weigh if she were 5 feet tall? She could weigh varying amounts – in other words, there is a distribution.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Chapter 12 Simple Linear Regression.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Correlation and Regression Elementary Statistics Larson Farber Chapter 9 Hours of Training Accidents.
Chapter 26: Inference for Slope. Height Weight How much would an adult female weigh if she were 5 feet tall? She could weigh varying amounts – in other.
The “Big Picture” (from Heath 1995). Simple Linear Regression.
Chapter 20 Linear and Multiple Regression
Regression and Correlation
Inference for Least Squares Lines
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Regression.
Least Square Regression
Inference for Regression
Chapter 13 Created by Bethany Stubbe and Stephan Kogitz.
Regression.
Simple Linear Regression
Chapter 13 Simple Linear Regression
Lecture 12 More Examples for SLR More Examples for MLR 9/19/2018
The Practice of Statistics in the Life Sciences Fourth Edition
Inference for Regression Lines
Solutions for Tutorial 3
Chapter 12 Regression.
6-1 Introduction To Empirical Models
Regression.
Regression.
Regression.
Regression Chapter 8.
Regression.
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
Simple Linear Regression
Chapter Fourteen McGraw-Hill/Irwin
Chapter Thirteen McGraw-Hill/Irwin
Chapter 13 Simple Linear Regression
Presentation transcript:

Least Square Regression More info (as usual) and Computer Output with Interpretations

Where did it get it’s name? The sum of all the errors squared is called the total sum of squared errors (SSE). Calculate the error (residual) and square it.

Population vs. Sample

Four Key Properties of LSR The LSR passes through the point The LSR sum of residuals is zero. The LSR sum of residuals squared is an absolute minimum. The histogram of the residuals for any value of x has a normal distribution (as does the histogram of all the residuals in the LSR)—normally distributed. Look at residual plot to check for constant variance.

Has the number of flights increased over the past years? Searching the Internet to find information on air travel in the United States, data was found on a number of commercial aircraft flying the United States during the years 1990-1998. The dates were recorded as years since 1990. Thus the year 1990 was recorded as year 0.

Has the number of flights increased?

Flights r = .99885 r2 = .99771

Flights—Computer Output Predictor Coef Stdev t-ratio p Constant 2939.93 20.55 143.09 0.000 Years 233.517 4.316 54.11 0.000 s = 33.43 Write the LSR equation from the computer output? Flights = 233.517(Years) + 2939.93

How tall is that building if you know how many stories are in it? 46 “tall” buildings were selected from all over the US. Height = 11.36·stories - 85

How tall is it? 198.6 ft 45 Is this a decent model? Why or why not? If a building had 10 stories, what would its height be? If a building stood 600 ft tall, how many stories would it have? 198.6 ft 45

Is there a relationship between body weight and height? 91 students’ body weight and corresponding height are in this sample.

Height to Weight?

Height to Weight? LSR Equation y-intcp slope P-value The regression equation is WEIGHT = - 205 + 5.09 HEIGHT Predictor Coef Stdev t-ratio p Constant -204.74 29.16 -7.02 0.000 HEIGHT 5.0918 0.4237 12.02 0.000 s = 14.79 R-sq = 61.6% R-sq(adj) = 61.2% Analysis of Variance SOURCE DF SS MS F p Regression 1 31592 31592 144.38 0.000 Error 90 19692 219 Total 91 51284 Unusual Observations Obs. HEIGHT WEIGHT Fit Stdev.Fit Residual 9 72.0 195.00 161.87 2.08 33.13 25 61.0 140.00 105.86 3.62 34.14 40 72.0 215.00 161.87 2.08 53.13 84 68.0 110.00 141.50 1.57 -31.50 slope P-value

Check for bell-shaped (normal) histogram of residuals Height to Weight? Check residual plot of residuals vs. height Check for bell-shaped (normal) histogram of residuals

Height to Weight?

Four Key Properties of LSR The LSR passes through the point HEIGHT Mean 68.717 WEIGHT Mean 145.15 The LSR sum of residuals is zero. SUM Residuals = -0.00029755 The LSR sum of residuals squared is an absolute minimum. SUM Squared Residuals = 19692 The histogram of the residuals for any value of x has a normal distribution (as does the histogram of all the residuals in the LSR)—normally distributed. Look at residual plot for constant variance.

Height to Weight? Hypothesis Testing of Slope, 1: Ho: 1= 0 Ha: 1 0 Predictor Coef Stdev t-ratio p Constant -204.74 29.16 -7.02 0.000 HEIGHT 5.0918 0.4237 12.02 0.000 Since the p-value is less than .05, we reject the null hypothesis and conclude that the slope does not equal zero.

Limitations to Predictions Interpolation Extrapolation Confidence Intervals estimate the mean value of the response variable for a particular value of x. Prediction Intervals (like confidence intervals) are used to describe the variation among individuals with a particular value of x.

Confidence Interval for Slope Formula: b1  t*(s.e. b1) The best estimate of slope is (-----, -----) at a 95% confidence interval.

Confidence Interval for Slope 0.0173098  0.00388426  ( .01342554, .02119406) Since the CI for slope does not contain zero, we can conclude the slope is not equal to zero.