Decomposition of Sum of Squares

Slides:



Advertisements
Similar presentations
Multiple Regression and Model Building
Advertisements

Topic 12: Multiple Linear Regression
Chapter 12 Simple Linear Regression
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Simple Linear Regression. Start by exploring the data Construct a scatterplot  Does a linear relationship between variables exist?  Is the relationship.
Simple Linear Regression. G. Baker, Department of Statistics University of South Carolina; Slide 2 Relationship Between Two Quantitative Variables If.
Linear regression models
EPI 809/Spring Probability Distribution of Random Error.
1 Chapter 2 Simple Linear Regression Ray-Bing Chen Institute of Statistics National University of Kaohsiung.
Chapter 12 Simple Linear Regression
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Chapter 10 Simple Regression.
The Simple Regression Model
Lesson #32 Simple Linear Regression. Regression is used to model and/or predict a variable; called the dependent variable, Y; based on one or more independent.
Chapter Topics Types of Regression Models
Chapter 11 Multiple Regression.
BCOR 1020 Business Statistics
1 732G21/732A35/732G28. Formal statement  Y i is i th response value  β 0 β 1 model parameters, regression parameters (intercept, slope)  X i is i.
Simple Linear Regression and Correlation
Simple Linear Regression Analysis
Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Introduction to Linear Regression and Correlation Analysis
Chapter 11 Simple Regression
CPE 619 Simple Linear Regression Models Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama.
Simple Linear Regression Models
Statistics for Business and Economics Dr. TANG Yu Department of Mathematics Soochow University May 28, 2007.
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
CHAPTER 15 Simple Linear Regression and Correlation
Introduction to Linear Regression
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
Lesson 14 - R Chapter 14 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
The “Big Picture” (from Heath 1995). Simple Linear Regression.
Chapter 11 Linear Regression and Correlation. Explanatory and Response Variables are Numeric Relationship between the mean of the response variable and.
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
STA302/1001 week 11 Regression Models - Introduction In regression models, two types of variables that are studied:  A dependent variable, Y, also called.
Applied Regression Analysis BUSI 6220
Multiple Regression.
The simple linear regression model and parameter estimation
Simple Linear Regression & Correlation
Lecture #26 Thursday, November 17, 2016 Textbook: 14.1 and 14.3
Decomposition of Sum of Squares
Analysis of Variance in Matrix form
Simple Linear Regression
Multiple Regression and Model Building
Chapter 11 Simple Regression
Relationship with one independent variable
CHAPTER 29: Multiple Regression*
Multiple Regression.
Regression Models - Introduction
Simple Linear Regression
Chapter 14 Inference for Regression
Relationship with one independent variable
Simple Linear Regression
Simple Linear Regression
Linear Regression and Correlation
Simple Linear Regression
Linear Regression and Correlation
Inference about the Slope and Intercept
Regression Models - Introduction
Pearson Correlation and R2
Presentation transcript:

Decomposition of Sum of Squares The total sum of squares (SS) in the response variable is The total SS can be decompose into two main sources; error SS and regression SS… The error SS is The regression SS is It is the amount of variation in Y’s that is explained by the linear relationship of Y with X. STA302/1001 - week 5

Claims First, SSTO = SSR +SSE, that is Proof:…. Alternative decomposition is Proof: Exercises. STA302/1001 - week 5

Analysis of Variance Table The decomposition of SS discussed above is usually summarized in analysis of variance table (ANOVA) as follow: Note that the MSE is s2 our estimate of σ2. STA302/1001 - week 5

Coefficient of Determination The coefficient of determination is It must satisfy 0 ≤ R2 ≤ 1. R2 gives the percentage of variation in Y’s that is explained by the regression line. STA302/1001 - week 5

Claim R2 = r2, that is the coefficient of determination is the correlation coefficient square. Proof:… STA302/1001 - week 5

Important Comments about R2 It is a useful measure but… There is no absolute rule about how big it should be. It is not resistant to outliers. It is not meaningful for models with no intercepts. It is not useful for comparing models unless same Y and one set of predictors is a subset of the other. STA302/1001 - week 5

ANOVE F Test The ANOVA table gives us another test of H0: β1 = 0. The test statistics is Derivations … STA302/1001 - week 5

Prediction of Mean Response Very often, we would want to use the estimated regression line to make prediction about the mean of the response for a particular X value (assumed to be fixed). We know that the least square line is an estimate of Now, we can pick a point, X = x* (in the range in the regression line) then, is an estimate of Claim: Proof: This is the variance of the estimate of E(Y | X=x*). STA302/1001 - week 5

Confidence Interval for E(Y | X = x*) For a given x, x* , a 100(1-α)% CI for the mean value of Y is where STA302/1001 - week 5

Example Consider the smoking and cancer data. Suppose we wish to predict the mean mortality index when the smoking index is 101, that is, when x* = 101…. STA302/1001 - week 5

Prediction of New Observation Suppose we want to predict a particular value of Y* when X = x*. The predicted value of a new point measured when X = x* is Note, the above predicted value is the same as the estimate of E(Y | X = x*). The predicted value has two sources of variability. One is due to the regression line being estimated by b0+b1X. The second one is due to ε* i.e., points don’t fall exactly on line. To calculated the variance in error of prediction we look at the difference STA302/1001 - week 5

Prediction Interval for New Observation 100(1-α)% prediction interval for when X = x* is This is not a confidence interval; CI’s are for parameters and we are estimating a value of a random variable. Prediction interval is wider than CI for E(Y | X = x*). STA302/1001 - week 5

Dummy Variable Regression Dummy or indicator variable takes two values: 0 or 1. It indicates which category an observation is in. Example… Interpretation of regression coefficient in a dummy variable regression… STA302/1001 - week 5