6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y is.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
Chapter 12 Simple Linear Regression
Experimental design and analyses of experimental data Lesson 2 Fitting a model to data and estimating its parameters.
Probability & Statistical Inference Lecture 9
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
EPI 809/Spring Probability Distribution of Random Error.
Simple Linear Regression and Correlation
Objectives (BPS chapter 24)
1 Chapter 2 Simple Linear Regression Ray-Bing Chen Institute of Statistics National University of Kaohsiung.
Chapter 12 Simple Linear Regression
Creating Graphs on Saturn GOPTIONS DEVICE = png HTITLE=2 HTEXT=1.5 GSFMODE = replace; PROC REG DATA=agebp; MODEL sbp = age; PLOT sbp*age; RUN; This will.
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Multiple regression analysis
Chapter 10 Simple Regression.
Chapter 12 Simple Regression
1 Chapter 3 Multiple Linear Regression Ray-Bing Chen Institute of Statistics National University of Kaohsiung.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
SIMPLE LINEAR REGRESSION
1 Review of Correlation A correlation coefficient measures the strength of a linear relation between two measurement variables. The measure is based on.
Chapter Topics Types of Regression Models
Linear Regression and Correlation Analysis
1 Regression Analysis Regression used to estimate relationship between dependent variable (Y) and one or more independent variables (X). Consider the variable.
This Week Continue with linear regression Begin multiple regression –Le 8.2 –C & S 9:A-E Handout: Class examples and assignment 3.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Simple Linear Regression and Correlation
Introduction to Regression Analysis, Chapter 13,
Simple Linear Regression Analysis
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Lecture 5 Correlation and Regression
Regression and Correlation Methods Judy Zhong Ph.D.
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation Note: Homework Due Thursday.
6-3 Multiple Regression Estimation of Parameters in Multiple Regression.
Regression Examples. Gas Mileage 1993 SOURCES: Consumer Reports: The 1993 Cars - Annual Auto Issue (April 1993), Yonkers, NY: Consumers Union. PACE New.
Regression For the purposes of this class: –Does Y depend on X? –Does a change in X cause a change in Y? –Can Y be predicted from X? Y= mX + b Predicted.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
5-5 Inference on the Ratio of Variances of Two Normal Populations The F Distribution We wish to test the hypotheses: The development of a test procedure.
Chapter 11 Linear Regression Straight Lines, Least-Squares and More Chapter 11A Can you pick out the straight lines and find the least-square?
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
6-3 Multiple Regression Estimation of Parameters in Multiple Regression.
VI. Regression Analysis A. Simple Linear Regression 1. Scatter Plots Regression analysis is best taught via an example. Pencil lead is a ceramic material.
Simple Linear Regression. Data available : (X,Y) Goal : To predict the response Y. (i.e. to obtain the fitted response function f(X)) Least Squares Fitting.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Lecture 10 Introduction to Linear Regression and Correlation Analysis.
1 Experimental Statistics - week 12 Chapter 11: Linear Regression and Correlation Chapter 12: Multiple Regression.
11-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
1 Experimental Statistics - week 11 Chapter 11: Linear Regression and Correlation.
Chapter Outline EMPIRICAL MODELS 11-2 SIMPLE LINEAR REGRESSION 11-3 PROPERTIES OF THE LEAST SQUARES ESTIMATORS 11-4 SOME COMMENTS ON USES OF REGRESSION.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
6-1 Introduction To Empirical Models
Simple Linear Regression
St. Edward’s University
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

6-1 Introduction To Empirical Models

Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y is related to x by the following straight-line relationship: where the slope and intercept of the line are called regression coefficients. The simple linear regression model is given by where  is the random error term.

6-1 Introduction To Empirical Models

We think of the regression model as an empirical model. Suppose that the mean and variance of  are 0 and  2, respectively, then The variance of Y given x is 6-1 Introduction To Empirical Models

The true regression model is a line of mean values: where  1 can be interpreted as the change in the mean of Y for a unit change in x. Also, the variability of Y at a particular value of x is determined by the error variance,  2. This implies there is a distribution of Y-values at each x and that the variance of this distribution is the same at each x. 6-1 Introduction To Empirical Models

A Multiple Linear Regression Model:

6-2 Simple Linear Regression Least Squares Estimation The case of simple linear regression considers a single regressor or predictor x and a dependent or response variable Y. The expected value of Y at each level of x is a random variable: We assume that each observation, Y, can be described by the model

6-2 Simple Linear Regression Least Squares Estimation Suppose that we have n pairs of observations (x 1, y 1 ), (x 2, y 2 ), …, (x n, y n ). The method of least squares is used to estimate the parameters,  0 and  1 by minimizing the sum of the squares of the vertical deviations in Figure 6-6.

6-2 Simple Linear Regression Least Squares Estimation Using Equation 6-8, t he n observations in the sample can be expressed as The sum of the squares of the deviations of the observations from the true regression line is

6-2 Simple Linear Regression Least Squares Estimation

6-2 Simple Linear Regression Least Squares Estimation

6-2 Simple Linear Regression Least Squares Estimation

6-2 Simple Linear Regression Least Squares Estimation

6-2 Simple Linear Regression Least Squares Estimation

6-2 Simple Linear Regression Least Squares Estimation

6-2 Simple Linear Regression Least Squares Estimation

6-2 Simple Linear Regression Least Squares Estimation

6-2 Simple Linear Regression Sums of Squares and Cross-products Matrix The Sums of squares and cross-products matrix is a convenient way to summarize the quantities needed to do the hand calculations in regression. It also plays a key role in the internal calculations of the computer. It is outputed from PROC REG and PROC GLM if the XPX option is included on the model statement. The elements are X’X InterceptXY n X Y

6-2 Simple Linear Regression Least Squares Estimation

6-2 Simple Linear Regression Regression Assumptions and Model Properties

6-2 Simple Linear Regression Regression Assumptions and Model Properties

6-2 Simple Linear Regression Regression and Analysis of Variance

6-2 Simple Linear Regression Regression and Analysis of Variance

6-2 Simple Linear Regression Testing Hypothesis in Simple Linear Regression Use of t-Tests Suppose we wish to test An appropriate test statistic would be

6-2 Simple Linear Regression Testing Hypothesis in Simple Linear Regression Use of t-Tests We would reject the null hypothesis if

6-2 Simple Linear Regression Testing Hypothesis in Simple Linear Regression Use of t-Tests Suppose we wish to test An appropriate test statistic would be

6-2 Simple Linear Regression Testing Hypothesis in Simple Linear Regression Use of t-Tests We would reject the null hypothesis if

6-2 Simple Linear Regression Testing Hypothesis in Simple Linear Regression Use of t-Tests An important special case of the hypotheses of Equation 6-23 is These hypotheses relate to the significance of regression. Failure to reject H 0 is equivalent to concluding that there is no linear relationship between x and Y.

6-2 Simple Linear Regression Testing Hypothesis in Simple Linear Regression Use of t-Tests

6-2 Simple Linear Regression Testing Hypothesis in Simple Linear Regression Use of t-Tests

6-2 Simple Linear Regression

The Analysis of Variance Approach

6-2 Simple Linear Regression Testing Hypothesis in Simple Linear Regression The Analysis of Variance Approach

6-2 Simple Linear Regression Confidence Intervals in Simple Linear Regression

6-2 Simple Linear Regression Confidence Intervals in Simple Linear Regression

6-2 Simple Linear Regression

6-2.4 Prediction of Future Observations

6-2 Simple Linear Regression

6-2.4 Prediction of Future Observations

6-2 Simple Linear Regression Checking Model Adequacy Fitting a regression model requires several assumptions. 1.Errors are uncorrelated random variables with mean zero; 2.Errors have constant variance; and, 3.Errors be normally distributed. The analyst should always consider the validity of these assumptions to be doubtful and conduct analyses to examine the adequacy of the model

6-2 Simple Linear Regression Checking Model Adequacy The residuals from a regression model are e i = y i - ŷ i, where y i is an actual observation and ŷ i is the corresponding fitted value from the regression model. Analysis of the residuals is frequently helpful in checking the assumption that the errors are approximately normally distributed with constant variance, and in determining whether additional terms in the model would be useful.

6-2 Simple Linear Regression Checking Model Adequacy

6-2 Simple Linear Regression Checking Model Adequacy

6-2 Simple Linear Regression Checking Model Adequacy

6-2 Simple Linear Regression Checking Model Adequacy

6-2 Simple Linear Regression Checking Model Adequacy

6-2 Simple Linear Regression Example 6-1 OPTIONS NOOVP NODATE NONUMBER LS=140; DATA ex61; INPUT salt area LABEL salt='Salt Conc' area='Roadway area'; CARDS; ods graphics on; PROC REG DATA=EX61; MODEL SALT=AREA/XPX R; DATA EX61N; AREA=1.25; OUTPUT; DATA EX61N1; SET EX61 EX61N; ods graphics off; PROC REG DATA=EX61N1; MODEL SALT=AREA/CLM CLI; /* CLM for (100-??) % confidence limits for the expected value of the dependent variable, CLI for (100-??) % confidence limits for an individual predicted value */ TITLE 'CIs FOR MEAN RESPONSE AND FUTURE OBSERVATION'; RUN; QUIT;

6-2 Simple Linear Regression Linear Regression of SALT vs AREA The REG Procedure Model: MODEL1 Dependent Variable: salt Salt Conc Number of Observations Read 20 Number of Observations Used Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Parameter Estimates Parameter Standard Variable Label DF Estimate Error t Value Pr > |t| Intercept Intercept area Roadway area <.0001 Linear Regression of SALT vs AREA The REG Procedure Model: MODEL1 Model Crossproducts X'X X'Y Y'Y Variable Label Intercept area salt Intercept Intercept area Roadway area salt Salt Conc

6-2 Simple Linear Regression SAS 시스템 The REG Procedure Model: MODEL1 Dependent Variable: salt Salt Conc Output Statistics Dependent Predicted Std Error Std Error Student Cook's Obs Variable Value Mean Predict Residual Residual Residual D | **| | | | | | |* | | | | | | | | | | | |* | | |* | | | | | ****| | | | | | |**** | | **| | | |** | | | | | | | | | | | *| | | **| | | |** | Sum of Residuals 0 Sum of Squared Residuals Predicted Residual SS (PRESS)

6-2 Simple Linear Regression

CIs FOR MEAN RESPONSE AND FUTURE OBSERVATION The REG Procedure Model: MODEL1 Dependent Variable: salt Salt Conc Number of Observations Read 21 Number of Observations Used 20 Number of Observations with Missing Values 1 Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Parameter Estimates Parameter Standard Variable Label DF Estimate Error t Value Pr > |t| Intercept Intercept area Roadway area <.0001

6-2 Simple Linear Regression CIs FOR MEAN RESPONSE AND FUTURE OBSERVATION The REG Procedure Model: MODEL1 Dependent Variable: salt Salt Conc Output Statistics Dependent Predicted Std Error Obs Variable Value Mean Predict 95% CL Mean 95% CL Predict Residual Sum of Residuals 0 Sum of Squared Residuals Predicted Residual SS (PRESS)

6-2 Simple Linear Regression Correlation and Regression The sample correlation coefficient between X and Y is

6-2 Simple Linear Regression Correlation and Regression The sample correlation coefficient is also closely related to the slope in a linear regression model

6-2 Simple Linear Regression

6-2.6 Correlation and Regression It is often useful to test the hypotheses The appropriate test statistic for these hypotheses is Reject H 0 if |t 0 | > t  /2,n-2.

6-2 Simple Linear Regression OPTIONS NOOVP NODATE NONUMBER LS=80; DATA ex61; INPUT salt area LABEL salt='Salt Conc' area='Roadway area'; CARDS; PROC CORR DATA=EX61; VAR SALT AREA; TITLE 'Correlation between SALT and AREA'; PROC REG; MODEL salt=area/XPX; TITLE 'Linear Regression of SALT vs AREA'; RUN; QUIT; Example 6-1 (Continued)

6-2 Simple Linear Regression Correlation between SALT and AREA CORR 프로시저 2 개의 변수 : salt area 단순 통계량 변수 N 평균 표준편차 합 최솟값 최댓값 salt area 단순 통계량 변수 레이블 salt Salt Conc area Roadway area 피어슨 상관 계수, N = 20 H0: Rho=0 가정하에서 Prob > |r| salt area salt Salt Conc <.0001 area Roadway area <.0001

6-2 Simple Linear Regression Linear Regression of SALT vs AREA The REG Procedure Model: MODEL1 Model Crossproducts X'X X'Y Y'Y Variable Label Intercept area salt Intercept Intercept area Roadway area salt Salt Conc The REG Procedure Model: MODEL1 Dependent Variable: salt Salt Conc Number of Observations Read 20 Number of Observations Used 20 Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Parameter Estimates Parameter Standard Variable Label DF Estimate Error t Value Pr > |t| Intercept Intercept area Roadway area <.0001