6-3 Multiple Regression 6-3.1 Estimation of Parameters in Multiple Regression.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
Introduction to Predictive Modeling with Examples Nationwide Insurance Company, November 2 D. A. Dickey.
Experimental design and analyses of experimental data Lesson 2 Fitting a model to data and estimating its parameters.
12-1 Multiple Linear Regression Models Introduction Many applications of regression analysis involve situations in which there are more than.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
12 Multiple Linear Regression CHAPTER OUTLINE
Analysis of Variance Compares means to determine if the population distributions are not similar Uses means and confidence intervals much like a t-test.
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
EPI 809/Spring Probability Distribution of Random Error.
Multiple Linear Regression
Topic 3: Simple Linear Regression. Outline Simple linear regression model –Model parameters –Distribution of error terms Estimation of regression parameters.
Creating Graphs on Saturn GOPTIONS DEVICE = png HTITLE=2 HTEXT=1.5 GSFMODE = replace; PROC REG DATA=agebp; MODEL sbp = age; PLOT sbp*age; RUN; This will.
Partial Regression Plots. Life Insurance Example: (nknw364.sas) Y = the amount of life insurance for the 18 managers (in $1000) X 1 = average annual income.
Multiple regression analysis
Multiple Regression models Estimation Goodness of fit tests
Chapter 11: Inferential methods in Regression and Correlation
Reading – Linear Regression Le (Chapter 8 through 8.1.6) C &S (Chapter 5:F,G,H)
Sociology 601 Class 28: December 8, 2009 Homework 10 Review –polynomials –interaction effects Logistic regressions –log odds as outcome –compared to linear.
1 Review of Correlation A correlation coefficient measures the strength of a linear relation between two measurement variables. The measure is based on.
EPI809/Spring Testing Individual Coefficients.
This Week Continue with linear regression Begin multiple regression –Le 8.2 –C & S 9:A-E Handout: Class examples and assignment 3.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Forecasting Revenue: An Example of Regression Model Building Setting: Possibly a large set of predictor variables used to predict future quarterly revenues.
Regression and Correlation Methods Judy Zhong Ph.D.
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
1 MULTI VARIATE VARIABLE n-th OBJECT m-th VARIABLE.
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
4-5 Inference on the Mean of a Population, Variance Unknown Hypothesis Testing on the Mean.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation.
12a - 1 © 2000 Prentice-Hall, Inc. Statistics Multiple Regression and Model Building Chapter 12 part I.
Topic 14: Inference in Multiple Regression. Outline Review multiple linear regression Inference of regression coefficients –Application to book example.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation Note: Homework Due Thursday.
Regression Examples. Gas Mileage 1993 SOURCES: Consumer Reports: The 1993 Cars - Annual Auto Issue (April 1993), Yonkers, NY: Consumers Union. PACE New.
Regression For the purposes of this class: –Does Y depend on X? –Does a change in X cause a change in Y? –Can Y be predicted from X? Y= mX + b Predicted.
Introduction to Linear Regression
7-1 The Strategy of Experimentation Every experiment involves a sequence of activities: 1.Conjecture – the original hypothesis that motivates the.
Copyright © Cengage Learning. All rights reserved. 12 Simple Linear Regression and Correlation
5-5 Inference on the Ratio of Variances of Two Normal Populations The F Distribution We wish to test the hypotheses: The development of a test procedure.
Why Design? (why not just observe and model?) CopyrightCopyright © Time and Date AS / Steffen Thorsen All rights reserved. About us | Disclaimer.
6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y is.
Anaregweek11 Regression diagnostics. Regression Diagnostics Partial regression plots Studentized deleted residuals Hat matrix diagonals Dffits, Cook’s.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
The Completely Randomized Design (§8.3)
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
6-3 Multiple Regression Estimation of Parameters in Multiple Regression.
1 Experimental Statistics - week 14 Multiple Regression – miscellaneous topics.
Simple Linear Regression (SLR)
5-1 Introduction 5-2 Inference on the Means of Two Populations, Variances Known Assumptions.
Xuhua Xia Correlation and Regression Introduction to linear correlation and regression Numerical illustrations SAS and linear correlation/regression –CORR.
Simple Linear Regression. Data available : (X,Y) Goal : To predict the response Y. (i.e. to obtain the fitted response function f(X)) Least Squares Fitting.
1 Experimental Statistics - week 12 Chapter 12: Multiple Regression Chapter 13: Variable Selection Model Checking.
7-2 Factorial Experiments A significant interaction mask the significance of main effects. Consequently, when interaction is present, the main effects.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 10 th Edition.
1 Chapter 5 : Volatility Models Similar to linear regression analysis, many time series exhibit a non-constant variance (heteroscedasticity). In a regression.
1 Experimental Statistics - week 13 Multiple Regression Miscellaneous Topics.
Experimental Statistics - week 9
1 Experimental Statistics - week 12 Chapter 11: Linear Regression and Correlation Chapter 12: Multiple Regression.
1 Experimental Statistics - week 11 Chapter 11: Linear Regression and Correlation.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Regression Diagnostics
5-5 Inference on the Ratio of Variances of Two Normal Populations
بحث في التحليل الاحصائي SPSS بعنوان :
6-4 Other Aspects of Regression
6-1 Introduction To Empirical Models
Multiple Linear Regression
4-1 Statistical Inference
Presentation transcript:

6-3 Multiple Regression Estimation of Parameters in Multiple Regression

6-3 Multiple Regression Estimation of Parameters in Multiple Regression The least squares function is given by The least squares estimates must satisfy

6-3 Multiple Regression Estimation of Parameters in Multiple Regression The least squares normal equations are The solution to the normal equations are the least squares estimators of the regression coefficients.

6-3 Multiple Regression X’X in Multiple Regression

6-3 Multiple Regression

6-3.1 Estimation of Parameters in Multiple Regression

6-3 Multiple Regression

6-3.2 Inferences in Multiple Regression Test for Significance of Regression

6-3 Multiple Regression Inferences in Multiple Regression Inference on Individual Regression Coefficients

6-3 Multiple Regression Inferences in Multiple Regression Confidence Intervals on the Mean Response and Prediction Intervals

6-3 Multiple Regression Confidence Intervals on the Mean Response and Prediction Intervals The response at the point of interest is and the corresponding predicted value is

6-3 Multiple Regression Inferences in Multiple Regression Confidence Intervals on the Mean Response and Prediction Intervals

6-3 Multiple Regression Checking Model Adequacy Residual Analysis

6-3 Multiple Regression Checking Model Adequacy Residual Analysis

6-3 Multiple Regression Checking Model Adequacy Residual Analysis

6-3 Multiple Regression Checking Model Adequacy Residual Analysis

6-3 Multiple Regression Checking Model Adequacy Residual Analysis

6-3 Multiple Regression Checking Model Adequacy Influential Observations The disposition of points in the x-space is important in determining the properties of the model in R 2, the regression coefficients, and the magnitude of the error mean squares. A large value of D i implies that the ith points is influential. A value of D i >1 would indicate that the point is influential.

6-3 Multiple Regression Checking Model Adequacy

OPTIONS NOOVP NODATE NONUMBER LS=140; DATA ex67; INPUT strength length height label strength='Pull Strength' length='Wire length' height='Die Height'; CARDS; PROC SGSCATTER data=ex67; MATRIX STRENGTH LENGTH HEIGHT; TITLE 'Scatter Plot Matrix for Wire Bond Data'; PROC REG data=ex67; MODEL strength=length height/xpx r CLB CLM CLI; TITLE 'Multiple Regression'; DATA EX67N; INPUT LENGTH HEIGHT DATALINES; DATA EX67N1; SET EX67 EX67N; PROC REG DATA=EX67N1; MODEL STRENGTH=LENGTH HEIGHT/CLM CLI; TITLE 'CIs FOR MEAN RESPONSE AND FUTURE OBSERVATION'; RUN; QUIT; Example Multiple Regression PLOT npp.*Residual.; /* Normal Probability Plot */ PLOT RESIDual.*Pred.; /* Residual Plot */ PLOT Residual.*length; PLOT Residual.*height;

6-3 Multiple Regression

The REG Procedure Model: MODEL1 Model Crossproducts X'X X'Y Y'Y Variable Label Intercept length height strength Intercept Intercept length Wire length height Die Height strength The REG Procedure Model: MODEL1 Dependent Variable: strength Number of Observations Read 25 Number of Observations Used 25 Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Parameter Estimates Parameter Standard Variable Label DF Estimate Error t Value Pr > |t| 95% Confidence Limits Intercept Intercept length Wire length < height Die Height Multiple Regression

Multiple Regression The REG Procedure Model: MODEL1 Dependent Variable: strength Output Statistics Dependent Predicted Std Error Std Error Student Cook's Obs Variable Value Mean Predict 95% CL Mean 95% CL Predict Residual Residual Residual D | |* | | *| | | **| | | *| | | **| | | |* | | |* | | |* | | ***| | | | | | *| | | | | | | | | | | | |***** | | | | | |**** | | **| | | *| | | | | | **| | | |* | | | | | |* | | | | Sum of Residuals 0 Sum of Squared Residuals Predicted Residual SS (PRESS) Multiple Regression

CIs FOR MEAN RESPONSE AND FUTURE OBSERVATION The REG Procedure Model: MODEL1 Dependent Variable: strength Pull Strength Number of Observations Read 27 Number of Observations Used 25 Number of Observations with Missing Values 2 Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Parameter Estimates Parameter Standard Variable Label DF Estimate Error t Value Pr > |t| Intercept Intercept length Wire length <.0001 height Die Height Multiple Regression

CIs FOR MEAN RESPONSE AND FUTURE OBSERVATION The REG Procedure Model: MODEL1 Dependent Variable: strength Pull Strength Output Statistics Dependent Predicted Std Error Obs Variable Value Mean Predict 95% CL Mean 95% CL Predict Residual Sum of Residuals 0 Sum of Squared Residuals Predicted Residual SS (PRESS) Multiple Regression

6-3.3 Checking Model Adequacy

6-3 Multiple Regression Checking Model Adequacy

6-3 Multiple Regression Checking Model Adequacy Multicollinearity

6-3 Multiple Regression Consider the Full Model

OPTIONS NOOVP NODATE NONUMBER LS=100; DATA appraise; INPUT price units age size parking area cond$ CARDS; F G G E G G G G G G G F E G G G F E G F G E F E PROC CORR DATA=APPRAISE; VAR PRICE UNITS AGE SIZE PARKING AREA; TITLE 'CORRELATIONS OF VARIABLES IN MODEL'; PROC REG DATA=APPRAISE; MODEL PRICE=UNITS AGE SIZE PARKING AREA/R VIF; TITLE 'ALL VARIABLES IN MODEL'; PROC REG DATA=APPRAISE; MODEL PRICE=UNITS AGE AREA/R INFLUENCE; TITLE 'REDUCED MODEL'; RUN; QUIT; Example 6-3 Multiple Regression

CORRELATIONS OF VARIABLES IN MODEL CORR 프로시저 6 Variables: price units age size parking area 단순 통계량 변수 N 평균 표준편차 합 최소값 최대값 price units age size parking area 피어슨 상관 계수, N = 24 H0: Rho=0 가정하에서 Prob > |r| price units age size parking area price < < <.0001 units < < <.0001 age size <.0001 < parking area <.0001 < Multiple Regression

ALL VARIABLES IN MODEL The REG Procedure Model: MODEL1 Dependent Variable: price Number of Observations Read 24 Number of Observations Used 24 Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model E E <.0001 Error Corrected Total E12 Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Parameter Estimates Parameter Standard Variance Variable DF Estimate Error t Value Pr > |t| Inflation Intercept units age size parking area < Multiple Regression

ALL VARIABLES IN MODEL The REG Procedure Model: MODEL1 Dependent Variable: price Output Statistics Dependent Predicted Std Error Std Error Student Cook's Obs Variable Value Mean Predict Residual Residual Residual D | *| | | *| | | | | | **| | | | | | *| | | | | | |** | | | | | |*** | | ***| | | | | | |** | | *| | | |** | | | | | | | | | | | **| | | | | | | | | |**** | | *| | | |**** | Sum of Residuals 0 Sum of Squared Residuals Predicted Residual SS (PRESS) Multiple Regression

A Test for the Significance of a Group of Regressors (Partial F-Test) Suppose that the full model has k regressors, and we are interested in testing whether the last k-r of them can be deleted from the model. This smaller model is called the reduced model. That is, the full model is Then, to test the hypotheses

6-3 Multiple Regression A Test for the Significance of a Group of Regressors (Partial F-Test)

6-3 Multiple Regression

Notes on the Reduced Model

6-3 Multiple Regression Examining the Final Model

6-3 Multiple Regression One More Diagnostic

REDUCED MODEL (NO SIZE AND PARKING) The REG Procedure Model: MODEL1 Dependent Variable: price Number of Observations Read 24 Number of Observations Used 24 Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model E E <.0001 Error Corrected Total E12 Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Parameter Estimates Parameter Standard Variable DF Estimate Error t Value Pr > |t| Intercept <.0001 units age area < Multiple Regression

REDUCED MODEL The REG Procedure Model: MODEL1 Dependent Variable: price Output Statistics Dependent Predicted Std Error Std Error Student Cook's Obs Variable Value Mean Predict Residual Residual Residual D | *| | | **| | | | | | | | | | | | | | | | | | | | | |** | | |* | | ***| | | |* | | |** | | *| | | |** | | | | | | | | | | | ***| | | | | | *| | | |*** | | *| | | |**** | Multiple Regression

REDUCED MODEL The REG Procedure Output Statistics Hat Diag Cov DFBETAS Obs RStudent H Ratio DFFITS Intercept units age area Multiple Regression Sum of Residuals 0 Sum of Squared Residuals Predicted Residual SS (PRESS)

6-3 Multiple Regression