EPI 809/Spring 2008 1 Probability Distribution of Random Error.

Slides:



Advertisements
Similar presentations
Regression and correlation methods
Advertisements

Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Forecasting Using the Simple Linear Regression Model and Correlation
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Simple Linear Regression. G. Baker, Department of Statistics University of South Carolina; Slide 2 Relationship Between Two Quantitative Variables If.
Simple Linear Regression and Correlation
Simple Linear Regression
Chapter 12 Simple Linear Regression
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Chapter 12 Simple Regression
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 11 th Edition.
Statistics for Business and Economics
Chapter 13 Introduction to Linear Regression and Correlation Analysis
The Simple Regression Model
1 Pertemuan 13 Uji Koefisien Korelasi dan Regresi Matakuliah: A0392 – Statistik Ekonomi Tahun: 2006.
SIMPLE LINEAR REGRESSION
1 Review of Correlation A correlation coefficient measures the strength of a linear relation between two measurement variables. The measure is based on.
Pengujian Parameter Koefisien Korelasi Pertemuan 04 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Chapter Topics Types of Regression Models
Chapter 11 Multiple Regression.
Statistics for Business and Economics Chapter 11 Multiple Regression and Model Building.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
This Week Continue with linear regression Begin multiple regression –Le 8.2 –C & S 9:A-E Handout: Class examples and assignment 3.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Lecture 19 Simple linear regression (Review, 18.5, 18.8)
Chapter 7 Forecasting with Simple Regression
Introduction to Regression Analysis, Chapter 13,
Simple Linear Regression. Introduction In Chapters 17 to 19, we examine the relationship between interval variables via a mathematical equation. The motivation.
Linear Regression/Correlation
© 2011 Pearson Education, Inc. Statistics for Business and Economics Chapter 10 Simple Linear Regression.
Regression and Correlation Methods Judy Zhong Ph.D.
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
Statistics for Business and Economics Chapter 10 Simple Linear Regression.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation.
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
12a - 1 © 2000 Prentice-Hall, Inc. Statistics Multiple Regression and Model Building Chapter 12 part I.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
Regression For the purposes of this class: –Does Y depend on X? –Does a change in X cause a change in Y? –Can Y be predicted from X? Y= mX + b Predicted.
© 2001 Prentice-Hall, Inc. Statistics for Business and Economics Simple Linear Regression Chapter 10.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y is.
Lecture 10: Correlation and Regression Model.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
1 Experimental Statistics - week 12 Chapter 11: Linear Regression and Correlation Chapter 12: Multiple Regression.
11-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
1 Experimental Statistics - week 11 Chapter 11: Linear Regression and Correlation.
1 Linear Regression Model. 2 Types of Regression Models.
© 2011 Pearson Education, Inc Statistics for Business and Economics Chapter 10 Simple Linear Regression.
Chapter 13 Simple Linear Regression
Lecture 11: Simple Linear Regression
Chapter 20 Linear and Multiple Regression
Statistics for Business and Economics
Statistics for Managers using Microsoft Excel 3rd Edition
Chapter 11: Simple Linear Regression
PENGOLAHAN DAN PENYAJIAN
Simple Linear Regression
Statistics for Business and Economics
Introduction to Regression
Presentation transcript:

EPI 809/Spring Probability Distribution of Random Error

EPI 809/Spring Regression Modeling Steps  1.Hypothesize Deterministic Component  2.Estimate Unknown Model Parameters  3.Specify Probability Distribution of Random Error Term Estimate Standard Deviation of Error Estimate Standard Deviation of Error  4.Evaluate Model  5.Use Model for Prediction & Estimation

EPI 809/Spring Linear Regression Assumptions Assumptions of errors    n Assumptions of errors    n - Gauss-Markov condition - Gauss-Markov condition 1. Independent errors 2. Mean of probability distribution of errors is 0 3. Errors have constant variance σ 2, for which an estimator is S 2 4. Probability distribution of error is normal 5. Potential violation of G-M condition.

EPI 809/Spring Error Probability Distribution

EPI 809/Spring Random Error Variation

EPI 809/Spring Random Error Variation  1.Variation of Actual Y from Predicted Y

EPI 809/Spring Random Error Variation  1.Variation of Actual Y from Predicted Y  2.Measured by Standard Error of Regression Model Sample Standard Deviation of , s Sample Standard Deviation of , s ^

EPI 809/Spring Random Error Variation  1.Variation of Actual Y from Predicted Y  2.Measured by Standard Error of Regression Model Sample Standard Deviation of , s Sample Standard Deviation of , s  3. Affects Several Factors Parameter Significance Parameter Significance Prediction Accuracy Prediction Accuracy ^

EPI 809/Spring Evaluating the Model Testing for Significance

EPI 809/Spring Regression Modeling Steps  1. Hypothesize Deterministic Component  2.Estimate Unknown Model Parameters  3.Specify Probability Distribution of Random Error Term Estimate Standard Deviation of Error Estimate Standard Deviation of Error  4.Evaluate Model  5.Use Model for Prediction & Estimation

EPI 809/Spring Test of Slope Coefficient  1. Shows If There Is a Linear Relationship Between X & Y  2.Involves Population Slope  1  3.Hypotheses H 0 :  1 = 0 (No Linear Relationship) H 0 :  1 = 0 (No Linear Relationship) H a :  1  0 (Linear Relationship) H a :  1  0 (Linear Relationship)  4.Theoretical basis of the test statistic is the sampling distribution of slope

EPI 809/Spring Sampling Distribution of Sample Slopes

EPI 809/Spring Sampling Distribution of Sample Slopes

EPI 809/Spring Sampling Distribution of Sample Slopes  All Possible Sample Slopes  Sampl e 1:2.5  Sampl e 2:1.6  Sampl e 3:1.8  Sampl e 4:2.1 : : Very large number of sample slopes

EPI 809/Spring Sampling Distribution of Sample Slopes  All Possible Sample Slopes  Samp le 1:2.5  Samp le 2:1.6  Samp le 3:1.8  Samp le 4:2.1 : : large number of sample slopes Sampling Distribution 1111 1111 S ^ ^

EPI 809/Spring Slope Coefficient Test Statistic

EPI 809/Spring Test of Slope Coefficient Rejection Rule  Reject H 0 in favor of H a if t falls in colored area  Reject H 0 for H a if P-value = P(T>|t|) |t|) < α T=t (n-2) 0 t 1-α/2, (n-2) Reject H 0 0 α/2 -t 1-α/2, (n-2) α/2

EPI 809/Spring Test of Slope Coefficient Example  Reconsider the Obstetrics example with the following data: Estriol (mg/24h) B.w. (g/1000)  Is the Linear Relationship between Estriol & Birthweight significant at.05 level?

EPI 809/Spring Solution Table For β’s

EPI 809/Spring Solution Table for SSE Birth weight =y Estriol =x Predicted =y=β 0 + β 1 x (Obs-pred) 2 =( y - y) SSE=1.1 ^^^^

EPI 809/Spring Test of Slope Parameter Solution  H 0 :  1 = 0  H a :  1  0   .05  df  = 3  Critical Value(s): Test Statistic:

EPI 809/Spring Test Statistic Solution From Table

EPI 809/Spring Test of Slope Parameter  H 0 :  1 = 0  H a :  1  0   .05  df  = 3  Critical Value(s): Test Statistic: Decision:Conclusion: Reject at  =.05 There is evidence of a linear relationship

EPI 809/Spring Test of Slope Parameter Computer Output  Parameter Estimates Parameter Standard Variable DF Estimate Error t Value Pr > |t| Intercept Estriol  t =  k / S  P-Value SS kk k k ^ ^ ^ ^

EPI 809/Spring Measures of Variation in Regression  1.Total Sum of Squares (SS yy ) Measures Variation of Observed Y i Around the Mean  Y Measures Variation of Observed Y i Around the Mean  Y  2.Explained Variation (SSR) Variation Due to Relationship Between X & Y Variation Due to Relationship Between X & Y  3.Unexplained Variation (SSE) Variation Due to Other Factors Variation Due to Other Factors

EPI 809/Spring Variation Measures Total sum of squares (Y i -  Y) 2 Unexplained sum of squares (Y i -  Y i ) 2 ^ Explained sum of squares (Y i -  Y) 2 ^ YiYiYiYi

EPI 809/Spring  1.Proportion of Variation ‘Explained’ by Relationship Between X & Y Coefficient of Determination 0  r 2  1

EPI 809/Spring Coefficient of Determination Examples r 2 = 1 r 2 =.8r 2 = 0

EPI 809/Spring Coefficient of Determination Example  Reconsider the Obstetrics example. Interpret a coefficient of Determination of  Answer: About 82% of the total variation of birthweight Is explained by the mother’s Estriol level.

EPI 809/Spring r 2 Computer Output Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var r 2 adjusted for number of explanatory variables & sample size S r2r2

EPI 809/Spring Using the Model for Prediction & Estimation

EPI 809/Spring Regression Modeling Steps  1.Hypothesize Deterministic Component  2.Estimate Unknown Model Parameters  3.Specify Probability Distribution of Random Error Term-Estimate Standard Deviation of Error  4.Evaluate Model  5.Use Model for Prediction & Estimation

EPI 809/Spring Prediction With Regression Models What Is Predicted? Population Mean Response E(Y) for Given X Population Mean Response E(Y) for Given X Point on Population Regression LinePoint on Population Regression Line Individual Response (Y i ) for Given X Individual Response (Y i ) for Given X

EPI 809/Spring What Is Predicted?

EPI 809/Spring Confidence Interval Estimate of Mean Y

EPI 809/Spring Factors Affecting Interval Width  1.Level of Confidence (1 -  ) Width Increases as Confidence Increases Width Increases as Confidence Increases  2.Data Dispersion (s) Width Increases as Variation Increases Width Increases as Variation Increases  3.Sample Size Width Decreases as Sample Size Increases Width Decreases as Sample Size Increases  4.Distance of X p from Mean  X Width Increases as Distance Increases Width Increases as Distance Increases

EPI 809/Spring Why Distance from Mean? Greater dispersion than X 1 XXXX

EPI 809/Spring Confidence Interval Estimate Example  Reconsider the Obstetrics example with the following data: Estriol (mg/24h) B.w. (g/1000)  Estimate the mean BW and a subject’s BW response when the Estriol level is 4 at.05 level.

EPI 809/Spring Solution Table

EPI 809/Spring Confidence Interval Estimate Solution - Mean BW X to be predicted

EPI 809/Spring Prediction Interval of Individual Response Note!

EPI 809/Spring Why the Extra ‘S’?

EPI 809/Spring SAS codes for computing mean and prediction intervals  Data BW; /*Reading data in SAS*/  input estriol birthw;  cards;  11  21  32  42  54  ;  run;  PROC REG data=BW; /*Fitting a linear regression model*/  model birthw=estriol/CLI CLM alpha=.05;  run;

EPI 809/Spring Interval Estimate from SAS- Output The REG Procedure Dependent Variable: y Output Statistics Dep Var Predicted Std Error Obs y Value Mean Predict 95% CL Mean 95% CL Predict Residual Predicted Y when X = 3 Confidence Interval SYSYSYSY^ Prediction Interval

EPI 809/Spring Hyperbolic Interval Bands

EPI 809/Spring Correlation Models

EPI 809/Spring Types of Probabilistic Models

EPI 809/Spring  Both variables are treated the same in correlation; in regression there is a predictor and a response  In regression the x variable is assumed non- random or measured without error  Correlation is used in looking for relationships, regression for prediction Correlation vs. regression

EPI 809/Spring Correlation Models  1.Answer ‘How Strong Is the Linear Relationship Between 2 Variables?’  2.Coefficient of Correlation Used Population Correlation Coefficient Denoted  (Rho) Population Correlation Coefficient Denoted  (Rho) Values Range from -1 to +1 Values Range from -1 to +1 Measures Degree of Association Measures Degree of Association  3.Used Mainly for Understanding

EPI 809/Spring  1.Pearson Product Moment Coefficient of Correlation between x and y: Sample Coefficient of Correlation

EPI 809/Spring Coefficient of Correlation Values

EPI 809/Spring Coefficient of Correlation Values No Correlation

EPI 809/Spring Coefficient of Correlation Values Increasing degree of negative correlation No Correlation

EPI 809/Spring Coefficient of Correlation Values Perfect Negative Correlation No Correlation

EPI 809/Spring Coefficient of Correlation Values Perfect Negative Correlation No Correlation Increasing degree of positive correlation

EPI 809/Spring Coefficient of Correlation Values Perfect Positive Correlation Perfect Negative Correlation No Correlation

EPI 809/Spring Coefficient of Correlation Examples r = 1r = -1 r =.89r = 0

EPI 809/Spring Test of Coefficient of Correlation  1.Shows If There Is a Linear Relationship Between 2 Numerical Variables  2.Same Conclusion as Testing Population Slope  1  3.Hypotheses H 0 :  = 0 (No Correlation) H 0 :  = 0 (No Correlation) H a :   0 (Correlation) H a :   0 (Correlation)

EPI 809/Spring Sample t-Test on Correlation Coefficient  Hypotheses H 0 :  = 0 (No Correlation) H 0 :  = 0 (No Correlation) H a :   0 (Correlation) H a :   0 (Correlation)  test statistic: under H 0 t = r (n-2) 1/2 / (1-r 2 ) 1/2 ~ t (n-2) t = r (n-2) 1/2 / (1-r 2 ) 1/2 ~ t (n-2) Reject H 0 if |t| > t α/2, n-2 Reject H 0 if |t| > t α/2, n-2

EPI 809/Spring Sample Z-Test on Correlation Coefficient  Hypotheses (Fisher) H 0 :  =  0 H 0 :  =  0 H a :    0 H a :    0  test statistic: under H 0 : Reject H 0 if |z| > z 1- α/2 Reject H 0 if |z| > z 1- α/2

EPI 809/Spring Conclusion 1. Describe the Linear Regression Model 2. State the Regression Modeling Steps 3. Explain Ordinary Least Squares 4. Compute Regression Coefficients 5. Understand and check model assumptions 6. Predict Response Variable 7. Comments of SAS Output

EPI 809/Spring Conclusion … 8. Correlation Models 9. Test of coefficient of Correlation