Lecture 6 Feb. 2, 2015 ANNOUNCEMENT: Lab session will go from 4:20-5:20 based on the poll. (The majority indicated that it would not be a problem to chance,

Slides:



Advertisements
Similar presentations
Properties of Least Squares Regression Coefficients
Advertisements

Multiple Regression Analysis
The Simple Regression Model
3.2 OLS Fitted Values and Residuals -after obtaining OLS estimates, we can then obtain fitted or predicted values for y: -given our actual and predicted.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Lecture 9 Today: Ch. 3: Multiple Regression Analysis Example with two independent variables Frisch-Waugh-Lovell theorem.
1 Lecture 2: ANOVA, Prediction, Assumptions and Properties Graduate School Social Science Statistics II Gwilym Pryce
Part 1 Cross Sectional Data
The Simple Linear Regression Model: Specification and Estimation
The Simple Regression Model
Chapter 10 Simple Regression.
2.5 Variances of the OLS Estimators
Simple Linear Regression
Econ Prof. Buckles1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.
FIN357 Li1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.
FIN357 Li1 The Simple Regression Model y =  0 +  1 x + u.
CHAPTER 4 ECONOMETRICS x x x x x Multiple Regression = more than one explanatory variable Independent variables are X 2 and X 3. Y i = B 1 + B 2 X 2i +
FIN357 Li1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.
The Simple Regression Model
Lecture 1 (Ch1, Ch2) Simple linear regression
BCOR 1020 Business Statistics
FIN357 Li1 The Simple Regression Model y =  0 +  1 x + u.
So are how the computer determines the size of the intercept and the slope respectively in an OLS regression The OLS equations give a nice, clear intuitive.
Ordinary Least Squares
Lecture 5 Correlation and Regression
3. Multiple Regression Analysis: Estimation -Although bivariate linear regressions are sometimes useful, they are often unrealistic -SLR.4, that all factors.
Regression and Correlation Methods Judy Zhong Ph.D.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Introduction to Linear Regression and Correlation Analysis
Chapter 11 Simple Regression
Hypothesis Testing in Linear Regression Analysis
Chapter 4-5: Analytical Solutions to OLS
7.1 Multiple Regression More than one explanatory/independent variable This makes a slight change to the interpretation of the coefficients This changes.
Multiple regression - Inference for multiple regression - A case study IPS chapters 11.1 and 11.2 © 2006 W.H. Freeman and Company.
Ordinary Least Squares Estimation: A Primer Projectseminar Migration and the Labour Market, Meeting May 24, 2012 The linear regression model 1. A brief.
1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u.
2.4 Units of Measurement and Functional Form -Two important econometric issues are: 1) Changing measurement -When does scaling variables have an effect.
Copyright © 2006 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin The Two-Variable Model: Hypothesis Testing chapter seven.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
1 B IVARIATE AND MULTIPLE REGRESSION Estratto dal Cap. 8 di: “Statistics for Marketing and Consumer Research”, M. Mazzocchi, ed. SAGE, LEZIONI IN.
1 Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.
11 Chapter 5 The Research Process – Hypothesis Development – (Stage 4 in Research Process) © 2009 John Wiley & Sons Ltd.
1 AAEC 4302 ADVANCED STATISTICAL METHODS IN AGRICULTURAL RESEARCH Part II: Theory and Estimation of Regression Models Chapter 5: Simple Regression Theory.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis: Inference
Multiple Regression Analysis: Estimation
Announcements Reminder: Exam 1 on Wed.
The simple linear regression model and parameter estimation
Linear Regression with One Regression
REGRESSION G&W p
Ch. 2: The Simple Regression Model
Multiple Regression Analysis: Estimation
The Simple Linear Regression Model: Specification and Estimation
Chapter 5: The Simple Regression Model
More on Specification and Data Issues
The Simple Regression Model
Multiple Regression Analysis
More on Specification and Data Issues
Chapter 15 Linear Regression
Ch. 2: The Simple Regression Model
Chapter 6: MULTIPLE REGRESSION ANALYSIS
Simple Linear Regression
Seminar in Economics Econ. 470
The Simple Regression Model
The Simple Regression Model
Linear Regression Summer School IFPRI
More on Specification and Data Issues
Introduction to Regression
Presentation transcript:

Lecture 6 Feb. 2, 2015 ANNOUNCEMENT: Lab session will go from 4:20-5:20 based on the poll. (The majority indicated that it would not be a problem to chance, and 3-4 of you have told me you have a class ending at 4:15)

The Simple Regression Model Properties of OLS on any sample of data Algebraic properties of OLS regression: always going to be the case on the best-fit line through our data. (They come from the first order conditions used to define the estimators.) Deviations from regression line sum up to zero Sample covariance between residuals and regressors is zero Sample averages of y and x lie on regression line

The Simple Regression Model Goodness-of-Fit Measures of Variation “How well does the explanatory variable explain the dependent variable?“ Total sum of squares, represents total variation in dependent variable Explained sum of squares, represents variation explained by regression Residual sum of squares, represents variation not explained by regression

The Simple Regression Model Decomposition of total variation Goodness-of-fit measure (R-squared) Total variation Explained part Unexplained part R-squared measures the fraction of the total variation that is explained by the regression

The Simple Regression Model CEO Salary and return on equity Caution: A high R-squared does not necessarily mean that your use of regression is successful, just like a low R-squared does not mean that your finding is unimportant. The regression explains only 1.3 % of the total variation in salaries

Example in Stata output (using dataset wage1) What percent of the variation in hourly wage is explained by years of education? Which number do you think is the Total Sum of Squares? Which number do you think is the Explained Sum of Squares?

The Simple Regression Model Variances of the OLS estimators Depending on the sample, the estimates will be nearer or farther away from the true population values How far can we expect our estimates to be away from the true population values on average (= sampling variability)? Sampling variability is measured by the estimator‘s variances Assumption SLR.5 (Homoskedasticity) The value of the explanatory variable must contain no information about the variability of the unobserved factors

The Simple Regression Model Graphical illustration of homoskedasticity The variability of the unobserved influences does not dependent on the value of the explanatory variable

The Simple Regression Model An example for heteroskedasticity: Wage and education The variance of the unobserved determinants of wages changes with the level of education. Does it increase or decrease with education?

The Simple Regression Model Theorem 2.2 (Variances of OLS estimators) Conclusion: The sampling variability of the estimated regression coefficients will be higher, the larger the variability of the unobserved factors, and it will be lower, the higher the variation in the explanatory variable. How to structure an experiment testing effect of hours of SAT instruction on score? Under assumptions SLR.1 – SLR.5:

The Simple Regression Model Estimating the error variance The variance of u does not depend on x, i.e. is equal to the unconditional variance One could estimate the variance of the errors by calculating the variance of the residuals in the sample; unfortunately this estimate would be biased An unbiased estimate of the error variance can be obtained by substracting the number of estimated regression coefficients from the number of observations Theorem 2.3 proves unbiasedness of this estimator under SLR1-SLR5.

The Simple Regression Model Once we have estimated we can proceed to estimate the sampling variance of our regression coefficients. Calculation of standard errors (estimator for standard deviation) of regression coefficients Plug in for the unknown The estimated standard deviations of the regression coefficients are called “standard errors“. They measure how precisely the regression coefficients are estimated. This will be very important for us to be able to do hypothesis testing using our estimates.

Multiple Regression Analysis: Estimation Chapter 3 Wooldridge: Introductory Econometrics: A Modern Approach, 5e

Multiple Regression Analysis: Estimation Definition of the multiple linear regression model „Explains variable in terms of variables “ Intercept Slope parameters Dependent variable, explained variable, response variable,… Error term, disturbance, unobservables,… Independent variables, explanatory variables, regressors,…

Multiple Regression Analysis: Estimation Motivation for multiple regression Incorporate more explanatory factors into the model Explicitly hold fixed other factors that otherwise would be in Allow for more flexible functional forms Example: Wage equation Now measures effect of education explicitly holding experience fixed All other factors… Hourly wage Years of education Labor market experience

Multiple Regression Analysis: Estimation Example: Average test scores and per student spending Per student spending is likely to be correlated with average family income at a given high school because of school financing Omitting average family income in regression would lead to biased estimate of the effect of spending on average test scores In a simple regression model, effect of per student spending would partly include the effect of family income on test scores Other factors Average standardized test score of school Per student spending at this school Average family income of students at this school

Multiple Regression Analysis: Estimation Example: Family income and family consumption Model has two explanatory variables: inome and income squared Consumption is explained as a quadratic function of income One has to be very careful when interpreting the coefficients: Other factors Family consumption Family income Family income squared Depends on how much income is already there By how much does consumption increase if income is increased by one unit?

Multiple Regression Analysis: Estimation OLS Estimation of the multiple regression model Random sample Regression residuals Minimize sum of squared residuals Minimization will be carried out by computer

Multiple Regression Analysis: Estimation Interpretation of the multiple regression model The multiple linear regression model manages to hold the values of other explanatory variables fixed even if, in reality, they are correlated with the explanatory variable under consideration “Ceteris paribus“-interpretation It has still to be assumed that unobserved factors do not change if the explanatory variables are changed By how much does the dependent variable change if the j-th independent variable is increased by one unit, holding all other independent variables and the error term constant

Multiple Regression Analysis: Estimation Example: Determinants of college GPA Interpretation Holding ACT fixed, another point on high school grade point average is associated with another .453 points college grade point average Or: If we compare two students with the same ACT, but the hsGPA of student A is one point higher, we predict student A to have a colGPA that is .453 higher than that of student B Holding high school grade point average fixed, another 10 points on ACT are associated with less than one point on college GPA Grade point average at college High school grade point average Achievement test score