Ordinary Least Squares Estimation: A Primer Projectseminar Migration and the Labour Market, Meeting May 24, 2012 The linear regression model 1. A brief.

Slides:



Advertisements
Similar presentations
Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Advertisements

6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
1 SSS II Lecture 1: Correlation and Regression Graduate School 2008/2009 Social Science Statistics II Gwilym Pryce
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Bivariate Regression Analysis
Multiple regression analysis
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Chapter 10 Simple Regression.
CHAPTER 3 ECONOMETRICS x x x x x Chapter 2: Estimating the parameters of a linear regression model. Y i = b 1 + b 2 X i + e i Using OLS Chapter 3: Testing.
Chapter 12 Simple Regression
Lesson #32 Simple Linear Regression. Regression is used to model and/or predict a variable; called the dependent variable, Y; based on one or more independent.
CHAPTER 4 ECONOMETRICS x x x x x Multiple Regression = more than one explanatory variable Independent variables are X 2 and X 3. Y i = B 1 + B 2 X 2i +
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Chapter 11 Multiple Regression.
1 MF-852 Financial Econometrics Lecture 6 Linear Regression I Roy J. Epstein Fall 2003.
Business Statistics - QBM117 Interval estimation for the slope and y-intercept Hypothesis tests for regression.
Dr. Mario MazzocchiResearch Methods & Data Analysis1 Correlation and regression analysis Week 8 Research Methods & Data Analysis.
Correlation and Regression Analysis
Simple Linear Regression Analysis
Relationships Among Variables
Ordinary Least Squares
Lecture 5 Correlation and Regression
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Chapter 11 Simple Regression
5.1 Basic Estimation Techniques  The relationships we theoretically develop in the text can be estimated statistically using regression analysis,  Regression.
Understanding Multivariate Research Berry & Sanders.
Chapter 14 Simple Regression
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Bivariate Regression Assumptions and Testing of the Model Economics 224, Notes for November 17, 2008.
Introduction to Linear Regression
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Examining Relationships in Quantitative Research
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Political Science 30: Political Inquiry. Linear Regression II: Making Sense of Regression Results Interpreting SPSS regression output Coefficients for.
Lecture 10: Correlation and Regression Model.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Chapter 2 Ordinary Least Squares Copyright © 2011 Pearson Addison-Wesley. All rights reserved. Slides by Niels-Hugo Blunch Washington and Lee University.
1 B IVARIATE AND MULTIPLE REGRESSION Estratto dal Cap. 8 di: “Statistics for Marketing and Consumer Research”, M. Mazzocchi, ed. SAGE, LEZIONI IN.
I271B QUANTITATIVE METHODS Regression and Diagnostics.
June 30, 2008Stat Lecture 16 - Regression1 Inference for relationships between variables Statistics Lecture 16.
11 Chapter 5 The Research Process – Hypothesis Development – (Stage 4 in Research Process) © 2009 John Wiley & Sons Ltd.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
1 Regression Review Population Vs. Sample Regression Line Residual and Standard Error of Regression Interpretation of intercept & slope T-test, F-test.
Multiple Regression Analysis: Estimation. Multiple Regression Model y = ß 0 + ß 1 x 1 + ß 2 x 2 + …+ ß k x k + u -ß 0 is still the intercept -ß 1 to ß.
Multiple Regression Analysis Regression analysis with two or more independent variables. Leads to an improvement.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Regression Analysis: A statistical procedure used to find relations among a set of variables B. Klinkenberg G
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Lecturer: Ing. Martina Hanová, PhD. Business Modeling.
The “Big Picture” (from Heath 1995). Simple Linear Regression.
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
Lecture 6 Feb. 2, 2015 ANNOUNCEMENT: Lab session will go from 4:20-5:20 based on the poll. (The majority indicated that it would not be a problem to chance,
The simple linear regression model and parameter estimation
Chapter 20 Linear and Multiple Regression
Chapter 11: Simple Linear Regression
Correlation and Simple Linear Regression
I271B Quantitative Methods
Prepared by Lee Revere and John Large
Migration and the Labour Market
Correlation and Simple Linear Regression
Simple Linear Regression and Correlation
Linear Regression Summer School IFPRI
Introduction to Regression
Presentation transcript:

Ordinary Least Squares Estimation: A Primer Projectseminar Migration and the Labour Market, Meeting May 24, 2012 The linear regression model 1. A brief introduction into linear regression 2. How to do a regression 3. How to interpret the output in STATA

linear regression 1.A brief introduction into linear regression models In order to find out the relationship between one aspect (variable1) and another aspect (variable2), one may run a regression model. e.g. what is the impact of work experience on the level of wages A regression measures whether and by which extent an exogenous (independent) variable affects an endogenous (dependent) variable.

linear regression 1.A brief introduction into linear regression A regression indicates how much and in which way a independent variable influences a dependent variable. One can distinguish between: A positive and negative correlation A high and low correlation A significant or insignificant impact Definition of significance: You test at which significance level (e.g. 0, 5%, 10%-level) you can reject the hypothesis that the variable has zero impact (so-called “Null- Hypothesis”) or H0.

linear regression 1.A brief introduction into linear regression the general multivariate model (with many explantory variables): γ i indicates the dependent/endogenous variable x 1i,ki exogenous variable, explaining/independent variable β 0 constant, y- axis intercept (if x = 0) β 1,2,k regression coefficient, parameters of regression ε i residual, disturbance term (should be normally distributed, expected value of 0, constant variance)

linear regression 1.short introduction into linear regression In a simple linear regression model, there is beside the constant only one regression coefficient: γ i indicates the dependent/endogenous variable x 1i exogenous variable, explaining/independent variable β 0 constant, y- axis intercept (if x = 0) β 1 regression coefficient, parameter of regression ε i residual, disturbance term

linear regression 1.short introduction into linear regression In a simple linear regression model, there is beside the constant only one regression coefficient: Thus, in this simple linear regression model γ i is explained by the variable x 1i. Moreover there is a constant variable β 0 and x 1i is weighted by β 1. β 1 can be interpreted e.g. as the effect of an e.g. increase of x 1i by one unit on the output (or wage) γ i.

linear regression 1.short introduction into linear regression In a simple linear regression model, there is beside the constant only one regression coefficient: ε i is the disturbance variable and indicates the difference between the result of our estimation done by the regression model and “reality”, the true observed value. The regression is done with the of Ordinary Least Squares (OLS) estimator, which minimizes the squared value of the residual. Nevertheless there is still deviation between the true and the estimated values since we have a stochastic and not a (purely) deterministic relationship.

linear regression 1.short introduction into linear regression β0β0 εiεi β 1i γiγi x 1i

linear regression 1.short introduction into linear regression Models with fixed effects/dummy variables. You consider beyond one constant more constants (intercept) terms for each group. Where α are the constants and dummy variables, and i = 1, 2, … N is the group (e.g. education or experience) index. Thus, you consider N-1 dummy variables which creates a different intercept (constant) for each group. The slope parameter remains however uniform for all groups.

linear regression 2. How to do a regression with STATA > regress depvar [indepvars] [if] [, options ] after the command you first set the dependent variable (=endogenous variable, the variable you want to explain), after that you put the independent variables (exogenous variables). example:

linear regression 3. How to interpret the output of a regression fitting of the model analysis of the coefficients analysis of the variance of the model β0β0 β1β1

linear regression 3. How to interpret the output of a regression analysis of the coefficients β 0 indicates the output if there is no x 1i, i.e. if there is no income the outcome variable (sqm) would be equal to the value of coefficient β 0. β 1 describes how much the output changes if there would be an increase of hhinc by one. Can be positive or negative (-> positive or negative correlation) β0β0 β1β1

linear regression 3. How to interpret the output of a regression analysis of the coefficients Moreover the output gives us the standard error, the t- value and the p-value. The standard error is a measure for the precision of the parameter estimate. The t-value is the coefficient divided by the standard error. As a rule of the thumb, a t-value of 2.0 indicated that the coefficient is significantly different from zero at the 5% level, a t-value of 2.64 that it is differently from zero at the 1% level. β0β0 β1β1

linear regression 3. How to interpret the output of a regression analysis of the coefficients The p-value provides the accurate significance level for the rejection of the Null Hypothesis, i.e. that the estimated parameter is different from zero. A p-value < 0.05 indicates that the significance level is 5%. Reporting: Your report the coefficient and either the standard error or the t-statistics, and indicate the significance levels by stars behind the coefficient. E.g. *** suggest a significance level of 1%, ** of 5%, * of 10%. β0β0 β1β1

linear regression 3. How to interpret the output of a regression fit of the model A good value about the fit of our regression is R-squared, R 2 indicates how good our model can explain the “real” values of y. R 2 = 1 -> perfect fit, our model can explain every single value R 2 = 0 -> no fit, our model is rather useless

linear regression 3. How to interpret the output of a regression fit of the model The adjusted R squared corrects the R squared for the number of variables considered. It’s a slightly better measure than the R squared. Report either the R squared or the adjusted R squared. The other measures are usually not reported.

linear regression 3. How to interpret the output of a regression Analysis of the variance of the model This part of the output indicates the variation of the model and of the residual.