Migration and the Labour Market

Slides:



Advertisements
Similar presentations
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Ch11 Curve Fitting Dr. Deshi Ye
Objectives (BPS chapter 24)
Chapter 10 Simple Regression.
BA 555 Practical Business Analysis
Linear Regression with One Regression
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter Topics Types of Regression Models
Chapter 11 Multiple Regression.
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Pertemua 19 Regresi Linier
Simple Linear Regression Analysis
Regression and Correlation Methods Judy Zhong Ph.D.
Introduction to Linear Regression and Correlation Analysis
Inference for regression - Simple linear regression
Chapter 11 Simple Regression
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
STATA: An Introduction Into the Basics Prof. Dr. Herbert Brücker University of Bamberg Seminar “Migration and the Labour Market” May 31, 2012.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Ordinary Least Squares Estimation: A Primer Projectseminar Migration and the Labour Market, Meeting May 24, 2012 The linear regression model 1. A brief.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
I271B QUANTITATIVE METHODS Regression and Diagnostics.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Class 5 Multiple Regression CERAM February-March-April 2008 Lionel Nesta Observatoire Français des Conjonctures Economiques
Lecture 6 Feb. 2, 2015 ANNOUNCEMENT: Lab session will go from 4:20-5:20 based on the poll. (The majority indicated that it would not be a problem to chance,
Chapter 4: Basic Estimation Techniques
Chapter 4 Basic Estimation Techniques
Linear Regression with One Regression
Regression Analysis AGEC 784.
Inference for Least Squares Lines
Vera Tabakova, East Carolina University
Statistics for Managers using Microsoft Excel 3rd Edition
Basic Estimation Techniques
Kakhramon Yusupov June 15th, :30pm – 3:00pm Session 3
THE LINEAR REGRESSION MODEL: AN OVERVIEW
PANEL DATA 1. Dummy Variable Regression 2. LSDV Estimator
Fundamentals of regression analysis
Chapter 11 Simple Regression
Correlation and Simple Linear Regression
I271B Quantitative Methods
CHAPTER 29: Multiple Regression*
CHAPTER 26: Inference for Regression
Chapter 6: MULTIPLE REGRESSION ANALYSIS
6-1 Introduction To Empirical Models
Multiple Regression Models
PENGOLAHAN DAN PENYAJIAN
Correlation and Simple Linear Regression
Simple Linear Regression
Simple Linear Regression and Correlation
Seminar in Economics Econ. 470
BEC 30325: MANAGERIAL ECONOMICS
The Simple Regression Model
Linear Panel Data Models
3.2. SIMPLE LINEAR REGRESSION
BEC 30325: MANAGERIAL ECONOMICS
Introduction to Regression
St. Edward’s University
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Migration and the Labour Market STATA II: An Introduction into Panel Regression Models Dr. Ehsan Vallizadeh Bamberg, June 6, 2018

Last meeting Generating dummy variables Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Last meeting Generating dummy variables Gen command, replace command Advanced generation (forvalues 1/3 {….}) Organise your work with globals glo name varlist $name Summary statistics sum varlist 2

Last meeting II 4 Making Graphs One Y-axis: Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Last meeting II 4 Making Graphs One Y-axis: graph twoway line var1 var2 if ed == 1 & ex ==1 Two Y-axes: graph twoway line (var1 var2) (var3 var2, yaxis(2)) if… Scatter plots: graph scatter var1 var2 if … Scatter plots with regression line: graph scatter var1 var2 || lfit var1 var2 if … 3

Discussing your ‘homework’ Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Today’s Meeting Discussing your ‘homework’ Generating the data sets and dummy variable Descriptive statistics Graphs Problems? Introduction into regressions commands in STATA Discussing the next steps 4

What is a linear regression model? Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models What is a linear regression model? The general econometric model: yi indicates the dependent (or: endogenous) variable x1i,ki exogenous variable, explaining the independent variable β0 constant or the y-axis intercept (if x = 0) β1,2,k regression coefficient or parameter of regression εi residual, disturbance term.

Migration and the Labor Market Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models The OLS estimator The ordinary least square (OLS) estimator minimizes the squared deviations from the linear regression line Under the assumption that (i) the error term is normally distributed and has an expected mean value of zero, and (ii) the variance of the error term is constant and limited, the OLS estimator delivers unbiased results. Depending on the sample size, you can draw inference on the total population For the example, you can use the standard errors or t-statistics of the coefficients of the explanatory variables to test the null-hypothesis whether the estimated coefficient is zero

The OLS estimator: problems Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models The OLS estimator: problems Endogeneity (simultaneous equation bias): if your explanatory variables are not truly exogenous but correlated with the explanatory variable, the mean value of the error term isn’t zero and you obtain biased estimates. Solutions: Instrumental Variable (IV) estimation, natural experiments Omitted variable bias: If you have omitted relevant explanatory variables (i) the estimates of the remaining coefficients might be biased and (ii) the error term might be not zero.

The OLS estimator: problems (cont.) Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models The OLS estimator: problems (cont.) Multicollinearity: If your explanatory variables are correlated, you may have problems to identify the true correlation coefficient for the individual regressors, in particular when your sample is small. Is less serious. Heteroscedasticity: If the variance is not constant across groups in your sample (e.g. education groups) the standard errors are not properly estimated and the coefficients might be biased as well. Solutions: Robust Standard Error estimates, Generalized Least Square (GLS) estimation, and others.

The OLS estimator: problems (cont.) Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models The OLS estimator: problems (cont.) Contemporary correlation of the error term: If your error term across the panels is contemporaneously correlated, e.g. due to simultaneous time shocks, you obtain biased estimates Solutions: GLS-, Generalized Methods of Moments estimators And many, many others ….

Regression Analysis with STATA Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Regression Analysis with STATA

Running regressions: General Syntax Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Running regressions: General Syntax The standard OLS regression syntax in STATA is: regress depvar [list of indepvar ] [if], [options] Example: Regress log wage on migration share, controlling for education, working experience and time regress ln_wqjt m_qjt $D_i $D_j $D_t

Recall the Borjas (2003)-Modell Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Recall the Borjas (2003)-Modell This model in STATA Syntax: regress ln_wqjt m_qjt $D_i $D_j $D_t $D_ed_ex $D_ed_t $D_ex_t where ln_wqjt: dependent variable (log wage) mqjt: migration share in educatipn-experience cell $D_i: global for education dummies $D_j: global for experience dummies $D_t: global for time dummies $D_ed_ex: global for interaction education-experience dummies $D_ed_t : global for education-time interaction dummies $D_ex_t: global for experience-time interaction dummies

Running a regression model Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Running a regression model Globals ! Regression command Dependent variable Independent variables

Running a Regression: Output Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Running a Regression: Output

How to interpret the output of a regression Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models How to interpret the output of a regression variance of model degrees of freedom 1. Observations 2. fit of the model 3. F-Test 4. R-squared 5. adjusted R-squared 6. Root Mean Standard Error β1 95% confidence interval analysis of significance levels β0

Instumental Variable (IV) models Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Instumental Variable (IV) models Instrumental variables (IVs) are variables which are correlated with the (potentially) endogenous explanatory variable, but not the dependent variable and, hence, not the error term Syntax of a IV model in stata: ivregress depvar indepvar (endogenous variable = iv) Example: ivregress ln_wqjt $D_i $D_j … (mqjt = iv1 iv2 …)

Migration and the Labor Market Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Panel Models Very often you use panel models, i.e. models which have a group and time series dimension There exist special estimators for this, e.g. fixed or random effects models A fixed effects model is a model where you have a fixed (constant) effect for each individual/group. This is equivalent to a dummy variable for each group A random effects model is a model where you have a random effect for each individual group, which is based on assumptions on the distribution of individual effects

Preparation for Panel Models: Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Panel Models Preparation for Panel Models: For running panel models STATA needs to identify the group(individual) and time series dimension Therefore you need an index for each group and an index for each time period Then use the tsset command to organize you dataset as a panel data set Syntax: tsset index year where index is the group/individual index and year the time index

Preparation: Running the tsset command Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Preparation: Running the tsset command

Running Regressions: Panel Models Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Running Regressions: Panel Models Then you can use panel estimators, e.g. the xtreg estimator Syntax xtreg depvar [list of indepvar ] [if], [options] xtreg ln_wqjt m_qjt, fe i.e. in the example we run a simple fixed effects panel regression model which is equivalent to include a dummy variable for each group (in this case education-experience group), but: no time or interaction dummies

Running a Panel Regression: command Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Running a Panel Regression: command

Running a Panel Regression: Output Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Running a Panel Regression: Output

Running Regressions: Panel Models Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Running Regressions: Panel Models There are other features of panel estimators which are helpful Heteroscedasticity: Heteroscedasticity: the variance is not constant, but varies across groups xtpcse , p(h) corrects for heteroscedastic standard errors xtgls , p(h) corrects coefficient and standard errors for panel heteroscedasticity, but may produce biased results depending on the group and time dimension of the panel Note: p(h) after the comma is a so-called option in the STATA syntax

Running Regressions: Panel Models Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Running Regressions: Panel Models Contemporary correlation across cross-sections Contemporary correlation: the error terms are contemporarily correlated across cross-sections, e.g. due to macroeconomic disturbances xtgls , p(c) corrects for contemporary correlation and panel heteroscedasticity, but may produce biased results depending on the group and time dimension of the panel.

Plan for the presentation Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Plan for the presentation Introduction into your country and outline of research question [Very brief sketch of state of research] Theoretical hypothesis/hypotheses Description of data Data sources Descriptive Statistics Graphical presentation of descriptive evidence Econometric findings Regression model Regression results Interpretation Conclusions

Outline for next meeting: Descriptive data analysis Migration and the Labor Market Session 4: Introduction into Panel-Regression-Models Next Meeting on June 21, 2018 Outline for next meeting: Descriptive data analysis 26