Pooling Cross Sections across Time: Simple Panel Data Methods

Slides:



Advertisements
Similar presentations
Introduction Describe what panel data is and the reasons for using it in this format Assess the importance of fixed and random effects Examine the Hausman.
Advertisements

PANEL DATA 1. Dummy Variable Regression 2. LSDV Estimator
Economics 20 - Prof. Anderson1 Panel Data Methods y it = x it k x itk + u it.
Pooled Cross Sections and Panel Data I
Economics 20 - Prof. Anderson
Data organization.
Random Assignment Experiments
Lecture 8 (Ch14) Advanced Panel Data Method
Lecture 4 Econ 488. Ordinary Least Squares (OLS) Objective of OLS  Minimize the sum of squared residuals: where Remember that OLS is not the only possible.
Econ Prof. Buckles1 Welcome to Econometrics What is Econometrics?
Economics 20 - Prof. Anderson
Chapter 10 Simple Regression.
Pooled Cross Sections and Panel Data II
1 Ka-fu Wong University of Hong Kong Forecasting with Regression Models.
Violations of Assumptions In Least Squares Regression.
6.4 Prediction -We have already seen how to make predictions about our dependent variable using our OLS estimates and values for our independent variables.
1Prof. Dr. Rainer Stachuletz Time Series Data y t =  0 +  1 x t  k x tk + u t 1. Basic Analysis.
1Prof. Dr. Rainer Stachuletz Panel Data Methods y it =  0 +  1 x it  k x itk + u it.
ECON 6012 Cost Benefit Analysis Memorial University of Newfoundland
Introduction 1 Panel Data Analysis. And now for… Panel Data! Panel data has both a time series and cross- section component Observe same (eg) people over.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Analysis of Cross Section and Panel Data Yan Zhang School of Economics, Fudan University CCER, Fudan University.
10. Basic Regressions with Times Series Data 10.1 The Nature of Time Series Data 10.2 Examples of Time Series Regression Models 10.3 Finite Sample Properties.
Public Policy Analysis ECON 3386 Anant Nyshadham.
Simultaneous Equations Models A simultaneous equations model is one in which there are endogenous variables which are determined jointly. e.g. the demand-supply.
Economics 20 - Prof. Anderson1 Time Series Data y t =  0 +  1 x t  k x tk + u t 1. Basic Analysis.
1 The Training Benefits Program – A Methodological Exposition To: The Research Coordination Committee By: Jonathan Adam Lind Date: 04/01/16.
Heteroscedasticity Heteroscedasticity is present if the variance of the error term is not a constant. This is most commonly a problem when dealing with.
Ch5 Relaxing the Assumptions of the Classical Model
GS/PPAL Section N Research Methods and Information Systems
Chapter 15 Panel Data Models.
Ch. 2: The Simple Regression Model
Vera Tabakova, East Carolina University
The Nature of Econometrics and Economic Data
The Nature of Econometrics and Economic Data
The Nature of Econometrics and Economic Data
The Nature of Econometrics and Economic Data
General belief that roads are good for development & living standards
Panel Data Models By Mai Thanh, Jin Lulu.
Pooling Cross Sections across Time: Simple Panel Data Methods
Econometrics ITFD Week 8.
The Nature of Econometrics and Economic Data
Difference-in-Differences
PANEL DATA 1. Dummy Variable Regression 2. LSDV Estimator
More on Specification and Data Issues
Jennifer Ward-Batts March 21, 2017
Chapter 15 Panel Data Analysis.
More on Specification and Data Issues
Instrumental Variables and Two Stage Least Squares
Advanced Panel Data Methods
Multiple Regression Analysis with Qualitative Information
Chapter 6: MULTIPLE REGRESSION ANALYSIS
Economics 20 - Prof. Anderson
Instrumental Variables and Two Stage Least Squares
Migration and the Labour Market
Economics 20 - Prof. Anderson
Ch. 13. Pooled Cross Sections Across Time: Simple Panel Data.
Instrumental Variables and Two Stage Least Squares
Evaluating Impacts: An Overview of Quantitative Methods
Econometrics Analysis
The Simple Regression Model
The Productivity Effects of Privatization Longitudinal Estimates using Comprehensive Manufacturing Firm Data from Hungary, Romania, Russia, and Ukraine.
Multiple Regression Analysis with Qualitative Information
Multiple Regression Analysis: OLS Asymptotics
Multiple Regression Analysis: OLS Asymptotics
Instrumental Variables Estimation and Two Stage Least Squares
More on Specification and Data Issues
Violations of Assumptions In Least Squares Regression
Ch. 13. Pooled Cross Sections Across Time: Simple Panel Data.
Advanced Panel Data Methods
Presentation transcript:

Pooling Cross Sections across Time: Simple Panel Data Methods

Pooling Cross Sections across Time: Simple Panel Data Methods Policy analysis with pooled cross sections Two or more independently sampled cross sections can be used to evaluate the impact of a certain event or policy change Example: Effect of new garbage incinerator’s location on housing prices Examine the effect of the location of a house on its price before and after the garbage incinerator was built: After incinerator was built Before incinerator was built

Pooling Cross Sections across Time: Simple Panel Data Methods Example: Garbage incinerator and housing prices (cont.) It would be wrong to conclude from the regression after the incinerator is there that being near the incinerator depresses prices so strongly One has to compare with the situation before the incinerator was built: In the given case, this is equivalent to This is the so called difference-in-differences estimator (DiD) Incinerator depresses prices but location was one with lower prices anyway

Pooling Cross Sections across Time: Simple Panel Data Methods Difference-in-differences in a regression framework In this way standard errors for the DiD-effect can be obtained If houses sold before and after the incinerator was built were sys- tematically different, further explanatory variables should be included This will also reduce the error variance and thus standard errors Before/After comparisons in “natural experiments” DiD can be used to evaluate policy changes or other exogenous events Differential effect of being in the location and after the incinerator was built

Pooling Cross Sections across Time: Simple Panel Data Methods Policy evaluation using difference-in-differences Compare outcomes of the two groups before and after the policy change Compare the difference in outcomes of the units that are affected by the policy change (= treatment group) and those who are not affected (= control group) before and after the policy was enacted. For example, the level of unemployment benefits is cut but only for group A (= treatment group). Group A normally has longer unemployment durations than group B (= control group). If the diffe-rence in unemployment durations between group A and group B becomes smaller after the reform, reducing unemployment benefits reduces unemployment duration for those affected. Caution: Difference-in-differences only works if the difference in outcomes between the two groups is not changed by other factors than the policy change (e.g. there must be no differential trends).

Pooling Cross Sections across Time: Simple Panel Data Methods Two-period panel data analysis Example: Effect of unemployment on city crime rate Assume that no other explanatory variables are available. Will it be possible to estimate the causal effect of unemployment on crime? Yes, if cities are observed for at least two periods and other factors affecting crime stay approximately constant over those periods: Time dummy for the second period Unobserved time-constant factors (= fixed effect) Other unobserved factors (= idiosyncratic error)

Pooling Cross Sections across Time: Simple Panel Data Methods Example: Effect of unemployment on city crime rate (cont.) Estimate differenced equation by OLS: Subtract: Fixed effect drops out + 1 percentage point unemploy-ment rate leads to 2.22 more crimes per 1,000 people Secular increase in crime

Pooling Cross Sections across Time: Simple Panel Data Methods Discussion of first-differenced panel estimator Further explanatory variables may be included in original equation Note that there may be arbitrary correlation between the unobserved time-invariant characteristics and the included explanatory variables OLS in the original equation would therefore be inconsistent The first-differenced panel estimator is thus a way to consistently estimate causal effects in the presence of time-invariant endogeneity For consistency, strict exogeneity has to hold in the original equation First-differenced estimates will be imprecise if explanatory variables vary only little over time (no estimate possible if time-invariant)