Public Policy Analysis ECON 3386 Anant Nyshadham.

Slides:



Advertisements
Similar presentations
Economics 20 - Prof. Anderson1 Panel Data Methods y it = x it k x itk + u it.
Advertisements

Economics 20 - Prof. Anderson
Multiple Regression Analysis
Review of Identifying Causal Effects Methods of Economic Investigation Lecture 13.
Random Assignment Experiments
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Time Varying Coefficient Models; A Proposal for selecting the Coefficient Driver Sets Stephen G. Hall, P. A. V. B. Swamy and George S. Tavlas,
1 SSS II Lecture 1: Correlation and Regression Graduate School 2008/2009 Social Science Statistics II Gwilym Pryce
Omitted Variable Bias Methods of Economic Investigation Lecture 7 1.
Differences-in-Differences
Objectives (BPS chapter 24)
PHSSR IG CyberSeminar Introductory Remarks Bryan Dowd Division of Health Policy and Management School of Public Health University of Minnesota.
Assumption MLR.3 Notes (No Perfect Collinearity)
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
Multiple Linear Regression Model
Chapter 10 Simple Regression.
Pooled Cross Sections and Panel Data II
Econ Prof. Buckles1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.
1 Chapter 3 Multiple Linear Regression Ray-Bing Chen Institute of Statistics National University of Kaohsiung.
Regression with a Binary Dependent Variable. Introduction What determines whether a teenager takes up smoking? What determines if a job applicant is successful.
Chapter 4 Multiple Regression.
Chapter 51 Experiments, Good and Bad. Chapter 52 Experimentation u An experiment is the process of subjecting experimental units to treatments and observing.
Clustered or Multilevel Data
Empirical methods take real-world data estimate size of relationship between variables two types  regression analysis  natural experiments take real-world.
Economics 20 - Prof. Anderson
Topic 3: Regression.
1Prof. Dr. Rainer Stachuletz Time Series Data y t =  0 +  1 x t  k x tk + u t 1. Basic Analysis.
1Prof. Dr. Rainer Stachuletz Panel Data Methods y it =  0 +  1 x it  k x itk + u it.
PEPA is based at the IFS and CEMMAP © Institute for Fiscal Studies Identifying social effects from policy experiments Arun Advani (UCL & IFS) and Bansi.
Objectives of Multiple Regression
Estimation of Demand Prof. Ravikesh Srivastava Lecture-8.
Error Component Models Methods of Economic Investigation Lecture 8 1.
Factorial Design of Experiments Kevin Leyton-Brown.
Chapter 12 Multiple Linear Regression Doing it with more variables! More is better. Chapter 12A.
Instrumental Variables: Problems Methods of Economic Investigation Lecture 16.
Introduction to Multilevel Modeling Stephen R. Porter Associate Professor Dept. of Educational Leadership and Policy Studies Iowa State University Lagomarcino.
Matching Estimators Methods of Economic Investigation Lecture 11.
Session III Regression discontinuity (RD) Christel Vermeersch LCSHD November 2006.
Public Policy Analysis ECON 3386 Anant Nyshadham.
Managerial Economics Demand Estimation & Forecasting.
1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
AFRICA IMPACT EVALUATION INITIATIVE, AFTRL Africa Program for Education Impact Evaluation David Evans Impact Evaluation Cluster, AFTRL Slides by Paul J.
Lecture 7: What is Regression Analysis? BUEC 333 Summer 2009 Simon Woodcock.
Introduction section of article
Applying impact evaluation tools A hypothetical fertilizer project.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
1 Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
Using Propensity Score Matching in Observational Services Research Neal Wallace, Ph.D. Portland State University February
Randomized Assignment Difference-in-Differences
Economics 20 - Prof. Anderson1 Time Series Data y t =  0 +  1 x t  k x tk + u t 1. Basic Analysis.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Regression Analysis: A statistical procedure used to find relations among a set of variables B. Klinkenberg G
Alexander Spermann University of Freiburg, SS 2008 Matching and DiD 1 Overview of non- experimental approaches: Matching and Difference in Difference Estimators.
Experimental Evaluations Methods of Economic Investigation Lecture 4.
Lecture 6 Feb. 2, 2015 ANNOUNCEMENT: Lab session will go from 4:20-5:20 based on the poll. (The majority indicated that it would not be a problem to chance,
The simple linear regression model and parameter estimation
Pooling Cross Sections across Time: Simple Panel Data Methods
Multiple Regression Analysis: Estimation
Pooling Cross Sections across Time: Simple Panel Data Methods
Methods of Economic Investigation Lecture 12
Economics 20 - Prof. Anderson
Matching Methods & Propensity Scores
Ch. 13. Pooled Cross Sections Across Time: Simple Panel Data.
Applying Impact Evaluation Tools: Hypothetical Fertilizer Project
Linear Regression Summer School IFPRI
3.2. SIMPLE LINEAR REGRESSION
Ch. 13. Pooled Cross Sections Across Time: Simple Panel Data.
Presentation transcript:

Public Policy Analysis ECON 3386 Anant Nyshadham

Endogeneity

True Model Suppose true model is: –Y = a + bX + cV + e a, b, and c are parameters to be estimated; e is error term WHAT DOES b REPRESENT IN TERMS OF POLICY? Why do we care to estimate it? Do not observe V Can only estimate: –Y = a + bX + e What happens to estimate of b, b?

Example Y = a + bX + cV + e –Y is agricultural yield (total production / area) –X is use of HYV (=1 if used HYV in last season, 0 otherwise) –V is a vector of soil characteristics (esp. suitability for planting HYV) Uninteresting case: c=0 Two important cases –V is not known by farmer –V is known by farmer (and affects X)

Irrelevant V Suppose that c=0 Soil characteristics have no effects on yields (not really believable!) Profit-maximizing farmer would thus not base his choice of X on V Estimates of a and b will be unaffected by omission of soil quality (V) Y = a + bX + cV + e, c=0 Y = a + bX + e (estimated model is the true model) Thus linear regression will give us a = a, b = b

Exogenous X Farmer does not know soil quality (V) –Thus V does not affect farmer’s choice of X Suppose HYV adoption makes yield (Y) very large if V=1, but very small if V=0 –Y will depend on both X and V –Farmer cannot act on relationship between X and V; therefore, X will not depend on V! Estimate of b is unaffected: – b = b; a = [E(Y – bX)] = a + c V (V is average soil quality in sample)

Endogenous X Farmer knows soil quality (V) and takes it into account when choosing to adopt HYV (X) –Farmer wants to maximize yield –Suppose soil quality can be of two types good for HYV (V=1) bad for HYV (V=0) –Extreme Case: Farmer chooses to adopt (X=1) only when soil is good for HYV (i.e. V=1); and thus X=0 if V=0 Estimate of b will be biased in this case: – b = b + c; a = a

Endogenous V (cont.) A less extreme, more believable case: Suppose farmer more likely to use HYV (X=1) if his soil is suitable for it (V=1) Bias will then depend on degree of dependency between X and V b <= b <= b + c ; a <= a <= a + c V If we observe soil quality (V), or know exact relationship between V and X, can still get estimate of true b! But this is not common…in general, we don’t know V or its exact relationship to X What can we do?

Overcoming Endogeneity Induce variation in X which is void of relationship with V (randomization) Remove effects of static unobserved V by comparing groups over time (differencing) Use other observed characteristics to fully predict portion of X which depends on unobserved V (matching, control function) Exploit “excludable” determinant of X, independent of V (instrument) or specific functional form of X assignment (discontinuity)

Differencing If we can see treated and untreated groups before and after, we can compare the CHANGES in Y for treated before and after treatment to coincident changes for the untreated and –High and low goiter states before and after iodization –Factories with and without LED during high and low temperatures in the same day, month, year, etc Assume changes in everything else are common to both treated and untreated groups

Instrumental Variables If we know of some factor Z that at least partially determines treatment X without directly impacting outcome Y, we can use Z as a predictor (instrument) of treatment X that can bypass any confounders. –Ease of accessing health care predicts health care utilization but not incidence and severity of sickness 2 key requirements –Z must adequately predict X (testable) –Z must not impact Y except through X (assumed)

Regression Discontinuity If we know the exact assignment rule, we can use this rule to construct instrument Z for treatment X. –Merit-based tuition subsidies given based on GPA and SAT/ACT cutoffs –Subsidized health care provided to those below wealth cutoff Compare those just above cutoff to those just below cutoff Assume at tiny increments of eligibility all else is equivalent across treated and untreated

Matching Match each treated participant to one or more untreated participant based on observable characteristics. Assumes no selection on unobservables Condense all observables into one “propensity score,” match on that score.

Matching After matching treated to most similar untreated, subtract the means, calculate average difference

Projections & Regressions

Linear Projections If we have many potentially related (jointly distributed) variables –Outcome of interest Y –Explanatory variable of interest X –Additional potential confounders A, B, C We are interested in how much of Y is explained incrementally by X accounting for any confounding co- variation with A, B, C A projection is a decomposition of the variation in variable into the independent (orthogonal) planes or spaces of other variables. Each of the independent sources of variation in the full set of variables Y,X,A,B,C is given a plane that is separated by right angles from each of the other planes.

Linear Projections Projections are analytical/theoretical representations and are true by construction We can always represent one variable as a projection on other variables Y = ρ + λ x X + λ a A + λ b B + λ c C + ψ If A contributes nothing to Y, λ a =0 If Y,X,A,B,C are jointly normally distributed ψ is independent from X,A,B,C and the linear projection fully explains the relationship between Y and X,A,B,C

Regression Regressions are the data (empirical) analogue to projections A regression of Y on X,A,B,C separates the observed variation in Y into the orthogonal planes of observed variation in X,A,B,C Y = α + ϒ x X + ϒ a A + ϒ b B + ϒ c C + ε ϒ ’s measure the observed covariance between Y and regressor (X,A,B,C) divided by the variance of the regressor

Partition Regression A regression of Y on X,A,B,C will yield the same ϒ x as a regression of Y on X Y = α + ϒ x X + ϒ a A + ϒ b B + ϒ c C + ε Y = δ + κ a A + κ b B + κ c C + Y  Y = Y – (δ + κ a A + κ b B + κ c C) X = τ + η a A + η b B + η c C + X  X = X – (τ + η a A + η b B + η c C) Y = ϒ x X + ε

Fixed (Group) Effects Operation: include as controls a set of dummy variables that spans a dimension of variation –Omit one dummy if general constant is estimated 1 dummy for gender 11 dummies for month Concept: Assigns varying intercept (constant) to individual groups or time periods –Effectively demeans variables within cells of variation (e.g., by month or gender) –Ensures that coefficient of interest does not reflect these course differences across groups or time

Example Papers Impacts of –salt iodization on education and labor outcomes –temperature and lighting on worker productivity –health care on health outcomes and household enterprise activity –scholarships on college outcomes –health insurance on criminal activity –soft skills training on worker productivity and retention