EMPIRICAL STUDY AND FORECASTING (II)

Slides:



Advertisements
Similar presentations
Regression Discontinuity. Basic Idea Sometimes whether something happens to you or not depends on your ‘score’ on a particular variable e.g –You get a.
Advertisements

Random Assignment Experiments
Lecture #11: Introduction to the New Empirical Industrial Organization (NEIO) - What is the old empirical IO? The old empirical IO refers to studies that.
Presented by Malte Lierl (Yale University).  How do we measure program impact when random assignment is not possible ?  e.g. universal take-up  non-excludable.
Econ 140 Lecture 241 Simultaneous Equations II Lecture 24.
Copyright © 2008 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics, 9e Managerial Economics Thomas Maurice.
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 7: Demand Estimation and Forecasting.
Chapter 10 Simple Regression.
Analysis of Covariance Goals: 1)Reduce error variance. 2)Remove sources of bias from experiment. 3)Obtain adjusted estimates of population means.
Analysis of Covariance Goals: 1)Reduce error variance. 2)Remove sources of bias from experiment. 3)Obtain adjusted estimates of population means.
Inference for regression - Simple linear regression
Chapter 7: Demand Estimation and Forecasting
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
Global Workshop on Development Impact Evaluation in Finance and Private Sector Rio de Janeiro, June 6-10, 2011 Making the Most out of Discontinuities Florence.
?v=cqj5Qvxd5MO Linear and Quadratic Functions and Modeling.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Session III Regression discontinuity (RD) Christel Vermeersch LCSHD November 2006.
ECON 3039 Labor Economics By Elliott Fan Economics, NTU Elliott Fan: Labor 2015 Fall Lecture 21.
Managerial Economics Demand Estimation & Forecasting.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Public Policy Analysis ECON 3386 Anant Nyshadham.
GG 313 Geological Data Analysis Lecture 13 Solution of Simultaneous Equations October 4, 2005.
Lecture 7: What is Regression Analysis? BUEC 333 Summer 2009 Simon Woodcock.
Chapter 7: Demand Estimation and Forecasting McGraw-Hill/Irwin Copyright © 2011 by the McGraw-Hill Companies, Inc. All rights reserved.
Applying impact evaluation tools A hypothetical fertilizer project.
Overview of Regression Analysis. Conditional Mean We all know what a mean or average is. E.g. The mean annual earnings for year old working males.
ECON 3039 Labor Economics By Elliott Fan Economics, NTU Elliott Fan: Labor 2015 Fall Lecture 41.
AADAPT Workshop Latin America Brasilia, November 16-20, 2009 Laura Chioda.
Example x y We wish to check for a non zero correlation.
ECON 3039 Labor Economics By Elliott Fan Economics, NTU Elliott Fan: Labor 2015 Fall Lecture 61.
Public Finance and Public Policy Jonathan Gruber Third Edition Copyright © 2010 Worth Publishers 1 of 24 Copyright © 2010 Worth Publishers.
ECON 3039 Labor Economics By Elliott Fan Economics, NTU Elliott Fan: Labor 2015 Fall Lecture 71.
Managerial Economics & Decision Sciences Department tyler realty  old faithful  business analytics II Developed for © 2016 kellogg school of management.
Impact Evaluation Methods Regression Discontinuity Design and Difference in Differences Slides by Paul J. Gertler & Sebastian Martinez.
Stats Methods at IC Lecture 3: Regression.
Demand Estimation & Forecasting
Chapter 7 Demand Estimation and Forecasting
business analytics II ▌assignment four - solutions mba for yourself 
The simple linear regression model and parameter estimation
ECON 4009 Labor Economics 2017 Fall By Elliott Fan Economics, NTU
Lezione di approfondimento su RDD (in inglese)
Applied Biostatistics: Lecture 2
An introduction to Impact Evaluation
Comparing Three or More Means
Chapter 5 STATISTICS (PART 4).
The Regression Discontinuity Design
12 Inferential Analysis.
Making the Most out of Discontinuities
CHAPTER 29: Multiple Regression*
CHAPTER 26: Inference for Regression
Empirical Tools of Public Finance
Graphs, Linear Equations, and Functions
1/18/2019 1:17:10 AM1/18/2019 1:17:10 AM Discussion of “Strategies for Studying Educational Effectiveness” Mark Dynarski Society for Research on Educational.
12 Inferential Analysis.
Least-Squares Regression
Impact Evaluation Methods: Difference in difference & Matching
6.1 Introduction to Chi-Square Space
Explanation of slide: Logos, to show while the audience arrive.
Product moment correlation
Regression Forecasting and Model Building
Applying Impact Evaluation Tools: Hypothetical Fertilizer Project
Chapter 7: Demand Estimation and Forecasting
Chapter 7: Demand Estimation and Forecasting
Positive analysis in public finance
Using Big Data to Solve Economic and Social Problems
Chapter 7 Demand Estimation & Forecasting
Demand Estimation & Forecasting
Linear Functions and Slope-Intercept Form Lesson 2-3
Presentation transcript:

EMPIRICAL STUDY AND FORECASTING (II) 2018 Spring By Elliott Fan Economics, NTU Elliott Fan: ESAF 2018 Spring Lecture 1

Elliott Fan: ESAF 2018 Spring Lecture 1 A little jumpy here Elliott Fan: ESAF 2018 Spring Lecture 1

Elliott Fan: ESAF 2018 Spring Lecture 1 A little jumpy here Mortality risk shoots up on and immediately following a twenty-first birthday This spike adds about 100 deaths to a baseline level of about 150 per day. The age-21 spike doesn’t seem to be a generic party-hardy birthday effect. If this spike reflects birthday partying alone, we should expect to see deaths shoot up after the twentieth and twenty-second birthdays as well, but that doesn’t happen. There’s something special about the twenty-first birthday. Elliott Fan: ESAF 2018 Spring Lecture 1

Elliott Fan: ESAF 2018 Spring Lecture 1 A little jumpy here This figure plots death rates from all causes against age in months. The lines in the figure show fitted values from a regression of death rates on an over-21 dummy and age in months (the vertical dashed line indicates the minimum legal drinking age (MLDA) cutoff). Elliott Fan: ESAF 2018 Spring Lecture 1

Elliott Fan: ESAF 2018 Spring Lecture 1 Define the terms Forcing variable (or running variable): age Cutoff point: the 21st birthday Treatment variable: legal privilege for drinking alcohol Outcome variable: mortality rate Elliott Fan: ESAF 2018 Spring Lecture 1

Jumpy here and there -- examples I combine a regression discontinuity design with rich data on academic and labor market outcomes for a large sample of Florida students to estimate the returns to college admission for academically marginal students. Students with grades just above a threshold for admissions eligibility at a large public university in Florida are much more likely to attend any university than below-threshold students. The marginal admission yields earnings gains of 22% between 8 and 14 years after high school completion. These gains outstrip the costs of college attendance, and they are largest for male students and free-lunch recipients. Elliott Fan: ESAF 2018 Spring Lecture 1

Jumpy here and there -- examples Elliott Fan: ESAF 2018 Spring Lecture 1

Jumpy here and there -- examples Elliott Fan: ESAF 2018 Spring Lecture 1

What can we do to use the jump? Recall that removing selection bias demands random assignment of the treatment Observations just on the right of the cutoff point and those just on the left of the cutoff point are very similar except for the likelihood of treatment taking up So the key requirement is that people cannot precisely control the exact position along the forcing variable around the cutoff point Note that there are different degree of “control” Elliott Fan: ESAF 2018 Spring Lecture 1

Methodologies – the RD design The RD design utilizes the randomness from incomplete manipulation of a characteristic around the threshold point for the treatment. From Lee and Lemieux (2009): Elliott Fan: ESAF 2018 Spring Lecture 1

Elliott Fan: ESAF 2018 Spring Lecture 1 What is RDD? A regression discontinuity design (RDD) is a quasi-experimental design that elicits the causal effects of interventions by assigning a cutoff or threshold above or below which an intervention is assigned. By comparing observations lying closely on either side of the threshold, it is possible to estimate the average treatment effect in environments in which a randomized trial was unfeasible. Elliott Fan: ESAF 2018 Spring Lecture 1

Elliott Fan: ESAF 2018 Spring Lecture 1 Why RDD? One major challenge of empirical studies is to pin down causality, which demands sources of exogenous variations in the “treatment” variables. Such sources are rare. Some variables exhibit a discontinuity, forming a potential source for such exogenous variations in the variable of interest. Elliott Fan: ESAF 2018 Spring Lecture 1

Elliott Fan: ESAF 2018 Spring Lecture 1 Identification Consider the following setup: Forcing variable: X Discontinuity: X=c We can now define something we might like to know: Treatment Effect at Discontinuity-TAD Elliott Fan: ESAF 2018 Spring Lecture 1

Elliott Fan: ESAF 2018 Spring Lecture 1

Elliott Fan: ESAF 2018 Spring Lecture 1 Identification In general, we cannot estimate TAD in a statistically reliable way. What can we estimate in the data? Let’s try an RD estimator. As before, the game will be to figure out what assumptions are necessary for the RD estimator (which we can see) to deliver the TAD (which we cannot see). Elliott Fan: ESAF 2018 Spring Lecture 1

Elliott Fan: ESAF 2018 Spring Lecture 1 Identification We will start with what we can estimate, then add and subtract three terms. After manipulation, we can clarify the assumptions necessary to get the TAD. Now add and subtract three things: Elliott Fan: ESAF 2018 Spring Lecture 1

Elliott Fan: ESAF 2018 Spring Lecture 1 Identification Rearrange the equation, we obtain: The four terms refer to: TAD: this is our object of interest. Selection bias: Captures any difference in Y0 between treatment and control. If assignment at discontinuity is as good as random, can assume this is zero. Limit gap from above: Captures any jump in Y1 at c. Can assume zero if Y1 continuous at c. Limit gap from below: Captures any jump in Y0 at c. Can assume zero if Y0 continuous at c. Elliott Fan: ESAF 2018 Spring Lecture 1

Two types of designs -- Sharp and Fuzzy Sharp design: When treatment taking is deterministically relied on eligibility. That is, everybody complies. Fuzzy design: When treatment taking is NOT deterministically relied on eligibility. That is, there are non-compliers Elliott Fan: ESAF 2018 Spring Lecture 1

Sharp and Fuzzy Designs Imbens and Lemieux (2008) Elliott Fan: ESAF 2018 Spring Lecture 1

Elliott Fan: ESAF 2018 Spring Lecture 1 An example of sharp RDD Go back to the case of legal access to alcohol on death rates. The treatment variable in this case can be written Da , where Da = 1 indicates legal drinking and is 0 otherwise. Da is a function of age a : the MLDA transforms 21-year-olds from underage minors to legal alcohol consumers. We capture this transformation in mathematical notation by writing: In sharp RD designs, treatment switches cleanly off or on as the running variable passes a cutoff. Elliott Fan: ESAF 2018 Spring Lecture 1

Elliott Fan: ESAF 2018 Spring Lecture 1 Running a regression Ma is the death rate in month a (again, month is defined as a 30-day interval counting from the twenty-first birthday). Da, the treatment dummy, is defined as in the previous slide a is a linear control for age in months. Elliott Fan: ESAF 2018 Spring Lecture 1

Elliott Fan: ESAF 2018 Spring Lecture 1 Linear or non-linear? Elliott Fan: ESAF 2018 Spring Lecture 1

Elliott Fan: ESAF 2018 Spring Lecture 1 Linear or non-linear? Elliott Fan: ESAF 2018 Spring Lecture 1

Elliott Fan: ESAF 2018 Spring Lecture 1 Linear or non-linear? Elliott Fan: ESAF 2018 Spring Lecture 1

Elliott Fan: ESAF 2018 Spring Lecture 1 Linear or non-linear? Elliott Fan: ESAF 2018 Spring Lecture 1

Running a regression – normalized form Ma is the death rate in month a (again, month is defined as a 30-day interval counting from the twenty-first birthday). Da, the treatment dummy, is defined as in the previous slide a is a linear control for age in months a0 refers to the cutoff point Elliott Fan: ESAF 2018 Spring Lecture 1

Running a regression – making it more flexible Now, adding 𝛿[(𝑎− 𝑎 0 ) 𝐷 𝑎 ] allows the slope to be different before and after the cutoff point You should be careful interpreting the main coefficients Elliott Fan: ESAF 2018 Spring Lecture 1

Elliott Fan: ESAF 2018 Spring Lecture 1 Compare the results Elliott Fan: ESAF 2018 Spring Lecture 1

Which function form is best? There are no general rules here, and no substitute for a thoughtful look at the data. When different forms provide varied estimates, caution is needed. Elliott Fan: ESAF 2018 Spring Lecture 1

The next question: how do you know it is MLDA? Elliott Fan: ESAF 2018 Spring Lecture 1

The next question: how do you know it is MLDA? Elliott Fan: ESAF 2018 Spring Lecture 1