LT5: Review Sam Marden 1. Working with summary data.

Slides:



Advertisements
Similar presentations
Using HMOs To Serve The Medicaid Population: What Are The Effects On Healthcare Utilization And Does The Type Of HMO Matter? Bradley Herring and E. Kathleen.
Advertisements

Designing an impact evaluation: Randomization, statistical power, and some more fun…
PANEL DATA 1. Dummy Variable Regression 2. LSDV Estimator
Economics 20 - Prof. Anderson1 Panel Data Methods y it = x it k x itk + u it.
REGRESSION, IV, MATCHING Treatment effect Boualem RABTA Center for World Food Studies (SOW-VU) Vrije Universiteit - Amsterdam.
1 Examples of Fixed-Effect Models. 2 Almond et al. Babies born w/ low birth weight(< 2500 grams) are more prone to –Die early in life –Have health problems.
1 Almond et al. Babies born w/ low birth weight(< 2500 grams) are more prone to – Die early in life – Have health problems later in life – Educational.
Review of Identifying Causal Effects Methods of Economic Investigation Lecture 13.
LT8: Matching Sam Marden Introduction Describe the intuition behind matching estimators. Be concise. Suppose you have a sample of.
Omitted Variable Bias Methods of Economic Investigation Lecture 7 1.
Lecture 12 (Ch16) Simultaneous Equations Models (SEMs)
Econ 140 Lecture 241 Simultaneous Equations II Lecture 24.
Chapter 5 Introduction to Inferential Statistics.
Pooled Cross Sections and Panel Data II
1Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 7. Specification and Data Problems.
Simple Linear Regression
The Fundamental Problem of Causal Inference Alexander Tabarrok January 2007.
Clustered or Multilevel Data
Stat 217 – Day 25 Regression. Last Time - ANOVA When?  Comparing 2 or means (one categorical and one quantitative variable) Research question  Null.
1 Research Method Lecture 11-1 (Ch15) Instrumental Variables Estimation and Two Stage Least Square ©
Stat 112: Lecture 9 Notes Homework 3: Due next Thursday
Correlation and Regression Analysis
The Role of Consumer Knowledge on the Demand for Preventive Health Care Among the Elderly Stephen T. Parente, Ph.D., Project HOPE Center for Health Affairs.
LT6: IV2 Sam Marden Question 1 & 2 We estimate the following demand equation ln(packpc) = b 0 + b 1 ln(avgprs) +u What do we require.
Assessing Studies Based on Multiple Regression
Research Methods in Human Sexuality
Modeling errors in physical activity data Sarah Nusser Department of Statistics and Center for Survey Statistics and Methodology Iowa State University.
Review Lecture histograms, five number summary, one-sample confidence intervals, nonresponse, observational study.
Error Component Models Methods of Economic Investigation Lecture 8 1.
Intergenerational Poverty and Mobility. Intergenerational Mobility Leblanc’s Random Family How does this excerpt relate to what we have been talking about?
Shawn Cole Harvard Business School Threats and Analysis.
Has Public Health Insurance for Older Children Reduced Disparities in Access to Care and Health Outcomes? Janet Currie, Sandra Decker, and Wanchuan Lin.
Chapter 2 AP Psychology Outline
Instrumental Variables: Problems Methods of Economic Investigation Lecture 16.
Welcome to Econ 420 Applied Regression Analysis Study Guide Week Six.
PROBLEM SET 1 SOLUTION. QUESTION 1 Error Term Omitted variables Measurement error Specification error Random or unpredictable occurrences Some of the.
Application 3: Estimating the Effect of Education on Earnings Methods of Economic Investigation Lecture 9 1.
Application 2: Minnesota Domestic Violence Experiment Methods of Economic Investigation Lecture 6.
Christine Pal Chee October 9, 2013 Research Design.
A discussion of Comparing register and survey wealth data ( F. Johansson and A. Klevmarken) & The Impact of Methodological Decisions around Imputation.
Accounting for the Effect of Health on Economic Growth David N. Weil Proponent/Presenter Section.
Stat 112 Notes 9 Today: –Multicollinearity (Chapter 4.6) –Multiple regression and causal inference.
Instrumental Variables: Introduction Methods of Economic Investigation Lecture 14.
Lost but not forgotten : attrition in the Étude longitudinale du développement des enfants du Québec (ÉLDEQ), Julien BÉRARD-CHAGNON and Simona.
Correlation. Up Until Now T Tests, Anova: Categories Predicting a Continuous Dependent Variable Correlation: Very different way of thinking about variables.
Christel M. J. Vermeersch November 2006 Session V Instrumental Variables.
Chapter 8: Simple Linear Regression Yang Zhenlin.
CORRELATIONS: PART II. Overview  Interpreting Correlations: p-values  Challenges in Observational Research  Correlations reduced by poor psychometrics.
Analysis of Experiments
Randomized Assignment Difference-in-Differences
CHECKING THE CONSISTENCY OF POVERTY IN POLAND: EVIDENCE by Adam Szulc Warsaw School of Economics, Poland.
Copyright © 2015 Inter-American Development Bank. This work is licensed under a Creative Commons IGO 3.0 Attribution-Non Commercial-No Derivatives (CC-IGO.
By Randall Munroe, xkcd.com Econometrics: The Search for Causal Relationships.
The Evaluation Problem Alexander Spermann, University of Freiburg 1 The Fundamental Evaluation Problem and its Solution SS 2009.
INSTRUMENTAL VARIABLES Eva Hromádková, Applied Econometrics JEM007, IES Lecture 5.
Experimental Evaluations Methods of Economic Investigation Lecture 4.
Multiple Regression Analysis: Further Issues
Difference-in-Differences
Stats Questions We Are Often Asked
PANEL DATA 1. Dummy Variable Regression 2. LSDV Estimator
More on Specification and Data Issues
More on Specification and Data Issues
Impact evaluation: The quantitative methods with applications
Is there such a thing as Migration of Poverty in Albania?
Matching Methods & Propensity Scores
Ch. 13. Pooled Cross Sections Across Time: Simple Panel Data.
Chapter 17 Measurement Key Concept: If you want to estimate the demand curve, you need to find cases where the supply curve shifts.
Evaluating Impacts: An Overview of Quantitative Methods
More on Specification and Data Issues
Ch. 13. Pooled Cross Sections Across Time: Simple Panel Data.
Presentation transcript:

LT5: Review Sam Marden

1. Working with summary data

2. More Stats Refresher (b)

3. Panel Data (a) We are trying to learn whether the Aid to Families With Dependant Children program (which provided block grants to states to support programs targeted at low income women with children) effected birth weights. You run the OLS regression: LowBirthWeight = a + b*AFDCPct + u Where AFDCPct is the share of the states population on AFDC supported welfare programs and LowBirthWeight is the percentage of children born with low birth weight i.What do you expect b_hat to be? Why? ii.Do you think this is likely to be the causal effect of the welfare program?

3. Panel Data (a) We are trying to learn whether the Aid to Families With Dependant Children program (which provided block grants to states to support programs targeted at low income women with children) effected birth weights. You run the OLS regression: LowBirthWeight = a + b*AFDCPct + u Where AFDCPct is the share of the states population on AFDC supported welfare programs and LowBirthWeight is the percentage of children born with low birth weight i.What do you expect b_hat to be? Why? i.Causal effect – maybe weakly negative. But OVB, in particular correlates with poverty and cov(poverty, lowbirthweight)>0 and cov(poverty, AFDCPct)>0 so will be biased upwards. Bias probably stronger than causal effect. ii.Do you think this is likely to be the causal effect of the welfare program?

3. Panel Data (b) Like a boss, you add some controls for doctors per capita, hospital beds per capita and income. i.What is the (likely) causal effect of each of these variables? ii.How is your estimate of b_hat likely to change when you control for each of these factors iii.What would need to be true for the new estimates of b_hat to be a consistent estimator of the programs effect? iv.Suppose you add state fixed effects. What problem do they help solve? What would you expect to happen to b_hat when you include state FE?

3. Panel Data (b) Like a boss, you add some controls for doctors per capita, hospital beds per capita and income. i.What is the (likely) causal effect of each of these variables? ii.How is your estimate of b_hat likely to change when you control for each of these factors iii.What would need to be true for the new estimates of b_hat to be a consistent estimator of the programs effect? iv.Suppose you add state fixed effects. What problem do they help solve? What would you expect to happen to b_hat when you include state FE? Parts I common sense. Part ii think about OVB. Part iii cov(x,e)=0 (what does this mean. Part 4, takes care of all time invariant differences between states  identify only off ‘within’ variation. Not clear what the direction of the change should be.

Question 4: The Wald Estimator (a) What is the meaning of: E[y i c |T] E: y i c : T:

Question 4: The Wald Estimator (a) What is the meaning of: E[y i c |T] E: the expectation of – the ‘population’ mean y i c : test scores for school i if it were treated T: conditional on being part of the treated group So, E[y i c |T] is the expected average test score of an school in the treated group, had it not got the treatment.

Question 4: The Wald Estimator (b) What does Ḕ[y i c |C] mean? What is the value of Ḕ[y i c |C] ?

Question 4: The Wald Estimator (b) What does Ḕ[y i c |C] mean? It’s the sample analogue of, “the expected test score of an individual in the control group, had they not got the treatment.” What is the value of Ḕ[y i c |C] ? 60

Question 4: The Wald Estimator (c)

Question 4: The Wald Estimator (d) With random assignment of schoolbooks within the treatment and the control group we obtain the ATE (think about why this is true). How would our estimates of ATE 1.Be biased if only the control schools with books were a non- random sample (within the control group)? 2.Be biased if only the ‘treated’ schools without books were a non- random sample (within the treatment group)? 3.What is the overall bias..

Question 4: The Wald Estimator (d)

Question 5 We run a regression and use it to predict house prices. It turns out that our predictions are too low for the most expensive houses and too high for the cheapest. What gives?

Question 6 (a) We obtain the following regression results: DaysIll i = *FluShot i 1.What is the interpretation of the coefficients? 2.What is the biggest problem with interpreting things causally?

Question 6 (b) and (c) 4b. Is HMO membership a good instrument for getting a flu shot? 4c. Is being visited by a health worker who talks about flu and flu shots a good intrument for getting a flu shot? i

Question 6 (b) and (c) Take 3. 4b. Is HMO membership a good instrument for getting a flu shot? 4c. Is being visited by a health worker who talks about flu and flu shots a good intrument for getting a flu shot? Conditions of a good instrument (1) relevance, (2) exogeneity, Both probably satisfy relevance. We can check this anyway. Neither probably satisfy exogeneity e.g. b)There is selection into HMO’s – people may be poor sicker whatever. Also, HMO’s focus on preventative care which may affect days sick other than through the flu shot. c)The health worker talks about the risk of flu. People may be more careful e.g. washing their hands. This will also effect the number of days sick