Panel Data. Assembling the Data insheet using marriage-data.csv, c d u "background-data", clear d u "experience-data", clear u "wage-data", clear d reshape.

Slides:



Advertisements
Similar presentations
Dummy Variables and Interactions. Dummy Variables What is the the relationship between the % of non-Swiss residents (IV) and discretionary social spending.
Advertisements

Sociology 601 Class 24: November 19, 2009 (partial) Review –regression results for spurious & intervening effects –care with sample sizes for comparing.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 1) Slideshow: exercise 1.16 Original citation: Dougherty, C. (2012) EC220 - Introduction.
1 Results from hsb_subset.do. 2 Example of Kloeck problem Two-stage sample of high school sophomores 1 st school is selected, then students are picked,
From Anova to Regression: analyzing the effect on consumption of no. of persons in family Family consumption data family.dta E/Albert/Courses/cdas/appstat00/From.
1 FE Panel Data assumptions. 2 Assumption #1: E(u it |X i1,…,X iT,  i ) = 0.
More on Regression Spring The Linear Relationship between African American Population & Black Legislators.
Lecture 4 (Chapter 4). Linear Models for Correlated Data We aim to develop a general linear model framework for longitudinal data, in which the inference.
HETEROSCEDASTICITY-CONSISTENT STANDARD ERRORS 1 Heteroscedasticity causes OLS standard errors to be biased is finite samples. However it can be demonstrated.
1 Nonlinear Regression Functions (SW Chapter 8). 2 The TestScore – STR relation looks linear (maybe)…
Advanced Panel Data Techniques
EC220 - Introduction to econometrics (chapter 7)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: exercise 3.5 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Lecture 6: Repeated Measures Analyses Elizabeth Garrett Child Psychiatry Research Methods Lecture Series.
Adaptive expectations and partial adjustment Presented by: Monika Tarsalewska Piotrek Jeżak Justyna Koper Magdalena Prędota.
Valuation 4: Econometrics Why econometrics? What are the tasks? Specification and estimation Hypotheses testing Example study.
Shall we take Solow seriously?? Empirics of growth Ania Nicińska Agnieszka Postępska Paweł Zaboklicki.
SPH 247 Statistical Analysis of Laboratory Data 1April 23, 2010SPH 247 Statistical Analysis of Laboratory Data.
Sociology 601 Class 25: November 24, 2009 Homework 9 Review –dummy variable example from ASR (finish) –regression results for dummy variables Quadratic.
Sociology 601 Class 28: December 8, 2009 Homework 10 Review –polynomials –interaction effects Logistic regressions –log odds as outcome –compared to linear.
Multilevel Models 2 Sociology 8811, Class 24
Multilevel Models 2 Sociology 229A, Class 18
1 Multiple Regression EPP 245/298 Statistical Analysis of Laboratory Data.
Multilevel Models 1 Sociology 229A Copyright © 2008 by Evan Schofer Do not copy or distribute without permission.
Regression Example Using Pop Quiz Data. Second Pop Quiz At my former school (Irvine), I gave a “pop quiz” to my econometrics students. The quiz consisted.
Introduction to Regression Analysis Straight lines, fitted values, residual values, sums of squares, relation to the analysis of variance.
Addressing Alternative Explanations: Multiple Regression Spring 2007.
1 Review of Correlation A correlation coefficient measures the strength of a linear relation between two measurement variables. The measure is based on.
1 Michigan.do. 2. * construct new variables;. gen mi=state==26;. * michigan dummy;. gen hike=month>=33;. * treatment period dummy;. gen treatment=hike*mi;
Sociology 601 Class 23: November 17, 2009 Homework #8 Review –spurious, intervening, & interactions effects –stata regression commands & output F-tests.
A trial of incentives to attend adult literacy classes Carole Torgerson, Greg Brooks, Jeremy Miles, David Torgerson Classes randomised to incentive or.
1 Zinc Data EPP 245 Statistical Analysis of Laboratory Data.
Sociology 601 Class 26: December 1, 2009 (partial) Review –curvilinear regression results –cubic polynomial Interaction effects –example: earnings on married.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 6) Slideshow: variable misspecification iii: consequences for diagnostics Original.
1 INTERACTIVE EXPLANATORY VARIABLES The model shown above is linear in parameters and it may be fitted using straightforward OLS, provided that the regression.
1 TWO SETS OF DUMMY VARIABLES The explanatory variables in a regression model may include multiple sets of dummy variables. This sequence provides an example.
Confidence intervals were treated at length in the Review chapter and their application to regression analysis presents no problems. We will not repeat.
EXERCISE 5.5 The Stata output shows the result of a semilogarithmic regression of earnings on highest educational qualification obtained, work experience,
Returning to Consumption
Country Gini IndexCountryGini IndexCountryGini IndexCountryGini Index Albania28.2Georgia40.4Mozambique39.6Turkey38 Algeria35.3Germany28.3Nepal47.2Turkmenistan40.8.
1 Estimation of constant-CV regression models Alan H. Feiveson NASA – Johnson Space Center Houston, TX SNASUG 2008 Chicago, IL.
How Does Digitization Affect Scholarship? Mark McCabe University of Michigan Roger Schonfeld Ithaka Christopher Snyder Dartmouth College December 11, 2007.
MultiCollinearity. The Nature of the Problem OLS requires that the explanatory variables are independent of error term But they may not always be independent.
EDUC 200C Section 3 October 12, Goals Review correlation prediction formula Calculate z y ’ = r xy z x for a new data set Use formula to predict.
Wiener Institut für Internationale Wirtschaftsvergleiche The Vienna Institute for International Economic Studies Structural change, productivity.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 1) Slideshow: exercise 1.5 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Lecture 3 Linear random intercept models. Example: Weight of Guinea Pigs Body weights of 48 pigs in 9 successive weeks of follow-up (Table 3.1 DLZ) The.
. reg LGEARN S WEIGHT85 Source | SS df MS Number of obs = F( 2, 537) = Model |
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: exercise 5.2 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Special topics. Importance of a variable Death penalty example. sum death bd- yv Variable | Obs Mean Std. Dev. Min Max
Lecture 5. Linear Models for Correlated Data: Inference.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 6) Slideshow: exercise 6.13 Original citation: Dougherty, C. (2012) EC220 - Introduction.
STAT E100 Section Week 12- Regression. Course Review - Project due Dec 17 th, your TA. - Exam 2 make-up is Dec 5 th, practice tests have been updated.
RAMSEY’S RESET TEST OF FUNCTIONAL MISSPECIFICATION 1 Ramsey’s RESET test of functional misspecification is intended to provide a simple indicator of evidence.
1 CHANGES IN THE UNITS OF MEASUREMENT Suppose that the units of measurement of Y or X are changed. How will this affect the regression results? Intuitively,
Modeling Multiple Source Risk Factor Data and Health Outcomes in Twins Andy Bogart, MS Jack Goldberg, PhD.
GRAPHING A RELATIONSHIP IN A MULTIPLE REGRESSION MODEL The output above shows the result of regressing EARNINGS, hourly earnings in dollars, on S, years.
1 BINARY CHOICE MODELS: LINEAR PROBABILITY MODEL Economists are often interested in the factors behind the decision-making of individuals or enterprises,
1 In the Monte Carlo experiment in the previous sequence we used the rate of unemployment, U, as an instrument for w in the price inflation equation. SIMULTANEOUS.
WHITE TEST FOR HETEROSCEDASTICITY 1 The White test for heteroscedasticity looks for evidence of an association between the variance of the disturbance.
VARIABLE MISSPECIFICATION II: INCLUSION OF AN IRRELEVANT VARIABLE In this sequence we will investigate the consequences of including an irrelevant variable.
Diff-inDiff Are exports from i to j, the same as imports in i from j? Should they be?. gen test=xij-mji (14 missing values generated). sum test,
The slope, explained variance, residuals
QM222 Your regressions and the test
QM222 Class 15 Section D1 Review for test Multicollinearity
Covariance x – x > 0 x (x,y) y – y > 0 y x and y axes.
Table 4. Panel Regression with Fixed Effects
Eva Ørnbøl + Morten Frydenberg
Financial Econometrics Fin. 505
EPP 245 Statistical Analysis of Laboratory Data
Presentation transcript:

Panel Data

Assembling the Data insheet using marriage-data.csv, c d u "background-data", clear d u "experience-data", clear u "wage-data", clear d reshape long lwage, i(nr) j(year) sort nr year merge 1:1 nr year using "marriage-data" drop _merge merge 1:1 nr year using "experience-data" drop _merge merge n:1 nr using "background-data" drop _merge d sum save "data-exercise-11-nls", replace

(2) Is the data balanced? xtset nr year panel variable: nr (strongly balanced) time variable: year, 1980 to 1987 delta: 1 unit What does being balanced mean?

(3) First Step. reg lwage married Source | SS df MS Number of obs = F( 1, 4358) = Model | Prob > F = Residual | R-squared = Adj R-squared = Total | Root MSE = lwage | Coef. Std. Err. t P>|t| [95% Conf. Interval] married | _cons |

(4) Controls. reg lwage married exper union educ black hisp Source | SS df MS Number of obs = F( 6, 4353) = Model | Prob > F = Residual | R-squared = Adj R-squared = Total | Root MSE = lwage | Coef. Std. Err. t P>|t| [95% Conf. Interval] married | exper | union | educ | black | hisp | _cons |

(5) Panel Data. xtreg lwage married exper union educ black hisp, fe note: educ omitted because of collinearity note: black omitted because of collinearity note: hisp omitted because of collinearity Fixed-effects (within) regression Number of obs = 4360 Group variable: nr Number of groups = 545 R-sq: within = Obs per group: min = 8 between = avg = 8.0 overall = max = 8 F(3,3812) = corr(u_i, Xb) = Prob > F = lwage | Coef. Std. Err. t P>|t| [95% Conf. Interval] married | exper | union | educ | 0 (omitted) black | 0 (omitted) hisp | 0 (omitted) _cons | sigma_u | sigma_e | rho | (fraction of variance due to u_i) F test that all u_i=0: F(544, 3812) = Prob > F = Why have ‘black’, ‘educ’ and ‘hisp’ been dropped from the regression? What variation are we working off when we include fixed effects?

Collinearity IDYearID1_FEID2_FEID3_FEIncomeMarriedBlackNational GDP 1995_FE etc 

(6) Clustering xtreg lwage married exper union, fe cluster(nr) Fixed-effects (within) regression Number of obs = 4360 Group variable: nr Number of groups = 545 R-sq: within = Obs per group: min = 8 between = avg = 8.0 overall = max = 8 F(3,544) = corr(u_i, Xb) = Prob > F = (Std. Err. adjusted for 545 clusters in nr) | Robust lwage | Coef. Std. Err. t P>|t| [95% Conf. Interval] married | exper | union | _cons | sigma_u | sigma_e | rho | (fraction of variance due to u_i)

(7) Are dummies equivalent to FE?. reg lwage married exper union i.nr, cluster(nr) Linear regression Number of obs = 4360 F( 2, 544) =. Prob > F =. R-squared = Root MSE = (Std. Err. adjusted for 545 clusters in nr) | Robust lwage | Coef. Std. Err. t P>|t| [95% Conf. Interval] married | exper | union |

(7) Time FE? Why not include Experience?. xtreg lwage married union i.year, fe cluster(nr) Fixed-effects (within) regression Number of obs = 4360 Group variable: nr Number of groups = 545 R-sq: within = Obs per group: min = 8 between = avg = 8.0 overall = max = 8 F(9,544) = corr(u_i, Xb) = Prob > F = (Std. Err. adjusted for 545 clusters in nr) | Robust lwage | Coef. Std. Err. t P>|t| [95% Conf. Interval] married | union | | year | 1981 | | | | | | | | _cons | sigma_u | sigma_e | rho | (fraction of variance due to u_i)

(8) Driven By Divorce?. xtreg lwage married union i.year if everdivorce == 0, fe cluster(nr) Fixed-effects (within) regression Number of obs = 3792 Group variable: nr Number of groups = 474 R-sq: within = Obs per group: min = 8 between = avg = 8.0 overall = max = 8 F(9,473) = corr(u_i, Xb) = Prob > F = (Std. Err. adjusted for 474 clusters in nr) | Robust lwage | Coef. Std. Err. t P>|t| [95% Conf. Interval] married | union | | year | 1981 | | | | | | | | _cons | sigma_u | sigma_e | rho | (fraction of variance due to u_i)

(8) Driven by Divorce 2?. xtreg lwage married union i.year if everdivorce == 1, fe cluster(nr) Fixed-effects (within) regression Number of obs = 568 Group variable: nr Number of groups = 71 R-sq: within = Obs per group: min = 8 between = avg = 8.0 overall = max = 8 F(9,70) = 6.21 corr(u_i, Xb) = Prob > F = (Std. Err. adjusted for 71 clusters in nr) | Robust lwage | Coef. Std. Err. t P>|t| [95% Conf. Interval] married | union | | year | 1981 | | | | | | | | _cons | sigma_u | sigma_e | rho | (fraction of variance due to u_i)