Selection Bias, Comparative Advantage and Heterogeneous Returns to Education: Evidence from China James J. Heckman (University of Chicago) Xuesong Li (Institute.

Slides:



Advertisements
Similar presentations
Impact analysis and counterfactuals in practise: the case of Structural Funds support for enterprise Gerhard Untiedt GEFRA-Münster,Germany Conference:
Advertisements

1 The Social Survey ICBS Nurit Dobrin December 2010.
REGRESSION, IV, MATCHING Treatment effect Boualem RABTA Center for World Food Studies (SOW-VU) Vrije Universiteit - Amsterdam.
Hierarchical Linear Modeling: An Introduction & Applications in Organizational Research Michael C. Rodriguez.
THE ECONOMIC RETURN ON INVESTMENTS IN HIGHER EDUCATION: Understanding The Internal Rate of Return Presentation OISE, HEQCO, MTCU Research Symposium Defining.
Economics 20 - Prof. Anderson1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 7. Specification and Data Problems.
Unbalanced Panel Data … and Stata Kuan-Pin Lin Portland State University and WISE, Xiamen University.
Lecture 28 Categorical variables: –Review of slides from lecture 27 (reprint of lecture 27 categorical variables slides with typos corrected) –Practice.
Rising Earnings Inequality in Urban China during Li Shi School of Economics and Business Administration, BNU Song Jin School of Economics and.
“A Unified Framework for Measuring Preferences for Schools and Neighborhoods” Bayer, Ferreira, McMillian.
Multiple Linear Regression Model
Chapter 10 Simple Regression.
Pooled Cross Sections and Panel Data II
QUALITATIVE AND LIMITED DEPENDENT VARIABLE MODELS.
BACKGROUND RESEARCH QUESTIONS  Does the time parents spend with children differ according to parents’ occupation?  Do occupational differences remain.
The incidence of Mandated Maternity Benefits
Regression with a Binary Dependent Variable. Introduction What determines whether a teenager takes up smoking? What determines if a job applicant is successful.
Social Interaction and Stock Market Participation: Evidence from British Panel Data Sarah Brown and Karl Taylor Department of Economics University of Sheffield.
1 The Effect of Benefits on Single Motherhood in Europe Libertad González Universitat Pompeu Fabra May 2006.
Single and Multiple Spell Discrete Time Hazards Models with Parametric and Non-Parametric Corrections for Unobserved Heterogeneity David K. Guilkey.
Economics of Gender Chapter 5 Assist.Prof.Dr.Meltem INCE YENILMEZ.
Assessing Studies Based on Multiple Regression
Matching Methods. Matching: Overview  The ideal comparison group is selected such that matches the treatment group using either a comprehensive baseline.
Off-farm labour participation of farmers and spouses Alessandro Corsi University of Turin.
NUFE 1 General Education, Vocational Education and Individual Income in Rural China HUANG Bin Center for Public Finance Research Faculty of Public Finance.
Do multinational enterprises provide better pay and working conditions than their domestic counterparts? A comparative analysis Alexander Hijzen (OECD.
Welfare Reform and Lone Parents Employment in the UK Paul Gregg and Susan Harkness.
Has Public Health Insurance for Older Children Reduced Disparities in Access to Care and Health Outcomes? Janet Currie, Sandra Decker, and Wanchuan Lin.
Parents’ basic skills and children’s test scores Augustin De Coulon, Elena Meschi and Anna Vignoles.
Evaluating Job Training Programs: What have we learned? Haeil Jung and Maureen Pirog School of Public and Environmental Affairs Indiana University Bloomington.
Do Intermarried Individuals Perform Better in the Labour Market? Raya Muttarak Supervisor: Prof. Anthony Heath Department of Sociology, University of Oxford.
LABOUR FORCE PARTICIPATION, EARNINGS AND INEQUALITY IN NIGERIA
HAOMING LIU JINLI ZENG KENAN ERTUNC GENETIC ABILITY AND INTERGENERATIONAL EARNINGS MOBILITY 1.
Beyond surveys: the research frontier moves to the use of administrative data to evaluate R&D grants Oliver Herrmann Ministry of Business, Innovation.
Managerial Economics Demand Estimation & Forecasting.
Maximum Likelihood Estimation Methods of Economic Investigation Lecture 17.
Application 3: Estimating the Effect of Education on Earnings Methods of Economic Investigation Lecture 9 1.
1 REGRESSION ANALYSIS WITH PANEL DATA: INTRODUCTION A panel data set, or longitudinal data set, is one where there are repeated observations on the same.
The Choice Between Fixed and Random Effects Models: Some Considerations For Educational Research Clarke, Crawford, Steele and Vignoles and funding from.
AFRICA IMPACT EVALUATION INITIATIVE, AFTRL Africa Program for Education Impact Evaluation David Evans Impact Evaluation Cluster, AFTRL Slides by Paul J.
A discussion of Comparing register and survey wealth data ( F. Johansson and A. Klevmarken) & The Impact of Methodological Decisions around Imputation.
Instrumental Variables: Introduction Methods of Economic Investigation Lecture 14.
Employment Discrimination and Gender Wage Differentials
Lorraine Dearden Director of ADMIN Node Institute of Education
Using microsimulation model to get things right: a wage equation for Poland Leszek Morawski, University of Warsaw Michał Myck, DIW - Berlin Anna Nicińska,
Randomized Assignment Difference-in-Differences
Household Members’ Time Allocation to Daily Activities and Decision to Hire Domestic Helpers Donggen WANG and Jiukun LI Department of Geography Hong Kong.
Effects of migration and remittances on poverty and inequality A comparison between Burkina Faso, Kenya, Nigeria, Senegal, South Africa, and Uganda Y.
1 The Training Benefits Program – A Methodological Exposition To: The Research Coordination Committee By: Jonathan Adam Lind Date: 04/01/16.
Children’s Emotional and Behavioral Problems and Their Parents’ Labor Supply Patrick Richard, Ph.D., M.A. Nicholas C. Petris Center on Health Markets and.
Empirical Studies of Marriage and Divorce. Korenman and Neumark, 1991 Does Marriage Really Make Men More Productive? How do we explain the male marriage.
The Evaluation Problem Alexander Spermann, University of Freiburg 1 The Fundamental Evaluation Problem and its Solution SS 2009.
4. Tobit-Model University of Freiburg WS 2007/2008 Alexander Spermann 1 Tobit-Model.
INSTRUMENTAL VARIABLES Eva Hromádková, Applied Econometrics JEM007, IES Lecture 5.
Experimental Evaluations Methods of Economic Investigation Lecture 4.
Assessing the Impact of Informality on Wages in Tanzania: Is There a Penalty for Women? Pablo Suárez Robles (University Paris-Est Créteil) 1.
1 An Empirical Analysis of Divorce or Separation among Cross-Border Marriages in Taiwan By Wen-Shai Hung Department of Business Administration Providence.
The Evaluation Problem Alexander Spermann, University of Freiburg, 2007/ The Fundamental Evaluation Problem and its Solution.
Residential Mobility, Heterogeneous Neighborhood effects and Educational Attainment of Blacks and Whites Li Gan Texas A&M University and NBER Yingning.
More on Specification and Data Issues
Instrumental Variables and Two Stage Least Squares
Impact evaluation: The quantitative methods with applications
Matching Methods & Propensity Scores
Matching Methods & Propensity Scores
Advanced Panel Data Methods
Methods of Economic Investigation Lecture 12
Instrumental Variables and Two Stage Least Squares
Matching Methods & Propensity Scores
Instrumental Variables and Two Stage Least Squares
Impact Evaluation Methods: Difference in difference & Matching
Presentation transcript:

Selection Bias, Comparative Advantage and Heterogeneous Returns to Education: Evidence from China James J. Heckman (University of Chicago) Xuesong Li (Institute of Quantitative & Technical Economics, Chinese Academy of Social Sciences)

2 1. Introduction 2. Models with and without Heterogeneity 3. Selection Bias and The Marginal Treatment Effect 4. Data Set and Empirical Results 5. Concluding Remarks

3 1.Introduction Heterogeneity and missing counterfactual states are central features of micro data. This paper uses China’s micro data, to estimate the return to education for China considering both heterogeneity and selection bias.

4 Our work builds on previous research by Heckman and Vytlacil (1999, 2001), and Carneiro (2002), which develops a semi parametric framework.

5 2. Models with and without Heterogeneity A conventional model of the return to education without heterogeneity in returns: (1) I for individuals (i=1, 2,...,n), lnYi is log income, Si is schooling level or years of schooling, Xi is a vector of variables βis the rate of return to education, γis a vector of coefficients.

6 OLS problem: omitted ability Ai, Three strategies: (1) IV. But It is also very hard to find satisfactory instruments. In fact, most commonly used instruments in the schooling literature are invalid because they are correlated with the omitted ability. (2) Fixed effect method: find a paired comparison such as a genetic twin or sibling with similar or identical ability. It needs enough information

7. (3) Proxy variables for ability Many empirical analyses reveal that better family background and better family resources are usually associated with better environments which raise ability. In our empirical work we use parental income as a proxy for ability.

8 A model with heterogeneous returns to education (in random coefficient form) (2) βi is the heterogeneous rate of return to education, which varies among individuals. Xi is a vector of variables including the proxy for ability. We focus on two schooling choices: (1) high school Si=0 (2) college Si=1

9 The two potential selection outcomes

10 Observed log earnings are: where (5) is the heterogeneous return to education for individual i. βi varies in the population, and the return to schooling is a random variable with a distribution.

11 The mean of βi given X is: (6) Decision rule: (7) Si* is a latent variable denoting the net benefit of going to school Zi is an observed vector of variables.

12 Pi = Pi (Zi) is the propensity score or probability of receiving treatment (going to college). P(Z) can be estimated by a logit or probit model. Usi is the unobserved heterogeneity for individual i in the treatment selection equation. Without loss of generality, we may assume that Usi ~ Unif [0,1]. The decision of whether to go to college (or not) for individual i is determined completely by the comparison of the observed heterogeneity Pi(Zi) with the unobserved heterogeneity Usi. The smaller the Usi, the more likely it is that the person goes to college.

13 3. Selection Bias and The Marginal Treatment Effect (8) ATE is the average treatment effect (the effect of randomly assigning a person to schooling) (9)

14 (10) Selection bias is the mean difference in the no-schooling (S = 0) unobservables between those who go to school and those who do not.

15 TT (treatment on the treated), the effect of treatment on those who receive it (e.g. go to college) compared with what they would experience without treatment (i.e. do not go to college), defined as (11) Sorting effect is the mean gain of the unobservables for people who choose ‘1’.

16 IV is not a consistent estimator In the presence of heterogeneity and selection bias. (12)

17 Neither OLS nor IV is a consistent estimator of the mean return to education in the presence of heterogeneity and selection. Under certain assumptions, it is possible to identify the heterogeneous return to education with marginal treatment effect (MTE) via the method of local instrument variables (LIV), where MTE is:

18 The MTE is the average willingness to pay (WTP) for lnY1i (compared to lnY0i ) given characteristics Xi and unobserved heterogeneity Usi. MTE can be estimated from the following relationship, where LIV can be estimated by semi parametric methods for derivatives (Heckman, 2001):

19 All the other treatment variables can be unified using MTE:

20 Where the weights are:

21 Treatment on the untreated (TUT) is the effect of treatment on those who do not receive it (i.e. do not go to college) compared with what they would experience with the treatment (i.e. go to college)

22 4. Data Set and Empirical Results Data Source: China Urban Household Income and Expenditure Survey (CUHIES) 2000 Conducted by the Urban Socio-Economic Survey Organization of the National Bureau of Statistics. Six provinces: Guangdong Liaoning Sichuan Shaanxi Zhejiang Beijing.

23 Sample size: 4250 households. For each household, there is rich information on all household members, including head, spouse, children and parents. Age, sex, education level, employment status and enterprise ownership, occupation, years of work experience and total annual income are available for each household member. There are seven education levels in the sample: university, college, special technical school, senior high school, junior high school, primary school, and other.

24 The used sample consists of 587 individuals, including 273 people with four-year college (or university) certificates and 314 people with only senior high school certificates.

25 Table 2. Summary Statistics Variable All (n=587)Treated (n=273)Untreated (n=314) MeanStd. ErrMeanStd. ErrMeanStd. Err Log Wage Age Years of work experience Year college attendance Male Lived in Guangdong Province (GD) Lived in Liaoning Province (LN) Lived in Shaanxi Province (SX) Lived in Sichuan Province (SC) Lived in Beijing (BJ) Lived in Zhejiang Province (ZJ) Worked in state owned enterprises (SOEs) Worked in collective-owned firms Worked in joint-venture or foreign owned firms Worked in private owned firms Worked in IND_CON sector* Worked in TRA_COM sector* Worked in HOU_RES sector* Worked in SPO_SOC sector* Worked in CUL_SCI sector* Worked in FIN_INS sector* Worked in GOVERN sector* Worked in OTHER sector* Years of father’s education Years of mother’s education Parental income (in 1000 yuan)

26 Variable OLSIV CoefficientStandard ErrorCoefficientStandard Error Intercept Year’s college attendance Years of work experience Experience squared Parental income in 1000 yuan Male Lived in Guangdong Province Lived in Liaoning Province Lived in Sichuan Province Lived in Beijing Lived in Zhejiang Province Worked in state owned enterprises Worked in collective-owned firms Worked in private owned firms Worked in IND_CON sector* Worked in TRA_COM sector* Worked in SPO_SOC sector* Worked in FIN_INS sector* Table 3. Estimated Mincer Model

27 VariableCoefficientStandard Error Mean Marginal Effect Intercept Years of father’s education Years of mother’s education Parental income in 1000 yuan Born before Born in Born in Born in Born in Born in Born in Born in Born in Born in Born in Born in Born in Born in Born in Born in Born in Table 4. Estimated Logit Model For Schooling

28 Variable High SchoolCollege Std. Err. Years of work experience Experience squared Parental income in 1000 yuan Male Lived in Guangdong Province Lived in Liaoning Province Lived in Sichuan Province Lived in Beijing Lived in Zhejiang Province Worked in state owned enterprises Worked in collective-owned firms Worked in private owned firms Worked in IND_CON sector* Worked in TRA_COM sector* Worked in SPO_SOC sector* Worked in FIN_INS sector* Table 5. Estimated Coefficients from Local Linear Regression Guassian Kernel, bandwidth = 0.4

29 ParametersEstimation OLS IV* ATE TT TUT Bias Selection Bias Sorting Gain * Using propensity score as instrument Table 6. Comparison of Different Parameters

30

31

32

33

34 A: with firms’ ownership dummies but not sectoral dummies B: with sectoral dummies but not ownership dummies C: no sectoral and ownership dummies

35

36 5. Concluding Remarks Neglecting heterogeneity and selection bias leads to biased and inconsistent estimates, such as those obtained using conventional OLS and IV parameters. We demonstrate the importance of proxying for ability in the wage equation to identify returns to education. Excluding the proxy leads to implausibly high estimates of the return to schooling.

37 In 2000 the average return to four-year college attendance is 43% (on average, 11% annually) for young people in the urban areas of the six provinces. The results imply that, after more than twenty years of economic reform with market orientation, the average return to education in China has increased markedly compared with that of the 1980s and early 1990s.