Selection Bias, Comparative Advantage and Heterogeneous Returns to Education: Evidence from China James J. Heckman (University of Chicago) Xuesong Li (Institute of Quantitative & Technical Economics, Chinese Academy of Social Sciences)
2 1. Introduction 2. Models with and without Heterogeneity 3. Selection Bias and The Marginal Treatment Effect 4. Data Set and Empirical Results 5. Concluding Remarks
3 1.Introduction Heterogeneity and missing counterfactual states are central features of micro data. This paper uses China’s micro data, to estimate the return to education for China considering both heterogeneity and selection bias.
4 Our work builds on previous research by Heckman and Vytlacil (1999, 2001), and Carneiro (2002), which develops a semi parametric framework.
5 2. Models with and without Heterogeneity A conventional model of the return to education without heterogeneity in returns: (1) I for individuals (i=1, 2,...,n), lnYi is log income, Si is schooling level or years of schooling, Xi is a vector of variables βis the rate of return to education, γis a vector of coefficients.
6 OLS problem: omitted ability Ai, Three strategies: (1) IV. But It is also very hard to find satisfactory instruments. In fact, most commonly used instruments in the schooling literature are invalid because they are correlated with the omitted ability. (2) Fixed effect method: find a paired comparison such as a genetic twin or sibling with similar or identical ability. It needs enough information
7. (3) Proxy variables for ability Many empirical analyses reveal that better family background and better family resources are usually associated with better environments which raise ability. In our empirical work we use parental income as a proxy for ability.
8 A model with heterogeneous returns to education (in random coefficient form) (2) βi is the heterogeneous rate of return to education, which varies among individuals. Xi is a vector of variables including the proxy for ability. We focus on two schooling choices: (1) high school Si=0 (2) college Si=1
9 The two potential selection outcomes
10 Observed log earnings are: where (5) is the heterogeneous return to education for individual i. βi varies in the population, and the return to schooling is a random variable with a distribution.
11 The mean of βi given X is: (6) Decision rule: (7) Si* is a latent variable denoting the net benefit of going to school Zi is an observed vector of variables.
12 Pi = Pi (Zi) is the propensity score or probability of receiving treatment (going to college). P(Z) can be estimated by a logit or probit model. Usi is the unobserved heterogeneity for individual i in the treatment selection equation. Without loss of generality, we may assume that Usi ~ Unif [0,1]. The decision of whether to go to college (or not) for individual i is determined completely by the comparison of the observed heterogeneity Pi(Zi) with the unobserved heterogeneity Usi. The smaller the Usi, the more likely it is that the person goes to college.
13 3. Selection Bias and The Marginal Treatment Effect (8) ATE is the average treatment effect (the effect of randomly assigning a person to schooling) (9)
14 (10) Selection bias is the mean difference in the no-schooling (S = 0) unobservables between those who go to school and those who do not.
15 TT (treatment on the treated), the effect of treatment on those who receive it (e.g. go to college) compared with what they would experience without treatment (i.e. do not go to college), defined as (11) Sorting effect is the mean gain of the unobservables for people who choose ‘1’.
16 IV is not a consistent estimator In the presence of heterogeneity and selection bias. (12)
17 Neither OLS nor IV is a consistent estimator of the mean return to education in the presence of heterogeneity and selection. Under certain assumptions, it is possible to identify the heterogeneous return to education with marginal treatment effect (MTE) via the method of local instrument variables (LIV), where MTE is:
18 The MTE is the average willingness to pay (WTP) for lnY1i (compared to lnY0i ) given characteristics Xi and unobserved heterogeneity Usi. MTE can be estimated from the following relationship, where LIV can be estimated by semi parametric methods for derivatives (Heckman, 2001):
19 All the other treatment variables can be unified using MTE:
20 Where the weights are:
21 Treatment on the untreated (TUT) is the effect of treatment on those who do not receive it (i.e. do not go to college) compared with what they would experience with the treatment (i.e. go to college)
22 4. Data Set and Empirical Results Data Source: China Urban Household Income and Expenditure Survey (CUHIES) 2000 Conducted by the Urban Socio-Economic Survey Organization of the National Bureau of Statistics. Six provinces: Guangdong Liaoning Sichuan Shaanxi Zhejiang Beijing.
23 Sample size: 4250 households. For each household, there is rich information on all household members, including head, spouse, children and parents. Age, sex, education level, employment status and enterprise ownership, occupation, years of work experience and total annual income are available for each household member. There are seven education levels in the sample: university, college, special technical school, senior high school, junior high school, primary school, and other.
24 The used sample consists of 587 individuals, including 273 people with four-year college (or university) certificates and 314 people with only senior high school certificates.
25 Table 2. Summary Statistics Variable All (n=587)Treated (n=273)Untreated (n=314) MeanStd. ErrMeanStd. ErrMeanStd. Err Log Wage Age Years of work experience Year college attendance Male Lived in Guangdong Province (GD) Lived in Liaoning Province (LN) Lived in Shaanxi Province (SX) Lived in Sichuan Province (SC) Lived in Beijing (BJ) Lived in Zhejiang Province (ZJ) Worked in state owned enterprises (SOEs) Worked in collective-owned firms Worked in joint-venture or foreign owned firms Worked in private owned firms Worked in IND_CON sector* Worked in TRA_COM sector* Worked in HOU_RES sector* Worked in SPO_SOC sector* Worked in CUL_SCI sector* Worked in FIN_INS sector* Worked in GOVERN sector* Worked in OTHER sector* Years of father’s education Years of mother’s education Parental income (in 1000 yuan)
26 Variable OLSIV CoefficientStandard ErrorCoefficientStandard Error Intercept Year’s college attendance Years of work experience Experience squared Parental income in 1000 yuan Male Lived in Guangdong Province Lived in Liaoning Province Lived in Sichuan Province Lived in Beijing Lived in Zhejiang Province Worked in state owned enterprises Worked in collective-owned firms Worked in private owned firms Worked in IND_CON sector* Worked in TRA_COM sector* Worked in SPO_SOC sector* Worked in FIN_INS sector* Table 3. Estimated Mincer Model
27 VariableCoefficientStandard Error Mean Marginal Effect Intercept Years of father’s education Years of mother’s education Parental income in 1000 yuan Born before Born in Born in Born in Born in Born in Born in Born in Born in Born in Born in Born in Born in Born in Born in Born in Born in Table 4. Estimated Logit Model For Schooling
28 Variable High SchoolCollege Std. Err. Years of work experience Experience squared Parental income in 1000 yuan Male Lived in Guangdong Province Lived in Liaoning Province Lived in Sichuan Province Lived in Beijing Lived in Zhejiang Province Worked in state owned enterprises Worked in collective-owned firms Worked in private owned firms Worked in IND_CON sector* Worked in TRA_COM sector* Worked in SPO_SOC sector* Worked in FIN_INS sector* Table 5. Estimated Coefficients from Local Linear Regression Guassian Kernel, bandwidth = 0.4
29 ParametersEstimation OLS IV* ATE TT TUT Bias Selection Bias Sorting Gain * Using propensity score as instrument Table 6. Comparison of Different Parameters
30
31
32
33
34 A: with firms’ ownership dummies but not sectoral dummies B: with sectoral dummies but not ownership dummies C: no sectoral and ownership dummies
35
36 5. Concluding Remarks Neglecting heterogeneity and selection bias leads to biased and inconsistent estimates, such as those obtained using conventional OLS and IV parameters. We demonstrate the importance of proxying for ability in the wage equation to identify returns to education. Excluding the proxy leads to implausibly high estimates of the return to schooling.
37 In 2000 the average return to four-year college attendance is 43% (on average, 11% annually) for young people in the urban areas of the six provinces. The results imply that, after more than twenty years of economic reform with market orientation, the average return to education in China has increased markedly compared with that of the 1980s and early 1990s.