Presentation is loading. Please wait.

Presentation is loading. Please wait.

Implementation of a double-hurdle model Bruno Garcia The Stata Journal (2013), 13, Number 4, pp. 776-794 Presented by Gulzat.

Similar presentations


Presentation on theme: "Implementation of a double-hurdle model Bruno Garcia The Stata Journal (2013), 13, Number 4, pp. 776-794 Presented by Gulzat."โ€” Presentation transcript:

1 Implementation of a double-hurdle model Bruno Garcia The Stata Journal (2013), 13, Number 4, pp Presented by Gulzat

2 The paper is about A double hurdle model (DHM) (Cragg, 1971 Econometrica 39: ) What is new: Stata command dblhurdle (and predict after dblhurdle )

3 Censored dependent variable models
E.g. Consumer or not if a consumer the value of the expenditure is known Tobit: assumes that the factors explaining of becoming a consumer and how much to spend have the same effect on these two decisions DHM: allows these effects to differ

4 Tobit Model ๐‘Œ ๐‘– = ๐‘Œ ๐‘– โˆ— ๐‘–๐‘“ ๐‘Œ ๐‘– โˆ— >0 ๐‘Œ ๐‘– =0 ๐‘–๐‘“ ๐‘Œ ๐‘– โˆ— โ‰ค0
๐‘Œ ๐‘– = ๐‘Œ ๐‘– โˆ— ๐‘–๐‘“ ๐‘Œ ๐‘– โˆ— >0 ๐‘Œ ๐‘– = ๐‘–๐‘“ ๐‘Œ ๐‘– โˆ— โ‰ค0 ๐‘Œ ๐‘– โˆ— = ๐‘‹ ๐‘– ๐›ฝ+ ๐œ€ ๐‘– and ๐œ€ ๐‘– โ‰ˆ๐‘(0, ๐œŽ 2 ) Two variables and one model to explain these two variables

5 Double Hurdle Model 1. Potential consumer or not, D is not observed
๐ท ๐‘– =1 ๐‘–๐‘“ ๐‘ ๐‘– ๐›ฟ+ ๐‘ข ๐‘– >0 ๐ท ๐‘– =0 ๐‘–๐‘“ ๐‘ ๐‘– ๐›ฟ+ ๐‘ข ๐‘– โ‰ค0 2. ๐‘Œ ๐‘– โˆ— = ๐‘‹ ๐‘– ๐›ฝ+ ๐œ€ ๐‘– ๐‘Œ ๐‘– = ๐‘Œ ๐‘– โˆ— ๐‘–๐‘“ ๐ท ๐‘– =1 ๐‘Ž๐‘›๐‘‘ ๐‘Œ ๐‘– โˆ— >0 ๐‘Œ ๐‘– = ๐‘œ๐‘กโ„Ž๐‘’๐‘Ÿ๐‘ค๐‘–๐‘ ๐‘’ (or ๐ท ๐‘– =0 or ( ๐‘Œ ๐‘– โˆ— โ‰ค0 & ๐ท ๐‘– =1) ) ๐‘ข ๐‘– โ‰ˆ๐‘ 0,1 ๐œ€ ๐‘– โ‰ˆ๐‘(0, ๐œŽ 2 ) ๐‘๐‘œ๐‘Ÿ๐‘Ÿ( ๐‘ข ๐‘– , ๐œ€ ๐‘– )=๐œŒ unobserved elements effecting consumers/nonconsumers may affect amount of expenditure Individuals make decisions in two steps

6 Double Hurdle Model (following the paper.....)
Decision 1: participation Decision 2: quantity (maybe zero) ๐‘ฆ ๐‘– =the observed consumption of an individual, dependent variable continous over positive values, but ๐‘ƒ ๐‘ฆ=0 >0 ๐‘Ž๐‘›๐‘‘ ๐‘ƒ ๐‘ฆ<0 =0 ๐‘ฆ ๐‘– = ๐‘ฅ ๐‘– ๐›ฝ+ ๐œ– ๐‘– ๐‘–๐‘“ min ๐‘ฅ ๐‘– ๐›ฝ+ ๐œ– ๐‘– , ๐‘ง ๐‘– ๐›พ+ ๐‘ข ๐‘– > ๐‘œ๐‘กโ„Ž๐‘’๐‘Ÿ๐‘ค๐‘–๐‘ ๐‘’ ๐œ– ๐‘– ๐‘ข ๐‘– ~๐‘ 0,ฮฃ , ฮฃ= ๐œŽ 12 ๐œŽ 12 ๐œŽ ฮจ ๐‘ฅ,๐‘ฆ,๐œŒ =CDF of a bivariate normal with correlation ๐œŒ

7 Double Hurdle Model The log liklihood function for the DHM (ฮฆโˆ’๐ถ๐ท๐น, ๐œ™โˆ’๐ท๐น): log ๐ฟ = ๐‘ฆ ๐‘– =0 ๐‘™๐‘œ๐‘” 1โˆ’ฮฆ ๐‘ง ๐‘– ๐›พ, ๐‘ฅ ๐‘– ๐›ฝ ๐œŽ ,๐œŒ + ๐‘ฆ ๐‘– >0 ๐‘™๐‘œ๐‘” ฮฆ ๐‘ง ๐‘– ๐›พ+ ๐œŒ ๐œŽ ( ๐‘ฆ ๐‘– โˆ’ ๐‘ฅ ๐‘– ๐›ฝ) 1โˆ’ ๐œŒ โˆ’๐‘™๐‘œ๐‘” ๐œŽ +๐‘™๐‘œ๐‘” ๐œ™ ๐‘ฆ ๐‘– โˆ’ ๐‘ฅ ๐‘– ๐›ฝ ๐œŽ ๐‘ฆ ๐‘– >0 ๐‘™๐‘œ๐‘” ฮฆ ๐‘ง ๐‘– ๐›พ+ ๐œŒ ๐œŽ ( ๐‘ฆ ๐‘– โˆ’ ๐‘ฅ ๐‘– ๐›ฝ) 1โˆ’ ๐œŒ โˆ’๐‘™๐‘œ๐‘” ๐œŽ +๐‘™๐‘œ๐‘” ๐œ™ ๐‘ฆ ๐‘– โˆ’ ๐‘ฅ ๐‘– ๐›ฝ ๐œŽ

8 Double Hurdle Model ๐‘ฅ ๐‘– ๐›ฝ+ ๐œ– ๐‘– models the quantity equation
๐‘ง ๐‘– ๐›พ+ ๐‘ข ๐‘– models the participation equation The command estimates ๐›ฝ,๐›พ,๐œŒ, ๐‘Ž๐‘›๐‘‘ ๐œŽ where ๐œŽ=๐‘‰๐‘Ž๐‘Ÿ(๐œ–) Restriction: ๐‘‰๐‘Ž๐‘Ÿ ๐‘ข =1 the model to be identified

9 Double Hurdle Model: Stata

10

11 Double Hurdle Model

12

13 Example: The use of the dblhurdle command using smoke
Example: The use of the dblhurdle command using smoke.dta from Wooldridge (2010).

14

15

16 Marginal effects The number of years of schooling (educ) on:
1. The probability of smoking 2. The expected number of cigarettes smoked given that you smoke 3. The expected number of cigarettes smoked

17 Prediction ppar - the probability of being away from the corner conditional on the covariates: ycond - expectation: yexpected - expected value of y conditional on x and z:

18 Marginal effects

19 Marginal effects

20 Marginal effects

21 Monte Carlo simulation: Finite sample properties of the estimator
Three measures of performance: The mean of the estimated parameters should be close to their true values. The mean standard error of the estimated parameters over the repetitions should be close to the standard deviation of the point estimates. The rejection rate of hypothesis tests should be close to the nominal size of the test.

22 Monte Carlo simulation
The data-generating process can be summarized as follows:

23 Monte Carlo simulation
A dataset of 2,000 observations was created. The xโ€™s were drawn from a standard normal distribution, and the dโ€™s were drawn from a Bernoulli with p = 1/2. Refer to this dataset as โ€œbaseโ€. Iteration of the simulation: 1. Use โ€œbaseโ€. 2. For each observation, draw (gen) ๐œ– from a standard normal. 3. For each observation, draw (gen) u from a standard normal. 4. For each observation, compute y according to the data-generating process presented above. 5. Fit the model, and save the values of interest with post.

24 Monte Carlo simulation

25 Monte Carlo simulation
A less intuitive issue: The set of regressors in the participation equation=the set of regressors of the quantity equation. The model is weakly identified. The data-generating process:

26 Monte Carlo simulation

27 Conclusion Researchers may consider dblhurdle when using tobit model
Its flexibility allows the researcher to break down the modeled quantity along two useful dimensions, the โ€œquantityโ€ dimension and the โ€œparticipationโ€ dimension The command presented in this article only allows for a single corner in the data One desirable feature to add is the capability to handle dependent variables with two corners


Download ppt "Implementation of a double-hurdle model Bruno Garcia The Stata Journal (2013), 13, Number 4, pp. 776-794 Presented by Gulzat."

Similar presentations


Ads by Google