Nested Logit Model by Asif Khan Phd Graduate Seminar in advance Statistics Institute of Rural Development (IRE) Georg-August University Goettingen July 24, 2006
ContentsContents Independence of Irrelevant Alternative s Nested Logit Model Random Utility Model GEV distribution Seperable Utility Seperable Probabilities Inclusive value Estimation Shortcoming of Nested Logit Model
Independence of Irrelevant Alternatives Multinomial logit & Conditional logit models based on IIA. The odds do not depend on other outcomes that are available. So alternative outcomes are “irrelevant.” What this means is that adding or deleting outcomes does not affect the odds among the remaining outcomes. IIA assume that `unobserable` or latent attributes of all alter natives are perceived as equally similar.
Example IIA Choices of travel to a city dwellers 60% 20% Share of 3 alternatives: The ratio b/w bus & car = 1 : 3 IIA assumption: 60% + 15% = 75%20% + 5% = 25% The ratio b/w bus & car must stay at = 1 : 3
IIA property convenient for estimation but fails on consumer behavior. Unrealistic assumption: why? b/coz: people will travel by white bus, if grey bus is not available without switching to taxi, which may be expansive. More realistic situation may be: White bus = 40% Taxi = 60% IIA biggest drawback of MNLM model Real world situation: Problem with IIA Tests for validity of IIA: Hausman & McFadden test (1984) Small and Hsiao test of IIA
Nested Logit Model If MNLM fails then: Multinomial Probit : computation problems Nested Logit : partial relaxation of IIA Independence from IIA Nested Logit: also called structured logit, sequential logit, GEV model Useful when alternatives similar in unoberved factors to other alternatives Developed by Ben-Akiva (1973) & McFadden (1978) Widely used in transportation, housing, energy etc.
Nested Logit Model Probability With alternatives removed Alternative OriginalCarCarpoolBusTrain Car (+12.5%).52(+30%).48(+20%) Carpool (+100%)-.13(+30%).12(+20%) Bus (+60%).33(+10%)-.40(+33%) Train (+60%).22(+10%).35(+70%)- Travel choices available to a worker to workplace AutoTransit CarCarpoolBusTrain IIA hold within nest: IIA does not hold across nest: Proportional substitution within nest No Proportional substitution across nest
Random Utility Model NLM: a discrete choice mode In DC situation, a decision maker is assumed to associate a value (utlity) to each available alternative. Utility of an alternative = f(alt. Char. + decison maker char.) Decision maker choose alternative with higgest utility: U nj > U Lm Since we cannot observe all utility so it is modelled as random vari ables and group them into following model: U nj = V nj + ε nj Total utility = representative/observed utility + unknown utility Treat them random with cumulative distribution and collect them into a vector relating to alternatives at hand: ε nj =(ε n1...., ε nj ) So based on this ε nj we are making good guesses or probablity statement what the choice will be.
In NLM we assume that unoberved utility has GEV distribution: exp(-∑ k k=1 (∑e -ε n j /λ k ) λ k ) Generalization of univerate distribution in logit model. The unoberved utility is correlated within nests. ε n j uncorrelated across nests Parameter λ k is a measure of degree of independence in ε n j in nest k. Higher λ k means less correlation & higher i ndependence & vice versa. McFadden (1978) used 1-λ k as indication for correlation If λ k =1 means complete independence or no correlation If λ k =1 nested logit model reduce to standard logit model GEV distribution
Observed Utility (U nj ) is: U nj = U T + U C|T U T = utility from travel mode e.g., auto or transit U C|T = utility from travel choice e.g., car, bus etc. Random utility = Marginal utility + Conditional utility Seperable Utility AutoTransit CarCarpoolBusTrain U T Marginal U U C|T Conditional U Constant for all alts. within a nest. Vars. that describe a nest. These var. differ over nest but not for alts. within each nest. Varies over alts. within a nest. Vars. that describe an alt. These vars. vary over alts.within each nest.
Probabilities in nested logit is a product of two simple logits. P i = Prob (nest containing i) x Prob (i, given nest containing i) e.g., P i = Prob (auto) x Prob (car, given auto) P i = P n * P i|n Seperable Probabilities ∑mezm+λIm∑mezm+λIm P n = e z n + λ I n P i|n = e Y i /λ ∑jeYi/λ∑jeYi/λ ln∑ j e Y i /λ I n = Where Y i are vars. that vary over alternatives within the nest. Z n are variables that vary over nests but not within alternatives within each nest I n is the inclusive value of nest n & λ parameter of I n (Upper model)(Lower model) P i = Marginal Prob. * Conditional Prob.
Inclusive value ln∑ j e Y i /λ I n = Also called log-sum for nest n or inclusive utility It is the expected maximum utility that a decision maker recived from a choice within the alternatives in a nest. Ben-Akiva (1973) considered it a link b/w lower & upper model. Hence it brings information from the conditional prob. (lower model) to the marginal prob. (upper model) as it is the denominator of lower model. λ is the log-sum coefficent showing degree of independence in the u nobserved part of utility for alternatives in a nest. Lower λ means le ss independence & more correlation. (remember 1-λ is a measure of correlation) λ =1 (non correlation so a standard logit) λ = 0 (means perfect correlation) I n = E(max U n ) = E(max V j +ε j )
Estimation
Shortcoming of Nested Logit Model For some choices there is a natural tree structure & for other there is not. This natural tree structure is derived from seperable utility function arguement (for e.g., choose b/w flying & ground transport; then choose b/w bus, car & train). Hence the behavioral characteristics of separability translates into an estimating approach that allows nesting procedure to equate behavioral & estimating considerations. The partitioning of some choices is adhoc & leads to troubling possibilities that the results might be dependent on the branches so defined. So there will be different results based on different specification of tree structure. There is no test for discriminating among tree structures, a problematic aspect of these models (Greene, 2003)
Greene, William H Econometric Analysis. 5th ed. Prentice Hall, USA. Jeffrey, Wooldridge M Econometric Analysis of Cross Section and Panel Data. The MIT, USA. The Nested Logit Regression Model. Kenneth, Train Discrete Choice Methods with Simulation. Cambridge University Press, USA. Discrete Dependent Variable Models. inepubs/nchrp/cd-22/v2chapter5.htmlhttp://onlinepubs.trb.org/onl inepubs/nchrp/cd-22/v2chapter5.html Maddala, G. S Limited-Dependent and Qualitative Variables in Econometrics. Cambridge University Press USA. McFadden, D. L Disaggregate Travel Demand's RUM Side: A 30-Year Retrospective. manuscript, Department of Economics, University of California, Berkeley. References
Thank you The end