Title: Applied Microeconometrics Chapter 3 Multinomial and ordered models
1Applied MicroeconometricsChapter 3Multinomial
and ordered models
2- The Multinomial Logit Model (MNL)
- Estimation
- The IIA Assumption
- Applications
- Extensions to MNL
- Ordered Probit
Train, K. (2003), Discrete Choice Methods with
Simulation (downloadable from http//elsa.berkeley
.edu/books/choice2.html) Wooldridge, J.M. (2002),
Econometric Analysis of Cross Section and Panel
Data, Ch. 15
3Making the right decision between alternative
models
yes
mlogit
IIA valid ?
unordered
no
mprobit nested logit
ordered logit/ probit
ordered
IIAindependence of irrelevant alternatives
(assumption)
4Multinomial models
Multiple alternatives without obvious ordering
? Choice of a single alternative out of a number
of distinct alternatives
e.g. which means of transportation do you use to
get to work?
bus, car, bicycle etc.
? Example for ordered structure how do you feel
today very well, fairly well, not too well,
miserably
5Derivation of the Multinomial Logit model
- Choice between M alternatives
- Decision is determined by the utility level Uij,
an individual i derives from choosing alternative
j - Let
-
-
- where i1,,N individuals j0,,J alternatives
- The alternative providing the highest level of
utility will be chosen.
(1)
6Derivation of the Multinomial Logit model
- Probability that alternative j will be chosen
- In order to calculate this probability, the
maximum of a number of random variables has to
be determined. - In general, this requires solving
multidimensional integrals ? analytical
solutions do not exist
7Derivation of the Multinomial Logit model
- Exception If the error terms eij in (1) are
assumed to be independently identically
standard extreme value distributed, then an
analytical solution exists. - In this case, similar to binary logit, it can
be shown that the choice probabilities are
8Derivation of the Multinomial Logit model
- The special case where J1 yields the binary
Logit model.
9Independent variables
- Different kinds of independent variables
- Characteristics that do not vary over
alternatives (e.g., socio-demographic
characteristics, time effects) - Characteristics that vary over alternatives
(e.g., prices, travel distances etc.)
In the latter case, the multinomial logit is
often called conditional logit (CLOGIT in
Stata) It requires a different arrangement of the
data (one line per alternative for each i)
10Estimation of the MNL
- Estimation is by Maximum Likelihood
- The log likelihood function
- is globally concave and easy to maximize
(McFadden, 1974) ? big computational advantage
over multinomial probit or nested logit
11Interpretation of coefficients
- The coefficients themselves cannot be
interpreted easily but the exponentiated
coefficients have an interpretation as the
relative risk ratios (RRR)
risk ratio
(for simplicity, only one regressor considered)
12Interpretation of coefficients
- The relative risk ratio tells us how the
probability of choosing j relative to 0 changes
if we increase x by one unit
such that
relative risk ratio RRR
Note some people also use the term odds ratio
for the relative risk
13Interpretation of coefficients
Interpretation
Variable x increases (decreases) the probability
that alternative j is chosen instead of the
baseline alternative if RRR gt (lt) 1.
14Marginal effects in the MNL
- relative change of pij if x increases by 1 per
cent
15Independence of Irrelevant Alternatives (IIA)
Important assumption of the multinomial
Logit-Model ? it implies that the decision
between two alternatives is independent from the
existence of more alternatives
16Independence of Irrelevant Alternatives (IIA)
- Ratio of the choice probabilities between two
alternatives j and k is independent from any
other alternative
17Independence of Irrelevant Alternatives (IIA)
- Problem This assumption is invalid in many
situations.
Example red bus - blue bus - problem
- initial situation only red buses
- an individual chooses to walk with probability
2/3 - - probability of taking a red bus is 1/3
- probability ratio 21
18Independence of Irrelevant Alternatives (IIA)
Introduction of blue buses
- It is rational to believe that that the
probability of walking will not change. - If the number of red buses number of blue
buses Person walks with P4/6 - Person takes a red bus with P1/6
- Person takes a blue bus with P1/6
New probability ratio for walking vs. red bus
41
Not possible according to IIA!
19Independence of Irrelevant Alternatives (IIA)
- The following probabilities result from the
IIA- assumption - P(by foot)2/4
- P(red bus)1/4
- P(blue bus)1/4, such that
-
- Problem probability of walking decreases from
2/3 to 2/4 due to the introduction of blue buses
? not plausible!
20Independence of Irrelevant Alternatives (IIA)
- Reason of IIA property assumption that error
termns are independently distributed over all
alternatives. - The IIA property causes no problems if all
alternatives considered differ in almost the
same way.
e.g., probability of taking a red bus is highly
correlated with the probability of taking a blue
bus substitution patterns
21Hausman Test for validity of IIA
- H0 IIA is valid (odds ratios are independent
of additional alternatives) - Procedure omit a category
? Do the estimated coefficients change
significantly? - If they do reject H0
- cannot apply multinomial logit
- choose nested logit or multinomial probit
instead -
22Cramer-Ridder Test
- Often one would like to know whether certain
alternatives can be merged into one - e.g., do employment states such as
unemployment and nonemployment need to be
distinguished? - The Cramer-Ridder tests the null hypothesis
that the alternatives can be merged. It has the
form of a LR test - 2(logLU-logLR)?²
-
23Cramer-Ridder Test
- Derive the log likelihood value of the
restricted model where two alternatives (here, A
and N) have been merged -
where log
is the log likelihood of the
restricted model, log
is the log likelihood
of the pooled model, and nA and nN are the
number of times A and N have been chosen
24Application
Data
616 observations of choice of a particular health
insurance
3 alternatives
- indemnity plan deductible has to be paid
before the benefits of the policy can apply - prepaid plan prepayment and unlimited usage
of benefits - uninsured no health insurance
25Application
Observation group nonwhite
0 white
1 black
1 black
Is the choice of health care insurance determined
by the variable nonwhite?
26Application
Stata estimation output for the MNL
27Application
- If one does not choose a category as baseline,
Stata uses the alternative with the highest
frequency.
here indemnity is used as the baseline category
used for comparison
customized choice of basic category in
Stata mlogit depvar indepvars, base ()
28Interpreting the output
- The estimated coefficients are difficult to
interpret quantitatively
- The coefficient indicates how the logarithmized
probability of choosing the alternative
prepaid instead of indemnity changes if
nonwhite changes from 0 to 1. More intuitive
to exponentiate coeffs and form RRRs
29Calculation of RRR
30Calculation of RRR
- Probability of choosing
- prepaid over indemnity is 1.9 times higher
for black individuals - uninsure over indemnity is 1.5 times
higher for black individuals
31Marginal effects
- Stata computes the marginal effect of
nonwhite for each alternative separately.
(AKA margeff)
32Marginal effects
- Interpretation If the variable nonwhite
changes from 0 to 1 - the probability of choosing alternative
indemnity decreases by 15.2 per cent. - the probability of choosing alternative
prepaid increases by 15.0 per cent. - the probability of choosing alternative
uninsure rises by 0.2 per cent - (However, none of the coefficients is
significant)