Bayesian Econometrics Software

7.2 Ordered Logit model

Mathematical representation

\begin{matrix} \begin{aligned} y_{i}^{*} & = x_{i}^{'} β + 𝜀_{i}, 𝜀_{i} \sim Logistic (0, 1) \\ y_{i} & = \{\begin{matrix} 1 & if & γ_{0} < y_{i}^{*} \leq γ_{1} \\ 2 & if & γ_{1} < y_{i}^{*} \leq γ_{2} \\ ⋮ & ⋮ & ⋮ \\ M & if & γ_{M - 1} < y_{i}^{*} \leq γ_{M} \end{matrix} \end{aligned} \end{matrix}

(7.2)

the model is estimated using $N$ observations
$y_{i}$ is the value of the dependent variable for observation $i$ and it can assume integer values in the range $1, \dots, M$
$x_{i}$ is a $K \times 1$ vector that stores the values of the $K$ independent variables for observation $i$
$β$ is a $K \times 1$ vector of parameters
the $γ$ s are parameters that represent the cutoﬀ points between categories and they satisfy the relation $γ_{0} < γ_{1} < \dots < γ_{M}$ , with $γ_{0} = - \infty$ , $γ_{M} = \infty$ and, for identiﬁcation purposes, $γ_{1} = 0$ ; there are $M - 2$ $γ$ s to be estimated
following Albert & Chib (2001), to impose the inequality constraints on the $γ$ s the problem is re-parameterized using the 1-1 mapping: $\begin{matrix} δ_{2} & = & log (γ_{2} - γ_{1}) \\ δ_{3} & = & log (γ_{3} - γ_{2}) \\ ⋮ & = & ⋮ \\ δ_{M - 1} & = & log (γ_{M - 1} - γ_{M - 2}) \end{matrix}$
there are $M - 2$ $δ$ s to be estimated and they are collected in an $(M - 2) \times 1$ vector $δ$

Priors


Parameter	Probability density function	Default hyperparameters

$β$	$p (β) = \frac{\| P_{β} \|^{1 ∕ 2}}{{(2 π)}^{K ∕ 2}} exp \{- \frac{1}{2} {(β - m_{β})}^{'} P_{β} (β - m_{β})\}$	$m_{β} = 0_{K}$ , $P_{β} = 0.001 \cdot I_{K}$
$δ$	$p (δ) = \frac{\| P_{δ} \|^{1 ∕ 2}}{{(2 π)}^{\frac{M - 2}{2}}} exp \{- \frac{1}{2} {(δ - m_{δ})}^{'} P_{δ} (δ - m_{δ})\}$	$m_{δ} = 0_{M - 2}$ , $P_{δ} = 0.001 \cdot I_{M - 2}$

Syntax

[

<model name> =

]

ologit( y ~ x1 x2 … xK

[

, <options>

]

);

where:

y is the dependent variable name, as it appears in the dataset used for estimation
x1 x2 $\dots$ xK is a list of the $K$ independent variable names, as they appear in the dataset used for estimation; when a constant term is to be included in the model, this must be requested explicitly

The dependent variable, y, in the dataset used for estimation must contain only consecutive integer values, with the numbering starting at 1. Observations with missing values in y are dropped during estimation, but if a non-integer numerical value is encountered or if the integer values are not consecutive (for example there are no observations for which

y_{i} = 2

), then an error is produced.

The optional arguments for the ordered Logit model are:²

Gibbs parameters

"chains"	number of chains to run in parallel (positive integer); the default value is 1
"burnin"	number of burn-in draws per chain (positive integer); the default value is 10000
"draws"	number of retained draws per chain (positive integer); the default value is 20000
"thin"	value of the thinning parameter (positive integer); the default value is 1
"seed"	value of the seed for the random-number generator (positive integer); the default value is 42
Hyperparameters

"m_beta"	mean vector of the prior for $β$ ( $K \times 1$ vector); the default value is $0_{K}$
"P_beta"	precision matrix of the prior for $β$ ( $K \times K$ symmetric and positive-deﬁnite matrix); the default value is $0.001 \cdot I_{K}$
"m_delta"	mean vector of the prior for $δ$ ( $(M - 2) \times 1$ vector); the default value is $0_{M - 2}$
"P_delta"	precision matrix of the prior for $δ$ ( $(M - 2) \times (M - 2)$ symmetric and positive-deﬁnite matrix); the default value is $0.001 \cdot I_{M - 2}$
Dataset and log-marginal likelihood

"dataset"	the id value of the dataset that will be used for estimation; the default value is the ﬁrst dataset in memory (in alphabetical order)
"logML_CJ"	boolean indicating whether the Chib (1995)/Chib & Jeliazkov (2001) approximation to the log-marginal likelihood should be calculated (true $\|$ false); the default value is false

Reported Parameters


$β$	variable_name	vector of parameters associated with the independent variables
$γ$	gamma_m	vector of cutoﬀ points ( $M - 2$ )

Stored values and post-estimation analysis
If a left-hand-side id value is provided when an ordered Logit model is created, then the following results are saved in the model item and are accessible via the ‘.’ operator:

Samples	a matrix containing the draws from the posterior of $β$ and $γ$
x1, $\dots$ ,xK	vectors containing the draws from the posterior of the parameters associated with variables x1, $\dots$ ,xK (the names of these vectors are the names of the variables that were included in the right-hand side of the model)
gamma_2, $\dots$ , gamma_{M-1}	vectors containing the draws from the posterior of the cutoﬀ parameters, for $m = 2, \dots, M - 1$
logML	the Lewis & Raftery (1997) approximation of the log-marginal likelihood
logML_CJ	the Chib (1995)/Chib & Jeliazkov (2001) approximation to the log-marginal likelihood; this is available only if the model was estimated with the "logML_CJ"=true option
nchains	the number of chains that were used to estimate the model
nburnin	the number of burn-in draws per chain that were used when estimating the model
ndraws	the total number of retained draws from the posterior ( $=$ chains $\cdot$ draws)
nthin	value of the thinning parameter that was used when estimating the model
nseed	value of the seed for the random-number generator that was used when estimating the model

Additionally, the following functions are available for post-estimation analysis (see section B.14):

diagnostics()
test()
pmp()
mfx()

The ordered Logit model uses the mfx() function to calculate and report the marginal eﬀects of the independent variables on the probability of the response variable being in each one of the $M$ categories: $Prob (y = m | x)$ , for $m = 1, 2, \dots, M$ . Because the model calculates only one type of marginal eﬀects, the only valid value for the "type" option is 1. The generic syntax for a statement involving the mfx() function after estimation of an ordered Logit model is:

mfx(

[

"type"=1

]

[

, "point"=<point of calculation>

]

[

, "model"=<model name>

]

);

See the general documentation of the mfx() function (section B.14) for details on the other optional arguments.

Examples

Example 1

myData = import("$BayESHOME/Datasets/dataset10.csv");
myData.constant = ones(rows(myData), 1);

ologit( y ~ constant x1 x2 x3 x4 );

Example 2

myData = import("$BayESHOME/Datasets/dataset10.csv");
myData.constant = ones(rows(myData), 1);

myModel = ologit( y ~ constant x1 x2 x3 x4,
    "m_beta"=zeros(5,1), "P_beta" = 0.01*eye(5,5),
    "m_delta"=zeros(3,1), "P_delta" = 0.1*eye(3,3),
    "burnin"=10000, "draws"=40000, "thin"=4, "chains"=2,
    "logML_CJ" = true, "dataset"=myData);

diagnostics("model"=myModel);

mfx("point"="mean","model"=myModel);
mfx("point"="median","model"=myModel);

²Optional arguments are always given in option-value pairs (eg. "chains"=3).

[next] [prev] [prev-tail] [front] [up]