Bayesian Econometrics Software

6.3 Random-eﬀects binary Probit

Mathematical representation

Prob (y_{i t} = 1) = Φ (α_{i} + x_{i t}^{'} β), α_{i} \sim N (0, \frac{1}{ω})

(6.5)

the model is estimated using observations from $N$ groups, each group observed for $T_{i}$ periods (balanced or unbalanced panels); the total number of observations is $\sum_{i = 1}^{N} T_{i}$
$y_{i t}$ is the value of the dependent variable for group $i$ , observed in period $t$ and it can take two values: 0 and 1
$x_{i t}$ is a $K \times 1$ vector that stores the values of the $K$ independent variables for group $i$ , observed in period $t$
$β$ is a $K \times 1$ vector of parameters
$α_{i}$ is the group-speciﬁc error term for group $i$
$ω$ is the precision of the group-speciﬁc error term: $σ_{α}^{2} = \frac{1}{ω}$
$Φ (\cdot)$ is the standard-Normal cdf

An equivalent representation uses the latent variable $y_{i t}^{*}$ :

\begin{matrix} \begin{aligned} y_{i t}^{*} & = α_{i} + x_{i t}^{'} β + 𝜀_{i t}, 𝜀_{i t} \sim N (0, 1), α_{i} \sim N (0, \frac{1}{ω}), \\ y_{i t} & = \{\begin{matrix} 1 & if & y_{i t}^{*} > 0 \\ 0 & if & y_{i t}^{*} \leq 0 \end{matrix} \end{aligned} \end{matrix}

(6.6)

The mean of the distribution of the

α_{i}

s is restricted to zero and, therefore, these are simply group-speciﬁc errors terms. However, including a constant term in the set of independent variables is valid and leads to a speciﬁcation equivalent to one where the group eﬀects are draws from a normal distribution with mean equal to the parameter associated with the constant term and precision

ω

Priors


Parameter	Probability density function	Default hyperparameters

$β$	$p (β) = \frac{\| P \|^{1 ∕ 2}}{{(2 π)}^{K ∕ 2}} exp \{- \frac{1}{2} {(β - m)}^{'} P (β - m)\}$	$m = 0_{K}$ , $P = 0.001 \cdot I_{K}$
$ω$	$p (ω) = \frac{b_{ω}^{a_{ω}}}{Γ (a_{ω})} ω^{a_{ω} - 1} e^{- ω b_{ω}}$	$a_{ω} = 0.01$ , $b_{ω} = 0.001$

Syntax

[

<model name> =

]

probit_re( y ~ x1 x2 … xK

[

, <options>

]

);

where:

y is the dependent variable name, as it appears in the dataset used for estimation
x1 x2 $\dots$ xK is a list of the $K$ independent variable names, as they appear in the dataset used for estimation; when a constant term is to be included in the model, this must be requested explicitly

The dependent variable, y, in the dataset used for estimation must contain only two values: 0 and 1 (with 1 indicating “success"). Observations with missing values in y are dropped during estimation, but if a numerical value other than 0 and 1 is encountered, then an error is produced.

BayES automatically drops from the sample used for estimation groups which are observed only once. This is because for these groups the group eﬀect (

α_{i}

) cannot be distinguished from the error term (

𝜀_{i t}

The optional arguments for the random-eﬀects binary Probit model are:³

Gibbs parameters

"chains"	number of chains to run in parallel (positive integer); the default value is 1
"burnin"	number of burn-in draws per chain (positive integer); the default value is 10000
"draws"	number of retained draws per chain (positive integer); the default value is 20000
"thin"	value of the thinning parameter (positive integer); the default value is 1
"seed"	value of the seed for the random-number generator (positive integer); the default value is 42
Hyperparameters

"m"	mean vector of the prior for $β$ ( $K \times 1$ vector); the default value is $0_{K}$
"P"	precision matrix of the prior for $β$ ( $K \times K$ symmetric and positive-deﬁnite matrix); the default value is $0.001 \cdot I_{K}$
"a_omega"	shape parameter of the prior for $ω$ (positive number); the default value is $0.01$
"b_omega"	rate parameter of the prior for $ω$ (positive number); the default value is $0.001$
Dataset and log-marginal likelihood

"dataset"	the id value of the dataset that will be used for estimation; the default value is the ﬁrst dataset in memory (in alphabetical order)
"logML_CJ"	boolean indicating whether the Chib (1995)/Chib & Jeliazkov (2001) approximation to the log-marginal likelihood should be calculated (true $\|$ false); the default value is false

Reported Parameters


$β$	variable_name	vector of parameters associated with the independent variables

$ω$	omega	precision parameter of the group-speciﬁc error term, $α_{i}$

$σ_{α}$	sigma_alpha	standard deviation of the group-speciﬁc error term: $σ_{α} = 1 ∕ ω^{1 ∕ 2}$

Stored values and post-estimation analysis
If a left-hand-side id value is provided when a random-eﬀects binary Probit model is created, then the following results are saved in the model item and are accessible via the ‘.’ operator:

Samples	a matrix containing the draws from the posterior of $β$ and $ω$
x1, $\dots$ ,xK	vectors containing the draws from the posterior of the parameters associated with variables x1, $\dots$ ,xK (the names of these vectors are the names of the variables that were included in the right-hand side of the model)
omega	vector containing the draws from the posterior of $ω$
logML	the Lewis & Raftery (1997) approximation of the log-marginal likelihood
logML_CJ	the Chib (1995)/Chib & Jeliazkov (2001) approximation to the log-marginal likelihood; this is available only if the model was estimated with the "logML_CJ"=true option
alpha_i	$N \times 1$ vector that stores the group-speciﬁc errors; the values in this vector are not guaranteed to be in the same order as the order in which the groups appear in the dataset used for estimation; use the store() function to associate the values in alpha_i with the observations in the dataset
nchains	the number of chains that were used to estimate the model
nburnin	the number of burn-in draws per chain that were used when estimating the model
ndraws	the total number of retained draws from the posterior ( $=$ chains $\cdot$ draws)
nthin	value of the thinning parameter that was used when estimating the model
nseed	value of the seed for the random-number generator that was used when estimating the model

Additionally, the following functions are available for post-estimation analysis (see section B.14):

diagnostics()
test()
pmp()
store()
mfx()
predict()

The random-eﬀects binary Probit model uses the store() function to associate the group eﬀects (alpha_i) with speciﬁc observations and store their values in the dataset used for estimation. The generic syntax for a statement involving the store() function after estimation of a random-eﬀects binary Probit model is:

store( alpha_i, <new variable name>,

[

"model"=<model name>

]

);

The random-eﬀects binary Probit model uses the mfx() function to calculate and report the marginal eﬀects of the independent variables on the probability of success. There are two types of marginal eﬀects which can be requested by setting the "type" argument of the mfx() function equal to 1 or 2:

when "type"=1 the marginal eﬀects are calculated marginally with respect to the group eﬀects.
when "type"=2 the marginal eﬀects are calculated conditionally on the group-eﬀects being equal to zero (the expected value of the group eﬀects, when treated as group-speciﬁc errors).

The generic syntax for a statement involving the mfx() function after estimation of a random-eﬀects binary Probit model is:

mfx(

[

"type"=1

]

[

, "point"=<point of calculation>

]

[

, "model"=<model name>

]

);

and:

mfx( "type"=2

[

, "point"=<point of calculation>

]

[

, "model"=<model name>

]

);

for calculating these two types of marginal eﬀects. The default value of the "type" option is 1. See the general documentation of the mfx() function (section B.14) for details on the other optional arguments.

The random-eﬀects binary Probit model uses the predict() function to generate predictions of the probability of success. There are two types of predictions which can be requested by setting the "type" argument of the mfx() function equal to 1 or 2:

when "type"=1 the predictions are generated marginally with respect to the group eﬀects.
when "type"=2 the predictions are generated conditionally on the group-eﬀects being equal to zero (the expected value of the group eﬀects, when treated as group-speciﬁc errors).

The generic syntax for a statement involving the predict() function after estimation of a random-eﬀects binary Probit model is:

[

]

= predict(

[

"type"=1

]

[

, "point"=<point of calculation>

]

[

,"model"=<model name>

]

[

, "stats"=true|false

]

[

, "prefix"=<prefix for new variable name>

]

);

and:

[

]

= predict( "type"=2

[

, "point"=<point of calculation>

]

[

,"model"=<model name>

]

[

, "stats"=true|false

]

[

, "prefix"=<prefix for new variable name>

]

);

for generating these two types of predictions eﬀects. The default value of the "type" option is 1. See the general documentation of the predict() function (section B.14) for details on the other optional arguments.

Examples

Example 1

myData = import("$BayESHOME/Datasets/dataset4.csv");
myData.constant = ones(rows(myData), 1);
set_pd( year, id, "dataset" = myData);

probit_re( y ~ constant x1 x2 x3 x4 );

Example 2

myData = import("$BayESHOME/Datasets/dataset4.csv");
myData.constant = ones(rows(myData), 1);
set_pd( year, id, "dataset" = myData);

myModel = probit_re( y ~ constant x1 x2 x3 x4,
    "m"=ones(5,1), "P"=0.1*eye(5,5), "a_omega"=0.1, "b_omega"=0.01,
    "burnin"=10000, "draws"=40000, "thin"=4, "chains"=2,
    "logML_CJ" = true );

diagnostics("model"=myModel);

kden(myModel.x3, "title" = "∖beta3 from the Probit model");

margeff_mean = mfx("point"="mean","model"=myModel,"type"=1);
margeff_mean = mfx("point"="mean","model"=myModel,"type"=2);

predict("type"=1, "prefix"=marg_);
predict("type"=2, "prefix"=cond_);

³Optional arguments are always given in option-value pairs (eg. "chains"=3).

[next] [prev] [prev-tail] [front] [up]