binom_bf_informed {multibridge} | R Documentation |
Evaluates Informed Hypotheses on Multiple Binomial Parameters
Description
Evaluates informed hypotheses on multiple binomial parameters.
These hypotheses can contain (a mixture of) inequality constraints, equality constraints, and free parameters.
Informed hypothesis H_r
states that binomial proportions obey a particular constraint.
H_r
can be tested against the encompassing hypothesis H_e
or the null hypothesis H_0
.
Encompassing hypothesis H_e
states that binomial proportions are free to vary.
Null hypothesis H_0
states that category proportions are exactly equal.
Usage
binom_bf_informed(
x,
n = NULL,
Hr,
a,
b,
factor_levels = NULL,
cred_level = 0.95,
niter = 5000,
bf_type = "LogBFer",
seed = NULL,
maxiter = 1000,
nburnin = niter * 0.05
)
Arguments
x |
a vector of counts of successes, or a two-dimensional table (or matrix) with 2 columns, giving the counts of successes and failures, respectively |
n |
numeric. Vector of counts of trials. Must be the same length as |
Hr |
string or character. Encodes the user specified informed hypothesis. Use either specified |
a |
numeric. Vector with alpha parameters. Must be the same length as |
b |
numeric. Vector with beta parameters. Must be the same length as |
factor_levels |
character. Vector with category names. Must be the same length as |
cred_level |
numeric. Credible interval for the posterior point estimates. Must be a single number between 0 and 1 |
niter |
numeric. Vector with number of samples to be drawn from truncated distribution |
bf_type |
character. The Bayes factor type. When the informed hypothesis is compared to the encompassing hypothesis,
the Bayes factor type can be |
seed |
numeric. Sets the seed for reproducible pseudo-random number generation |
maxiter |
numeric. Maximum number of iterations for the iterative updating scheme used in the bridge sampling routine. Default is 1,000 to avoid infinite loops |
nburnin |
numeric. A single value specifying the number of burn-in samples when drawing from the truncated distribution. Minimum number of burn-in samples is 10. Default is 5% of the number of samples. Burn-in samples are removed automatically after the sampling. |
Details
The model assumes that the data in x
(i.e., x_1, ..., x_K
) are the observations of K
independent
binomial experiments, based on n_1, ..., n_K
observations. Hence, the underlying likelihood is the product of the
k = 1, ..., K
individual binomial functions:
(x_1, ... x_K) ~ \prod Binomial(N_k, \theta_k)
Furthermore, the model assigns a beta distribution as prior to each model parameter (i.e., underlying binomial proportions). That is:
\theta_k ~ Beta(\alpha_k, \beta_k)
Value
List consisting of the following elements
$bf_list
gives an overview of the Bayes factor analysis:
-
bf_type
: string. Contains Bayes factor type as specified by the user -
bf
: data.frame. Contains Bayes factors for all Bayes factor types -
error_measures
: data.frame. Contains for the overall Bayes factor the approximate relative mean-squared errorre2
, the approximate coefficient of variationcv
, and the approximate percentage errorpercentage
-
logBFe_equalities
: data.frame. Lists the log Bayes factors for all independent equality constrained hypotheses -
logBFe_inequalities
: data.frame. Lists the log Bayes factor for all independent inequality constrained hypotheses
-
$cred_level
numeric. User specified credible interval
$restrictions
list that encodes informed hypothesis for each independent restriction:
-
full_model
: list containing the hypothesis, parameter names, data and prior specifications for the full model. -
equality_constraints
: list containing the hypothesis, parameter names, data and prior specifications for each equality constrained hypothesis. -
inequality_constraints
: list containing the hypothesis, parameter names, data and prior specifications for each inequality constrained hypothesis. In addition, innr_mult_equal
andnr_mult_free
encodes which and how many parameters are equality constraint or free, inboundaries
includes the boundaries of each parameter, innineq_per_hyp
states the number of inequality constraint parameters per independent inequality constrained hypothesis, and indirection
states the direction of the inequality constraint.
-
$bridge_output
list containing output from bridge sampling function:
-
eval
: list containing the log prior or posterior evaluations (q11
) and the log proposal evaluations (q12
) for the prior or posterior samples, as well as the log prior or posterior evaluations (q21
) and the log proposal evaluations (q22
) for the samples from the proposal distribution -
niter
: number of iterations of the iterative updating scheme -
logml
: estimate of log marginal likelihood -
hyp
: evaluated inequality constrained hypothesis -
error_measures
: list containing inre2
the approximate relative mean-squared error for the marginal likelihood estimate, incv
the approximate coefficient of variation for the marginal likelihood estimate (assumes that bridge estimate is unbiased), and inpercentage
the approximate percentage error of the marginal likelihood estimate
-
$samples
list containing a list for prior samples and a list of posterior samples from truncated distributions which were used to evaluate inequality constraints. Prior and posterior samples of independent inequality constraints are again saved in separate lists. Samples are stored as matrix of dimension
nsamples x nparams
.
Note
The following signs can be used to encode restricted hypotheses: "<"
and ">"
for inequality constraints, "="
for equality constraints,
","
for free parameters, and "&"
for independent hypotheses. The restricted hypothesis can either be a string or a character vector.
For instance, the hypothesis c("theta1 < theta2, theta3")
means
-
theta1
is smaller than boththeta2
andtheta3
The parameters
theta2
andtheta3
both havetheta1
as lower bound, but are not influenced by each other.
The hypothesis c("theta1 < theta2 = theta3 & theta4 > theta5")
means that
Two independent hypotheses are stipulated:
"theta1 < theta2 = theta3"
and"theta4 > theta5"
The restrictions on the parameters
theta1
,theta2
, andtheta3
do not influence the restrictions on the parameterstheta4
andtheta5
.-
theta1
is smaller thantheta2
andtheta3
-
theta2
andtheta3
are assumed to be equal -
theta4
is larger thantheta5
References
Damien P, Walker SG (2001). “Sampling truncated normal, beta, and gamma densities.” Journal of Computational and Graphical Statistics, 10, 206–215.
Gronau QF, Sarafoglou A, Matzke D, Ly A, Boehm U, Marsman M, Leslie DS, Forster JJ, Wagenmakers E, Steingroever H (2017). “A tutorial on bridge sampling.” Journal of Mathematical Psychology, 81, 80–97.
Frühwirth-Schnatter S (2004). “Estimating marginal likelihoods for mixture and Markov switching models using bridge sampling techniques.” The Econometrics Journal, 7, 143–167.
Sarafoglou A, Haaf JM, Ly A, Gronau QF, Wagenmakers EJ, Marsman M (2021). “Evaluating Multinomial Order Restrictions with Bridge Sampling.” Psychological Methods.
See Also
Other functions to evaluate informed hypotheses:
binom_bf_equality()
,
binom_bf_inequality()
,
mult_bf_equality()
,
mult_bf_inequality()
,
mult_bf_informed()
Examples
# data
x <- c(3, 4, 10, 11)
n <- c(15, 12, 12, 12)
# priors
a <- c(1, 1, 1, 1)
b <- c(1, 1, 1, 1)
# informed hypothesis
factor_levels <- c('binom1', 'binom2', 'binom3', 'binom4')
Hr <- c('binom1', '<', 'binom2', '<', 'binom3', '<', 'binom4')
output_total <- binom_bf_informed(x, n, Hr, a, b, niter=2e3, factor_levels, seed=2020)