| brmsmargins {brmsmargins} | R Documentation |
Calculate Marginal Effects from 'brms' Models
Description
This function is designed to help calculate marginal effects
including average marginal effects (AMEs) from brms models.
Arguments are labeled as required when it is required that the
user directly specify the argument. Arguments are labeled as
optional when either the argument is optional or there are
sensible default values so that users do not typically need to specify
the argument.
Usage
brmsmargins(
object,
at = NULL,
wat = NULL,
add = NULL,
newdata = model.frame(object),
CI = 0.99,
CIType = "HDI",
contrasts = NULL,
ROPE = NULL,
MID = NULL,
subset = NULL,
dpar = NULL,
seed,
verbose = FALSE,
...
)
Arguments
object |
A required argument specifying a fitted |
at |
An optional argument (but note, either |
wat |
An optional list with named elements including one element named,
“ID” with a single character string, the name of the variable
in the model frame that is the ID variable. Additionally,
there should be one or more named elements, named after variables
in the model (and specified in the |
add |
An optional argument (but note, either |
newdata |
An optional argument specifying an object inheriting
from data frame indicating the baseline values to use for predictions and AMEs.
It uses a sensible default: the model frame from the |
CI |
An optional argument with a numeric value specifying the width
of the credible interval. Defaults to |
CIType |
An optional argument, a character string specifying the
type of credible interval (e.g., highest density interval). It is passed down to
|
contrasts |
An optional argument specifying a contrast matrix.
The posterior predictions matrix
is post multiplied by the contrast matrix, so they must be conformable.
The posterior predictions matrix has a separate column for each row in the
|
ROPE |
An optional argument, that can either be left as |
MID |
An optional argument, that can either left as |
subset |
An optional argument, a character string that is a
valid |
dpar |
An optional argument giving the parameter passed on to the |
seed |
An optional argument that controls whether (and if so what) random seed
to use. This does not matter when using fixed effects only. However,
when using Monte Carlo integration to integrate out random effects from
mixed effects models, it is critical if you are looking at a continuous
marginal effect with some small offset value as otherwise the
Monte Carlo error from one set of predictions to another may exceed
the true predicted difference.
If |
verbose |
An optional argument, a logical value whether to print
more verbose messages. Defaults to |
... |
An optional argument, additional arguments passed on to
|
Details
The main parts required for the function are a fitted model object,
(via the object argument) a dataset to be used for prediction,
(via the newdata argument which defaults to the model frame),
and a dataset passed to either at or add.
The steps are as follows:
Check that the function inputs (model object, data, etc.) are valid.
Take the dataset from the
newdataargument and either add the values from the first row ofaddor replace the values using the first row ofat. Only variables specified inatoraddare modified. Other variables are left as is.Use the
fitted()function to generate predictions based on this modified dataset. Ifeffectsis set to “fixedonly” (meaning only generate predictions using fixed effects) or to “includeRE” (meaning generate predictions using fixed and random effects), then predictions are generated entirely using thefitted()function and are, typically back transformed to the response scale. For mixed effects models with fixed and random effects whereeffectsis set to “integrateoutRE”, thenfitted()is only used to generate predictions using the fixed effects on the linear scale. For each prediction generated, the random effects are integrated out by drawingkrandom samples from the model assumed random effect(s) distribution. These are added to the fixed effects predictions, back transformed, and then averaged over allkrandom samples to perform numerical Monte Carlo integration.All the predictions for each posterior draw, after any back transformation has been applied, are averaged, resulting in one, marginal value for each posterior draw. These are marginal predictions. They are average marginal predictions if averaging over the sample dataset, or may be marginal predictions at the means, if the initial input dataset used mean values, etc.
Steps two to four are repeated for each row of
atoradd. Results are combined into a matrix where the columns are different rows fromatoraddand the rows are different posterior draws.If contrasts were specified, using a contrast matrix, the marginal prediction matrix is post multiplied by the contrast matrix. Depending on the choice(s) of
addoratand the values in the contrast matrix, these can then be average marginal effects (AMEs) by using numerical integration (addwith 0 and a very close to 0 value) or discrete difference (atwith say 0 and 1 as values) for a given predictor(s).The marginal predictions and the contrasts, if specified are summarized.
Although brmsmargins() is focused on helping to calculate
marginal effects, it can also be used to generate marginal predictions,
and indeed these marginal predictions are the foundation of any
marginal effect estimates. Through manipulating the input data,
at or add and the contrast matrix, other types of estimates
averaged or weighting results in specific ways are also possible.
Value
A list with four elements.
PosteriorPosterior distribution of all predictions. These predictions default to fixed effects only, but by specifying options toprediction()they can include random effects or be predictions integrating out random effects.SummaryA summary of the predictions.ContrastsPosterior distribution of all contrasts, if a contrast matrix was specified.ContrastSummaryA summary of the posterior distribution of all contrasts, if specified
References
Pavlou, M., Ambler, G., Seaman, S., & Omar, R. Z. (2015) doi:10.1186/s12874-015-0046-6 “A note on obtaining correct marginal predictions from a random intercepts model for binary outcomes” and Skrondal, A., & Rabe-Hesketh, S. (2009) doi:10.1111/j.1467-985X.2009.00587.x “Prediction in multilevel generalized linear models” and Norton EC, Dowd BE, Maciejewski ML. (2019) doi:10.1001/jama.2019.1954 “Marginal Effects—Quantifying the Effect of Changes in Risk Factors in Logistic Regression Models”
Examples
## Not run:
#### Testing ####
## sample data and logistic model with brms
set.seed(1234)
Tx <- rep(0:1, each = 50)
ybin <- c(rep(0:1, c(40,10)), rep(0:1, c(10,40)))
logitd <- data.frame(Tx = Tx, ybin = ybin)
logitd$x <- rnorm(100, mean = logitd$ybin, sd = 2)
mbin <- brms::brm(ybin ~ Tx + x, data = logitd, family = brms::bernoulli())
summary(mbin)
## now check AME for Tx
tmp <- brmsmargins(
object = mbin,
at = data.table::data.table(Tx = 0:1),
contrasts = matrix(c(-1, 1), nrow = 2),
ROPE = c(-.05, +.05),
MID = c(-.10, +.10))
tmp$Summary
tmp$ContrastSummary ## Tx AME
## now check AME for Tx with bootstrapping the AME population
tmpalt <- brmsmargins(
object = mbin,
at = data.table::data.table(Tx = 0:1),
contrasts = matrix(c(-1, 1), nrow = 2),
ROPE = c(-.05, +.05),
MID = c(-.10, +.10),
resample = 100L)
tmpalt$Summary
tmpalt$ContrastSummary ## Tx AME
## now check AME for continuous predictor, x
## use .01 as an approximation for first derivative
## 1 / .01 in the contrast matrix to get back to a one unit change metric
tmp2 <- brmsmargins(
object = mbin,
add = data.table::data.table(x = c(0, .01)),
contrasts = matrix(c(-1/.01, 1/.01), nrow = 2),
ROPE = c(-.05, +.05),
MID = c(-.10, +.10))
tmp2$ContrastSummary ## x AME
if (FALSE) {
library(lme4)
data(sleepstudy)
fit <- brms::brm(Reaction ~ 1 + Days + (1 + Days | Subject),
data = sleepstudy,
cores = 4)
summary(fit, prob = 0.99)
tmp <- brmsmargins(
object = fit,
at = data.table::data.table(Days = 0:1),
contrasts = matrix(c(-1, 1), nrow = 2),
ROPE = c(-.05, +.05),
MID = c(-.10, +.10), CIType = "ETI", effects = "integrateoutRE", k = 5L)
tmp$Summary
tmp$ContrastSummary
}
## End(Not run)