jomo {jomo} | R Documentation |
Joint Modelling Imputation
Description
A wrapper function linking all the functions for JM imputation. The matrix of responses Y, must be a data.frame where continuous variables are numeric and binary/categorical variables are factors.
Usage
jomo(Y, Y2=NULL, X=NULL, X2=NULL, Z=NULL, clus=NULL, beta.start=NULL,
l2.beta.start=NULL, u.start=NULL, l1cov.start=NULL, l2cov.start=NULL,
l1cov.prior=NULL, l2cov.prior=NULL, nburn=1000, nbetween=1000, nimp=5,
a=NULL, a.prior=NULL, meth="common", output=1, out.iter=10)
Arguments
Y |
A data.frame containing the (level-1) outcomes of the imputation model. Columns related to continuous variables have to be numeric and columns related to binary/categorical variables have to be factors. |
Y2 |
A data.frame containing the level-2 outcomes of the imputation model. Columns related to continuous variables have to be numeric and columns related to binary/categorical variables have to be factors. |
X |
A data frame, or matrix, with covariates of the joint imputation model. Rows correspond to different observations, while columns are different variables. Missing values are not allowed in these variables. In case we want an intercept, a column of 1 is needed. The default is a column of 1. |
X2 |
A data frame, or matrix, with level-2 covariates of the joint imputation model. Rows correspond to different level-1 observations, while columns are different variables. Missing values are not allowed in these variables. In case we want an intercept, a column of 1 is needed. The default is a column of 1. |
Z |
A data frame, or matrix, for covariates associated to random effects in the joint imputation model. Rows correspond to different observations, while columns are different variables. Missing values are not allowed in these variables. In case we want an intercept, a column of 1 is needed. The default is a column of 1. |
clus |
A data frame, or matrix, containing the cluster indicator for each observation. If missing, functions for single level imputation are automatically used. |
beta.start |
Starting value for beta, the vector(s) of level-1 fixed effects. Rows index different covariates and columns index different outcomes. For each n-category variable we have a fixed effect parameter for each of the n-1 latent normals. The default is a matrix of zeros. |
l2.beta.start |
Starting value for beta2, the vector(s) of level-2 fixed effects. Rows index different covariates and columns index different level-2 outcomes. For each n-category variable we have a fixed effect parameter for each of the n-1 latent normals. The default is a matrix of zeros. |
u.start |
A matrix where different rows are the starting values within each cluster for the random effects estimates u. The default is a matrix of zeros. |
l1cov.start |
Starting value for the covariance matrix. Dimension of this square matrix is equal to the number of outcomes (continuous plus latent normals) in the imputation model. The default is the identity matrix. Functions for imputation with random cluster-specific covariance matrices are an exception, because we need to pass the starting values for all of the matrices stacked one above the other. |
l2cov.start |
Starting value for the level 2 covariance matrix. Dimension of this square matrix is equal to the number of outcomes (continuous plus latent normals) in the imputation model times the number of random effects plus the number of level-2 outcomes. The default is an identity matrix. |
l1cov.prior |
Scale matrix for the inverse-Wishart prior for the covariance matrix. The default is the identity matrix. |
l2cov.prior |
Scale matrix for the inverse-Wishart prior for the level 2 covariance matrix. The default is the identity matrix. |
nburn |
Number of burn in iterations. Default is 1000. |
nbetween |
Number of iterations between two successive imputations. Default is 1000. |
nimp |
Number of Imputations. Default is 5. |
a |
Starting value for the degrees of freedom of the inverse Wishart distribution of the cluster-specific covariance matrices. Default is 50+D, with D being the dimension of the covariance matrices. This is used only with clustered data and when option meth is set to "random". |
a.prior |
Hyperparameter (Degrees of freedom) of the chi square prior distribution for the degrees of freedom of the inverse Wishart distribution for the cluster-specific covariance matrices. Default is D, with D being the dimension of the covariance matrices. |
meth |
Method used to deal with level 1 covariance matrix. When set to "common", a common matrix across clusters is used (functions jomo1rancon, jomo1rancat and jomo1ranmix). When set to "fixed", fixed study-specific matrices are considered (jomo1ranconhr, jomo1rancathr and jomo1ranmixhr with coption meth="fixed"). Finally, when set to "random", random study-specific matrices are considered (jomo1ranconhr, jomo1rancathr and jomo1ranmixhr with coption meth="random") |
output |
When set to any value different from 1 (default), no output is shown on screen at the end of the process. |
out.iter |
When set to K, every K iterations a dot is printed on screen. Default is 10. |
Details
This is just a wrapper function to link all the functions in the package. Format of the columns of Y is crucial in order for the function to be using the right sub-function.
Value
On screen, the posterior mean of the fixed and random effects estimates and of the covariance matrices are shown. The only argument returned is the imputed dataset in long format. Column "Imputation" indexes the imputations. Imputation number 0 are the original data.
References
Carpenter J.R., Kenward M.G., (2013), Multiple Imputation and its Application. Wiley, ISBN: 978-0-470-74052-1.
Examples
# define all the inputs:
Y<-cldata[,c("measure","age")]
clus<-cldata[,c("city")]
nburn=as.integer(200);
nbetween=as.integer(200);
nimp=as.integer(5);
#And finally we run the imputation function:
imp<-jomo(Y,clus=clus,nburn=nburn,nbetween=nbetween,nimp=nimp)
# Finally we show how to fit the model and combine estimate with Rubin's rules
# Here we use mitml, other options are available in mice, mitools, etc etc
#if (requireNamespace("mitml", quietly = TRUE)&requireNamespace("lme4", quietly = TRUE)) {
#imp.mitml<-jomo2mitml.list(imp)
#fit.i<-with(imp.mitml, lmer(measure~age+(1|clus)))
#fit.MI<-testEstimates(fit.i, var.comp=T)
# }
#we could even run imputation with fixed or random cluster-specific covariance matrices:
#imp<-jomo(Y,clus=clus,nburn=nburn,nbetween=nbetween,nimp=nimp, meth="fixed")
#or:
#imp<-jomo(Y,clus=clus,nburn=nburn,nbetween=nbetween,nimp=nimp, meth="random")
#if we do not add clus as imput, functions for single level imputation are used:
#imp<-jomo(Y)