R: Estimating Mixed Graphical Models

mgm {mgm}

R Documentation

Estimating Mixed Graphical Models

Description

Function to estimate k-degree Mixed Graphical Models via nodewise regression.

Usage

mgm(data, type, level, lambdaSeq, lambdaSel, lambdaFolds,
    lambdaGam, alphaSeq, alphaSel, alphaFolds, alphaGam, 
    k, moderators, ruleReg, weights, threshold, method, 
    binarySign, scale, verbatim, pbar, warnings, saveModels, 
    saveData, overparameterize, thresholdCat, signInfo, ...)

Arguments

`data`	n x p data matrix
`type`	p vector indicating the type of variable for each column in `data`. "g" for Gaussian, "p" for Poisson, "c" for categorical.
`level`	p vector indicating the number of categories of each variable. For continuous variables set to 1.
`lambdaSeq`	A sequence of lambdas that should be searched (see also `lambdaSel`). Defaults to `NULL`, which uses the `glmnet` default to select a lambda candidate sequence (recommended). See `?glmnet` for details.
`lambdaSel`	Specifies the procedure for selecting the tuning parameter controlling the Lq-penalization. The two options are cross validation "CV" and the Extended Bayesian Information Criterion (EBIC) "EBIC". The EBIC performs well in selecting sparse graphs (see Barber and Drton, 2010 and Foygel and Drton, 2014). Note that when also searching the alpha parameter in the elastic net penalty, cross validation should be preferred, as the parameter vector will not necessarily be sparse anymore. The EBIC tends to be a bit more conservative than CV (see Haslbeck and Waldorp, 2016). CV can sometimes not be performed with categorical variables, because `glmnet` requires at least 2 events of each category of each categorical variable in each training-fold. Defaults to `lambdaSel = "CV"`.
`lambdaFolds`	Number of folds in cross validation if `lambdaSel = "CV"`.
`lambdaGam`	Hyperparameter gamma in the EBIC if `lambdaSel = "EBIC"`. Defaults to `lambdaGam = .25`.
`alphaSeq`	A sequence of alpha parameters for the elastic net penality in [0,1] that should be searched (see also `alphaSel`). Defaults to `alphaSeq = 1`, which means that the lasso is being used. `alphaSeq = 0` corresponds to an L2-penalty (Ridge regression). For details see Friedman, Hastie and Tibshirani (2010).
`alphaSel`	Specifies the procedure for selecting the alpha parameter in the elastic net penalty. The two options are cross validation "CV" and the Extended Bayesian Information Criterion (EBIC) "EBIC". The EBIC performs well in selecting sparse graphs (see Barber and Drton, 2010 and Foygel and Drton, 2014). Note that when also searching the alpha parameter in the elastic net penalty, cross validation should be preferred, as the parameter vector will not necessarily be sparse anymore. The EBIC tends to be a bit more conservative than CV (see Haslbeck and Waldorp, 2016). CV can sometimes not be performed with categorical variables, because `glmnet` requires at least 2 events of each category of each categorical variable in each training-fold. Defaults to `alphaSel = "CV"`.
`alphaFolds`	Number of folds in cross validation if `alphaSel = "CV"`.
`alphaGam`	Hyperparameter gamma in the EBIC if `alphaSel = "EBIC"`. Defaults to `alphaGam = .25`.
`k`	Order up until including which interactions are included in the model. `k = 2` means that all pairwise interactions are included, `k = 3` means that all pairwise and all three-way interactions are included, etc. In previous versions of mgm the order of interactions was specified by the parameter d, the largest size or a neighborhood. Note that k = d + 1.
`moderators`	Integer vector with elements in `1:p`, specifying moderation effects to be included in the model. For instance, `moderators = c(4)` includes all linear moderation effects of variable 4. This is equivalent to including all 3-way interactions that include variable 4. Note that `moderators = 1:p` gives the same model as setting `k = 3` (see previous argument). Alternatively, a specific set of moderators can be specified via a M x 3 matrix, where M is the number of moderation effects. For example, `moderators = matrix(1:3, nrow=1)` adds (only) the 3-way interaction 1-2-3 to the model.
`ruleReg`	Rule used to combine estimates from neighborhood regression. E.g. for pairwise interactions, two estimates (one from regressing A on B and one from B on A) have to combined in one edge parameter. `ruleReg = "AND"` requires all estimates to be nonzero in order to set the edge to be present. `ruleReg = "OR"` requires at least one estiamte to be nonzero in order to set the edge to be present. For higher order interactions, `k` estimate have to be combined with this rule.
`weights`	A n vector with weights for observations.
`threshold`	A threshold below which edge-weights are put to zero. This is done in order to guarantee a lower bound on the false-positive rate. `threshold = "LW"` refers to the threshold in Loh and Wainwright (2013), which was used in all previous versions of mgm. `threshold = "HW"` refers to the threshold in Haslbeck and Waldorp (2016). If `threshold = "none"` no thresholding is applied. Defaults to `threshold = "LW"`.
`method`	Estimation method, currently only `method = "glm"`.
`binarySign`	If `binarySign = TRUE`, a sign for the interaction within binary nodes and between binary and continuous nodes is provided in the output. Note that in this case the two categories of the binary variables have to be coded in 0,1. This is to ensure that the interpretation of the sign is unambigous: a positive sign of a parameter means that increasing the associated predictor results in a higher probability for category 1.
`scale`	If `scale = TRUE`, all Gaussian nodes (specified by `"g"` in `type`) are centered and divided by their standard deviation. Scaling is recommended, because otherwise the penalization of a parameter depends on the variance of the associated predictor. Defaults to `scale = TRUE`.
`verbatim`	If `verbatim = TRUE`, no warnings and no progress bar is shown. Defaults to `verbatim = FALSE`.
`pbar`	If `pbar = TRUE`, a progress bar is shown. Defaults to `pbar = TRUE`.
`warnings`	If `warnings = TRUE`, no warnigns are returned. Defaults to `warnings = FALSE`.
`saveModels`	If `saveModels = FALSE`, only information about the weighted adjacency matrix, and if k > 2 about the factor graph is provided in the output list. If `saveModels = TRUE`, all fitted parameters are additionally returned.
`saveData`	If `saveData = TRUE`, the data is saved in the output list. Defaults to `saveData = FALSE`.
`overparameterize`	If `overparameterize = TRUE`, `mgm()` uses over-parameterized design-matrices for each neighborhood regression; this means that an interaction between two categorical variables with m and s categories is parameterized by ms parameters. If `overparameterize = FALSE` the standard parameterization (in `glmnet`) with m(s-1) parameters is used, where the first category of the predicting variable serves as reference category. If all variables are continuous both parameterizations are the same. Note that the default is set to `overparameterize = FALSE`, to be consistent with the previous mgm versions. However when the goal is to seperate pairwise interactions from 3-way (or higher) interactions, then the overparameterized version is advantageous. See the examples below for an illustration. Note that we can estimate the model despite the colinear columns in the design matrix because we use penalized regression.
`thresholdCat`	If `thresholdCat = FALSE`, the thresholds of categorical variables are set to zero. Defaults to `thresholdCat = TRUE` for which the thresholds are esimated.
`signInfo`	If `signInfo = TRUE`, a message is shown in the console, indicating that the sign of estimates is stored separately. Defaults to `signInfo = TRUE`.
`...`	Additional arguments.

Details

mgm() estimates an exponential mixed graphical model as introduced in Yang and colleagies (2014). Note that MGMs are not normalizable for all parameter values. See Chen, Witten & Shojaie (2015) for an overview of when pairwise MGMs are normalizable. To our best knowledge, for MGMs with interactions of order > 2 that include non-categorical variables, the conditions for normalizablity are unknown.

Value

The function returns a list with the following entries:

`call`	Contains all provided input arguments. If `saveData = TRUE`, it also contains the data
`pairwise`	Contains a list with all information about estimated pairwise interactions. `wadj` contains the p x p weighted adjacency matrix, if p is the number of variables in the network. `signs` has the same dimensions as `wadj` and contains the signs for the entries of `wadj`: 1 indicates a positive sign, -1 a negative sign and 0 an undefined sign. A sign is undefined if an edge is a function of more than one parameter. This is the case for interactions involving a categorical variable with more than 2 categories. `edgecolor` also has the same dimensions as `wadj` contains a color for each edge, depending on `signs`. It is provided for more convenient plotting. If only pairwise interactions are modeled (`d = 1`), `wadj` contains all conditional independence relations. The matrices `edgecolor_cb` contain a color blind friendly color scheme. `edge_lty` contains a matrix with 1s for positive/undefined signs and 2s for negative signes, which can be used as input to the `lty` argument in `qgraph()` in order to plot edges with negative sign as dashed lines.
`interactions`	A list with three entries that relate each interaction in the model to all its parameters. This is different to the output provided in `factorgraph`, where one value is assigned to each interaction. `indicator` contains a list with k-1 entries, one for each order of modeled interaction, which contain the estimated (nonzero) interactions. `weightsAgg` contains a list with k-1 entries, which in turn contain R lists, where R is the number of interactions (and rows in the corresponding list entry in`indicator`) that were estimated (nonzero) in the given entry. Each of these entries contains the mean of the absolute values of all parameters involved in this interaction. `weights` has the same structure as `weightsAgg`, but does contain all parameters involved in the interaction instead of the mean of their absolute values. `signs` has the same structure as `weightsAgg`/`weights` and provides the sign of the interaction, if defined.
`intercepts`	A list with p entries, which contain the intercept/thresholds for each node in the network. In case a given node is categorical with m categories, there are m thresholds for this variable.
`nodemodels`	A list with p `glmnet()` models, from which all above output is computed. Also contains the coefficients `models` for the selected lambda and the applied tau threshold `tau`.

Author(s)

Jonas Haslbeck <jonashaslbeck@gmail.com>

References

Barber, R. F., & Drton, M. (2015). High-dimensional Ising model selection with Bayesian information criteria. Electronic Journal of Statistics, 9(1), 567-607.

Chen S, Witten DM & Shojaie (2015). Selection and estimation for mixed graphical models. Biometrika, 102(1), 47.

Foygel, R., & Drton, M. (2010). Extended Bayesian information criteria for Gaussian graphical models. In Advances in neural information processing systems (pp. 604-612).

Friedman, J., Hastie, T., & Tibshirani, R. (2010). Regularization paths for generalized linear models via coordinate descent. Journal of statistical software, 33(1), 1.

Haslbeck, J. M. B., & Waldorp, L. J. (2020). mgm: Estimating time-varying Mixed Graphical Models in high-dimensional Data. Journal of Statistical Software, 93(8), pp. 1-46. DOI: 10.18637/jss.v093.i08

Loh, P. L., & Wainwright, M. J. (2012, December). Structure estimation for discrete graphical models: Generalized covariance matrices and their inverses. In NIPS (pp. 2096-2104).

Yang, E., Baker, Y., Ravikumar, P., Allen, G. I., & Liu, Z. (2014, April). Mixed Graphical Models via Exponential Families. In AISTATS (Vol. 2012, pp. 1042-1050).

Examples


## Not run: 


## We fit a pairwise and 3-order MGM to the mixed Autism dataset (?autism_data)

# 1) Fit Pairwise MGM

# Call mgm()
fit_k2 <- mgm(data = autism_data$data,
              type = autism_data$type,
              level = autism_data$lev,
              k = 2) # ad most pairwise interacitons

# Weighted adjacency matrix
fit_k2$pairwise$wadj

# Visualize using qgraph()
library(qgraph)
qgraph(fit_k2$pairwise$wadj,
       edge.color = fit_k2$pairwise$edgecolor,
       layout = "spring",
       labels =  autism_data$colnames)


# 2) Fit MGM with pairwise & three-way interactions
fit_k3 <- mgm(data = autism_data$data,
              type = autism_data$type,
              level = autism_data$lev,
              k = 3) # include all interactions up to including order 3

# List of estimated interactions
fit_k3$interactions$indicator

# 3) Plot Factor Graph 
FactorGraph(object = fit_k3, 
            PairwiseAsEdge = FALSE, 
            labels = autism_data$colnames)

# 4) Predict values
pred_obj <- predict(fit_k3, autism_data$data)

head(pred_obj$predicted) # first six rows of predicted values
pred_obj$errors # Nodewise errors


## Here we illustrate why we need to overparameterize the design matrix to 
## recover higher order interactions including categorical variables

# 1) Define Graph (one 3-way interaction between 3 binary variables)

# a) General Graph Info
type = c("c", "c", "c")
level = c(2, 2, 2)
# b) Define Interaction
factors <- list()
factors[[1]] <- NULL # no pairwise interactions
factors[[2]] <- matrix(c(1,2,3), ncol=3, byrow = T) # one 3-way interaction
interactions <- list()
interactions[[1]] <- NULL
interactions[[2]] <- vector("list", length = 1)
# threeway interaction no1
interactions[[2]][[1]] <- array(0, dim = c(level[1], level[2], level[3]))
theta <- .7
interactions[[2]][[1]][1, 1, 1] <- theta  #weight theta for conf (1,1,1), weight 0 for all others
# c) Define Thresholds
thresholds <- list()
thresholds[[1]] <- c(0, 0)
thresholds[[2]] <- c(0, 0)
thresholds[[3]] <- c(0, 0)


# 2) Sample from Graph
iter <- 1
set.seed(iter)
N <- 2000
d_iter <- mgmsampler(factors = factors,
                     interactions = interactions,
                     thresholds = thresholds,
                     type = type,
                     level = level,
                     N = N,
                     nIter = 50,
                     pbar = TRUE)


# 3.1) Estimate order 3 MGM using standard parameterization
d_est_stand <- mgm(data = d_iter$data,
                   type = type,
                   level = level,
                   k = 3,
                   lambdaSel = "CV",
                   ruleReg = "AND",
                   pbar = TRUE, 
                   overparameterize = FALSE, 
                   signInfo = FALSE)

# We look at the nodewise regression for node 1 (same for all)
coefs_stand <- d_est_stand$nodemodels[[1]]$model
coefs_stand 
# We see that nonzero-zero pattern of parameter vector does not allow us to infer whether 
# interactions are present or not


# 3.2) Estimate order 3 MGM using overparameterization
d_est_over <- mgm(data = d_iter$data,
                  type = type,
                  level = level,
                  k = 3,
                  lambdaSel = "CV",
                  ruleReg = "AND",
                  pbar = TRUE, 
                  overparameterize = TRUE, 
                  signInfo = FALSE)

# We look at the nodewise regression for node 1 (same for all)
coefs_over <- d_est_over$nodemodels[[1]]$model
coefs_over # recovers exactly the 3-way interaction


# For more examples see https://github.com/jmbh/mgmDocumentation




## End(Not run)

[Package mgm version 1.2-14 Index]