R: Component Selection

model_selection_component {IMIX}

R Documentation

Component Selection

Description

Model selection for components based on AIC and BIC values for models in IMIX

Usage

model_selection_component(
  data_input,
  data_type = c("p", "z"),
  tol = 1e-06,
  maxiter = 1000,
  seed = 10,
  verbose = FALSE
)

Arguments

`data_input`	An n x d data frame or matrix of the summary statistics z score or p value, n is the nubmer of genes, d is the number of data types. Each row is a gene, each column is a data type.
`data_type`	Whether the input data is the p values or z scores, default is p value
`tol`	The convergence criterion. Convergence is declared when the change in the observed data log-likelihood increases by less than epsilon.
`maxiter`	The maximum number of iteration, default is 1000
`seed`	set.seed, default is 10
`verbose`	Whether to print the full log-likelihood for each iteration, default is FALSE

Value

Selected number of components based on AIC and BIC

`Component_Selected_AIC`	Selected number of components by AIC with the smallest AIC value among all components and models
`Component_Selected_BIC`	Selected number of components by BIC with the smallest BIC value among all components and models
`AIC/BIC`	The AIC and BIC values for all components for IMIX_ind_unrestrict, IMIX_cor_twostep, and IMIX_cor
`IMIX_ind_unrestrict`	A list of the IMIX_ind_unrestrict for all components 1,2,...2^d, this step was fitted using R package "Mclust", more details of the output can be found there
`IMIX_cor_twostep`	A list of the IMIX_cor_twostep for all components 1,2,...2^d, here, the mean is the estimated value of IMIX_ind_unrestrict
`IMIX_cor`	A list of the IMIX_cor_twostep for all components 1,2,...2^d

References

Ziqiao Wang and Peng Wei. 2020. “IMIX: a multivariate mixture model approach to association analysis through multi-omics data integration.” Bioinformatics. <doi:10.1093/bioinformatics/btaa1001>. Luca Scrucca, Michael Fop, T. Brendan Murphy, and Adrian E. Raftery. 2016. “mclust 5: Clustering, Classification and Density Estimation Using Gaussian Finite Mixture Models.” The R Journal 8 (1): 289–317. <doi:10.32614/RJ-2016-021>.

Examples

# A toy example
data("data_p")
set.seed(10)
data <- data_p[sample(1:1000,20,replace = FALSE),]
select_comp0 <- model_selection_component(data, data_type = "p", seed = 20)


# First load the data
data("data_p")

# Perform model selections on the data
select_comp1 = model_selection_component(data_p, data_type = "p", seed = 20)

[Package IMIX version 1.1.5 Index]