decorrelate.train {FADA} | R Documentation |
Factor Adjusted Discriminant Analysis 1: Decorrelation of the training data
Description
This function decorrelates the training dataset by adjusting data for the effects of latent factors of dependence.
Usage
decorrelate.train(data.train, nbf = NULL, maxnbfactors=12, diagnostic.plot = FALSE,
min.err = 0.001, verbose = TRUE,EM = TRUE, maxiter = 15,...)
Arguments
data.train |
A list containing the training dataset with the following components: |
nbf |
Number of factors. If |
maxnbfactors |
The maximum number of factors. Default is |
diagnostic.plot |
If |
min.err |
Threshold of convergence of the algorithm criterion. Default is min.err=0.001. |
verbose |
Print out number of factors and values of the objective criterion along the iterations. Default is |
EM |
The method used to estimate the parameters of the factor model. If |
maxiter |
Maximum number of iterations for estimation of the factor model. |
... |
Other arguments that can be passed in the |
Value
Returns a list with the following elements:
meanclass |
Group means estimated after iterative decorrelation |
fa.training |
Decorrelated training data |
Psi |
Estimation of the factor model parameters: specific variance |
B |
Estimation of the factor model parameters: loadings |
factors.training |
Scores of the trainings individuals on the factors |
groups |
Recall of group variable of training data |
proba.training |
Internal value (estimation of individual probabilities for the training dataset) |
Author(s)
Emeline Perthame, Chloe Friguet and David Causeur
References
Friedman, J., Hastie, T. and Tibshirani, R. (2010), Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software, 33, 1-22.
Friguet, C., Kloareg, M. and Causeur, D. (2009), A factor model approach to multiple testing under dependence. Journal of the American Statistical Association, 104:488, 1406-1415.
Perthame, E., Friguet, C. and Causeur, D. (2015), Stability of feature selection in classification issues for high-dimensional correlated data, Statistics and Computing.
See Also
FADA-package
FADA
glmnet-package
factanal
Examples
data(data.train)
res0 = decorrelate.train(data.train,nbf=3) # when the number of factors is forced
res1 = decorrelate.train(data.train) # when the optimal number of factors is unknown