FBED variable selection method using the correlation {Rfast} | R Documentation |
FBED variable selection method using the correlation
Description
FBED variable selection method using the correlation.
Usage
cor.fbed(y, x, ystand = TRUE, xstand = TRUE, alpha = 0.05, K = 0)
Arguments
y |
The response variable, a numeric vector. |
x |
A matrix with the data, where the rows denote the samples and the columns are the variables. |
ystand |
If this is TRUE the response variable is centered. The mean is subtracted from every value. |
xstand |
If this is TRUE the independent variables are standardised. |
alpha |
The significance level, set to 0.05 by default. |
K |
The number of times to repeat the process. The default value is 0. |
Details
FBED stands for Forward Backward with Earcly Dropping. It is a variation of the classical forward selection, where at each step, only the statistically significant variables carry on. The rest are dropped. The process stops when no other variables can be selected. If K = 1, the process is repeated testing sequentially again all those that have not been selected. If K > 1, then this is repeated.
In the end, the backward selection is performed to remove any falsely included variables. This backward phase has not been implemented yet.
Value
A list including:
runtime |
The duration of the process. |
res |
A matrix with the index of the selected variable, their test statistic value and the associated p-value. |
info |
A matrix with two columns. The cumulative number of variables selected and the number of tests for each value of K. |
Author(s)
Michail Tsagris
R implementation and documentation: Michail Tsagris <mtsagris@uoc.gr>
References
Borboudakis G. and Tsamardinos I. (2019). Forward-backward selection with early dropping. Journal of Machine Learning Research, 20(8): 1-39.
See Also
cor.fsreg, ompr, correls, fs.reg
Examples
x <- matrnorm(100, 100)
y <- rnorm(100)
a <- cor.fbed(y, x)
a
x <- NULL