| orsf_vs {aorsf} | R Documentation |
Variable selection
Description
Variable selection
Usage
orsf_vs(object, n_predictor_min = 3, verbose_progress = NULL)
Arguments
object |
(ObliqueForest) a trained oblique random forest object (see orsf). |
n_predictor_min |
(integer) the minimum number of predictors allowed |
verbose_progress |
(logical) not implemented yet. Should progress be printed to the console? |
Details
The difference between variables_included and predictors_included is
referent coding. The variable would be the name of a factor variable
in the training data, while the predictor would be the name of that
same factor with the levels of the factor appended. For example, if
the variable is diabetes with levels = c("no", "yes"), then the
variable name is diabetes and the predictor name is diabetes_yes.
tree_seeds should be specified in object so that each successive run
of orsf will be evaluated in the same out-of-bag samples as the initial
run.
Value
a data.table with four columns:
-
n_predictors: the number of predictors used
-
stat_value: the out-of-bag statistic
-
variables_included: the names of the variables included
-
predictors_included: the names of the predictors included
-
predictor_dropped: the predictor selected to be dropped
Examples
object <- orsf(formula = time + status ~ .,
data = pbc_orsf,
n_tree = 25,
importance = 'anova')
orsf_vs(object, n_predictor_min = 15)