important_variables {randomForestExplainer} | R Documentation |
Extract k most important variables in a random forest
Description
Get the names of k variables with highest sum of rankings based on the specified importance measures
Usage
important_variables(
importance_frame,
k = 15,
measures = names(importance_frame)[2:min(5, ncol(importance_frame))],
ties_action = "all"
)
Arguments
importance_frame |
A result of using the function measure_importance() to a random forest or a randomForest object |
k |
The number of variables to extract |
measures |
A character vector specifying the measures of importance to be used |
ties_action |
One of three: c("none", "all", "draw"); specifies which variables to pick when ties occur. When set to "none" we may get less than k variables, when "all" we may get more and "draw" makes us get exactly k. |
Value
A character vector with names of k variables with highest sum of rankings
Examples
forest <- randomForest::randomForest(Species ~ ., data = iris, localImp = TRUE, ntree = 300)
important_variables(measure_importance(forest), k = 2)
[Package randomForestExplainer version 0.10.1 Index]