RF_predict {DMTL} | R Documentation |
Predictive Modeling using Random Forest Regression
Description
This function trains a Random Forest regressor using the training data
provided and predict response for the test features. This implementation
depends on the randomForest
package.
Usage
RF_predict(
x_train,
y_train,
x_test,
lims,
optimize = FALSE,
n_tree = 300,
m_try = 0.3333,
seed = NULL,
verbose = FALSE,
parallel = FALSE
)
Arguments
x_train |
Training features for designing the RF regressor. |
y_train |
Training response for designing the RF regressor. |
x_test |
Test features for which response values are to be predicted.
If |
lims |
Vector providing the range of the response values for modeling. If missing, these values are estimated from the training response. |
optimize |
Flag for model tuning. If |
n_tree |
Number of decision trees to be built in the forest. Defaults
to |
m_try |
Fraction of the features to be used for building each tree.
Defaults to |
seed |
Seed for random number generator (for reproducible outcomes).
Defaults to |
verbose |
Flag for printing the tuning progress when |
parallel |
Flag for allowing parallel processing when performing grid
search i.e., |
Value
If x_test
is missing, the trained RF regressor.
If x_test
is provided, the predicted values using the model.
Note
The response values are filtered to be bound by range in lims
.
Examples
set.seed(86420)
x <- matrix(rnorm(3000, 0.2, 1.2), ncol = 3); colnames(x) <- paste0("x", 1:3)
y <- 0.3*x[, 1] + 0.1*x[, 2] - x[, 3] + rnorm(1000, 0, 0.05)
## Get the model only...
model <- RF_predict(x_train = x[1:800, ], y_train = y[1:800], n_tree = 300)
## Get predictive performance...
y_pred <- RF_predict(x_train = x[1:800, ], y_train = y[1:800], x_test = x[801:1000, ])
y_test <- y[801:1000]
print(performance(y_test, y_pred, measures = "RSQ"))