enpls.ad {enpls} | R Documentation |
Ensemble Partial Least Squares for Model Applicability Domain Evaluation
Description
Model applicability domain evaluation with ensemble partial least squares.
Usage
enpls.ad(x, y, xtest, ytest, maxcomp = NULL, cvfolds = 5L,
space = c("sample", "variable"), method = c("mc", "boot"),
reptimes = 500L, ratio = 0.8, parallel = 1L)
Arguments
x |
Predictor matrix of the training set. |
y |
Response vector of the training set. |
xtest |
List, with the i-th component being the i-th test set's predictor matrix (see example code below). |
ytest |
List, with the i-th component being the i-th test set's response vector (see example code below). |
maxcomp |
Maximum number of components included within each model. If not specified, will use the maximum number possible (considering cross-validation and special cases where n is smaller than p). |
cvfolds |
Number of cross-validation folds used in each model
for automatic parameter selection, default is |
space |
Space in which to apply the resampling method.
Can be the sample space ( |
method |
Resampling method. |
reptimes |
Number of models to build with Monte-Carlo resampling or bootstrapping. |
ratio |
Sampling ratio used when |
parallel |
Integer. Number of CPU cores to use.
Default is |
Value
A list containing:
-
tr.error.mean
- absolute mean prediction error for training set -
tr.error.median
- absolute median prediction error for training set -
tr.error.sd
- prediction error sd for training set -
tr.error.matrix
- raw prediction error matrix for training set -
te.error.mean
- list of absolute mean prediction error for test set(s) -
te.error.median
- list of absolute median prediction error for test set(s) -
te.error.sd
- list of prediction error sd for test set(s) -
te.error.matrix
- list of raw prediction error matrix for test set(s)
Note
Note that for space = "variable"
, method
could
only be "mc"
, since bootstrapping in the variable space
will create duplicated variables, and that could cause problems.
Author(s)
Nan Xiao <https://nanx.me>
Examples
data("alkanes")
x <- alkanes$x
y <- alkanes$y
# training set
x.tr <- x[1:100, ]
y.tr <- y[1:100]
# two test sets
x.te <- list(
"test.1" = x[101:150, ],
"test.2" = x[151:207, ]
)
y.te <- list(
"test.1" = y[101:150],
"test.2" = y[151:207]
)
set.seed(42)
ad <- enpls.ad(
x.tr, y.tr, x.te, y.te,
space = "variable", method = "mc",
ratio = 0.9, reptimes = 50
)
print(ad)
plot(ad)
# the interactive plot requires a HTML viewer
## Not run:
plot(ad, type = "interactive")
## End(Not run)