cv.sprinter {sprintr} | R Documentation |
Running sprinter with cross-validation
Description
The main cross-validation function to select the best sprinter fit for a path of tuning parameters.
Usage
cv.sprinter(x, y, num_keep = NULL, square = FALSE, lambda = NULL,
nlam = 100, lam_min_ratio = ifelse(nrow(x) < ncol(x), 0.01, 1e-04),
nfold = 5, foldid = NULL)
Arguments
x |
An |
y |
A response vector of size |
num_keep |
Number of candidate interactions to keep in Step 2. If |
square |
Indicator of whether squared effects should be fitted in Step 1. Default to be FALSE. |
lambda |
A user specified list of tuning parameter. Default to be NULL, and the program will compute its own |
nlam |
The number of |
lam_min_ratio |
The ratio of the smallest and the largest values in |
nfold |
Number of folds in cross-validation. Default value is 5. If each fold gets too view observation, a warning is thrown and the minimal |
foldid |
A vector of length |
Value
An object of S3 class "sprinter
".
n
The sample size.
p
The number of main effects.
a0
estimate of intercept corresponding to the CV-selected model.
compact
A compact representation of the selected variables.
compact
has three columns, with the first two columns representing the indices of a selected variable (main effects with first index = 0), and the last column representing the estimate of coefficients.fit
The whole
glmnet
fit object in Step 3.fitted
fitted value of response corresponding to the CV-selected model.
lambda
The sequence of
lambda
values used.cvm
The averaged estimated prediction error on the test sets over K folds.
cvsd
The standard error of the estimated prediction error on the test sets over K folds.
foldid
Fold assignment. A vector of length
n
.ibest
The index in
lambda
that is chosen by CV.call
Function call.
See Also
Examples
n <- 100
p <- 200
x <- matrix(rnorm(n * p), n, p)
y <- x[, 1] - 2 * x[, 2] + 3 * x[, 1] * x[, 3] - 4 * x[, 4] * x[, 5] + rnorm(n)
mod <- cv.sprinter(x = x, y = y)