| cv.sprinter {sprintr} | R Documentation |
Running sprinter with cross-validation
Description
The main cross-validation function to select the best sprinter fit for a path of tuning parameters.
Usage
cv.sprinter(x, y, num_keep = NULL, square = FALSE, lambda = NULL,
nlam = 100, lam_min_ratio = ifelse(nrow(x) < ncol(x), 0.01, 1e-04),
nfold = 5, foldid = NULL)
Arguments
x |
An |
y |
A response vector of size |
num_keep |
Number of candidate interactions to keep in Step 2. If |
square |
Indicator of whether squared effects should be fitted in Step 1. Default to be FALSE. |
lambda |
A user specified list of tuning parameter. Default to be NULL, and the program will compute its own |
nlam |
The number of |
lam_min_ratio |
The ratio of the smallest and the largest values in |
nfold |
Number of folds in cross-validation. Default value is 5. If each fold gets too view observation, a warning is thrown and the minimal |
foldid |
A vector of length |
Value
An object of S3 class "sprinter".
nThe sample size.
pThe number of main effects.
a0estimate of intercept corresponding to the CV-selected model.
compactA compact representation of the selected variables.
compacthas three columns, with the first two columns representing the indices of a selected variable (main effects with first index = 0), and the last column representing the estimate of coefficients.fitThe whole
glmnetfit object in Step 3.fittedfitted value of response corresponding to the CV-selected model.
lambdaThe sequence of
lambdavalues used.cvmThe averaged estimated prediction error on the test sets over K folds.
cvsdThe standard error of the estimated prediction error on the test sets over K folds.
foldidFold assignment. A vector of length
n.ibestThe index in
lambdathat is chosen by CV.callFunction call.
See Also
Examples
n <- 100
p <- 200
x <- matrix(rnorm(n * p), n, p)
y <- x[, 1] - 2 * x[, 2] + 3 * x[, 1] * x[, 3] - 4 * x[, 4] * x[, 5] + rnorm(n)
mod <- cv.sprinter(x = x, y = y)