cv.DMRnet {DMRnet} | R Documentation |
cross-validation for DMRnet
Description
Executes k-fold cross-validation for DMR and returns a value for df.
Usage
cv.DMRnet(
X,
y,
family = "gaussian",
o = 5,
nlambda = 100,
lam = 10^(-7),
interc = TRUE,
maxp = ifelse(family == "gaussian", ceiling(length(y)/2), ceiling(length(y)/4)),
nfolds = 10,
indexation.mode = "GIC",
algorithm = "DMRnet",
clust.method = ifelse(algorithm == "glamer", "single", "complete")
)
Arguments
X |
Input data frame, of dimension n x p; each row is an observation vector. Columns can be numerical or integer for continuous predictors or factors for categorical predictors. |
y |
Response variable. Numerical for |
family |
Response type; one of: |
o |
Parameter of the group lasso screening step, described in |
nlambda |
Parameter of the group lasso screening step, described in |
lam |
The amount of penalization in ridge regression (used for logistic regression in order to allow for parameter estimation in linearly separable setups) or the amount of matrix regularization in case of linear regression. Used only for numerical reasons. The default value is 1e-7. |
interc |
Should intercept(s) be fitted (the default, |
maxp |
Maximal number of parameters of the model, smaller values result in quicker computation. |
nfolds |
Number of folds in cross-validation. The default value is 10. |
indexation.mode |
How the cross validation algorithm should index the models for internal quality comparisons; one of: |
algorithm |
The algorithm to be used; for partition selection (merging levels) use one of: |
clust.method |
Clustering method used for partitioning levels of factors; see function hclust in package stats for details. |
Details
cv.DMRnet algorithm does nfold
-fold cross-validation for DMRnet. The df for the minimal estimated prediction error is returned.
Value
An object with S3 class "cv.DMR" is returned, which is a list with the ingredients of the cross-validation fit.
- df.min
df (number of parameters) of the model with minimal cross-validated error.
- df.1se
df (number of parameters) of the smallest model falling under the upper curve of a prediction error plus one standard deviation.
- dmr.fit
Fitted
DMR
object for the full data.- cvm
The mean cross-validated error for the entire sequence of models.
- foldid
The fold assignments used.
See Also
plot.cv.DMR
for plotting, coef.cv.DMR
for extracting coefficients and predict.cv.DMR
for prediction.
Examples
## cv.DMRnet for linear regression
set.seed(13)
data(miete)
ytr <- miete$rent[1:1500]
Xtr <- miete$area[1:1500]
Xte <- miete$area[1501:2053]
cv <- cv.DMRnet(Xtr, ytr)
print(cv)
plot(cv)
coef(cv)
ypr <- predict(cv, newx = Xte)