R: Split data for k-fold cross-validation

datasplit {spm2}

R Documentation

Split data for k-fold cross-validation

Description

This function is a data splitting function for k-fold cross- validation and uses a stratified random sampling technique. It resamples the training data based on sample quantiles.

Usage

datasplit(trainy, k.fold = 10)

Arguments

`trainy`	a vector of response, must have a length equal to sample size.
`k.fold`	integer; number of folds in the cross-validation. if > 1, then apply k-fold cross validation; the default is 10, i.e., 10-fold cross validation that is recommended.

Value

A list of samples each with an index of k-fold number.

Note

This function is largely based on rfcv in randomForest.

Author(s)

Jin Li

References

A. Liaw and M. Wiener (2002). Classification and Regression by randomForest. R News 2(3), 18-22.

Examples

library(spm)
data(petrel)
idx1 <- datasplit(petrel[, 3], k.fold = 10)
table(idx1)

[Package spm2 version 1.1.3 Index]