Folds {sharp} | R Documentation |
Splitting observations into folds
Description
Generates a list of n_folds
non-overlapping sets of observation IDs
(folds).
Usage
Folds(data, family = NULL, n_folds = 5)
Arguments
data |
vector or matrix of data. In regression, this should be the outcome data. |
family |
type of regression model. This argument is defined as in
|
n_folds |
number of folds. |
Details
For categorical outcomes (i.e. family
argument is set to
"binomial"
, "multinomial"
or "cox"
), the split is done
such that the proportion of observations from each of the categories in
each of the folds is representative of that of the full sample.
Value
A list of length n_folds
with sets of non-overlapping
observation IDs.
Examples
# Splitting into 5 folds
simul <- SimulateRegression()
ids <- Folds(data = simul$ydata)
lapply(ids, length)
# Balanced folds with respect to a binary variable
simul <- SimulateRegression(family = "binomial")
ids <- Folds(data = simul$ydata, family = "binomial")
lapply(ids, FUN = function(x) {
table(simul$ydata[x, ])
})
[Package sharp version 1.4.6 Index]