R: Random sampling of data into train and test

sampling {scorecardModelUtils}

R Documentation

Random sampling of data into train and test

Description

The function does random sampling of the data and split it into train and test datasets. Training base percentage and seed value(optional) is taken as arguments. If seed value is not specified, random seed will be generated on different iterations.

Usage

sampling(base, train_perc = 0.7, seed = NA, replace = F)

Arguments

`base`	input dataframe
`train_perc`	(optional) percentage of total base to be kept as training sample, to be provided as decimal/fraction (default percentage is 0.7)
`seed`	(optional) seed value (if not given random seed is generated)
`replace`	(optional) whether replacement will e with or without replacement (default is FALSE ie. without replacement)

Value

An object of class "sampling" is a list containing the following components:

`train_sample`	training sample as a dataframe
`test_sample`	test sample as a dataframe
`seed`	seed used

Author(s)

Arya Poddar <aryapoddar290990@gmail.com>

Examples

data <- iris
sampling_list <- sampling(base = data,train_perc = 0.7,seed = 1234)
sampling_list$train
sampling_list$test
sampling_list$seed

[Package scorecardModelUtils version 0.0.1.0 Index]