smoteNew {sambia}R Documentation

smoteNew is a necessary function that modifies the SMOTE algorithm.

Description

smoteNewis a necessary function that modifies the SMOTE algorithm in the following ways: (1) correct bug in original smotefamily::SMOTE() function and (2) lets the user specifiy which class to be oversampled.

Usage

smoteNew(data.x, data.y, K, dup_size = 0, class.to.oversample)

Arguments

data.x

A data frame or matrix of numeric-attributed dataset

data.y

A vector of a target class attribute corresponding to a dataset X

K

The number of nearest neighbors during sampling process

dup_size

The number or vector representing the desired times of synthetic minority instances over the original number of majority instances

class.to.oversample

Class to be oversampled

Author(s)

Norbert Krautenbacher, Kevin Strauss, Maximilian Mandl, Christiane Fuchs

Examples

library(smotefamily)
library(sambia)
data.example = sample_generator(10000,ratio = 0.80)
genData = sambia:::smoteNew(data.example[,-3],data.example[,3],K = 5,class.to.oversample = 'p')

[Package sambia version 0.1.0 Index]