| smote {SmartMeterAnalytics} | R Documentation | 
Synthetic minority oversampling (SMOTE)
Description
Performs oversampling by creating new instances.
Usage
smote(
  Variables,
  Classes,
  subset_use = NULL,
  k = 5,
  use_nearest = TRUE,
  proportions = 0.9,
  equalise_with_undersampling = FALSE,
  safe = FALSE
)
Arguments
| Variables | the data.frame of independent variables that should be used to create new instances | 
| Classes | the class labels in the prediction problem | 
| subset_use | a specific subset only is used for the oversampling. If NULL, everything is used. | 
| k | the number of neigbours for generation | 
| use_nearest | should only the nearest neighbours be used? (very slow) | 
| proportions | to which proportion (of the biggest class) should the classes be equalized | 
| equalise_with_undersampling | should additional undersampling be performed? | 
| safe | should a safe version of SMOTE be used? | 
Details
SMOTE is used to generate synthetic datapoints of a smaller class, for example to overcome the problem of imbalanced classes in classification.
Value
a list containing new independent variables data.frame and new class labels
Author(s)
Ilya Kozlovskiy, Konstantin Hopf konstantin.hopf@uni-bamberg.de