Resampling Algorithms for Multi-Label Datasets


[Up] [Top]

Documentation for package ‘mldr.resampling’ version 0.2.3

Help Pages

adjustedHammingDist Auxiliary function used by MLeNN. Computes the Hamming Distance between two instances
calculateDistances Auxiliary function used to calculate the distances between an instance and the ones with a specific active label. Euclidean distance is calculated for numeric attributes, and VDM for non numeric ones.
calculateTableVDM Auxiliary function used to calculate an auxiliary table to make VDM calculation faster
executeAlgorithm Auxiliary function used by resample. It executes an algorithm, given as a string, and stores the resulting MLD in a arff file
generateInstanceMLSOL Auxiliary function used by MLSOL. Creates a synthetic sample based on two other samples, taking into account their types
getAllNeighbors Auxiliary function used by MLSOL and MLUL. Computes the kNN of every instance in a dataset
getAllNeighbors2 Auxiliary function used by MLeNN and MLTL. Gets the kNN of every instance in a dataset, when compared to some of the rest
getAllReverseNeighbors Auxiliary function used by MLUL. For each instance in the dataset, given the neighbors structure, we compute its reverse nearest neighbors
getC Auxiliary function used by MLSOL and MLUL. For each instance in the dataset, we compute, for each label, the proportion of neighbors having an opposite class with respect to the proper instance
getNN Auxiliary function used to compute the neighbors of an instance
getNumCores Get the number of cores available for parallel computing
getS Auxiliary function used by MLSOL and MLUL. For non outlier instances, it aggregates the values of C, taking into account the global class imbalance
getU Auxiliary function used by MLUL. It computes the influence of each instance with respect to its reverse neighbors
getV Auxiliary function used by MLUL. It calculates, for each instance, how important it is in the dataset
getW Auxiliary function used by MLSOL and MLUL. For non outlier instances, it aggregates the values of S for each label
initTypes Auxiliary function used by MLSOL. Categorizes each pair instance-label of the dataset with a type
LPROS Randomly clones instances with minoritary labelsets
LPRUS Randomly deletes instances with majoritary labelsets
MLeNN Multilabel edited Nearest Neighbor (MLeNN)
MLRkNNOS Reverse-nearest neighborhood based oversampling for imbalanced, multi-label datasets
MLROS Randomly clones instances with minoritary labels
MLRUS Randomly deletes instances with majoritary labels
MLSMOTE Synthetic oversampling of multilabel instances (MLSMOTE)
MLSOL Multi-label oversampling based on local label imbalance (MLSOL)
MLTL Multilabel approach for the Tomek Link undersampling algorithm (MLTL)
MLUL Multi-label undersampling based on local label imbalance (MLUL)
newSample Auxiliary function used by MLSMOTE. Creates a synthetic sample based on values of attributes and labels of its neighbors
REMEDIAL Decouples highly imbalanced labels
resample Interface function of the package. It executes one or several algorithms, given as strings, and stores the resulting MLDs in arff files
setNumCores Set the number of cores available for parallel computing
setParallel Enable/Disable parallel computing
vdm Auxiliary function used to calculate the Value Difference Metric (VDM) between two instances considering their non numeric attributes