| utiml {utiml} | R Documentation |
utiml: Utilities for Multi-Label Learning
Description
The utiml package is a framework for the application of classification algorithms to multi-label data. Like the well known MULAN used with Weka, it provides a set of multi-label procedures such as sampling methods, transformation strategies, threshold functions, pre-processing techniques and evaluation metrics. The package was designed to allow users to easily perform complete multi-label classification experiments in the R environment.
Details
Currently, the main methods supported are:
-
Classification methods:
ML Baselines,Binary Relevance (BR),BR+,Classifier Chains,Calibrated Label Ranking (CLR),Dependent Binary Relevance (DBR),Ensemble of Binary Relevance (EBR),Ensemble of Classifier Chains (ECC),Ensemble of Pruned Set (EPS),Hierarchy Of Multilabel classifiER (HOMER),Label specIfic FeaTures (LIFT),Label Powerset (LP),Meta-Binary Relevance (MBR or 2BR),Multi-label KNN (ML-KNN),Nested Stacking (NS),Pruned Problem Transformation (PPT),Pruned and Confident Stacking Approach (Prudent),Pruned Set (PS),Random k-labelsets (RAkEL),Recursive Dependent Binary Relevance (RDBR),Ranking by Pairwise Comparison (RPC) -
Evaluation methods:
Performing a cross-validation procedure,Confusion Matrix,Evaluate,Supported measures -
Pre-process utilities:
Fill sparse data,Normalize data,Remove attributes,Remove labels,Remove skewness labels,Remove unique attributes,Remove unlabeled instances,Replace nominal attributes -
Sampling methods:
Create holdout partitions,Create k-fold partitions,Create random subset,Create subset,Partition fold -
Threshold methods:
Fixed threshold,Cardinality threshold,MCUT,PCUT,RCUT,SCUT,Subset correction
However, there are other utilities methods not previously cited as
as.bipartition, as.mlresult,
as.ranking, multilabel_prediction, etc. More
details and examples are available on
utiml repository.
Notes
We use the mldr package, to manipulate multi-label data.
See its documentation to more information about handle multi-label dataset.
Cite as
@article{RJ-2018-041,
author = {Adriano Rivolli and Andre C. P. L. F. de Carvalho},
title = {{The utiml Package: Multi-label Classification in R}},
year = {2018},
journal = {{The R Journal}},
doi = {10.32614/RJ-2018-041},
url = {https://doi.org/10.32614/RJ-2018-041},
pages = {24--37},
volume = {10},
number = {2}
}
Author(s)
Adriano Rivolli <rivolli@utfpr.edu.br>
This package is a result of my PhD at Institute of Mathematics and Computer Sciences (ICMC) at the University of Sao Paulo, Brazil.
PhD advisor: Andre C. P. L. F. de Carvalho