R: Toy multi-label dataset.

toyml {utiml}

R Documentation

Toy multi-label dataset.

Description

A toy multi-label dataset is a synthetic dataset generated by the tool http://sites.labic.icmc.usp.br/mldatagen/ using the Hyperspheres strategy. Its purpose is to be used for small tests and examples.

Usage

toyml

Format

A mldr object with 100 instances, 10 features and 5 labels:

att1: Relevant numeric attribute between (-1 and 1)
att2: Relevant numeric attribute between (-1 and 1)
att3: Relevant numeric attribute between (-1 and 1)
att4: Relevant numeric attribute between (-1 and 1)
att5: Relevant numeric attribute between (-1 and 1)
att6: Relevant numeric attribute between (-1 and 1)
att7: Relevant numeric attribute between (-1 and 1)
iatt8: Irrelevant numeric attribute between (-1 and 1)
iatt9: Irrelevant numeric attribute between (-1 and 1)
ratt10: Redundant numeric attribute between (-1 and 1)
y1: Label 'y1' - Frequency: 0.17
y2: Label 'y2' - Frequency: 0.78
y3: Label 'y3' - Frequency: 0.19
y4: Label 'y4' - Frequency: 0.69
y5: Label 'y5' - Frequency: 0.17

Details

General Information

Cardinality: 2
Density: 0.4
Distinct multi-labels: 18
Number of single labelsets: 5
Max frequency: 23

Source

Generated by http://sites.labic.icmc.usp.br/mldatagen/ Configuration:

Strategy: Hyperspheres
Relevant Features: 7
Irrelevant Features: 2
Redundant Features: 1
Number of Labels (q): 5
Number of Instances: 100
Noise (from 0 to 1): 0.05
Maximum Radius/Half-Edge of the Hyperspheres/Hypercubes: 0.8
Minimum Radius/Half-Edge of the Hyperspheres/Hypercubes: ((q/10)+1)/q

[Package utiml version 0.1.7 Index]