toyml {utiml} | R Documentation |
Toy multi-label dataset.
Description
A toy multi-label dataset is a synthetic dataset generated by the tool http://sites.labic.icmc.usp.br/mldatagen/ using the Hyperspheres strategy. Its purpose is to be used for small tests and examples.
Usage
toyml
Format
A mldr object with 100 instances, 10 features and 5 labels:
- att1
Relevant numeric attribute between (-1 and 1)
- att2
Relevant numeric attribute between (-1 and 1)
- att3
Relevant numeric attribute between (-1 and 1)
- att4
Relevant numeric attribute between (-1 and 1)
- att5
Relevant numeric attribute between (-1 and 1)
- att6
Relevant numeric attribute between (-1 and 1)
- att7
Relevant numeric attribute between (-1 and 1)
- iatt8
Irrelevant numeric attribute between (-1 and 1)
- iatt9
Irrelevant numeric attribute between (-1 and 1)
- ratt10
Redundant numeric attribute between (-1 and 1)
- y1
Label 'y1' - Frequency: 0.17
- y2
Label 'y2' - Frequency: 0.78
- y3
Label 'y3' - Frequency: 0.19
- y4
Label 'y4' - Frequency: 0.69
- y5
Label 'y5' - Frequency: 0.17
Details
General Information
Cardinality: 2
Density: 0.4
Distinct multi-labels: 18
Number of single labelsets: 5
Max frequency: 23
Source
Generated by http://sites.labic.icmc.usp.br/mldatagen/ Configuration:
Strategy: Hyperspheres
Relevant Features: 7
Irrelevant Features: 2
Redundant Features: 1
Number of Labels (q): 5
Number of Instances: 100
Noise (from 0 to 1): 0.05
Maximum Radius/Half-Edge of the Hyperspheres/Hypercubes: 0.8
Minimum Radius/Half-Edge of the Hyperspheres/Hypercubes: ((q/10)+1)/q