madelon {MDFS} | R Documentation |
An artificial dataset called MADELON
Description
An artificial dataset containing data points grouped in 32 clusters placed on the vertices of a five dimensional hypercube and randomly labeled 0/1.
Usage
madelon
Format
A list of two elements:
- data
2000 by 500 matrix of 2000 objects with 500 features
- decision
vector of 2000 decisions (labels 0/1)
- IG.2D
example 2D IG computed using
ComputeMaxInfoGains
Details
The five dimensions constitute 5 informative features. 15 linear combinations of those features are added to form a set of 20 (redundant) informative features. There are 480 distractor features called 'probes' having no predictive power.
Included is the original training set with label -1 changed to 0.
Source
https://archive.ics.uci.edu/ml/datasets/Madelon
[Package MDFS version 1.5.3 Index]