ImbR {UBL} | R Documentation |
Synthetic Regression Data Set
Description
Simulated data set for imbalanced domain on regression. The rare cases corresponden to the higher extreme values and are described by a circle with white noise. The normal cases have a normal distribution with the same center of the circunference with elliptical contours.
Usage
data(ImbR)
Format
The data set has 2 continuous features (X1
and X2
) and a continuous target variable (denoted as Tgt
). The rare examples, i.e, cases with higher values of the target variable occur in 5% of the data. Data set ImbR has 1000 examples.
ImbR data has been simulated as follows:
- -
lower
Tgt
values:(X1, X2)
and Tgt
- -
higher
Tgt
values:(X1, X2)
, where
and
Tgt
Author(s)
Paula Branco paobranco@gmail.com, Rita Ribeiro rpribeiro@dcc.fc.up.pt and Luis Torgo ltorgo@dcc.fc.up.pt
Examples
data(ImbR)
summary(ImbR)
boxplot(ImbR$Tgt)
[Package UBL version 0.0.9 Index]