noisy_data {ibawds} | R Documentation |
Noisy Data From a Tenth Order Polygon
Description
Training and test data create from a tenth order polynomial with added noise. The polynomial is given by
The noise follows a standard normal distribution. The data can be used to demonstrate overfitting. It is inspired by section II. B. in A high-bias, low-variance introduction to Machine Learning for physicists
Usage
noisy_data
Format
a list of two tibbles with two columns each. stands for the
independent,
for the dependent variable. The training data
(
noisy_data$train
) contains 1000 rows, the test data (noisy_data$test
)
20 rows.
References
P. Mehta et al., A high-bias, low-variance introduction to Machine Learning for physicists Phys. Rep. 810 (2019), 1-124. arXiv:1803.08823 doi:10.1016/j.physrep.2019.03.001
[Package ibawds version 0.6.0 Index]