BT_Simulated_Data {BT} | R Documentation |
A simulated database used for examples and vignettes. The variables are related to a motor insurance pricing context.
BT_Simulated_Data
A simulated data frame with 50,000 rows and 7 columns, containing simulation of different policyholders:
Gender, varying between male and female.
Age, varying from 18 to 65years old.
Noisy variable, not used to simulate the response variable. It allows to assess how the algorithm handle these features.
Car type, varying between yes (sport car) or no.
Yearly exposure-to-risk, varying between 0 and 1.
Yearly claim number, simulated thanks to Poisson distribution.
Yearly claim frequency, corresponding to the ratio between Y and ExpoR.