BT_Simulated_Data {BT}R Documentation

Simulated Database.

Description

A simulated database used for examples and vignettes. The variables are related to a motor insurance pricing context.

Usage

BT_Simulated_Data

Format

A simulated data frame with 50,000 rows and 7 columns, containing simulation of different policyholders:

Gender

Gender, varying between male and female.

Age

Age, varying from 18 to 65years old.

Split

Noisy variable, not used to simulate the response variable. It allows to assess how the algorithm handle these features.

Sport

Car type, varying between yes (sport car) or no.

ExpoR

Yearly exposure-to-risk, varying between 0 and 1.

Y

Yearly claim number, simulated thanks to Poisson distribution.

Y_normalized

Yearly claim frequency, corresponding to the ratio between Y and ExpoR.


[Package BT version 0.4 Index]