R: BTFit

BTFit {BT}

R Documentation

BTFit

Description

These are objects representing fitted boosting trees.

Details

Boosting Tree Model Object.

Value

`BTInit`	an object of class `BTInit` containing the initial fitted value `initFit`, the initial `training.error` and the initial `validation.error` if any.
`BTErrors`	an object of class `BTErrors` containing the vectors of errors for each iteration performed (excl. the initialization). More precisely, it contains the `training.error`, `validation.error` if `train.fraction`<1 and the `oob.improvement` if `bag.fraction` < 1. Moreover, if a cross-validation approach was performed, a vector of cross-validation errors `cv.error` as a function of boosting iteration is also stored in this object.
`BTIndivFits`	an object of class `BTIndivFits` containing the list of each individual tree fitted at each boosting iteration.
`distribution`	the Tweedie power (and so the distribution) that has been used to perform the algorithm. It will currently always output 1.
`var.names`	a vector containing the names of the explanatory variables.
`response`	the name of the target/response variable.
`w`	a vector containing the weights used.
`seed`	the used seed, if any.
`BTData`	if `keep.data=TRUE`, an object of class `BTData` containing the `training.set` and `validation.set` (can be NULL if not used). These data frames are reduced to the used variables, that are the response and explanatory variables. Note that in case of cross-validation, even if `keep.data=TRUE` the folds will not be kept. In fact, only the data frames related to the original fit (i.e. on the whole training set) will be saved.
`BTParams`	an object of class `BTParams` containing all the (Adaptive) boosting tree parameters. More precisely, it contains the `ABT`, `train.fraction`, `shrinkage`, `interaction.depth`, `bag.fraction`, `n.iter`, `colsample.bytree` and `tree.control` parameter values.
`keep.data`	the `keep.data` parameter value.
`is.verbose`	the `is.verbose` parameter value.
`fitted.values`	the training set fitted values on the score scale using all the `n.iter` (and initialization) iterations.
`cv.folds`	the number of cross-validation folds. Set to 1 if no cross-validation performed.
`call`	the original call to the `BT` algorithm.
`Terms`	the `model.frame` terms argument.
`folds`	a vector of values identifying to which fold each observation is in. This argument is not present if there is no cross-validation. On the other hand, it corresponds to `folds.id` if it was initially defined by the user.
`cv.fitted`	a vector containing the cross-validation fitted values, if a cross-validation was performed. More precisely, for a given observation, the prediction will be furnished by the cv-model for which this specific observation was out-of-fold. See `predict.BTCVFit` for more details.

Structure

The following components must be included in a legitimate BTFit object.

Author(s)

Gireg Willame gireg.willame@gmail.com

This package is inspired by the gbm3 package. For more details, see https://github.com/gbm-developers/gbm3/.

References

M. Denuit, D. Hainaut and J. Trufin (2019). Effective Statistical Learning Methods for Actuaries |: GLMs and Extensions, Springer Actuarial.

M. Denuit, D. Hainaut and J. Trufin (2019). Effective Statistical Learning Methods for Actuaries ||: Tree-Based Methods and Extensions, Springer Actuarial.

M. Denuit, D. Hainaut and J. Trufin (2019). Effective Statistical Learning Methods for Actuaries |||: Neural Networks and Extensions, Springer Actuarial.

M. Denuit, D. Hainaut and J. Trufin (2022). Response versus gradient boosting trees, GLMs and neural networks under Tweedie loss and log-link. Accepted for publication in Scandinavian Actuarial Journal.

M. Denuit, J. Huyghe and J. Trufin (2022). Boosting cost-complexity pruned trees on Tweedie responses: The ABT machine for insurance ratemaking. Paper submitted for publication.

M. Denuit, J. Trufin and T. Verdebout (2022). Boosting on the responses with Tweedie loss functions. Paper submitted for publication.