| woeBinning {woeBinning} | R Documentation |
Package for Supervised Weight of Evidence Binning
Description
This package generates, visualizes, tabulates and deploys a supervised weight of evidence (WOE) binning of variables.
Details
The package woeBinning automates the process of binning of numeric
variables and factors with respect to a dichotomous target variable.
Additionally, it visualizes the realized binning solution, tabulates it and
deploys it to (new) data. All functions can be used with single variables
or an entire data frame.
Binning Functions
-
woe.binninggenerates a supervised fine and coarse classing of numeric variables and factors. -
woe.tree.binninggenerates a supervised tree-like segmentation of numeric variables and factors. -
woe.binning.plotvisualizes the binning solution generated and saved viawoe.binningorwoe.tree.binning. -
woe.binning.tabletabulates the binning solution generated and saved viawoe.binningorwoe.tree.binning. -
woe.binning.deploydeploys the binning solution generated and saved viawoe.binningorwoe.tree.binningto (new) data.
References
Siddiqi, N. 2006: Credit Risk Scorecards: Developing and Implementing Intelligent Credit Scoring. Hoboken, New Jersey: John Wiley & Sons.
Anderson, R. 2007: The Credit Scoring Toolkit: Theory and Practice for Retail Credit Risk Management and Decision Automation. Oxford / New York: Oxford University Press.
Examples
# Load German credit data and create subset
data(germancredit)
df <- germancredit[, c('creditability', 'credit.amount', 'duration.in.month',
'savings.account.and.bonds', 'purpose')]
# Bin all variables of the data frame (apart from the target variable)
# with default parameter settings
binning <- woe.binning(df, 'creditability', df)
# Plot the binned variables
woe.binning.plot(binning)
# Tabulate the binned variables
tabulate.binning <- woe.binning.table(binning)
tabulate.binning
# Deploy the binning solution to the data frame
# (i.e. add binned variables and corresponding WOE variables)
df.with.binned.vars.added <- woe.binning.deploy(df, binning,
add.woe.or.dum.var='woe')