nzv {PDtoolkit}R Documentation

Near-zero variance

Description

nzv procedure aims to identify risk factors with low variability (almost constants). Usually these risk factors are expertly investigated and decision is made if they should be excluded from further modeling process.
nzv output report includes the following metrics:

Usage

nzv(db, sc = c(NA, NaN, Inf, -Inf))

Arguments

db

Data frame of risk factors supplied for near-zero variance analysis.

sc

Numeric or character vector with special case elements. Default values are c(NA, NaN, Inf, -Inf).

Value

The command nzv returns the data frame with different matrices needed for identification of near-zero variables. For details see Description section.

Examples

suppressMessages(library(PDtoolkit))
data(loans)
#artificially add some special values
loans$"Account Balance"[1:10] <- NA
rf.s <- nzv(db = loans, sc = c(NA, NaN, Inf, -Inf))
rf.s

[Package PDtoolkit version 1.2.0 Index]