auto_cut {flashlight} | R Documentation |
Discretizes a Vector
Description
This function takes a vector x
and returns a list with information on
disretized version of x
. The construction of level names can be controlled
by passing ...
arguments to formatC()
.
Usage
auto_cut(
x,
breaks = NULL,
n_bins = 27L,
cut_type = c("equal", "quantile"),
x_name = "value",
level_name = "level",
...
)
Arguments
x |
A vector. |
breaks |
An optional vector of breaks. Only relevant for numeric |
n_bins |
If |
cut_type |
For the default type "equal", bins of equal width are created
by |
x_name |
Column name with the values of |
level_name |
Column name with the bin labels of |
... |
Further arguments passed to |
Value
A list with the following elements:
-
data
: Adata.frame
with columsx_name
andlevel_name
each with the same length asx
. The columnx_name
has values in outputbin_means
while the columnlevel_name
has values inbin_labels
. -
breaks
: A vector of increasing and unique breaks used to cut a numericx
with too many distinct levels.NULL
otherwise. -
bin_means
: The midpoints of subsequent breaks, or if there are nobreaks
in the output, factor levels or distinct values ofx
. -
bin_labels
: Break labels of the form "(low, high]" if there arebreaks
in the output, otherwise the same asbin_means
. Same order asbin_means
.
Examples
auto_cut(1:10, n_bins = 3)
auto_cut(c(NA, 1:10), n_bins = 3)
auto_cut(1:10, breaks = 3:4, n_bins = 3)
auto_cut(1:10, n_bins = 3, cut_type = "quantile")
auto_cut(LETTERS[4:1], n_bins = 2)
auto_cut(factor(LETTERS[1:4], LETTERS[4:1]), n_bins = 2)
auto_cut(990:1100, n_bins = 3, big.mark = "'", format = "fg")
auto_cut(c(0.0001, 0.0002, 0.0003, 0.005), n_bins = 3, format = "fg")