| auto_cut {flashlight} | R Documentation |
Discretizes a Vector
Description
This function takes a vector x and returns a list with information on
disretized version of x. The construction of level names can be controlled
by passing ... arguments to formatC().
Usage
auto_cut(
x,
breaks = NULL,
n_bins = 27L,
cut_type = c("equal", "quantile"),
x_name = "value",
level_name = "level",
...
)
Arguments
x |
A vector. |
breaks |
An optional vector of breaks. Only relevant for numeric |
n_bins |
If |
cut_type |
For the default type "equal", bins of equal width are created
by |
x_name |
Column name with the values of |
level_name |
Column name with the bin labels of |
... |
Further arguments passed to |
Value
A list with the following elements:
-
data: Adata.framewith columsx_nameandlevel_nameeach with the same length asx. The columnx_namehas values in outputbin_meanswhile the columnlevel_namehas values inbin_labels. -
breaks: A vector of increasing and unique breaks used to cut a numericxwith too many distinct levels.NULLotherwise. -
bin_means: The midpoints of subsequent breaks, or if there are nobreaksin the output, factor levels or distinct values ofx. -
bin_labels: Break labels of the form "(low, high]" if there arebreaksin the output, otherwise the same asbin_means. Same order asbin_means.
Examples
auto_cut(1:10, n_bins = 3)
auto_cut(c(NA, 1:10), n_bins = 3)
auto_cut(1:10, breaks = 3:4, n_bins = 3)
auto_cut(1:10, n_bins = 3, cut_type = "quantile")
auto_cut(LETTERS[4:1], n_bins = 2)
auto_cut(factor(LETTERS[1:4], LETTERS[4:1]), n_bins = 2)
auto_cut(990:1100, n_bins = 3, big.mark = "'", format = "fg")
auto_cut(c(0.0001, 0.0002, 0.0003, 0.005), n_bins = 3, format = "fg")