fast_handle_na {dataPreparation}R Documentation

Handle NA values

Description

Handle NAs values depending on the class of the column.

Usage

fast_handle_na(
  data_set,
  set_num = 0,
  set_logical = FALSE,
  set_char = "",
  verbose = TRUE
)

Arguments

data_set

Matrix, data.frame or data.table

set_num

NAs replacement for numeric column, (numeric or function, default to 0)

set_logical

NAs replacement for logical column, (logical or function, default to FALSE)

set_char

NAs replacement for character column, (character or function, default to "")

verbose

Should the algorithm talk (logical, default to TRUE)

Details

To preserve RAM this function edits data_set by reference. To keep object unchanged, please use copy.
If you provide a function, it will be applied to the full column. So this function should handle NAs.
For factor columns, it will add NA to list of values.

Value

data_set as a data.table with NAs replaced.

Examples

# Build a useful data_set set for example
require(data.table)
data_set <- data.table(numCol = c(1, 2, 3, NA),
                   charCol = c("", "a", NA, "c"),
                   booleanCol = c(TRUE, NA, FALSE, NA))

# To set NAs to 0, FALSE and "" (respectively for numeric, logical, character)
fast_handle_na(copy(data_set))

# In a numeric column to set NAs as "missing"
fast_handle_na(copy(data_set), set_char = "missing")

# In a numeric column, to set NAs to the minimum value of the column#'
fast_handle_na(copy(data_set), set_num = min) # Won't work because min(c(1, NA)) = NA so put back NA
fast_handle_na(copy(data_set), set_num = function(x)min(x,na.rm = TRUE)) # Now we handle NAs

# In a numeric column, to set NAs to the share of NAs values
rateNA <- function(x) {
  sum(is.na(x)) / length(x)
}
fast_handle_na(copy(data_set), set_num = rateNA)


[Package dataPreparation version 1.1.1 Index]