preprocess_one_hot_encoding {mlpack}R Documentation

One Hot Encoding

Description

A utility to do one-hot encoding on features of dataset.

Usage

preprocess_one_hot_encoding(
  input,
  dimensions = NA,
  verbose = getOption("mlpack.verbose", FALSE)
)

Arguments

input

Matrix containing data (numeric matrix/data.frame with info).

dimensions

Index of dimensions that need to be one-hot encoded (if unspecified, all categorical dimensions are one-hot encoded) (integer vector).

verbose

Display informational messages and the full list of parameters and timers at the end of execution. Default value "getOption("mlpack.verbose", FALSE)" (logical).

Details

This utility takes a dataset and a vector of indices and does one-hot encoding of the respective features at those indices. Indices represent the IDs of the dimensions to be one-hot encoded.

If no dimensions are specified with "dimensions", then all categorical-type dimensions will be one-hot encoded. Otherwise, only the dimensions given in "dimensions" will be one-hot encoded.

The output matrix with encoded features may be saved with the "output" parameters.

Value

A list with several components:

output

Matrix to save one-hot encoded features data to (numeric matrix).

Author(s)

mlpack developers

Examples

# So, a simple example where we want to encode 1st and 3rd feature from
# dataset "X" into "X_output" would be

## Not run: 
output <- preprocess_one_hot_encoding(input=X, dimensions=1, dimensions=3)
X_ouput <- output$output

## End(Not run)

[Package mlpack version 4.4.0 Index]