Preprocessing and Feature Engineering Steps for Modeling


[Up] [Top]

Documentation for package ‘recipes’ version 1.0.10

Help Pages

A B C D F G H I J M N P R S T U V misc

-- A --

add_check Add a New Operation to the Current Recipe
add_role Manually alter roles
add_step Add a New Operation to the Current Recipe
all_date Role Selection
all_datetime Role Selection
all_datetime_predictors Role Selection
all_date_predictors Role Selection
all_double Role Selection
all_double_predictors Role Selection
all_factor Role Selection
all_factor_predictors Role Selection
all_integer Role Selection
all_integer_predictors Role Selection
all_logical Role Selection
all_logical_predictors Role Selection
all_nominal Role Selection
all_nominal_predictors Role Selection
all_numeric Role Selection
all_numeric_predictors Role Selection
all_ordered Role Selection
all_ordered_predictors Role Selection
all_outcomes Role Selection
all_predictors Role Selection
all_string Role Selection
all_string_predictors Role Selection
all_unordered Role Selection
all_unordered_predictors Role Selection
are_weights_used Helpers for steps with case weights
averages Helpers for steps with case weights

-- B --

bake Apply a trained preprocessing recipe
bake.recipe Apply a trained preprocessing recipe

-- C --

case-weight-helpers Helpers for steps with case weights
case_weights Using case weights with recipes
check_class Check variable class
check_cols Check if all columns are present
check_missing Check for missing values
check_new_values Check for new values
check_range Check range consistency
correlations Helpers for steps with case weights
covariances Helpers for steps with case weights
current_info Role Selection

-- D --

denom_vars Ratio variable creation
detect_step Detect if a particular step or check is used in a recipe
developer_functions Developer functions for creating recipes steps
discretize Discretize Numeric Variables
discretize.default Discretize Numeric Variables
discretize.numeric Discretize Numeric Variables
dummy_extract_names Naming Tools
dummy_names Naming Tools

-- F --

formula.recipe Create a formula from a prepared recipe
fully_trained Check to see if a recipe is trained/prepared

-- G --

get_case_weights Helpers for steps with case weights

-- H --

has_role Role Selection
has_type Role Selection

-- I --

imp_vars Impute via bagged trees

-- J --

juice Extract transformed training set

-- M --

medians Helpers for steps with case weights

-- N --

names0 Naming Tools

-- P --

pca_wts Helpers for steps with case weights
predict.discretize Discretize Numeric Variables
prep Estimate a preprocessing recipe
prep.recipe Estimate a preprocessing recipe
prepper Wrapper function for preparing recipes within resampling
print.recipe Print a Recipe

-- R --

recipe Create a recipe for preprocessing data
recipe.data.frame Create a recipe for preprocessing data
recipe.default Create a recipe for preprocessing data
recipe.formula Create a recipe for preprocessing data
recipe.matrix Create a recipe for preprocessing data
recipes_eval_select Evaluate a selection with tidyselect semantics specific to recipes
recipes_extension_check Checks that steps have all S3 methods
remove_role Manually alter roles
roles Manually alter roles

-- S --

selection Methods for selecting variables in step functions
selections Methods for selecting variables in step functions
step_arrange Sort rows using dplyr
step_bagimpute Impute via bagged trees
step_bin2factor Create a factors from A dummy variable
step_BoxCox Box-Cox transformation for non-negative data
step_bs B-spline basis functions
step_center Centering numeric data
step_classdist Distances to class centroids
step_classdist_shrunken Compute shrunken centroid distances for classification models
step_corr High correlation filter
step_count Create counts of patterns using regular expressions
step_cut Cut a numeric variable into a factor
step_date Date feature generator
step_depth Data depths
step_discretize Discretize Numeric Variables
step_dummy Create traditional dummy variables
step_dummy_extract Extract patterns from nominal data
step_dummy_multi_choice Handle levels in multiple predictors together
step_factor2string Convert factors to strings
step_filter Filter rows using dplyr
step_filter_missing Missing value column filter
step_geodist Distance between two locations
step_harmonic Add sin and cos terms for harmonic analysis
step_holiday Holiday feature generator
step_hyperbolic Hyperbolic transformations
step_ica ICA signal extraction
step_impute_bag Impute via bagged trees
step_impute_knn Impute via k-nearest neighbors
step_impute_linear Impute numeric variables via a linear model
step_impute_lower Impute numeric data below the threshold of measurement
step_impute_mean Impute numeric data using the mean
step_impute_median Impute numeric data using the median
step_impute_mode Impute nominal data using the most common value
step_impute_roll Impute numeric data using a rolling window statistic
step_indicate_na Create missing data column indicators
step_integer Convert values to predefined integers
step_interact Create interaction variables
step_intercept Add intercept (or constant) column
step_inverse Inverse transformation
step_invlogit Inverse logit transformation
step_isomap Isomap embedding
step_knnimpute Impute via k-nearest neighbors
step_kpca Kernel PCA signal extraction
step_kpca_poly Polynomial kernel PCA signal extraction
step_kpca_rbf Radial basis function kernel PCA signal extraction
step_lag Create a lagged predictor
step_lincomb Linear combination filter
step_log Logarithmic transformation
step_logit Logit transformation
step_lowerimpute Impute numeric data below the threshold of measurement
step_meanimpute Impute numeric data using the mean
step_medianimpute Impute numeric data using the median
step_modeimpute Impute nominal data using the most common value
step_mutate Add new variables using dplyr
step_mutate_at Mutate multiple columns using dplyr
step_naomit Remove observations with missing values
step_nnmf Non-negative matrix factorization signal extraction
step_nnmf_sparse Non-negative matrix factorization signal extraction with lasso penalization
step_normalize Center and scale numeric data
step_novel Simple value assignments for novel factor levels
step_ns Natural spline basis functions
step_num2factor Convert numbers to factors
step_nzv Near-zero variance filter
step_ordinalscore Convert ordinal factors to numeric scores
step_other Collapse infrequent categorical levels
step_pca PCA signal extraction
step_percentile Percentile transformation
step_pls Partial least squares feature extraction
step_poly Orthogonal polynomial basis functions
step_poly_bernstein Generalized bernstein polynomial basis
step_profile Create a profiling version of a data set
step_range Scaling numeric data to a specific range
step_ratio Ratio variable creation
step_regex Detect a regular expression
step_relevel Relevel factors to a desired level
step_relu Apply (smoothed) rectified linear transformation
step_rename Rename variables by name using dplyr
step_rename_at Rename multiple columns using dplyr
step_rm General variable filter
step_rollimpute Impute numeric data using a rolling window statistic
step_sample Sample rows using dplyr
step_scale Scaling mumeric data
step_select Select variables using dplyr
step_shuffle Shuffle variables
step_slice Filter rows by position using dplyr
step_spatialsign Spatial sign preprocessing
step_spline_b Basis splines
step_spline_convex Convex splines
step_spline_monotone Monotone splines
step_spline_natural Natural splines
step_spline_nonnegative Non-negative splines
step_sqrt Square root transformation
step_string2factor Convert strings to factors
step_time Time feature generator
step_unknown Assign missing categories to "unknown"
step_unorder Convert ordered factors to unordered factors
step_window Moving window functions
step_YeoJohnson Yeo-Johnson transformation
step_zv Zero variance filter
summary.recipe Summarize a recipe

-- T --

tidy.check Tidy the result of a recipe
tidy.check_class Tidy the result of a recipe
tidy.check_cols Tidy the result of a recipe
tidy.check_missing Tidy the result of a recipe
tidy.check_new_values Tidy the result of a recipe
tidy.check_range Tidy the result of a recipe
tidy.recipe Tidy the result of a recipe
tidy.step Tidy the result of a recipe
tidy.step_arrange Tidy the result of a recipe
tidy.step_bin2factor Tidy the result of a recipe
tidy.step_BoxCox Tidy the result of a recipe
tidy.step_bs Tidy the result of a recipe
tidy.step_center Tidy the result of a recipe
tidy.step_classdist Tidy the result of a recipe
tidy.step_classdist_shrunken Tidy the result of a recipe
tidy.step_corr Tidy the result of a recipe
tidy.step_count Tidy the result of a recipe
tidy.step_cut Tidy the result of a recipe
tidy.step_date Tidy the result of a recipe
tidy.step_depth Tidy the result of a recipe
tidy.step_discretize Tidy the result of a recipe
tidy.step_dummy Tidy the result of a recipe
tidy.step_dummy_extract Tidy the result of a recipe
tidy.step_dummy_multi_choice Tidy the result of a recipe
tidy.step_factor2string Tidy the result of a recipe
tidy.step_filter Tidy the result of a recipe
tidy.step_filter_missing Tidy the result of a recipe
tidy.step_geodist Tidy the result of a recipe
tidy.step_harmonic Tidy the result of a recipe
tidy.step_holiday Tidy the result of a recipe
tidy.step_hyperbolic Tidy the result of a recipe
tidy.step_ica Tidy the result of a recipe
tidy.step_impute_bag Tidy the result of a recipe
tidy.step_impute_knn Tidy the result of a recipe
tidy.step_impute_linear Tidy the result of a recipe
tidy.step_impute_lower Tidy the result of a recipe
tidy.step_impute_mean Tidy the result of a recipe
tidy.step_impute_median Tidy the result of a recipe
tidy.step_impute_mode Tidy the result of a recipe
tidy.step_impute_roll Tidy the result of a recipe
tidy.step_indicate_na Tidy the result of a recipe
tidy.step_integer Tidy the result of a recipe
tidy.step_interact Tidy the result of a recipe
tidy.step_intercept Tidy the result of a recipe
tidy.step_inverse Tidy the result of a recipe
tidy.step_invlogit Tidy the result of a recipe
tidy.step_isomap Tidy the result of a recipe
tidy.step_kpca Tidy the result of a recipe
tidy.step_kpca_poly Tidy the result of a recipe
tidy.step_kpca_rbf Tidy the result of a recipe
tidy.step_lag Tidy the result of a recipe
tidy.step_lincomb Tidy the result of a recipe
tidy.step_log Tidy the result of a recipe
tidy.step_logit Tidy the result of a recipe
tidy.step_mutate Tidy the result of a recipe
tidy.step_mutate_at Tidy the result of a recipe
tidy.step_naomit Tidy the result of a recipe
tidy.step_nnmf Tidy the result of a recipe
tidy.step_nnmf_sparse Tidy the result of a recipe
tidy.step_normalize Tidy the result of a recipe
tidy.step_novel Tidy the result of a recipe
tidy.step_ns Tidy the result of a recipe
tidy.step_num2factor Tidy the result of a recipe
tidy.step_nzv Tidy the result of a recipe
tidy.step_ordinalscore Tidy the result of a recipe
tidy.step_other Tidy the result of a recipe
tidy.step_pca Tidy the result of a recipe
tidy.step_percentile Tidy the result of a recipe
tidy.step_pls Tidy the result of a recipe
tidy.step_poly Tidy the result of a recipe
tidy.step_poly_bernstein Tidy the result of a recipe
tidy.step_profile Tidy the result of a recipe
tidy.step_range Tidy the result of a recipe
tidy.step_ratio Tidy the result of a recipe
tidy.step_regex Tidy the result of a recipe
tidy.step_relevel Tidy the result of a recipe
tidy.step_relu Tidy the result of a recipe
tidy.step_rename Tidy the result of a recipe
tidy.step_rename_at Tidy the result of a recipe
tidy.step_rm Tidy the result of a recipe
tidy.step_sample Tidy the result of a recipe
tidy.step_scale Tidy the result of a recipe
tidy.step_select Tidy the result of a recipe
tidy.step_shuffle Tidy the result of a recipe
tidy.step_slice Tidy the result of a recipe
tidy.step_spatialsign Tidy the result of a recipe
tidy.step_spline_b Tidy the result of a recipe
tidy.step_spline_convex Tidy the result of a recipe
tidy.step_spline_monotone Tidy the result of a recipe
tidy.step_spline_natural Tidy the result of a recipe
tidy.step_spline_nonnegative Tidy the result of a recipe
tidy.step_sqrt Tidy the result of a recipe
tidy.step_string2factor Tidy the result of a recipe
tidy.step_time Tidy the result of a recipe
tidy.step_unknown Tidy the result of a recipe
tidy.step_unorder Tidy the result of a recipe
tidy.step_window Tidy the result of a recipe
tidy.step_YeoJohnson Tidy the result of a recipe
tidy.step_zv Tidy the result of a recipe

-- U --

update.step Update a recipe step
update_role Manually alter roles
update_role_requirements Update role specific requirements

-- V --

variances Helpers for steps with case weights

-- misc --

.get_data_types Get types for use in recipes
.get_data_types.character Get types for use in recipes
.get_data_types.Date Get types for use in recipes
.get_data_types.default Get types for use in recipes
.get_data_types.double Get types for use in recipes
.get_data_types.factor Get types for use in recipes
.get_data_types.hardhat_case_weights Get types for use in recipes
.get_data_types.integer Get types for use in recipes
.get_data_types.list Get types for use in recipes
.get_data_types.logical Get types for use in recipes
.get_data_types.numeric Get types for use in recipes
.get_data_types.ordered Get types for use in recipes
.get_data_types.POSIXct Get types for use in recipes
.get_data_types.Surv Get types for use in recipes
.get_data_types.textrecipes_tokenlist Get types for use in recipes