cpoFixFactors {mlrCPO} | R Documentation |
Clean Up Factorial Features
Description
This is a CPOConstructor
to be used to create a
CPO
. It is called like any R function and returns
the created CPO
.
Prevent common pitfalls when using factorial data, by making factorial data have the same levels in training and prediction, and by dropping factor levels that do not occur in training data.
Usage
cpoFixFactors(
drop.unused.levels = TRUE,
fix.factors.prediction = TRUE,
id,
export = "export.default",
affect.type = NULL,
affect.index = integer(0),
affect.names = character(0),
affect.pattern = NULL,
affect.invert = FALSE,
affect.pattern.ignore.case = FALSE,
affect.pattern.perl = FALSE,
affect.pattern.fixed = FALSE
)
Arguments
drop.unused.levels |
Factor levels of data that have no instances in the data are dropped. If
“fix.factors.prediction” is false, this can lead to training data having
different factor levels than prediction data. Default is |
fix.factors.prediction |
Factor levels are kept the same in training and prediction. This is
recommended. Default is |
id |
[ |
export |
[ |
affect.type |
[ |
affect.index |
[ |
affect.names |
[ |
affect.pattern |
[ |
affect.invert |
[ |
affect.pattern.ignore.case |
[ |
affect.pattern.perl |
[ |
affect.pattern.fixed |
[ |
Value
[CPO
].
General CPO info
This function creates a CPO object, which can be applied to
Task
s, data.frame
s, link{Learner}
s
and other CPO objects using the %>>%
operator.
The parameters of this object can be changed after creation
using the function setHyperPars
. The other
hyper-parameter manipulating functins, getHyperPars
and getParamSet
similarly work as one expects.
If the “id” parameter is given, the hyperparameters will have this id as aprefix; this will, however, not change the parameters of the creator function.
Calling a CPOConstructor
CPO constructor functions are called with optional values of parameters, and additional “special” optional values.
The special optional values are the id
parameter, and the affect.*
parameters. The affect.*
parameters
enable the user to control which subset of a given dataset is affected. If no affect.*
parameters are given, all
data features are affected by default.
See Also
Other CPOs:
cpoApplyFunRegrTarget()
,
cpoApplyFun()
,
cpoAsNumeric()
,
cpoCache()
,
cpoCbind()
,
cpoCollapseFact()
,
cpoDropConstants()
,
cpoDropMostlyConstants()
,
cpoDummyEncode()
,
cpoFilterAnova()
,
cpoFilterCarscore()
,
cpoFilterChiSquared()
,
cpoFilterFeatures()
,
cpoFilterGainRatio()
,
cpoFilterInformationGain()
,
cpoFilterKruskal()
,
cpoFilterLinearCorrelation()
,
cpoFilterMrmr()
,
cpoFilterOneR()
,
cpoFilterPermutationImportance()
,
cpoFilterRankCorrelation()
,
cpoFilterRelief()
,
cpoFilterRfCImportance()
,
cpoFilterRfImportance()
,
cpoFilterRfSRCImportance()
,
cpoFilterRfSRCMinDepth()
,
cpoFilterSymmetricalUncertainty()
,
cpoFilterUnivariate()
,
cpoFilterVariance()
,
cpoIca()
,
cpoImpactEncodeClassif()
,
cpoImpactEncodeRegr()
,
cpoImputeConstant()
,
cpoImputeHist()
,
cpoImputeLearner()
,
cpoImputeMax()
,
cpoImputeMean()
,
cpoImputeMedian()
,
cpoImputeMin()
,
cpoImputeMode()
,
cpoImputeNormal()
,
cpoImputeUniform()
,
cpoImpute()
,
cpoLogTrafoRegr()
,
cpoMakeCols()
,
cpoMissingIndicators()
,
cpoModelMatrix()
,
cpoOversample()
,
cpoPca()
,
cpoProbEncode()
,
cpoQuantileBinNumerics()
,
cpoRegrResiduals()
,
cpoResponseFromSE()
,
cpoSample()
,
cpoScaleMaxAbs()
,
cpoScaleRange()
,
cpoScale()
,
cpoSelect()
,
cpoSmote()
,
cpoSpatialSign()
,
cpoTransformParams()
,
cpoWrap()
,
makeCPOCase()
,
makeCPOMultiplex()