identifyLoners {dataReporter} | R Documentation |
A checkFunction for identifying sparsely represented values (loners)
Description
A checkFunction
to be called from check
that identifies values that
only occur less than 6 times in factor, (haven_)labelled, or character variables (that is, loners).
Usage
identifyLoners(v, nMax = 10)
Arguments
v |
A character, (haven_)labelled, or factor variable to check. |
nMax |
The maximum number of problematic values to report.
Default is |
Details
For character, (haven_)labelled, and factor variables, identify values that only have a very low number of observations, as these categories might be problematic when conducting an analysis. Unused factor levels are not considered "loners". "Loners" are defined as values with 5 or less observations, reflecting the commonly use rule of thumb for performing chi squared tests.
Value
A checkResult
with three entires:
$problem
(a logical indicating whether case issues where found),
$message
(a message describing which values in v
were loners) and
$problemValues
(the problematic values in their original format).
Note that Only unique problematic values
are listed and they are presented in alphabetical order.
See Also
check
, allCheckFunctions
,
checkFunction
, checkResult
Examples
identifyLoners(c(rep(c("a", "b", "c"), 10), "d", "d"))