grepNonStandardCharacters {Ecfun} | R Documentation |
grep for nonstandard characters
Description
Return the indices of elements of x
containing characters that are not in
standardCharacters
.
Usage
grepNonStandardCharacters(x, value=FALSE,
standardCharacters=c(letters, LETTERS, ' ',
'.', ',', 0:9, '\"', "\'", '-', '_', '(',
')', '[', ']', '\n'),
... )
Arguments
x |
character vector in which it is desired to
identify elements containing characters not
in |
value |
logical:
|
standardCharacters |
Characters to overlook in |
... |
optional arguments for
|
Details
1. x. <- strsplit(x, '')
: convert
the input character vector to a list of
vectors of character vectors with
nchar(x.[i])
== 1 for i in
1:length(x)
.
2. sapply(x., ...)
to identify all
elements for which any element of x[[i]] is
not in standardCharacters
.
Value
an integer vector identifying all elements
of x
containing a character not in
standardCharacters
.
Author(s)
Spencer Graves
See Also
stringi-package
grep
,
regexpr
,
subNonStandardCharacters
,
showNonASCII
Examples
Names <- c('Raul', 'Ra`l', 'Torres,Raul', 'Torres, Raul')
# confusion in character sets can create
# names like Names[2]
chk <- grepNonStandardCharacters(Names)
all.equal(chk, 2)
chkv <- grepNonStandardCharacters(Names, TRUE)
all.equal(chkv, Names[2])