| grepNonStandardCharacters {Ecfun} | R Documentation |
grep for nonstandard characters
Description
Return the indices of elements of x
containing characters that are not in
standardCharacters.
Usage
grepNonStandardCharacters(x, value=FALSE,
standardCharacters=c(letters, LETTERS, ' ',
'.', ',', 0:9, '\"', "\'", '-', '_', '(',
')', '[', ']', '\n'),
... )
Arguments
x |
character vector in which it is desired to
identify elements containing characters not
in |
value |
logical:
|
standardCharacters |
Characters to overlook in |
... |
optional arguments for
|
Details
1. x. <- strsplit(x, ''): convert
the input character vector to a list of
vectors of character vectors with
nchar(x.[i]) == 1 for i in
1:length(x).
2. sapply(x., ...) to identify all
elements for which any element of x[[i]] is
not in standardCharacters.
Value
an integer vector identifying all elements
of x containing a character not in
standardCharacters.
Author(s)
Spencer Graves
See Also
stringi-package
grep,
regexpr,
subNonStandardCharacters,
showNonASCII
Examples
Names <- c('Raul', 'Ra`l', 'Torres,Raul', 'Torres, Raul')
# confusion in character sets can create
# names like Names[2]
chk <- grepNonStandardCharacters(Names)
all.equal(chk, 2)
chkv <- grepNonStandardCharacters(Names, TRUE)
all.equal(chkv, Names[2])