stri_enc_toascii {stringi} | R Documentation |
Convert To ASCII
Description
This function converts input strings to ASCII, i.e., to character strings consisting of bytes not greater than 127.
Usage
stri_enc_toascii(str)
Arguments
str |
a character vector to be converted |
Details
All code points greater than 127 are replaced with the ASCII SUBSTITUTE
CHARACTER (0x1A).
R encoding declarations are always used to determine
which encoding is assumed for each input, see stri_enc_mark
.
If ill-formed byte sequences are found in UTF-8 byte
streams, a warning is generated.
A bytes
-marked string is assumed to be in an 8-bit encoding
extending the ASCII map (a common assumption in R itself).
Note that the SUBSTITUTE CHARACTER (\x1a == \032
) may be interpreted
as the ASCII missing value for single characters.
Value
Returns a character vector.
Author(s)
Marek Gagolewski and other contributors
See Also
The official online manual of stringi at https://stringi.gagolewski.com/
Gagolewski M., stringi: Fast and portable character string processing in R, Journal of Statistical Software 103(2), 2022, 1-59, doi:10.18637/jss.v103.i02
Other encoding_conversion:
about_encoding
,
stri_enc_fromutf32()
,
stri_enc_tonative()
,
stri_enc_toutf32()
,
stri_enc_toutf8()
,
stri_encode()