| encodings {polmineR} | R Documentation |
Conversion between corpus and native encoding.
Description
Utility functions to convert the encoding of a character vector
between the native encoding and the encoding of the corpus.
Usage
as.utf8(x, from)
as.nativeEnc(x, from)
as.corpusEnc(x, from = encoding(), corpusEnc)
Arguments
x |
A |
from |
A |
corpusEnc |
A |
Details
The encoding of a corpus and the encoding of the terminal (the native
encoding) may differ, provoking strange or wrong results if no conversion is
carried out between the potentially differing encodings. The functions
as.nativeEnc() and as.corpusEnc are auxiliary functions to
assist the conversion. The functions as.nativeEnc and as.utf8
deliberately remove the explicit statement of the encoding, to avoid warnings
that may occur with character vector columns in a data.table object.