encodings {polmineR} | R Documentation |
Conversion between corpus and native encoding.
Description
Utility functions to convert the encoding of a character
vector
between the native encoding and the encoding of the corpus.
Usage
as.utf8(x, from)
as.nativeEnc(x, from)
as.corpusEnc(x, from = encoding(), corpusEnc)
Arguments
x |
A |
from |
A |
corpusEnc |
A |
Details
The encoding of a corpus and the encoding of the terminal (the native
encoding) may differ, provoking strange or wrong results if no conversion is
carried out between the potentially differing encodings. The functions
as.nativeEnc()
and as.corpusEnc
are auxiliary functions to
assist the conversion. The functions as.nativeEnc
and as.utf8
deliberately remove the explicit statement of the encoding, to avoid warnings
that may occur with character vector columns in a data.table
object.