fixEncoding |
Adapt the (Declared) Encoding of a Character Vector |
format.textcnt |
Term or Pattern Counting of Text Documents |
is.ascii |
Adapt the (Declared) Encoding of a Character Vector |
is.locale |
Adapt the (Declared) Encoding of a Character Vector |
is.utf8 |
Adapt the (Declared) Encoding of a Character Vector |
readBytes |
Read Byte or Character Strings |
readChars |
Read Byte or Character Strings |
remove_stopwords |
Preprocessing of Text Documents |
textcnt |
Term or Pattern Counting of Text Documents |
tokenize |
Preprocessing of Text Documents |
translate |
Adapt the (Declared) Encoding of a Character Vector |
translate_Unicode_latin_ligatures |
Translate Unicode Latin Ligatures |