Text Analysis Utilities


[Up] [Top]

Documentation for package ‘tau’ version 0.0-25

Help Pages

fixEncoding Adapt the (Declared) Encoding of a Character Vector
format.textcnt Term or Pattern Counting of Text Documents
is.ascii Adapt the (Declared) Encoding of a Character Vector
is.locale Adapt the (Declared) Encoding of a Character Vector
is.utf8 Adapt the (Declared) Encoding of a Character Vector
readBytes Read Byte or Character Strings
readChars Read Byte or Character Strings
remove_stopwords Preprocessing of Text Documents
textcnt Term or Pattern Counting of Text Documents
tokenize Preprocessing of Text Documents
translate Adapt the (Declared) Encoding of a Character Vector
translate_Unicode_latin_ligatures Translate Unicode Latin Ligatures