postProcTxt {inpdfr} | R Documentation |
Prossess vectors containing words into a data.frame of word occurrences.
Description
Prossess vectors containing words into a data.frame of word occurrences.
Usage
postProcTxt(txt, minword = 1, maxword = 20, minFreqWord = 1)
Arguments
txt |
A vector containing text. |
minword |
An integer specifying the minimum number of letters per word into the returned data.frame. |
maxword |
An integer to specifying the maximum number of letters per word into the returned data.frame. |
minFreqWord |
An integer specifying the minimum word frequency into the returned data.frame. |
Value
A data.frame (freq = occurrences, stem = stem words, word = words), sorted by word occurrences.
[Package inpdfr version 0.1.12 Index]