rslp_doc {rslp} | R Documentation |
RSLP Document
Description
Apply the Stemming Algorithm for the Portuguese Language to vector of documents. It extracts words using the regex "\b[:alpha:]\b"
Usage
rslp_doc(
docs,
steprules = readRDS(system.file("steprules.rds", package = "rslp"))
)
Arguments
docs |
chr vector of documents |
steprules |
as obtained from the function extract_rules. (only define if you are certain about it). The default is to get the parsed version of the rules installed with the package. |
References
V. Orengo, C. Huyck, "A Stemming Algorithmm for the Portuguese Language", SPIRE, 2001, String Processing and Information Retrieval, International Symposium on, String Processing and Information Retrieval, International Symposium on 2001, pp. 0186, doi:10.1109/SPIRE.2001.10024
Examples
docs <- c("coma frutas pois elas fazem bem para.")
rslp_doc(docs)
[Package rslp version 0.2.0 Index]