fst_rm_stop_punct {finnsurveytext} | R Documentation |
Remove Finnish stopwords and punctuation from CoNLL-U dataframe
Description
Removes stopwords and punctuation from a dataframe containing Finnish survey text data which is already in CoNLL-U format.
Usage
fst_rm_stop_punct(data, stopword_list = "nltk")
Arguments
data |
A dataframe of Finnish text in CoNLL-U format. |
stopword_list |
A valid Finnish stopword list, default is '"nltk"'. |
Value
A dataframe of Finnish text in CoNLL-U format without stopwords and punctuation.
Examples
fst_rm_stop_punct(conllu_dev_q11_3)
fst_rm_stop_punct(conllu_dev_q11_1, stopword_list <- "snowball")
fst_rm_stop_punct(conllu_cb_bullying, "stopwords-iso")
[Package finnsurveytext version 1.0.0 Index]