fst_rm_stop_punct {finnsurveytext}R Documentation

Remove Finnish stopwords and punctuation from CoNLL-U dataframe

Description

Removes stopwords and punctuation from a dataframe containing Finnish survey text data which is already in CoNLL-U format.

Usage

fst_rm_stop_punct(data, stopword_list = "nltk")

Arguments

data

A dataframe of Finnish text in CoNLL-U format.

stopword_list

A valid Finnish stopword list, default is '"nltk"'.

Value

A dataframe of Finnish text in CoNLL-U format without stopwords and punctuation.

Examples

fst_rm_stop_punct(conllu_dev_q11_3)
fst_rm_stop_punct(conllu_dev_q11_1, stopword_list <- "snowball")
fst_rm_stop_punct(conllu_cb_bullying, "stopwords-iso")

[Package finnsurveytext version 1.0.0 Index]