fst_get_top_ngrams2 {finnsurveytext} | R Documentation |
Make Top N-grams Table 2
Description
Creates a table of the most frequently-occurring ngrams within the data. Equivalent to 'fst_get_top_ngrams()' but does not print message.
Usage
fst_get_top_ngrams2(
data,
number = 10,
ngrams = 1,
norm = "number_words",
pos_filter = NULL,
strict = TRUE
)
Arguments
data |
A dataframe of text in CoNLL-U format. |
number |
The number of n-grams to return, default is '10'. |
ngrams |
The type of n-grams to return, default is '1'. |
norm |
The method for normalising the data. Valid settings are ''number_words'' (the number of words in the responses, default), ''number_resp'' (the number of responses), or 'NULL' (raw count returned). |
pos_filter |
List of UPOS tags for inclusion, default is 'NULL' which means all word types included. |
strict |
Whether to strictly cut-off at 'number' (ties are alphabetically ordered), default is 'TRUE'. |
Value
A table of the most frequently occurring n-grams in the data.
Examples
fst_get_top_ngrams2(conllu_dev_q11_1_nltk)
fst_get_top_ngrams2(conllu_dev_q11_1_nltk, number = 10, ngrams = 1)
[Package finnsurveytext version 1.0.0 Index]