fst_get_top_ngrams2 {finnsurveytext}R Documentation

Make Top N-grams Table 2

Description

Creates a table of the most frequently-occurring ngrams within the data. Equivalent to 'fst_get_top_ngrams()' but does not print message.

Usage

fst_get_top_ngrams2(
  data,
  number = 10,
  ngrams = 1,
  norm = "number_words",
  pos_filter = NULL,
  strict = TRUE
)

Arguments

data

A dataframe of text in CoNLL-U format.

number

The number of n-grams to return, default is '10'.

ngrams

The type of n-grams to return, default is '1'.

norm

The method for normalising the data. Valid settings are ''number_words'' (the number of words in the responses, default), ''number_resp'' (the number of responses), or 'NULL' (raw count returned).

pos_filter

List of UPOS tags for inclusion, default is 'NULL' which means all word types included.

strict

Whether to strictly cut-off at 'number' (ties are alphabetically ordered), default is 'TRUE'.

Value

A table of the most frequently occurring n-grams in the data.

Examples

fst_get_top_ngrams2(conllu_dev_q11_1_nltk)
fst_get_top_ngrams2(conllu_dev_q11_1_nltk, number = 10, ngrams = 1)

[Package finnsurveytext version 1.0.0 Index]