vocabulary_size {dials} | R Documentation |
Number of tokens in vocabulary
Description
Used in textrecipes::step_tokenize_sentencepiece()
and
textrecipes::step_tokenize_bpe()
.
Usage
vocabulary_size(range = c(1000L, 32000L), trans = NULL)
Arguments
range |
A two-element vector holding the defaults for the smallest and largest possible values, respectively. If a transformation is specified, these values should be in the transformed units. |
trans |
A |
Examples
vocabulary_size()
[Package dials version 1.3.0 Index]