| quanteda_options {quanteda} | R Documentation |
Get or set package options for quanteda
Description
Get or set global options affecting functions across quanteda.
Usage
quanteda_options(..., reset = FALSE, initialize = FALSE)
Arguments
... |
options to be set, as key-value pair, same as
|
reset |
logical; if |
initialize |
logical; if |
Details
Currently available options are:
verboselogical; if
TRUEthen use this as the default for all functions with averboseargumentthreadsinteger; specifies the number of threads to use in parallelized functions; defaults to the maximum number of threads
print_dfm_max_ndoc,print_corpus_max_ndoc,print_tokens_max_ndocinteger; specify the number of documents to display when using the defaults for printing a dfm, corpus, or tokens object
print_dfm_max_nfeat,print_corpus_max_nchar,print_tokens_max_ntokeninteger; specifies the number of features to display when printing a dfm, the number of characters to display when printing corpus documents, or the number of tokens to display when printing tokens objects
print_dfm_summaryinteger; specifies the number of documents to display when using the defaults for printing a dfm
print_dictionary_max_nkey,print_dictionary_max_nvalthe number of keys or values (respectively) to display when printing a dictionary
print_kwic_max_nrowthe number of rows to display when printing a kwic object
base_docnamecharacter; stem name for documents that are unnamed when a corpus, tokens, or dfm are created or when a dfm is converted from another object
base_featnamecharacter; stem name for features that are unnamed when they are added, for whatever reason, to a dfm through an operation that adds features
base_compnamecharacter; stem name for components that are created by matrix factorization
language_stemmercharacter; language option for
char_wordstem(),tokens_wordstem(), anddfm_wordstem()pattern_hashtag,pattern_usernamecharacter; regex patterns for (social media) hashtags and usernames respectively, used to avoid segmenting these in the default internal "word" tokenizer
tokens_block_sizeinteger; specifies the number of documents to be tokenized at a time in blocked tokenization. When the number is large, tokenization becomes faster but also memory-intensive.
tokens_localecharacter; specify locale in stringi boundary detection in tokenization and corpus reshaping. See
stringi::stri_opts_brkiter().tokens_tokenizer_wordcharacter; the current word tokenizer version used as a default for
what = "word"intokens(), one of"word1","word2","word3"(same as"word2"), or"word4".
Value
When called using a key = value pair (where key can be
a label or quoted character name)), the option is set and TRUE is
returned invisibly.
When called with no arguments, a named list of the package options is returned.
When called with reset = TRUE as an argument, all arguments are
options are reset to their default values, and TRUE is returned
invisibly.
Examples
(opt <- quanteda_options())
quanteda_options(verbose = TRUE)
quanteda_options("verbose" = FALSE)
quanteda_options("threads")
quanteda_options(print_dfm_max_ndoc = 50L)
# reset to defaults
quanteda_options(reset = TRUE)
# reset to saved options
quanteda_options(opt)