bootstrap_dfm {quanteda} | R Documentation |
Bootstrap a dfm
Description
Create an array of resampled dfms.
Usage
bootstrap_dfm(x, n = 10, ..., verbose = quanteda_options("verbose"))
Arguments
x |
a dfm object |
n |
number of resamples |
... |
additional arguments passed to |
verbose |
if |
Details
Function produces multiple, resampled dfm objects, based on resampling sentences (with replacement) from each document, recombining these into new "documents" and computing a dfm for each. Resampling of sentences is done strictly within document, so that every resampled document will contain at least some of its original tokens.
Value
A named list of dfm objects, where the first, dfm_0
, is
the dfm from the original texts, and subsequent elements are the
sentence-resampled dfms.
Author(s)
Kenneth Benoit
Examples
# bootstrapping from the original text
set.seed(10)
txt <- c(textone = "This is a sentence. Another sentence. Yet another.",
texttwo = "Premiere phrase. Deuxieme phrase.")
dfmat <- dfm(tokens(txt))
bootstrap_dfm(dfmat, n = 3, verbose = TRUE)
[Package quanteda version 4.0.2 Index]