corpus_subset {quanteda} | R Documentation |
Extract a subset of a corpus
Description
Returns subsets of a corpus that meet certain conditions, including direct
logical operations on docvars (document-level variables). corpus_subset
functions identically to subset.data.frame()
, using non-standard
evaluation to evaluate conditions based on the docvars in the corpus.
Usage
corpus_subset(x, subset, drop_docid = TRUE, ...)
Arguments
x |
corpus object to be subsetted. |
subset |
logical expression indicating the documents to keep: missing values are taken as false. |
drop_docid |
if |
... |
not used |
Value
corpus object, with a subset of documents (and docvars) selected according to arguments
See Also
Examples
summary(corpus_subset(data_corpus_inaugural, Year > 1980))
summary(corpus_subset(data_corpus_inaugural, Year > 1930 & President == "Roosevelt"))
[Package quanteda version 4.0.2 Index]