| as.VCorpus {polmineR} | R Documentation |
Get VCorpus.
Description
Retrieve full text for the subcorpora orpartition objects in a
subcorpus_bundle or partition_bundle and generate a
VCorpus-class object from the tm-package.
Usage
## S4 method for signature 'partition_bundle'
as.VCorpus(x)
Arguments
x |
A |
Details
The VCorpus class of the tm-package offers an interface to
access the functionality of the tm-package. Note however that
generating a VCorpus to get a DocumentTermMatrix, or a
TermDocumentMatrix is a highly inefficient detour. Applying the
as.DocumentTermMatrix or as.TermDocumentMatrix methods on a
partition_bundle is the recommended approach.
If the tm-package has been loaded, the as.VCorpus-method
included in the polmineR-package may become inaccessible. To deal with
this (propable) scenario, it is possible to use a coerce-method
(as(YOUROBJECT, "VCorpus")), see examples.
Examples
use(pkg = "RcppCWB", corpus = "REUTERS")
pb <- corpus("REUTERS") %>%
partition_bundle(s_attribute = "id")
vc <- as.VCorpus(pb) # works only, if tm-package has not yet been loaded
vc <- as(pb, "VCorpus") # will work if tm-package has been loaded, too
vc <- corpus("REUTERS") %>%
split(s_attribute = "id") %>%
as("VCorpus")