Extractor {boilerpipeR} | R Documentation |
Generic extraction function which calls boilerpipe extractors
Description
It is the actual workhorse which directly calls the boilerpipe Java library. Typically called through
functions as listed for parameter exname
.
Usage
Extractor(exname, content, asText = TRUE, ...)
Arguments
exname |
character specifying the extractor to be used. It can take one of the following values:
|
content |
Text content or URL as character |
asText |
should content specifed be treated as actual text to be extracted or url (from which HTML document is first downloaded and extracted afterwards), defaults to TRUE |
... |
additional parameters |
Value
extracted text as character
Author(s)
Mario Annau
References
https://github.com/kohlschutter/boilerpipe