| Extractor {boilerpipeR} | R Documentation |
Generic extraction function which calls boilerpipe extractors
Description
It is the actual workhorse which directly calls the boilerpipe Java library. Typically called through
functions as listed for parameter exname.
Usage
Extractor(exname, content, asText = TRUE, ...)
Arguments
exname |
character specifying the extractor to be used. It can take one of the following values:
|
content |
Text content or URL as character |
asText |
should content specifed be treated as actual text to be extracted or url (from which HTML document is first downloaded and extracted afterwards), defaults to TRUE |
... |
additional parameters |
Value
extracted text as character
Author(s)
Mario Annau
References
https://github.com/kohlschutter/boilerpipe