Extractor {boilerpipeR} | R Documentation |
It is the actual workhorse which directly calls the boilerpipe Java library. Typically called through
functions as listed for parameter exname
.
Extractor(exname, content, asText = TRUE, ...)
exname |
character specifying the extractor to be used. It can take one of the following values:
|
content |
Text content or URL as character |
asText |
should content specifed be treated as actual text to be extracted or url (from which HTML document is first downloaded and extracted afterwards), defaults to TRUE |
... |
additional parameters |
extracted text as character
Mario Annau
https://github.com/kohlschutter/boilerpipe