LargestContentExtractor {boilerpipeR} | R Documentation |
A full-text extractor which extracts the largest text component of a page.
Description
For news articles, it may perform better than the DefaultExtractor
,
but usually worse than ArticleExtractor
.
Usage
LargestContentExtractor(content, ...)
Arguments
content |
Text content as character |
... |
additional parameters |
Value
extracted text as character
Author(s)
Mario Annau
See Also
Examples
data(content)
extract <- LargestContentExtractor(content)
[Package boilerpipeR version 1.3.2 Index]