LargestContentExtractor {boilerpipeR}R Documentation

A full-text extractor which extracts the largest text component of a page.

Description

For news articles, it may perform better than the DefaultExtractor, but usually worse than ArticleExtractor.

Usage

LargestContentExtractor(content, ...)

Arguments

content

Text content as character

...

additional parameters

Value

extracted text as character

Author(s)

Mario Annau

See Also

Extractor

Examples

data(content)
extract <- LargestContentExtractor(content)

[Package boilerpipeR version 1.3.2 Index]