gettxt {htm2txt}R Documentation

Extract simple plain texts from a web page at a certain URL

Description

Extract simple plain texts from a web page at a certain URL

Usage

gettxt(URL, encoding = "UTF-8", ...)

Arguments

URL

A character indicating the URL of a web page.

encoding

Encoding method (e.g., "UTF-8", "latin1", "bytes", "unknown", etc.).

...

Other htm2txt arguments.

Value

A character containing plain texts converted from the htm document at the URL.

Examples

text = gettxt("https://www.wikipedia.org/")

[Package htm2txt version 2.2.2 Index]