letter.convert {JATSdecoder}R Documentation

letter.convert

Description

Converts and unifies most hexadecimal and some HTML coded letters to Unicode characters. Performs CERMINE specific error correction (inserting operators, where these got lost while conversion).

Usage

letter.convert(x, cermine = FALSE, greek2text = FALSE, warning = TRUE)

Arguments

x

text string to process.

cermine

Logical. If TRUE CERMINE specific error handling and letter conversion will be applied.

greek2text

Logical. If TRUE some greek letters and special characters will be unified to textual representation (important to extract stats).

warning

Logical. If TRUE prints warning massage if CERMINE specific letter conversion was performed.

Value

Character. Text with unified and corrected letter representation.

Examples

x<-c("five &#x0003c; ten","five &lt; ten")
letter.convert(x)

[Package JATSdecoder version 1.2.0 Index]