| u_char_properties {Unicode} | R Documentation |
Unicode Character Properties
Description
Get the properties of Unicode characters.
Usage
u_char_info(x)
u_char_properties(x, which)
u_char_property(x, which)
Arguments
x |
an R object which can be coerced to a |
which |
a character vector or string (for
|
Value
For u_char_info, a data frame with variables giving the Code
(Code) and the ‘basic’ Unicode variables Name, General
Category, Canonical Combining Class, Bidi Class, Decomposition,
Numeric Value Decimal Digit, Numeric Value Digit, Numeric Value,
Bidi Mirrored, Unicode 1 Name, ISO Comment, Simple Uppercase Mapping,
Simple Lowercase Mapping, and Simple Titlecase Mapping, with names
obtained by replacing white spaces by underscores (e.g.,
Bidi_Class.)
For u_char_properties, a data frame with the values of the
specified properties, or, if no arguments were given, a character
vector with the names of all currently available Unicode character
properties.
For u_char_property, the values of the specified property.
Note
Currently, only the property values of a subset of all Unicode character properties can be obtained.
References
Unicode Character Database (https://www.unicode.org/ucd/)
Examples
## When was the Euro sign added to Unicode?
x <- u_char_from_name("EURO SIGN")
u_char_property(x, "Age")
## List the currently available Unicode character properties.
u_char_properties()