qlcData-package {qlcData} | R Documentation |
Processing data for quantitative language comparison (QLC)
Description
The package offers various functions to read, transcode and process data. There are many different function to read in data. Also a general framework to recode nominal data is included. Further, there is a general approach to describe orthographic systems through so-called Orthography Profiles. It offers functions to write such profiles based on some actual written text, and to test and correct profiles given concrete data. The main end-use is to produce tokenized texts in so-called tailored grapheme clusters.
Details
Package: | qlcData |
Type: | Package |
Version: | 0.3 |
Date: | 2024-06-07 |
License: | GPL-3 |
Various functions to read specific data formats of QLC are documented in read_align
, read.profile
, read.recoding
.
The recode
function allows for an easy and transparent way to specify a recoding of an existing nominal dataset. The specification of the recoding-decisions is preferably saved in an easily accessible YAML-file. There are utility function write.profile
for writing and reading such files included.
For processing of strings using orthography profiles, the central function is tokenize
. A basic sceleton for an orthography profile can be produced with write.profile
Author(s)
Michael Cysouw <cysouw@mac.com>