| qlcData-package {qlcData} | R Documentation |
Processing data for quantitative language comparison (QLC)
Description
The package offers various functions to read, transcode and process data. There are many different function to read in data. Also a general framework to recode nominal data is included. Further, there is a general approach to describe orthographic systems through so-called Orthography Profiles. It offers functions to write such profiles based on some actual written text, and to test and correct profiles given concrete data. The main end-use is to produce tokenized texts in so-called tailored grapheme clusters.
Details
| Package: | qlcData |
| Type: | Package |
| Version: | 0.3 |
| Date: | 2024-06-07 |
| License: | GPL-3 |
Various functions to read specific data formats of QLC are documented in read_align, read.profile, read.recoding.
The recode function allows for an easy and transparent way to specify a recoding of an existing nominal dataset. The specification of the recoding-decisions is preferably saved in an easily accessible YAML-file. There are utility function write.profile for writing and reading such files included.
For processing of strings using orthography profiles, the central function is tokenize. A basic sceleton for an orthography profile can be produced with write.profile
Author(s)
Michael Cysouw <cysouw@mac.com>