| durationsGe {languageR} | R Documentation |
Durational measurements on the Dutch prefix ge-
Description
Durational measurements on the Dutch prefix ge- in the Spoken Dutch Corpus.
Usage
data(durationsGe)
Format
A data frame with 428 observations on the following 8 variables.
Worda factor with the words as levels.
Frequencya numeric vector with the word's absolute frequency in the Spoken Dutch Corpus.
Speakera factor with the speakers as levels.
Sexa factor with levels
femaleandmale, this information is missing for one speaker.YearOfBirtha numeric vector with years of birth.
DurationOfPrefixa numeric vector with the duration of the prefix -ont in seconds.
SpeechRatea numeric vector coding speech rate in number of syllables per second.
NumberSegmentsOnseta numeric vector for the number of segments in the onset of the stem.
References
Pluymaekers, M., Ernestus, M. and Baayen, R. H. (2005) Frequency and acoustic length: the case of derivational affixes in Dutch, Journal of the Acoustical Society of America, 118, 2561-2569.
Examples
## Not run:
data(durationsGe)
durationsGe$Frequency = log(durationsGe$Frequency + 1)
durationsGe$YearOfBirth = durationsGe$YearOfBirth - 1900
durationsGe.lm = lm(DurationOfPrefix ~ Frequency+SpeechRate, data = durationsGe)
summary(durationsGe.lm)
# ---- model criticism
plot(durationsGe.lm)
outliers = c(271, 392, 256, 413, 118, 256)
durationsGe.lm = lm(DurationOfPrefix ~ Frequency + SpeechRate,
data = durationsGe[-outliers, ])
summary(durationsGe.lm)
## End(Not run)