durationsGe {languageR} | R Documentation |
Durational measurements on the Dutch prefix ge-
Description
Durational measurements on the Dutch prefix ge- in the Spoken Dutch Corpus.
Usage
data(durationsGe)
Format
A data frame with 428 observations on the following 8 variables.
Word
a factor with the words as levels.
Frequency
a numeric vector with the word's absolute frequency in the Spoken Dutch Corpus.
Speaker
a factor with the speakers as levels.
Sex
a factor with levels
female
andmale
, this information is missing for one speaker.YearOfBirth
a numeric vector with years of birth.
DurationOfPrefix
a numeric vector with the duration of the prefix -ont in seconds.
SpeechRate
a numeric vector coding speech rate in number of syllables per second.
NumberSegmentsOnset
a numeric vector for the number of segments in the onset of the stem.
References
Pluymaekers, M., Ernestus, M. and Baayen, R. H. (2005) Frequency and acoustic length: the case of derivational affixes in Dutch, Journal of the Acoustical Society of America, 118, 2561-2569.
Examples
## Not run:
data(durationsGe)
durationsGe$Frequency = log(durationsGe$Frequency + 1)
durationsGe$YearOfBirth = durationsGe$YearOfBirth - 1900
durationsGe.lm = lm(DurationOfPrefix ~ Frequency+SpeechRate, data = durationsGe)
summary(durationsGe.lm)
# ---- model criticism
plot(durationsGe.lm)
outliers = c(271, 392, 256, 413, 118, 256)
durationsGe.lm = lm(DurationOfPrefix ~ Frequency + SpeechRate,
data = durationsGe[-outliers, ])
summary(durationsGe.lm)
## End(Not run)