talkers {hqmisc} | R Documentation |
Data set of talkers of Dutch from the Netherlands.
Description
This data set gives metadata (id, sex, age, region of origin) and speech characteristics (average syllable duration, average phrase length) for a stratified sample of 80 talkers of Dutch from the Netherlands.
Usage
data(talkers)
Format
A data frame with 80 observations on the following 6 variables.
id
identifier code (from data source, see Source)
sex
sex (0=female, 1=male)
age
age (in years)
region
region of origin (a factor with levels
M
=Mid,N
=North,S
=South, orW
=West)syldur
average duration of syllables, or seconds per syllable (in seconds, excluding pause time, 1/(articulation rate) )
nsyl
average number of syllables per phrase, or average phrase length in syllables
Details
Talkers grew up in their region of origin, and have lived and worked there as teachers of Dutch Language and Literature in secondary education. Talkers with ages between 41 and 45 were not included in this study. The sample is stratified by sex, region, and (age>41) (see Examples).
Speech data were collected from (and averaged over) a recorded interview lasting about 15 minutes. The talker and the interviewer only spoke Standard Dutch during the interview.
One talker (id
117) spoke remarkably slower than all others, yielding a very high syldur.
The West region is commonly regarded as the linguistic center of the Netherlands. Each of the four regions has a distinct variety of Dutch. The variety of the West region is closest to the Standard Dutch spoken in the Netherlands.
Speech recordings and metadata were collected in 1999.
Source
http://tla.mpi.nl/resources/data-archive/, Corpus of Spoken Dutch
References
Oostdijk, N. (2000). The Spoken Dutch Corpus: Overview and first evaluation. In M. Gravilidou, G. Carayannis, S. Markantonatou, S. Piperidis & G. Stainhaouer (Eds.), Proceedings of the Second International Conference on Language Resources and Evaluation (Vol. 2, pp. 887-894).
Adank, P., van Hout, R., & van de Velde, H. (2007). An acoustic description of the vowels of northern and southern Standard Dutch II: Regional varieties. Journal of the Acoustical Society of America, 121(2), 1130-1141.
Quené, H. (2008). Multilevel modeling of between-speaker and within-speaker variation in spontaneous speech tempo. Journal of the Acoustical Society of America, 123(2), 1104-1113.
Examples
data(talkers)
str(talkers)
pairs( talkers[,2:6] )
with( talkers, table( sex, region, I(age>41) ) )