| writtenVariationLijk {languageR} | R Documentation |
Variation in written Dutch in the use of the suffix -lijk
Description
This dataset documents variation in the use of the 80 most frequent words ending in the suffix -lijk in written Dutch.
Usage
data(writtenVariationLijk)
Format
A data frame with 560 observations on the following 5 variables.
Corpusa factor with as levels the sampled newspapers:
belang(Het Belang van Limburg),gazet(De Gazet van Antwerpen),laatnieu(Het Laatste Nieuws),limburg(De Limburger),nrc(NRC Handelsblad),stand(De Standaard), andtele(De Telegraaf).Worda factor with the 80 most frequent words ending in -lijk.
Counta numeric vector with token counts in the CONDIV corpus.
Countrya factor with levels
FlandersandNetherlands.Registera factor with levels
National,QualityandRegionalcoding the type of newspaper.
References
Keune, K., Ernestus, M., Van Hout, R. and Baayen, R.H. (2005) Social, geographical, and register variation in Dutch: From written 'mogelijk' to spoken 'mok', Corpus Linguistics and Linguistic Theory, 1, 183-223.
Examples
## Not run:
data(writtenVariationLijk)
require(lme4)
require(lmerTest)
require(lme4)
writtenVariationLijk.lmer = glmer(Count ~ Country * Register + (1|Word),
control=glmerControl(optimizer="optimx",optCtrl=list(method="nlminb")),
data = writtenVariationLijk, family = "poisson")
writtenVariationLijk.lmerA = glmer(Count ~ Country * Register + (Country|Word),
control=glmerControl(optimizer="optimx",optCtrl=list(method="nlminb")),
data = writtenVariationLijk, family = "poisson")
anova(writtenVariationLijk.lmer, writtenVariationLijk.lmerA)
summary(writtenVariationLijk.lmerA)
## End(Not run)