Statistics and Data Sets for Corpus Frequency Data


[Up] [Top]

Documentation for package ‘corpora’ version 0.6

Help Pages

corpora-package corpora: Statistical Inference from Corpus Frequency Data
alpha.col Colour palettes for linguistic visualization (corpora)
binom.pval P-values of the binomial test for frequency counts (corpora)
BNCbiber Biber's (1988) register features for the British National Corpus
BNCcomparison Comparison of written and spoken noun frequencies in the British National Corpus
BNCdomains Distribution of domains in the British National Corpus (BNC)
BNCInChargeOf Collocations of the phrase "in charge of" (BNC)
BNCmeta Metadata for the British National Corpus (XML edition)
BNCqueries Per-text frequency counts for a selection of BNCweb corpus queries
BrownBigrams Bigrams of adjacent words from the Brown corpus
BrownLOBPassives Frequency counts of passive verb phrases in the Brown and LOB corpora
BrownPassives Frequency counts of passive verb phrases in the Brown corpus
BrownStats Basic statistics of texts in the Brown corpus
chisq Pearson's chi-squared statistic for frequency comparisons (corpora)
chisq.pval P-values of Pearson's chi-squared test for frequency comparisons (corpora)
colVector Propagate vector to single-row or single-column matrix (corpora)
cont.table Build contingency tables for frequency comparison (corpora)
corpora corpora: Statistical Inference from Corpus Frequency Data
corpora.palette Colour palettes for linguistic visualization (corpora)
DistFeatBrownFam Latent dimension scores from a distributional analysis of the Brown Family corpora
FakeCensus Simulated census data for examples and illustrations (corpora)
fisher.pval P-values of Fisher's exact test for frequency comparisons (corpora)
keyness Compute best-practice keyness measures (corpora)
KrennPPV German PP-Verb collocation candidates annotated by Brigitte Krenn (2000)
LanguageCourse Simulated study on effectiveness of language course (corpora)
LOBPassives Frequency counts of passive verb phrases in the LOB corpus
LOBStats Basic statistics of texts in the LOB corpus
PassiveBrownFam By-text frequencies of passive verb phrases in the Brown Family corpora.
prop.cint Confidence interval for proportion based on frequency counts (corpora)
qw Split string into words, similar to qw() in Perl (corpora)
rowVector Propagate vector to single-row or single-column matrix (corpora)
sample.df Random samples from data frames (corpora)
simulated.census Simulated census data for examples and illustrations (corpora)
simulated.language.course Simulated study on effectiveness of language course (corpora)
simulated.wikipedia Simulated type and token counts for Wikipedia articles (corpora)
stars.pval Show p-values as significance stars (corpora)
VSS A small corpus of very short stories with linguistic annotations
WackypediaStats Simulated type and token counts for Wikipedia articles (corpora)
z.score The z-score statistic for frequency counts (corpora)
z.score.pval P-values of the z-score test for frequency counts (corpora)