Dickens {zipfR} | R Documentation |
Dickens' Frequency Data (zipfR)
Description
Objects of classes spc
and vgc
that
contain frequency data for a collection of Dickens's works from
Project Gutenberg, and for 3 novels (Oliver Twist, Great
Expectations and Our Mutual Friends).
Details
Dickens.spc
has a frequency spectrum derived from a
collection of Dickens' works downloaded from the Gutenberg archive
(A Christmas Carol, David Copperfield, Dombey and Son, Great
Expectations, Hard Times, Master Humphrey's Clock, Nicholas
Nickleby, Oliver Twist, Our Mutual Friend, Sketches by BOZ, A Tale
of Two Cities, The Old Curiosity Shop, The Pickwick Papers, Three
Ghost Stories). Dickens.emp.vgc
contains the corresponding
observed vocabulary growth (V
and V(1)
).
DickensOliverTwist.spc
and DickensOliverTwist.emp.vgc
contain spectrum and observed growth curve (V
and V(1)
of the early novel Oliver Twist (1837-1839).
DickensGreatExpectations.spc
and
DickensGreatExpectations.emp.vgc
contain spectrum and
observed growth curve (V
and V(1)
) of the late novel
Great Expectations (1860-1861).
DickensOurMutualFriend.spc
and
DickensOurMutualFriend.emp.vgc
contain spectrum and observed
growth curve (V
and V(1)
) of Our Mutual Friend, the
last novel completed by Dickens (1864-1865).
Notice that we removed numbers and other forms of non-linguistic material before collecting the frequency data.
References
Project Gutenberg: https://www.gutenberg.org/
Charles Dickens on Wikipedia: https://en.wikipedia.org/wiki/Charles_Dickens
Examples
data(Dickens.spc)
summary(Dickens.spc)
data(Dickens.emp.vgc)
summary(Dickens.emp.vgc)
data(DickensOliverTwist.spc)
summary(DickensOliverTwist.spc)
data(DickensOliverTwist.emp.vgc)
summary(DickensOliverTwist.emp.vgc)