read.corp.celex {koRpus} | R Documentation |
Import Celex data
Description
Read data from Celex[1] formatted corpora.
Usage
read.corp.celex(
celex.path,
running.words,
fileEncoding = "ISO_8859-1",
n = -1,
caseSens = TRUE
)
Arguments
celex.path |
A character string, path to a frequency file in Celex format to read. |
running.words |
An integer value, number of running words in the Celex data corpus to be read. |
fileEncoding |
A character string naming the encoding of the Celex files. |
n |
An integer value defining how many lines of data should be read if |
caseSens |
Logical,
if |
Value
An object of class kRp.corp.freq
.
References
See Also
Examples
## Not run:
my.Celex.data <- read.corp.celex(
file.path("~","mydata","Celex","GERMAN","GFW","GFW.CD"),
running.words=5952000
)
freq.analysis(
tokenized.obj,
corp.freq=my.Celex.data
)
## End(Not run)
[Package koRpus version 0.13-8 Index]