ESSLLI08_Nouns {wordspace} | R Documentation |
Noun Clustering Task from ESSLLI 2008 (wordspace)
Description
A set of 44 nouns denoting basic-level concepts from 6 semantic classes, used as a gold standard in the ESSLLI 2008 shared task on noun clustering.
Usage
ESSLLI08_Nouns
Format
A data frame with 44 rows and the following 5 columns:
word
a character vector specifying the 44 nouns in CWB/Penn format (see
convert.lemma
)class
a factor vector specifying the semantic class of each noun (bird, fruitTree, green, groundAnimal, tool, vehicle)
class2
a factor vector specifying a coarser 3-class categorization (animal, vegetable, artifact)
class3
a factor vector specifying a coarser 2-class categorization (natural, artifact)
freq.bnc
a numeric vector specifying the frequency of each noun in the British National Corpus
Source
http://wordspace.collocations.de/doku.php/data:esslli2008:concrete_nouns_categorization
Examples
print(ESSLLI08_Nouns)
[Package wordspace version 0.2-8 Index]