| hash_grady_pos {lexicon} | R Documentation |
Grady Ward's Moby Parts of Speech
Description
A dataset containing a hash lookup of Grady Ward's parts of speech from the Moby project. The words with non-ASCII characters removed.
grady_pos_feature - A function for augmenting hash_grady_pos
with 3 additional columns: (1) n_pos - the number of parts of speech
a word has, (2) space - logical; indicating if a word contains a space,
& (3) primary - logical; indicating if this is the most likely part of
speech given the word.
Usage
data(hash_grady_pos)
grady_pos_feature(data)
Arguments
data |
This should be |
Format
A data frame with 246,691 rows and 3 variables
Details
word. The word.
pos. The part of speech; one of :
Adjective,Adverb,Conjunction,Definite Article,Interjection,Noun,Noun Phrase,Plural,Preposition,Pronoun,Verb (intransitive),Verb (transitive), orVerb (usu participle). Note that the first part of speech for a word is its primary use; all other uses are secondary.
Source
Originally downloaded from: http://icon.shef.ac.uk/Moby
Examples
## Not run:
library(data.table)
hash_grady_pos <- grady_pos_feature(hash_grady_pos)
hash_grady_pos['dog']
hash_grady_pos[primary == TRUE, ]
hash_grady_pos[primary == TRUE & space == FALSE, ]
## End(Not run)