ngrams {NLP} | R Documentation |
Compute N-Grams
Description
Compute the n
-grams (contiguous sub-sequences of length n
)
of a given sequence.
Arguments
x |
a sequence (vector). |
n |
a positive integer giving the length of contiguous sub-sequences to be computed. |
Value
a list with the computed sub-sequences.
Examples
s <- "The quick brown fox jumps over the lazy dog"
## Split into words:
w <- strsplit(s, " ", fixed = TRUE)[[1L]]
## Word tri-grams:
ngrams(w, 3L)
## Word tri-grams pasted together:
vapply(ngrams(w, 3L), paste, "", collapse = " ")
[Package NLP version 0.2-1 Index]