ngram {EnvNJ}R Documentation

Compute n-Gram Frequencies Vector

Description

Computes the n-gram frequencies vector for a given protein.

Usage

ngram(prot, k = 4)

Arguments

prot

a character string corresponding to the primary structure of the protein.

k

a positive integer, between 1 and 5, indicating the k-mer of the words to be counted.

Details

The one letter code for amino acids is used (capital).

Value

A dataframe with two columns, the first one given the peptides and the second one the corresponding absolute frequency.

References

Stuart et al. Bioinformatics 2002; 18:100-108.

See Also

ngraMatrix(), ffp(), svdgram()

Examples

ngram(bovids$Bos_taurus[1], k = 3)

[Package EnvNJ version 0.1.3 Index]