human {canprot}R Documentation

Amino acid compositions of human proteins

Description

Amino acid compositions of human proteins derived from UniProt.

Format

human.aa is a data frame with 25 columns in the format used for amino acid compositions in CHNOSZ (see thermo):

protein character Identification of protein
organism character Identification of organism
ref character Reference key for source of sequence data
abbrv character Abbreviation or other ID for protein (e.g. gene name)
chains numeric Number of polypeptide chains in the protein
Ala...Tyr numeric Number of each amino acid in the protein

The protein column contains UniProt IDs in the format database|accession-isoform, where database is most often ‘⁠sp⁠’ (Swiss-Prot) or ‘⁠tr⁠’ (TrEMBL), and isoform is an optional suffix indicating the isoform of the protein (particularly in the human.additional file).

Details

The amino acid compositions of human proteins are stored in three files under extdata/protein.

On loading the package, the individual data files are read and combined, and the result is assigned to the human.aa object in the canprot environment.

See Also

human_aa gets amino acid compositions for human proteins specified by their UniProt IDs.

Examples

# The number of proteins
nrow(get("human.aa", canprot))

[Package canprot version 2.0.0 Index]