Digest {OrgMassSpecR} | R Documentation |
Predict Peptides Resulting from Enzymatic Digest
Description
Cleave an amino acid sequence (a protein or peptide) according to enzyme specific rules and calculate the precursor ion m/z values.
Usage
Digest(sequence, enzyme = "trypsin", missed = 0, IAA = TRUE,
N15 = FALSE, custom = list())
Arguments
sequence |
a character string representing the amino acid sequence to be cleaved by the enzyme. |
enzyme |
a character string specifying the rules for cleavage. Options are |
missed |
the maximum number of missed cleavages. Must be an integer of 0 (default) or greater. An error will result if the specified number of missed cleavages is greater than the maximum possible number of missed cleavages. |
IAA |
logical. |
N15 |
logical indicating if the nitrogen-15 isotope should be used in place of the default nitrogen-14 isotope. |
custom |
a list specifying user defined residues as |
Details
The amino acid residues must be specified by one letter codes. The predefined residues are:
A = alanine | L = leucine |
R = arginine | K = lysine |
N = asparagine | M = methionine |
D = aspartic acid | F = phenylalanine |
C = cysteine | P = proline |
E = glutamic acid | S = serine |
Q = glutamine | T = threonine |
G = glycine | W = tryptophan |
H = histidine | Y = tyrosine |
I = isoleucine | V = valine |
If "trypsin"
is specified, the sequence is cleaved on the c-terminal side of K and R residues, except if K or R is followed by P. If "trypsin.strict"
is specified, the sequence is cleaved on the c-terminal side of K and R residues. If "pepsin"
is specified, the sequence is cleaved on the c-terminal side of F, L, W, Y, A, E, and Q residues. This rule is specific to pepsin at pH >
2, as used in hydrogen-deuterium exchange experiments.
When "trypsin"
is specified, KP and RP are not considered missed cleavages when missed
is >
0.
The argument IAA
specifies treatment of the protein with iodoacetamide. This treatment produces iodoacetylated cysteine residues (elemental formula C5H8N2O2S).
If TRUE
, the argument N15
specifies 100% nitrogen-15 incorporation. It is intended for proteins grown with a nitrogen-15 labeled food source. (Although the experiment itself may grow a protein with less than 100% nitrogen-15 incorporation). Setting N15 = TRUE
does not modify the mass of a custom residue, or the mass of the nitrogen(s) added if IAA = TRUE
.
If a custom residue code is identical to a predefined residue code, the custom residue mass will be used in place of the predefined mass.
The error message “object "mass" not found
” indicates the input sequence contains an undefined residue(s).
Value
A data frame with the following column names.
peptide |
resulting peptides. |
start |
beginning residue positions in the the original sequence. |
end |
ending residue positions in the the original sequence. |
mc |
number of missed cleavages. |
mz1 |
monoisotopic m/z values for the |
mz2 |
monoisotopic m/z values for the |
mz3 |
monoisotopic m/z values for the |
Author(s)
Nathan G. Dodder
References
The relative atomic masses of the isotopes are from the NIST Physical Reference Data Website http://physics.nist.gov/PhysRefData/Compositions/. The molar mass of a proton (H+) is from the NIST CODATA Website http://physics.nist.gov/cuu/Constants/index.html.
See Also
MonoisotopicMass
, FragmentPeptide
Examples
## digest human serum albumin with 0 and 1 missed cleavages
Digest(example.sequence, missed = 1)
## digest human serum albumin with a phosphoserine at position 58
## and all methionines oxidized
modifiedHsaSequence <- strsplit(example.sequence, split = "")[[1]]
modifiedHsaSequence[58] <- "s" # insert code for phosphoserine
modifiedHsaSequence <- paste(modifiedHsaSequence, collapse = "")
Digest(modifiedHsaSequence, custom = list(code = c("s","M"),
mass = c(MonoisotopicMass(list(C=3, H=6, N=1, O=5, P=1)),
MonoisotopicMass(list(C=5, H=9, N=1, O=2, S=1)))))
## digest human serum albumin with strict rules
Digest(example.sequence, enzyme = "trypsin.strict")