| removeGaps {protr} | R Documentation | 
Remove or replace gaps from protein sequences.
Description
Remove/replace gaps or any irregular characters from protein sequences, to make them suitable for feature extraction or sequence alignment based similarity computation.
Usage
removeGaps(x, pattern = "-", replacement = "", ...)
Arguments
| x | character vector, containing the input protein sequence(s). | 
| pattern | character string contains the gap (or other irregular)
character to be removed or replaced. Default is  | 
| replacement | a replacement for matched characters.
Default is  | 
| ... | addtional parameters for  | 
Value
a vector of protein sequence(s) with gaps or irregular characters removed/replaced.
Author(s)
Nan Xiao <https://nanx.me>
Examples
# amino acid sequences that contain gaps ("-")
aaseq <- list(
  "MHGDTPTLHEYMLDLQPETTDLYCYEQLSDSSE-EEDEIDGPAGQAEPDRAHYNIVTFCCKCDSTLRLCVQS",
  "MHGDTPTLHEYMLDLQPETTDLYCYEQLNDSSE-EEDEIDGPAGQAEPDRAHYNIVTFCCKCDSTLRLCVQS"
)
## Not run: 
#' # gaps create issues for alignment
parSeqSim(aaseq)
# remove the gaps
nogapseq <- removeGaps(aaseq)
parSeqSim(nogapseq)
## End(Not run)
[Package protr version 1.7-2 Index]