padSeqEnds {alakazam} | R Documentation |
Pads ragged ends of aligned DNA sequences
Description
padSeqEnds
takes a vector of DNA sequences, as character strings,
and appends the ends of each sequence with an appropriate number of "N"
characters to create a sequence vector with uniform lengths.
Usage
padSeqEnds(seq, len = NULL, start = FALSE, pad_char = "N", mod3 = TRUE)
Arguments
seq |
character vector of DNA sequence strings. |
len |
length to pad to. Only applies if longer than the maximum length of
the data in |
start |
if |
pad_char |
character to use for padding. |
mod3 |
if |
Value
A modified seq
vector with padded sequences.
See Also
See maskSeqEnds for creating uniform masking from existing masking.
Examples
# Default behavior uniformly pads ragged ends
seq <- c("CCCCTGGG", "ACCCTG", "CCCC")
padSeqEnds(seq)
# Pad to fixed length
padSeqEnds(seq, len=15)
# Add padding to the beginning of the sequences instead of the ends
padSeqEnds(seq, start=TRUE)
padSeqEnds(seq, len=15, start=TRUE)
[Package alakazam version 1.3.0 Index]