slideWindowSeq {shazam} | R Documentation |
Sliding window approach towards filtering a single sequence
Description
slideWindowSeq
determines whether an input sequence contains equal to or more than
a given number of mutations in a given length of consecutive nucleotides (a "window")
when compared to a germline sequence.
Usage
slideWindowSeq(inputSeq, germlineSeq, mutThresh, windowSize)
Arguments
inputSeq |
input sequence. |
germlineSeq |
germline sequence. |
mutThresh |
threshold on the number of mutations in |
windowSize |
length of consecutive nucleotides. Must be at least 2. |
Value
TRUE
if there are equal to or more than mutThresh
number of mutations
in any window of windowSize
consecutive nucleotides (i.e. the sequence should
be filtered); FALSE
if otherwise.
See Also
calcObservedMutations is called by slideWindowSeq
to identify observed
mutations. See slideWindowDb for applying the sliding window approach on a
data.frame
. See slideWindowTune for parameter tuning for mutThresh
and windowSize
.
Examples
# Use an entry in the example data for input and germline sequence
data(ExampleDb, package="alakazam")
in_seq <- ExampleDb[["sequence_alignment"]][100]
germ_seq <- ExampleDb[["germline_alignment_d_mask"]][100]
# Determine if in_seq has 6 or more mutations in 10 consecutive nucleotides
slideWindowSeq(inputSeq=in_seq, germlineSeq=germ_seq, mutThresh=6, windowSize=10)
slideWindowSeq(inputSeq="TCGTCGAAAA", germlineSeq="AAAAAAAAAA", mutThresh=6, windowSize=10)