mat.dif {bios2mds} | R Documentation |
Matrices of difference scores between sequences
Description
Computes a matrix providing the distances based on the difference scores between sequences from two multiple sequence alignments.
Usage
mat.dif(align1, align2, gap = FALSE, aa.strict = FALSE, sqrt = FALSE)
Arguments
align1 |
a list of character vectors representing a first multiple sequence aligment. |
align2 |
a list of character vectors representing a second multiple sequence aligment. |
gap |
a logical value indicating whether gap character should be taken as supplementary symbol (TRUE) or not (FALSE). Default is FALSE. |
aa.strict |
a logical value indicating whether only strict amino acids should be taken into account (TRUE) or not (FALSE). To be used only for amino acid sequences. Default is FALSE. |
sqrt |
a logical value indicating whether the distance should be equal to the squared root of the difference score (TRUE) or not (FALSE). Default is FALSE. |
Details
If align1
and align2
are identical, mat.dif
computes the symetrical matrix of distances between each sequence of the alignment.
Before using mat.dif
, users must check the alignment of sequences within align1
and align2
and between align1
and align2
.
Value
A named numeric matrix providing the difference-based distances between each pair of sequences from align1
and align2
. The number of rows and columns is identical to the number of sequences in align1
and align2
, respectively.
Author(s)
Julien Pele and Jean-Michel Becu
See Also
identity
function from bio3d
package.
Examples
# calculating the matrix of distances based on the difference scores
# between GPCRs sample from H. sapiens and D. melanogaster:
aln_human <- import.fasta(system.file("msa/human_gpcr.fa", package = "bios2mds"))
aln_drome <- import.fasta(system.file("msa/drome_gpcr.fa", package = "bios2mds"))
mat.dif1 <- mat.dif(aln_human[1:5], aln_drome[1:5])
mat.dif1