skbio.alignment.local_pairwise_align

skbio.alignment.local_pairwise_align(seq1, seq2, gap_open_penalty, gap_extend_penalty, substitution_matrix)[source]

Locally align exactly two seqs with Smith-Waterman

Parameters:

seq1 : str or BiologicalSequence

The first unaligned sequence.

seq2 : str or BiologicalSequence

The second unaligned sequence.

gap_open_penalty : int or float

Penalty for opening a gap (this is substracted from previous best alignment score, so is typically positive).

gap_extend_penalty : int or float

Penalty for extending a gap (this is substracted from previous best alignment score, so is typically positive).

substitution_matrix: 2D dict (or similar)

Lookup for substitution scores (these values are added to the previous best alignment score).

Returns:

skbio.Alignment

Alignment object containing the aligned sequences as well as

details about the alignment.

Notes

This algorithm was originally described in [R110]. The scikit-bio implementation was validated against the EMBOSS water web server [R111].

References

[R110](1, 2) Identification of common molecular subsequences. Smith TF, Waterman MS. J Mol Biol. 1981 Mar 25;147(1):195-7.
[R111](1, 2) http://www.ebi.ac.uk/Tools/psa/emboss_water/