Abstract
The MSA program, written and distributed in 1989, is one of the few existing programs that attempts to find optimal alignments of multiple protein or DNA sequences. MSA implements a branch-and-bound technique on a variant of Dijkstra's shortest paths algorithm to prune the basic dynamic programming graph. We have made substantial improvements in the time and space usage of MSA. On some runs, we achieve an order of magnitude reduction in space usage and a significant multiplicative factor speedup in running time. To explain these improvements, we give a much more detailed description of MSA than has been previously available.
Some of the work of this author was carried out at the Department of Computer Science of the University of California at Davis
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
S. F. Altschul. Gap costs for multiple sequence alignment. J. Theor. Biol., 138:297–309, 1989.
S. F. Altschul, Raymond J. Carroll, and David J. Lipman. Weights for data related by a tree. J. Molecular Biology, 207:647–653, 1989.
G. J. Barton and M. J. E. Sternberg. Evaluation and improvements in the automatic alignment of protein sequences. J. Mol. Biol., 198:327–337, 1987.
G. J. Barton and M. J. E. Sternberg. A strategy for the rapid multiple alignment of protein sequences. Protein Engineering, 1:89–94, 1987.
H. Bodlaender, R. G. Downey, M. R. Fellows, and H. T. Wareham. The parameterized complexity of sequence alignment and consensus. In Proc. of the 5th Symp. on Combinatorial Pattern Matching, Lecture Notes Comp. Sci. 807, pages 15–30, 1994.
H. Carrillo and D. Lipman. The multiple sequence alignment problem in biology. SLAM J. Appl. Math., 48:1073–1082, 1988.
S. C. Chan, A. K. C. Wong, and D. K. Y. Chiu. A survey of multiple sequence comparison methods. Bulletin of Mathematical Biology, 54:563–598, 1992.
E. W. Dijkstra. A note on two problems in connexion with graphs. Numerische Mathematik, 1:269–271, 1959.
D. Feng and R. Doolittle. Progressive sequence alignment as a prerequisite to correct phylogenetic trees. J. Molecular Evol., 25:351–360, 1987.
D. G. Higgins, A. J. Bleasby, and R. Fuchs. Clustal v: improved software for multiple sequence alignment. CABIOS, 8:189–191, 1992.
J. Kececioglu. Notes on an approach of Carrillo and Lipman to minimum sum of pairs multiple sequence alignment. Unpublished notes, 1989.
J. Kececioglu. The maximum weight trace problem in multiple sequence alignment. In Proc. of the 4th Symp. on Combinatorial Pattern Matching, Springer-Verlag Lecture Notes in Comp. Sci. 684, pages 106–119, 1993.
D. J. Lipman, S. F. Altschul, and J. D. Kececioglu. A tool for multiple sequence alignment. Proc. Natl. Acad. Sci. USA., 86:4412–4415, 1989.
D. Maier. The complexity of some problems on subsequences and supersequences. J. ACM, 25:322–336, 1978.
M. A. McClure, T. K. Vasi, and W. M. Fitch. Comparative analysis of multiple protein-sequence alignment methods. Mol. Biol. Evol., 11:571–592, 1994.
S. Subbiah and S. C. Harrison. A method for multiple sequence alignment with gaps. J. Mol. Biol., 209:539–548, 1989.
W. R. Taylor. Multiple sequence alignment by a pairwise algorithm. CABIOS, 3:81–87, 1987.
W. R. Taylor. A flexible method to align large numbers of biological sequences. Journal of Molecular Evolution, 28:161–169, 1988.
L. Wang and T. Jiang. On the complexity of multiple sequence alignment. J. Computational Biology, 1:337–348, 1994.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1995 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gupta, S.K., Kececioglu, J.D., Schäffer, A.A. (1995). Making the shortest-paths approach to sum-of-pairs multiple sequence alignment more space efficient in practice. In: Galil, Z., Ukkonen, E. (eds) Combinatorial Pattern Matching. CPM 1995. Lecture Notes in Computer Science, vol 937. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-60044-2_39
Download citation
DOI: https://doi.org/10.1007/3-540-60044-2_39
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60044-2
Online ISBN: 978-3-540-49412-6
eBook Packages: Springer Book Archive