Abstract
Sequencing genomes is a fundamental aspect of biological research. Shotgun sequencing, since introduced by Sanger et al [2], has remained the mainstay in the research field of genome sequence assembly. This method randomly obtains sequence reads (e.g. a subsequence including about 500 characters) from a genome and then assemblies them into contigs based on significant overlap among them. The whole-genome shotgun (WGS) approach, generates sequence reads directly from a whole-genome library and uses computational techniques to reassemble them. A variety of assembly programs have been previously proposed and implemented, including PHRAP [3] (Green 1994), CAP3 [4] (1999), Celera [5] (2000) etc. Because of great computational complexity and increasingly large size, they incur great time and space overhead. PHRAP [3], for instance, which can only run in a stand-alone way, requires many times memory (usually greater than 10) as the size of original sequence data. In realistic applications, sequencing process might come to become unacceptably slow for insufficient memory even with a mainframe with huge RAM.
This research is supported in part by the Key Program of National Natural Science Foundation of China (No. 69933010 and 60303008), and China National 863 High-Tech Projects (No. 2002AA4Z3430 and 2002AA231041)
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic Local Alignment Search Tool. J. Mol. Biol. 215, 403–410 (1990)
Sanger, F., Nicklen, S., Coulsen, A.R.: DNA Sequencing with Chain Terminating Inhibitors. Proc. Natl. Acad. Sci 74, 5463–5467 (1977)
Green, P.: PHRAP Documentation (1994), http://www.phrap.org
Huang, X., Madan, A.: CAP3: A DNA Sequence Assembly Program. Genome Res. 9, 868–877 (1999)
Myers, E.W., Sutton, G.G., Delcher, A.L., Dew, I.M., et al.: A Whole-genome Assembly of Drosophila. Science 287, 2196–2204 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tang, J., Huang, D., Wang, C., Wang, W., Shi, B. (2004). GiSA: A Grid System for Genome Sequences Assembly. In: Atzeni, P., Chu, W., Lu, H., Zhou, S., Ling, TW. (eds) Conceptual Modeling – ER 2004. ER 2004. Lecture Notes in Computer Science, vol 3288. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30464-7_63
Download citation
DOI: https://doi.org/10.1007/978-3-540-30464-7_63
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23723-5
Online ISBN: 978-3-540-30464-7
eBook Packages: Springer Book Archive