Abstract
Motivation: High-throughput sequencing data is rapidly accumulating in public repositories. Making this resource accessible for interactive analysis at scale requires efficient approaches for its storage and indexing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Almodaresi, F., Pandey, P., Ferdman, M., Johnson, R., Patro, R.: An efficient, scalable, and exact representation of high-dimensional color information enabled using de Bruijn graph search. J. Comput. Biol. 27(4), 485–499 (2020)
Bradley, P., Den Bakker, H.C., Rocha, E.P., McVean, G., Iqbal, Z.: Ultrafast search of all deposited bacterial and viral genomic data. Nat. Biotechnol. 37(2), 152–159 (2019)
Danciu, D., Karasikov, M., Mustafa, H., Kahles, A., Rätsch, G.: Topology-based sparsification of graph annotations. Bioinformatics 37(Suppl._1), i169–i176 (2021). https://doi.org/10.1093/bioinformatics/btab330
Karasikov, M., et al.: Metagraph: indexing and analysing nucleotide archives at petabase-scale. bioRxiv (2020). https://doi.org/10.1101/2020.10.01.322164. https://www.biorxiv.org/content/early/2020/11/03/2020.10.01.322164
Marchet, C., Iqbal, Z., Gautheret, D., Salson, M., Chikhi, R.: REINDEER: efficient indexing of k-mer presence and abundance in sequencing datasets. Bioinformatics 36(Suppl._1), i177–i185 (2020). https://doi.org/10.1093/bioinformatics/btaa487
Pandey, P., Almodaresi, F., Bender, M.A., Ferdman, M., Johnson, R., Patro, R.: Mantis: a fast, small, and exact large-scale sequence-search index. Cell Syst. 7(2), 201–207.e4 (2018)
Acknowledgments
M. K. and H. M. are funded as part of Swiss National Research Programme (NRP) 75 “Big Data” by the SNSF grant #407540_167331. M. K., H. M., and A. K. are also partially funded by ETH core funding (to G. R.).
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Karasikov, M., Mustafa, H., Rätsch, G., Kahles, A. (2022). Lossless Indexing with Counting de Bruijn Graphs. In: Pe'er, I. (eds) Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science(), vol 13278. Springer, Cham. https://doi.org/10.1007/978-3-031-04749-7_34
Download citation
DOI: https://doi.org/10.1007/978-3-031-04749-7_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-04748-0
Online ISBN: 978-3-031-04749-7
eBook Packages: Computer ScienceComputer Science (R0)