{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,6,2]],"date-time":"2024-06-02T04:28:19Z","timestamp":1717302499331},"reference-count":35,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,5,5]],"date-time":"2023-05-05T00:00:00Z","timestamp":1683244800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,5,5]],"date-time":"2023-05-05T00:00:00Z","timestamp":1683244800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100000051","name":"National Human Genome Research Institute","doi-asserted-by":"publisher","award":["R01HG011392"],"id":[{"id":"10.13039\/100000051","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000057","name":"National Institute of General Medical Sciences","doi-asserted-by":"publisher","award":["R35GM139602"],"id":[{"id":"10.13039\/100000057","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000076","name":"Directorate for Biological Sciences","doi-asserted-by":"crossref","award":["2029552"],"id":[{"id":"10.13039\/100000076","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Algorithms Mol Biol"],"abstract":"Abstract<\/jats:title>We present a new method and software tool called that applies a pangenome index to the problem of inferring genotypes from short-read sequencing data. The method uses a novel indexing structure called the marker array. Using the marker array, we can genotype variants with respect from large panels like the 1000 Genomes Project while reducing the reference bias that results when aligning to a single linear reference. can infer accurate genotypes in less time and memory compared to existing graph-based methods. The method is implemented in the open source software tool available at https:\/\/github.com\/alshai\/rowbowt<\/jats:ext-link>.<\/jats:p>","DOI":"10.1186\/s13015-023-00225-3","type":"journal-article","created":{"date-parts":[[2023,5,5]],"date-time":"2023-05-05T10:02:17Z","timestamp":1683280937000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["Pangenomic genotyping with the marker array"],"prefix":"10.1186","volume":"18","author":[{"given":"Taher","family":"Mun","sequence":"first","affiliation":[]},{"given":"Naga Sai Kavya","family":"Vaddadi","sequence":"additional","affiliation":[]},{"given":"Ben","family":"Langmead","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,5,5]]},"reference":[{"issue":"7","key":"225_CR1","doi-asserted-by":"publisher","first-page":"1104","DOI":"10.1038\/s41588-021-00877-0","volume":"53","author":"RW Davies","year":"2021","unstructured":"Davies RW, Kucka M, Su D, Shi S, Flanagan M, Cunniff CM, Chan YF, Myers S. Rapid genotype imputation from sequence with reference panels. Nat Genet. 2021;53(7):1104\u201311.","journal-title":"Nat Genet"},{"key":"225_CR2","doi-asserted-by":"publisher","first-page":"14","DOI":"10.1016\/j.plantsci.2015.04.016","volume":"242","author":"C Kim","year":"2016","unstructured":"Kim C, Guo H, Kong W, Chandnani R, Shuang LS, Paterson AH. Application of genotyping by sequencing technology to a variety of crop breeding programs. Plant Sci. 2016;242:14\u201322.","journal-title":"Plant Sci"},{"issue":"7571","key":"225_CR3","doi-asserted-by":"publisher","first-page":"68","DOI":"10.1038\/nature15393","volume":"526","author":"A Auton","year":"2015","unstructured":"Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, Korbel JO, Marchini JL, McCarthy S, McVean GA, Abecasis GR, et al. A global reference for human genetic variation. Nature. 2015;526(7571):68\u201374.","journal-title":"Nature"},{"issue":"5","key":"225_CR4","doi-asserted-by":"publisher","first-page":"849","DOI":"10.1101\/gr.213611.116","volume":"27","author":"VA Schneider","year":"2017","unstructured":"Schneider VA, Graves-Lindsay T, Howe K, Bouk N, Chen HC, Kitts PA, Murphy TD, Pruitt KD, Thibaud-Nissen F, Albracht D, et al. Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly. Genome Res. 2017;27(5):849\u201364.","journal-title":"Genome Res"},{"issue":"7","key":"225_CR5","doi-asserted-by":"publisher","first-page":"1008302","DOI":"10.1371\/journal.pgen.1008302","volume":"15","author":"T G\u00fcnther","year":"2019","unstructured":"G\u00fcnther T, Nettelblad C. The presence and impact of reference bias on population genomic studies of prehistoric human populations. PLoS Genet. 2019;15(7):1008302.","journal-title":"PLoS Genet"},{"issue":"5","key":"225_CR6","doi-asserted-by":"publisher","first-page":"931","DOI":"10.1534\/g3.114.015784","volume":"5","author":"DY Brandt","year":"2015","unstructured":"Brandt DY, Aguiar VR, Bitarello BD, Nunes K, Goudet J, Meyer D. Mapping bias overestimates reference allele frequencies at the HLA genes in the 1000 genomes project phase I data. G3 (Bethesda). 2015;5(5):931\u201341.","journal-title":"G3 (Bethesda)"},{"issue":"1","key":"225_CR7","doi-asserted-by":"publisher","first-page":"30","DOI":"10.1038\/s41588-018-0273-y","volume":"51","author":"RM Sherman","year":"2019","unstructured":"Sherman RM, Forman J, Antonescu V, Puiu D, Daya M, Rafaels N, Boorgula MP, Chavan S, Vergara C, Ortega VE, et al. Assembly of a pan-genome from deep sequencing of 910 humans of African descent. Nat Genet. 2019;51(1):30\u20135.","journal-title":"Nat Genet"},{"key":"225_CR8","doi-asserted-by":"publisher","first-page":"20","DOI":"10.1016\/j.isci.2019.07.011","volume":"18","author":"L Denti","year":"2019","unstructured":"Denti L, Previtali M, Bernardini G, Sch\u00f6nhuth A, Bonizzoni P. MALVA: genotyping by mapping-free ALlele detection of known VAriants. iScience. 2019;18:20\u20137.","journal-title":"iScience"},{"issue":"17","key":"225_CR9","doi-asserted-by":"publisher","first-page":"538","DOI":"10.1093\/bioinformatics\/btw460","volume":"32","author":"A Shajii","year":"2016","unstructured":"Shajii A, Yorukoglu D, William Yu Y, Berger B. Fast genotyping of known SNPs through approximate k-mer matching. Bioinformatics. 2016;32(17):538\u201344.","journal-title":"Bioinformatics."},{"issue":"1","key":"225_CR10","doi-asserted-by":"publisher","first-page":"220","DOI":"10.1186\/s13059-018-1595-x","volume":"19","author":"J Pritt","year":"2018","unstructured":"Pritt J, Chen NC, Langmead B. FORGe: prioritizing variants for graph genomes. Genome Biol. 2018;19(1):220.","journal-title":"Genome Biol"},{"issue":"1","key":"225_CR11","doi-asserted-by":"publisher","first-page":"291","DOI":"10.1186\/s13059-019-1909-7","volume":"20","author":"S Chen","year":"2019","unstructured":"Chen S, Krusche P, Dolzhenko E, Sherman RM, Petrovski R, Schlesinger F, Kirsche M, Bentley DR, Schatz MC, Sedlazeck FJ, Eberle MA. Paragraph: a graph-based structural variant genotyper for short-read sequence data. Genome Biol. 2019;20(1):291.","journal-title":"Genome Biol"},{"issue":"9","key":"225_CR12","doi-asserted-by":"publisher","first-page":"875","DOI":"10.1038\/nbt.4227","volume":"36","author":"E Garrison","year":"2018","unstructured":"Garrison E, Sir\u00e9n J, Novak AM, Hickey G, Eizenga JM, Dawson ET, Jones W, Garg S, Markello C, Lin MF, Paten B, Durbin R. Variation graph toolkit improves read mapping by representing genetic variation in the reference. Nat Biotechnol. 2018;36(9):875\u20139.","journal-title":"Nat Biotechnol"},{"issue":"1","key":"225_CR13","doi-asserted-by":"publisher","first-page":"8","DOI":"10.1186\/s13059-020-02229-3","volume":"22","author":"NC Chen","year":"2021","unstructured":"Chen NC, Solomon B, Mun T, Iyer S, Langmead B. Reference flow: reducing reference bias using multiple population genomes. Genome Biol. 2021;22(1):8.","journal-title":"Genome Biol"},{"issue":"6574","key":"225_CR14","doi-asserted-by":"publisher","first-page":"8871","DOI":"10.1126\/science.abg8871","volume":"374","author":"J n","year":"2021","unstructured":"n J, Monlong J, Chang X, Novak AM, Eizenga JM, Markello C, Sibbesen JA, Hickey G, Chang PC, Carroll A, Gupta N, Gabriel S, Blackwell TW, Ratan A, Taylor KD, Rich SS, Rotter JI, Haussler D, Garrison E, Paten B. Pangenomics enables genotyping of known structural variants in 5202 diverse genomes. Science. 2021;374(6574):8871.","journal-title":"Science"},{"issue":"7","key":"225_CR15","doi-asserted-by":"publisher","first-page":"1054","DOI":"10.1038\/s41588-018-0145-5","volume":"50","author":"JA Sibbesen","year":"2018","unstructured":"Sibbesen JA, Maretty L, Krogh A. Accurate genotyping across variant classes and lengths using variant graphs. Nat Genet. 2018;50(7):1054\u20139.","journal-title":"Nat Genet"},{"issue":"4","key":"225_CR16","doi-asserted-by":"publisher","first-page":"518","DOI":"10.1038\/s41588-022-01043-w","volume":"54","author":"J Ebler","year":"2022","unstructured":"Ebler J, Ebert P, Clarke WE, Rausch T, Audano PA, Houwaart T, Mao Y, Korbel JO, Eichler EE, Zody MC, et al. Pangenome-based genome inference allows efficient and accurate genotyping across a wide spectrum of variant classes. Nat Genet. 2022;54(4):518\u201325.","journal-title":"Nat Genet"},{"key":"225_CR17","doi-asserted-by":"crossref","unstructured":"Gagie T, Navarro G, Prezza N. Optimal-Time Text Indexing in BWT-runs Bounded Space. In: Proceedings of the 29th Annual Symposium on Discrete Algorithms (SODA), pp. 1459\u20131477; 2018.","DOI":"10.1137\/1.9781611975031.96"},{"issue":"4","key":"225_CR18","doi-asserted-by":"publisher","first-page":"500","DOI":"10.1089\/cmb.2019.0309","volume":"27","author":"A Kuhnle","year":"2020","unstructured":"Kuhnle A, Mun T, Boucher C, Gagie T, Langmead B, Manzini G. Efficient construction of a complete index for pan-genomics read alignment. J Comput Biol. 2020;27(4):500\u201313.","journal-title":"J Comput Biol"},{"issue":"6","key":"225_CR19","doi-asserted-by":"publisher","DOI":"10.1016\/j.isci.2021.102696","volume":"24","author":"O Ahmed","year":"2021","unstructured":"Ahmed O, Rossi M, Kovaka S, Schatz MC, Gagie T, Boucher C, Langmead B. Pan-genomic matching statistics for targeted nanopore sequencing. iScience. 2021;24(6): 102696.","journal-title":"iScience"},{"key":"225_CR20","unstructured":"Burrows M, Wheeler DJ. A block sorting lossless data compression algorithm. Technical Report 124, Digital Equipment Corporation 1994."},{"key":"225_CR21","unstructured":"Ferragina P, Manzini G. Opportunistic data structures with applications. In: Proceedings of the 41st Annual Symposium on Foundations of Computer Science (FOCS), pp. 390\u2013398; 2000."},{"issue":"1","key":"225_CR22","doi-asserted-by":"publisher","first-page":"1784","DOI":"10.1038\/s41467-018-08148-z","volume":"10","author":"MJP Chaisson","year":"2019","unstructured":"Chaisson MJP, Sanders AD, Zhao X, Malhotra A, Porubsky D, Rausch T, Gardner EJ, Rodriguez OL, Guo L, Collins RL, et al. Multi-platform discovery of haplotype-resolved structural variation in human genomes. Nat Commun. 2019;10(1):1784.","journal-title":"Nat Commun"},{"key":"225_CR23","doi-asserted-by":"publisher","first-page":"6537","DOI":"10.1126\/science.abf7117","volume":"372","author":"P Ebert","year":"2021","unstructured":"Ebert P, Audano PA, Zhu Q, Rodriguez-Martin B, Porubsky D, Bonder MJ, Sulovari A, Ebler J, Zhou W, Serra Mari R, et al. Haplotype-resolved diverse human genomes and integrated analysis of structural variation. Science. 2021;372:6537.","journal-title":"Science."},{"issue":"2","key":"225_CR24","doi-asserted-by":"publisher","first-page":"169","DOI":"10.1089\/cmb.2021.0290","volume":"29","author":"M Rossi","year":"2022","unstructured":"Rossi M, Oliva M, Langmead B, Gagie T, Boucher C. MONI: a pangenomic index for finding maximal exact matches. J Comput Biol. 2022;29(2):169\u201387.","journal-title":"J Comput Biol"},{"issue":"15","key":"225_CR25","doi-asserted-by":"publisher","first-page":"2156","DOI":"10.1093\/bioinformatics\/btr330","volume":"27","author":"P Danecek","year":"2011","unstructured":"Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, Handsaker RE, Lunter G, Marth GT, Sherry ST, et al. The variant call format and VCFtools. Bioinformatics. 2011;27(15):2156\u20138.","journal-title":"Bioinformatics"},{"issue":"21","key":"225_CR26","doi-asserted-by":"publisher","first-page":"2987","DOI":"10.1093\/bioinformatics\/btr509","volume":"27","author":"H Li","year":"2011","unstructured":"Li H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics. 2011;27(21):2987\u201393.","journal-title":"Bioinformatics"},{"key":"225_CR27","doi-asserted-by":"crossref","unstructured":"Gog S, Beller T, Moffat A, Petri M. From theory to practice: Plug and play with succinct data structures. In: 13th International Symposium on Experimental Algorithms, (SEA 2014), pp. 326\u2013337; 2014.","DOI":"10.1007\/978-3-319-07959-2_28"},{"key":"225_CR28","doi-asserted-by":"crossref","unstructured":"Belazzougui D, Cunial F, Gagie T, Prezza N, Raffinot M. Flexible indexing of repetitive collections. In: Kari J, Manea F, Petre I., editors. Unveiling dynamics and complexity. vol. 10307, pp. 162\u2013174. Springer, Cham; 2017. Series Title: Lecture Notes in Computer Science.","DOI":"10.1007\/978-3-319-58741-7_17"},{"key":"225_CR29","doi-asserted-by":"publisher","first-page":"157","DOI":"10.1016\/j.jbiotec.2017.07.017","volume":"261","author":"K Reinert","year":"2017","unstructured":"Reinert K, Dadi TH, Ehrhardt M, Hauswedell H, Mehringer S, Rahn R, Kim J, Pockrandt C, Winkler J, Siragusa E, Urgese G, Weese D. The SeqAn C++ template library for efficient sequence analysis: A resource for programmers. J Biotechnol. 2017;261:157\u201368.","journal-title":"J Biotechnol"},{"issue":"4","key":"225_CR30","doi-asserted-by":"publisher","first-page":"357","DOI":"10.1038\/nmeth.1923","volume":"9","author":"B Langmead","year":"2012","unstructured":"Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9(4):357\u20139.","journal-title":"Nat Methods"},{"key":"225_CR31","doi-asserted-by":"crossref","unstructured":"Wagner J, Olson ND, Harris L, McDaniel J, Cheng H, Fungtammasan A, Hwang Y-C, Gupta R, Wenger AM, Rowell WJ, et al. Towards a comprehensive variation benchmark for challenging medically-relevant autosomal genes. 2021.","DOI":"10.1101\/2021.06.07.444885"},{"key":"225_CR32","unstructured":"NIST: Medically Relevant Genes. [Online]. Available from: https:\/\/github.com\/usnistgov\/cmrg-benchmarkset-manuscript\/tree\/master\/data\/gene_coords\/unsorted\/GRCh38_mrg_full_gene.bed. Accessed 19 Mar 2023."},{"key":"225_CR33","doi-asserted-by":"publisher","first-page":"33","DOI":"10.12688\/f1000research.29032.2","volume":"10","author":"F M\u00f6lder","year":"2021","unstructured":"M\u00f6lder F, Jablonski KP, Letcher B, Hall MB, Tomkins-Tinch CH, Sochat V, Forster J, Lee S, Twardziok SO, Kanitz A, Wilm A, Holtgrewe M, Rahmann S, Nahnsen S, K\u00f6ster J. Sustainable data analysis with Snakemake. F1000 Res. 2021;10:33.","journal-title":"F1000 Res"},{"issue":"17","key":"225_CR34","doi-asserted-by":"publisher","first-page":"2759","DOI":"10.1093\/bioinformatics\/btx304","volume":"33","author":"M Kokot","year":"2017","unstructured":"Kokot M, Dlugosz M, Deorowicz S. KMC 3: counting and manipulating k-mer statistics. Bioinformatics. 2017;33(17):2759\u201361.","journal-title":"Bioinformatics"},{"key":"225_CR35","unstructured":"Goga A, Bal\u00e1\u017e A, Petescia A, Gagie T. MARIA: multiple-alignment $$r$$-index with aggregation. 2022. arXiv 2209.09218."}],"container-title":["Algorithms for Molecular Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13015-023-00225-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13015-023-00225-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13015-023-00225-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,5]],"date-time":"2023-05-05T10:03:34Z","timestamp":1683281014000},"score":1,"resource":{"primary":{"URL":"https:\/\/almob.biomedcentral.com\/articles\/10.1186\/s13015-023-00225-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,5,5]]},"references-count":35,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["225"],"URL":"https:\/\/doi.org\/10.1186\/s13015-023-00225-3","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2022.05.19.492566","asserted-by":"object"}]},"ISSN":["1748-7188"],"issn-type":[{"value":"1748-7188","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,5,5]]},"assertion":[{"value":"31 March 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 April 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 May 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"2"}}