Association Mapping of Complex Diseases with Ancestral Recombination Graphs: Models and Efficient Algorithms | SpringerLink
Skip to main content

Association Mapping of Complex Diseases with Ancestral Recombination Graphs: Models and Efficient Algorithms

  • Conference paper
Research in Computational Molecular Biology (RECOMB 2007)

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 4453))

Abstract

Association, or LD (linkage disequilibrium), mapping is an intensely-studied approach to gene mapping (genome-wide or in candidate regions) that is widely hoped to be able to efficiently locate genes influencing both complex and Mendelian traits. The logic underlying association mapping implies that the best possible mapping results would be obtained if the genealogical history of the sampled individuals were explicitly known. Such a history would be in the form of an “ancestral recombination graph (ARG)”. But despite the conceptual importance of genealogical histories to association mapping, few practical association mapping methods have explicitly used derived genealogical aspects of ARGs. Two notable exceptions are [35] and [23].

In this paper we develop an association mapping method that explicitly constructs and samples minARGs (ARGs that minimize the number of recombinations). We develop an ARG sampling method that provably samples minARGs uniformly at random, and that is practical for moderate sized datasets. We also develop a different, faster, ARG sampling method that still samples from a well-defined subspace of ARGs, and that is practical for larger sized datasets. We present novel efficient algorithms on extensions of the “phenotype likelihood” problem, a key step in the method in [35]. We also prove that computing the phenotype likelihood for a different natural extension of the penetrance model in [35] is NP-hard, answering a question unresolved in that paper. Finally, we put all of these results into practice, and examine how well the implemented methods perform, compared to the results in [35]. The empirical results show great speed ups, and definite but sometimes small, improvements in mapping accuracy. Speed is particularly important in doing genome-wide scans for causative mutations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 5719
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 7149
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Bafna, V., Bansal, V.: The number of recombination events in a sample history: conflict graph and lower bounds. IEEE/ACM Trans. on Computational Biology and Bioinformatics 1, 78–90 (2004)

    Article  Google Scholar 

  2. Bafna, V., Bansal, V.: Inference about Recombination from Haplotype Data: Lower Bounds and Recombination Hotspots. J. of Comp. Bio. 13, 501–521 (2006)

    Article  MathSciNet  Google Scholar 

  3. Bordewich, M., Semple, C.: On the computational complexity of the rooted subtree prune and regraft distance. Annals of Combinatorics 8, 409–423 (2004)

    Article  MATH  MathSciNet  Google Scholar 

  4. Brown, D.: Private communications.

    Google Scholar 

  5. Clark, A.G.: Finding genes underlying risk of complex disease by linkage disequilibrium mapping. Current Opinion in Genetics and Development 13, 296–302 (2003)

    Article  Google Scholar 

  6. Felsenstein, J.: Evolutionary trees from DNA sequences: a maximum likelihood approach. J. Mol. Evol. 17, 368–376 (1981)

    Article  Google Scholar 

  7. Griffiths, R.C., Marjoram, P.: Ancestral inference from samples of DNA sequences with recombination. J. of Comp. Bio. 3, 479–502 (1996)

    Article  Google Scholar 

  8. Gusfield, D.: Optimal, efficient reconstruction of Root-Unknown phylogenetic networks with constrained and structured recombination. JCSS 70, 381–398 (2005)

    MATH  MathSciNet  Google Scholar 

  9. Gusfield, D., Eddhu, S., Langley, C.: Optimal, efficient reconstruction of phylogenetic networks with constrained recombination. J. Bioinformatics and Computational Biology 2, 173–213 (2004)

    Article  Google Scholar 

  10. Gusfield, D., Eddhu, S., Langley, C.: The fine structure of galls in phylogenetic networks. INFORMS J. on Computing 16, 459–469 (2004)

    Article  MathSciNet  Google Scholar 

  11. Hein, J.: Reconstructing evolution of sequences subject to recombination using parsimony. Math. Biosci. 98, 185–200 (1990)

    Article  MATH  MathSciNet  Google Scholar 

  12. Hein, J.: A heuristic method to reconstruct the history of sequences subject to recombination. J. Mol. Evol. 36, 396–405 (1993)

    Article  Google Scholar 

  13. Hein, J., Schierup, M., Wiuf, C.: Gene Genealogies, Variation and Evolution: A primer in coalescent theory. Oxford University Press, Oxford (2005)

    MATH  Google Scholar 

  14. Hinds, D., Stuve, L., Nilsen, G., Halperin, E., Eskin, E., Gallinger, D., Frazer, K., Cox, D.: Whole-Genome Patterns of Common DNA variation in three human populations. Science 307, 1072–1079 (2005)

    Article  Google Scholar 

  15. Hudson, R., Kaplan, N.: Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics 111, 147–164 (1985)

    Google Scholar 

  16. International HapMap Consortium: The HapMap project. Nature 426, 789–796 (2003)

    Article  Google Scholar 

  17. International HapMap Consortium: A haplotype map of the human genome. Nature 437, 1299–1320 (2005)

    Article  Google Scholar 

  18. Kerem, B., Rommens, J.M., Buchanan, J.A., Markiewicz, D., Cox, T.K., Chakravarti, A., Buchwald, M., Tsui, L.C.: Identification of the cystic fibrosis gene: genetic analysis. Science 245, 1073–1080 (1989)

    Article  Google Scholar 

  19. Larribe, F., Lessard, S., Schork, N.J.: Gene mapping via Ancestral Recombination Graph. Theor. Popul. Biol. 62, 215–229 (2002)

    Article  MATH  Google Scholar 

  20. Li, N., Stephens, M.: Modeling Linkage Disequilibrium, and identifying recombination hotspots using SNP data. Genetics 165, 2213–2233 (2003)

    Google Scholar 

  21. Lyngso, R., Song, Y.S., Hein, J.: Minimum Recombination Histories by Branch and Bound. In: Proceedings of Workshop on Algorithm of Bioinformatics (WABI), vol. 3692, pp. 239-250 (2005)

    Google Scholar 

  22. McPeek, M.S., Strahs, A.: Assessment of linkage disequilibrium by the decay of haplotype sharing, with application to fine-scale genetic mapping. Am. J. Hum. Genet. 65, 858–875 (1999)

    Article  Google Scholar 

  23. Minichiello, M., Durbin, R.: Mapping trait loci using inferred ancestral recombination graphs. Am. J. Hum. Genet. 79, 910–922 (2006)

    Article  Google Scholar 

  24. Morris, A.P., Whittaker, J.C., Balding, D.J.: Fine-scale mapping of disease loci via shattered coalescent modeling of genealogies. Am. J. Hum. Genet. 70, 686–707 (2002)

    Article  Google Scholar 

  25. Myers, S.R., Griffiths, R.C.: Bounds on the minimum number of recombination events in a sample history. Genetics 163, 375–394 (2003)

    Google Scholar 

  26. Norborg, M., Tavare, S.: Linkage disequilibrium: what history has to tell us. Trends in Genetics 18, 83–90 (2002)

    Article  Google Scholar 

  27. Rannala, B., Reeve, J.P.: High-resolution multipoint linkage-disequilibrium mapping in the context of a human genome sequence. Am. J. Hum. Genet. 69, 159–178 (2001)

    Article  Google Scholar 

  28. Risch, N., Merikangas, K.: The Future of Genetic Studies of Complex Human Diseases. Science 275, 1516–1517 (1996)

    Article  Google Scholar 

  29. Song, Y., Hein, J.: Parsimonious reconstruction of sequence evolution and haplotype blocks: Finding the minimum number of recombination events. In: Proc. of 2003 Workshop on Algorithms in Bioinformatics (WABI) (2003)

    Google Scholar 

  30. Song, Y., Hein, J.: On the Minimum Number of Recombination Events in the Evolutionary History of DNA Sequences. J. of Math. Biology 48, 160–186 (2003)

    Article  MathSciNet  Google Scholar 

  31. Song, Y.S., Wu, Y., Gusfield, D.: Efficient computation of close lower and upper bounds on the minimum number of needed recombinations in the evolution of biological sequences (Proceedings of ISMB 2005). Bioinformatics 421, i413–i422 (2005)

    Article  Google Scholar 

  32. Stephens, M., Smith, N., Donnelly, P.: A new statistical method for haplotype reconstruction from population data. Am. J. Hum. Genet. 68, 978–989 (2001)

    Article  Google Scholar 

  33. Templeton, A.R., Boerwinkle, E., Sing, C.F.: A cladistic analysis of phenotypic associations with haplotypes inferred from restriction endonuclease mapping. I. Basic theory and an analysis of alcohol dehydrogenase activity in Drosophila. Genetics 117, 343–351 (1987)

    Google Scholar 

  34. Wang, L., Zhang, K., Zhang, L.: Perfect Phylogenetic Networks with Recombination. J. of Comp. Bio. 8, 69–78 (2001)

    Article  Google Scholar 

  35. Zollner, S., Pritchard, J.K.: Coalescent-Based Association Mapping and Fine Mapping of Complex Trait Loci. Genetics 169, 1071–1092 (2005)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Terry Speed Haiyan Huang

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this paper

Cite this paper

Wu, Y. (2007). Association Mapping of Complex Diseases with Ancestral Recombination Graphs: Models and Efficient Algorithms. In: Speed, T., Huang, H. (eds) Research in Computational Molecular Biology. RECOMB 2007. Lecture Notes in Computer Science(), vol 4453. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71681-5_34

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-71681-5_34

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-71680-8

  • Online ISBN: 978-3-540-71681-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics