Abstract
Association, or LD (linkage disequilibrium), mapping is an intensely-studied approach to gene mapping (genome-wide or in candidate regions) that is widely hoped to be able to efficiently locate genes influencing both complex and Mendelian traits. The logic underlying association mapping implies that the best possible mapping results would be obtained if the genealogical history of the sampled individuals were explicitly known. Such a history would be in the form of an “ancestral recombination graph (ARG)”. But despite the conceptual importance of genealogical histories to association mapping, few practical association mapping methods have explicitly used derived genealogical aspects of ARGs. Two notable exceptions are [35] and [23].
In this paper we develop an association mapping method that explicitly constructs and samples minARGs (ARGs that minimize the number of recombinations). We develop an ARG sampling method that provably samples minARGs uniformly at random, and that is practical for moderate sized datasets. We also develop a different, faster, ARG sampling method that still samples from a well-defined subspace of ARGs, and that is practical for larger sized datasets. We present novel efficient algorithms on extensions of the “phenotype likelihood” problem, a key step in the method in [35]. We also prove that computing the phenotype likelihood for a different natural extension of the penetrance model in [35] is NP-hard, answering a question unresolved in that paper. Finally, we put all of these results into practice, and examine how well the implemented methods perform, compared to the results in [35]. The empirical results show great speed ups, and definite but sometimes small, improvements in mapping accuracy. Speed is particularly important in doing genome-wide scans for causative mutations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bafna, V., Bansal, V.: The number of recombination events in a sample history: conflict graph and lower bounds. IEEE/ACM Trans. on Computational Biology and Bioinformatics 1, 78–90 (2004)
Bafna, V., Bansal, V.: Inference about Recombination from Haplotype Data: Lower Bounds and Recombination Hotspots. J. of Comp. Bio. 13, 501–521 (2006)
Bordewich, M., Semple, C.: On the computational complexity of the rooted subtree prune and regraft distance. Annals of Combinatorics 8, 409–423 (2004)
Brown, D.: Private communications.
Clark, A.G.: Finding genes underlying risk of complex disease by linkage disequilibrium mapping. Current Opinion in Genetics and Development 13, 296–302 (2003)
Felsenstein, J.: Evolutionary trees from DNA sequences: a maximum likelihood approach. J. Mol. Evol. 17, 368–376 (1981)
Griffiths, R.C., Marjoram, P.: Ancestral inference from samples of DNA sequences with recombination. J. of Comp. Bio. 3, 479–502 (1996)
Gusfield, D.: Optimal, efficient reconstruction of Root-Unknown phylogenetic networks with constrained and structured recombination. JCSS 70, 381–398 (2005)
Gusfield, D., Eddhu, S., Langley, C.: Optimal, efficient reconstruction of phylogenetic networks with constrained recombination. J. Bioinformatics and Computational Biology 2, 173–213 (2004)
Gusfield, D., Eddhu, S., Langley, C.: The fine structure of galls in phylogenetic networks. INFORMS J. on Computing 16, 459–469 (2004)
Hein, J.: Reconstructing evolution of sequences subject to recombination using parsimony. Math. Biosci. 98, 185–200 (1990)
Hein, J.: A heuristic method to reconstruct the history of sequences subject to recombination. J. Mol. Evol. 36, 396–405 (1993)
Hein, J., Schierup, M., Wiuf, C.: Gene Genealogies, Variation and Evolution: A primer in coalescent theory. Oxford University Press, Oxford (2005)
Hinds, D., Stuve, L., Nilsen, G., Halperin, E., Eskin, E., Gallinger, D., Frazer, K., Cox, D.: Whole-Genome Patterns of Common DNA variation in three human populations. Science 307, 1072–1079 (2005)
Hudson, R., Kaplan, N.: Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics 111, 147–164 (1985)
International HapMap Consortium: The HapMap project. Nature 426, 789–796 (2003)
International HapMap Consortium: A haplotype map of the human genome. Nature 437, 1299–1320 (2005)
Kerem, B., Rommens, J.M., Buchanan, J.A., Markiewicz, D., Cox, T.K., Chakravarti, A., Buchwald, M., Tsui, L.C.: Identification of the cystic fibrosis gene: genetic analysis. Science 245, 1073–1080 (1989)
Larribe, F., Lessard, S., Schork, N.J.: Gene mapping via Ancestral Recombination Graph. Theor. Popul. Biol. 62, 215–229 (2002)
Li, N., Stephens, M.: Modeling Linkage Disequilibrium, and identifying recombination hotspots using SNP data. Genetics 165, 2213–2233 (2003)
Lyngso, R., Song, Y.S., Hein, J.: Minimum Recombination Histories by Branch and Bound. In: Proceedings of Workshop on Algorithm of Bioinformatics (WABI), vol. 3692, pp. 239-250 (2005)
McPeek, M.S., Strahs, A.: Assessment of linkage disequilibrium by the decay of haplotype sharing, with application to fine-scale genetic mapping. Am. J. Hum. Genet. 65, 858–875 (1999)
Minichiello, M., Durbin, R.: Mapping trait loci using inferred ancestral recombination graphs. Am. J. Hum. Genet. 79, 910–922 (2006)
Morris, A.P., Whittaker, J.C., Balding, D.J.: Fine-scale mapping of disease loci via shattered coalescent modeling of genealogies. Am. J. Hum. Genet. 70, 686–707 (2002)
Myers, S.R., Griffiths, R.C.: Bounds on the minimum number of recombination events in a sample history. Genetics 163, 375–394 (2003)
Norborg, M., Tavare, S.: Linkage disequilibrium: what history has to tell us. Trends in Genetics 18, 83–90 (2002)
Rannala, B., Reeve, J.P.: High-resolution multipoint linkage-disequilibrium mapping in the context of a human genome sequence. Am. J. Hum. Genet. 69, 159–178 (2001)
Risch, N., Merikangas, K.: The Future of Genetic Studies of Complex Human Diseases. Science 275, 1516–1517 (1996)
Song, Y., Hein, J.: Parsimonious reconstruction of sequence evolution and haplotype blocks: Finding the minimum number of recombination events. In: Proc. of 2003 Workshop on Algorithms in Bioinformatics (WABI) (2003)
Song, Y., Hein, J.: On the Minimum Number of Recombination Events in the Evolutionary History of DNA Sequences. J. of Math. Biology 48, 160–186 (2003)
Song, Y.S., Wu, Y., Gusfield, D.: Efficient computation of close lower and upper bounds on the minimum number of needed recombinations in the evolution of biological sequences (Proceedings of ISMB 2005). Bioinformatics 421, i413–i422 (2005)
Stephens, M., Smith, N., Donnelly, P.: A new statistical method for haplotype reconstruction from population data. Am. J. Hum. Genet. 68, 978–989 (2001)
Templeton, A.R., Boerwinkle, E., Sing, C.F.: A cladistic analysis of phenotypic associations with haplotypes inferred from restriction endonuclease mapping. I. Basic theory and an analysis of alcohol dehydrogenase activity in Drosophila. Genetics 117, 343–351 (1987)
Wang, L., Zhang, K., Zhang, L.: Perfect Phylogenetic Networks with Recombination. J. of Comp. Bio. 8, 69–78 (2001)
Zollner, S., Pritchard, J.K.: Coalescent-Based Association Mapping and Fine Mapping of Complex Trait Loci. Genetics 169, 1071–1092 (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Wu, Y. (2007). Association Mapping of Complex Diseases with Ancestral Recombination Graphs: Models and Efficient Algorithms. In: Speed, T., Huang, H. (eds) Research in Computational Molecular Biology. RECOMB 2007. Lecture Notes in Computer Science(), vol 4453. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71681-5_34
Download citation
DOI: https://doi.org/10.1007/978-3-540-71681-5_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71680-8
Online ISBN: 978-3-540-71681-5
eBook Packages: Computer ScienceComputer Science (R0)