Abstract
Imputation methods have been suggested as an efficient way to increase both utility and coverage in genome-wide association studies, especially when combining data generated from different genotyping arrays. We aim to demonstrate that imputation results are extremely accurate and the association analysis from imputed data does not over-inflate the results. Instead imputation leads to an increase in the power of the dataset without introducing any systematic biases. The majority of common variants can be imputed with very high accuracy (r2>0.9) and we validated the accuracy of imputations by comparing actual genotypes from low-throughput genotyping assays against imputed genotypes. Imputation was performed using IMPUTE2 and the 1000 Genomes cosmopolitan reference panel, which results in about 38 million SNPs. After quality control and filtering we performed case-control associations with 3,159,556 markers. We show a comparison of results from genotyped and imputed data and also determine how accurate ancestry is determined by imputations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Hindorff, L.A., Sethupathy, P., Junkins, H.A., Ramos, E.M., Mehta, J.P., Collins, F.S., Manolio, T.A.: Potential Etiologic and Functional Implications of Genome-wide Association Loci for Human Diseases and Traits. Proceedings of the National Academy of Sciences of the United States of America 106, 9362–9367
Dudbridge, F., Gusnanto, A.: Estimation of Significance Thresholds for Genomewide Association Scans. Genetic Epidemiology 32, 227–234
Chanock, S.J., Manolio, T., Boehnke, M., Boerwinkle, E., Hunter, D.J., Thomas, G., Hirschhorn, J.N., et al.: Replicating Genotype-phenotype Associations. Nature 447, 655–660
Stranger, B.E., Stahl, E.A., Raj, T.: Progress and Promise of Genome-Wide Association Studies for Human Complex Trait Genetics. Genetics 187, 367–383
Evangelou, E., John P.A.: Ioannidis: Meta-analysis Methods for Genome-wide Association Studies and Beyond. Nature Reviews Genetics 14, 379–389
McCarty, C.A., Wilke, R.A., Giampietro P.F., Wesbrook S.D., Caldwell, M.D.: Marshfield Clinic Personalized Medicine Research Project (PMRP): design, methods and recruitment for a large population-based biobank. Personalized Medicine 2, 49–79
Turner, S., Armstrong, L.L., Bradford, Y., Carlson, C.S., Crawford, D.C., Crenshaw, A.T., de Andrade, M., et al.: Quality Control Procedures for Genome-wide Association Studies. Current Protocols in Human 68, 1.19.1–1.19.18
Cross, D.S., Ivacic, L.C., McCarty, C.A.: Development of a Fingerprinting Panel Using Medically Relevant Polymorphisms. BMC Medical Genomics 2, 17
Cross, D.S., Ivacic, L.C., Stefanski, E.L., McCarty, C.A.: Population Based Allele Frequencies of Disease Associated Polymorphisms in the Personalized Medicine Research Project. BMC Genetics 11, 51
Olivier, D., Marchini, J., Zagury, J.-F.: A Linear Complexity Phasing Method for Thousands of Genomes. Nature Methods 9, 179–181
Marchini, J., Howie, B., Myers, S., McVean, G., Donnelly, P.: A New Multipoint Method for Genome-wide Association Studies by Imputation of Genotypes. Nature Genetics 39, 906–913
Frazer, K.A., Ballinger, D.G., Cox, D.R., Hinds, D.A., Stuve, L.L., Gibbs, R.A., Belmont, J.W., et al.: A Second Generation Human Haplotype Map of over 3.1 Million SNPs. Nature 449, 851–861
Howie, B., Fuchsberger, C., Stephens, M., Marchini, J., Abecasis, G.R.: Fast and Accurate Genotype Imputation in Genome-wide Association Studies through Pre-phasing. Nature Genetics 44, 955–959
Altshuler, D.M., Gibbs, R.A., Peltonen, L., Altshuler, D.M., Gibbs, R.A., Peltonen, L., Dermitzakis, E., et al.: Integrating Common and Rare Genetic Variation in Diverse Human Populations. Nature 467, 52–58
Kho, A.N., Hayes, M.G., Rasmussen-Torvik, L., Pacheco, J.A., Thompson, W.K., Armstrong, L.L., Denny, J.C., et al.: Use of Diverse Electronic Medical Record Systems to Identify Genetic Risk for Type 2 Diabetes Within a Genome-wide Association Study. Journal of the American Medical Informatics Association: JAMIA 19, 212–218
Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Manuel, A.R.F., Bender, D., Maller, J., et al.: PLINK: a Tool Set for Whole-genome Association and Population-based Linkage Analyses. American Journal of Human Genetics 81, 559–575
Zheng, X., Levine, D., Shen, J., Gogarten, S.M., Laurie, C., Weir, B.S.: A High- performance Computing Toolset for Relatedness and Principal Component Analysis of SNP Data. Bioinformatics 28, 3326–3328
Novembre, J., Stephens, M.: Interpreting principal component analyses of spatial population genetic variation. Nat. Genetics 40, 646–649
Pruim, R.J., Welch, R.P., Sanna, S., Teslovich, T.M., Chines, P.S., Gliedt, T.P., Boehnke, M., Abecasis, G.R., Willer, C.J.: LocusZoom: Regional Visualization of Genome-wide Association Scan Results. Bioinformatics 26, 2336–2337
Pendergrass, S.A., Dudek, S.M., Crawford, D.C., Ritchie, M.D.: Visually Integrating and Exploring High Throughput Phenome-Wide Association Study (PheWAS) Results Using PheWAS-View. BioData Mining 5, 5
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Verma, S.S. et al. (2014). Benefits of Accurate Imputations in GWAS. In: Esparcia-Alcázar, A., Mora, A. (eds) Applications of Evolutionary Computation. EvoApplications 2014. Lecture Notes in Computer Science(), vol 8602. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45523-4_71
Download citation
DOI: https://doi.org/10.1007/978-3-662-45523-4_71
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45522-7
Online ISBN: 978-3-662-45523-4
eBook Packages: Computer ScienceComputer Science (R0)