Efficiency and power in genetic association studies

Paul I W de Bakker; Roman Yelensky; Itsik Pe'er; Stacey B Gabriel; Mark J Daly; David Altshuler

doi:10.1038/ng1669

Efficiency and power in genetic association studies

Nat Genet. 2005 Nov;37(11):1217-23. doi: 10.1038/ng1669. Epub 2005 Oct 23.

Authors

Paul I W de Bakker¹, Roman Yelensky, Itsik Pe'er, Stacey B Gabriel, Mark J Daly, David Altshuler

Affiliation

¹ Center for Human Genetic Research, Massachusetts General Hospital, 185 Cambridge Street, CPZN-6818, Boston, Massachusetts 02114-2790, USA.

PMID: 16244653
DOI: 10.1038/ng1669

Abstract

We investigated selection and analysis of tag SNPs for genome-wide association studies by specifically examining the relationship between investment in genotyping and statistical power. Do pairwise or multimarker methods maximize efficiency and power? To what extent is power compromised when tags are selected from an incomplete resource such as HapMap? We addressed these questions using genotype data from the HapMap ENCODE project, association studies simulated under a realistic disease model, and empirical correction for multiple hypothesis testing. We demonstrate a haplotype-based tagging method that uniformly outperforms single-marker tests and methods for prioritization that markedly increase tagging efficiency. Examining all observed haplotypes for association, rather than just those that are proxies for known SNPs, increases power to detect rare causal alleles, at the cost of reduced power to detect common causal alleles. Power is robust to the completeness of the reference panel from which tags are selected. These findings have implications for prioritizing tag SNPs and interpreting association studies.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Case-Control Studies
Chromosome Mapping
Computer Simulation
Genetic Markers / genetics
Genetic Predisposition to Disease*
Haplotypes / genetics*
Humans
Linkage Disequilibrium / genetics
Models, Genetic
Polymorphism, Single Nucleotide / genetics*
Reference Standards

Substances

Genetic Markers