Prospects & Overviews
SNP ascertainment bias in population genetic analyses: Why it is important, and how to correct it
Version of Record online: 9 JUL 2013
© 2013 WILEY Periodicals, Inc.
Volume 35, Issue 9, pages 780–786, September 2013
How to Cite
Lachance, J. and Tishkoff, S. A. (2013), SNP ascertainment bias in population genetic analyses: Why it is important, and how to correct it. Bioessays, 35: 780–786. doi: 10.1002/bies.201300014
- Issue online: 14 AUG 2013
- Version of Record online: 9 JUL 2013
- NIH postdoctoral fellowship (F32HG006648-02) to JL and an NIH Pioneer Award (DP1 ES022577-04) to ST
- African hunter-gatherers;
- human genetics;
- population genetics;
- SNP ascertainment bias;
- whole genome sequencing
Whole genome sequencing and SNP genotyping arrays can paint strikingly different pictures of demographic history and natural selection. This is because genotyping arrays contain biased sets of pre-ascertained SNPs. In this short review, we use comparisons between high-coverage whole genome sequences of African hunter-gatherers and data from genotyping arrays to highlight how SNP ascertainment bias distorts population genetic inferences. Sample sizes and the populations in which SNPs are discovered affect the characteristics of observed variants. We find that SNPs on genotyping arrays tend to be older and present in multiple populations. In addition, genotyping arrays cause allele frequency distributions to be shifted towards intermediate frequency alleles, and estimates of linkage disequilibrium are modified. Since population genetic analyses depend on allele frequencies, it is imperative that researchers are aware of the effects of SNP ascertainment bias. With this in mind, we describe multiple ways to correct for SNP ascertainment bias.