Summary
- Top of page
- Summary
- Introduction
- Material and Methods
- Results
- Discussion
- Conclusions
- Acknowledgements
- Web resources
- References
- Supporting Information
Several studies have found strikingly different allele frequencies between continents. This has been mainly interpreted as being due to local adaptation. However, demographic factors can generate similar patterns. Namely, allelic surfing during a population range expansion may increase the frequency of alleles in newly colonised areas. In this study, we examined 772 STRs, 210 diallelic indels, and 2834 SNPs typed in 53 human populations worldwide under the HGDP-CEPH Diversity Panel to determine to which extent allele frequency differs among four regions (Africa, Eurasia, East Asia, and America). We find that large allele frequency differences between continents are surprisingly common, and that Africa and America show the largest number of loci with extreme frequency differences. Moreover, more STR alleles have increased rather than decreased in frequency outside Africa, as expected under allelic surfing. Finally, there is no relationship between the extent of allele frequency differences and proximity to genes, as would be expected under selection. We therefore conclude that most of the observed large allele frequency differences between continents result from demography rather than from positive selection.
Introduction
- Top of page
- Summary
- Introduction
- Material and Methods
- Results
- Discussion
- Conclusions
- Acknowledgements
- Web resources
- References
- Supporting Information
On a worldwide scale, human populations show a large phenotypic variability, particularly for skin colour, face and body shapes, susceptibility to pathogens, as well as for the prevalence of genetic diseases (Lewontin, 1995). However, most of the genetic variation in humans is found within populations rather than among populations or geographic regions (Lewontin, 1972, Barbujani et al. 1997, Rosenberg et al. 2002b). Still, many studies have focused on traits or loci showing geographically restricted distribution, or on loci showing drastic allele frequency differences between two regions. These particular cases can indeed reveal important information about local selective pressures or about the demographic histories of different populations (Balaresque et al. 2007). It is however difficult to disentangle the effects of positive selection from those of demography, since past demographic events such as population bottlenecks or range expansions can mimic the genetic signatures of a selective sweep like long range linkage disequilibrium and reduced allelic diversity.
The colonisation of the world by modern humans was probably accompanied by a series of founder effects with subsequent local population expansions (Handley et al. 2007). Strong bottlenecks have also certainly occurred during the exit out of Africa and at the onset of the colonisation of the Americas by people from Asia (Fagundes et al. 2007, Goebel et al. 2008). These bottlenecks, followed by a spatial expansion, can lead to the geographic spread of an allele that rides on the wave of advance of the spatial expansion, a phenomenon called allelic surfing (Edmonds et al. 2004, Klopfstein et al. 2006, Travis et al. 2007). New mutations arising on the wave front and extant alleles may surf successfully (Excoffier & Ray, 2008), spreading geographically and increasing in frequency in the newly colonised areas (Klopfstein et al. 2006). A combination of simulation, analytical and experimental studies have shown that the probability for an allele to successfully surf is increased in the presence of spatial bottlenecks, when local deme size is small, and when populations at the wave front grow rapidly and exchange few genes with their neighbours (Klopfstein et al. 2006, Hallatschek et al. 2007, Travis et al. 2007, Excoffier & Ray, 2008, Hallatschek & Nelson, 2008). This neutral process has received much attention recently because of its consequences on allele frequencies that mimic selective processes (Nielsen et al. 2007).
However, it is clear that human populations colonising novel habitats have been confronted by new selective pressures due to their exposure to different climate, food sources, and pathogens (Balaresque et al. 2007). Some of these selective pressures certainly triggered local adaptation that impacted on allele frequencies at several loci. However, neutral allele surfing, like selection, will also occur at only a few loci, and will therefore not affect all loci uniformly, like other demographic factors such as demographic expansions, inbreeding or bottlenecks.
Until recently, most human genes showing strong geographic structures were considered to be under positive selection (see Table 1, where 44 such genes are listed). Most of these genes show a marked difference in allele frequencies (typically larger than 20%) between African and non-African populations. In many of these studies, local selection outside Africa was thought to have promoted these large allele frequency differences. Prominent examples are two genes that are involved in the control of brain size, MCPH1 and ASPM (Evans et al. 2005, Mekel-Bobrov et al. 2005). Both genes showed an increased frequency of a derived allele outside Africa and high levels of linkage disequilibrium. The authors therefore hypothesised that the derived haplotypes were under local positive selection in non-African populations. However, Currat et al. (2006) showed by spatially-explicit simulations that similar geographic distributions of allele frequencies could be generated by neutral allelic surfing during the range expansion outside Africa.
In this study, we explore data from the HGDP-CEPH Diversity Panel consisting of 772 STRs, 210 insertion-deletion polymorphisms and 2834 SNPs typed in 53 populations worldwide to determine the prevalence of large allele frequency differences between regions. We find that large allele frequency differences between continental regions are extremely common, as they occur at almost one third of all loci. We discuss the respective role of selection and demographic factors for shaping these patterns in the light of geographic and genomic information.
Results
- Top of page
- Summary
- Introduction
- Material and Methods
- Results
- Discussion
- Conclusions
- Acknowledgements
- Web resources
- References
- Supporting Information
We tested whether populations belonging to the same region have more similar allele frequencies than expected by chance due to shared demographic history or shared selective events. Indeed, they show more similar allele frequencies than random populations, as the number of alleles showing ΔF > 0.2 for a given comparison is always significantly larger than expected by chance when tested with the random population permutation procedure (Tables 2–4 and Tables S2-S4). However, this is not always the case when tested with the geographically explicit permutation test, when randomized regions are made up of spatially neighbouring populations. In the STR dataset all positive frequency differences between America and the rest of the world that are larger than 0.2 are non-significant (Table 2 and Table S2). Additionally some of the larger frequency differences between America and the rest of the world in the indel dataset are also non-significant (Table 3 and Table S3). The geographically explicit permutation test is expected to be more stringent, as geographically close populations are genetically often more similar than random populations. However, if there are only few populations in a region, as is the case for the Americas, the geographically explicit permutation test is too stringent because the number of different random groups is reduced. Allele-specific ΔF was therefore tested with the random permutation procedure only and it is found significant in all cases as soon as ΔF > 0.25. We therefore chose an arbitrary threshold for ΔF of 0.3 to define a set of alleles with significant ΔF to summarise the results.
Table 2. STR allele frequency differences (ΔF) for the comparisons of Africa vs. the rest of the world and America vs. the rest of the world. Positive
(in the upper part of the table) indicate that the alleles have a lower frequency within African (or American) populations than in the non-African (or non-American) populations (because
). | ΔF | Africa vs. non-Africa | America vs. non-America |
|---|
| Allelesa | significantb | p-value 1c | p-value 2d | Locie | significantf | Allelesa | significantb | p-value 1c | p-value 2d | Locie | significantf |
|---|
|
| 0.65–0.7 | 0 | | | | | | 0 | | | | | |
| 0.6–0.65 | 1 | 1 | ** | ** | 1 | 1 | 0 | | | | | |
| 0.55–0.6 | 0 | | | | | | 0 | | | | | |
| 0.5–0.55 | 5 | 5 | ** | ** | 5 | 5 | 0 | | | | | |
| 0.45–0.5 | 9 | 9 | ** | ** | 9 | 9 | 1 | 1 | * | | 0 | 0 |
| 0.4–0.45 | 9 | 9 | ** | ** | 9 | 9 | 1 | 1 | * | | 0 | 0 |
| 0.35–0.4 | 19 | 19 | ** | ** | 17 | 17 | 6 | 6 | ** | | 6 | 6 |
| 0.3–0.35 | 24 | 24 | ** | ** | 22 | 22 | 13 | 13 | * | | 6 | 6 |
| (−0.3) −0.3 | 9122 | 4604 | | | 693 | 609 | 9049 | 3916 | | | 625 | 568 |
| (−0.3)–(−0.35) | 9 | 9 | ** | * | 6 | 6 | 53 | 53 | ** | ** | 49 | 49 |
| (−0.35)–(−0.4) | 8 | 8 | ** | * | 7 | 7 | 34 | 34 | ** | ** | 33 | 33 |
| (−0.4)–(−0.45) | 2 | 2 | ** | * | 1 | 1 | 24 | 24 | ** | ** | 24 | 24 |
| (−0.45)–(−0.5) | 1 | 1 | ** | * | 1 | 1 | 15 | 15 | ** | ** | 15 | 15 |
| (−0.5)–(−0.55) | 1 | 1 | ** | * | 1 | 1 | 5 | 5 | ** | ** | 5 | 5 |
| (−0.55)–(−0.6) | 0 | | | | | | 7 | 7 | ** | ** | 7 | 7 |
| (−0.6)–(−0.65) | 0 | | | | | | 2 | 2 | ** | ** | 2 | 2 |
Table 3. Indel absolute allele frequency differences for the comparisons of Africa vs. the rest of the world and America vs. the rest of the world. Since indels can be considered as diallelic loci, we directly report the number of loci with a given ΔFmax value. | ΔFmax | Africa vs. non-Africa | America vs. non-America |
|---|
| Locia | significantb | p-value 1c | p-value 2d | Loci a | significantb | p-value 1c | p-value 2d |
|---|
|
| 0.75–0.8 | 0 | | | | 0 | | | |
| 0.7–0.75 | 1 | 1 | ** | ** | 1 | 1 | ** | * |
| 0.65–0.7 | 2 | 2 | ** | ** | 0 | | | |
| 0.6–0.65 | 1 | 1 | ** | ** | 0 | | | |
| 0.55–0.6 | 4 | 4 | ** | ** | 0 | | | |
| 0.5–0.55 | 3 | 3 | ** | * | 2 | 2 | ** | * |
| 0.45–0.5 | 10 | 10 | ** | ** | 1 | 1 | * | |
| 0.4–0.45 | 14 | 14 | ** | ** | 6 | 6 | ** | * |
| 0.35–0.4 | 14 | 14 | ** | * | 8 | 8 | ** | |
| 0.3–0.35 | 12 | 12 | ** | * | 11 | 11 | ** | |
| 0–0.3 | 149 | 104 | | | 181 | 93 | | |
Table 4. SNP allele frequency differences for the comparisons of Africa vs. the rest of the world and America vs. the rest of the world. Since SNPs can be considered as diallelic loci, we directly report the number of loci with a given ΔFmax value. | ΔFmax | Africa vs. non-Africa | America vs. non-America |
|---|
| Locia | significantb | p-value 1c | p-value 2d | Locia | significantb | p-value 1c | p-value 2d |
|---|
|
| 0.75–0.8 | 3 | 3 | ** | ** | 0 | | | |
| 0.7–0.75 | 10 | 10 | ** | ** | 0 | | | |
| 0.65–0.7 | 1 | 1 | ** | * | 1 | 1 | ** | * |
| 0.6–0.65 | 14 | 14 | ** | ** | 5 | 5 | ** | * |
| 0.55–0.6 | 31 | 31 | ** | ** | 13 | 13 | ** | * |
| 0.5–0.55 | 38 | 38 | ** | ** | 19 | 19 | ** | * |
| 0.45–0.5 | 62 | 62 | ** | ** | 22 | 22 | ** | * |
| 0.4–0.45 | 89 | 89 | ** | ** | 60 | 60 | ** | * |
| 0.35–0.4 | 136 | 136 | ** | ** | 72 | 72 | ** | * |
| 0.3–0.35 | 129 | 129 | ** | * | 143 | 143 | ** | * |
| 0–0.3 | 2321 | 1484 | | | 2499 | 1303 | | |
Overall we find that large allele frequency differences between geographic regions are extremely frequent (Tables 2–4 and Tables S2–S4). Indeed, 215 of the 772 STR loci (27.9%), 90 out of 210 indel loci (42.9%) and 913 of the 2834 SNP loci (32.2%) have ΔFmax > 0.3 for at least one comparison. Among these, 18.1% of the STR loci with ΔFmax > 0.3 show such a large ΔFmax for more than one comparison, while for the indels and SNPs this fraction is 28.9% and 18.1%, respectively. Note that the total number of loci with ΔFmax > 0.3 is smaller than the sum of the number of loci with ΔFmax > 0.3 involved in the different comparisons that can be computed from Tables S2–S4, because a given locus can show large allele frequencies in more than one continental comparison. The largest observed ΔF (0.79) was found between African and non-African populations for the SNP locus ‘rs5972561’ (see below in Figure 4I).
In the comparisons of Africa and America to the rest of the World, the allele frequency differences are strikingly large (Tables S2-S4), as expected under the surfing out-of-Africa hypothesis. When Africa is contrasted to the rest of the world the fraction of loci with ΔFmax > 0.3 is 10.2%, 29.0%, and 18.1%, for STRs, indels, and SNPs, respectively, and these fractions are 19.0%, 13.8%, and 11.8%, respectively, for the Americas. For the Eurasian and East Asian regions, these numbers are much lower, and vary between 1.2% and 8.6%. In keeping with these results, ΔF's are actually never as large in the comparisons of Eurasia and East Asia as in other comparisons. For instance, STRs do not show any allele with ΔF > 0.45 in Eurasia or in East Asia, whereas ΔF reaches 0.6 in Africa and 0.65 in America.
Given their large mutation rate, it may seem surprising that STR alleles show ΔF as large as those observed for SNPs and for indels if these differences had been created during the expansion out-of-Africa some 50 to 60 thousand years ago. Over time, mutations are indeed expected to erode large initial frequency differences at neutral loci, and thus large ΔF (50% or more) could be better explained by their maintenance due to selection. In order to check how quickly mutations would lower the frequency of an allele initially fixed in a population, we have carried out simple simulations at STR loci of an unsubdivided population under a pure stepwise mutation model. We have reported this decrease over 2000 generations in Figure S1 for different mutation rates and different effective population sizes. As expected the rate of decrease is positively correlated with mutation rate, and its variance is negatively correlated with population size. However, for a mutation rate of 5×10−4, the allele frequency is still about 65% after 1,000 generations and 46% after 2,000 generations. For a lower mutation rate of 10−4, the mean expected frequencies are 91–92% and 83–85% after 1,000 and 2,000 generations, respectively, depending on the effective population size. Given the relatively large variance of mutation rates for human STR loci (Xu et al. 2005), it appears therefore likely that STR allele frequencies of more than 80% could still be observed after 2,000 generations if they were initially fixed by surfing or a strong bottleneck, without the need to invoke selection for their maintenance. Still, one would expect that loci with high mutation rates would show lower allele frequency differences today. Since heterozygosity is positively correlated with mutation rate for STRs (Kimmel & Chakraborty, 1996), we would expect loci with a low heterozygosity to have larger allele frequency differences than loci with a high heterozygosity, and this is exactly what we observe in Figure 1.
Surfing promotes the increase of allele frequencies in the direction of a spatial expansion. Therefore we expect to find more STR alleles with increased frequency in newly colonised areas than alleles with decreased frequency, since the decrease compensating the increase of a single allele will affect several other alleles at a given locus. This excess should be especially pronounced for Africa and America, because they are separated by spatial bottlenecks from the Eurasian continent. As shown in Figures 2 and 3, there is indeed a clear asymmetry in the distribution of STR allele frequency differences between regions. For instance, by considering only alleles with ΔF > 0.3, there are clearly more alleles that increased in frequency outside Africa than there are alleles that decreased in frequency. On the contrary, for East Asia and the Americas, there are more alleles at a higher frequency within these regions (Table S5). Since it is not possible to describe this pattern for diallelic loci like SNPs and indels, we tested for these markers whether the derived alleles show an asymmetry in frequency differences. We actually did not expect to find any asymmetry, as surfing does not discriminate between ancestral and derived alleles. For the indels the derived allele is about equally likely to increase in frequency as it is to decrease in frequency (Table S6). For SNPs however, we find that derived alleles have more often increased than decreased outside Africa for 0.15 < ΔF < 0.5, while we see the reverse situation in America for 0.3 < ΔF < 0.4 (Table S7). No clear pattern occurs for the other two regions (Table S7). This pattern is compatible with surfing, since most derived SNP alleles have low frequencies in Africa and could thus have had more room to increase in frequency by surfing than already frequent alleles.
Eberle et al. (2006) found that genic regions are enriched for signals of positive selection compared to non-genic regions (see also Hinds et al. 2005, Voight et al. 2006, Barreiro et al. 2008). If large ΔF were mainly created by the action of positive selection, it should be especially common close to genes. However, we find the correlation of ΔFmax and distance to the closest gene is only significant (at the 5% level) in three instances: for STR alleles in Eurasia, as well as for SNP alleles in Eurasia and America (Figures S2 and S4). In all three cases the explained variance (R2) is small and the p-values are above the 1% level. For indels there is no significant correlation between ΔFmax and distance to genes (Figure S3). However, the power to detect selection close to genic regions may be limited here by the lower density of markers than that available in previous genomic studies, which were however based on a much smaller number of populations.
Discussion
- Top of page
- Summary
- Introduction
- Material and Methods
- Results
- Discussion
- Conclusions
- Acknowledgements
- Web resources
- References
- Supporting Information
We have found an unexpectedly large fraction of loci showing strong differences in allele frequencies between continents in all three datasets. 43% of the indels, 32% of the SNPs and 28% of the STR loci show large frequency differences (ΔFmax > 0.3) between a given geographic region and the rest of the world. A visual inspection of the spatial distribution of some of these allele frequencies indeed reveals striking features (Figure 4), with strong differences between continents, either with very narrow or broader clines, which at first sight is difficult to attribute to pure neutral processes. However, the sheer number of loci showing such striking patterns makes it difficult to believe that these patterns have all been shaped by positive selection, as previously advocated (Evans et al. 2005, Mekel-Bobrov et al. 2005, Akey et al. 2006, Myles et al. 2008).
There is a clear excess of large ΔF between sub-Saharan Africa or the Americas and other regions as compared to ΔF between Eurasia or east Asia and other regions (Tables S2-S4). This is in line with previous genome scan studies, which detected more evidence of recent positive selection in Eurasian and East Asian populations as compared to African populations (Kayser et al. 2003, Akey et al. 2004, Storz et al. 2004, Carlson et al. 2005, Williamson et al. 2007). African populations seem therefore to have a deficit of recent positive selection (but see Hawks et al. 2007), which may be interpreted as evidence that selective pressures in recent times were more prevalent outside of Africa (Akey et al. 2004, Storz et al. 2004). In agreement with this hypothesis, Tang et al. (2007) found more genomic regions potentially influenced by selection when Africa was compared to Eurasian or to Asian populations than in the comparison of Eurasia to Asia. Under a selectionist view, this could be explained by the fact that the Eurasian continent has been colonized only recently and traces of selection would be easier to recognize. However, the populations remaining in Africa have also experienced drastic changes in their environment during the past 50,000 years (deMenocal, 2004), and prominent examples of recent genetic adaptations have been found in this continent as well (e.g. beta-globin (Hanchard et al. 2007), G6PD (Saunders et al. 2002), or lactose tolerance (Tishkoff et al. 2007)). Like Africa, the Americas are also strongly differentiated from the rest of the World, and here selection would have had little time to operate, especially given the overall small sizes of the populations, leading to large levels of differentiation among Amerindian populations (Wang et al. 2007b).
We believe that demographic factors can better explain the particular differentiation of both Africa and the Americas. These two continents are indeed geographically very isolated from the others, such that some spatial and demographic bottlenecks have certainly occurred during the exit out-of-Africa to colonize Eurasia and during the colonization of the Americas from North-East Asia (see e.g. Fagundes et al. 2007). Moreover, these spatial bottlenecks could have also enhanced the possibility of allelic surfing during subsequent spatial expansions (Travis et al. 2007). Allele surfing could also explain the asymmetry of the STR allele frequency distributions (Figures 2 and 3), since this phenomenon originally described the increase in frequency of rare alleles over large and recently colonized areas (Edmonds et al. 2004, Klopfstein et al. 2006). Therefore, the asymmetries shown in Figures 2 and 3 are expected after a range expansion out-of-Africa, as well as into Eurasia, East-Asia and the Americas.
If large allele frequency differences were mainly driven by positive selection acting on coding regions, one would expect to see a negative relationship between ΔF and the distance between gene and markers. Voight et al. (2006) indeed discovered more signals of selection in genic regions than in non-genic regions of the genome and Hinds et al. (2005) and Eberle et al. (2006) found that regions of extended linkage disequilibrium are enriched for genic SNPs. When testing for a correlation of allele frequency differences and distance to genes, however, we find only marginally significant results in three cases. We note however, that the relative lower number of loci examined here in a large number of populations is in contrast with previous genome scan studies, where hundreds of thousands of loci were studied in a very few populations. This low marker density may indeed prevent us from obtaining significant results, and it would be interesting to extend our analysis to new databases containing hundreds of thousands of markers (see e.g. Jakobsson et al. 2008, Li et al. 2008). In any case, the fact that markers showing high levels of differentiation between continents appear randomly scattered over the whole genome is more in line with surfing than with positive selection as a cause. It is, however, very likely that we observe the effects of diverse selective and neutral forces and their interaction. Positive selection, genetic drift and allelic surfing mainly lead to increased genetic differences between populations, while balancing selection and migration decrease differentiation. Our results suggest that local adaptation is certainly not the main acting force in promoting these large allele frequency changes between continental regions, but selection could certainly be involved at various loci.
Among the genes that are close to markers with high allele frequency differences between African and non-African populations, we could identify some that were already signalled as candidates for positive selection in previous studies using different criterion than mere allele frequency differences between continents. These are TCF15 (Storz et al. 2004), KRTAP23–1 (Williamson et al. 2007), PHACTR1 (Williamson et al. 2007), C20orf26 (Williamson et al. 2007), ANTXR2 (Kimura et al. 2007), UTRN (Tang et al. 2007), TYRP1 (Izagirre et al. 2006, Lao et al. 2007), LYST (Izagirre et al. 2006), DMD (Nachman & Crowell, 2000), SEMA4F (Nielsen et al. 2005), and E2F6 (Kayser et al. 2003). It suggests either that markers with geographic differentiation may indeed point to linked selected genes or that previous studies using allele frequency difference as a criterion to identify outlier loci have erroneously mistaken surfing for selection.
Since allele surfing looks very much like a selective sweep (Nielsen et al. 2007, Excoffier & Ray, 2008) it would affect other aspects of genetic diversity than the allele frequency spectrum, like linkage disequilibrium and extended homozygosity (Biswas & Akey, 2006). Previous studies aiming at detecting positively selected loci have attempted to control for past demography, either by 1) explicitly modelling some complex demography (Sabeti et al. 2007, Stajich & Hahn, 2005, Tang et al. 2007, Williamson et al. 2007), 2) by comparing diversity linked to derived or ancestral alleles (Voight et al. 2006), or 3) by contrasting coding to non-coding regions (Akey et al. 2002, Barreiro et al. 2008). To our knowledge, range expansions have never been used as a null model against which observed patterns were examined, and it is thus unclear (and would be worth examining) how the sensitivity of the first types of approaches would change under such a new null model. As mentioned above, derived and ancestral alleles show different frequencies in Africa (The International HapMap Consortium, 2007, Li et al. 2008) and the result of positive selection differs between new and standing variation (Przeworski et al. 2005, Teshima et al. 2006, Barrett & Schluter, 2008), so that tests based on the comparison of diversity associated to derived and ancestral alleles may indeed be sensitive to allele surfing, simply because these two allele categories have different initial frequencies. The comparison of genic to non-genic regions may indeed be the approach most robust against past demography. For instance, Barreiro et al. (2008) compared the proportion of loci with a high FST between genic and non-genic SNPs. They found that the proportion of genic SNPs with an FST>0.65 was about 2.8 fold larger than the proportion of non-genic SNPs with equally large FST, and they could identify several candidate genes based on this high level of differentiation between populations. However, since this class of high FST SNPs represents only about 0.35% of all genic SNPs, it suggests that most genic regions have not been influenced much by selection. While we find that positive selection is unlikely to have shaped the allele frequency spectrum at most loci, it may certainly have acted on fewer genes than previously believed, and our current results do not allow us to discriminate between the effects of demography and selection for an individual locus. Loci which are candidates for being under positive selection should therefore be more carefully scrutinized to find links between potentially selected alleles and a phenotypic effect (see e.g. Sabeti et al. 2007).
Supporting Information
- Top of page
- Summary
- Introduction
- Material and Methods
- Results
- Discussion
- Conclusions
- Acknowledgements
- Web resources
- References
- Supporting Information
Table S1. Populations sampled in the HGDP-CEPH Diversity Panel.
Table S2. STR allele frequency differences (ΔF) for all comparisons between major geographic regions.
Table S3. Indel absolute allele frequency differences (ΔFmax) for all comparisons between major geographic regions.
Table S4. SNP absolute allele frequency differences (ΔFmax) for all comparisons between major geographic regions.
Table S5. Asymmetric distribution of STR allele frequency differences between regions.
Table S6. Asymmetric distribution of indel derived allele frequency differences between regions.
Table S7. Asymmetric distribution of SNP derived allele frequency differences between regions.
Figure S1. Expected decrease of STR allele frequency.
Figure S2. Relationship between ΔFmax and distance to the closest genes for the STR loci.
Figure S3. Relationship between ΔFmax and distance to the closest genes for indel loci.
Figure S4. Relationship between ΔFmax and distance to the closest genes for SNP loci.
Please note: Wiley-Blackwell Publishing are not responsible for the content or functionality of any supporting materials supplied by the authors. Any queries (other than missing material) should be directed to the corresponding author for the article.
Please note: Wiley-Blackwell is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.