Theory, practice, and conservation in the age of genomics: The Galápagos giant tortoise as a case study

Abstract High‐throughput DNA sequencing allows efficient discovery of thousands of single nucleotide polymorphisms (SNPs) in nonmodel species. Population genetic theory predicts that this large number of independent markers should provide detailed insights into population structure, even when only a few individuals are sampled. Still, sampling design can have a strong impact on such inferences. Here, we use simulations and empirical SNP data to investigate the impacts of sampling design on estimating genetic differentiation among populations that represent three species of Galápagos giant tortoises (Chelonoidis spp.). Though microsatellite and mitochondrial DNA analyses have supported the distinctiveness of these species, a recent study called into question how well these markers matched with data from genomic SNPs, thereby questioning decades of studies in nonmodel organisms. Using >20,000 genomewide SNPs from 30 individuals from three Galápagos giant tortoise species, we find distinct structure that matches the relationships described by the traditional genetic markers. Furthermore, we confirm that accurate estimates of genetic differentiation in highly structured natural populations can be obtained using thousands of SNPs and 2–5 individuals, or hundreds of SNPs and 10 individuals, but only if the units of analysis are delineated in a way that is consistent with evolutionary history. We show that the lack of structure in the recent SNP‐based study was likely due to unnatural grouping of individuals and erroneous genotype filtering. Our study demonstrates that genomic data enable patterns of genetic differentiation among populations to be elucidated even with few samples per population, and underscores the importance of sampling design. These results have specific implications for studies of population structure in endangered species and subsequent management decisions.


| INTRODUCTION
The advent of high-throughput DNA sequencing has enabled the characterization of the genomes of model and nonmodel organisms alike. Genomewide data can improve the precision and accuracy of estimates of population parameters, enhancing our understanding of present-day structure, gene flow, and local adaptation (Funk, McKay, Hohenlohe, & Allendorf, 2012). These data have also facilitated more detailed reconstructions of historical events that impacted evolutionary trajectories within species (e.g., Emerson et al., 2010), and among closely related species (e.g., Chaves et al., 2016).
While whole-genome sequencing is still beyond the budget of many research programs, methods based on reducedrepresentation genomic libraries (e.g., double-digest restriction-site associated DNA sequencing, ddRAD-seq (Peterson, Weber, Kay, Fisher, & Hoekstra, 2012)) allow tens or hundreds of thousands of single nucleotide polymorphisms (SNPs) to be discovered and reliably genotyped at a much reduced cost (Andrews, Good, Miller, Luikart, & Hohenlohe, 2016). This is particularly beneficial for species of conservation concern, where limited resources and sampling constraints (i.e., few individuals are available) may be prevalent. No matter the application, though, well-designed population genetics studies aim to maximize their statistical power while minimizing costs.
Genomewide SNP data are currently being applied to a broad spectrum of conservation objectives. These range from informing captive breeding programs (e.g., Wright et al., 2015) and improving detection of hybridization and inbreeding depression (e.g., vonHoldt, Kays, Pollinger, & Wayne, 2016;Robinson et al., 2016), to delineating conservation units, assessing levels of adaptive genetic variation, and predicting viability in the face of anthropogenic impacts such as climate change (Brauer, Hammer, & Beheregaray, 2016;Henry & Russello, 2013;Rellstab, Gugerli, Eckert, Hancock, & Holderegger, 2015;Sork et al., 2016). The appeal of genomic approaches to conservation biology is heightened by indications that a large number of independent loci can alleviate issues associated with small sample sizes per population; when using thousands of loci, one can obtain reliable estimates of genetic diversity and population differentiation, so long as the true values of these parameters are sufficiently high (e.g., Li & Durbin, 2011;Willing, Dreyer, & van Oosterhout, 2012). Yet, as noted by Allendorf (2017), genomic datasets need to be analyzed within the context of a carefully considered sampling design. Shortcomings in sampling design can lead to erroneous conclusions (Meirmans, 2015), which can have profound consequences for any population-level study, but especially for those with direct management implications for threatened or endangered species.
Here, we explore the power of using thousands of SNP markers to study population structure, and the impact of sampling design and small sample sizes on detecting and describing that structure. To do this, we use genomic data from Galápagos giant tortoises (Chelonoidis spp.) as a case study, given that a recent study has questioned the genomic distinctiveness of several species within this genus (Loire et al., 2013). The Galápagos Islands are home to a radiation of endemic giant tortoises that includes 11 endangered and four extinct species ( Figure 1). Taxonomic designations are supported by differences in morphology, geographic isolation of most species, and evidence of F I G U R E 1 Distribution map of Galápagos giant tortoises throughout the archipelago. The islands with extant species are shown in gray, while the islands with extinct species are in white. Black triangles identify the location of the four volcanoes on Isabela Island, each with its own locally endemic tortoise species. Extinct species are identified by a cross symbol. Names of each species are in cursive with a black line pointing to the island or location within an island where they occur. The populations from the three species in this study are identified by two or three letter symbols in bold: CRU = C. porteri, Santa Cruz Island (La Caseta). VA = C. vandenburghi, Volcano Alcedo, central Isabela Island, and PBL = C. becki, Piedras Blancas, Volcano Wolf, northern Isabela Island evolutionary divergence based on mitochondrial DNA (mtDNA) and nuclear microsatellite data (Ciofi, Milinkovitch, Gibbs, Caccone, & Powell, 2002;Garrick et al., 2015;see Fig. S7a,b).
In contrast to previous studies (see section VIII in Appendix S1 for details; Ciofi et al., 2002;Beheregaray et al., 2004;Russello et al., 2005;Poulakakis, Russello, Geist, & Caccone, 2012;Garrick et al., 2015;Poulakakis et al., 2015), Loire et al. (2013) challenged the genetic distinctiveness of three Galápagos giant tortoise species. Those authors collected transcriptome-derived genotypic data from ~1,000 synonymous SNPs from five captive individuals representing three species (C. becki, C. porteri, and C. vandenburghi). They did not detect significant differentiation, as measured by F ST , when comparing two groups (one group of three C. becki individuals, and the second group consisting of two individuals, one C. porteri, and one C. vandenburghi). These two groups were constructed on what the authors identified as natural partitions, based on the observation that their samples fall into two different mtDNA clades (Fig. S6a;Poulakakis et al., 2012). Furthermore, Loire et al. (2013) did not detect homozygosity excess, as measured by F IT , for which positive values would indicate population structure. Given that previous population genetic studies have largely relied upon data from mtDNA and microsatellites, such a discrepancy between these traditional markers and genomic SNPs could have wide-ranging implications, beyond the case of Galápagos giant tortoises, and therefore warrants further investigation.
In this study, we investigate the agreement of population structure analyses based on genomewide SNPs compared to those based on mtDNA sequences and microsatellite genotypes. To do this, we generated a dataset of tens of thousands of genomewide SNPs from 30 individuals representing the same three species (C. becki, C. porteri, and C. vandenburghi) considered by Loire et al. (2013). As these species form a recently diverged species complex, we treat each species as a population to compare against the null hypothesis that all Galápagos giant tortoises belong to a single species with one panmictic population. First, we address whether or not there is significant genomic differentiation among these three Galápagos giant tortoise species using newly generated SNPs. Then, we subsample our data to explore the effects of using only a few individuals per population and of pooling individuals from different populations on estimating genetic differentiation. From these subsampling simulations, we predict the range of F ST estimates expected when using the sampling scheme of Loire et al. (2013). Finally, we re-analyze the raw RNA-seq data from Loire et al. (2013) to test our prediction.

| SNP calling
We used forward and reverse reads to generate a de novo assembly using the pyrad v.3.0.3 pipeline (Eaton, 2014). Reads were demultiplexed and assigned to each individual based on barcodes allowing for one mismatch. We replaced base calls of Q < 20 with an ambiguous base (N) and discarded sequences containing more than four ambiguities. We used 85% clustering similarity as a threshold to align the reads into loci. We set additional filtering parameters to allow for a maximum number of SNPs to be called: retaining clusters with a minimum depth of sequence coverage (Mindepth) >5 and a locus coverage (MinCov) >10, a maximum proportion of individuals with shared heterozygote sites of 20% (MaxSH = p. 20), and a maximum number of SNP per locus of 15 (maxSNP = 15). For subsequent analyses, we filtered this dataset using vcftools (Danecek et al., 2011) to generate a set of polymorphic loci (23,057 SNPs) with no missing data common to all three Galápagos giant tortoises populations of interest, abbreviated PBL, CRU, and VA and corresponding to the species C. becki, C. porteri, and C. vandenburghi, respectively (n = 10 individuals each).

| Analytical methods
F-statistics (F IT , F IS , global F ST , and pairwise F ST ) were calculated using the diveRsity package in R (Keenan, McGinnity, Cross, Crozier, & Prodöhl, 2013), which uses a weighted Weir and Cockerham (1984) estimator. The same package was used to assess the statistical significance of these estimates by bootstrapping across loci. Through this method, we established 95% confidence intervals for each estimate, accepting as significant those that did not include 0. Pairwise F ST calculated from thousands of subsamples of the data (described below) were carried out in vcftools (Danecek et al., 2011) to streamline computation. We also used vcftools to calculate the number of loci out of Hardy-Weinberg equilibrium (HWE) for each population and for pooled populations.
As F ST estimates rely on a priori assignment of individuals to groups that are typically based on geographic location, we used two methods that do not have this assumption to assess patterns of differentiation among our samples. To do this, we first carried out principal component analysis (PCA) on all 30 individuals, using the PLINK software (Chang et al., 2015). Principal components 1 and 2 were plotted against each other in R. To complement the multivariate analyses, we performed a Bayesian clustering analysis, implemented in the program Structure version 2.3.4 (Falush, Stephens, & Pritchard, 2003;Pritchard, Stephens, & Donnelly, 2000), also including all 30 individuals. Structure assumes a model with K unknown clusters representing genetic populations in HWE and then assigns individuals to each cluster based on allele frequencies. We ran 20 repetitions of Structure for K = 1-5, with a burn-in of 10,000 iterations and MCMC length of 50,000 iterations. These runs used the admixture model, correlated allele frequencies among populations, and did not assume prior population information. All other parameters were left at default values. Results were postprocessed and visualized using CLUMPAK (Kopelman, Mayzel, Jakobsson, Rosenberg, & Mayrose, 2015). We used mean log likelihood values (Pritchard et al., 2000) and the ΔK statistic (Evanno, Regnaut, & Goudet, 2005) to infer the best K (Fig. S5).
Both analyses considered the 23,057 SNPs common to all individuals.
To further assess the power of our SNPs to detect population structure, we randomly subsampled individuals from each of the species and calculated pairwise F ST for each species using these subsamples. We tested this for per-species sample sizes of n = 2, n = 3, and n = 5. This process was repeated 1,000 times for each sample size.
We also carried out a similar analysis maintaining all 10 individuals per population but randomly subsampling SNPs from our dataset. For these analyses, we used the following number of SNPs: 25, 50, 100, 200, 500, 1,000, 5,000, and 10,000. This was repeated 1,000 times for each sample size. Finally, we used a subsampling scenario that directly mimicked the one in Loire et al. (2013) to further evaluate the impact of limited sample sizes and pooling of samples from distinct species on F ST estimates. As was done in Loire et al. (2013), we compared a set of three individuals from C. becki to a grouping that included one C. porteri plus one C. vandenburghi individual. To account for sample variation, we repeated this grouping process 1,000 times (described in full in section IV in Appendix S1).

| Tortoise samples and ddRAD-seq dataset
Our sequencing generated a total of 3,094,399,092 retained reads (approximately 15-58 million reads per individual) after demultiplexing and filtering reads for quality and ambiguous barcodes and ddRAD-tags. de novo assembly of the data resulted in 48,004,056 ddRAD-tags (approximately 320,000-465,000 per individual). From these, we called SNPs and obtained 973,321 variable sites. We then narrowed those loci down to only loci with genotypes called in every individual in our three species dataset, for a total of 23,057 SNPs.
For the three species of interest, the number of loci retained within populations and between population pairs is presented in Table 1. The average coverage per locus per individual was 12X (minimum 9; maximum 15).  (Table 2). These estimates were similar to, though higher than, F ST estimates using 12 nuclear microsatellite markers  for these species comparisons (Table 2 and Table S5).

| F-statistics using ddRAD-seq data
T A B L E 1 Number of polymorphic loci present in all individuals (n = 10 per species) used for analyses of each population (diagonal) and population pair (below diagonal) T A B L E 2 Pairwise F ST values between given species pairs. Above the diagonal, values calculated using our dataset of SNPs with no missing data and common to the population pair, along with 95% confidence intervals. Below the diagonal, values calculated using 12 microsatellite loci from Garrick et al. (2015) (see section VIII in Appendix S1). Data were obtained using 10 samples for each population (PBL, VA, CRU) for the three species

| PCA and Structure
The first two principal components of the PCA showed clear differentiation among individuals from the three species. PC1 accounted for approximately 12.0% of the variation among individuals, and PC2 accounted for approximately 9.3% of the variation among individuals ( Figure 2). Similarly, both mean log likelihood values (Pritchard et al., 2000) and the ΔK statistic (Evanno et al., 2005) supported the existence of three distinct genetic units in the Structure analysis (Fig.   S5). These groups correspond to the a priori geographic groupings used in F ST estimates and to the three named species. Our separate analysis of loci out of HWE, the basis for the Structure algorithm, supported these findings as well. When each species was considered separately, out of 23,057 loci PBL showed 214 out of HWE, CRU showed 124 out of HWE, and VA showed 71 out of HWE.
When the CRU and VA samples were pooled, the number of loci out of HWE rose to 1,326. When all three species were pooled and treated as one population, 2,422 loci were found to be out of HWE.

| Sample size, number of loci, and the effect of individual samples
In all population comparisons for the three sample sizes (n = 2, 3, or 5), the majority of estimates were within 0.03 of the F ST value calculated using the complete dataset of 10 samples per population (Figure 3 and Figure S2). In every case, when the sample size was two, F ST tended to be underestimated, though with a long tail of overestimated outliers.
In all comparisons with sample sizes of three or five, this skew disappeared: We found that 95% of the estimates were within 0.05 of the estimate using 10 samples (Tables S1).
Our F ST estimates from subsampled SNPs ranging from 25 to 10,000 SNPs appeared to have the statistical power to detect population structure between these population pairs when 10 individuals were used, with 95% of all estimates above 0 (Table S2A-C and Figure   S3). However, as expected, with many fewer SNPs the range of 95% of the estimates was very wide (see section III in Appendix S1). For example, when only 100 SNPs were used to compare PBL and CRU, 95% of the F ST estimates were between 0.1 and 0.255, while using 1,000 SNPs gave 95% of the F ST estimates between 0.146 and 0.194 for the same comparison (Table S2A). We estimated nearly identical Fst values when using all SNPs common to a given population pair and when using the set of SNPs common to all three populations (Table S3).

| Effect of pooling samples
To test how pooling samples affected F-statistic estimates, we used This confirms that pooling samples from two populations, each representing different species, results in a strongly depressed F ST estimate.
However, these simulations highlight that the occurrence of genetic differentiation (i.e., positive F ST values) should still be detectable even with this grouping scheme.

| Re-analysis of Loire et al. transcriptome data
Given that our analyses of ddRAD-seq data showed clear genetic structure among the populations from the three species, and our subsampling simulations (Fig. S4)

| Strong evidence of population structure
Using genomewide SNP data we found evidence for significant differentiation among the three species considered (C. becki, C. porteri, and C. vandenburghi), consistent with the findings of decades of research in this system Beheregaray, Ciofi, Geist, et al., 2003;Beheregaray et al., 2004;Ciofi et al., 2002;Garrick et al., 2015;Poulakakis et al., 2008Poulakakis et al., , 2012Poulakakis et al., , 2015Russello et al., 2005;Russello, Beheregaray, et al., 2007;Russello, Hyseni, et al., 2007). Our estimate of F IT (0.257), which was a focal metric used in the previous study (Loire et al., 2013),   Fig. S1). Interpreting significantly positive F IS values, such as the one calculated from our ddRAD-seq dataset, can be difficult (Allendorf & Luikart, 2009). This could be due to substructure within one or more populations, sampling stochasticity, and/or recent demographic changes in relatively small populations. It could also be that such small populations are not necessarily expected to be in HWE due to the increased influence of genetic drift (Allendorf & Luikart, 2009).
To assess whether there was additional genetic structure outside of our a priori assignment of individuals based on their geographic location, we also analyzed the 30 samples in our ddRAD-seq dataset using two methods without prior assignment of each sample to a group. Both principal component ( Figure 2) and Bayesian clustering analyses (Fig. S5) clearly discerned three genetically distinct clusters that corresponded to the samples from the three species tested in our pairwise F ST estimates.
This echoed our per-locus analysis of HWE, which showed that treating all 30 individuals from the three named species as a single population dramatically increased the number of loci out of HWE.
F I G U R E 3 Boxplots of pairwise F ST estimates using 1,000 randomly drawn subsamples of individuals for each sample size (n = 2, 3, or 5) from each population. PBL, CRU and VA correspond to population samples from C. becki, C. porteri, and C. vandenburghi, respectively. (a) is the pairwise comparison of PBL and CRU, (b) is the pairwise comparison of PBL and VA, and (c) is the pairwise comparison of CRU and VA. The horizontal black line in each boxplot marks the F ST value calculated using all 10 individuals from each population in the pairwise comparison (see Table S1). Lower hinge corresponds to first quartile (25th percentile); upper hinge corresponds to third quartile (75th percentile). Whiskers indicate points within 1.5 times the interquartile range (IQR), with outliers indicated as points beyond that range Results of our analyses of population structure using tens of thousands of genomewide SNPs are concordant with earlier studies using mtDNA haplotypes and microsatellite genotypes Beheregaray, Ciofi, Geist, et al., 2003;Beheregaray et al., 2004;Ciofi et al., 2002;Garrick et al., 2015;Poulakakis et al., 2008Poulakakis et al., , 2012Poulakakis et al., , 2015Russello et al., 2005;Russello, Beheregaray, et al., 2007;Russello, Hyseni, et al., 2007). These findings definitively resolve concerns raised by Loire et al. (2013) regarding whether these traditional markers were accurately reflecting the genetic distinctiveness of Galápagos giant tortoise species. Importantly, our results not only revealed the same genetic clustering as earlier studies, but also showed the same patterns of genetic distance. As in the microsatellite studies, we found slightly greater genetic differenti-  Table 2). While qualitatively the same, our F ST estimates are notably higher than those calculated using microsatellites (Table 2), a finding predicted by the mathematics of using biallelic vs. multiallelic loci (Putman & Carbone, 2014), which has also been found in other systems (e.g., Payseur & Jing, 2009).

| Impact of sample size and number of loci on detecting population structure
Population genetic theory (Nei, 1978), simulations (Willing et al., 2012), and empirical work (Reich, Thangaraj, Patterson, Price, & Singh, 2009) support the idea that a dataset of thousands of loci should have the power to detect population structure with high precision, even when only a few individuals per population are analyzed.
We tested this idea with our Galápagos giant tortoise ddRAD-seq SNP data by estimating F ST from subsamples of two, three, and five individuals from each population and comparing them to the same estimates obtained from 10 individuals per population. All tested sample sizes were able to detect significant F ST values, though using three or five samples yielded more precise estimates than using only two ( Figure 3; Tables S1). These analyses are consistent with the idea that accurate F ST values can be estimated using as few as two or three samples per population if thousands of SNPs are analyzed.
Likewise, we found that for highly differentiated populations such as those studied here, hundreds of SNPs were sufficient to accurately describe population structure when ten individuals per population were used. This empirical evidence should be helpful in the design of future conservation genetics studies that aim to describe population structure, in which case additional samples may lead to diminishing returns for improving statistical power. This will be especially useful for endangered or elusive species for which sampling may present a severe limitation.

| Sampling design matters
Our genomewide SNP data detected high and significant differentiation among these three species, even when only two or three individuals from each were used in the analysis (Figure 3). While these results were strongly supported, they failed to explain the discrepancy described by Loire et al. (2013), who used over 1,000 synonymous SNPs from transcriptome sequencing data and found no differentiation between the same three species. Their sample size of five captive individuals does not by itself account for the discrepancy between the two studies, because, as we show above ( Fig. S4), using thousands of SNPs should give sufficient power to detect population structure in Galápagos giant tortoises, even when sample size is that small.  (Caccone et al., 1999;Russello, Beheregaray, et al., 2007;Russello, Hyseni, et al., 2007) compared to haplotypes found in the PBL C. becki population. This choice is problematic for several reasons (detailed in the Appendix S1 section VIII). Most importantly, F-statistics are a reflection of population differentiation, not of phylogenetic relatedness.
Treating the individuals from C. porteri and C. vandenburghi as belonging to the same population biased the F-statistics estimates by leading to an increase in within-group variation, and therefore depressed F ST values. This within-group structure, which distorts F-statistics, is known as Wahlund effect (Wahlund, 1928).
The problem outlined above is clear in our pairwise analysis using >20,000 SNPs, which shows that while the C. becki population sample is about equally differentiated from the C. porteri and C. vandenburghi ones, the ones from C. porteri and C. vandenburghi are more differentiated from each other than from the C. becki population sample (Table 2). To empirically test for the Wahlund effect under this sampling scheme, we simulated a scenario in which three samples from C. becki were compared to a population consisting of one C. porteri and one C. vandenburghi sample. Repeating this sampling scenario 1,000 times, we found significantly depressed mean F ST estimates, as low as 0.075, with 95% of comparisons ranging from 0.052 to 0.127 (Fig. S4a). Even more strikingly, when we limited the analysis to a similar number of markers as Loire et al. (2013) and used 1,000 randomly drawn SNPs, the range of 95% of the estimates increased to 0.031-0.134.

| RNA-seq data support population structure
While our subsampling simulations showed a clear Wahlund effect when samples from two different species (C. porteri and C. vandenburghi) were combined into one grouping, these F ST estimates were still positive (mean F ST = 0.075). We therefore would have expected Loire et al. (2013) to find a similar estimate in their analysis of RNA-seq data, but they reported no significantly positive F ST value. To investigate this discrepancy, we re-analyzed their raw sequencing data by aligning it to a Galápagos giant tortoise reference genome. Using the SNPs from this re-analysis, we estimated an F ST of 0.054, which is similar to our expected F ST under their sampling design (Fig. S4). Our estimates of F IS and F IT for the RNA-seq dataset were negative, a surprising result that may be related to the sampling design, the specific individuals included in that study, or the deviations from HWE that can occur in small populations (Kimura & Crow, 1963). This last point is due to the assumption of large numbers in HWE, which is violated in small populations (Allendorf & Luikart, 2009 (Fig. S6). This pattern of principal components mirrors the one that we found with our 30 sample dataset for the same populations ( Figure 2). These results, which match our expectations based on subsampling simulations (Fig. S4), suggest that the lack of significantly positive F ST values found by Loire et al. (2013) is due not just to small sample size and inappropriate grouping of samples, but also the genotype filters employed in their initial analysis. The original Loire et al. (2013) methods describe a genotype filter that assigns posterior probability to genotypes based on HWE. We suspect that this may not be a reliable method when genotyping a pool of individuals from different species, as these samples will not meet the assumption of HWE. Our SNP calls of their data may have also been improved by mapping the RNA sequence reads to a draft Galápagos giant tortoise reference genome, as suggested by others (Shafer et al., 2016). However, our ddRAD-seq SNP data were called without mapping to a reference, so this methodological difference cannot completely explain the loss of signal.

| CONCLUSIONS
Reduced-representation sequencing offers practical ways to take advantage of the power of population genomics, even when samples and funds are limited (Narum, Buerkle, Davey, Miller, & Hohenlohe, 2013 However, the heterogeneity of samples within a population can confound calculations using small sample sizes in unpredictable ways. Reduced sample size also limits the diversity of analyses that can be performed, especially limiting those that do not rely on a priori population designation, such as PCA and Bayesian clustering algorithms. Ultimately, we found that both our ddRAD-seq data and a re-analysis of RNA-seq data generated by Loire et al. (2013) were consistent with the findings of earlier microsatellite and mtDNA studies. We therefore expect genomewide SNPs to support the conclusions of population genetic studies of Galápagos giant tortoises beyond the three species considered here.
Distinguishing populations and evolutionary lineages, such as the giant tortoise species analyzed here, is a vital role for population genetic analyses to play in conservation (Funk et al., 2012). Results from such analyses can assist in protected area designation (Larson et al., 2014), inform appropriate legal protections (vonHoldt, Cahill, et al., 2016), and guide captive breeding strategies (de Cara, Fernández, Toro, & Villanueva, 2011;Lew et al., 2015). We show that, as long as population genetics theory is carefully taken into account, the use of genomewide data enabled by high-throughput sequencing can be a powerful tool in these conservation efforts, even when sample sizes are limited. proposing the initial problem and stimulating a more thorough investigation into the genomic evolution of these species. We appreciate constructive comments from three anonymous reviewers and L.

DATA ACCESSIBILITY
Raw data from Illumina sequencing are deposited to the NCBI Sequence Read Archive (SRA) for all individuals included in this study and are awaiting review. The vcf file used in the analyses is available on the Dryad Digital Repository: https://doi.org/10.5061/dryad.2hj75.
Microsatellite genotypes and mitochondrial DNA sequences used in the Appendix S1 are available upon request.