Genomic footprints of a biological invasion: Introduction from Asia and dispersal in Europe of the topmouth gudgeon (Pseudorasbora parva)

Abstract Facilitated by the intensification of global trading, the introduction and dispersal of species to areas in which they are historically non‐native is nowadays common. From an evolutionary standpoint, invasions are paradoxical: not only non‐native environments could be different from native ones for which introduced individuals would be ill‐adapted, but also small founding population size should be associated with reduced adaptive potential. As such, biological invasions are considered valuable real‐time evolutionary experiments. Here, we investigated the population structure and adaptive potential of the highly invasive topmouth gudgeon (Pseudorasbora parva) across Europe and East Asia. We RAD‐sequenced 301 specimens from sixteen populations and three distinct within‐catchment invaded regions as well as two locations in the native range. With 13,785 single nucleotide polymorphisms, we provide conclusive evidence for a genome‐wide signature of two distinct invasion events, in Slovakia and Turkey, each originating from a specific area in the native range. A third invaded area, in France, appears to be the result of dispersal within the invasive range. Few loci showed signs of selection, the vast majority of which being identified in the Slovakian region. Functional annotation suggests that faster early stage development, resistance to pollution and immunocompetence contribute to the invasion success of the local habitats. By showing that populations in the invasive range have different evolutionary histories, our study reinforces the idea that populations, rather than species, are the units to consider in invasion biology.


| INTRODUC TI ON
The rate at which species are reported outside their natural distribution boundaries continues to increase (Bock et al., 2015;Simberloff et al., 2013). Because non-native species can rapidly establish populations in new locations, they can exert a rapid and severe impact on native ecosystems (Ehrenfeld, 2010), yet not necessarily be detrimental to society and economy (Davis et al., 2011). To understand the whole impact of a biological invasion is complex, and evolutionary approaches should aim to characterize invasion pathways as well as the determinants underlying the successful establishment of non-native species (Colautti, Grigorovich, & MacIsaac, 2006;Cristescu, 2015). It is well known that ecological or demographic factors such as propagule pressure (the number of individual arriving at a non-native range) or plasticity in key life history traits (e.g., age at first maturity, reproductive success, faster growth) are important to colonization success and establishment of populations (Britton & Gozlan, 2013;Simberloff, 2009). The role of evolutionary mechanisms such as drift, selection, or adaptive phenotypic plasticity, on the other hand is less well understood (Bock et al., 2015). However, well documented cases do exist, including that of the invasive cane toad (Bufo marinus) in Australia, where the rapid evolution of adaptive plasticity facilitated dispersal success (Rollins, Richardson, & Shine, 2015).
The use of genetic tools is often applied to infer colonization processes and identify source/sink relationships between native and non-native range (Estoup & Guillemaud, 2010). The use of genomics in the context of biological invasion is still in its infancy, thus leaving unanswered questions about the role of fine-scale population structure, adaptive potential or hybridization as a predictor of invasion success. In this work, we intend to fill in this knowledge gap by investigating the genome-wide signature of one of the most successful cases of biological invasions worldwide, that of the topmouth gudgeon (Pseudorasbora parva).
This small freshwater cyprinid was introduced as a byproduct in several countries surrounding the Black Sea in the early 1960s (Gozlan et al., 2010). Chinese carp imports for aquaculture purposes acted as a transport vector from China to Europe. Romania, Hungary, Lithuania and Ukraine are amongst the first sites where it was introduced (Gozlan, 2011;Gozlan, Pinder, & Shelley, 2002).
The subsequent establishment of invasive topmouth gudgeon populations occurred at an extremely rapid rate: westward dispersal promoted by human activities, such as translocations of farm fish, recreational fishing or ornamental fish trade, introduced fishes from Hungary into former Czechoslovakia and Germany (1970s); southwards dispersal into Turkey occurred from an introduction point in the Black Sea most likely Bulgaria (Gozlan et al., 2010). First detections in western Europe date back as late as the 1970s, but colonization was only reported in the 1990s (Gozlan et al., 2010). As a result, in 50 years, the distribution of topmouth gudgeon in its invasive range spanned Central and Western Europe, the British Isles and the Turkish territory in Asia Minor and Central Asia. The successful spread of topmouth gudgeon populations in the new Eurasian range is hypothesized to be facilitated by the high plasticity of life history traits, such as short generation time, multiple spawning or small body sizes and reproductive biology that ensures a high reproductive success from as early as the first year of life (Britton, Davies, & Brazier, 2008;Rosecchi, Thomas, & Crivelli, 2001;Yan & Chen, 2009). The species also presents highly plastic responses to a series of environmental factors, such as temperature, population density, physical alterations of the habitat and predator density (Gozlan et al., 2010). The successful establishment of invasive populations of topmouth gudgeon had a series of negative impacts on native ecosystems. For example, the high densities exert intense competition for food resources with native fauna that shares the same trophic niche (Britton, Davies, & Harrod, 2010).
Topmouth gudgeon are also healthy carriers of the intracellular pathogen Rosette agent Sphaerothecum destruens, a deadly protist, which is virulent to a large number of European native freshwater fish species (Andreou & Gozlan, 2016;Combe & Gozlan, 2018).
Molecular studies on topmouth gudgeon populations have to date focused on the characterization of the introduction and dispersal pathways in Eurasia. Attempts to understand the invasion pathways from genetic signatures have used mitochondrial DNA (Simon et al., 2011), microsatellites (Simon, Gozlan, Britton, Van Oosterhout, & Hänfling, 2015) or both (Hardouin et al., 2018). The native topmouth gudgeon populations are composed of four deep mitochondrial lineages, two of which constitute the mitochondrial genetic background of all non-native populations in Europe (Simon et al., 2011). Those lineages originating from China represent a phylogeographic break promoted by the Qinling mountains (Hardouin et al., 2018). At the nuclear level, populations in the native range are more genetically diverse, an observation explained by traditional activities of translocation performed in China over the last 2,000 years (Hardouin et al., 2018). The complex population structure and high diversity observed in the native range of the species suggests, from a molecular perspective, an equally complex invasion scenario offering a unique opportunity to investigate an invasive species' adaptive potential. Increasing sequencing depth enables covering larger portions of the genomes and clarifying background genomic differentiation while at the same time increasing the chance of identifying candidate loci under selection. In this study, we performed restriction-associated DNA tags sequencing (RADseq) on specimens collected from 16 populations in Slovakia, Turkey and France representing a chronological gradient (from known records) of invasion. We also added two populations from the native range, identified by Hardouin et al. (2018) as representatives of the two putative invasive mitochondrial lineages.
By screening the genome-wide diversity of topmouth gudgeon along invaded regions, we (a) characterized the invasion process and hypothetical dispersal in the invasive range; (b) identified candidate loci under selection and (c) characterized their distribution to determine their role in the invasion success of this species. We hypothesize that successful establishment was facilitated by the same loci evolving in parallel across non-native regions.

| Sampling scheme of invaded regions
A total of 301 topmouth gudgeon were collected for this study.
Samples were distributed across four geographic areas, one in the native range and three in the invasive range, resulting in a total of 16 locations, hereafter populations. In the native range, we used fish collected from the two major catchments in China: the Yangtze (CN-Yan) and Yellow river (CN-Yel). Fish from these locations had their mitochondrial DNA partially sequenced and showed highly distinct lineages (Hardouin et al., 2018). In the invasive range, we collected fish from eight sites within the Danube catchment -from two tributaries in Slovakia: three sites from the Vah main catchment, two sites from a branch of the Vah that passes through Nitra and three other sites from the Hron; from four sites in the Sarçay stream, Muğla, Turkey and from two sites from the Garonne catchment in France (Table 1). This way, we aimed to analyse three distinct regions of the invasion range ( Figure 1).

| DNA extraction, library preparation and restriction-associated DNA tag sequencing
All fish samples were stored in 90% ethanol prior to extraction.
Genomic DNA was extracted with Qiagen DNeasy Kit Blood and Tissue kit (Hilden, Germany) following the manufacturer's instructions. Library preparation, sequencing and bioinformatic processing of raw reads (demultiplexing of individual barcodes, removal of adaptors and barcodes and trimming) was performed at the GeT-PlaGe core facility, INRA Toulouse, France. Libraries preparation followed Baird et al. (2008). Libraries were constructed by pooling 48 samples after individual barcoding. Samples were digested with SbfI restriction enzyme and individually barcoded. The sample pools were sonicated, size selection was performed using SPB beads with adaptors being ligated prior to sequencing. Ten PCR cycles were used to amplify libraries. Library quality was assessed using an Advanced Analytical Fragment Analyser and libraries were quantified by Quantitative PCR using the Kapa Library Quantification Kit.
Sequencing took place on an Illumina HiSeq3000 using a single-end read length of 150 bp Only reads with Q > 30 were used in this study.

| Filtering and processing of RADseq data
Since there is no reference genome of the topmouth gudgeon, we   The complete data set was then processed with the chosen parameter's combination of m = 3, M = 4 and n = 4. To avoid overrepresentation of rare loci, the catalogue of loci formed with the subsampled data set was used as reference (Rodríguez-Ezpeleta et al., 2016). At the populations step, we followed the 80% rule suggested by Rochette & Catchen (2017) for the parameter r. Only one SNP per loci was kept avoiding linkage disequilibrium bias in downstream analyses.
To identify candidate loci under selection in each of the three geographic regions of the invasive range, we created a subset of SNPs per region, hereafter rSNPs. This was performed by running StackS pipeline with fish from Slovakia, Turkey and France separately, but matching SNPs against the corrected catalogue loci. While we acknowledge we may have overlooked region-specific loci, grouping populations per region prior to outlier detection scans is in line with our hypothesis that successful establishment was facilitated by the same loci evolving in parallel across non-native regions. Importantly, we intended to relax the violations to baseline assumptions common to currently outlier detection methods: low overall differentiation, symmetric migration rates and similar evolutionary and demographic history (Lotterhos & Whitlock, 2014). Lastly, all data sets were filtered out for loci with more than 20% missing data with VcFtoolS

| Genetic diversity estimates -comparisons between invasive and native ranges
We calculated the observed heterozygosity (H o ), number of private alleles (pA) and number of variant sites (vS) in StackS. To account for possible confounding effects linked to individual coverages (Trucchi et al., 2016), genetic diversity indices were tested in a linear model that included the individual coverage and range as explanatory variables: fit = aov(diversity estimate ~ coverage + range).
Models were corrected for the use of sequential sum of squares analyses using drop1(fit, ~., test = "Chisq"). Absolute values of pA and vS were standardized for the depth of coverage at each population, and log transformed to normalize the distribution prior to statistical analyses.
Afterwards, we performed ANOVAs to compare diversity estimates between ranges and among regions. Here we define "range" as native and invasive and "region" as Slovakia, Turkey, France and China, under the null hypothesis that the native range harbours a higher genetic diversity. Note that to investigate the role of range and region in this work, two independent ANOVAs had to be performed. This is because the native range includes one region only. All statistical analyses were performed in r 3.4.1 (R Core Team, 2017).

| Population structure at the invasion front and relationship with native sites
To reconstruct the pathway of the topmouth gudgeon's introduction, investigate invasive range expansion and identify candidate loci associated with invasion success, we determined the genetic structure across the species' spatial distribution. Population structure was investigated by a multifold approach. We started by visually inspecting the distribution of pairwise F ST , obtained with StackS, in a heatmap and with a hierarchical cluster analysis on the between-individual genetic distance dissimilarity matrix -visualized in a dendrogram -using the Bioconductor's package SNPrelate (Zheng et al., 2012). We also investigated the distribution of molecular variance at two maximum hierarchic levels, with AMOVAs (10,000 permutations) implemented in arlequiN version 3.5 (Excoffier & Lischer, 2010 were also analysed in arlequiN (10,000 permutations).
We then used FaStStructure (Raj, Stephens, & Pritchard, 2014) to infer the likelihood of fine-scale population structure. Three independent iterations for values of K ranging from 1 to 17 were performed. The most likely number of Ks was assessed with the algorithm chooseK (Raj et al., 2014). Visualization of admixture proportions was done by constructing membership probabilities plots with the r package adegeNet (Jombart, 2008). In order to obtain information on allele frequency divergence between native/ non-native regions, we used Structure v2.3.4 (Pritchard, Stephens, & Donnelly, 2000) with K = 2 for the following pairwise comparisons: Yangtze River/France, Yangtze River/Turkey, Yellow River/ France and Yellow River/Slovakia. Conditions were set to default with a MCMC length of 5 × 10 6 generations and burnin of 5 × 10 5 .
Three independent iterations were performed. We used an admixture model where a, the degree of admixture, was inferred from a uniform prior, with initial a = 1, max = 10.0 and SD = 0.025. The frequency model was set to correlated allele frequencies amongst populations. Comparison of average allele frequency divergences was performed in r.
We then performed a discriminant analysis of principal components (DAPC) to clarify sublevels of population structure. While DAPC is a multivariate method that maximizes genetic differentiation between predefined groups, we rather explored the versatility of find.clusters function by removing a priori population assumptions, but restraining the search for K to a maximum of 16 (total number of our populations).
Best-fit number of clusters were verified by Bayesian Information criteria (BIC). DAPC is implemented in the r package adegeNet (Jombart, 2008). Upon identification of the best-fit model, we investigated which location, within each region, was probably the most recently colonized.
For that, we compared pairwise F ST between sampled locations and putative source, under the assumption that higher F ST would be a proxy for older introduction time.

| Detection of hybrid classes among and within invasive lineages
With the original data set, we investigated the possibility of hybrids (F1, F2 and backcrosses) between lineages as present in the native range. The objective was to test whether hybridization can contribute to the successful colonization in the invasive range.

| Scanning for candidate loci under selection
Outlier detection methods were independently applied to all combined populations within each region. We used the outFlaNk v0.2 which separates the variance caused by (a) the existence of a finite real number of demes, (b) sampling a finite number of individuals per deme and (c) spatial selection from a spatially heterogeneous selection on a specific locus (Whitlock & Lotterhos, 2015). Due to the recent evolutionary timeframe of the invasion, we expect that loci hypothetically under selection occur at lower frequencies.
Therefore, we relaxed the filtering step to 40% of missing data per locus and considered a Hmin of 0.05.

| Environmental correlations
Water velocity in rivers is a factor that might favour the establishment of fish populations, and particularly those of the topmouth gudgeon a species that prefer lentic conditions for reproduction (Boltachev, Danilyuk, Pakhorukov, & Bondarev, 2006). Because the Slovakian sampling sites were distributed between two separate tributaries, the river Váh, with an average river discharge of 196 m 3 /s and the river Hron, with an average discharge of 57.3 m 3 /s, it was possible to investigate if current speed acts as a selective pressure to the establishment of topmouth gudgeon populations. The rationale is that loci highly ranked in Bayes Factor (BF) are possibly affected by the presence of outliers. Hence, candidate loci whose allelic frequencies covary with environmental factor would be suggestive of selection. Environmental correlations were investigated on Slovakia's regional SNP panel and performed in bayeNV2 (Günther & Coop, 2013). BF threshold for positive correlations was defined by BF > 10 (Jeffreys, 1998).

| blast and gene ontology terms of candidate loci
Candidate loci were blasted against eNSembl database of annotated genomes, and if no results, again blasted against Ncbi database.
The search was restrained to the available genomes of fishes. We filtered out low complexity regions and defined sensitivity for short sequences. Meaningful hits were defined as those that overlapped at least 75 bp against the database while reporting an Evalue < 10 -4 (Altschul, Gish, Miller, Myers, & Lipman, 1990). Gene ontology terms (GO) of each successful hit from ENSEMBL were recorded.

| Sequencing statistics and variant calling
A total of 301 fishes were used in this study. On average, 1.3 × 10 6 reads (SE ± 1.31 × 10 5 reads) were used per individual. After inspection of coverage and number of loci, we kept the combination m = 4, M = 5 and maintaining M = N as parameters ( Figures S1 and S2). The average individual coverage was 28.7× (SE ± 2×). Coverage metrics divided by region and range are presented in Figures S3 and S4. The total number of loci kept after all filtering steps was 13,768.

| Genetic diversity estimates in native and invasive range
Comparisons  (Table S1).

| Population structure, pathways of invasion and hybridization
The population structure of sampled sites investigated with FaSt-Structure revealed the likely K to range between three and four, but visual representation of membership probabilities suggested K = 3 to be a more robust outcome (Figure 3). For K = 3, French populations appeared as a distinct cluster, with Slovakia and Turkey belonging to the same cluster of the Yellow River and the Yangtze, respectively.
Allelic divergence between tested range pairs revealed that Slovakian populations and that of its putative source, the Yellow River, were more genetically similar than any other native/non-native pairs and that French populations were more similar to those of the Yellow River rather than those of the Yangtze ( Figure   S6). Discriminant analysis revealed three clusters (retaining 200 PCs and plotting two loadings Figure S7a S l o v a k i a F r a n c e T u r k e y Y e l l o w S l o v a k i a F r a n c e T u r k e y Y e l l o w S l o v a k i a F r a n c e T u r k e y Y a n g t z e Y e l l o w S l o v a k i a F r a n c e T u r k e y Y a n g t z e Y a n g t z e Y a n g t z e Y e l l o w S l o v a k i a F r a n c e T u r k e y Y a n g t z e Y e l l o w S l o v a k i a F r a n c e T u r k e y Y a n g t z e Y e l l o w S l o v a k i a F r a n c e T u r k e y Y a n g t z e Y e l l o w S l o v a k i a F r a n c e T u r k e y Y a n g t z e Y e l l o w S l o v a k i a F r a n c e T u r k e y Y a n g t z e Y e l l o w regions = 0.258, average FST within regions = 0.008, t = 18.856 p < .001). Noteworthy, the two populations from China were significantly different from one another (F ST = 0.188, p < .01). This differentiation at the native range carried into the invasive range. This was confirmed by the clustering dendrogram, where two highly differentiated branches, each harbouring a Chinese population and either one or two regions from the non-native range, stem from a common ancestry ( Figure S8).
Comparing pairwise differences between putative native sources and respective sinks in the invasive range revealed that locations within Turkey diverge more from Yangtze river than the Slovakian ones do from the Yellow river (average FST Turkey/Yangtze = 0.105, average FST Slovakia/Yellow = 0.05, t = 3.769 p = .032). Within Slovakia, the Vah river populations revealed to be less divergent from that of the Yellow river, although differentiation was not significant.
Inferences on possible hybridization revealed the existence of two pure lines whose individuals possessed a 100% pure genome.
All individuals from Slovakia and the Yellow River (Pure 1) form the first line, while all individuals from Turkey and the Yangtze (Pure 2) belong to the second group. Interestingly, the large majority of individuals collected in France belong to Pure 1 genomic background and showed between 5%-75% of their genomes to have arisen from backcrosses. Noteworthy was a single individual exhibiting the genomic make-up of a F1 ( Figure S5). The allelic frequencies of a total of 30 loci correlated with river speed (for a BF > 10), one of which overlapped with outFlaNk.

| Candidate loci genetic diversity: Relationship among invasive regions in relation to native location
The comparison of Ho between neutral and candidate loci across ranges and regions revealed a model including region fits better than a model including range as variable (ANOVA: F (model range vs model region) = 15.60; df 1,3 ; p < .001, RSS model region = 1E −7 , model range = 5E −7 ). Thus, a model with regions was preferred and proven to overall explain 71% of Ho variation (R 2 = .71; F = 12.18; df 7,24 ; p < .001).
Furthermore, it revealed a negative and significant effect of candi-

| Candidate loci to discriminate population structure
We assumed that candidate loci could resolve fine-scale population structure at least within the Slovakian region, where candidates were found. Clustering analyses revealed K = 2 as the model complexity maximizing marginal likelihood scores. AMOVA suggested that Slovakian, French and Turkish populations formed a single group while the Chinese populations of the Yellow river and of the Yangtze formed two independent groups (F CT = 0.335, p < .001).
DAPC analyses were less conclusive than those of neutral loci.
Plotting the principal components for K = 3 (retaining 10 PCs and plotting two loadings (Figure S7b), as a term of comparison to neutral loci, revealed a more scattered graph where it is still possible to identify Turkish populations and respective source (China-Yangtze River), while French, Slovakian and the Chinese population of the Yellow River grouped together (Figure 4b).

| blast and gene ontology terms of candidate loci
Amongst the 21 candidates identified across invaded regions, seven detected in Slovakia showed a positive blaSt result against available fish genomes, supported by multiple species hits, amongst the most common were the cavefish (Astyanax mexicanus), codfish (Gadus morhua), zebrafish (Danio rerio) and the stickleback (Gasterosteus aculeatus) ( Table 2). Biological processes were linked to negative regulation of cell cycle, protein transport and cell differentiation of eye and limb developmental processes. Blast against Ncbi database produced two additional hits -one on the common carp Cyprinus carpio and one on a cyprinid endemic to the Chinese region of Yunan, Sinocyclocheilus rhinocerous -associated with mRNA transcripts-like involved in defence response to protozoan or regulation of T-cell differentiation and glutathione processing (Table 3).

| Genomic diversity between native and invasive range
Population genetics of biological invasions has mostly been built around the paradigm of low diversity in non-native populations as a consequence of a genetic bottleneck associated to founder effects (Allendorf & Lundquist, 2003). When comparing average observed heterozygosity estimates among invasive populations and those in the native range, we found that invasive populations have on average, a higher observed heterozygosity and a higher number of variant sites than those in the native range. Higher genetic diversity among invasive populations, however, is not an uncommon observation and is frequently attributed to multiple introductions from genetically distinct sources (Bock et al., 2015). Amongst the most well-known cases are wetland grass (Phalaris arundinacea) introduced in North America from European native regions (Lavergne & Molofsky, 2007) or that of the brown anole lizard (Anolis sagrei), whose invasive populations in Florida were traced back to eight genetically distinct native sources (Kolbe et al., 2004). Even though higher genetic diversity among invasive P. parva populations has previously been reported, it was later attributed to an unbalanced sampling design lacking native populations (Hardouin et al., 2018;Simon et al., 2015). Here, our data mainly show a source-sink population system. Because no ad-  genetically similar to those from Slovakia/Yellow River could have been introduced in France, perhaps from Armenia, according to empirical record (Gozlan et al., 2010), as opposed to scenarios with a direct sourcing either from Slovakia or from the Yellow River. It is possible that the time since introduction, approximately 30 years, was not enough for divergence to conspicuously manifest at the genome-wide level (Hey, 2006 (Lin et al., 2017) and the Asian tiger mosquito (Goubert et al., 2017).

| Signatures of the invasion processdisentangling introduction and dispersal
While range expansions are known to affect the genotypic composition of neutral standing genetic variation (Excoffier, Foll, & Petit, 2009), expectations for adaptive variation are extrapolated from heterozygosity-fitness correlations (Peischl & Excoffier, 2015). If that is correct, adaptive loci are predicted to have a relatively weak effect on the individual fitness component (Peischl & Excoffier, 2015 (Regoli & Principato, 1995;Ren et al., 2002;Timofeyev et al., 2004). Danube tributaries in the region of former Czechoslovakia, and specifically the river Vah, are documented to be polluted with heavy metal concentrations and chemicals released from the paper industry (Füllenbach, 2017;Gondová, Janiga, Hundža, & Solár, 2017). It is possible that in this case, pollution indicators could be a factor covarying with river flow speed where for instances more stagnant waters -which this species favours -tend to accumulate more pollutants residues over time. The other locus matched a mRNA transcript associated with adaptive immune response.
Given the relationship between topmouth gudgeon and the rosette agent Sphaerothecum destruens for which the topmouth gudgeon is a healthy carrier (Andreou & Gozlan, 2016), the identification of loci associated with immune response could be a signature of the fish immunocompetence, further facilitating the species' establishment among native fish communities (Combe & Gozlan, 2018).

| Candidate loci to discriminate population structure
No evidence for fine-scale population structure was found with candidate loci. The role of putative candidates in defining fine-scale population structure is probably hampered by the relatively young age of invasive populations. While recent works suggest that candidate loci increase the resolution at which population structure can be detected, i.e., hake (Merluccius merluccius) in the Mediterranean (Milano et al., 2014) or the Eastern Atlantic wrasse (Symphodus tinca) (Carreras et al., 2017), it may, however, not apply to newly established systems. Noteworthy, and contrary to what was observed with the full data set, molecular variance among candidate loci discriminated the native from the invasive range, suggesting that divergence at these loci is accumulating at a faster pace than that of the background genome. The observation that candidate loci can discriminate between native and non-native range suggests that this type of markers are potentially effective to document biological invasions whose history is not as well documented as that of the topmouth gudgeon.
In conclusion, the detection of occurrence and spread of biological invaders are dependent on human observation and documentation. Population genetics provide temporal and spatial depth to those observations and enable the characterization of invasion pathways, dispersal and establishment in the new range (Estoup & Guillemaud, 2010). Our study describes the genomic signature in one of the most successful fish invaders. We show that genetically dissimilar source populations, each with specific evolutionary histories, could prompt distinct genomic response of adaptation. Disentangling whether related to selective pressures in the introduced environments or to methodological caveats linked to recent divergence times, is open to future exploration. Nevertheless, considering the role of adaptive genetic variation in establishment success, we uphold the suggestion that factors underlying successful invasions should be decomposed at the population level. Together, we show that evolutionary histories in native and non-native ranges determine the genetic make-up of invasive populations contributing to their adaptive potential and successful establishment.

ACK N OWLED G EM ENTS
This study is part of the project PROBIS (Biodiversa) and financially supported by ONEMA to S.B., and J.C., DFG (EI 841/7-1) to C.E. received funding for this research from BIODIVERSA.

DATA AVA I L A B I L I T Y S TAT E M E N T
Raw data is available under the project number PRJNA560205 at NCBI's SRA.