Adapting legume crops to climate change using genomic approaches

Abstract Our agricultural system and hence food security is threatened by combination of events, such as increasing population, the impacts of climate change, and the need to a more sustainable development. Evolutionary adaptation may help some species to overcome environmental changes through new selection pressures driven by climate change. However, success of evolutionary adaptation is dependent on various factors, one of which is the extent of genetic variation available within species. Genomic approaches provide an exceptional opportunity to identify genetic variation that can be employed in crop improvement programs. In this review, we illustrate some of the routinely used genomics‐based methods as well as recent breakthroughs, which facilitate assessment of genetic variation and discovery of adaptive genes in legumes. Although additional information is needed, the current utility of selection tools indicate a robust ability to utilize existing variation among legumes to address the challenges of climate uncertainty.

After the cereals, legumes are the most agriculturally important crop family (Graham & Vance, 2003) as they have multiple uses, ranging from animal forage and aquaculture feed to human food. Legume grains are appreciated for their protein content, in particular among lowincome families or where people avoid eating meat for religious or ethical reasons (Young, Mudge, & Ellis, 2003;Zhu, Choi, Cook, & Shoemaker, 2005). Legumes contain substances beneficial to health such as folate, lignans, saponins, antioxidants, dietary fibre, and resistant starch, and have the potential to offer protection against some cancers (American Institute for Cancer Research, 2014), diabetes, and obesity (Dove et al., 2011). Due to their symbiotic nitrogen-fixing characteristics, legumes have a crucial role in natural ecosystems as well as in sustainable farming through their contribution in crop rotations and increasing soil fertility in arid areas and where nitrogen is low (Zahran, 1999).

| CLIMATE CHANGE EFFECT
Demand for agricultural products continues to rise due to the population growth and increased food consumption per capita. Land-use change and climatic variations are intensifying competition for resources (land, water, and energy;Abberton et al., 2015;Gomiero, 2016). Climate change impacts several aspects of agricultural systems, from altering flowering phenology, water availability, soil fertility and erosion, increase in pathogen spread, and host susceptibility (Rosenzweig & Hillel, 1995) to more subtle shifts in plant distribution and biodiversity (Bakkenes, Alkemade, Ihle, Leemans, & Latour, 2002), and plant-pollinator interactions (Bishop, Jones, O'Sullivan, & Potts, 2016).
The combined effects of climate change on our agricultural systems can cause crop failures worldwide and lead to food insecurity.
The complex challenge is best tackled with a joint approach that highlights the need for an increase not only in productivity (i.e., yield) and diversity of our crops, but also efficiency (i.e., water, land, and nutrient use; Abberton et al., 2015).

| ADAPTATION PRIORITY IN REGIONAL AREAS?
Although climate change is a global threat, its direction and severity is not spread equally across continents and even regions. For example, although Mediterranean countries in Europe will experience frequent droughts, northern Europe is expected to become more of a Mediterranean climate. In Asia, more floods are expected in countries such as Bangladesh, due to the increase in severity of monsoon rains, whereas some others may experience decline in precipitation. In Africa, rise in temperature is predicted to increase desertification (Hopkin, 2005).
As reconfirmed by the Global Climate Risk Index analyses, less developed countries are most vulnerable to climate change risk (Kreft, Eckstein, & Melchior, 2017), and hence their agriculture and food security will be also negatively affected, such as in Sub-Saharan Africa as reviewed by Kotir (2011). Although the fragile nature of these regions makes them of great priority for maintaining agricultural productivity, to develop effective plans and make the best use of funding, ecosystem integrity needs to be taken into the account as well. In this context, Hannah et al. (2013) introduced several regional and global adaptation priorities by modelling the changes in agricultural suitability of 15 major rainfed staple crops, as well as biodiversity changes of 1,263 bird species. However, concerning crop development and breeding, we suggest that identifying climate-related changes in biodiversity of crop wild relatives (CWR) along with farming suitability provides a more holistic approach to develop priority schemes.

| WHICH TRAITS ARE IMPORTANT AS A TARGET OF BREEDING?
Yield is often the primary target of breeding, however, domestication traits such as flowering time, alkaloid content, and pod indehiscence have been the long-term targets of breeding experiments as they contribute to yield total and quality. The multifaceted significance of flowering time in ecological, evolutionary, and adaptation processes makes it unique among traits that affect plant fitness (Elzinga et al., 2007;Franks, 2015;Weller & Ortega, 2015). A global search on 116 Northern Hemisphere plant families, including several species of legume, found global phylogenetic signals in the direction and magnitude of flowering time shifts, led by selection under climate change (Rafferty & Nabity, 2017). However, explaining the variation among or within species, and whether these shifts are sufficient for survival, remains unclear (Visser & Both, 2005).
Thus, unravelling the genetic basis of flowering time variation is of great importance for breeding purposes (Nelson, Berger, & Erskine, 2010).
Efforts to alleviate the impact of climate change have led to increased research into traits such as drought and heat tolerance, as well as biotic stresses (Abberton et al., 2015;Doebley, Gaut, & Smith, 2006;Gepts, 2010). Soil and water salinity, which have been exacerbated due to the climate change driven factors such as sea level rise and shifts in precipitation (Teh & Koh, 2016), are also one of the major restrictions in the production of crops including legumes (Russell, 1976;Shrivastava & Kumar, 2015).
Efficient capitalization of elevated CO 2 levels are of importance as this affects not only the plants, but also the rhizosphere microbial community structure and interaction (Drigo, Kowalchuk, & Van Veen, 2008). In addition to these adaptive traits, climate change is also forcing us to breed for novel traits. For example, genes that limit the production of methane in ruminants are being sought in forages, such as subterranean clover (Trifolium subterraneum), to mitigate greenhouse gas emissions .
Applying theoretical advances made possible through genomics studies can have practical outcomes, such as enabling plant breeders to accelerate the domestication of promising wild species. For example, in southwestern Australia where the legume farming system was dominated by clover and annual medics (Medicago spp.) prior to the 1990s, it became possible to expand legume diversity by incorporating traits such as deeper root systems, acid-soil tolerant root nodule symbiosis, and pest and disease tolerance, and eight species of legumes were newly domesticated including Ornithopus sativus, Biserrula pelecinus, Trifolium glanduliferum, Trifolium dasyurum, Trifolium spumosum, Trifolium purpureum, Medicago sphaerocarpos (Nichols et al., 2007;Nichols, Loi, Nutt, Snowball, & Revell, 2010), and Melilotus siculus (common name messina; Rogers et al., 2011).

| UNDERSTANDING GENETIC VARIATION
Knowledge of the extent and distribution of genetic diversity is essential for the efficient use of these resources in plant breeding programs.
In order to understand the adaptation process, we must enhance our knowledge of mutations, genetic diversity of adaptive traits, phenotypic effects of genetic variants, and the interaction between the environment and genetic variation (Wright & Gaut, 2005). Darwin (1859) considered domestication as a model of adaptation from which the nature of variation and selection could be inferred.
Domestication of plants has been crucial to the development of human civilization by enabling an abundance of food (Diamond, 2002). However, as a consequence of this human intervention and historic population bottlenecks associated with it, the genetic diversity of most domesticated crops has been vastly reduced compared with their wild progenitors (Diamond, 2002;Gepts, 2010;Glémin & Bataillon, 2009). Reduction in genetic diversity of cultivated legumes compared with their wild relatives and ancestors has been discussed for different plants such as common bean Bitocchi et al., 2013;Gepts, 1990), soybean (Hyten et al., 2006;Lam et al., 2010), narrow-leafed lupin (Lupinus angustifolius; Berger, Buirchell, Luckett, & Nelson, 2012), and chickpea (Varshney et al., 2013). Interestingly, a study illustrated that reduction in genetic diversity as a result of domestication could go beyond the plant itself, as lower sequence variation was observed in rhizobia from domesticated chickpea compared with those from the wild type, which may suggest the potential negative impact of chickpea domestication on symbiosis (Kim et al., 2014). Shifts in genetic variation as a result of domestication, crop expansion, and breeding highlight the need for conserving and management of genetic resources for future breeding attempts.
6 | RESOURCES AVAILABLE IN LEGUMES 6.1 | Germplasm collections Germplasm collections are the cornerstone of genetic resource conservation and management. Preserving wild plant populations in their natural habitat in situ will not only conserve diversity, but also ensures the extension of evolutionary processes that could lead to adaptive traits and new genetic and genotypic diversity for a wide range of species (Hawkes, 1991;National Research Council, 1993). The importance of in situ conservation of wild populations in particular for preservation of threatened Mediterranean legume genus such as Lens, Lupinus, and Cicer, due to the loss of natural habitats, ecosystems, and genetic diversity have been discussed and demonstrated (Maxted & Bennett, 2001;Walter & Gillett, 1998). However, there are currently few case studies of in situ conservation in legumes for wild species (Ajlouni, El-Oqlah, Al-Ghzawi, Al-Tawaha, & Amri, 2010). However, it is also important to consider the in situ/on farm conservation of landraces and heterogeneous populations as a crucial aspect of the conservation of crop germplasm (Brush, 2000).
In the early 20th century, Nikolai I. Vavilov was among the first to recognize the significance of collecting and preserving plant materials ex situ. Vavilov's scientific expeditions resulted in conservation of genetic resources across different plant species including legume family members, such as white lupin (Lupinus albus), mung bean (Vigna radiata), chickpea, lentil, and pea (Pisum sativum; Kurlovich et al., 2000). Today, the extent of legume ex situ germplasm collections stands second only to the cereals, with a total of 1,041,345 accessions, out of which common bean represents the biggest group with 261,968, followed by tepary bean (Phaseolus acutifolius), scarlet runner bean (Phaseolus coccineus), lima bean (Phaseolus lunatus), and soybean, which collectively represent 156,849 accessions (Smýkal et al., 2015).
For detailed information on the current number of accessions in legume germplasm collections, see Smýkal et al. (2015). These germplasm collections provide researchers with a great source of genetic variability that could be utilized in breeding for climate resilient crops (Hawkes, 1991).

| Molecular markers and whole-genome resequencing
In the last few decades, innovations in genomics-based techniques and platforms have provided a wealth of genetic and genomics resources (Varshney, Graner, & Sorrells, 2005) that revolutionized research in both model and crop legumes. In particular, the increased application of molecular markers and reference genome sequences have had a substantial impact in accelerating progress in plant breeding and helping to incorporate new genetic diversity from germplasm resources.

| FINDING ADAPTIVE GENES AND ADAPTIVE TRAITS
Currently, there are two general methods to identify genes and mechanisms related to important agronomic traits in plant species, known as "top-down" and "bottom-up." The top-down approach begins with a phenotype of interest followed by forward genetic analysis to identify candidate genes. Contrastingly, bottom-up approaches use population genetic analyses to identify signatures of adaptation in a set of potentially adaptive genes, and then apply bioinformatics and reverse genetic tools to associate selected genes to a phenotype (Ross-Ibarra, Morrell, & Gaut, 2007;Wright & Gaut, 2005). 7.1 | Top-down approach (linkage analysis) 7.1.1 | QTL mapping Two popular genetic analyses used in the top-down method are QTL and association or linkage disequilibrium (LD) mapping. QTL mapping is the more traditional approach and has been successful in identifying genomic regions associated with adaptive traits. For example, soil salinity is one of the major limitations for successful germination and plant growth in soybean (Essa, 2002), and several QTL mapping studies have identified loci conferring salinity tolerance (Do et al., 2017;Ha et al., 2013;Hamwieh et al., 2011;Lee et al., 2004).
Drought is a major limitation in the production of many legumes and has been targeted in various QTL studies to search for loci and genes conferring tolerance that has led to the breeding of crops with greater drought tolerance. A recent successful example is a study by Varshney et al. (2014) in chickpea, where they found several main effect and epistatic QTLs, among which, one QTL cluster was suggested as a "QTL-hotspot," a candidate genomic region for several drought tolerance and root traits in chickpea Kale et al., 2015). Later, applying a marker-assisted backcrossing approach, this QTL-hotspot was introgressed into a popular Indian chickpea variety (JG 11), which improved several root traits including rooting depth, root length density, and root dry weight (Varshney et al., 2016). This work was extended to several elite varieties in India and Africa . Applying a QTL-seq approach, that is, the identification of QTLs by whole genome resequencing from two bulked populations (Takagi et al., 2013), candidate genes for several traits under rainfed conditions (100-seed weight, root/total plant dry weight) were rapidly identified in chickpea, and three genes have since been validated (Singh, Khan, Jaganathan, et al., 2016).
QTL analysis in cowpea led to the identification of five genomic regions accounting for 11.5-18.1% of phenotypic variation for heat tolerance (Lucas et al., 2013) as well as three loci associated with heat-induced browning of seed coats (Pottorff et al., 2014). QTL mapping accompanied by synteny analysis revealed candidate genes for resistance to Macrophomina phaseolina, a fungal pathogen of cowpea (Muchero, Ehlers, Close, & Roberts, 2011). A recent study in pigeonpea used three different mapping populations and genotyping by sequencing to construct dense genetic maps that revealed 14 significant QTLs for resistance to fusarium wilt . Similarly, QTLs have been identified for sterility mosaic disease in pigeonpea . In addition, a QTL-Seq approach has been used to map Fusarium wilt and sterility mosaic disease in pigeonpea (Singh, Khan, Saxena, et al., 2016). In wild lentil (Lens ervoides), a recent study by Bhadauria, Ramsay, Bett, and Banniza Although the identification of QTLs and candidate genes is relatively routine when a suitable population is available with good quality genotypic and phenotypic information, the translation of this information to the development of improved varieties can be challenging, and this method does have several limitations. For instance, developing mapping populations is difficult for some plants, such as those propagated vegetatively, perennial and polyploid species. In tetraploid alfalfa, which is a perennial species, availability of limited number of markers in a polyploid genome restricts the saturation of linkage maps.
Furthermore, fewer recombination events are captured in a tetraploid population compared with the diploid, which affects the precision of linkage maps (Li & Brummer, 2012). Furthermore, QTL results are dependent on environment/experimental design and the allelic variations in parents of the experimental population (two parents in most studies). For example, identification of QTLs in regions with lower recombination rate, such as centromeric regions, will be more challenging. Additional drawbacks of QTL arise from what is known as the Beavis effect that is overestimation of phenotypic variances associated with QTL in a population of small size (Beavis, 1998;Korte & Farlow, 2013;Weinig & Schmitt, 2004;Xu, 2003).
Although identifying QTLs has its own challenges, narrowing down the QTL region to find the loci responsible for the trait of interest may not be easy. In addition to these challenges, lack of a thorough data management system for storing, combining, and reusing QTL data is an additional hurdle for efficient use of available information that could avoid doubling the efforts and expenses. Although this information is available for some major legumes through the legume information system (Dash et al., 2015), in cooperation with SoyBase (http:// soybase.org) and PeanutBase (http://peanutbase.org), and cool season food legume (https://www.coolseasonfoodlegume.org), many of the minor legumes of great potential (such as Lupinus species) are not receiving enough attention in this regard.
An interesting study providing both opportunities and limitations of QTL is by Książkiewicz et al. (2017) in white lupin (Lupinus albus). Early flowering in white lupin was known to be controlled by the locus brevis, regulating vernalisation response, however, Książkiewicz et al. (2017) found multiple QTLs responsible for vernalisation responsiveness, yet the specific genes in these QTLs remain unknown. In addition, although overlapping QTLs were found in Australian and Polish experiments, identification of an additional small effect QTL in an Australian trial is a reflection that QTL results are environmental dependant, and hence, to capture the whole picture for a trait of interest, it is necessary to integrate and compare QTL data from different experiments.

| LD mapping (association mapping)
LD mapping has several benefits and can be considered as a complementary approach to QTL mapping. First, it may allow faster progress than QTL analyses as it does not always involve making experimental populations. Second and most importantly, LD can provide higher mapping resolution as it takes into the account the accumulation of historic recombination events (Korte & Farlow, 2013;Xu, Li, Yang, & Xu, 2017).
LD mapping can be classified into two types, including (a) broad genome-wide studies seeking variation associated with phenotypic diversity and (b) narrower investigations attempting to identify causal genes and mutations in a small number of candidate genes within a specified genomic region (Ross-Ibarra et al., 2007). Examples of where LD mapping has been applied for identification of both novel and previously characterized genes responsible for agronomic traits include genome-wide association studies (GWAS) in model legume Medicago truncatula (Stanton-Geddes et al., 2013), common bean (Kamfwa, Cichy, & Kelly, 2015;Moghaddam et al., 2016), and soybean (Contreras-Soto et al., 2017;Zhou et al., 2015). GWAS has also proven to be successful in identifying candidate genes for ascochyta blight resistance  and heat and drought tolerant loci in chickpea (Thudi, Upadhyaya, et al., 2014), and Aphanomyces euteiches resistance in Medicago truncatula (Bonhomme et al., 2014). Applying GWAS in a population comprising 292 pigeonpea accessions using data over several years enabled identification of association between candidate genes and traits, including 100-seed weight, days to 50% flowering, and plant height (Varshney et al., 2017). Hoyos-Villegas, Song, and Kelly (2017) investigated the genetic basis of variation for drought tolerance and related traits in a diversity panel including 96 Middle American genotypes of common bean, and the GWAS analysis allowed identification of significant marker-trait associations for traits related to drought tolerance and candidate genes associated with wilting.
In cowpea, salinity has become an increasing threat to production, and Ravelombola et al. (2018) identified SNPs associated with salt tolerance at germination and seedling stages. These markers can be applied as a tool for selecting tolerant lines to be included in breeding programs of this crop. One of the most successful applications of GWAS is in peanut. Peanut is one of the important crop of the semiarid tropics, where climate change is posing a threat to crop productivity due to the increase in range of abiotic (e.g., drought and heat) and biotic stresses. Although the complex and tetraploid nature of the peanut genome makes QTL mapping studies a challenging task, GWAS enhances the chance of characterizing candidate genes for production related traits. A comprehensive study by Pandey et al. (2014) analysed marker-trait associations for a wide range of economically important traits in peanut, such as yield components, oil components, drought, and disease tolerance. Several markers with significant allelic effects (>20% phenotypic variation) were identified for different traits such as pod yield, seed weight (under well-watered and drought stress), oil content, and quality. Another GWAS study in 158 peanut accessions found a total of 51 SNPs associated with various traits including seed weight and pod weight, and identified candidate genes related to the domestication of peanut , and this information will facilitate the genomic assisted breeding of peanut cultivars.
Despite the potential that LD mapping offers to identify adaptive genes, the tendency for spurious association, that is, false association with genomic regions, missing genotypes, identification of small effect variants, and genetic heterogeneity remain as limitations (Korte & Farlow, 2013). Another limiting factor of LD mapping is that resolution is dependent on the rate of LD decay, so using wild relatives of crops could serve as a better foundation.
In our opinion, because GWAS requires extensive phenotypic and genotypic information, it might be more usefully applied for major legume crops, where resources might already be available and the development of the future resources might be of interest of a wider research community. Additionally, accessing and integrating GWAS results from various studies is currently a cumbersome task due to the lack of a dedicated GWAS database in legumes that would enable cross referencing of resources from different experiments. The development of such a database would greatly benefit legume adaptation research.

| Genome-environment association mapping
GWAS has been applied in genome-environment association mapping to provide a new avenue to identify climate-adaptive genetic loci and the genetic basis of local adaptation (Hancock et al., 2011), assuming "association between conditionally adaptive mutations and the environmental conditions with which they interact" (Turner, Bourne, Von Wettberg, Hu, & Nuzhdin, 2010 p. 262). Hence, genome-environment associations along with genome-phenotype associations can be applied to efficiently select for climate resilience traits (Lasky et al., 2015). In Arabidopsis lyrata, the association between polymorphisms and soil type was shown to be enriched in some functional annotation terms such as metal ion transmembrane transporter activity, providing novel candidate genes for soil adaptation (Turner et al., 2010).
In legumes, genome-environment association analyses have been applied in Medicago truncatula, which identified candidate genes associated with adaptation to annual mean temperature and precipitation in the wettest month, and isothermality (Yoder et al., 2014). A recent study in narrow-leafed lupin investigated the association between SNPs and climatic gradients and found significant associations between some SNPs with annual mean temperature and precipitation (Mousavi-Derazmahalleh et al., 2018). Although availability of genotyping by sequencing data along with georeferenced genetic material makes genome-environment association mapping an interesting avenue to explore, such analyses are most useful based on collections made directly from the wild habitat, ensuring good correspondence between climatic records and collection site. Hence, they may be less informative for legumes that have been domesticated a long time ago and diverged substantially from their wild ancestors. For example, in chickpea, which is one of the oldest domesticated legumes, hybrids from crosses between domesticated chickpea with Cicer echinospermum, a wild relative that is believed to have contributed gene flow to cultivated chickpea, are infertile (Ladizinsky & Adler, 1976).

| Bottom-up approach (population genomics)
The limitations of the QTL and LD mapping methods highlight the need for a complementary approach. Molecular population genetics, which forms the basis of bottom-up approaches, appears to be promising for advancing our knowledge of the molecular signature of adaptation (Wright & Gaut, 2005).
Population genomics studies in Medicago truncatula demonstrated local adaptation of Tunisian populations to soil salinity, and revealed candidate genes with regulatory roles in abscisic and jasmonic acid signalling as well as genes associated with biotic stress and flowering time (Friesen et al., 2010;Friesen et al., 2014). A genome-wide study of artificial selection in soybean revealed candidate genes for some domestication traits such as seed-coat colour, growth habit, flowering time, and seed size (Li et al., 2013). Another study in soybean identified 159 putative domestication sweep accounting for 4.9% of the genome, containing 4,414 genes (Valliyodan et al., 2016). Recently, comparison of results of four different studies of varying sizes, data types, and methodologies Bitocchi, Rau, Benazzo, et al., 2017;Rodriguez et al., 2016;Schmutz et al., 2014), all based on population genomics approaches to search for signatures of selection during common bean domestication, provided evidence of domestication candidate genes for four genes (i.e., AN-Pv33, AN-Pv69, AN-DNAJ, and Leg223). Investigation of these genes highlighted their involvement in plant resistance/ tolerance to biotic and abiotic stresses, including heat, drought, and salinity (Bitocchi, Rau, Benazzo, et al., 2017).
Population genomics has great potential for identifying candidate genes harbouring adaptive mutations. However, careful consideration must be taken to exclude demographic effects such as population size and structure, which could bias the results by increasing the statistical variance applied to detect the selection signature. In addition, methods that consider demography may still not be able to detect recent selective sweeps (Nielsen, 2005). An example of where this can be seen is in narrow-leafed lupin, a domesticated crop of the In order to fully understand adaptive evolution, we should understand all possible changes leading to the adaptation. Therefore, whilst focusing on the role of genomics for producing climate-ready crops, it is worthwhile to take into consideration the phenotypic alterations as a result of epigenetics variation. Epigenetics refers to mechanisms such as modification of DNA methylation and histones, and noncoding RNAs, which do not alter DNA sequences but could affect gene expression and trait phenotypes (Springer & Schmitz, 2017). Research in Arabidopsis thaliana has shown substantial heritable DNA methylation variation for several plant traits and plasticity, such as root allocation and drought tolerance, as well as the act of natural selection on some of the variation, such as plant biomass and height (Y.Y. Zhang, Fischer, Colot, & Bossdorf, 2013). A recent study by Song, Zhang, Stelly, and Chen (2017)

| High throughput phenotyping
Recent decades witnessed a tremendous progress in DNA sequencing technologies; however, successful crop improvement plans are also dependant on accurately measuring plant traits to identify genetic loci associated with traits. Along with innovative and high throughout phenotyping strategies (such as near-infrared spectroscopy on agricultural harvesters and spectral reflectance of plant canopy), analysis can be extended to the molecular phenotype using transcriptomic, metabolomics, and proteomic approaches (Beleggia et al., 2016;Bitocchi, Rau, Benazzo, et al., 2017). Together, these will improve our capacity to explore the phenotypic space from large multilocation field trials (Fahlgren, Gehan, & Baxter, 2015;Montes, Melchinger, & Reif, 2007). A recent study on a large population of rice (consists of 1,568 samples), using both field-based high throughput phenotyping (HTP) and manual phenotyping, confirmed the efficiency and accuracy of HTP in detecting QTLs associated with grain yield and yield components (Tanger et al., 2017). Of relevance is also the development of a high-throughput phenotyping system to study root systems , and integrating HTP methods with high-throughput genotyping hold a potential to

| Predictive modelling
Exploring collections of CWR from regions that are likely to be enriched for target traits (e.g., warm, dry areas for heat and drought tolerance) ensure this available genetic diversity can be identified and harnessed when needed (Mousavi-Derazmahalleh et al., 2018;Phillips, Asdal, Magos Brehm, Rasmussen, & Maxted, 2016)

| Pangenomes
Although the availability of reference genomes has greatly assisted plant genetics research and breeding, these reference genomes capture only a portion of diversity present in the species. A solution is the development of pangenome assemblies that more comprehensively capture sequence and structural diversity in a species. In legumes, pangenomes have been constructed for soybean (Lam et al., 2010;Li et al., 2014) and Medicago truncatula (Zhou et al., 2017). A pangenome of soybean based on seven accessions of wild soybean suggested faster evolution and greater variability in dispensable genes compared with the core genes, which may be associated with adaptation to biotic and abiotic stresses (Li et al., 2014).
Pangenome construction takes into the account the structural variations, and so they capture the genetic variation of the species more comprehensively rather than a single reference genome. They also enable comprehensive identification of SNP variation. This simplifies discovery of rare variants, which might be associated with QTLs for agronomic traits. Pangenomes also allow the differentiation between SNPs occurring in core (present in all individuals of the species) and variable (present in a subset of individuals) genomes, the latter of which was found in several studies to influence adaptation to biotic and abiotic stresses (Hurgobin & Edwards, 2017). These resources can be of value for legume breeding, based on novel gene identification and discovery of nucleotide diversity that enables molecular marker design for introgression of previously untapped genes into crop improvement programs.
Considering the fact that the selection of appropriate individuals with enough variation is an important element to a successful pangenome study, we suggest that pangenomes may be smaller in size in crops such as narrow-leafed lupin, which went through severe genetic bottleneck during its recent domestication (Berger et al., 2012). In addition, pangenome construction requires extensive sequence data and computational resources, and its quality is dependent on the assembly precision. This makes the development of pangenomes challenging in the case of crops with complex and large repetitive genomes such as pea (Macas, Neumann, & Navrátilová, 2007), as well as polyploid genomes such as tetraploid alfalfa (Medicago sativa).

| Genome editing
A relatively new technology for mutagenesis is the clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR-Cas9 system.
Originally discovered as bacteria's adaptive immune system, CRISPRs' repeat-spacer-repeat sequence pattern was found to be involved in an RNA intereference-like mechanism that can identify and cut foreign DNA. Genome editing modification by CRISPR uses a guide RNA that is complementary to a target gene, induces double-strand breaks usually by a Cas9 nuclease, followed by a non-homologous end joining or homology-directed repair mechanism (Jinek et al., 2012;Xiong, Ding, & Li, 2015). Legumes hold great promise to mitigate the effect of climate change through their contribution in sustainable farming, capitalization of elevated level of CO2, and broadening the crop base, which is currently dominated by a small number of major crops, mainly from the cereal family. In addition, enormous progress has been made in legumes to identify novel alleles for adaptive traits. However, deployment of these findings in applied breeding remain a major limitation to release climate-ready cultivars. As stated by Gready (2014), disruptive thinking and technologies are required to take advantage of best of the old and the new. We believe availability of genome editing tools such as CRISPR provide an excellent example of this. The full potential of legume crops remains yet to be explored, with genomics as a powerful enabling tool. Choice of approaches to create new cultivars is dependent on various factors such as plant information, availability of geno- We also thank Candy Taylor for the comments on the manuscript.

CONFLICT OF INTEREST
None.