Abstract
- Top of page
- Abstract
- Introduction
- Methods
- Results
- Discussion
- Conclusions
- Acknowledgements
- References
- Biosketch
- Supporting Information
Aim Investigate the relative abilities of different bioclimatic models and data sets to project species ranges in novel environments utilizing the natural experiment in biogeography provided by Australian Acacia species.
Location Australia, South Africa.
Methods We built bioclimatic models for Acacia cyclops and Acacia pycnantha using two discriminatory correlative models (MaxEnt and Boosted Regression Trees) and a mechanistic niche model (CLIMEX). We fitted models using two training data sets: native-range data only (‘restricted’) and all available global data excluding South Africa (‘full’). We compared the ability of these techniques to project suitable climate for independent records of the species in South Africa. In addition, we assessed the global potential distributions of the species to projected climate change.
Results All model projections assessed against their training data, the South African data and globally were statistically significant. In South Africa and globally, the additional information contained in the full data set generally improved model sensitivity, but at the expense of increased modelled prevalence, particularly in extrapolation areas for the correlative models. All models projected some climatically suitable areas in South Africa not currently occupied by the species. At the global scale, widespread and biologically unrealistic projections by the correlative models were explained by open-ended response curves, a problem which was not always addressed by broader background climate space or by the extra information in the full data set. In contrast, the global projections for CLIMEX were more conservative. Projections into 2070 indicated a polewards shift in climate suitability and a decrease in model interpolation area.
Main conclusions Our results highlight the importance of carefully interpreting model projections in novel climates, particularly for correlative models. Much work is required to ensure bioclimatic models performed in a robust and ecologically plausible manner in novel climates. We explore reasons for variations between models and suggest methods and techniques for future improvements.
Introduction
- Top of page
- Abstract
- Introduction
- Methods
- Results
- Discussion
- Conclusions
- Acknowledgements
- References
- Biosketch
- Supporting Information
Understanding the potential impacts of novel climates on native and alien species distributions is critical for conservation planning and management, but projecting ecological futures is highly uncertain. Studies that model species ranges can encounter methodological, conceptual and theoretical difficulties, making interpretation of results problematic for both current and future environments (Dormann, 2007; Coreau et al., 2009; Rodda et al., 2011).
Bioclimatic models are commonly used for projecting the potential range of invasive species for risk assessment and more generally for species range shifts under the influence of climate change (Guisan & Zimmermann, 2000; Kriticos & Randall, 2001). These models define the potential limits of species distributions using various combinations of the species known range, physiological tolerances, biotic interactions and dispersal potential (Elith & Leathwick, 2009; Kearney & Porter, 2009; Soberón & Nakamura, 2009). The models are then transferred or projected to other regions or times to identify additional areas suitable for occupation by the species in question.
The most commonly used bioclimatic models are correlative, linking readily available species distribution records with spatial environmental data, using either statistical or machine learning techniques (Elith & Leathwick, 2009). An alternative, but more time- and data-intensive approach is to link the ecophysiological responses of species to environmental covariates in mechanistic bioclimatic models (Kriticos & Randall, 2001; Sutherst, 2003; Kearney & Porter, 2009).
Which components of the species niche are represented in different modelling techniques depends on the framing of the research question (Venette et al., 2010), the modelling method and the training data used (Kearney, 2006; Hirzel & Le Lay, 2008; Soberón & Nakamura, 2009). These choices, in turn, can influence model projections. Novel climates are key areas of interest for invasion ecology and climate change, as well as for the management and policy frameworks built on such knowledge. The three primary determinants of a species range are climate, biotic interactions and dispersal (Soberón & Nakamura, 2009). Because biotic and dispersal drivers of distributions can change rapidly owing to anthropogenic influences, the goal for bioclimatic models exploring habitat suitability in novel climates should be to approximate the Grinnellian fundamental niche (sensuSoberón, 2007). At the very least, they should be able to characterize the realized Hutchinsonian niche (sensuSoberón, 2007) underlying the species’ native range.
Various approaches aim to develop bioclimatic models that more closely approximate fundamental niches, while recognizing that perfect matches are not possible. Theoretically, the ability of correlative and mechanistic models to project suitable novel climate space should be improved by greater model complexity and the inclusion of more species-relevant data, but this construct is rarely tested. For example, mechanistic models are potentially able to get closer to understanding the determinants of the fundamental niche by considering ecophysiological processes. Alternatively, it has been suggested that models of invasive species fitted with pooled data from alien and native ranges may improve the descriptive performance of the models relative to models fitted with native-range data only (Mau-Crimmins et al., 2006; Broennimann & Guisan, 2008; Sanchez-Fernandez et al., 2011). Current consensus suggests that correlative model outputs align more closely with species’ realized distributions, while mechanistic models more closely approximate their fundamental climate niche and therefore robustly project species ranges into novel climates (Soberón, 2010; Rodda et al., 2011). Yet few studies have rigorously investigated or tested this proposition (but see Sutherst & Bourne, 2009; Elith et al., 2010; Kearney et al., 2010).
The expertise and resources required to parameterize mechanistic models may not be available for many species, thus limiting their application to high profile questions. In contrast, correlative models are quick to parameterize, have minimal requirements and use widely available species distribution records and spatial environmental data. It is therefore highly likely that they will continue to be used. Concerns about extrapolation issues (e.g. Sutherst & Bourne, 2009) have prompted some authors to argue for careful and critical evaluation of the performance of correlative models in novel environments to identify and address problems requiring resolution (Elith et al., 2010; Venette et al., 2010; Rodda et al., 2011).
Acacia cyclops A.Cunn. ex G.Don and Acacia pycnantha Benth. (subgenus Phyllodineae, Mimosoideae: Fabaceae; Miller et al., 2011) are native to Australia and major invasive species in South Africa, the Iberian Peninsula and California (Turpie et al., 2003; Gaertner et al., 2009; Le Maitre et al., 2011; Richardson & Rejmánek, 2011). The life histories of both species are well characterized, and their native and naturalized distributions, well documented (see review in Appendix S2 in Supporting Information). In Australia, both species have become naturalized outside their historical native distributions (Maslin & McDonald, 2004). In South Africa, both species had been introduced by the mid-19th century and widely planted for land rehabilitation and commercial purposes. This long invasion history, widespread colonization and well-documented distributions may be viewed as a ‘natural experiment in biogeography’ from which much can be learnt about species range dynamics (Richardson et al., 2011). This makes these Acacia species useful model systems for comparing the ability of different approaches to project potential geographic ranges for species in novel environments. As such, Australian acacias in South Africa provide a unique opportunity to investigate tools for invasive species and climate change risk assessment.
In this study, we compare the ability of different modelling methods to make projections of species potential ranges in novel environments. Because we cannot test this directly, beyond qualitative assessments based on theoretical expectations, we use invasions of novel environments in South Africa as a proxy for future novel climates. Specifically, we built bioclimatic models for A. cyclops and A. pycnantha using the mechanistic niche model CLIMEX (Sutherst & Maywald, 1985; Sutherst et al., 2007) and two discriminative correlative modelling techniques MaxEnt (Phillips et al., 2006) and Boosted Regression Trees (BRT; Ridgeway, 2007; Elith et al., 2008). We fitted models for the three techniques utilizing two training data sets: native-range data only (‘restricted’) and all available global data excluding South African distribution records (‘full’). We compared the ability of the six techniques (three bioclimatic models× two training data sets) to project the climate suitability for observed records of the two species in South Africa. In addition, we assessed the impacts of climate change on the potential distributions of the two species in both native and alien habitats. Our intention is to motivate developments that improve bioclimatic modelling of novel climates by investigating what differences in the models or training data sets are responsible for variation in model behaviour. Finally, we explore how correlative and mechanistic modelling approaches can complement each other or, together, facilitate the development of more robust bioclimatic modelling techniques.
Discussion
- Top of page
- Abstract
- Introduction
- Methods
- Results
- Discussion
- Conclusions
- Acknowledgements
- References
- Biosketch
- Supporting Information
The distribution patterns of Australian acacias provide a unique opportunity for investigating many practical and theoretical issues associated with bioclimatic modelling and its application to novel climates, such as invasive species and future climate change risk assessments. For example, A. cyclops and A. pycnantha both have elements of native-range expansion, alien invasions within their native continent, and broad-scale alien invasions in other continents. Our study found substantial variation in the projected range limits for A. cyclops and A. pycnantha among the three modelling techniques and between models fitted with restricted or full training data sets.
There are obviously many issues that this manuscript could discuss with respect to bioclimatic modelling, such as data quality and appropriate statistics for testing models. We have focussed our discussion on (1) the interpretation of model projections, (2) explaining differences between models and (3) how we might best proceed with modelling species ranges in novel climates.
Model interpretation
All models were statistically significant when tested against their training data, with independent South African data and globally, but there was considerable variation among modelling techniques and between models fitted with restricted or full training data sets in both their sensitivity and modelled prevalence. Generally, the extra information in full training data sets encompassed a broader environmental range and increased the sensitivity of the models. Gains were marginal for both species using MaxEnt, moderate for both species using BRT and substantial for A. cyclops and marginal for A. pycnantha using CLIMEX. It is likely that the CLIMEX projections based on the restricted data set, particularly for A. cyclops, were too conservative and therefore potentially underestimated suitable climate space as indicated by the low sensitivity value when projected to South Africa (Table 2). However, the method used to construct the restricted data set models goes against recommended practice for CLIMEX modelling because of the potential value in the data being withheld from the model (Kriticos & Randall, 2001; Sutherst & Bourne, 2009). As might be expected, the increases in the sensitivity of the models fitted with full training data sets were also accompanied by increases in model prevalence scores, as they included information from locations occupying more extreme climatic conditions than the native-range data set. Interestingly, this additional information had a different impact on the correlative and mechanistic models.
The CLIMEX modelling technique allows new distribution records or ecophysiological information to be included in the model by iterative adjustments to the restricted data set parameters, with a coincident change in the projection area to encompass ecoclimatically similar locations. For our acacia models, the prevalence of the CLIMEX models increased to encompass the points in the full data set with only moderate increases in model prevalence elsewhere confined to ecologically reasonable climates (compare Figs 4c & 4f). In contrast, the additional information included in the full training data set sometimes adversely affected the discriminative correlative models; increased sensitivity came at the expense of proportionally greater increases in modelled prevalence and consequently reduced the statistical significance of the models. Where the alien distribution data included locations outside of the climatic range spanned by the native distribution records, it also included additional areas of model background that were incompletely invaded (Fig. S1). Therefore, models trained on the complete data sets used a background sample that included areas of high climatic suitability that were occupied in the native range and unoccupied in the exotic range. This pattern of increasing confusion in correlative models trained on full vs. restricted range data is apparent in both the changes in the potential range boundaries and changes in the relative suitability patterns within the boundaries for the models (e.g. compare Fig. 4a with 4d, Fig. 4b with 4e). In both of these correlative model comparisons, the full data set models classified considerably less area as highly suitable, indicating that the model’s ability to discriminate relative climate suitability was reduced. Moreover, the extra information in the full data sets did not necessarily result in closed Bioclim variable response curves, even when the variable range of the background was broad relative to that of the distribution points (Fig. S1). Taken together, our results indicate that combining native and alien distribution records in discriminative correlative models does not consistently improve model projections. If undertaken as a methodological choice, careful interpretation of the input data and the model results is imperative.
A significant advantage of testing the models in South Africa is that we were able to observe how the models performed against an independent set of high-quality data, where the ecological implications of prevalence and sensitivity can be interpreted with reasonable confidence. Of particular interest to invasion ecologists and biosecurity managers are regions of projected suitable climates that do not currently have distribution records. In South Africa, the correlative models projected areas of climatic suitability in interpolation (MESS+) space beyond the current distribution in the Eastern Cape, particularly for A. cyclops (Figs S4 & S5). In contrast, the CLIMEX model projected areas of climatic suitability in the central provinces west and north of Lesotho (Figs S4f & S5f). If these projections are plausible, the absence of these two acacias may be due to factors not included in the models. For example, regions north and west of Lesotho are known to have up to 60 nights of frost annually (Schulze, 1965), a range-limiting climate variable not captured well by climate averages (Zimmermann et al., 2009). When assessed against high-quality data, models can be used to generate testable hypotheses to provide insight into the relative importance of climate, dispersal and biotic interactions on the present range limits of these species.
Irrespective of the model performance in South Africa, for invasive species risk assessments, bioclimatic models ideally need to be able to perform in a robust manner globally. It was clear from the mapped model results that model performance in South Africa was not representative of model performance globally, indicating performance in South Africa cannot be usefully generalized elsewhere. This study found substantial differences in the global projections of range limits for A. cyclops and A. pycnantha among the three bioclimatic models (Figs 4 & 5). Both MaxEnt and BRT models projected implausible areas of high suitability for A. cyclops in the tropics, subtropics and deserts and for A. pycnantha at very high latitudes in the Northern Hemisphere. In contrast, the CLIMEX model projections were more constrained and restricted to the native range and closely matched climates, incidentally more closely resembling the MESS+ regions from the correlative models.
Explaining model differences
The differences in projections between the correlative models and CLIMEX are influenced, in part, by the respective methods for fitting species ranges. MaxEnt and BRT models seek to discriminate between distribution records and the background using the entire presence data set to fit the response functions. In contrast, the CLIMEX model fitting process explicitly focuses attention on the peripheral distribution records and their relationship with adjacent, apparently climatically unsuitable regions. The CLIMEX user’s challenge is to identify solutions that accord with knowledge across multiple domains. When a conflict is discovered between the information at hand and the model, all data and knowledge sources (distribution records, ecophysiological data, climate data, theoretical precepts and the relevance of model mechanisms) are scrutinized to identify plausible, parsimonious explanations. In this way, we were able to identify and actively exclude distribution record outliers that were found in apparently climatically implausible locations, but that were not detected in the Bioclim variable exploration used for the correlative models (Fig. S1).
A second influence on model performance is how the models handle extrapolation. In the global projections, it was clear that the MaxEnt and BRT models were extrapolating (MESS−) beyond their training data to project climatic suitability in parts of the world where it is implausible for large woody shrubs to grow (e.g. the Sahara desert and Greenland; Figs 4 & 5). MESS maps also made it very clear that regions of correlative model extrapolation dominated global projections in future climate scenarios (Figs 6, 7, S6 & S7). The three modelling techniques applied here differ considerably in how they deal with extrapolation (Fig. 8). MaxEnt provides the user with four options to control response curves: ‘no extrapolation’, ‘clamping’ (maintaining the suitability value at the limits of the training data), ‘don’t clamp’ (continuing the trajectory of the response curve at the limits of the training data) and ‘fade by clamping’ (reducing the suitability value by the difference between clamped and don’t clamped output; Fig. 8a). BRT models produce the equivalent of clamping in MaxEnt (Fig. 8b). In this study, MaxEnt models used the clamping option, meaning that open-ended response functions often maintained high suitability values throughout the extrapolation space (Figs S2 & S3). These factors account for much of the implausible model projections clearly evident at the global scale. CLIMEX models, on the other hand, are fitted over the entire environmental domain (Fig. 8c). Parameters that contribute to the growth index (GI) consist of closed curves that are used by the model in a different way to correlative models (Sutherst et al., 2007). CLIMEX also uses stress functions to further define unsuitable climate space. Stress functions largely define the species range, whereas the annual growth index (GIA) and stresses indices, in combination, define the climate suitability (EI) within that range. Critically, these stress functions have the property of explicitly penalizing conditions that are more extreme than those which are inferred by the model to be unsuitable.
Improving bioclimatic models for novel climates
If we are to properly understand species invasions and the effects of anthropogenic climate change, we need to be confident that our models are capturing key determinants of the fundamental niche and that projections beyond the training regions are meaningful and reliable. Our study is not the first to describe differences among models in the limits to species distributions (Pearson et al., 2006; Elith & Graham, 2009) or to highlight the problems with using correlative bioclimatic models for extrapolation (e.g. Hirzel & Le Lay, 2008; Sutherst & Bourne, 2009). Although important advances have been made in tools that facilitate careful interpretation of model outputs (Elith et al., 2010, 2011), there are still avenues for research and development that could improve the ability of bioclimatic models to handle novel climates. Based on our experience, we suggest four areas of endeavour: (1) defining the background for training the MaxEnt models and the background or sampling area for pseudo-absence points in the BRT models, (2) adding an ability to create more ecologically realistic response functions, (3) developing more relevant variables for bioclimatic modelling and (4) further integrating mechanistic and correlative model techniques.
The choice of the background (MaxEnt and BRT) or sampling area for the generation of pseudo-absences (BRT) remains a matter of judgement and involves many considerations (Elith et al., 2010). If the background is either too narrowly or too broadly defined, it can compromise model performance (VanDerWal et al., 2009) and its ability to accurately capture or project distributions. For example, increasing the size of the background will increase the background climate span relative to the climatic span of the distribution records and reduce the area where models are extrapolating. However, the reduced area of extrapolation comes at the expense of discriminating suitable environments at local scales, and places misleading emphasis on a reduced set of variables less relevant to the species being modelled (VanDerWal et al., 2009; Elith et al., 2010). There are no hard and fast rules for defining backgrounds, yet avoiding extrapolation using the whole world as a background would be clearly inappropriate. Our study is among the first to consider bioclimatic rules for defining the background (applying the Köppen–Geiger climate zones), and we demonstrate new tools for visualizing extrapolation space (MESS− overlays) and the interplay between the climate space spanned by the distribution records and the model training domains (Figs S1–S3). We chose to use Köppen–Geiger zones because of their strong climatic basis but, even so, we found that there was substantial variation among the bioclimatic variables in the ranges and frequency distributions spanned by the distribution records relative to the training domain (Fig. S1). Our choice of background deliberately avoided the inclusion of climates well outside of the range spanned by the distribution records. However, the training data may have had a truncated environmental domain because of species ranges abutting continental boundaries (Figs 1 & 2), and the models fitted open-ended response functions (Figs S2 & S3), leading to inappropriate modelled suitability in MESS− areas. Clearly, more thought needs to be given to defining the background in terms of how geographic space translates to climate space.
Both our study and others show that extrapolation needs to be treated with caution in correlative models (e.g. Kriticos & Randall, 2001; Sutherst & Bourne, 2009; Elith et al., 2010). Many of the problems arise because of open-ended response curves. Extrapolating into novel climates with open-ended response curves in discriminatory correlative models can give biologically unrealistic projections when the response functions of particular variables are dominating model behaviour. One solution may be to incorporate options that determine response function behaviour in an ecologically meaningful way. Possibly of greater importance for the models of invasive species is the greater likelihood that in selecting models with high specificity, the model becomes over-fitted. Any methods that control response function behaviour should be ecophysiologically based (Austin, 1987; Austin & Meyers, 1996).
A further area for research would be to develop more relevant bioclimatic variables. The 35 Bioclim variables available in the CliMond data set (Kriticos et al., 2011) are an expansion on the 19 core variables used for many models up to this point. However, new variables could be developed that more closely match critical stress mechanisms for organisms, such as frost or drought. Alternatively, variables could be developed based on extreme values rather than means (Zimmermann et al., 2009), especially given that most projections of future climates indicate that the frequencies of events currently considered extreme will increase (Frei et al., 2006).
The performance of the full data set CLIMEX models in this study highlights its utility for invasive species risk assessment and climate change studies. Outside of specific ecophysiological studies to populate parameters (e.g. Scott & Yeoh, 1999), the time and skill required to fit these models is similar to that of well-constructed correlative models. Nonetheless, efforts to improve the ability of CLIMEX models to be machine-fitted should continue while retaining one of the model’s strengths, namely its ability to confront the user with conflicting evidence of habitat suitability and provide many of the means to resolve conflicts. The tools developed by Elith et al. (2010, 2011) allow for a better understanding of correlative model behaviour, insightful model critique and improved transparency. This research group is also exploring methods to incorporate non-climatic physiological layers into correlative models and move beyond climate-matching the realized species niche. The approach taken in this study highlights parallels between correlative modelling and the established methods of mechanistic models such as CLIMEX, and we recommend further exploration of the different insights they can provide.
Acknowledgements
- Top of page
- Abstract
- Introduction
- Methods
- Results
- Discussion
- Conclusions
- Acknowledgements
- References
- Biosketch
- Supporting Information
We thank the Australian Department of Climate Change and Energy Efficiency (through the Bilateral Climate Change Partnership Program) and the CSIRO Climate Adaptation Flagship (B.L.W., D.J.K. & J.K.S.) for funding this work. We thank the South African Working for Water Programme (WfW) in collaboration with the CSIR (D.C.LeM.) and the DST-NRF Centre of Excellence for Invasion Biology (the latter through their collaborative research project with the Working for Water programme on ‘Research for Integrated Management of Invasive Alien Species’; J.J.LeR.) for funding research that contributed to this work. The Oppenheimer Memorial Trust and Stellenbosch University provided financial support for the workshop in Stellenbosch in October 2010 at which an early version of this paper was presented. We thank Danni Guo for technical support, Jean-Marc Dufour-Dror, Stephen Mifsud, Elizabete Marchante, Jean-Marc Tison, Luc Willemse, Giuseppe Brundu, Bruno Foggi, Marcos Salas, Juan Luis Luengo, Elias Sanchez, Luís González, Andrew Pearson and Anne Ovington for species distribution data and Jane Elith, Bob Sutherst, Simon Ferrier and two anonymous reviewers for feedback on earlier drafts of this manuscript.
Biosketch
- Top of page
- Abstract
- Introduction
- Methods
- Results
- Discussion
- Conclusions
- Acknowledgements
- References
- Biosketch
- Supporting Information
This project is part of a multi-institution collaboration between Australian and South African scientists using native and alien range studies to explore exchanged invasive plants.
Bruce Webber is a plant ecophysiologist with research interests in plant resource allocation and plant–animal interactions. He currently works with the CSIRO Climate Adaptation Flagship to apply these interests to understanding the dynamics of plant invasions within the context of a rapidly changing climate.
Author contributions: C.J.Y. and G.F.M. initiated the collaboration, G.F.M., C.J.Y., J.K.S., B.L.W. and D.C.LeM. conceived the research ideas, J.J.LeR. and N.O. contributed core data; A.McN., B.L.W., D.C.LeM. and C.J.Y. processed the distribution data, B.L.W., D.C.LeM., D.J.K., C.J.Y., J.K.S., and A.McN. ran the models, B.L.W., A.McN., D.J.K., N.O., D.C.LeM. and C.J.Y. analysed the model output, and B.L.W. led the writing.
Supporting Information
- Top of page
- Abstract
- Introduction
- Methods
- Results
- Discussion
- Conclusions
- Acknowledgements
- References
- Biosketch
- Supporting Information
Appendix S1 Species distribution data processing for South Africa.
Appendix S2 Supporting information for physiological parameters used in the CLIMEX models.
Figure S1 Bioclim variable box and whisker plots.
Figure S2 Bioclimatic parameter response curves for Acacia cyclops.
Figure S3 Bioclimatic parameter response curves for Acacia pycnantha.
Figure S4 Recent historical projections for Acacia cyclops in South Africa.
Figure S5 Recent historical projections for Acacia pycnantha in South Africa.
Figure S6 Future (2070) global projections for Acacia cyclops modelled with the restricted dataset.
Figure S7 Future (2070) global projections for Acacia pycnantha modelled with the restricted dataset.
Figure S8 Spatial distribution of limiting factors for MaxEnt modelling.
Table S1 Physiological parameters used in the CLIMEX models.
As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer-reviewed and may be re-organized for online delivery, but are not copy-edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors.
Please note: Wiley-Blackwell is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.