Volume 39, Issue 4
Software note
Free Access

sdm: a reproducible and extensible R platform for species distribution modelling

Babak Naimi

E-mail address: naimi.b@gmail.com

Imperial College London, Silwood Park, Buckhurst Road, Ascot, Berkshire, SL5 7PY UK

Center for Macroecology, Evolution and Climate, Natural History Museum of Denmark, Univ. of Copenhagen, Denmark

Search for more papers by this author
Miguel B. Araújo

Imperial College London, Silwood Park, Buckhurst Road, Ascot, Berkshire, SL5 7PY UK

Center for Macroecology, Evolution and Climate, Natural History Museum of Denmark, Univ. of Copenhagen, Denmark

Dept of Biogeography and Global Change, National Museum of Natural Sciences, CSIC, c/Jose Gutierrez Abascal, ES‐28006 Madrid Spain

InBio‐CIBIO, Univ. of Évora, Largo dos Colegiais, PT‐7000 Évora Portugal

Search for more papers by this author
First published: 02 February 2016
Citations: 121

Abstract

sdm is an object‐oriented, reproducible and extensible, platform for species distribution modelling. It uses individual species and community‐based approaches, enabling ensembles of models to be fitted and evaluated, to project species potential distributions in space and time. It provides a standardized and unified structure for handling species distributions data and modelling techniques, and supports markedly different modelling approaches, including correlative, process‐based (mechanistic), agent‐based, and cellular automata. The object‐oriented design of software is such that scientists can modify existing methods, extend the framework by developing new methods or modelling procedures, and share them to be reproduced by other scientists. sdm can handle spatial and temporal data for single or multiple species and uses high performance computing solutions to speed up modelling and simulations. The framework is implemented in R, providing a flexible and easy‐to‐use GUI interface.

Species distributions models (SDMs), also known as bioclimatic envelope models, ecological niche models and habitat suitability models, explore the relationship between geographical occurrences of species and corresponding environmental variables (Guisan and Zimmermann 2000, Peterson et al. 2011). SDMs are widely used in a range of fields and applications including regional biodiversity assessments, spatial conservation prioritization, evolutionary biology, epidemiology, global change biology, and wildlife management (Araújo and Peterson 2012). There are several SDM techniques available. They differ in their ability to summarize the relationships between response and predictor variables (Segurado and Araújo 2004, Elith et al. 2006), and when used for transferring the distributions of species into different geographical (Randin et al. 2006) or temporal contexts (Thuiller et al. 2004, Araújo et al. 2005b, Pearson et al. 2006) projections can vary startlingly among techniques.

SDMs also vary with regards to the type of response variables used (e.g. presence and absence versus presence only), the types of predictor variables handled (e.g. continuous versus categorical), the type of output provided (e.g. probabilities, continuous indices of suitability, or binary predictions of presence and absence), the type of species–environment relationship assumed (e.g. simple linear to complex nonlinear), the approach used to estimate species distributions (e.g. parametric versus nonparametric approaches), and the approach to select relevant predictor variables (e.g. whether predictor contributions are weighted, and whether they allow for interactions among variables) (Segurado and Araújo 2004, Elith et al. 2006, Austin 2007, Naimi et al. 2011, Peterson et al. 2011).

The outputs of SDMs are sensitive to the specific rules used to parameterize them. When models are implemented in different platforms, rules used to fit them may not be comparable. For example, Domain (Carpenter et al. 1993), DesktopGARP (Stockwell and Peters 1999), and Maxent (Phillips et al. 2006) are typically implemented with different off‐the‐shelf software making cross‐model comparisons challenging. Models are also generally implemented following different protocols for pre‐processing of data and post‐processing of the results, even when they are implemented within the same computer platform. Given the difficulties in comparing the results of different models, conclusions from model comparison studies are difficult to generalise beyond the specific case studies (Segurado and Araújo 2004, Elith et al. 2006).

An integrated framework enabling multiple SDMs to be fitted and compared simultaneously is required to move the field of species distribution modeling forward. Three off‐the‐shelf software including openModeller (de Souza Muñoz et al. 2009), BIOENSEMBLES (Diniz‐Filho et al. 2009), and ModeEco (Guo and Liu 2010) have been independently developed to provide such frameworks. They enable several modelling algorithms to be fitted simultaneously and they perform the most common tasks related to species distribution modelling (e.g. data evaluation, prediction). They also provide graphical user interface (GUI) making them click‐and‐run software and particularly friendly to users with less computational expertise. Simultaneously, they provide limited flexibility as users can only use algorithms, model comparison and evaluation procedures that are implemented therein. Moreover, insufficient understanding of what such click‐and‐run software is doing and how they were implemented makes users fret over whether they are doing what is expected (Joppa et al. 2013).

R (R Development Core Team) is a general‐purpose high‐level programming language and a free (under the GNU general public license) open source environment. It is widely used for statistical analysis and graphical visualization and, recently, its suitability for mathematical computing (Soetaert et al. 2010), manipulation and analysis of complex spatial data sets and modelling (Bivand et al. 2008) has increased. R can be extended through user‐created packages, which allow developing new and specialized analytical techniques, graphical devices, import/export capabilities, reporting tools, etc. Growing collections of tools are explicitly being developed to bridge R and the known modelling software (Naimi and Voinov 2012). All of these capabilities make R very powerful. Despite the advantages of R, there are also some disadvantages. R involves a steep learning curve preventing beginners and script‐avert scientists from taking advantage of its capabilities. Moreover, different packages are not equivalent regarding their computational efficiency (García‐Callejas and Araújo unpubl.) or capability for handling errors. Sometimes they simply do not work under given circumstances and users have to struggle with errors and bugs. When users apply and compare alternative models, it becomes difficult to keep track of the syntactical nuances implemented in different packages (Kuhn 2008).

R provides an increasing number of packages for modelling (e.g. gbm, gam, maxlike, deSolve, simecol). At least two R platforms have been developed for fitting (e.g. BIOMOD and dismo; Thuiller et al. 2009, Hijmans and Elith 2013) and processing of species distributions modelling outputs (e.g. SDMTools; VanDerWal et al. 2011). BIOMOD (Thuiller et al. 2009), including its recent version biomod2, offers several functions for ensemble modelling of species distributions (Araújo and New 2007). The other package, dismo (Hijmans and Elith 2013), can be used to fit several SDMs including maxent (Phillips et al. 2006) in R, and facilitates using common spatial data in the procedure of modelling and predicting species distributions. However, it does not support fitting and comparison of multiple SDMs as in BIOMOD.

BIOMOD and dismo combine a limited number of packages and modelling techniques. Even if technically feasible to add more techniques into these platforms, the task is beyond reach by most users. The platforms also lack convenient GUI interfaces, thereby being unpalatable to users with very basic knowledge of R. More importantly, because implementation of the different techniques is not standardized, lessons learned from comparing outputs of different SDMs are impaired. Are results of a particular modelling technique better because the technique is superior to other, or because of particular default implementations in the software? Developing model‐independent methods (i.e. procedures that can be applied with any SDM) for common tasks in species distribution modelling (e.g. variable selection, variable importance) followed with a good software design would override such shortcomings for comparing modeling outputs in existing SDM platforms.

We introduce a new R package, sdm, that solves the limitations of existing platforms for species distributions modelling. sdm an extendable framework that enables fitting of individual and community‐based SDM approaches, while supporting markedly different modelling approaches, including correlative, process‐based (mechanistic), agent‐based, and cellular automata. It generates ensembles of models, and several options for evaluation of model results and and projection of species potential distributions in space and time. The generic design of sdm is object‐oriented making it flexible and amenable to efficient handling of errors. The object‐oriented design also makes it easily extended by users wanting to support additional models and/or procedures for any of the main steps in species distribution modelling. Finally, sdm provides a graphical user interface (GUI) making it easy‐to‐use even for users who are not familiar with R.

Design of the sdm package

The sdm package is designed to create a comprehensive modelling and simulation framework that: 1) provides a standardised and unified structure for handling species distributions data and modelling techniques (e.g. a unified interface is used to fit different models offered by different packages); 2) is able to support markedly different modelling approaches, including correlative, process‐based (mechanistic), agent‐based, cellular automata, etc.; 3) enables scientists to modify the existing methods, extend the framework by developing new methods or procedures, and share them to be reproduced by the other scientists; 4) handles spatial as well as temporal data for single or multiple species; 5) employs high performance computing solutions to speed up modelling and simulations, and finally; 6) uses flexible and easy‐to‐use GUI interface.

sdm was built following a fully object‐oriented design. The object‐oriented approach enables formulation of problems using interacting objects rather than sets of functions (Alfons et al. 2010). The properties of these objects are defined by general and extensible class description, suitable for species distributions models and their corresponding data. Their behavior and interactions are modeled with generic functions and methods. One of the most important concepts of object‐oriented programming is class inheritance, i.e., subclasses inherit properties and behavior from their super‐classes. Thus, code can be shared for related classes, which is the main advantage of inheritance (Alfons et al. 2010). In addition, subclasses may have additional properties and behavior, so in this sense they extend their super‐classes.

In the sdm framework, we used S4 and reference class systems (Chambers 2014), which provide mechanisms for object‐oriented programming in R. The reference class system allows the use of encapsulated object‐oriented programming, and their objects behave more like objects in the other object‐oriented programming language such as Java and C++. We defined several classes to handle species data, different methods, and settings for modelling and simulation. There are some container classes whose instances are collections of the methods for a specific purpose (e.g. model fitting, evaluation). These classes are extensible by users (i.e. a new method can be included to the collection by a user). Furthermore, the specific container classes were designed to handle the chain of processes (workflows). They are followed by some methods to facilitate their reproducibility on a new machine (i.e. they can be shared and reproduced by a new user on a new workstation). Reproducibility of an experiment refers to not only its’ exact repetition (repeatability), but also using the general idea and settings of the experiment in a new experiment. There is also a class to manage the metadata can be used for both methods and data in the framework. An object of the metadata class keeps some information (e.g. authors, date of creation, citation, and website) about the corresponding data or method. A user can find, for example, how to cite a new data, method, or process that has been created and shared by another user. An example of a data class and a container class in the sdm framework is provided in Fig. 1. A class may contain several subclasses and itself being a subclass of a superclass. A set of methods is defined for each class and can be used to handle the class during the simulation.

image

Class diagrams of a species data object (a), and a method container (b); each class contains several data, known as attributes or fields, kept in different slots (@slot‐name), and several methods defined as a list of functions to access the data objects in the class.

How does sdm work?

The sdm framework helps constructing and executing a chain of procedures that constitute the backbone of species distributions modelling. These procedures can be grouped into three steps: pre‐processing; processing; and post‐processing. Pre‐processing includes all procedures by which data becomes available for processing, when SDMs are fitted. After being processed, the model results are post‐processed given user‐specified settings (Fig. 2). An extensible set of functions (methods) is available for each step, which can be included into the chain by a user.

image

A schematic representation of a chain including the main classes and pre‐processing, processing, and post‐processing procedures for species distribution modelling in sdm.

Data management and pre‐processing

A set of utility functions is available in the sdm framework to read and handle species and environmental data in a flexible and automated way. Species data are usually available as a list of coordinates, or as a spatial point dataset. Environmental variables are mostly available as spatial data in the form of spatial vectors (e.g. lines, points, polygons) or rasters (i.e. spatial grids). GIS (Geographic Information Systems) operations are typically required to convert these kinds of data into a structure that is suitable for species distribution modelling. Such process of data manipulation is usually a challenge for non‐GIS experts, especially when the data vary in their extent or their coordinate systems. sdm can read species and environmental data with different common structures (spatial or non‐spatial), and is not sensitive to these problematic issues as they are automatically handled and fixed through the pre‐processing step. For instance, sdm uses several procedures to manipulate data when spatial datasets are introduced as the input data (e.g. species data as spatial points, and environmental predictors as a set of raster datasets) including: checking whether all the data use the same coordinate system, and if not, a project transformation is called to convert them into a unique coordinate system; checking whether they are spatially match and whether they use the same spatial extent, and if not, the extent will be matched and also the records outside of the main extent are recognized.

Data used in species distributions modeling typically carry a number of statistical problems (e.g. lack of absence data, multicollinearity among predictors, spatial autocorrelation in both response and predictor variables, positional uncertainty). Whilst solutions have been proposed to deal with these problems (Dormann 2011), current platforms for SDM tend to ignore them. The pre‐processing phase includes all procedures through which data are controlled for problematic issues and prepared for the processing (modeling) phase. These procedures are implemented as functions according to state‐of‐the‐art methods for the corresponding issues. We briefly describe some important procedures.

Pseudo‐absence – some models required absences as well as presences to be fitted. Yet all too often presence data alone are available. One option to deal with this problem is to generate pseudo absences. Pseudo absences tend either to be randomly drawn from a studied region, or environmentally or spatially stratified (Barbet‐Massin et al. 2012). These procedures for pseudo‐absence generation are available in sdm and can be used separately or within the modelling procedure. Furthermore, one can generate several replications of pseudo absences to explore the variability of the process through a simulation.

Collinearity – correlation between two or more predictor variables in a statistical model can cause problems of collinearity (also called multicollinearity). Many statistical models (especially regression‐type models) are sensitive to collinearity for it may cause instability in parameter estimation and biases in inference statistics (Dormann et al. 2013). Several approaches have been provided in the statistical literature to detect collinearity. Pairwise correlation coefficients and the variance inflation factor (VIF) (Marquardt 1970) are, perhaps, the most widely used approaches. The Pearson (r) or Spearman (ρ) correlation coefficients between a pair of variables can simply show whether two variables are correlated and, if so (usually when its value is greater than a threshold e.g. 0.7), having both variables in the modelling procedure may cause problems of collinearity. The VIF is a more precise method as it measures how strongly each predictor can be explained by the rest of predictors: if all information regarding a predictor is provided by other predictors why keep the predictor? The VIF is based on the square of the multiple correlation coefficient (R2) resulting from regressing the predictor variable against all other predictor variables. A VIF greater than 10 (as a rule of thumb) is a signal that the model has a collinearity problem (Chatterjee and Hadi 2006). All of the above measures are implemented in sdm and can be used to detect collinearity. To avoid collinearity in the modelling, one approach is to remove the collinear variable prior to model fitting. We developed two stepwise procedures to detect and exclude collinear variables: one based on VIF measure; the other using both correlation coefficients and VIF. The former approach calculates VIF for all predictors and excludes the one with the greatest VIF (if it is greater than a threshold). The procedure is repeated until all strongly collinear variables are excluded. The second approach calculates the correlation coefficients between variables and identifies a strongly correlated pair with the highest coefficient. Then the variable with a highest VIF is excluded from the pair, and the procedure is repeated until no strongly correlated pair remains.

Principle component analysis (PCA) can be used as a data reduction technique to reduce dimensionality in predictor variables (Heikkinen et al. 2006) and is available in sdm.

Positional uncertainty – increasing amounts of species data, especially presence‐only data from museum or herbarium collections (Graham et al. 2004) or from volunteer observation networks (Wood et al. 2011), are becoming available on the Internet. One of the problems with these data is the uncertainty regarding the exact position of the occurrence records (Graham et al. 2004, Rowe 2005). Examining spatial autocorrelation in predictor variables is one possible strategy to investigate whether positional uncertainty in species occurrences is problematic (Naimi et al. 2011, 2014). Spatial autocorrelation in predictors can give insight into how similar the nearby locations are to the uncertain species location. Strong spatial autocorrelation indicates that the errors in species locations matter less, because nearby locations have similar environmental characteristics to the true location. Spatial autocorrelation can be measured globally, over the entire study area (e.g. using a variogram; Naimi et al. 2011), or locally at each species location (e.g. using a local spatial autocorrelation measure; Naimi et al. 2014). The former can give insight into the level of positional uncertainty under which the models will be sensitive (by assuming that the spatial structure is the same over the study area), while the latter leads to identify the species locations that are likely to be problematic as a consequence of positional uncertainty. sdm, implements the two methods.

Feature construction – before processing the models, user‐defined features are established that determine how species distributions data are related to environmental variables. Several features are available in sdm including linear, quadratic, polynomial, product, hinge, threshold, spline, and factor that can be extended by the user according to the needs. sdm treats features as model‐independent, which is an important advantage over other SDM platforms as it makes it possible to use and compare unique set of feature classes across all models (subject to the being supported by modelling algorithm). The ability to set common features across different models helps overcoming one of the main drawbacks of existing model comparisons: not controlling for varying features across models. For example, while Maxent software (Phillips et al. 2006) supports hinge and threshold features in fitting a maximum entropy algorithm, the other SDM software do not support them.

Processing (model fitting)

Model fitting is a step in modelling species distributions, whereby one or several model(s) is fitted to relate response variables (species distributions) to predictor (environmental) variables. A user can select any (or all) of available methods (modelling algorithms). Several instances of a model may be used with different settings, and/or ensembles of several models can also be generated for each species to generate a consensus among them. Currently, sdm supports 15 modelling methods including generalized linear model (GLM; McCullagh and Nelder 1989), generalized additive model (GAM; Hastie and Tibshirani 1990), classification and regression trees (CART; Breiman et al. 1984), boosted regression trees (BRT; Friedman 2001), multivariate adaptive regression spline (MARS; Friedman 1991), mixture discriminant analysis (MAD; Hastie et al. 1994), random forests (RF; Breiman 2001), support vector machine (SVM; Vapnik 1995), artificial neural networks (ANN; Rosenblatt 1958), environmental niche factor analysis (ENFA; Hirzel et al. 2002), maximum entropy (Maxent; Phillips et al. 2006), maxlike (Royle et al. 2012), Bioclim (Busby 1991), Domain (Carpenter et al. 1993), and Mahalanobis (Farber and Kadmon 2003). Furthermore, several community‐based models (Baselga and Araújo 2009) and consensus techniques (Garcia et al. 2012), derived from fitting multiple (i.e. ensembles) of models (Araújo and New 2007), are implemented in sdm. Most of these modelling methods were available through different packages in R (e.g. GAM, BRT, SVM). sdm depends on and uses these packages to fit the models based on such methods that are selected by a user. Several modelling methods (Table 1) as well as all of the procedures in the pre‐ and post‐processing (e.g. multicollinearity test, variable importance, model evaluation), are implemented in the sdm package. The programme also provides some facilitator functions enabling the user to include (and use) new methods as they become available. The new method, or the specific settings for using an existing one, can then be exported and published (for example on Internet) for other users.

Table 1. A list of implemented modeling methods in the first release of the sdm package and their dependent packages
Modelling methods Depends on
Generalized linear models (GLM) stats
Generalized additive models (GAM) mgcv; gam
Boosted regression trees (BRT) gbm
Support vector machine (SVM) kernlab
Classification and regression trees (CART) tree
Multivariate adaptive regression spline (MARS) earth
Mixture discriminant analysis (MAD) mda
Random forests (RF) randomForest
Artificial neural networks (ANN) nnet; neuralnet
Environmental niche factor analysis (ENFA) adehabitatHS
Maximum entropy (maxent) Java software: maxent.jar
Maxlike maxlike
Bioclim NONE
Domain NONE
Mahalanobis NONE
Ensemble modelling NONE
Community‐based models gdm; mda

Post‐processing

When the models are fitted, there are several additional processes that can be employed, including model evaluation, prediction, and variable importance assessment. sdm also offers specific functions to analyse geographically the outputs when multiple species are modelled (e.g. calculation of species richness, beta diversity, and niche similarity), or to assess the temporal changes when species records are available in multiple time periods.

Model evaluation (accuracy assessment) – a comprehensive set of model evaluation procedures are implemented in sdm. Ideally, statistically independent data (test data) should be used to evaluate model predictions (Araújo et al. 2005a), otherwise a data‐splitting method is often used as an alternative by which a randomly drawn sample of the data are used to train the models and the remaining data are used for model evaluation (but see for alternative approaches, Madon et al. 2013). A one‐time data‐splitting has been widely used for this purpose, although it may introduce a bias to the parameter estimation (Araújo et al. 2005a). This issue can be overcome by using a family of resampling methods including random subsampling, K‐fold cross‐validation, Jackknife (leave‐one‐out), and bootstrapping (Hastie et al. 2009). Subsampling repeats the random data splitting into training and testing proportions K times (uses sampling without replacement). K‐fold cross‐validation, first, splits the data into K roughly equal‐sized parts, and then fits the models K times. Each time one part is used as test data and the other K – 1 parts of the data are used as training data. Leave‐one‐out is equal to the K‐folds cross‐validation when K is equal to the number of observations. This means that only one observation is used to evaluate the model at each run. Bootstrapping repeats a sampling with replacement method, each time a sample with equal size as the original data is drawn and used for training data. The observations that are not selected are used for the evaluation at each run. In sdm, all these procedures are implemented and can be used (one or all) in the evaluation procedure. Many state‐of‐the‐art statistics for evaluating SDMs (Fielding and Bell 1997) are implemented that include threshold‐dependent statistics (e.g. TSS, Sensitivity, Specificity), threshold‐independent statistics (e.g. AUC, COR), and methods developed to calculate p‐values through Jackknife for data sets with small sample size (Pearson et al. 2007).

Variable importance and response curve – determining the role of predictor variables in explaining the species distribution is of practical relevance to researchers concerned with interpreting the outputs of the models. Evaluating how important each variable is (Murray and Conner 2009) and/or visualizing the predicted response of species to the predictor variable (Elith et al. 2005) are two known methods to determine predictor variable importance. In sdm, several model‐independent techniques were implemented to evaluate the importance of variables and visualize species response curves. In sdm, response curves are generated according to the procedure proposed by Elith et al. (2005). Additional model‐independent techniques were also implemented to evaluate the relative variable importance. For example, a method is to calculate the improvement of the model performance over inclusion of each variable comparing to when the variable is excluded through a cross‐validation procedure. Another method is a randomization procedure that measures the correlation between the predicted values and predictions where the variable under investigation is randomly permutated. If the contribution of a variable to the model is high, then it is expected that the prediction is more affected by a permutation and therefore the correlation is lower. Therefore, ‘1 – correlation’ can be considered as a measure of variable importance (Thuiller et al. 2009).

Other capabilities

Apart from the functionalities corresponding to the main steps in species distribution modelling, sdm implements additional functions and classes for different purposes including controlling the data and procedures for handling the errors, facilitating the extensibility and reproducibility of the methods and procedures (by allowing to include or modify a method or procedure and distribute to the wider community), providing graphical user interface (GUI) and making the framework easy‐to‐use, generating dynamic reports, and implicitly parallelize the procedures to boost them through high performance computing, etc. Figure 3 provides a simple example on interfacing sdm through command line and GUI as well as some outputs. A tutorial contains further examples is provided with the package as a vignette, demostrating the main capabilities of sdm (as listed in ‘design of the sdm package’).

image

An example of using sdm package in R that demonstrating reading data, fitting species distribution models and predicting; (a) species dataset is provided as a spatial shapefile including presence–absence records, and two predictor variables (NDVI and precipitation) are in the format of raster Ascii files; (b) shows how the data are loaded and the model is fitted in the command line interface; (c) shows how the predict function can be used, and its outputs are visualized in geographic and niche space; (d) shows an example of GUI interface (as an alternative to the command line interface).

Conclusion

sdm is an object‐oriented reproducible and extensible framework for species distribution modelling in R that unified different implementations of SDMs in a single framework. sdm provides an easy‐to‐use comprehensive framework to perform the entire modelling process within the same environment using different state‐of‐the‐art approaches. The software is designed such to enable users to extend it and share the new data, methods or procedures to reproduce them by other users.

To cite sdm or acknowledge its use, cite this Software note as follows, substituting the version of the application that you used for ‘version 0’:

Naimi, B. and Araújo, M. B. 2016. sdm: a reproducible and extensible R platform for species distribution modelling. – Ecography 39: 368–375 (ver. 0).

    Number of times cited according to CrossRef: 121

    • ENMTML: An R package for a straightforward construction of complex ecological niche models, Environmental Modelling & Software, 10.1016/j.envsoft.2019.104615, (104615), (2020).
    • Under predicted climate change: Distribution and ecological niche modelling of six native tree species in Gilgit-Baltistan, Pakistan, Ecological Indicators, 10.1016/j.ecolind.2019.106049, 111, (106049), (2020).
    • Habitat suitability analysis reveals high ecological flexibility in a “strict” forest primate, Frontiers in Zoology, 10.1186/s12983-020-00352-2, 17, 1, (2020).
    • Seafloor geomorphic features as an alternative approach into modelling the distribution of cetaceans, Ecological Informatics, 10.1016/j.ecoinf.2020.101092, (101092), (2020).
    • A standard protocol for reporting species distribution models, Ecography, 10.1111/ecog.04960, 43, 9, (1261-1277), (2020).
    • An open-source framework to model present and future marine species distributions at local scale, Ecological Informatics, 10.1016/j.ecoinf.2020.101130, (101130), (2020).
    • Increasing Cervidae populations have variable impacts on habitat suitability for threatened forest plant and lichen species, Forest Ecology and Management, 10.1016/j.foreco.2020.118286, 473, (118286), (2020).
    • A comparison between Ensemble and MaxEnt species distribution modelling approaches for conservation: A case study with Egyptian medicinal plants, Ecological Informatics, 10.1016/j.ecoinf.2020.101150, 60, (101150), (2020).
    • Modelling human health vulnerability using different machine learning algorithms in stone quarrying and crushing areas of Dwarka river Basin, Eastern India, Advances in Space Research, 10.1016/j.asr.2020.05.032, (2020).
    • Identifying sources of dust aerosol using a new framework based on remote sensing and modelling, Science of The Total Environment, 10.1016/j.scitotenv.2020.139508, 737, (139508), (2020).
    • Modelling the impact of tidal range energy on species communities, Ocean & Coastal Management, 10.1016/j.ocecoaman.2020.105221, 193, (105221), (2020).
    • Biogeographical characterisation of Egypt based on environmental features and endemic vascular plants distribution, Applied Geography, 10.1016/j.apgeog.2020.102208, 119, (102208), (2020).
    • Toward community predictions: Multi‐scale modelling of mountain breeding birds' habitat suitability, landscape preferences, and environmental drivers, Ecology and Evolution, 10.1002/ece3.6295, 10, 12, (5544-5557), (2020).
    • A fine-tuned global distribution dataset of marine forests, Scientific Data, 10.1038/s41597-020-0459-x, 7, 1, (2020).
    • Bioclimatic modeling of the European distribution of the invasive Asian tiger mosquito, Aedes (Stegomyia) albopictus (Skuse, 1895), with special reference to Ukraine, Reports of the National Academy of Sciences of Ukraine, 10.15407/dopovidi2020.03.088, 3, (88-93), (2020).
    • Bioclimatic modeling of the distribution of brown marmorated stink bug Halyomorpha halys (Stål, 1855), with special reference to Ukraine, Reports of the National Academy of Sciences of Ukraine, 10.15407/dopovidi2020.02.082, 2, (82-86), (2020).
    • Revisiting the North-South genetic discontinuity in Central African tree populations: the case of the low-density tree species Baillonella toxisperma, Tree Genetics & Genomes, 10.1007/s11295-019-1408-8, 16, 1, (2020).
    • Relating modelled habitat suitability for Abies balsamea to on-the-ground species structural characteristics in naturally growing forests, Ecological Indicators, 10.1016/j.ecolind.2019.105981, 111, (105981), (2020).
    • Integrated machine learning methods with resampling algorithms for flood susceptibility prediction, Science of The Total Environment, 10.1016/j.scitotenv.2019.135983, 705, (135983), (2020).
    • ntbox: An r package with graphical user interface for modelling and evaluating multidimensional ecological niches, Methods in Ecology and Evolution, 10.1111/2041-210X.13452, 11, 10, (1199-1206), (2020).
    • Testing macroecological abundance patterns: The relationship between local abundance and range size, range position and climatic suitability among European vascular plants, Journal of Biogeography, 10.1111/jbi.13926, 47, 10, (2210-2222), (2020).
    • The continuing march of Common Green Iguanas: arrival on mainland Asia, Journal for Nature Conservation, 10.1016/j.jnc.2020.125888, (125888), (2020).
    • Assessing, mapping, and optimizing the locations of sediment control check dams construction, Science of The Total Environment, 10.1016/j.scitotenv.2020.139954, 739, (139954), (2020).
    • Integrating Computational Methods to Investigate the Macroecology of Microbiomes, Frontiers in Genetics, 10.3389/fgene.2019.01344, 10, (2020).
    • Ecological Niche Models of Four Hard Tick Genera (Ixodidae) in Mexico, Animals, 10.3390/ani10040649, 10, 4, (649), (2020).
    • Ensemble models of GLM, FDA, MARS, and RF for flood and erosion susceptibility mapping: a priority assessment of sub-basins, Geocarto International, 10.1080/10106049.2020.1829101, (1-16), (2020).
    • Assessing Preble’s Meadow Jumping Mouse (Zapus hudsonius preblei) Habitat and Connectivity for Conservation and Restoration, Wetlands, 10.1007/s13157-020-01374-6, (2020).
    • Flood susceptibility prediction using four machine learning techniques and comparison of their performance at Wadi Qena Basin, Egypt, Natural Hazards, 10.1007/s11069-020-04296-y, (2020).
    • Ensemble Modelling of Skipjack Tuna (Katsuwonus pelamis) Habitats in the Western North Pacific Using Satellite Remotely Sensed Data; a Comparative Analysis Using Machine-Learning Models, Remote Sensing, 10.3390/rs12162591, 12, 16, (2591), (2020).
    • Predicting the Potential Current and Future Distribution of the Endangered Endemic Vascular Plant Primula boveana Decne. ex Duby in Egypt, Plants, 10.3390/plants9080957, 9, 8, (957), (2020).
    • Using species distribution models to locate the potential cradles of the allopolyploid Gypsophila bermejoi G. López (Caryophyllaceae), PLOS ONE, 10.1371/journal.pone.0232736, 15, 5, (e0232736), (2020).
    • Muddying the waters: investigating the generality of silt-resistance in mound-building Nocomis spp. using hornyhead chub (Nocomis biguttatus) and redspot chub (Nocomis asper), Environmental Biology of Fishes, 10.1007/s10641-020-00984-7, (2020).
    • The dark side of raptors’ distribution ranges under climate change, Landscape Ecology, 10.1007/s10980-020-01025-5, (2020).
    • Margay (Leopardus wiedii) in the southernmost Atlantic Forest: Density and activity patterns under different levels of anthropogenic disturbance, PLOS ONE, 10.1371/journal.pone.0232013, 15, 5, (e0232013), (2020).
    • Multi-source spatial data-based invasion risk modeling of Striga ( Striga asiatica ) in Zimbabwe , GIScience & Remote Sensing, 10.1080/15481603.2020.1744250, (1-19), (2020).
    • An Effective Way to Map Land-Use Intensity with a High Spatial Resolution Based on Habitat Type and Environmental Data, Remote Sensing, 10.3390/rs12060969, 12, 6, (969), (2020).
    • Impact assessment of seven alien invasive bird species already introduced to South Africa, Biological Invasions, 10.1007/s10530-020-02221-9, (2020).
    • Evaluation of Recent Advanced Soft Computing Techniques for Gully Erosion Susceptibility Mapping: A Comparative Study, Sensors, 10.3390/s20020335, 20, 2, (335), (2020).
    • Soil Erosion Susceptibility Mapping in Kozetopraghi Catchment, Iran: A Mixed Approach Using Rainfall Simulator and Data Mining Techniques, Land, 10.3390/land9100368, 9, 10, (368), (2020).
    • Isolation by geographical distance after release from Pleistocene refugia explains genetic and phenotypic variation in Xylotrupes siamensis (Coleoptera: Scarabaeidae), Zoological Journal of the Linnean Society, 10.1093/zoolinnean/zlaa106, (2020).
    • Ensemble modeling of the potential distribution of the whale shark in the Atlantic Ocean, Ecology and Evolution, 10.1002/ece3.5884, 10, 1, (175-184), (2019).
    • Biogeography and conservation status of the pineapple family (Bromeliaceae), Diversity and Distributions, 10.1111/ddi.13004, 26, 2, (183-195), (2019).
    • A comparison of macroecological and stacked species distribution models to predict future global terrestrial vertebrate richness, Journal of Biogeography, 10.1111/jbi.13696, 47, 1, (114-129), (2019).
    • An ecological barrier between the Himalayas and the Hengduan Mountains maintains the disjunct distribution of Roscoea, Journal of Biogeography, 10.1111/jbi.13729, 47, 2, (326-341), (2019).
    • Alternative reproductive adaptations predict asymmetric responses to climate change in lizards, Scientific Reports, 10.1038/s41598-019-41670-8, 9, 1, (2019).
    • Habitat suitability, live abundance and their link to road mortality of Tasmanian wildlife, Wildlife Research, 10.1071/WR18128, 46, 3, (236), (2019).
    • Species Distribution Modeling in Latin America: A 25-Year Retrospective Review, Tropical Conservation Science, 10.1177/1940082919854058, 12, (194008291985405), (2019).
    • Is Myotis lavali (Chiroptera, Vespertilionidae) endemic to the South American dry diagonal?, Journal of Mammalogy, 10.1093/jmammal/gyz141, 100, 6, (1879-1888), (2019).
    • Update of the Scientific Opinion on the risks to plant health posed by Xylella fastidiosa in the EU territory, EFSA Journal, 10.2903/j.efsa.2019.5665, 17, 5, (2019).
    • Pest survey card on Xylella fastidiosa, EFSA Supporting Publications, 10.2903/sp.efsa.2019.EN-1667, 16, 6, (2019).
    • esdm: A tool for creating and exploring ensembles of predictions from species distribution and abundance models, Methods in Ecology and Evolution, 10.1111/2041-210X.13283, 10, 11, (1923-1933), (2019).
    • Spatial distribution and spread potential of sixteen Leptospira serovars in a subtropical region of Brazil, Transboundary and Emerging Diseases, 10.1111/tbed.13306, 66, 6, (2482-2495), (2019).
    • How does spatial resolution affect model performance? A case for ensemble approaches for marine benthic mesophotic communities, Journal of Biogeography, 10.1111/jbi.13581, 46, 6, (1249-1259), (2019).
    • Assessing the environmental and dispersal controls on Fagus grandifolia distributions in the Great Lakes region, Journal of Biogeography, 10.1111/jbi.13491, 46, 2, (405-419), (2019).
    • Projected 21st‐century distribution of canopy‐forming seaweeds in the Northwest Atlantic with climate change, Diversity and Distributions, 10.1111/ddi.12897, 25, 4, (582-602), (2019).
    • A review of evidence about use and performance of species distribution modelling ensembles like BIOMOD, Diversity and Distributions, 10.1111/ddi.12892, 25, 5, (839-852), (2019).
    • The MIAmaxent R package: Variable transformation and model selection for species distribution models, Ecology and Evolution, 10.1002/ece3.5654, 9, 21, (12051-12068), (2019).
    • Species distribution modelling to support forest management. A literature review, Ecological Modelling, 10.1016/j.ecolmodel.2019.108817, 411, (108817), (2019).
    • Climate change induced range shifts in seaweeds distributions in Europe, Marine Environmental Research, 10.1016/j.marenvres.2019.04.012, (2019).
    • Modelling current and future potential distributions of two desert jerboas under climate change in Iran, Ecological Informatics, 10.1016/j.ecoinf.2019.04.003, (2019).
    • Climate change and opposing spatial conservation priorities for anuran protection in the Brazilian hotspots, Journal for Nature Conservation, 10.1016/j.jnc.2019.04.003, (2019).
    • Land subsidence modelling using tree-based machine learning algorithms, Science of The Total Environment, 10.1016/j.scitotenv.2019.03.496, 672, (239-252), (2019).
    • Spatial Modeling of Gully Erosion, Spatial Modeling in GIS and R for Earth and Environmental Sciences, 10.1016/B978-0-12-815226-3.00030-2, (653-669), (2019).
    • A Comparative Study of Functional Data Analysis and Generalized Linear Model Data-Mining Techniques for Landslide Spatial Modeling, Spatial Modeling in GIS and R for Earth and Environmental Sciences, 10.1016/B978-0-12-815226-3.00021-1, (467-484), (2019).
    • Regional Groundwater Potential Analysis Using Classification and Regression Trees, Spatial Modeling in GIS and R for Earth and Environmental Sciences, 10.1016/B978-0-12-815226-3.00022-3, (485-498), (2019).
    • An ensemble prediction of flood susceptibility using multivariate discriminant analysis, classification and regression trees, and support vector machines, Science of The Total Environment, 10.1016/j.scitotenv.2018.10.064, 651, (2087-2096), (2019).
    • Predicting the impacts of climate change, soils and vegetation types on the geographic distribution of Polyporus umbellatus in China, Science of The Total Environment, 10.1016/j.scitotenv.2018.07.465, 648, (1-11), (2019).
    • Land subsidence hazard modeling: Machine learning to identify predictors and the role of human activities, Journal of Environmental Management, 10.1016/j.jenvman.2019.02.020, 236, (466-480), (2019).
    • Regional adaptation of European beech (Fagus sylvatica) to drought in Central European conditions considering environmental suitability and economic implications, Regional Environmental Change, 10.1007/s10113-019-01472-0, (2019).
    • Challenges and opportunities in developing decision support systems for risk assessment and management of forest invasive alien species, Environmental Reviews, 10.1139/er-2019-0024, (1-28), (2019).
    • Identifying and applying an optimum set of environmental variables in species distribution models, Inland Waters, 10.1080/20442041.2019.1653111, (1-18), (2019).
    • New distribution records from the Brazilian Cerrado and species distribution modelling of Boana crepitans, Lithobates palmipes, Pipa pipa , and Micrurus h. hemprichii , Biodiversity, 10.1080/14888386.2019.1664931, (1-12), (2019).
    • The phylogeography of Vellozia auriculata (Velloziaceae) supports low zygotic gene flow and local population persistence in the campo rupestre, a Neotropical OCBIL, Botanical Journal of the Linnean Society, 10.1093/botlinnean/boz051, (2019).
    • Modeling the Influence of Groundwater Exploitation on Land Subsidence Susceptibility Using Machine Learning Algorithms, Natural Resources Research, 10.1007/s11053-019-09490-9, (2019).
    • Variation in floral morphology in a hybrid complex of Cyclamen in Sardinia , Plant Ecology & Diversity, 10.1080/17550874.2019.1593545, (1-11), (2019).
    • Climate Change and Geographic Ranges: The Implications for Russian Forests, Frontiers in Ecology and Evolution, 10.3389/fevo.2019.00057, 7, (2019).
    • Floristic patterns and ecological drivers of sand dune ecosystem along the Mediterranean coast of Egypt, Arid Land Research and Management, 10.1080/15324982.2018.1564147, (1-24), (2019).
    • Multi-Temporal Analysis of Forest Fire Probability Using Socio-Economic and Environmental Variables, Remote Sensing, 10.3390/rs11010086, 11, 1, (86), (2019).
    • Niche Modeling of Economically Important Mahanarva (Hemiptera, Cercopidae) Species in South and Central America: Are Brazilian Spittlebug Sugarcane Pests Potential Invaders of South and Central America?, Journal of Economic Entomology, 10.1093/jee/toz252, (2019).
    • Global warming decreases the morphological traits of germination and environmental suitability of Dipteryx alata (Fabaceae) in Brazilian Cerrado, Acta Botanica Brasilica, 10.1590/0102-33062018abb0288, (2019).
    • Urban flood risk mapping using data-driven geospatial techniques for a flood-prone case area in Iran, Hydrology Research, 10.2166/nh.2019.090, (2019).
    • The effect of sample size on the accuracy of species distribution models: considering both presences and pseudo‐absences or background sites, Ecography, 10.1111/ecog.03188, 42, 3, (535-548), (2018).
    • Wallace: A flexible platform for reproducible modeling of species niches and distributions built for community expansion, Methods in Ecology and Evolution, 10.1111/2041-210X.12945, 9, 4, (1151-1156), (2018).
    • Guidance on quantitative pest risk assessment, EFSA Journal, 10.2903/j.efsa.2018.5350, 16, 8, (2018).
    • Inferring climatic controls of rice stem borers’ spatial distributions using maximum entropy modelling, Journal of Applied Entomology, 10.1111/jen.12493, 142, 4, (388-396), (2018).
    • Plant biodiversity patterns along a climatic gradient and across protected areas in West Africa, African Journal of Ecology, 10.1111/aje.12517, 56, 3, (641-652), (2018).
    • Parapatric subspecies of Macaca assamensis show a marginal overlap in their predicted potential distribution: Some elaborations for modern conservation management, Ecology and Evolution, 10.1002/ece3.4405, 8, 19, (9712-9727), (2018).
    • Expected impacts of climate change threaten the anuran diversity in the Brazilian hotspots, Ecology and Evolution, 10.1002/ece3.4357, 8, 16, (7894-7906), (2018).
    • Model-R: A Framework for Scalable and Reproducible Ecological Niche Modeling, High Performance Computing, 10.1007/978-3-319-73353-1_15, (218-232), (2018).
    • Molluscan indicator species and their potential use in ecological status assessment using species distribution modeling, Marine Environmental Research, 10.1016/j.marenvres.2018.05.020, 140, (10-17), (2018).
    • Using species distribution models at local scale to guide the search of poorly known species: Review, methodological issues and future directions, Ecological Modelling, 10.1016/j.ecolmodel.2018.07.018, 385, (124-132), (2018).
    • Mapping invasion potential using ensemble modelling. A case study on Yushania maling in the Darjeeling Himalayas, Ecological Modelling, 10.1016/j.ecolmodel.2018.07.001, 385, (35-44), (2018).
    • Species Distributions, Spatial Ecology and Conservation Modeling, 10.1007/978-3-030-01989-1, (213-269), (2018).
    • A Statistical Comparison between Less and Common Applied Models to Estimate Geographical Distribution of Endangered Species (Felis margarita) in Central Iran, Contemporary Problems of Ecology, 10.1134/S1995425518060148, 11, 6, (687-696), (2018).
    • Modeling the distribution of Populus euphratica in the Heihe River Basin, an inland river basin in an arid region of China, Science China Earth Sciences, 10.1007/s11430-017-9241-2, 61, 11, (1669-1684), (2018).
    • ELSA: Entropy-based local indicator of spatial association, Spatial Statistics, 10.1016/j.spasta.2018.10.001, (2018).
    • Ensemble species distribution modelling with transformed suitability values, Environmental Modelling & Software, 10.1016/j.envsoft.2017.11.009, 100, (136-145), (2018).
    • Does a correlation exist between environmental suitability models and plant population parameters? An experimental approach to measure the influence of disturbances and environmental changes, Ecological Indicators, 10.1016/j.ecolind.2017.12.009, 86, (1-8), (2018).
    • A novel machine learning-based approach for the risk assessment of nitrate groundwater contamination, Science of The Total Environment, 10.1016/j.scitotenv.2018.07.054, 644, (954-962), (2018).
    • Incorporating knowledge uncertainty into species distribution modelling, Biodiversity and Conservation, 10.1007/s10531-018-1675-y, (2018).
    • See more

    The full text of this article hosted at iucr.org is unavailable due to technical difficulties.