New measures for assessing model equilibrium and prediction mismatch in species distribution models
Abstract
Models based on species distributions are widely used and serve important purposes in ecology, biogeography and conservation. Their continuous predictions of environmental suitability are commonly converted into a binary classification of predicted (or potential) presences and absences, whose accuracy is then evaluated through a number of measures that have been the subject of recent reviews. We propose four additional measures that analyse observation‐prediction mismatch from a different angle – namely, from the perspective of the predicted rather than the observed area – and add to the existing toolset of model evaluation methods. We explain how these measures can complete the view provided by the existing measures, allowing further insights into distribution model predictions. We also describe how they can be particularly useful when using models to forecast the spread of diseases or of invasive species and to predict modifications in species’ distributions under climate and land‐use change.
Introduction
Models based on species distributions are increasingly used in ecology, conservation and management, serving a number of important purposes (see e.g. Jiménez‐Valverde & Lobo, 2007 for a brief review). The predictions from such models, usually continuous values of environmental suitability or similar, are often converted into a binary classification of presence or absence, determined by a threshold above which the model is considered to predict the species to be present (Jiménez‐Valverde & Lobo, 2007; Nenzén & Araújo, 2011). After this binary conversion, a confusion matrix (Fig. 1) can be generated from the numbers of observed and predicted presences and absences (e.g. Fielding & Bell, 1997; Manel et al., 2001; Anderson et al., 2003). From this matrix, several measures can be calculated to evaluate the capacity of a model to correctly classify presences and absences, including measures of match and of mismatch between predictions and observations; such measures have been recently reviewed (Liu et al., 2009, 2011). Among the measures of mismatch are the omission and commission rates (Anderson et al., 2003), also known as false‐negative and false‐positive rates (Fielding & Bell, 1997; Liu et al., 2009, 2011): omission refers to species’ presences that are missed by the model (i.e. classified as absences), and commission refers to the presences that are predicted outside the area where the species was observed (i.e. absences classified as presences).

We would first like to point out that, although these measures (especially omission) are commonly referred to as errors (e.g. Guisan & Zimmermann, 2000; Teixeira et al., 2001; Anderson et al., 2003; Bulluck et al., 2006; Elith et al., 2006; Liu et al., 2011; Nenzén & Araújo, 2011; Peterson et al., 2011), neither omission nor commission are necessarily shortcomings of a model. Models are meant to infer, from the recorded distribution, the environmentally suitable areas for the species. As we detail below, a species may be absent from suitable areas, or present in less adequate areas, without this meaning that the model has made a mistake (see also Sillero et al., 2010).
Omission (presences not predicted by the model), while being more likely to reflect prediction error than commission, may also result from errors of identification or georeferencing of particular species records, as no data set can be deemed completely error free. Omissions may also reveal areas where a species is present under suboptimal conditions (e.g. sinks in the source–sink theory; Pulliam, 1988) due to spatially contagious processes such as dispersal or immigration. In the case of generalist or widespread species, it is common to observe presences in regions below the putative presence–absence (or suitable–unsuitable) threshold, as well as absences above this threshold, because generalists can usually tolerate a wider range of environmental conditions, and effective thresholds are difficult to define.
Commission (presences predicted outside the observed occurrence area) can point to areas where the modelled species occurs but has not been detected or sufficiently surveyed (again, no data set is guaranteed to be complete and error free). Commissions may also represent suitable areas to where the species has not managed to disperse (due to physical barriers, insufficient dispersal ability or lack of time), or where it has become temporarily extinct due to recent disturbance events (e.g. suitable unoccupied patches in metapopulation theory; Levins, 1969); or areas that are suitable on the basis of the environmental variables that were included in the model, but that are unsuitable on the basis of other factors such as biotic interactions (Anderson et al., 2003; Real et al., 2009; Barbosa et al., 2009, 2010).
Hence, rather than a drawback, model misclassifications can allow the extraction of ecological and evolutionary inferences by comparison of the observed and the predicted (potential) distributions of species (Anderson et al., 2003). As such, omission and commission should generally be referred to as rates rather than errors; this may also help in distinguishing error associated with the accuracy of the field data.
That said, additional informative measures can be calculated, regarding under‐ or over‐predicted presences and absences, that are not included in the published reviews on the evaluation measures of binary‐converted models (Fielding & Bell, 1997; Liu et al., 2009, 2011). We present four new measures that can be added to the existing suite of model evaluation metrics and provide useful insights into the potential or predicted distributions of species.
Rationale and Calculation
The omission and commission rates are calculated in relation to the observed data: omission is the proportion of predicted absences in the recorded presence area, and commission is the proportion of predicted presences in the observed (or assumed) absence area (Fielding & Bell, 1997; Anderson et al., 2003; Liu et al., 2009, 2011). In other words, omission and commission measure how many of the observations are incorrectly classified by the model. Omission is calculated based on the number of observed presences, and commission is calculated based on the number of observed/assumed absences (Fig. 1).
However, this procedure may pose some problems. Firstly, the omission and commission rates are the complements of model sensitivity and specificity (i.e. the proportions of correctly classified presences and absences, respectively), which are widely used in species distribution modelling. Hence, if we have sensitivity [Se = a/(a + c)], the omission rate is redundant (Om = c/(a + c) = 1−Se), and the same goes for specificity [Sp = d/(b + d)] and the commission rate (Co = b/(b + d) = 1−Sp; see Fig. 1 for the meanings of a, b, c and d).
Secondly, calculating omission and commission in this manner can sometimes lead to unrealistic assessments of model fit. For example, for a species with a restricted distribution within the studied territory, even a model that predicts more than twice the number of recorded presences may exhibit a low commission rate, given the high number of (assumed) absences relative to which this rate is calculated (e.g. Teixeira et al., 2001).
Thirdly, as a result of the frequent and generally recommended procedure of optimizing the binary conversion threshold to maximize both sensitivity and specificity (or to minimize the difference between the two), generally with a preference towards sensitivity (Manel et al., 2001; Jiménez‐Valverde & Lobo, 2007), sensitivity and specificity often show similar values, with sensitivity being slightly higher. Therefore, their complements omission and commission also take similar values, with commission being generally slightly higher. It is thus difficult to gauge, from omission and commission, whether a model mainly tends to either under‐ or over‐predict a species’ distribution.


The under‐ and over‐prediction rates are the complements of the negative and positive predictive power (NPP and PPP; Fielding & Bell, 1997), also called negative and positive predictive value (NPV and PPV; Liu et al., 2009, 2011), respectively. However, although NPP and PPP are relatively popular in fields such as medical diagnostics, they are seldom used in species distribution modelling (Liu et al., 2009). This could be because NPP and PPP are measures of goodness of fit, for which distribution modellers tend to prefer sensitivity and specificity. Distribution modellers are, however, interested in counterbalancing sensitivity and specificity with measures of disagreement between predictions and observations. While omission and commission are not suitable in this case, given that they do not add any information to sensitivity and specificity, the under‐ and over‐prediction rates are useful to assess lack of model fit while completing the view provided by sensitivity and specificity.


Case Studies and Potential Applications
We illustrate the use of these measures on the Iberian mole (Talpa occidentalis), an insectivorous mammal endemic to the Iberian Peninsula (SW Europe), whose distribution in Spain was modelled previously (Ribas et al., 2006; Fig. 2). More details on the data and modelling method are provided in Appendix S1 (see Supporting Information), where we also describe a series of additional case studies on species with varying range sizes (restricted to widespread) and biogeographical characteristics (native, invasive, metapopulational).

For the Iberian mole, omission and commission (like their complements sensitivity and specificity) become balanced near the 0.5 favourability threshold, and their values, which are calculated relative to the observed occurrence area, denote high model accuracy. However, from the perspective of the predicted occurrence area, over‐prediction is substantial at the same threshold, with 66% of the predicted occurrence area not being actually occupied. The potential presence increment is also relatively high at this threshold, as the model predicts more than twice the observed occurrence area. Equilibrium between observed and predicted occupancy is not attained until the 0.72 threshold, where the potential increments in presences and absences approach zero (Fig. 2).
Further insights arise from analysing species with varying prevalence or relative occurrence area (see Appendix S1). While omission and commission (following sensitivity and specificity) had similar values for medium thresholds within every model, the under‐ and over‐prediction rates were often visibly different from each other. Moreover, over‐prediction was higher than under‐prediction for some species and lower than under‐prediction for others (Figs S1 and S2) and, except for the most widespread species, this occurred along most of the range of possible thresholds separating predicted presences from predicted absences (Figs S3 and S4).
For restricted‐range species, although commission rates were low (following the high specificity), over‐prediction rates were substantial, reflecting the fact that a high proportion of the predicted favourable areas are not actually occupied (Figs S1 and S3). The most widespread species, on the other hand, have relatively high rates of under‐prediction (unfavourable localities that are actually occupied), despite the substantially lower omission (Figs S2 and S4). Equilibrium between potential and occupied area (i.e. null presence and absence increment) is achieved at very high favourability thresholds for restricted species (Fig. S3). This reflects specialists with low‐entropy distributions, requiring excellent environmental thresholds to occupy the whole suitable area; under those thresholds, there are always more favourable than actually occupied sites. As species prevalence increases, this equilibrium threshold decreases, approaching the sensitivity–specificity balance threshold. Middle favourability thresholds thus provide equilibrium between potential and observed distributions for these widespread species, reflecting less environmentally demanding occurrence patterns (Fig. S4). This information is not provided by the omission–commission (nor by the sensitivity–specificity) plots. The proposed measures thus allow further insights into the models’ tendency for either under‐ or over‐predicting species’ occurrence areas, from a novel point of view, independently of the information provided by omission and commission (or their complements sensitivity and specificity).
The measures presented here, as well as their variation across the range of decision thresholds (Fig. 2), can be easily calculated for analogous data sets (binary observations versus continuous predictions) with the modEvA package for R (Barbosa et al., 2013), which is currently in beta version. Until a stable version is officially released, the package (along with a set of simple instructions for users inexperienced with R) is available upon request to the authors.
Concluding Comments
Model performance measures such as sensitivity and specificity should be complemented with assessments of prediction mismatch, which does not necessarily indicate model failure and is useful for understanding species’ distributions, their equilibrium with the environment, or their potential for change. While omission and commission do not add any information to the widely used sensitivity and specificity, the proposed under‐ and over‐prediction rates analyse the problem from a different perspective, by assessing prediction mismatch over the potential rather than the observed occurrence area. They thus allow more complete assessments of model classification performance. In species distribution modelling, where sensitivity and specificity tend to be optimized, under‐prediction and over‐prediction can be particularly useful to assess misclassification rates without repeating information. The potential increments in presences and absences, in addition, measure the equilibrium between the observed and the potential area of occupancy – that is, between the model and the species’ distribution.
There is an increasing use of models for forecasting modifications in species’ distributions under climate and land‐use changes, especially to inform conservation planning for threatened species, as well as a growing interest in predicting and monitoring the spread of diseases and invasive species. We expect these measures to be particularly useful in such studies, as they assess how the potential distribution compares with the observed one and may thus provide clues on how a species’ range may be expected to expand or contract.
Acknowledgements
François Guilhaumon and Hedvig Nenzén provided useful R tips. A.M.B. received a postdoctoral fellowship (SFRH/BPD/40387/2007) from Fundação para a Ciência e a Tecnologia (FCT, Portugal), co‐financed by the European Social Fund, and a research visit grant from the New Zealand Institute of Mathematics and its Applications (NZIMA) and the University of Canterbury. Support was also received from Ministerio de Ciencia e Innovación (Spain), Junta de Andalucía, FEDER and QREN/INALENTEJO through projects CGL2008/01549/BOS, CGL2009‐11316/BOS, P09‐RNM‐5187 and ALENT‐07‐0224‐FEDER‐001755.
References
Biosketch
A. Márcia Barbosa is a postdoctoral fellow jointly hosted by CIBIO – University of Évora (Portugal) and Imperial College London (UK) – and was a visitor to the University of Canterbury (New Zealand) while preparing this article. Her research interests include biogeography, macroecology, distribution modelling, comparative phylogeography, biodiversity patterns and conservation. The team have a common interest in the analysis of biogeographical patterns and its applications to conservation and management.
Author contributions: A.M.B. and R.R. conceived the ideas. A.M.B. gathered and analysed the data, programmed the R functions and led the writing. A.R.M., J.A.B. and R.R. provided ideas for additional analyses and improved writing, interpretation and presentation.
Citing Literature
Number of times cited according to CrossRef: 32
- Rula Domínguez, Elsa Vázquez, Sarah A. Woodin, David S. Wethey, Laura G. Peteiro, Gonzalo Macho, Celia Olabarria, Sublethal responses of four commercially important bivalves to low salinity, Ecological Indicators, 10.1016/j.ecolind.2019.106031, 111, (106031), (2020).
- Nicole M. Herzog, Christopher Parker, Earl Keefe, Kristen Hawkes, Fire's impact on threat detection and risk perception among vervet monkeys: Implications for hominin evolution, Journal of Human Evolution, 10.1016/j.jhevol.2020.102836, 145, (102836), (2020).
- Darío Chamorro, Raimundo Real, Antonio-Román Muñoz, Fuzzy sets allow gaging the extent and rate of species range shift due to climate change, Scientific Reports, 10.1038/s41598-020-73509-y, 10, 1, (2020).
- Jennifer Bourque, Jean-Pierre Desforges, Milton Levin, Todd C. Atwood, Christian Sonne, Rune Dietz, Trine H. Jensen, Erin Curry, Melissa A. McKinney, Climate-associated drivers of plasma cytokines and contaminant concentrations in Beaufort Sea polar bears (Ursus maritimus), Science of The Total Environment, 10.1016/j.scitotenv.2020.140978, 745, (140978), (2020).
- Yohannes O. Kidane, Manuel Jonas Steinbauer, Carl Beierkuhnlein, Dead end for endemic plant species? A biodiversity hotspot under pressure, Global Ecology and Conservation, 10.1016/j.gecco.2019.e00670, (e00670), (2019).
- Ana Fagoaga, Hugues-Alexandre Blain, Rafael Marquina-Blasco, César Laplana, Neftalí Sillero, Cristo M. Hernández, Carolina Mallol, Bertila Galván, Francisco J. Ruiz-Sánchez, Improving the accuracy of small vertebrate-based palaeoclimatic reconstructions derived from the Mutual Ecogeographic Range. A case study using geographic information systems and UDA-ODA discrimination methodology, Quaternary Science Reviews, 10.1016/j.quascirev.2019.105969, 223, (105969), (2019).
- Gregorio Sánchez‐Montes, Ernesto Recuero, A. Márcia Barbosa, Íñigo Martínez‐Solano, Complementing the Pleistocene biogeography of European amphibians: Testimony from a southern Atlantic species, Journal of Biogeography, 10.1111/jbi.13515, 46, 3, (568-583), (2019).
- Matteo Pecchi, Maurizio Marchi, Vanessa Burton, Francesca Giannetti, Marco Moriondo, Iacopo Bernetti, Marco Bindi, Gherardo Chirici, Species distribution modelling to support forest management. A literature review, Ecological Modelling, 10.1016/j.ecolmodel.2019.108817, 411, (108817), (2019).
- Laure Gallien, Andrew H. Thornhill, Damaris Zurell, Joseph T. Miller, David M. Richardson, Global predictors of alien plant establishment success: combining niche and trait proxies, Proceedings of the Royal Society B: Biological Sciences, 10.1098/rspb.2018.2477, 286, 1897, (20182477), (2019).
- Juliane Friedrich, Per Arvelius, Erling Strandberg, Zita Polgar, Pamela Wiener, Marie J. Haskell, The interaction between behavioural traits and demographic and management factors in German Shepherd dogs, Applied Animal Behaviour Science, 10.1016/j.applanim.2018.12.004, (2018).
- Celine Bellard, Jonathan M. Jeschke, Boris Leroy, Georgina M. Mace, Insights from modeling studies on how climate change affects invasive alien species geography, Ecology and Evolution, 10.1002/ece3.4098, 8, 11, (5688-5700), (2018).
- Juan A. Sarquis, Maximiliano A. Cristaldi, Vanesa Arzamendia, Gisela Bellini, Alejandro R. Giraudo, Species distribution models and empirical test: Comparing predictions with well‐understood geographical distribution of Bothrops alternatus in Argentina, Ecology and Evolution, 10.1002/ece3.4517, 8, 21, (10497-10509), (2018).
- Antonio Pulido‐Pastor, Ana Luz Márquez, Enrique García‐Barros, Raimundo Real, Identification of potential source and sink areas for butterflies on the Iberian Peninsula, Insect Conservation and Diversity, 10.1111/icad.12297, 11, 5, (479-492), (2018).
- Boris Leroy, Robin Delsol, Bernard Hugueny, Christine N. Meynard, Chéïma Barhoumi, Morgane Barbet‐Massin, Céline Bellard, Without quality presence–absence data, discrimination metrics such as TSS can be misleading measures of model performance, Journal of Biogeography, 10.1111/jbi.13402, 45, 9, (1994-2002), (2018).
- Renata Ferrari, Hamish Malcolm, Joe Neilson, Vanessa Lucieer, Alan Jordan, Tim Ingleton, Will Figueira, Nicola Johnstone, Nicole Hill, Integrating distribution models and habitat classification maps into marine protected area planning, Estuarine, Coastal and Shelf Science, 10.1016/j.ecss.2018.06.015, 212, (40-50), (2018).
- Natacha Nikolic, Matthew Lauretta, Audrey Patucca, Gilles Morandeau, Characterization and standardization of the Atlantic albacore French pelagic trawl fishery, Aquatic Living Resources, 10.1051/alr/2018012, 31, (27), (2018).
- Paulo De Marco, Caroline Corrêa Nóbrega, Evaluating collinearity effects on species distribution models: An approach based on virtual species simulation, PLOS ONE, 10.1371/journal.pone.0202403, 13, 9, (e0202403), (2018).
- Rebeca de Jesús Crespo, Pablo Méndez Lázaro, Susan H. Yee, Linking Wetland Ecosystem Services to Vector-borne Disease: Dengue Fever in the San Juan Bay Estuary, Puerto Rico, Wetlands, 10.1007/s13157-017-0990-5, (2018).
- Jorge Gutiérrez-Rodríguez, A. Márcia Barbosa, Íñigo Martínez-Solano, Integrative inference of population history in the Ibero-Maghrebian endemic Pleurodeles waltl (Salamandridae), Molecular Phylogenetics and Evolution, 10.1016/j.ympev.2017.04.022, 112, (122-137), (2017).
- Darío Chamorro, Jesús Olivero, Raimundo Real, Antonio‐Román Muñoz, Environmental factors determining the establishment of the African Long‐legged Buzzard Buteo rufinus cirtensis in Western Europe, Ibis, 10.1111/ibi.12451, 159, 2, (331-342), (2017).
- Duccio Rocchini, Carol X Garzon-Lopez, Cartograms tool to represent spatial uncertainty in species distribution, Research Ideas and Outcomes, 10.3897/rio.3.e12029, 3, (e12029), (2017).
- Jorge Gutiérrez‐Rodríguez, A. Márcia Barbosa, Íñigo Martínez‐Solano, Present and past climatic effects on the current distribution and genetic diversity of the Iberian spadefoot toad (Pelobates cultripes): an integrative approach, Journal of Biogeography, 10.1111/jbi.12791, 44, 2, (245-258), (2016).
- Luís Reino, Mário Ferreira, Íñigo Martínez‐Solano, Pedro Segurado, Chi Xu, A. Márcia Barbosa, Favourable areas for co‐occurrence of parapatric species: niche conservatism and niche divergence in Iberian tree frogs and midwife toads, Journal of Biogeography, 10.1111/jbi.12850, 44, 1, (88-98), (2016).
- Jesús Olivero, John E. Fa, Raimundo Real, Miguel Ángel Farfán, Ana Luz Márquez, J. Mario Vargas, J. Paul Gonzalez, Andrew A. Cunningham, Robert Nasi, Mammalian biogeography and the Ebola virus in Africa, Mammal Review, 10.1111/mam.12074, 47, 1, (24-37), (2016).
- Sharad K. Baral, Gabriel Danyagri, Monique Girouard, François Hébert, Gaëtan Pelletier, Effects of suppression history on growth response and stem quality of extant northern hardwoods following partial harvests, Forest Ecology and Management, 10.1016/j.foreco.2016.04.023, 372, (236-246), (2016).
- Amy J. S. Davis, Kunwar K. Singh, Jean‐Claude Thill, Ross K. Meentemeyer, Accounting for residential propagule pressure improves prediction of urban plant invasion, Ecosphere, 10.1002/ecs2.1232, 7, 3, (2016).
- Raimundo Real, A. Márcia Barbosa, Joseph W. Bull, Species Distributions, Quantum Theory, and the Enhancement of Biodiversity Measures, Systematic Biology, 10.1093/sysbio/syw072, (syw072), (2016).
- Jesús Olivero, John E. Fa, Miguel A. Farfán, Jerome Lewis, Barry Hewlett, Thomas Breuer, Giuseppe M. Carpaneto, María Fernández, Francesco Germi, Shiho Hattori, Josephine Head, Mitsuo Ichikawa, Koichi Kitanaishi, Jessica Knights, Naoki Matsuura, Andrea Migliano, Barbara Nese, Andrew Noss, Dieudonné Ongbwa Ekoumou, Pascale Paulin, Raimundo Real, Mike Riddell, Edward G. J. Stevenson, Mikako Toda, J. Mario Vargas, Hirokazu Yasuoka, Robert Nasi, Distribution and Numbers of Pygmies in Central African Forests, PLOS ONE, 10.1371/journal.pone.0144499, 11, 1, (e0144499), (2016).
- David Romero, Jesús Olivero, José Carlos Brito, Raimundo Real, Comparison of approaches to combine species distribution models based on different sets of predictors, Ecography, 10.1111/ecog.01477, 39, 6, (561-571), (2015).
- M. Gies, M. Sondermann, D. Hering, C. K. Feld, A comparison of modelled and actual distributions of eleven benthic macroinvertebrate species in a Central European mountain catchment, Hydrobiologia, 10.1007/s10750-015-2280-7, 758, 1, (123-140), (2015).
- Maurizio Sarà, Spatial analysis of lanner falcon habitat preferences: Implications for agro-ecosystems management at landscape scale and raptor conservation, Biological Conservation, 10.1016/j.biocon.2014.08.004, 178, (173-184), (2014).
- Alberto Jiménez-Valverde, Threshold-dependence as a desirable attribute for discrimination assessment: implications for the evaluation of species distribution models, Biodiversity and Conservation, 10.1007/s10531-013-0606-1, 23, 2, (369-385), (2014).




