How to make more out of community data? A conceptual framework and its implementation as models and software
Abstract
Community ecology aims to understand what factors determine the assembly and dynamics of species assemblages at different spatiotemporal scales. To facilitate the integration between conceptual and statistical approaches in community ecology, we propose Hierarchical Modelling of Species Communities (HMSC) as a general, flexible framework for modern analysis of community data. While non‐manipulative data allow for only correlative and not causal inference, this framework facilitates the formulation of data‐driven hypotheses regarding the processes that structure communities. We model environmental filtering by variation and covariation in the responses of individual species to the characteristics of their environment, with potential contingencies on species traits and phylogenetic relationships. We capture biotic assembly rules by species‐to‐species association matrices, which may be estimated at multiple spatial or temporal scales. We operationalise the HMSC framework as a hierarchical Bayesian joint species distribution model, and implement it as R‐ and Matlab‐packages which enable computationally efficient analyses of large data sets. Armed with this tool, community ecologists can make sense of many types of data, including spatially explicit data and time‐series data. We illustrate the use of this framework through a series of diverse ecological examples.
Introduction
Ecology has been described as the scientific understanding of factors determining the abundance and distribution of species (Smith 1966; Begon et al. 1986). This understanding can hardly be achieved by studying species one by one since their abundances and distributions depend not only on their individual responses to the abiotic environment, but also on their interactions (Wisz et al. 2013). Thus, a key aim in modern community ecology is to gain an integrative understanding of how biotic and abiotic factors mould local species pools at different spatiotemporal scales.
Community ecology began as a descriptive science in which communities were classified based on the identities and sizes of local species pools (e.g. Clements 1936; Elton 1966). Modern community ecology is progressing from the description of patterns towards a mechanistic perspective, which seeks to understand the processes determining the identities and abundances of the species from local to global spatiotemporal scales (Agrawal et al. 2007; Logue et al. 2011). During the last few decades, experimental ecologist have used observations and experiments to assess the relative influences of stochasticity, competition and niche differentiation (see Logue et al. 2011), theoretical ecologists have developed models for predicting community dynamics (e.g. Tilman 1990, 2004; Holt et al. 1994; Bolker et al. 2003; Leibold et al. 2004; Holyoak et al. 2005), and statistical ecologists have developed metrics for assessing compositional changes among local communities (e.g. Gauch 1982; ter Braak & Prentice 1988; Legendre & Legendre 2012).
While a general theory to explain how communities are assembled across space and time is still lacking, community ecologists have converged towards a synthesis acknowledging that local species communities are a result of both stochastic and deterministic processes, henceforth called assembly processes (Gravel et al. 2006; Leibold & McPeek 2006; Stokes & Archer 2010; Weiher et al. 2011; Götzenberger et al. 2012). These forces encompass neutral processes, historical contingencies such as speciation, dispersal, abiotic environmental factors and biotic interactions (Vellend 2010; Weiher et al. 2011; Götzenberger et al. 2012).
As proposed by Zobel (1997) and illustrated in Fig. 1, the assembly processes can be envisaged as ‘filters’ operating at different scales. In this scheme, the ‘global species pool’ consists of all existing species, the ‘regional pool’ of all species able to colonise a given area, and the ‘local species pool’ of the set of species found at the finest scale considered (Cornell & Harrison 2014). Clearly, the species pools found at finer scales are filtered also by assembly processes acting at broader scales (Cornell & Lawton 1992). Of these hierarchical sieves, ‘environmental filters’ correspond to those abiotic factors which prevent the establishment or persistence of species in local communities (Kraft et al. 2015), and thus outline the fundamental niche of a species. ‘Biotic filters’ refer to interspecific and intraspecific competitive and facilitative interactions that determine the set of species in local communities (Wisz et al. 2013; Garnier et al. 2016), and thus determine their realised niches. These two types of forces may interact, as environmental filters may modify biotic interactions (e.g. Callaway & Walker 1997).

Beyond the deterministic processes selecting species from regional to local scales, stochastic processes create additional variation in the local communities. These processes – related to colonisation, extinction, ecological drift and environmental stochasticity – generate divergence among communities occupying identical environments (Chase & Myers 2011). The responses of the species to abiotic and biotic filters vary depending on species‐specific characteristics known as the response traits, including e.g. their dispersal propensity and competitive abilities (Lavorel & Garnier 2002) (Fig. 1). Thus, they will determine what species reach and colonise given areas, and what species succeed in securing adequate resources (McGill et al. 2006; Bolnick et al. 2011).
As a net outcome of the assembly processes outlined above, we find variation in the number, abundance, identities, and traits of the species present over a set of replicate communities observed in space and/or time (Fig. 2). While faced with a variety of data types, community ecologists have so far been armed with rather disparate statistical tools for connecting them with theories on community assembly. In particular, we lack a statistical frameworks that would enable us to infer actual assembly processes from community samples (Logue et al. 2011), and thus, a gap remains between theoretical predictions and data. Currently, the most popular tools used to study community structure are distance‐based ordinations (e.g. Gauch 1982; ter Braak & Prentice 1988; Legendre & Legendre 2012) and diversity measures (see Magurran 2004). While such approaches provide insights into patterns of diversity and community composition at different spatiotemporal scales (Legendre et al. 2005; Dray et al. 2012; Legendre & Gauthier 2014), they provide little quantitative insight into the relative contributions of different assembly processes. To overcome these limitations, community ecologists are showing increasing interest in model‐based approaches (Warton et al. 2015a,b).

Single‐species distribution models have been widely used to explain and predict how different taxa respond to environmental variation (Guisan & Thuiller 2005; Elith & Leathwick 2009). To capitalise on this success, there is a growing interest in extending species distribution models to community‐level models (Guisan & Rahbek 2011). One way of predicting community‐level properties is simply to add predictions of single‐species models into ‘stacked’ species distribution models (Guisan & Rahbek 2011; Calabrese et al. 2014). Another way is the use of joint species distribution models, which explicitly acknowledge the multivariate nature of species assemblages, allowing one to gather more mechanistic and predictive insights into assembly processes (Warton et al. 2015b). By simultaneously drawing on the information from multiple species, these models allow one to seek community‐level patterns in how species respond to their environment (e.g. Ovaskainen & Soininen 2011; Wang et al. 2012; Hui et al. 2015; Ovaskainen et al. 2016b), to relate such patterns to species traits and phylogenies (Pollock et al. 2012; Brown et al. 2014; Abrego et al. 2017a) and to quantify co‐occurrence patterns among species (Pollock et al. 2014; Ovaskainen et al. 2016a). Recent method development has made it possible to apply joint species distribution models e.g. to presence–absence data or abundance data (Hui et al. 2015; Clark et al. 2017), as well as to study designs of spatial (Thorson et al. 2015; Ovaskainen et al. 2016b), temporal (Sebastián‐González et al. 2010) or spatio‐temporal (Thorson et al. 2016) nature.
Owing to the ongoing revolution in sequencing technology, the development of statistical methodology for macro‐organisms (reviewed above) has been paralleled by the rapid development of statistical methodology for microbial community ecology. Here, much of the focus has been on inferring association networks from sequence count data (Steele et al. 2011; Weiss et al. 2016), and on asking how such association networks relate to environmental conditions or ecosystem processes (e.g. Guidi et al. 2016). An important challenge with the analysis of such data is that the information is in relative rather than in absolute sequence counts. If not properly accounted for, this feature will lead to spurious correlations (Friedman & Alm 2012). As is the case with macro‐organisms, inferring association‐networks from sparse data on species rich communities is one of the main methodological challenges in current microbial community ecology (Weiss et al. 2016).
The aim of this paper is to facilitate the integration of theoretical and empirical approaches in community ecology, and of methods previously developed separately for micro‐ and macro‐organisms. We suggest that recent advances in joint species distribution models can be transformed into a general framework for modern statistical analysis of community data. As the framework is based on a hierarchical joint species distribution approach, we call it Hierarchical Modelling of Species Communities (HMSC). HMSC integrates much of the recent methodological progress on joint species distribution models, and provides a unified platform which we hope will facilitate further amalgamation. To help community ecologists select a specific model which fits the nature of their data and the aims of their study, we offer a practical guide: we first describe the typical data types collected by empirical community ecologists, subsequently introduce the statistical HMSC framework, and then show with examples how the framework can be used to address topical questions in community ecology. Finally, we discuss some limitations of the framework, as well as challenges to be targeted by future research.
A statistical framework for hierarchical modelling of species communities (HMSC)
Typical community data include observations on the occurrence of species in a set of temporal and/or spatial replicates, henceforth called occurrence data and referred to as the
matrix (Fig. 3). Depending on the study/experimental design, on our objectives and the subject organisms, the occurrence of the species can be recorded in various ways, such as through direct visual or audial encounters, indirect cues such as tracks or droppings, or molecular identification of environmental samples. The occurrence matrix may thus describe e.g. presences and absences of the species, species counts, a percentage covered by each species or an estimate of its biomass. The amount and nature of observation error also depends on the method used. Common problems of species surveys are imperfect detection, creating false negatives, and misidentification, creating false positives (Guillera‐Arroita 2017).

The occurrence data are usually accompanied by environmental data consisting of a set of measured covariates that the ecologist hypothesised to be important in explaining community composition (
matrix, Fig. 3). Beyond the effects of these environmental covariates, the spatiotemporal context may generate a structure to the data. In studies where the data have been collected in a hierarchical way (e.g. plots within sites), we call the finest scale (i.e., each row of the data matrices
and
) the ‘sampling unit’. In studies treating space and/or time as continuous, the study design may be described by spatial or temporal coordinates.
To relate community‐level responses to environmental variation to response traits, one may wish to include data on species‐specific traits (
matrix, Fig. 3). These data may range from morphological traits such as body size, or physiological traits such as tolerance to salinity, to functional traits such as feeding type, or to the actual position of the species within the surrounding food web. Furthermore, we may combine trait data with phylogenetic data (C matrix, Fig. 3). The availability of phylogenetic data is rapidly increasing, allowing the construction of quantitative matrices of phylogenetic correlations within many organisms groups. Where quantitative phylogenies are lacking, data on taxonomic identity (at the level of genus, family, order, class, phylum…) can be used as a proxy of phylogenetic relatedness (but see Whitfeld et al. 2012).
) in each sampling unit (denoted as
, where
), i.e. the data summarised by matrix
in Fig. 3. For this, we use a generalised linear model,
(1)
Here,
is a statistical distribution tailored to the kind of data being used,
is the linear predictor, and
is a variance term, which is excluded for some distributions (e.g. probit or Poisson) and included for others (e.g. normal or over‐dispersed Poisson).
is modelled with the help of fixed (F) and random (R) parts as
. The fixed effects are modelled as the regression
(2)
denotes the environmental covariate
measured at site
(i.e., the information summarised in matrix
, Fig. 3; with
modelling the intercept), and the regression parameter
denotes the response of species j to covariate
. As the regression parameters measure how the occurrence of the species depends on the environmental conditions, we interpret these as describing species' environmental niches. To allow the statistical framework to generate a community‐level synthesis of how species respond to their environment, we assume that their responses to the environment (i.e. their regression parameters) adhere to a multivariate normal distribution,
(3)We use a dot to single out a row or a column in a matrix, so that βj. denotes the vector of regression coefficients for species j. As βj. describes how species
responds to environmental covariates, it characterises its environmental niche. The expected environmental niche of species j is denoted by vector μj., and variation around this expectation is captured by the variance‐covariance matrix
(Ovaskainen & Soininen 2011). The expected niche μj. can either be assumed to be the same for all species, or alternatively it can model the influence of species‐specific traits on species' responses. In the latter case, we write
, where
is the value of trait
for species
(matrix
, Fig. 3; with
modelling the intercept) and the parameter
measures the effect of trait l on response to covariate
(Abrego et al. 2017a). We may use this model also to ask what percentage of variation in species' environmental niches can be attributed to species' traits (see Supporting Information for details).
, Fig. 3), we model the covariance structure of the multivariate normal distribution as
(4)
the residual variance is independent among the species (as described by the identity matrix
), implying that closely related species do not have more similar environmental niches than do distantly related ones. When
approaches
, species' environmental niches are fully structured by their phylogeny, with related species having more similar niches than expected by random, implying niche conservatism.
Let us then turn to the random term
, which models the variation in species occurrences and co‐occurrences that cannot be attributed to the responses of the species to the measured covariates. If the study design consists of sampling units without any hierarchical, spatial or temporal structure,
will simply be
, where the superscript
refers to a random effect
that operates at the level of the sampling unit. These random effects are modelled as
where
is a residual species‐to‐species variance‐covariance matrix. Here, the word ‘residual’ refers to the fact that we have removed the influences of environmental covariates by the fixed effect part of the model. The diagonal element
describes the amount of random variation that species
shows at the level of the sampling unit, whereas the off‐diagonal element
describes the amount of covariation among the two species
and
. For hierarchical study designs, the model may include several association matrices
(see Ovaskainen et al. 2016a), each of which may have a spatial or temporal structure.
With
species, an association matrix
has
parameters, making its parameterisation with sparse data on species‐rich communities challenging. To facilitate the estimation of such matrices, we use a latent variable approach, which allows a parameter‐sparse representation of the matrix
through latent factors and their loadings (for mathematical details see Warton et al. 2015b; Ovaskainen et al. 2016a,b). The factor loadings themselves do not have a straightforward interpretation in terms of ecological interactions, but are useful in revealing patterns where two species either co‐occur more often than expected by chance: if the factor loadings have the same sign, the species respond to the latent variable in the ‘same’ way and thus increase in concert, whereas if the factor loadings have opposite signs, the species respond in ‘different’ ways, and thus one species declines when the other increases. We represent the species‐to‐species association network by the correlation matrix
defined by
. The correlation
measures to what extent species
and
are found together more or less often than expected by chance, after controlling for the environmental covariates. We note that one could alternatively measure species‐to‐species associations based on the precision matrix (inverse of
), which is more likely to identify direct links among species than the correlation matrix, as the latter one is also influenced by indirect links (Biswas et al. 2016; Ovaskainen et al. 2016a). As a further alternative, we note that instead of the latent variable approach, the correlation matrix could be parameterised through a mixture modelling approach (Pledger & Arnold 2014).
Let us note at this point that the much‐used multivariate ordination approaches (Legendre & Legendre 2012) are also based on patterns of species co‐occurrences. If we exclude the environmental covariates
from the analysis, then the latent variables behind an association matrix can be viewed as a model‐based ordination (Warton et al. 2015b). Species close to each other in the ordination space show positive co‐occurrence in the association matrix, whereas species at the opposite ends of the ordination space show negative co‐occurrence. If environmental covariates are included in the analysis, then an association matrix corresponds to a residual ordination, which describes those co‐occurrences that cannot be explained by shared responses to environmental covariates (Hui et al. 2015; Warton et al. 2015b).
and
, then the covariance between species occurrences (at the level of the linear predictor) is given by
(5)
denotes the mean value of the off‐diagonal elements of the matrix
(see Supporting Information for the derivation of eqn 5). In eqn 5, the terms
and (1‐ρm(C))V partition interspecific variation in responses to environmental covariates to components that can and cannot be explained by the measured traits, respectively.
Let us finally note that, as with any statistical model, it is important to validate the above‐described model in terms of its structural assumptions as well as the generality of the parameter estimates. In particular, to exclude the possibility of spurious inference due to model overfitting, we recommend examining the model's predictive power through a cross‐validation approach, as we will illustrate below in some of the examples.
In the Supporting Information, we provide a Bayesian approach for parameterising the model in a computationally efficient manner which allows the analyses of large data sets. We also provide an implementation of the statistical framework as R‐ and Matlab‐packages, which include a user manual.
Applying HMSC to topical questions in community ecology
The key value of the modelling framework laid out above (Fig. 4) is in how its different components relate to processes of community assembly (Figs 1 and 2). To illustrate how they can be extracted from real data sets, and how they can be interpreted in relation to key questions in community ecology, we next turn to three contrasting case studies. These studies were specifically chosen to bring out the different strengths and uses of HMSC. The design of the first study is spatially hierarchical, the second study is spatially explicit, and the third study involves time‐series data. The first case study also illustrates the utility of HMSC in addressing applied questions. To illustrate the comprehensiveness of the HSMC approach, we frame our treatise as a series of topical questions in community ecology which relate to the general theory outlined in the Introduction as well as a series of applied questions. These questions are summarised in Table 1, which also describes how HMSC can been applied to derive an answer. We note that this list is not exhaustive, and that the line between fundamental and applied questions in Table 1 is somewhat blurred. Furthermore, we remind the reader that the statistical framework of HMSC is correlative, and that the possibility of confounding effects should always be considered. Most importantly, the framework assumes (1) that environmental filtering is properly captured by the environmental covariates included in the model, (2) that biotic filtering is properly captured by residual species‐to‐species correlation matrices, or – in the case of time‐series data – by matrices modelling the influence of each species in the previous year on the occurrence of each other species in the following year, (3) that any residual variation (i.e., variation not predicted by environmental or biotic filtering) can be attributed to random processes, such as dispersal limitation, environmental stochasticity or ecological drift. In the Discussion, we return to these assumptions and limitations of the modelling framework, and describe future challenges for its further development.
| Question | How to address the question statistically? | Illustration | |
|---|---|---|---|
| FQ1 | How much variation in species occurrence is due to environmental filtering, biotic interactions and random processes, and how do these impacts vary across spatial and temporal scales? | By assessing the explanatory power of models and by variance partitioning among fixed and random effects at different scales (see Supporting Information for details) | All case studies |
| FQ2 | How do species' traits and phylogenetic relationships correlate with ecological niches? | By modelling responses to environmental covariates ( ) as a function of species' traits ( ) and phylogenetic correlations ( )
|
Bryophyte, butterfly and fungal case studies |
| FQ3 | What are the structures of species interaction networks? | By estimating the species‐to‐species association matrices or ![]() |
All case studies |
| FQ4 | How does community similarity depend on environmental similarity and/or geographic distance? | By decomposing community similarity into similarity due to responses to environmental covariates and/or spatial covariance, eqn 5 | Bryophyte and butterfly case studies |
| FQ5 | How does community structure change over time due to predictable succession or stochastic ecological drift? | By including time since environmental perturbation as a predictor, or by including temporally varying random effects | Bryophyte and water bird case studies |
| AQ1 | Do some species indicate the presence of others? | By testing how much the predictive power of the model increases for a focal species when accounting for the occurrences of other species | Butterfly and fungal case studies |
| AQ2 | How can geographic areas be classified into communities of common profile? | By clustering predicted communities based on their similarity | Butterfly case study |
| AQ3 | Which processes have been central in determining the response of a community to environmental change | By decomposing the response to environmental change to components related to species niches and random effects | Bryophyte case study |
| AQ4 | How can species be classified in terms of their response to abiotic environment? | By clustering parameters or predictions measuring the species responses to environmental covariates | Bryophyte case study |
| AQ5 | How is community structure predicted to change under various scenarios of e.g. environmental change | By use of scenario simulations | Bryophyte case study |
- All questions relate to variation in the number, abundance, identities and traits of the species present over a set of replicate communities observed in space and/or time (Fig. 2). The questions have been grouped into fundamental questions in basic science (FQ1–5) and questions of applied interest (AQ1–5), though we note that these two categories overlap. See text for further details. The fungal case study is presented in Supporting Information.
While all our examples are based on published studies, the novelty of this section is in illustrating how a wide range of questions and data types can be analysed with the help of the encompassing statistical framework of HMSC. More information on all case studies, including on their ecological context, can be found in the primary publications referred to below.
A spatially hierarchical study design: how do epiphytic bryophytes respond to forest management?
As community ecologists, a fundamental question to ask is what processes create observed variation in community structure (Fig. 2): what proportion of variance in species occurrence can be attributed to environmental filtering, biotic filtering and random processes, respectively (Table 1). If replicate samples are available in space or time, we may further ask how the relative importance of different assembly processes change over spatial or temporal scales. In general, biotic interactions are likely to be more important at the finest scales, where species physically meet, whereas environmental covariates (such as macroclimate and soil types) are likely to be more important at broader scales (Araújo & Rozenfeld 2014).
In a study of epiphytic bryophytes, the species were surveyed on aspen trees on a set of forest sites (Fig. 5a, for original study see Oldén et al. 2014). The fitted model explained 10% of species occurrences at the tree level (measured by Tjur's
(Tjur 2009), averaged over species) and 60% at the site level (measured as correlation in abundance, averaged over species). Thus, while it is difficult to predict which species are present or absent on a particular tree, more predictable patterns emerge at the level of the forest site. With the intent of identifying the strength of different processes shaping the local communities, we partitioned the explained variation into components attributable to the environmental covariates vs. variation assigned to random effects operating at different spatial scales (Fig. 5b; see Supporting Information for how this was technically done). Forest management type and age had a major influence on epiphytic bryophyte composition, as on average almost half of the variation was attributed to these forest site‐level covariates (Fig. 5b). These environmental aspects proved to be the main agents in structuring epiphytic bryophyte assemblages at the level of forests, since once the effects of these covariates were accounted for, the species no longer showed strong associative patterns (Fig. 5c; forest site level). In contrast, the covariate measured at the tree level (tree diameter), explained only about half of the variation (Fig. 5b; fixed vs. random effects at the tree level), and thus much of the community variation was captured by the random effects (Fig. 5c; tree level).

Species vary in their morphology, life‐history, behaviour and other traits. Relating this variation to community‐level responses is a key challenge in community ecology (Weiher et al. 2011) (Table 1, Fig. 2). Thus, we may wish to address the fourth corner problem (Legendre et al. 1997), i.e. to ask to what extent and in which way some traits influence the responses of the community to environmental variation. In such an analysis, each species is essentially one data point. Due to phylogenetic relationships, related species are expected to be similar both concerning their traits and occurrence patterns, making the data points not independent (Harvey & Pagel 1991) (Fig. 2). Thus, to ask about the adaptive significance of traits, it is necessary to account for phylogenetic constraints. Accounting for phylogenetic signal can also yield further evolutionary insights, as it allows one to examine niche evolution by asking whether related species have more similar niches than expected by chance (Warren et al. 2008).
To assess how the traits of the bryophytes influenced their responses to the environmental covariates, we classified species by their life‐form, a trait which has been previously suggested to be of functional importance (Bates 1998). However, this trait explained only 9% of the variation among species' environmental niches. The analysis yielded no strong support for the hypothesis of niche conservatism, with the posterior mean of the phylogenetic correlation parameter ρ being 0.26 (with a 95% credible interval from 0 to 0.77). These results suggest that in this particular study, the niches of bryophytes are to a limited extent related to their life‐form, but mainly structured by some other traits that are not strongly correlated with their phylogeny, pointing out a knowledge gap in their functional ecology.
To determine how biotic filters structure communities, the first step is to know which species interact with each other and how, i.e. to determine the structure of the network of interspecific interactions (Morales‐Castilla et al. 2015) (Table 1). Such an interaction network can be used, e.g. to examine whether the links among species within the community are organised in modular or nested ways (Fortuna et al. 2010). While non‐manipulative data on species occurrence do not allow conclusive inference on ecological interactions, the HMSC framework makes it possible to infer residual species‐to‐species associations, which can be considered as hypotheses of such interactions. To examine how epiphytic bryophytes respond to each other, we included community‐level random effects at the two hierarchical spatial scales included in our study (tree and site), and consequently estimated the species‐to‐species association networks at each of these scales (Fig. 5c). The resultant networks showed clear structure especially at the tree scale, where the species sorted essentially into three groups. The first group of species (species 11–31 in Fig. 5c) tend to occur together but not with species from the second group (species 32–45); the second group of species tend to occur together but not with species from the first group, and the third group of species (the remaining species) occur essentially independent of the other species. The estimated association networks differed between the two spatial scales, suggesting scale‐dependent assembly mechanisms.
For applied ecologists dealing with the management or conservation of ecological communities, there is a need to identify the processes behind community responses to environmental change (Table 1). For example, a delayed response to habitat loss may indicate an extinction debt, whereas species lacking from areas with a suitable climate may indicate dispersal limitation and thus a colonisation credit (Hanski 2000). Another primary goal of applied community ecology is to classify species in relation to their vulnerability to environmental changes such as climate warming, habitat fragmentation and pollution – as that allows one to identify species requiring special attention in conservation management (e.g. Pacifici et al. 2015). In this case study, we were interested in testing the potential of Green‐Tree Retention cutting (GTR) for conserving epiphytic bryophytes. GTR is a modification of traditional clear‐cutting, aimed at mitigating effects on biodiversity (Rosenvald & Lõhmus 2008). To evaluate its success, we fitted the statistical framework to data on epiphytic bryophytes growing on aspens found in natural forests of varying stand age, as well as on retention aspens in logged forest with variation in time since logging. The model‐based approach then allowed us to predict the influence of GTR through scenario simulation (Fig. 5d). After logging, species richness is predicted to drop temporarily, but then to quickly recover to the level of uncut forests. However, community composition is unlikely to return to the state typical of old‐growth forest, but rather to follow a different trajectory (Fig. 5d). Thus, while retention aspens in old loggings may host as many species as aspen trees in natural forests, the identities of these species are likely to differ.
Based on the predicted responses of the bryophyte species to forest type and age, we may identify species groups of relevance to forest management and restoration (Fig. 5e). Some species predominantly occur in natural forests and thus benefit only little from retention trees, whereas others are primarily found in logged forests, and can thus be considered as disturbance specialists of little concern for conservation. In between, there are two groups of species that benefit from retention trees: Some species (called recolonising species in Fig. 5e) appear on retention aspens only once forest has re‐grown around the retention tree. Other species (called life‐boated species in Fig. 5e) can persist on the retention trees immediately after logging.
A spatially explicit study design: classifying Great Britain based on butterfly communities
When comparing local communities, we may be interested in how community similarity depends on the similarity of environmental conditions, and on the distance between the communities (Soininen et al. 2007). In addition to repeating the questions already addressed with the bryophyte data, we next illustrate questions on community similarity with data from a butterfly survey conducted in Great Britain. From the larger survey, we utilise presence–absence data for 55 species acquired at a 10 × 10 km resolution (Fig. 6a, for original study see Ovaskainen et al. 2016b).

In a spatial study, the species occurrences can be expected to be spatially autocorrelated, with the similarity of observations decreasing with distance. To account for this possibility, we assumed for the butterfly data that the underlying latent factors have a spatial covariance structure, and consequently that also the species‐to‐species association matrix
depends on the distance
between the sampling units (see Supporting Information for details). Thus, we assumed that species share the same set of spatially structured latent variables, but differ in their factor loadings, i.e. in their responses to the latent variables. The latent factors may represent hidden environmental covariates, and are thus also interesting per se (see Ovaskainen et al. 2016b for illustrations of such maps).
A partitioning of variance among fixed effects (Fig. 6c) revealed that on average, environmental filters related to climatic conditions were somewhat more important than those related to habitat characteristics in explaining species occurrences, but that species differed substantially in their individual responses. More than half of the variation explained was attributed to the spatially structured random effects, suggesting that location and distance have strong effects on patterns of species occurrence and co‐occurrence. To examine how species' traits influence their responses to environmental factors and thus their distributions within Britain, we classified species into those that are habitat specialists vs. those that are not, and to those that are migrants vs. those that are not. These categorical traits explained 19% of variation in species' environmental niches. As with the bryophyte study, there was no strong evidence for niche conservatism (posterior mean of
, 95% credible interval 0.00–0.78). Association patterns among species were predominantly positive (Fig. 6d). This implies more variation in species richness than one would expect from independent occurrences, as species are either simultaneously present or absent from sampling units. Considering the general ecology of butterflies, where strong competition is unlikely but habitat specialisation frequent, these patterns are more likely due to shared responses to missing environmental covariates (or variation in sampling effort) than to true ecological interactions among them.
For applied purposes, we may wish to classify entire communities, to e.g. produce maps of distinct vegetation types, or more generally to define regions of common profile in terms of their community structure (Foster et al. 2013). Such regions can then be considered as management units, or be used in reserve selection (Margules & Pressey 2000). For the butterfly survey, regions characterised by similar communities (Foster et al. 2013) are shown in Fig. 6e, suggesting that British butterfly communities are primarily structured along the North–South gradient. To address how community similarity depends on the similarity of environmental conditions, and on the distance between the communities (Soininen et al. 2007), we may decompose the distance decay in community similarity into components that can and cannot be explained by similarity in environmental covariates (Fig. 6e). Importantly, the decay in similarity with distance gets steeper when we account for not only spatial distance but also for decreasing similarity in environmental conditions, again supporting the role of the environment in structuring these communities across space.
As applied ecologists, we may also wish to predict the community composition under different environmental conditions (Table 1), which may be either within the training data (interpolation) or outside it (extrapolation). For example, a scientist working with species distribution maps may wish to predict the ranges of all species based on the data points available (Guisan et al. 2013). The inclusion of spatially structured latent factors greatly improved the model's ability to predict butterfly communities at the validation sites (prediction for spatial model vs. prediction for non‐spatial model in Fig. 6b), and thus of interpolating species distribution maps from sparse observations. This is because with spatial latent factors, the predictions are based both on measured environmental covariates and on interpolated species occurrences. Furthermore, as the spatial latent factors include information about species co‐occurrence, the improved predictions not only account for the occurrences of the focal species in the nearby training sites, but also for the occurrences of all the other species at those sites.
In conservation biology, we frequently use indicator, umbrella and keystone species to indicate the presence of other species or of specific environmental conditions (Caro 2010). To identify such marker species, we should ask how well their occurrences predict the occurrences of target species or conditions. As shown in Fig. 6b (spatial vs. non‐spatial prediction) and discussed above, the inclusion of other species in the model greatly increases the model's predictive power. As we may use any subset of the species as predictors and any other subset as response variables, the modelling framework permits the systematic identification of indicator species, i.e. species which are relatively common and easy to survey, but whose occurrence correlates with the occurrence of species of particular interest (Caro 2010).
A temporally explicit study design: heterospecific attraction in water birds
When replicate samples are available in time, we may also wish to address if and how local communities change, either through predictable successional pathways such as priority effects, or through stochastic ecological drift – both of which may either create new niches or result in extinctions (Dickie et al. 2012) (Table 1). Our third case study consists of a time‐series containing data on birds' presence vs. absence across a set of 215 water ponds (Fig. 7a, for original study see Sebastián‐González et al. 2010). Temporally replicated data provide an additional dimension for quantifying the extent to which communities are structured by biotic interactions, as we may use the occurrences of the species in the previous year as predictors of their occurrences in the following year. Such predictors can simply be included in the environmental covariate matrix
(Fig. 3), in which case their influences are estimated by the regression parameters
. However, to separate species' responses to other species from species' responses to the shared environment, we denote by
the influence of occurrence of species
in the previous year on the occurrence of species
in the present year. In this study, variation in species occurrence is explained almost equally much by biotic filters, i.e. by the occurrences of the species in the previous year, as by environmental filters, i.e., by the environmental covariates measured (Fig. 7b). Among the random effects, the most important is the one associated with the level of the sampling unit (a pond‐year combination), representing heterospecific (i.e. between species) aggregation of the birds within the ponds.

To evaluate species associations over time, we examine the temporal association matrix
. As illustrated in Fig. 7c, this matrix is not necessarily symmetric, unlike the association networks measured by the covariance matrices
. The diagonal elements of the temporal association matrix
are positive, signalling that same species tend to return to the same ponds year after year. The estimated matrices also show strong support for the heterospecific attraction hypothesis (see review by Mönkkönen & Forsman 2002) both within and among years. Within a sampling unit, all species pairs are positively associated. Moreover, for many species pairs the occurrence of a species in one year increases the occurrence probability of another species in the next. For example, the Common shelduck is particularly likely to be found at the ponds where the Black‐winged stilt was breeding in the previous year (Fig. 7c). Associations among species both between years and within a year may be due to unmeasured aspects of habitat quality or to the benefits that water birds may obtain from settling in sites already occupied by heterospecifics.
Future directions and challenges
As we have illustrated by the above three examples and a further example on wood‐decaying fungi (see Supporting Information), Hierarchical Modelling of Species Communities (HMSC) offers a coherent framework for addressing a wide range of questions in community ecology, with its key advantages summarised in Box 1. However, as we discuss in this section, the core framework outlined here should be considered just a starting point, as it comes with important caveats and incitements for future work.
Box 1. Ten reasons for applying Hierarchical Modelling of Species Communities (HMSC)
- HMSC is a unifying framework which encompasses classic approaches such as single‐species distribution models and model‐based ordinations as special cases.
- HMSC provides simultaneous inferences at the species and community levels.
- HMSC offers the general advantages of model‐based approaches, such as tools for model validation and prediction.
- HMSC overcomes previous problems of modelling communities with sparse data.
- HMSC overcomes the long‐standing challenge in species distribution modelling of how to account for species interactions in explaining and predicting species occurrences.
- HMSC allows one to partition observed variation in species occurrences into components related to environmental variation measured vs. random processes (or unmeasured variation) at different spatial scales – both at the species and community levels.
- HMSC tackles the fourth corner problem (the influence of species traits to their occurrences, see Legendre et al. 1997) in a way which accounts for the phylogenetic signal in the data.
- HMSC can be applied to many kinds of study designs (including hierarchical, temporal or spatial) and many types of data (such as presence–absence, counts and continuous measurements).
- HMSC can generate predictions at the species, community or trait levels, while propagating uncertainty in parameter values to the level of the prediction.
- HMSC is computationally efficient, being able to analyse small data sets (with a few hundreds of sampling units and a few tens of species) in seconds, and large data sets (with some tens of thousands of sampling units and a few thousands of species) in few days.
While conceptually our modelling framework focuses on the mechanisms behind community assembly (Figs 1-4), it is technically a hierarchical generalised linear mixed model. Hence, the results are of correlative nature, not directly implying causation. For example, we emphasise that species association networks should not be interpreted as proven ecological interactions, but that ecological interactions may have a major influence on the associations. As another example, niche or interaction‐based processes do not necessarily always cause predictable community patterns (Munoz & Huneman 2016). These and other confounding issues are not a limitation of the statistical model, but of the nature of the data as such. For instance, data on species occurrence do not allow one to establish conclusively whether co‐occurrences are due to ecological interactions or to other confounding factors, such as missing environmental covariates or sampling biases (Kissling et al. 2012). Only experimental data can bring conclusive evidence on this issue. Having said that, our modelling approach offers a particularly efficient avenue for deriving data‐driven hypotheses on ecological interactions, as it decomposes the observed co‐occurrences into those that can be explained by the environmental covariates explicitly measured vs. those that cannot.
While the framework presented here already allows one to address some of the most fundamental questions in community ecology (Table 1), it should clearly be seen as a starting point for building the next generation of joint species distribution models. As we have based HMSC on hierarchical generalised linear mixed models, adding additional layers is both conceptually and technically straightforward. Below, we discuss four perspectives that we consider especially fruitful in this context.
The first perspective relates to the need to explicitly account for the observation process when modelling the data, thus allowing one to separate the processes of interest from biases introduced by the observer (Guillera‐Arroita 2017; Beissinger et al. 2016; Warton et al. 2016). As the simplest solution, one may include covariates reflecting observational error (such as variation in sampling effort) in the
matrix. However, for many kinds of data there is a need to incorporate a more elaborate observation model, as is commonly done in e.g. single‐species occupancy modelling (Guillera‐Arroita 2017). Within the HMSC framework, the observed data
could be modelled as a function of the underlying true occurrences
as e.g.
where the statistical distribution
describes the observation process. For example, much data on species distributions consist of presences only. Within the HMSC framework, such data could be analysed e.g. by linking the linear predictor to a spatially varying intensity of an underlying point process reflecting the sampling effort (Renner et al. 2015). As another example, sequence data that are commonly utilised in microbial community ecology are constrained by the total sequence count, which can create spurious correlations if not accounted for (Friedman & Alm 2012). Within the HMSC framework, such data could be modelled e.g. through multinomial logistic regression. With such an extension, HMSC would help merging the statistical methods developed thus far partly separately for micro‐organisms and macro‐organisms (see Introduction).
The second perspective relates to the need for more versatile treatments of association networks. One of the key strengths of the HMSC approach is that it allows us to estimate species association networks at different spatial or temporal scales, and to utilise the inferred associations in predictions and simulated scenarios. In the examples above, we have estimated the association matrices solely from the occurrence data. However, community ecologists will oftentimes hold interaction matrices constructed from other types of data, arising e.g. from experiments (Wootton & Emmerson 2005) or information on species traits which might influence species interactions (e.g. Schöb et al. 2013). In the present formulation of the model, we have assumed that species traits
and phylogenetic correlations
influence the species responses to the abiotic environment, but ignored the link from these matrices to co‐occurrence. Finding the best statistical approaches to utilising such data in the HMSC framework remains a challenge. One possible solution would be to model the factor loadings underlying species co‐occurrences as functions of the
and the
matrices in the same way that we have done for species niches described by the
parameters. Linking species co‐occurrences to traits, phylogenies, and their responses to environmental variation would provide new tools for addressing several intriguing questions, e.g. whether species with few strong interactions are more vulnerable to disturbances than species with many weak interactions (Fortuna & Bascompte 2006; Aizen et al. 2012; Abrego et al. 2017b), or whether the extent to which prey species within communities are coupled by predators translate into correlated abundances in space or time (Morris et al. 2004; Tack et al. 2011; Kaartinen & Roslin 2013).
We also note that the estimated association networks are conditional on species occurrence: even if two species show a strong positive co‐occurrence, the association is not realised in areas where neither of the species occurs. Less trivially, we have assumed the structure of association networks to be constant across all areas where they are realised. However, the type and strength of ecological interactions may be context‐dependent (e.g. Poisot et al. 2015). For example, the stress‐gradient hypothesis predicts that positive interactions are accentuated under stressful abiotic environmental conditions (Callaway & Walker 1997). In the context of HMSC, context dependence of the association matrices can be incorporated by modelling the underlying latent variable structure as a function of environmental covariates (Tikhonov et al. 2017), or more generally as functions of space or time.
The third perspective relates to the need of utilising the full potential of community‐level time‐series data. With our case study on water birds (Fig. 7), we illustrated time‐series modelling in the context of presence–absence data – but note that abundance data are likely to yield much stronger signals on heterospecific interactions. With abundance data, one option for seeking for the effect of such interactions is to use Gompertz–type models, which account for the intraspecific interactions through density dependence (Mutshinda et al. 2009). One technical challenge when dealing with species rich communities is that interaction matrices between years have very high dimensionality. In close analogy with our approach to high‐dimensional association matrices
, this problem could be solved by a latent variable approach. Here, the latent factors would correspond to community level summaries of species abundances, whereas factor loadings would model how the dynamics of individual species depend on those summaries.
The fourth and final perspective is a merger of genetic and evolutionary perspectives on community assembly. While the relevance of eco‐evolutionary feedbacks is increasingly recognised in studies of single species, the quantification of such feedbacks in studies targeting the community level is still scarce (Johnson & Stinchcombe 2007). In the HMSC framework presented above, we have only touched upon evolutionary aspects by asking whether species' niches show a phylogenetic signal. An important challenge is to adopt a more micro‐evolutionary perspective, e.g. by asking if and how the amount and type of genetic variation influences variation in species occurrence, either among species, or in space or through time. A related challenge is bringing the analyses to the individual level rather than operating only at the species level – as, for example, traits are often measured at the individual level (McGill et al. 2006). While the HMSC framework will in principle allow for this through its hierarchical structure, developing a general and computationally feasible approach that builds up communities from the individual level is not straightforward.
Having identified a need for integrating individual‐ and genetic‐level perspectives with community ecology, we end by highlighting the status quo: for merging perspectives from genes, individuals and communities, the largest challenge is currently in the lack of appropriate data. But for the kind of community data illustrated in Fig. 3, we believe that the opposite is true: here we have data in ample supply, whereas progress has been hampered by a lack of integrative modelling frameworks to synthesise these data. We hope that the HMSC approach provides a partial solution to this problem, and that it inspires future research to overcome some of the caveats listed above.
Acknowledgements
We thank Anna Oldén for permit to use of the bryophyte data as well providing the bryophyte photographs (Fig. 5e), Ester Sebastián‐González for permit to use the water bird data, Albert Phillimore for providing the butterfly phylogeny, and Andreas Trepte (www.photo-natur.de) for the photograph on Black‐winged stilt (Fig. 7a). We are grateful for valuable comments provided by four anonymous referees. The research was funded by the Academy of Finland (CoE grant 284601 to OO and grant 285803 to TR), the Research Council of Norway (CoE grant 223257), the Research Foundation of the University of Helsinki (PhD grant to AN), and the LUOVA graduate school of the University of Helsinki (PhD grant for GT).
HMSC‐software. The Matlab and R‐versions of the HMSC‐software, the user manual, and scripts for replicating the analyses presented here are found from https://www.helsinki.fi/en/researchgroups/metapopulation-research-centre/hmsc
Authorship
OO, LD and DD developed the statistical methods. GT implemented the Matlab package, GB the R‐package and AN, GT, GB, NA and OO wrote the user manual. OO, NA and TR wrote the first draft of the manuscript and all authors contributed substantially to the revisions.
References
Citing Literature
Number of times cited according to CrossRef: 215
- Alienor Jeliazkov, Darko Mijatovic, Stéphane Chantepie, Nigel Andrew, Raphaël Arlettaz, Luc Barbaro, Nadia Barsoum, Alena Bartonova, Elena Belskaya, Núria Bonada, Anik Brind’Amour, Rodrigo Carvalho, Helena Castro, Damian Chmura, Philippe Choler, Karen Chong-Seng, Daniel Cleary, Anouk Cormont, William Cornwell, Ramiro de Campos, Nicole de Voogd, Sylvain Doledec, Joshua Drew, Frank Dziock, Anthony Eallonardo, Melanie J. Edgar, Fábio Farneda, Domingo Flores Hernandez, Cédric Frenette-Dussault, Guillaume Fried, Belinda Gallardo, Heloise Gibb, Thiago Gonçalves-Souza, Janet Higuti, Jean-Yves Humbert, Boris R. Krasnov, Eric Le Saux, Zoe Lindo, Adria Lopez-Baucells, Elizabeth Lowe, Bryndis Marteinsdottir, Koen Martens, Peter Meffert, Andres Mellado-Díaz, Myles H. M. Menz, Christoph F. J. Meyer, Julia Ramos Miranda, David Mouillot, Alessandro Ossola, Robin Pakeman, Sandrine Pavoine, Burak Pekin, Joan Pino, Arnaud Pocheville, Francesco Pomati, Peter Poschlod, Honor C. Prentice, Oliver Purschke, Valerie Raevel, Triin Reitalu, Willem Renema, Ignacio Ribera, Natalie Robinson, Bjorn Robroek, Ricardo Rocha, Sen-Her Shieh, Rebecca Spake, Monika Staniaszek-Kik, Michal Stanko, Francisco Leonardo Tejerina-Garro, Cajo ter Braak, Mark C. Urban, Roel van Klink, Sébastien Villéger, Ruut Wegman, Martin J. Westgate, Jonas Wolff, Jan Żarnowiec, Maxim Zolotarev, Jonathan M. Chase, A global database for metacommunity ecology, integrating species, traits, environment and space, Scientific Data, 10.1038/s41597-019-0344-7, 7, 1, (2020).
- Gleb Tikhonov, Øystein H. Opedal, Nerea Abrego, Aleksi Lehikoinen, Melinda M. J. Jonge, Jari Oksanen, Otso Ovaskainen, Joint species distribution modelling with the r‐package Hmsc, Methods in Ecology and Evolution, 10.1111/2041-210X.13345, 11, 3, (442-447), (2020).
- Hao Ran Lai, Kwek Yan Chong, Alex Thiam Koon Yee, Hugh Tiang Wah Tan, Michiel Breugel, Functional traits that moderate tropical tree recruitment during post‐windstorm secondary succession, Journal of Ecology, 10.1111/1365-2745.13347, 108, 4, (1322-1333), (2020).
- Peter Vermeiren, Peter Reichert, Nele Schuwirth, Integrating uncertain prior knowledge regarding ecological preferences into multi-species distribution models: Effects of model complexity on predictive performance, Ecological Modelling, 10.1016/j.ecolmodel.2020.108956, 420, (108956), (2020).
- Jonathan F. Jupke, Ralf B. Schäfer, Should ecologists prefer model‐ over distance‐based multivariate methods?, Ecology and Evolution, 10.1002/ece3.6059, 10, 5, (2417-2435), (2020).
- Christopher Krapu, Mark Borsuk, A spatial community regression approach to exploratory analysis of ecological data, Methods in Ecology and Evolution, 10.1111/2041-210X.13371, 11, 5, (608-620), (2020).
- Kadeem J. Gilbert, Leonora S. Bittleston, Wenfei Tong, Naomi E. Pierce, Tropical pitcher plants (Nepenthes) act as ecological filters by altering properties of their fluid microenvironments, Scientific Reports, 10.1038/s41598-020-61193-x, 10, 1, (2020).
- Raphaëlle Momal, Stéphane Robin, Christophe Ambroise, Tree‐based inference of species interaction networks from abundance data, Methods in Ecology and Evolution, 10.1111/2041-210X.13380, 11, 5, (621-632), (2020).
- Marco Basile, Thomas Asbeck, Marlotte Jonker, Anna K. Knuff, Jürgen Bauhus, Veronika Braunisch, Grzegorz Mikusiński, Ilse Storch, What do tree-related microhabitats tell us about the abundance of forest-dwelling bats, birds, and insects?, Journal of Environmental Management, 10.1016/j.jenvman.2020.110401, 264, (110401), (2020).
- Angela M. Reid, Brett P. Murphy, Tom Vigilante, David M. J. S. Bowman, Distribution and abundance of large herbivores in a northern Australian tropical savanna: A multi‐scale approach, Austral Ecology, 10.1111/aec.12860, 45, 5, (529-547), (2020).
- F. Guillaume Blanchet, Kevin Cazelles, Dominique Gravel, Co‐occurrence is not evidence of ecological interactions, Ecology Letters, 10.1111/ele.13525, 23, 7, (1050-1063), (2020).
- Øystein H. Opedal, Mikael von Numers, Gleb Tikhonov, Otso Ovaskainen, Refining predictions of metacommunity dynamics by modeling species non‐independence, Ecology, 10.1002/ecy.3067, 101, 8, (2020).
- Otso Ovaskainen, Nerea Abrego, , Joint Species Distribution Modelling, 10.1017/9781108591720, (2020).
- Jasmine S.M. Veitch, Jeff Bowman, Albrecht I. Schulte-Hostedde, Non-random ectoparasite species co-occurrence patterns on deer mice (Peromyscus maniculatus): Joint species distribution modelling, International Journal for Parasitology: Parasites and Wildlife, 10.1016/j.ijppaw.2020.04.011, (2020).
- Nerea Abrego, Tomas Roslin, Tea Huotari, Ayco J. M. Tack, Björn D. Lindahl, Gleb Tikhonov, Panu Somervuo, Niels Martin Schmidt, Otso Ovaskainen, Accounting for environmental variation in co‐occurrence modelling reveals the importance of positive interactions in root‐associated fungal communities, Molecular Ecology, 10.1111/mec.15516, 29, 14, (2736-2746), (2020).
- Jesús N. Pinto-Ledezma, Jeannine Cavender-Bares, Using Remote Sensing for Modeling and Monitoring Species Distributions, Remote Sensing of Plant Biodiversity, 10.1007/978-3-030-33157-3, (199-223), (2020).
- Chongliang Zhang, Yong Chen, Binduo Xu, Ying Xue, Yiping Ren, Temporal transferability of marine distribution models in a multispecies context, Ecological Indicators, 10.1016/j.ecolind.2020.106649, 117, (106649), (2020).
- Chongliang Zhang, Yong Chen, Binduo Xu, Ying Xue, Yiping Ren, Improving prediction of rare species’ distribution from community data, Scientific Reports, 10.1038/s41598-020-69157-x, 10, 1, (2020).
- Nicole Hill, Skipton N. C. Woolley, Scott Foster, Piers K. Dunstan, John McKinlay, Otso Ovaskainen, Craig Johnson, Determining marine bioregions: A comparison of quantitative approaches, Methods in Ecology and Evolution, 10.1111/2041-210X.13447, 11, 10, (1258-1272), (2020).
- Christian Damgaard, Rikke Reisner Hansen, Francis K.C. Hui, Model-based ordination of pin-point cover data: Effect of management on dry heathland, Ecological Informatics, 10.1016/j.ecoinf.2020.101155, (101155), (2020).
- Márcio F.A. Leite, Eiko E. Kuramae, You must choose, but choose wisely: Model-based approaches for microbial community analysis, Soil Biology and Biochemistry, 10.1016/j.soilbio.2020.108042, 151, (108042), (2020).
- Nerea Abrego, Tea Huotari, Ayco J. M. Tack, Björn D. Lindahl, Gleb Tikhonov, Panu Somervuo, Niels Martin Schmidt, Otso Ovaskainen, Tomas Roslin, Higher host plant specialization of root‐associated endophytes than mycorrhizal fungi along an arctic elevational gradient, Ecology and Evolution, 10.1002/ece3.6604, 10, 16, (8989-9002), (2020).
- Christian Damgaard, Measurement Uncertainty in Ecological and Environmental Models, Trends in Ecology & Evolution, 10.1016/j.tree.2020.07.003, (2020).
- Ming-Chih Chiu, Sicheng Ao, Fengzhi He, Vincent H. Resh, Qinghua Cai, Elevation shapes biodiversity patterns through metacommunity-structuring processes, Science of The Total Environment, 10.1016/j.scitotenv.2020.140548, 743, (140548), (2020).
- Casper Kraan, Simon F. Thrush, Carsten F. Dormann, Co-occurrence patterns and the large-scale spatial structure of benthic communities in seagrass meadows and bare sand, BMC Ecology, 10.1186/s12898-020-00308-4, 20, 1, (2020).
- Mervi Laaksonen, Pekka Punttila, Juha Siitonen, Otso Ovaskainen, Saproxylic beetle assemblages in recently dead Scots pines: How traits modulate species’ response to forest management?, Forest Ecology and Management, 10.1016/j.foreco.2020.118300, 473, (118300), (2020).
- Limoilou-A. Renaud, F. Rousseu, F. Guillaume Blanchet, Alan A. Cohen, Marco Festa-Bianchet, Fanie Pelletier, Milk composition in a wild mammal: a physiological signature of phenological changes, Oecologia, 10.1007/s00442-020-04684-y, 193, 2, (349-358), (2020).
- Heidi K. Mod, Mathieu Chevalier, Miska Luoto, Antoine Guisan, Scale dependence of ecological assembly rules: Insights from empirical datasets and joint species distribution modelling, Journal of Ecology, 10.1111/1365-2745.13434, 108, 5, (1967-1977), (2020).
- Boris R. Krasnov, Michal Stanko, Marcela Lareschi, Irina S. Khokhlova, Species co‐occurrences in ectoparasite infracommunities: Accounting for confounding factors associated with space, time, and host community composition, Ecological Entomology, 10.1111/een.12900, 45, 5, (1158-1171), (2020).
- Ji Yoon Kim, Jun Nishihiro, Responses of lake macrophyte species and functional traits to climate and land use changes, Science of The Total Environment, 10.1016/j.scitotenv.2020.139628, 736, (139628), (2020).
- Catherine M. McClure, Kelly L. Smalling, Vicki S. Blazer, Adam J. Sperry, Megan K. Schall, Dana W. Kolpin, Patrick J. Phillips, Michelle L. Hladik, Tyler Wagner, Spatiotemporal variation in occurrence and co-occurrence of pesticides, hormones, and other organic contaminants in rivers in the Chesapeake Bay Watershed, United States, Science of The Total Environment, 10.1016/j.scitotenv.2020.138765, 728, (138765), (2020).
- Jonas J Lembrechts, L Broeders, J De Gruyter, D Radujković, I Ramirez-Rojas, J Lenoir, E Verbruggen, A framework to bridge scales in distribution modeling of soil microbiota, FEMS Microbiology Ecology, 10.1093/femsec/fiaa051, 96, 5, (2020).
- Zheng Han, Lishi Zhang, Yunlei Jiang, Haitao Wang, Frédéric Jiguet, Unravelling species co‐occurrence in a steppe bird community of Inner Mongolia: Insights for the conservation of the endangered Jankowski’s Bunting, Diversity and Distributions, 10.1111/ddi.13061, 26, 7, (843-852), (2020).
- R. Arraiano-Castilho, M.I. Bidartondo, T. Niskanen, S. Zimmermann, B. Frey, I. Brunner, B. Senn-Irlet, E. Hörandl, S. Gramlich, L.M. Suz, Plant-fungal interactions in hybrid zones: Ectomycorrhizal communities of willows (Salix) in an alpine glacier forefield, Fungal Ecology, 10.1016/j.funeco.2020.100936, 45, (100936), (2020).
- Elildo A.R. Carvalho, Eloísa N. Mendonça, Alexandre Martins, Torbjørn Haugaasen, Effects of illegal logging on Amazonian medium and large-sized terrestrial vertebrates, Forest Ecology and Management, 10.1016/j.foreco.2020.118105, 466, (118105), (2020).
- Tamara Münkemüller, Laure Gallien, Laura J. Pollock, Ceres Barros, Marta Carboni, Loïc Chalmandrier, Florent Mazel, Karel Mokany, Cristina Roquet, Jan Smyčka, Matthew V. Talluto, Wilfried Thuiller, Dos and don'ts when inferring assembly rules from diversity patterns, Global Ecology and Biogeography, 10.1111/geb.13098, 29, 7, (1212-1229), (2020).
- Jack D. Shutt, James A. Nicholls, Urmi H. Trivedi, Malcolm D. Burgess, Graham N. Stone, Jarrod D. Hadfield, Albert B. Phillimore, Gradients in richness and turnover of a forest passerine's diet prior to breeding: A mixed model approach applied to faecal metabarcoding data, Molecular Ecology, 10.1111/mec.15394, 29, 6, (1199-1213), (2020).
- Yifei JIA, Qing ZENG, Yuyu WANG, Neil SAINTILAN, Guangchun LEI, Li WEN, Processes shaping wintering waterbird communities in an intensive modified landscape: Neutral assembly with dispersal limitation and localized competition, Ecological Indicators, 10.1016/j.ecolind.2020.106330, 114, (106330), (2020).
- Julie Vercelloni, Benoit Liquet, Emma V. Kennedy, Manuel González‐Rivero, M. Julian Caley, Erin E. Peterson, Marji Puotinen, Ove Hoegh‐Guldberg, Kerrie Mengersen, Forecasting intensifying disturbance effects on coral reefs, Global Change Biology, 10.1111/gcb.15059, 26, 5, (2785-2797), (2020).
- Oskar Löfgren, Karin Hall, Barbara Christine Schmid, Honor Clare Prentice, Grasslands ancient and modern: Soil nutrients, habitat age and their relation to Ellenberg N, Journal of Vegetation Science, 10.1111/jvs.12856, 31, 3, (367-379), (2020).
- Ben J. Kefford, Paul K. Botwe, Andrew J. Brooks, Stefan Kunz, Richard Marchant, Sally Maxwell, Leon Metzeling, Ralf B. Schäfer, Ross M. Thompson, An integrated database of stream macroinvertebrate traits for Australia: concept and application, Ecological Indicators, 10.1016/j.ecolind.2020.106280, 114, (106280), (2020).
- Nicholas J. Clark, Kei Owada, Eugene Ruberanziza, Giuseppina Ortu, Irenee Umulisa, Ursin Bayisenge, Jean Bosco Mbonigaba, Jean Bosco Mucaca, Warren Lancaster, Alan Fenwick, Ricardo J. Soares Magalhães, Aimable Mbituyumuremyi, Parasite associations predict infection risk: incorporating co-infections in predictive models for neglected tropical diseases, Parasites & Vectors, 10.1186/s13071-020-04016-2, 13, 1, (2020).
- Anders Pape Møller, Quantifying rapidly declining abundance of insects in Europe using a paired experimental design, Ecology and Evolution, 10.1002/ece3.6070, 10, 5, (2446-2451), (2020).
- Linda K. Petersson, Daniel C. Dey, Annika M. Felton, Emile S. Gardiner, Magnus Löf, Influence of canopy openness, ungulate exclosure, and low‐intensity fire for improved oak regeneration in temperate Europe, Ecology and Evolution, 10.1002/ece3.6092, 10, 5, (2626-2637), (2020).
- Colin Southwell, Louise Emmerson, Density dependence forces divergent population growth rates and alters occupancy patterns of a central place foraging Antarctic seabird, Ecology and Evolution, 10.1002/ece3.6037, 10, 5, (2339-2351), (2020).
- Quresh S. Latif, Jacob S. Ivan, Amy E. Seglund, David L. Pavlacky, Richard L. Truex, Avian relationships with bark beetle outbreaks and underlying mechanisms in lodgepole pine and spruce-fir forests of Colorado, Forest Ecology and Management, 10.1016/j.foreco.2020.118043, 464, (118043), (2020).
- Alwin A. Hardenbol, Kaisa Junninen, Jari Kouki, A key tree species for forest biodiversity, European aspen (Populus tremula), is rapidly declining in boreal old-growth forest reserves, Forest Ecology and Management, 10.1016/j.foreco.2020.118009, 462, (118009), (2020).
- Fabrice Stephenson, Kimberly Goetz, Ben R. Sharp, Théophile L. Mouton, Fenna L. Beets, Jim Roberts, Alison B. MacDiarmid, Rochelle Constantine, Carolyn J. Lundquist, Modelling the spatial distribution of cetaceans in New Zealand waters, Diversity and Distributions, 10.1111/ddi.13035, 26, 4, (495-516), (2020).
- Jan Jansen, Piers K. Dunstan, Nicole A. Hill, Philippe Koubbi, Jessica Melbourne‐Thomas, Romain Causse, Craig R. Johnson, Integrated assessment of the spatial distribution and structural dynamics of deep benthic marine communities, Ecological Applications, 10.1002/eap.2065, 30, 3, (2020).
- Francisco Javier Murillo, Benjamin Weigel, Marieve Bouchard Marmen, Ellen Kenchington, Marine epibenthic functional diversity on Flemish Cap (north‐west Atlantic)—Identifying trait responses to the environment and mapping ecosystem functions, Diversity and Distributions, 10.1111/ddi.13026, 26, 4, (460-478), (2020).
- Nicholas M. Fountain‐Jones, Nicholas J. Clark, Amy C. Kinsley, Michelle Carstensen, James Forester, Timothy J. Johnson, Elizabeth A. Miller, Seth Moore, Tiffany M. Wolf, Meggan E. Craft, Microbial associations and spatial proximity predict North American moose (Alces alces) gastrointestinal community composition, Journal of Animal Ecology, 10.1111/1365-2656.13154, 89, 3, (817-828), (2020).
- Matthias Schleuning, Eike Lena Neuschulz, Jörg Albrecht, Irene M.A. Bender, Diana E. Bowler, D. Matthias Dehling, Susanne A. Fritz, Christian Hof, Thomas Mueller, Larissa Nowak, Marjorie C. Sorensen, Katrin Böhning-Gaese, W. Daniel Kissling, Trait-Based Assessments of Climate-Change Impacts on Interacting Species, Trends in Ecology & Evolution, 10.1016/j.tree.2019.12.010, (2020).
- Samantha K. Dawson, Jane A. Catford, Peter Berney, Richard T. Kingsford, Samantha Capon, Land use alters soil propagule banks of wetlands down the soil-depth profile, Marine and Freshwater Research, 10.1071/MF18438, 71, 2, (191), (2020).
- Gianluigi Ottaviani, Gunnar Keppel, Lars Götzenberger, Susan Harrison, Øystein H. Opedal, Luisa Conti, Pierre Liancourt, Jitka Klimešová, Fernando A.O. Silveira, Borja Jiménez-Alfaro, Luka Negoita, Jiří Doležal, Michal Hájek, Thomas Ibanez, Francisco E. Méndez-Castro, Milan Chytrý, Linking Plant Functional Ecology to Island Biogeography, Trends in Plant Science, 10.1016/j.tplants.2019.12.022, (2020).
- PH Wepfer, Y Nakajima, FKC Hui, S Mitarai, EP Economo, Metacommunity ecology of Symbiodiniaceae hosted by the coral Galaxea fascicularis, Marine Ecology Progress Series, 10.3354/meps13177, 633, (71-87), (2020).
- Jay Calvert, Chris McGonigle, Benthic community structure at a remote temperate rocky reef in the Gulf of Maine, Cashes Ledge, Seafloor Geomorphology as Benthic Habitat, 10.1016/B978-0-12-814960-7.00026-9, (461-471), (2020).
- Rui Morgado, Joana Santana, Miguel Porto, Juan S. Sánchez-Oliver, Luís Reino, José M. Herrera, Francisco Rego, Pedro Beja, Francisco Moreira, A Mediterranean silent spring? The effects of olive farming intensification on breeding bird communities, Agriculture, Ecosystems & Environment, 10.1016/j.agee.2019.106694, 288, (106694), (2020).
- Lélis A. Carlos-Júnior, Joel C. Creed, Rob Marrs, Rob J. Lewis, Timothy P. Moulton, Rafael Feijó-Lima, Matthew Spencer, Generalized Linear Models outperform commonly used canonical analysis in estimating spatial structure of presence/absence data, PeerJ, 10.7717/peerj.9777, 8, (e9777), (2020).
- Warren Paul, Covariate-adjusted species response curves derived from long-term macroinvertebrate monitoring data using classical and contemporary model-based ordination methods, Ecological Informatics, 10.1016/j.ecoinf.2020.101159, (101159), (2020).
- Samantha K. Dawson, Håkan Berglund, Otso Ovaskainen, Tord Snäll, Bengt G. Jonsson, Mari Jönsson, Convergence of fungal traits over time in natural and forestry-fragmented patches, Biological Conservation, 10.1016/j.biocon.2020.108789, 251, (108789), (2020).
- Øystein H. Opedal, Adriana A. Martins, Emma-Liina Marjakangas, A database and synthesis of euglossine bee assemblages collected at fragrance baits, Apidologie, 10.1007/s13592-020-00739-4, (2020).
- Chloé Mathieu, Syrie M. Hermans, Gavin Lear, Thomas R. Buckley, Kevin C. Lee, Hannah L. Buckley, A Systematic Review of Sources of Variability and Uncertainty in eDNA Data for Environmental Monitoring, Frontiers in Ecology and Evolution, 10.3389/fevo.2020.00135, 8, (2020).
- Chongliang Zhang, Yong Chen, Binduo Xu, Ying Xue, Yiping Ren, Evaluating the influence of spatially varying catchability on multispecies distribution modelling, ICES Journal of Marine Science, 10.1093/icesjms/fsaa068, (2020).
- Tyler Wagner, Gretchen J. A. Hansen, Erin M Schliep, Bethany J. Bethke, Andrew E Honsey, Peter C Jacobson, Benjamen C Kline, Shannon L White, Improved understanding and prediction of freshwater fish communities through the use of joint species distribution models, Canadian Journal of Fisheries and Aquatic Sciences, 10.1139/cjfas-2019-0348, (2020).
- Martin Ingram, Damjan Vukcevic, Nick Golding, Multi‐output Gaussian processes for species distribution modelling, Methods in Ecology and Evolution, 10.1111/2041-210X.13496, 0, 0, (2020).
- Nerea Abrego, Brittni Crosier, Panu Somervuo, Natalia Ivanova, Arusyak Abrahamyan, Amir Abdi, Karoliina Hämäläinen, Kaisa Junninen, Minna Maunula, Jenna Purhonen, Otso Ovaskainen, Fungal communities decline with urbanization—more in air than in soil, The ISME Journal, 10.1038/s41396-020-0732-1, (2020).
- Maria H. Hällfors, Laura H. Antão, Malcolm Itter, Aleksi Lehikoinen, Tanja Lindholm, Tomas Roslin, Marjo Saastamoinen, Shifts in timing and duration of breeding for 73 boreal bird species over four decades, Proceedings of the National Academy of Sciences, 10.1073/pnas.1913579117, (201913579), (2020).
- Filip HARABIS, Jana HRONKOVA, European database of the life-history, morphological and habitat characteristics of dragonflies (Odonata), European Journal of Entomology, 10.14411/eje.2020.035, 117, (302-308), (2020).
- Martin Wiemers, Nicolas Chazot, Christopher Wheat, Oliver Schweiger, Niklas Wahlberg, A complete time-calibrated multi-gene phylogeny of the European butterflies, ZooKeys, 10.3897/zookeys.938.50878, 938, (97-124), (2020).
- Arliss J. Winship, James T. Thorson, M. Elizabeth Clarke, Heather M. Coleman, Bryan Costa, Samuel E. Georgian, David Gillett, Arnaud Grüss, Mark J. Henderson, Thomas F. Hourigan, David D. Huff, Nissa Kreidler, Jodi L. Pirtle, John V. Olson, Matthew Poti, Christopher N. Rooper, Michael F. Sigler, Shay Viehman, Curt E. Whitmire, Good Practices for Species Distribution Modeling of Deep-Sea Corals and Sponges for Resource Management: Data Collection, Analysis, Validation, and Communication, Frontiers in Marine Science, 10.3389/fmars.2020.00303, 7, (2020).
- Kadambari Devarajan, Toni Lyn Morelli, Simone Tenan, Multi‐species occupancy models: review, roadmap, and recommendations, Ecography, 10.1111/ecog.04957, 0, 0, (2020).
- Otso Ovaskainen, Nerea Abrego, Panu Somervuo, Isabella Palorinne, Bess Hardwick, Juha-Matti Pitkänen, Nigel R. Andrew, Pascal A. Niklaus, Niels Martin Schmidt, Sebastian Seibold, Juliane Vogt, Evgeny V. Zakharov, Paul D. N. Hebert, Tomas Roslin, Natalia V. Ivanova, Monitoring Fungal Communities With the Global Spore Sampling Project, Frontiers in Ecology and Evolution, 10.3389/fevo.2019.00511, 7, (2020).
- Øystein H. Opedal, Otso Ovaskainen, Marjo Saastamoinen, Anna‐Liisa Laine, Saskya Nouhuys, Host‐plant availability drives the spatiotemporal dynamics of interacting metapopulations across a fragmented landscape, Ecology, 10.1002/ecy.3186, 0, 0, (2020).
- Sonja Saine, Otso Ovaskainen, Panu Somervuo, Nerea Abrego, Data collected by fruit body‐ and DNA‐based survey methods yield consistent species‐to‐species association networks in wood‐inhabiting fungal communities, Oikos, 10.1111/oik.07502, 0, 0, (2020).
- Torsti Schulz, Jarno Vanhatalo, Marjo Saastamoinen, Long‐term demographic surveys reveal a consistent relationship between average occupancy and abundance within local populations of a butterfly metapopulation, Ecography, 10.1111/ecog.04799, 43, 2, (306-317), (2019).
- Daniel Scherrer, Heidi K. Mod, Antoine Guisan, How to evaluate community predictions without thresholding?, Methods in Ecology and Evolution, 10.1111/2041-210X.13312, 11, 1, (51-63), (2019).
- Emma‐Liina Marjakangas, Nerea Abrego, Vidar Grøtan, Renato A. F. Lima, Carolina Bello, Ricardo S. Bovendorp, Laurence Culot, Érica Hasui, Fernando Lima, Renata Lara Muylaert, Bernardo Brandão Niebuhr, Alexandre A. Oliveira, Lucas Augusto Pereira, Paulo I. Prado, Richard D. Stevens, Maurício Humberto Vancine, Milton Cezar Ribeiro, Mauro Galetti, Otso Ovaskainen, Fragmented tropical forests lose mutualistic plant–animal interactions, Diversity and Distributions, 10.1111/ddi.13010, 26, 2, (154-168), (2019).
- Gleb Tikhonov, Li Duan, Nerea Abrego, Graeme Newell, Matt White, David Dunson, Otso Ovaskainen, Computationally efficient joint species distribution modeling of big spatial data, Ecology, 10.1002/ecy.2929, 101, 2, (2019).
- Andrew Hoegh, David W. Roberts, Evaluating and presenting uncertainty in model‐based unconstrained ordination, Ecology and Evolution, 10.1002/ece3.5752, 10, 1, (59-69), (2019).
- Christopher R. Haak, Francis K. C. Hui, Geoffrey W. Cowles, Andy J. Danylchuk, Positive interspecific associations consistent with social information use shape juvenile fish assemblages, Ecology, 10.1002/ecy.2920, 101, 2, (2019).
- Teresa Morán‐López, Walter D. Espíndola, Benjamin S. Vizzachero, Antonio Fontanella, Letty Salinas, César Arana, Guillermo Amico, Marco A. Pizo, Tomás A. Carlo, Juan M. Morales, Can network metrics predict vulnerability and species roles in bird‐dispersed plant communities? Not without behaviour, Ecology Letters, 10.1111/ele.13439, 23, 2, (348-358), (2019).
- Guilhem Sommeria‐Klein, Lucie Zinger, Eric Coissac, Amaia Iribar, Heidy Schimann, Pierre Taberlet, Jérôme Chave, Latent Dirichlet Allocation reveals spatial and taxonomic structure in a DNA‐based census of soil biodiversity from a tropical forest, Molecular Ecology Resources, 10.1111/1755-0998.13109, 20, 2, (371-386), (2019).
- Camilla Maciel Rabelo Pereira, Álvaro López‐García, Danielle Karla Alves Silva, Leonor Costa Maia, Tobias Guldberg Frøslev, Rasmus Kjøller, Søren Rosendahl, Tropical forest type influences community assembly processes in arbuscular mycorrhizal fungi, Journal of Biogeography, 10.1111/jbi.13739, 47, 2, (434-444), (2019).
- David W. Roberts, Comparison of distance‐based and model‐based ordinations, Ecology, 10.1002/ecy.2908, 101, 1, (2019).
- Dajana Radujković, Rudy Diggelen, Roland Bobbink, Maaike Weijters, Jim Harris, Mark Pawlett, Sara Vicca, Erik Verbruggen, Initial soil community drives heathland fungal community trajectory over multiple years through altered plant–soil interactions, New Phytologist, 10.1111/nph.16226, 225, 5, (2140-2151), (2019).
- Øystein H. Opedal, Stein J. Hegland, Using hierarchical joint models to study reproductive interactions in plant communities, Journal of Ecology, 10.1111/1365-2745.13301, 108, 2, (485-495), (2019).
- Damaris Zurell, Niklaus E. Zimmermann, Helge Gross, Andri Baltensweiler, Thomas Sattler, Rafael O. Wüest, Testing species assemblage predictions from stacked and joint species distribution models, Journal of Biogeography, 10.1111/jbi.13608, 47, 1, (101-113), (2019).
- Shuo Jiao, Yahai Lu, Soil pH and temperature regulate assembly processes of abundant and rare bacterial communities in agricultural ecosystems, Environmental Microbiology, 10.1111/1462-2920.14815, 22, 3, (1052-1065), (2019).
- Jonne Kotta, Jarno Vanhatalo, Holger Jänes, Helen Orav-Kotta, Luca Rugiu, Veijo Jormalainen, Ivo Bobsien, Markku Viitasalo, Elina Virtanen, Antonia Nyström Sandman, Martin Isaeus, Sonja Leidenberger, Per R. Jonsson, Kerstin Johannesson, Integrating experimental and distribution data to predict future species patterns, Scientific Reports, 10.1038/s41598-018-38416-3, 9, 1, (2019).
- Tad A. Dallas, Anna-Liisa Laine, Otso Ovaskainen, Detecting parasite associations within multi-species host and parasite communities, Proceedings of the Royal Society B: Biological Sciences, 10.1098/rspb.2019.1109, 286, 1912, (20191109), (2019).
- Cajo J. F. ter Braak, New robust weighted averaging‐ and model‐based methods for assessing trait–environment relationships, Methods in Ecology and Evolution, 10.1111/2041-210X.13278, 10, 11, (1962-1971), (2019).
- Jenni Niku, Francis K. C. Hui, Sara Taskinen, David I. Warton, gllvm: Fast analysis of multivariate abundance data with generalized linear latent variable models in r, Methods in Ecology and Evolution, 10.1111/2041-210X.13303, 10, 12, (2173-2182), (2019).
- Chongliang Zhang, Binduo Xu, Ying Xue, Yiping Ren, Evaluating multispecies survey designs using a joint species distribution model, Aquaculture and Fisheries, 10.1016/j.aaf.2019.11.002, (2019).
- James T. Thorson, Lorenzo Ciannelli, Michael A. Litzow, Defining indices of ecosystem variability using biological samples of fish communities: A generalization of empirical orthogonal functions, Progress in Oceanography, 10.1016/j.pocean.2019.102244, (102244), (2019).
- Yuichi Yamaura, F. Guillaume Blanchet, Motoki Higa, Analyzing community structure subject to incomplete sampling: hierarchical community model vs. canonical ordinations, Ecology, 10.1002/ecy.2759, 100, 8, (2019).
- Gordana C. Popovic, David I. Warton, Fiona J. Thomson, Francis K. C. Hui, Angela T. Moles, Untangling direct species associations from indirect mediator species effects with graphical models, Methods in Ecology and Evolution, 10.1111/2041-210X.13247, 10, 9, (1571-1583), (2019).
- Otso Ovaskainen, Danielle Leal Ramos, Eleanor M. Slade, Thomas Merckx, Gleb Tikhonov, Juho Pennanen, Marco Aurélio Pizo, Milton Cezar Ribeiro, Juan Manuel Morales, Joint species movement modeling: how do traits influence movements?, Ecology, 10.1002/ecy.2622, 100, 4, (2019).
- Duarte S. Viana, Jonathan M. Chase, Spatial scale modulates the inference of metacommunity assembly processes, Ecology, 10.1002/ecy.2576, 100, 2, (2019).
- Martin Stjernman, Ullrika Sahlin, Ola Olsson, Henrik G. Smith, Estimating effects of arable land use intensity on farmland birds using joint species modeling, Ecological Applications, 10.1002/eap.1875, 29, 4, (2019).
- Res Altwegg, James D. Nichols, Occupancy models for citizen‐science data, Methods in Ecology and Evolution, 10.1111/2041-210X.13090, 10, 1, (8-21), (2019).
- See more



) as a function of species' traits (
) and phylogenetic correlations (
)
or 

