Current Address: Department of Fisheries and Wildlife, Oregon State University, Corvallis, Oregon, USA
‘Exciting statistics’: the rapid development and promising future of hierarchical models for population ecology
Article first published online: 29 MAR 2012
© 2012 The Authors. Animal Conservation © 2012 The Zoological Society of London
Volume 15, Issue 2, pages 133–135, April 2012
How to Cite
Halstead, B. J., Wylie, G. D., Coates, P. S., Valcarcel, P. and Casazza, M. L. (2012), ‘Exciting statistics’: the rapid development and promising future of hierarchical models for population ecology. Animal Conservation, 15: 133–135. doi: 10.1111/j.1469-1795.2012.00540.x
- Issue published online: 29 MAR 2012
- Article first published online: 29 MAR 2012
Many biologists might indeed find statistics dull or even terrifying (van Emden, 2008), but ecologists and wildlife biologists should be downright ecstatic with recent developments in statistical modeling. One of the most exciting advances in recent decades is the development of hierarchical models. Hierarchical or multi-level models are becoming increasingly common in population analysis, and with good reason. Indeed, Kéry & Schaub (2011) devote much of their final chapter to the ‘Power and Beauty of Hierarchical Models’, and among the many virtues they list for hierarchical models are simpler fitting of complex models, clarity of thinking about these models and a ‘step-up’ approach to model building (Kéry & Schaub, 2011). Simpler and clearer model construction is an important advantage of hierarchical models, and although this may be reason enough to choose a hierarchical approach to modeling ecological data, the benefits do not stop with the fitting of the model to data.
The property of shrinkage is a particularly important advantage of hierarchical models in ecology (Gelman et al., 2004; Gelman & Hill, 2006; Ntzoufras, 2011; Cam, 2012; Schaub & Kéry, 2012). Shrinkage (or ‘borrowing strength’) refers to the assumption of random effects to be realizations of a process governed by a statistical distribution (Gelman et al., 2004; Cam, 2012; Schaub & Kéry, 2012). This stochastic assumption means that realizations of the process (such as daily risk of mortality at sites, in the present case) that are poorly estimated because of small sample size, short study duration, etc., but that fall far from the population mean will be pulled toward the mean; those realizations that are strongly supported by data contribute more information about the distribution from which the stochastic realizations are drawn, and are pulled less strongly toward the mean (Gelman et al., 2004; Gelman & Hill, 2006). Including random effects in models therefore allows the estimation of parameters for realizations with small sample sizes, guards against spurious conclusions, and increases the precision of parameter estimates for all realizations of the process of interest (Gelman & Hill, 2006; Cam, 2012; Schaub & Kéry, 2012).
Another important benefit of hierarchical models is reduction in bias. Where unmeasured (or unmeasurable) variables affect a parameter of interest, estimates of that parameter can be biased; applying a random effect allows for such unobserved heterogeneity and reduces the bias associated with unmeasured covariates (Gelman & Hill, 2006; Cam, 2012). The many advantages of hierarchical models for animal conservation apply not only to shared frailty models (Heisey et al., 2010; Griffin et al., 2011; Halstead et al., 2012), but also to modeling individual frailties in survival and reproduction (Cam et al., 2002; Wintrebert et al., 2005), accounting for heterogeneity in individual capture probabilities (Dorazio & Royle, 2003), discerning patterns among demographic parameters (Link & Barker, 2005) and combining inference for mark-recapture models (Rivot & Prévost, 2002; Calvert et al., 2009; Papadatou et al., 2010, 2012) among many, many other applications.
Although all commentators correctly acknowledge that algebraically identical hierarchical models can be analyzed under a frequentist or Bayesian approach, the latter offers the ecologist several important advantages. Modern software allows the simple implementation of complex models that would be difficult to analyze in a likelihood framework (Royle & Dorazio, 2008; Kéry & Schaub, 2011; Schaub & Kéry, 2012). This frees ecologists to focus on developing interesting, relevant models for their systems rather than writing complex likelihood functions (Kéry, 2010; Kéry & Schaub, 2011; Schaub & Kéry, 2012). The consistent treatment of fixed effects, random effects, and missing data is another advantage of the Bayesian paradigm (Link & Barker, 2010; Heisey, 2012). Calculation of derived parameters and out-of-sample predictions, and propagating error associated with these quantities, is simple and does not rely upon asymptotic approximations (Kéry, 2010; Link & Barker, 2010); such derived parameters and predictions are commonly of interest in population models. Updating prior information with data to produce posterior distributions, a centerpiece of the Bayesian paradigm, is consistent with the way most conservation science progresses (Clark, 2007). Further consistency is gained by the intuitive interpretation of posterior distributions as simple probability statements, which is often how resource managers and the public erroneously interpret frequentist confidence intervals. For these benefits, the cost is the specification of priors (Royle & Dorazio, 2008) and computing time, the latter of which can be substantial for some hierarchical models.
Hierarchical models and their analysis by Bayesian methods are increasingly common in ecology, but we have likely only seen the tip of the iceberg with regard to the utility of these models for the ecologist. Hierarchical models are an area of intense statistical research, and many applications relevant for the ecologist have yet to be realized (Cam, 2012; Heisey, 2012). For many ecologists, training in Bayesian analysis consists of a workshop or self-teaching with one (or more) of the several excellent texts on the subject (e.g., McCarthy, 2007; Royle & Dorazio, 2008; King et al., 2009; Kéry, 2010; Link & Barker, 2010; Kéry & Schaub, 2011). Treatment of Bayesian analysis as standard methodology in biometry and statistical ecology courses is in its infancy, and we reinforce the call of Cam (2012) for advanced courses in statistical modeling for wildlife biologists. As Bayesian methods are incorporated into mainstream statistical training for ecologists, we expect even greater growth in the use and development of hierarchical models for ecological applications.
- 2009). A hierarchical Bayesian approach to multi-state mark – recapture: simulations and applications. J. Appl. Ecol. 46, 610–620. , , , , & (
- 2012). ‘Each site has its own survival probability, but information is borrowed across sites to tell us about survival in each site’: random effects models as means of borrowing strength in survival studies of wild vertebrates. Anim. Conserv. 15, 129–132. (
- 2002). Individual covariation in life-history traits: seeing the trees despite the forest. Am. Nat. 159, 96–105. , , , & (
- 2007). Models for ecological data: an introduction. Princeton: Princeton University Press. (
- 2003). Mixture models for estimating the size of a closed population when capture rates vary among individuals. Biometrics 59, 351–364. & (
- 2008). Statistics for terrified biologists. Malden: Wiley–Blackwell. (
- 2004). Bayesian data analysis. 2nd edn. Boca Raton: Chapman & Hall. , , & (
- 2006). Data analysis using regression and multilevel/hierarchical models. Cambridge: Cambridge University Press. & (
- 2011). Neonatal mortality of elk driven by climate, predator phenology and predator community composition. J. Anim. Ecol. 80, 1246–1257. , , , , , , , , , , , , , , , , (
- 2012). Bayesian shared frailty models for regional inference about wildlife survival. Anim. Conserv. 15, 117–124. , , , & (
- 2012). Commentary: Bayesian shared frailty models for regional inference about wildlife survival. Anim. Conserv. 15, 127–128. (
- 2010). Linking process to pattern: estimating spatiotemporal dynamics of a wildlife epidemic from cross-sectional data. Ecol. Monogr. 80, 221–240. , , , , & (
- 2010). Introduction to winbugs for ecologists: Bayesian approach to regression, ANOVA, mixed models and related analyses. Burlington: Academic Press. (
- 2011). Bayesian population analysis using winbugs: a hierarchical perspective. Waltham: Academic Press. & (
- 2009). Bayesian analysis for population ecology. Boca Raton: Chapman and Hall/CRC. , , & (
- 2005). Modeling association among demographic parameters in analysis of open population capture-recapture data. Biometrics 61, 46–54. & (
- 2010). Bayesian inference: with ecological applications. London: Academic Press. & (
- 2007). Bayesian methods for ecology. Cambridge: Cambridge University Press. (
- 2011). Bayesian modeling using winbugs. Hoboken: John Wiley and Sons. (
- 2010). Assessing survival in a multi-population system: a case study on bat populations. Oecologia 165, 925–933. , , , & (
- 2012). Comparing survival among species with imperfect detection using multilevel analysis of mark—recapture data: a case study on bats. Ecography 35, 153–161. , , , , , , , , , , , (
- 2002). Hierarchical Bayesian analysis of capture-mark-recapture data. Can. J. Fish. Aquat. Sci. 59, 1768–1784. (
- 2008). Hierarchical modeling and inference in ecology: the analysis of data from populations, metapopulations and communities. London: Academic Press. & (
- 2012). Combining information in hierarchical models improves inferences in population ecology and demographic population analyses. Anim. Conserv. 15, 125–126. & (
- 2005). Joint modelling of breeding and survival in the kittiwake using frailty models. Ecol. Model. 181, 203–213. , , , & (