Abstract
 Top of page
 Abstract
 Introduction
 A simple example: h^{2} of horn length in unicorns
 Implications of model dependence
 Recommendations for evolutionary ecologists
 Acknowledgments
 References
Over the last decade, there has been a rapid growth in the application of quantitative genetic techniques to evolutionary studies of natural populations. Whereas this work yields enormous insight into evolutionary processes in the wild, the use of modelling techniques and strategies adopted from animal breeders means that estimates of trait heritabilities (h^{2}) are highly vulnerable to misinterpretation. Specifically, when estimated using animal models, h^{2} will not generally be comparable across studies and must be interpreted as being conditioned on any fixed effects included in the model. Failure to realize the model dependency of published h^{2} estimates will give a very misleading, and in most cases upwardly biased, impression of the potential for trait evolution.
Introduction
 Top of page
 Abstract
 Introduction
 A simple example: h^{2} of horn length in unicorns
 Implications of model dependence
 Recommendations for evolutionary ecologists
 Acknowledgments
 References
The heritability (h^{2}) of a phenotypic trait is defined as the proportion of phenotypic variance that is attributable to additive genetic effects, and is a key parameter in determining the evolutionary response to selection (Falconer & Mackay, 1996). As such, evolutionary biologists have long been interested in heritability, both for predicting selection responses, and as a tool to assess more general questions. For example, can laboratorybased estimates be usefully extrapolated to wild populations (Weigensberg & Roff, 1996)? Are some types of trait more heritable than others (Merilä & Sheldon, 1999)? Does heritability change consistently with the quality of the environment (Hoffman & Merila, 1999; Charmantier & Garant, 2005)? However, whereas it is well known that heritability is specific to a trait and a population, it is rather less appreciated that parameter estimates are also heavily determined by the structure of the model used for their estimation. This issue has enormous potential to confuse and mislead evolutionary ecologists, particularly researchers who are less familiar with quantitative genetic techniques.
Over recent years, there has been a surge of interest in the application of quantitative genetic models to data from natural populations (Kruuk, 2004; Postma & Charmantier, 2007). To a large extent, this endeavour has been facilitated by the adoption of the animal model, a form of mixed effects model that has long been used by animal breeders (Henderson, 1984). Mixed effects models contain both fixed and random effects. Fixed effects are used to model populationlevel average responses to explanatory variables, whereas random effects allow remaining variance to be partitioned into components attributable to any grouping factors present in the data (Galwey, 2006). By definition, an animal model includes an individual’s additive genetic merit (or breeding value, Lynch & Walsh, 1998; as a random effect, and as genes are shared among relatives, this allows estimation of the additive genetic variance (V_{A}) for a trait of interest. By comparison with more traditional analytical techniques (e.g. parent–offspring regression), this method offers greater power and flexibility, particularly when dealing with complex pedigree structures typical of natural populations (Kruuk, 2004).
A further advantage of animal models is that fixed effects can readily be included, such that an individual’s phenotype is ‘corrected’ for known sources of variation, such as age and sex, or environmental conditions, such as density or food abundance. For animal breeding applications, the inclusion of fixed effects is used to protect against downward bias in heritability estimates. For example, if one is interested in estimating the heritability of body weight, but individual animals have been measured at different stages of growth, then fitting age as a fixed effect in the model corrects for this allowing a more meaningful comparison among individuals. This same argument also applies to studies of natural populations and a brief survey of the literature shows that studies using animal models to estimate h^{2} in the wild have, almost without exception, included fixed effects in the models (for some recent examples, see Wilson et al., 2005; Qvarnström et al., 2006; Thériault et al., 2007).
Whereas heritability is determined as the ratio of additive (V_{A}) to phenotypic (V_{P}) variance, V_{P} is most often defined as being the variance around the fixed effects mean. That is, to say, V_{P} is determined as the sum of the variance components associated with each random effect, and will therefore not include any variance explained by the fixed effects of a model. Consequently, heritability estimates will be critically determined by the fixed effects structure of the model used. The purpose of this note is to draw attention to the consequences of this practice, encourage discussion on the appropriate use of fixed effects, and demonstrate the vital importance of interpreting h^{2} estimates in the context of the model used for estimation.
A simple example: h^{2} of horn length in unicorns
 Top of page
 Abstract
 Introduction
 A simple example: h^{2} of horn length in unicorns
 Implications of model dependence
 Recommendations for evolutionary ecologists
 Acknowledgments
 References
This effect is most readily demonstrated by example. After defining arbitrary G (additive genetic) and R (residual) covariance matrices for a pair of traits, horn length and body mass in unicorns, phenotypes were simulated across a hypothetical pedigree structure (containing 1900 individuals drawn from three generations) using PEDANTICS (Morrissey et al., 2007). Horn length and body mass were simulated such that both traits are heritable, and positively genetically correlated. Individual phenotypes were then completed by adding additional effects to horn length such that males have (on average) longer horns than females, horn length increases with age, and horns grow faster in males. Using ASReml, phenotypic variance was decomposed into additive genetic (V_{A}), and residual (environmental) variance (V_{R}), using a series of animal models differing only in their fixed effects structure (Table 1). Heritability (h^{2}) was then estimated as the ratio of V_{A} to phenotypic variance (V_{P}) under each model. Note that only a single data set was analysed as the purpose is simply to illustrate the effect described above (i.e. not to perform a rigorous simulation exercise).
Table 1. Variance components and heritability for horn length in unicorns estimated under models of differing fixed effects. Model  Fixed effects  V_{A}  V_{R}  V_{P}  h^{2}  V_{P(obs)}  V_{A}/V_{P(obs)} 


1a  Mean  0.361 (0.115)  3.117 (0.138)  3.478 (0.115)  0.116 (0.040)  3.466  0.104 
1b  Mean + age + sex  0.362 (0.052)  0.653 (0.039)  1.014 (0.038)  0.554 (0.106)  3.466  0.104 
1c  Mean + age + sex + age:sex  0.351 (0.049)  0.599 (0.036)  0.950 (0.035)  0.585 (0.110)  3.466  0.101 
2a  Mean + weight  0.239 (0.097)  3.042 (0.129)  3.281 (0.1074)  0.078 (0.034)  3.466  0.069 
2b  Mean + age + sex + weight  0.234 (0.039)  0.601 (0.033)  0.836 (0.030)  0.389 (0.080)  3.466  0.068 
2c  Mean + age + sex + age:sex + weight  0.228 (0.036)  0.546 (0.029)  0.774 (0.028)  0.418 (0.083)  3.466  0.066 
Following the common practice of defining V_{P} as the sum of variance components (i.e. V_{A} + V_{R}), it is clear that inclusion of fixed effects causes the expected increase in heritability estimates by reducing the magnitude of V_{R} and hence V_{P} (Table 1). Thus, the heritability of horn length increases from 0.116 (model 1a) to 0.554 when age and sex are included (model 1b) and to 0.585 when the interaction of age and sex is fit (model 1c).
Inclusion of these fixed effects necessarily produces a better fitting model as the age and sex effects are truly present in the simulated data. Here, statistical testing would therefore confirm the significance of the fixed effects and by conventional model selection procedures we would choose model 1c as the ‘best’ model. However, it does not necessarily follow that the heritability estimate is ‘better’ under model 1c. More generally, a common objective in biological modelling is to try and maximize the explanatory power of a model. Thus, a model is usually preferred if it explains more of the variance in the response (e.g. higher R^{2} in a general linear model), and leaves less unexplained residual variance. With V_{P} calculated as the sum of variance components, a model selection strategy based on explaining as much variance as possible renders a naive interpretation of h^{2} (as the proportion of variance explained by additive effects) useless. The more the knowledge about environmental effects on phenotype, the higher the heritability becomes.
Comparing the variance component estimates under models 1a–c, it is clear that fixed effects reduce V_{R} and hence increase h^{2}. However, the inclusion of fixed effects should not, in general, cause systematic changes in the estimated additive variance, and this is reflected by similar estimates of V_{A} across models 1a–c. Consequently, if V_{A} is scaled by the observed phenotypic variance in the data (as opposed to the sum of the variance components), then heritability is effectively constant across models (Table 1). Nevertheless, it is important to note that fixed and random effects are jointly estimated in an animal model and changing the fixed effects can therefore influence V_{A} estimates. This will especially be the case if the incidence of fixed effects is nonrandom with respect to the pedigree structure. For example, if related individuals are more likely to be measured under similar environmental conditions, then failure to include environment effects could upwardly bias V_{A} (Kruuk & Hadfield, 2007).
Furthermore, a particular situation also exists where inclusion of a fixed effect can decrease the estimated genetic variance for a trait. In our example, this occurs when body weight is included as a fixed effect in the animal model of horn length (Table 1). Comparing models 2a–c with models 1a–c shows that including weight yields lower estimates of V_{A}, with concomitant declines in the heritability. This is because the structure of the G matrix used to simulate the phenotypes was such that a positive genetic correlation exists between weight and horn length. Thus, accounting for differences among individuals in the weight actually removes a portion of additive variance for the horn length. Thus, the estimates of V_{A} (and heritability) under models 2a–c are conditioned on the second trait of weight.
From a quantitative genetic perspective, a more sensible strategy here might be to model horn length and body weight as two traits in a bivariate animal model, thus estimating h^{2} for each trait and the genetic correlation between them. More generally, a good rule of thumb might be not to include variables as fixed effects if they would reasonably be expected to have an additive genetic component of variance (and potentially additive covariance with the focal trait) themselves. However, a blanket application of this rule may perhaps be problematic where researchers need to use biological measures as surrogates for environmental conditions. For example, models of reproductive traits may attempt to control for amongindividual differences in environments experienced by including weight or body condition as a proxy for food abundance. If carried out in a quantitative genetic model, it would at least seem prudent to estimate (co)variance components associated with such environmental proxies in order to correctly interpret results.
Recommendations for evolutionary ecologists
 Top of page
 Abstract
 Introduction
 A simple example: h^{2} of horn length in unicorns
 Implications of model dependence
 Recommendations for evolutionary ecologists
 Acknowledgments
 References
Above, I have highlighted the pitfalls relating to interpreting published estimates of trait heritability from studies of wild populations. The question is what could or should be done? Fixed effects are included in animal models for good and sensible reasons and I certainly do not suggest a wholesale change to this practice. However, in the light of these issues, it is hoped that researchers performing quantitative genetic analyses will give greater consideration to the possibility of their results being misinterpreted. In general, the method of determining the denominator (i.e. V_{P}) has been stated explicitly in the methods sections of published work. However, it is fair to say that the full implications of this could, and should, be made more explicit. Furthermore, I suggest that researchers always provide a simple estimate of the phenotypic variance observed in the raw data, thereby allowing a standardized version of h^{2} to be calculated by a reader if desired. More generally, it is vital that increased attention is given to the model dependency of heritability estimates. Thus, although heritability is certainly a population and traitspecific parameter, it should also be viewed as model specific. This does not negate a straightforward interpretation of h^{2}, but it is vital to understand that the phenotypic variance being partitioned is usually that which remains after conditioning on any fixed effects. Failure to realize this will lead, in some cases, to grotesquely biased expectations for evolutionary change.