The full text of this article hosted at iucr.org is unavailable due to technical difficulties.

Free Access

The genetical theory of kin selection

A. GARDNER

Department of Zoology, University of Oxford, Oxford, UK

Balliol College, University of Oxford, Oxford, UK

Search for more papers by this author
S. A. WEST

Department of Zoology, University of Oxford, Oxford, UK

Search for more papers by this author
G. WILD

Department of Applied Mathematics, University of Western Ontario, London, ON, Canada

Search for more papers by this author
First published: 04 March 2011
Cited by: 173
Andy Gardner, Department of Zoology, University of Oxford, South Parks Road, Oxford OX1 3PS, UK.
Tel.: +44 (0) 1865 271 271; Fax: +44 (0) 1865 271 249; e‐mail: andy.gardner@zoo.ox.ac.uk

Abstract

Natural selection operates both directly, via the impact of a trait upon the individual’s own fitness, and indirectly, via the impact of the trait upon the fitness of the individual’s genetically related social partners. These effects are often framed in terms of Hamilton’s rule, rb − c > 0, which provides the central result of social‐evolution theory. However, a number of studies have questioned the generality of Hamilton’s rule, suggesting that it requires restrictive assumptions. Here, we use Fisher’s genetical paradigm to demonstrate the generality of Hamilton’s rule and to clarify links between different studies. We show that confusion has arisen owing to researchers misidentifying model parameters with the b and c terms in Hamilton’s rule, and misidentifying measures of genotypic similarity or genealogical relationship with the coefficient of genetic relatedness, r. More generally, we emphasize the need to distinguish between general kin‐selection theory that forms the foundations of social evolution, and streamlined kin‐selection methodology that is used to solve specific problems.

Introduction

Natural selection is the part of evolutionary change in heri characters that is driven by the differential reproductive success of individual organisms (Darwin, 1859; Fisher, 1930; Price, 1970). It is not the sole factor in evolution, nor is it necessarily the major driver of genetic change. However, it receives much attention because it is the part of evolutionary change that generates biological adaptation, and hence leads organisms to appear designed. Darwin (1859) argued that, as natural selection favours those individuals who achieve the greatest reproductive success, individuals will consequently appear designed to maximize their reproductive success (Darwinian fitness). This link between natural selection (dynamics) and phenotypes optimized for fitness maximization (design) has been formalized by Grafen (2002), for a large class of models that excludes social interaction between relatives.

The only fundamental change to Darwin’s theory of adaptation by natural selection has been to take account of social interactions between relatives. Darwin (1859, pp. 236–238) launched this avenue of inquiry by asking how the exquisite adaptations of sterile workers could have evolved in the social insects, given that these individuals enjoy no reproductive success and so cannot be directly acted upon by natural selection. His solution was to suppose that their fertile family members carry heritable tendencies for these traits, in latent form, and may pass these on to future generations. He suggested that natural selection acts upon worker traits indirectly, according to their fitness consequences for reproductive family members. The separation of direct fitness (impact upon personal reproductive success) and indirect fitness (impact upon the reproductive success of genetically similar individuals) was emphasized by Fisher (1930, p. 27), who understood that the latter could drive the evolution of altruistic characters such as aposematic coloration in gregarious insects (Fisher, 1930; pp. 158–159). However, Fisher otherwise neglected such effects in his formalization of Darwinian adaptation (Fisher, 1930, p. 27).

The theory of indirect effects was later fully developed by Hamilton (1963, 1964, 1970), whose analyses suggested that natural selection leads the individual to appear designed to maximize the sum of her direct and indirect fitness, or ‘inclusive fitness’. The formal link between natural selection and inclusive‐fitness maximization has been established by Grafen (2006a), in a large class of models that allows for social interactions between relatives. Hamilton also showed that the principle of inclusive fitness can be encapsulated in a pleasingly simple form, which Charnov (1977) termed ‘Hamilton’s rule’. This states that altruistic behaviour (Table 1) can be favoured by natural selection if rb − c > 0, where c is the fitness cost to the actor, b is the fitness benefit to the recipient, and r is the genetic relatedness of the actor and recipient. Maynard Smith (1964) coined the phrase ‘kin selection’ to describe the process of natural selection operating through indirect fitness effects.

Table 1. Classification of social behaviours. After Hamilton (1964, 1970) and West et al. (2007a,b).
Fitness effect for recipient
+
Fitness effect for actor + Mutual benefit Selfishness
Altruism Spite

The theory of kin selection describes how natural selection may be separated into direct and indirect components. Hamilton’s rule captures this basic separation, and hence appears to have the same generality and explanatory power as the theory of natural selection itself. Specifically, −c and rb represent, respectively, the direct and indirect fitness consequences of a trait, and apply to any character of interest, not just altruism (i.e. r, b & c could all be positive or negative or zero; Table 1). The explanatory power of this approach has been clearly demonstrated in numerous areas of behavioural and evolutionary research, where it allows clear predictions to be made about social adaptations, which can then be tested empirically (Krebs & Davies, 1993; Bourke & Franks, 1995; Mock & Parker, 1997; Frank, 1998; Haig, 2002; Burt & Trivers, 2006; West, 2009).

However, a number of theoretical studies have questioned the validity and generality of Hamilton’s rule. First, it has been suggested that Hamilton’s rule requires restrictive assumptions, such as genetic variation segregating at only a single locus, weak selection and absence of gene interactions within and between individuals (Cavalli‐Sforza & Feldman, 1978; Charlesworth, 1980; Uyenoyama & Feldman, 1980, 1981; Colwell, 1981; Uyenoyama et al., 1981; Michod, 1982; Queller, 1984, 1985; Avilés, 1993; Bulmer, 1994; Wolf et al., 1998, 1999; Gintis, 2000; Wolf, 2003; Wilson, 2005; Doebeli & Hauert, 2006; Fletcher & Doebeli, 2006, 2009; Fletcher et al., 2006; Fletcher & Zwick, 2006; Killingback et al., 2006; Nowak, 2006; Queller & Strassmann, 2006; Traulsen & Nowak, 2006; Wenseleers, 2006; Wilson & Wilson, 2007; Bijma & Wade, 2008; Goodnight et al., 2008; Traulsen et al., 2008; Wilson, 2008; Nowak et al., 2010a,b; Traulsen, 2010; Table 2). Second, where apparent contradictions of Hamilton’s rule have been resolved, practitioners of kin‐selection analysis have been accused of redefining the cost, benefit and relatedness terms as a post hoc fix (Doebeli & Hauert, 2006; Fletcher et al., 2006; Nowak et al., 2010a,b), and departing from original formulations of kin‐selection theory.

Table 2. Difficulties that have been suggested to render Hamilton’s rule invalid.
# Suggested difficulty References Resolution References
1 Genetic interactions (dominance, epistasis, synergy) Cavalli‐Sforza & Feldman, 1978; Uyenoyama & Feldman, 1980, 1981; Uyenoyama et al., 1981; Michod, 1982; Bulmer, 1994; Fletcher & Doebeli, 2006; Fletcher et al., 2006; Fletcher & Zwick, 2006; Queller & Strassmann, 2006; Wenseleers, 2006; van Veelen, 2009; Traulsen, 2010; Nowak et al., 2010b A Hamilton's rule regression analysis of additive effects does not require the absence of nonadditive effects Queller, 1992a; Frank, 1997a, 1998; Lehmann & Keller, 2006a,b; Gardner et al., 2007
2 Strong selection Charlesworth, 1980; Bulmer, 1994; Wenseleers, 2006; Traulsen et al., 2008; Traulsen, 2010; Nowak et al., 2010a,b Weak selection is often assumed for simplicity, but is not required in general Queller, 1992a; Lehmann & Keller, 2006a,b; Gardner et al., 2007
3 Frequency‐dependent selection Bulmer, 1994; Doebeli & Hauert, 2006; Nowak et al., 2010b; Traulsen, 2010 Costs and benefits may be frequency dependent Hamilton, 1967; Frank, 1986, 1998; Lehmann & Keller, 2006a,b; Gardner et al., 2007
4 Pure strategies Wenseleers, 2006 Pure strategies are allowed Gardner et al., 2007
5 Altruism among nonrelatives Killingback et al., 2006; Nowak, 2006; Fletcher & Doebeli, 2009; Nowak et al., 2010b Either the behaviour is not altruistic, or else genetic relatedness has been confused with genealogical relationship/DNA sequence similarity/etc. Grafen, 2007a,b; Taylor et al., 2007b; Lehmann et al., 2007a,b; this article
6 Conditional strategies Fletcher & Zwick, 2006; Fletcher et al., 2006 Hamilton's rule readily allows for conditional strategies Hamilton, 1967; Charnov, 1978; Craig, 1979; Maynard Smith, 1980; Parker, 1989; Crozier, 1992
7 Greenbeards Queller, 1985 Relatedness is defined with respect to role, not with respect to neighbours Gardner & West, 2010
8 Evolutionary bifurcations Fletcher & Doebeli, 2006; Doebeli & Hauert, 2006; Wenseleers, 2006 Hamilton's rule correctly describes the change in average genetic value even in the context of a bifurcation This article
9 Indirect genetic effects (IGEs) Wolf et al., 1998, 1999; Wolf, 2003; Bijma & Wade, 2008 IGEs are implicitly allowed for in Hamilton's rule This article
10 Multi‐generation effects Goodnight et al., 2008 Hamilton's rule captures reproductive value effects (long‐term genetic contributions to the future) Rousset & Ronce, 2004; Alizon & Taylor, 2008; Wild et al., 2009; Lehmann, 2010
11 Group selection Colwell, 1981; Avilés, 1993; Gintis, 2000; Wilson, 2005; Nowak, 2006; Traulsen & Nowak, 2006; Taylor et al., 2007a,b; Traulsen, 2010; Wilson & Wilson, 2007; Wilson, 2008 The multilevel selection approach is mathematically equivalent to the kin selection approach Hamilton, 1975; Grafen, 1984, 2006a; Wade, 1985; Frank, 1986, 1995; Queller, 1992b; Gardner et al., 2007; Lehmann et al., 2007a,b; Gardner & Grafen, 2009
12 More than two interacting individuals Nowak et al., 2010b Hamilton’s rule can be applied across any number of interacting individuals Hamilton, 1964; West & Gardner, 2010

Our aim in this article is to clarify the generality of Hamilton’s rule, as a statement of how natural selection drives changes in gene frequencies, and to make explicit the links between different theoretical approaches. In the first section, we employ a genetical approach to natural selection, developed by Fisher and used by Hamilton, to provide a derivation of Hamilton’s rule. This shows how selection can be divided into direct and indirect components, and makes explicit exactly what the different terms of Hamilton’s rule represent. In subsequent sections, we examine the validity and generality of Hamilton’s rule, considering the various complexities that have been suggested to make the rule fail to correctly predict the action of natural selection (summarized in Table 2). For illustrative purposes, we work through two issues in detail: nonadditive fitness effects and altruism among nonkin. Finally, in the Discussion, we show that confusion has arisen owing to a failure to distinguish between the general theory of kin selection that forms the foundations of social evolution vs. the streamlined kin‐selection methodologies that are used to solve specific problems.

The genetical theory of kin selection

Natural selection is the part of evolutionary change that results from the differential reproductive success of individual organisms (Darwin, 1859; Fisher, 1930; Price, 1970; Box 1). In particular, it concerns change in the heritable component of the phenotype (g; the individual’s ‘genetic value’ for a trait of interest; Price, 1970; Falconer, 1981; Grafen, 1985a; Box 2), and how this is mediated by differences in individual fitness (v; more generally, ‘reproductive value’; Fisher, 1930; Price & Smith, 1972; Charlesworth, 1994; Grafen, 2006b; Box 3). A formal statement of the action of natural selection is provided by Price’s (1970) equation. This is a change in the arithmetic average of genetic value between successive generations, and is given by:

image(1)

where v is an individual’s relative fitness, g is her genetic value for the character of interest, and cov(v,g) is the covariance of these two quantities taken over all individuals in the population (derivation provided in Box 1; Price, 1970; see also Robertson, 1966, 1968). This covariance form has been termed the ‘secondary theorem of natural selection’ (Robertson, 1968). For ease of exposition, we will not consider class structure in this article; the action of natural selection in class‐structured models is given a very general treatment by Grafen (2006b; see also Fisher, 1930; Price & Smith, 1972; Taylor, 1990, 1996; Charlesworth, 1994; Taylor & Frank, 1996; Frank, 1997a, 1998; and Taylor et al., 2007a).

Upon the assumption that heritable variation in the character of interest exists (var(g) > 0), the RHS of equation (1) may be expressed as the product of the additive genetic variance and a multiplying coefficient:

image(2)

The coefficient βv,g≡ cov(v,g)/var(g) has a useful interpretation: if we were to write down the genetic value (g) and relative fitness (v) of every individual in the population, then the coefficient βv,g is equal to the slope of the straight line fitted to these (g,v) data by the method of least squares, i.e.:

image(3)

(see Box 4 for details). The linearity of this least‐squares regression is sometimes confused with the assumption of linearity in an underlying evolutionary model (see Frank, 1998; Rice, 2004 for more discussion). However, this multiplicative partition of covariance is fully general, and holds irrespective of the degree of nonadditivity of gene‐fitness relations. Any impact of dominance, epistasis, etc. upon the immediate action of natural selection is fully accounted for in the least‐squares linear regression. Put another way, the goodness of fit of the linear regression model to the population data is irrelevant, because natural selection only ‘sees’ the additive (i.e. average) effects.

Equation (2) emphasizes that natural selection will occur wherever there is heritable variation (var(g) > 0) in a character that is correlated with fitness (βv,g ≠ 0; Darwin, 1859; Price, 1970). A special case is recovered by taking relative fitness as the character of interest (Fisher, 1941). This gives βv,g = 1 and hence ΔSE(g) = var(g) (Frank, 1997a). This is the ‘fundamental theorem of natural selection’, which states that the change in the average genetic value for relative fitness that can be ascribed to the action of natural selection is equal to the additive genetic variance in relative fitness. As variances are nonnegative, this result captures the directionality of the selective process (Fisher, 1930, 1941; Price, 1972a).

Fisher (1930), following Darwin’s (1859) comments on social insects, distinguished two routes for a correlation between an individual’s genetic value and her reproductive value. First, an individual’s genes (g) can have a direct impact on her fitness, owing to their influence on the phenotype. Second, an individual’s genes can have an indirect impact on her fitness, owing to the individual interacting with genetically similar social partners, whose genes (g′; and hence phenotypes) also mediate the focal individual’s fitness. We may separate the direct and indirect components of the action of natural selection, using the language of least‐squares linear regression (Queller, 1992a; Frank, 1997a, 1998), by imagining that we are fitting the equation:

image(4)

to population data (see Box 4 for details). Substituting this multi‐regression model into equation (1) obtains:

image(5)

where r ≡ cov(g′,g)/cov(g,g) is the genetic relatedness between social partners (Hamilton, 1972; Orlove, 1975; Orlove & Wood, 1978; Michod & Hamilton, 1980; Queller, 1992a; Frank, 1997a, 1998; Box 5). To be clear, the coefficient of genetic relatedness measures the genetic similarity of two individuals, irrespective of why they are similar (e.g. owing to coancestry). For simplicity, we have considered social partners interacting in only one ‘role’; a general treatment of potentially multiple roles is provided by Grafen (2006a).

Equation (5) captures the personal (or neighbour‐modulated) fitness view of kin selection (Hamilton, 1964; Orlove, 1975; Maynard Smith, 1983; Taylor & Frank, 1996; Frank, 1998). Here, we are concerned with the two routes (βv,g|g vs. βv,g′|gr) by which an individual’s genes may impact upon her own personal fitness: through her own expression of these genes, and through her neighbours’ expression of copies of the same genes.

Alternatively, we may reassign the indirect fitness effects to the actors responsible for them, giving:

image(6)

Equation (6) captures the inclusive‐fitness view, showing how an individual’s genes impact upon her inclusive fitness either by impacting upon her personal fitness (βv,g|g; direct fitness effect), or by impacting upon the fitness of individuals to which she is genetically related (βv′,g|g′r, where v′ is the relative fitness of her social partner; indirect fitness effect) (Hamilton, 1964; Orlove, 1975; Frank, 1998; Rousset, 2004).

The personal‐fitness and inclusive‐fitness approaches to kin selection are dynamically equivalent: they assign the indirect components of fitness to individuals in different ways, but they predict the same overall response to natural selection when fitness effects are aggregated over the whole population (Hamilton, 1964; Orlove, 1975; Maynard Smith, 1983; Taylor & Frank, 1996; Frank, 1997b, 1998; Rousset, 2004; Grafen, 2006a; Gardner et al., 2007; Taylor et al., 2007a). Hence, irrespective of the approach taken, we may denote the direct effect of an individual’s genes on her fitness by −c = βv,g|g and the effect of an individual’s genes on her social partner’s fitness by b = βv,g′|g = βv′,g|g. Consequently, the condition for natural selection to favour an increase in any character of interest (ΔSE(g) > 0) is given by Hamilton’s rule: rb − c > 0.

Box 1 – Selection and transmission

Natural Selection is not EvolutionFisher (1930, p. vii)

In very general terms, evolutionary change over any specified period of time can be expressed as a sum of selection and transmission components. This is captured by Price’s equation, based upon a general mapping between two populations of entities (Price, 1972b, 1995; Frank, 1995). Typically, we suppose that one population is descended from the other, and we denote these ‘parent’ and ‘offspring’ populations accordingly. The terms ‘parent’ and ‘offspring’ suggest that we are considering evolutionary change over a single generation; however, we are free to consider any two points in time.inline image

To derive Price’s equation, we assign every individual in the parent population a unique index iinline imageI, and we assign indices to every individual in the offspring population according to which parent individual they are descended from. When a given individual in the offspring population has more than one ancestor in the parent population (e.g. as in a nonselfing sexual population), each ancestor is awarded their genetic share of the offspring (e.g. mother and father would each be awarded a half share of the offspring, in a diploid population). We denote the ‘relative abundance’ of the ith parent as qi, where ∑Iqi = 1. Typically, qi = 1/N, where N is the number of individuals in the parent population, but we are free to assign different individuals different relative abundances, for example according to their relative biomass. Similarly, we denote the relative abundance of the ith parent’s descendants in the offspring population as qi′, and this allows us to define the relative fitness of any individual in the parent population as vi = qi′/qi (i.e. the relative growth factor of her immediate lineage). Finally, we can assign each individual in the parent population a value zi for any character of interest, and we denote the average character value of ith parent’s offspring as zi′= zi + Δzi. The average character value over the parent and offspring populations is E(z) = ∑Iqizi and E(z′) = ∑Iqizi′, respectively. (Strictly, this is the expectation of the random variable defined as the phenotypic value of an individual drawn at random from the population, with the probability of drawing a particular individual i being equal to her weighting qi.) Having described the parent and offspring populations, and the mapping between these two, we may write the change in the population average value of the character of interest (ΔE(z) = E(z′) − E(z)) as:

image((B1.1))

where: cov is the covariance and E is the expectation (arithmetic average), each taken over the set of all individuals in the population. The covariance term describes the change that can be ascribed to the statistical association between an individual’s character and her relative fitness, and defines selection. (Strictly, this is the covariance of the random variables defined as the phenotypic value and relative fitness of an individual drawn at random from the population, with the probability of drawing a particular individual i being equal to her weighting qi.) The expectation term describes the change that can be ascribed to character differences between a parent and her offspring, and defines changes associated with transmission.

Natural selection is a particular type of selection that involves genes, the fundamental units of heredity. In Fisher’s (1930) scheme, the character of interest is not an individual’s phenotype per se, but rather her genetic value for any phenotypic character of interest i.e. the heritable portion of the phenotype (Box 2). Moreover, we define change across a single generation. Denoting the genetic value by gi, then from equation (B1.1) the action of natural selection is given by:

image((B1.2))

For ease of exposition, we neglect some complicating factors. First, chance differences in individual reproductive success can drive genetic change (random drift). To remove this non‐Darwinian evolutionary factor, natural selection is properly defined as an average over future uncertainty (Grafen, 2000). Second, in class‐structured populations, one must keep track of differences in offspring class (e.g. male vs. female) as well as offspring number (reviewed by Grafen, 2006b). In such contexts, fitness is properly defined in terms of asymptotic genetic contribution to future generations, i.e. reproductive value (Box 3). In this article, we assume no class structure, for ease of presentation.

Box 2 – Genetic value

The contribution of imperfectly additive genetic factors divide themselves for statistical purposes into two parts: an additive part which reflects the genetic nature without distortion, and gives rise to the correlations which one obtains; and a residue which acts in much the same way as an arbitrary error introduced into the measurementsFisher (1918, p. 139)

Natural selection can drive phenotypic change only insofar as organismal characters are heritable. Thus, a proper statement of natural selection concerns changes in the heritable portion of the phenotype. This corresponds to the individual’s genetic value, which is the value we predict for her phenotype based upon a least‐squares linear regression model that takes as its predictor variables all the genes underlying the character of interest. Formally, the model can be written as:

image((B2.1))

where: Γ is the set of genic predictors (i.e. gene positions within an individual’s genome) impacting upon her phenotype; xγ is an individual’s genic value for the predictor γ inline image Γ; E(xγ) is the population‐average genic value for this predictor; z is the individual’s phenotypic value; and E(z) is the population‐average phenotypic value. Here, the regression terms β map the set of genic values to phenotypic value, and these can be determined by fitting the model (B2.1) to the set of genic and phenotypic data presented by any population, according to the usual method of least‐squares (Box 4).

Having fitted the model, an individual’s genetic value is simply the phenotypic value that the model predicts for her, based upon her genic data:

image((B2.2))

(Price, 1970; Falconer, 1981; Grafen, 1985a).

The assumptions and generality of Hamilton’s rule

The previous section has shown that Hamilton’s rule of kin selection emerges directly from a general expression of natural selection, simply by partitioning the latter into its direct and indirect components. It would appear to be a very general result. However, within the literature that has grown up around Hamilton’s rule, a number of complications have been suggested to limit its generality (summarized in Table 2). We now address each of these in turn.

First, it is often claimed that Hamilton’s rule assumes the absence of gene interactions, both within and between individuals (Cavalli‐Sforza & Feldman, 1978; Uyenoyama & Feldman, 1980, 1981; Uyenoyama et al., 1981; Michod, 1982; Bulmer, 1994; Fletcher & Doebeli, 2006; Fletcher & Zwick, 2006; Fletcher et al., 2006; Queller & Strassmann, 2006; Wenseleers, 2006; Nowak et al., 2010b; Traulsen, 2010). However, the analysis of the previous section did not require any assumptions to that effect, so this claim is incorrect (see also Frank, 1997a, 1998; Lehmann & Keller, 2006a,b; Gardner et al., 2007). This misunderstanding owes mainly to a confusion of the notion of the additive (average) effect of a gene, on the one hand, with the assumption of no gene interactions, on the other hand (Frank, 1998; pp. 18–19). The additive fitness effect of a gene is an average taken over genotypes, which allows for its dosage in each genotype, using the method of least‐squares linear regression (Fisher, 1918). Gene interactions – such as dominance, epistasis and synergy – may mediate the overall additive effect of the gene, and hence impact upon the action of natural selection, but these effects are already taken into account within the regression analysis (they need not be assumed absent). In the context of the linear regression analysis, a nonadditive effect is the discrepancy between the actual and predicted fitness of a genotype and, in any generation, such effects simply do not impact upon the action of natural selection.

Second, it is often claimed that Hamilton’s rule breaks down if selection is not vanishingly weak (Charlesworth, 1980; Bulmer, 1994; Wenseleers, 2006; Traulsen et al., 2008; Nowak et al., 2010a,b; Traulsen, 2010). However, the analysis of the previous section allows for selection of arbitrary strength, so again this claim is incorrect (see also Lehmann & Keller, 2006a,b; Gardner et al., 2007). This misconception seems to have arisen for two reasons. In some cases, weak selection can render nonadditive fitness effects negligible, and so (unnecessary) concerns regarding nonadditive effects have sometimes been alleviated by additional assumptions of weak selection. The other reason is that, although relatedness coefficients can be difficult to compute in the context of selection (Bulmer, 1994; but see Ajar, 2003; Gardner et al., 2007; Rousset & Roze, 2007), they are often much easier to compute in the context of neutral populations not undergoing selection. Hence, kin‐selection analyses often approximate relatedness coefficients by their neutral counterparts, with the proviso that the approximation is valid only when selection is weak (Taylor, 1996; Taylor & Frank, 1996; Frank, 1998). This simplification is made, just as any other simplifying model assumption, on the basis that it leads to an easier analysis. Furthermore, the assumption of weak selection is often appropriate from an empirical perspective, because many traits are influenced by multiple genes each having only a small effect (Lynch & Walsh, 1998). However, it is incorrect to interpret this simplifying assumption in a specific model as a general failure of Hamilton’s rule in the face of strong selection.

Third, it is often claimed that Hamilton’s rule breaks down in the context of frequency‐dependent selection (Bulmer, 1994; Doebeli & Hauert, 2006; Nowak et al., 2010b; Traulsen, 2010). Moreover, no assumption of frequency independence is built into the analysis of the previous section, so this claim is also incorrect (see also Lehmann & Keller, 2006a,b; Gardner et al., 2007; Ross‐Gillespie et al., 2007). A possible reason for this misconception is that Hamilton (1964) framed his principle of inclusive fitness as an extension of Fisher’s fundamental theorem of natural selection, which was commonly (though erroneously) assumed to fail in the context of frequency‐dependent selection (reviewed by Edwards, 1994). The analysis of the previous section makes clear the generality of the fundamental and secondary theorems of natural selection. Another possible reason for this misconception is the emphasis given by Hamilton (1988) to the frequency independence of kin selection under the assumption of weak selection, which he described as a ‘gift from God’. Although frequency independence does allow for an easier kin‐selection analysis, frequency‐dependent selection does not invalidate Hamilton’s rule.

Fourth, Wenseleers (2006) has claimed that Hamilton’s rule can only be applied to the evolution of ‘mixed’ (i.e. probabilistic) strategies, and cannot be applied to the evolution of ‘pure’ (i.e. hard‐wired discrete) strategies. This view appears to arise out of concerns relating to strong selection. High phenotypic variance can potentially translate into high fitness variance between individuals, and this can translate into high fitness variance between alleles (strong selection) if genes are consistently associated with particular phenotypes (pure strategies). By contrast, high fitness variance between individuals may translate into low fitness variance between alleles (weak selection) if genes are not consistently associated with particular phenotypes (mixed strategies). Hence, mixed strategies are a way of imposing weak selection at the gene level even in the context of large fitness differences at the phenotypic level. However, as outlined above, weak selection is not a general requirement for Hamilton’s rule to correctly predict the action of natural selection, so this point is moot. Gardner et al., 2007); and this article) provide illustrations of how Hamilton’s rule may be applied to models incorporating pure strategies.

Fifth, it is sometimes claimed that altruism can be favoured in the absence of relatedness, implying that Hamilton’s rule may fail to correctly predict the action of natural selection (Fletcher & Zwick, 2006; Killingback et al., 2006; Nowak, 2006; Fletcher & Doebeli, 2009). However, in each of the corresponding models, it has subsequently been shown that actors either derive a direct benefit from their behaviour (i.e. the behaviour is not really altruistic), or that they are in fact related to the recipients of their behaviour (relatedness is not really absent) (Lehmann & Keller, 2006a,b; Grafen, 2007a,b; Taylor et al., 2007b; West et al., 2007a,b, 2011; this article). Considering the latter case, relatedness is missed for two broad reasons. One reason is that authors sometimes assume that the lack of an explicit relatedness parameter, or lack of kin discrimination, means that relatedness is not important (e.g. Killingback et al., 2006; Nowak, 2006; Nowak et al., 2010b). In such cases, more formal analyses have demonstrated that altruism is driven by positive relatedness, often because limited dispersal leads to individuals interacting with their close relatives (Grafen, 1984, 2007a,b; Lehmann & Keller, 2006a,b; Lehmann et al., 2007a,b; Taylor et al., 2007b). Another reason that genetic relatedness can be missed is when it is assumed that it requires DNA sequence similarity (e.g. Fletcher & Doebeli, 2009). However, as discussed above, genetic relatedness measures the statistical association between social partners’ heritable tendencies for a trait of interest, which does not require that they share the same DNA sequence. The related point that genetical relatedness is distinct from genealogical relationship was first made by Hamilton (1964, 1970, 1975), and has since been analysed in a number of ‘greenbeard’ models (Dawkins, 1976, 1982; Ridley & Grafen, 1981; Lehmann & Keller, 2006a,b; Gardner & West, 2010; West & Gardner, 2010).

Sixth, Fletcher and colleagues (Fletcher & Zwick, 2006; Fletcher et al., 2006) have suggested that Hamilton’s rule cannot capture conditional strategies. However, there is a long history of Hamilton’s rule being successfully applied to facultative behaviours (Hamilton, 1967; Charnov, 1978; Craig, 1979; Maynard Smith, 1980; Parker, 1989; Crozier, 1992), so it is difficult to see how this erroneous view has arisen. Moreover, this error is readily demonstrated by the field of sex allocation, which provides a huge theoretical and empirical literature illustrating how kin‐selection theory can be used to explain and predict conditional behaviour (Charnov, 1982; Hardy, 2002; West, 2009).

Seventh, Queller (1984) suggested that Hamilton’s rule fails to correctly predict the action of natural selection when greenbeard genes underlie social behaviour. These are genes that identify copies of themselves carried in other individuals, and cause their bearer to behave in ways that preferentially benefit carriers of the gene (Hamilton, 1964, 1971, 1975; Dawkins, 1976; Gardner & West, 2010; McGlothlin et al., 2010). Greenbeards can be favoured even in fully mixed infinite populations, whereby neighbouring individuals are not related (r = 0). Hence, Queller (1984) suggested that there are no regular indirect effects (rb = 0), and added ‘correction’ terms into Hamilton’s rule to capture the greenbeard effect. However, relatedness to neighbours is not the appropriate coefficient of relatedness to use in this context. Because greenbeard individuals can identify which of their neighbours carry the gene, and which do not, then formally we should consider that there are two types of neighbour – carriers and noncarriers – and we should calculate relatedness coefficients appropriate for individuals in each role, separately. As there is a positive relatedness between the greenbeard actor and his bearded neighbours (r > 0), and as the actor provides a fitness benefit for these bearded neighbours (b > 0), there is an indirect fitness benefit (rb > 0), and hence no correction term is needed in order for Hamilton’s rule to capture the greenbeard effect (Gardner & West, 2010).

Eighth, there is a concern that Hamilton’s rule fails to predict evolutionary dynamics in the context of evolutionary bifurcations, where disruptive selection drives a bimodal distribution of phenotypes (Doebeli & Hauert, 2006; Fletcher & Doebeli, 2006; Wenseleers, 2006). For example, Doebeli & Hauert (2006) have considered that, at a bifurcation point with respect to level of cooperation, there may be no directional selection and mutants exhibiting either a higher or lower level of cooperation may simultaneously invade from rarity. They point out that, in such circumstances, Hamilton’s rule is either satisfied (cooperation favoured) or not satisfied (cooperation not favoured), and they suggest that this is inconsistent with both more‐cooperative and less‐cooperative mutants being able to invade. However, Hamilton’s rule, as a statement of the action of natural selection, concerns the average genetic change of a population, not the invasion success of rare mutants per se, so there is no contradiction here.

Ninth, it has been suggested that Hamilton’s rule neglects indirect genetic effects (IGEs), which occur when genes carried by one individual have phenotypic consequences for other individuals (Wolf et al., 1998; Wolf, 2003). Thus, analysis of IGEs has been suggested to generalize beyond a Hamilton’s‐rule analysis (Wolf et al., 1998; Bijma & Wade, 2008). However, the analysis of the previous section provides an exact and general derivation of Hamilton’s rule, based upon the relationship between an individual’s genes and her relative fitness, with all phenotypic effects remaining implicit. Hence, the terms of Hamilton’s rule already include the impact of IGEs, and no additional terms are needed to account for these. Note that we are not claiming that IGEs are unimportant, just that their existence does not render Hamilton’s rule invalid. IGEs can be interesting in their own right, and it will often be helpful to make them explicit within theoretical models, and use these as a means to make broader connections between the theories of kin selection and quantitative genetics, or to make specific predictions for certain situations (Cheverud, 1984; Wolf et al., 1998, 1999; Wolf, 2003; Bijma & Wade, 2008; McGlothlin et al., 2010).

Tenth, it has been suggested that social effects that accrue over multiple generations after the original behaviour was enacted are not capable of being captured by Hamilton’s rule (Goodnight et al., 2008). For the sake of clarity, the analysis of the previous section focuses on a single generation of genetic change and no class structure (Box 3). However, kin‐selection analysis readily applies to such problems. One can either consider the impact upon reproductive success of recipients in future generations (Lehmann, 2010) or, equivalently, the changing reproductive values (long‐term genetic legacy) of their ancestors in the present generation (Rousset & Ronce, 2004; Alizon & Taylor, 2008; Wild et al., 2009).

Eleventh, a recurring claim is that Hamilton’s rule fails to capture the effects of group selection. Such claims usually manifest as suggestions that kin selection cannot account for various phenomena arising in the context of multilevel selection (Colwell, 1981; Avilés, 1993; Gintis, 2000; Wilson, 2005; Nowak, 2006; Traulsen & Nowak, 2006; Taylor et al., 2007a,b; Traulsen, 2010; Wilson & Wilson, 2007; Wilson, 2008; reviewed by West et al., 2008, 2011). Some authors have also attempted to correct Hamilton’s rule by adding an additional ‘group selection’ term (Wilson, 2005; Bijma & Wade, 2008). However, it has long been understood that the kin selection and multilevel (group) selection approaches to social evolution are mathematically equivalent, and merely represent different partitions of the same evolutionary process (i.e. natural selection; Hamilton, 1975; Grafen, 1984, 2006a; Wade, 1985; Frank, 1986, 1995; Queller, 1992b; Rousset, 2004; Gardner et al., 2007; Lehmann et al., 2007b; Gardner & Grafen, 2009). No model of multilevel selection has ever delivered a (correct) prediction that could not be reformulated in terms of kin selection – despite repeated claims to the contrary.

Twelfth, it has been claimed that Hamilton’s rule requires pairwise interactions (Nowak et al., 2010b). However, pairwise interactions have not been assumed since the earliest developments of kin‐selection theory: Hamilton (1964) made clear that inclusive fitness is defined by summing effects over potentially multiple recipients, and the rule is that natural selection will favour a trait when the total inclusive‐fitness effect is positive. For example, if an altruistic act incurs a fitness cost c for the actor and provides a benefit b1 to one relative and a benefit b2 to another relative, then the condition for natural selection to favour an increase in altruism is r1b1 + r2b2 − c > 0, where r1 and r2 are the relatedness of the actor to the former and latter relatives, respectively. For this reason, the condition for altruism to be favoured need not be reducible to the form b/c > 1/r, as there may be multiple benefit terms and multiple relatedness terms involved (e.g. see the supplementary online material of West & Gardner, 2010). More generally, kin‐selection models of sex allocation and dispersal evolution have long considered fitness consequences for potentially large numbers of recipients (Frank, 1998).

In the next two sections of this article, we consider several of these issues in more detail, by focusing upon two recently proposed models of social evolution in which Hamilton’s rule has been argued to fail to correctly predict the action of natural selection. First, we consider games of synergistic cooperation discussed by van Veelen (2009). Our genetical analysis of these models demonstrates that Hamilton’s rule correctly predicts the action of natural selection, despite perceived complications of nonadditive fitness effects, strong selection, frequency dependence and pure strategies. We reveal that suggestions to the contrary owe to incorrect computation of cost and benefit terms in Hamilton’s rule. Second, we consider Fletcher & Doebeli’s (2009) model of altruism among nonkin. Our genetical analysis of this model demonstrates that Hamilton’s rule again correctly predicts the action of natural selection, and emphasizes that there may be genetic relatedness even in the absence of genealogical relationship or DNA‐sequence similarity. In particular, we reveal that Fletcher & Doebeli’s (2009) suggestion to the contrary owes to their incorrect computation of the relatedness term of Hamilton’s rule.

Box 3 – Reproductive value

To what extent will persons of this age, on average, contribute to the ancestry of future generations? The question is of some interest, as the direct action of Natural Selection must be proportional to this contributionFisher (1930, p. 27)

The reproductive value of an individual expresses, in relative terms, the number of copies of that individual’s genes expected to be found in the population in the very distant future (Taylor, 1996). Hence, it corresponds to the individual’s expected asymptotic share of the future gene pool (Fisher, 1930; Price & Smith, 1972; Grafen, 2006b). Genes carried by individuals with higher reproductive value are expected to be more prevalent in the future population than those carried by individuals with lower reproductive value. When all individuals are equivalent (no class structure), then an individual’s (expected) reproductive value is equivalent to her relative fitness, i.e. number of offspring expressed relative to the average. When individuals are not equivalent (class structure) then it is not possible to simply count offspring, as offspring may vary in their quality (i.e. class). Here, reproductive value provides an exchange rate that translates all offspring into a universal currency.

As an example of no class structure, consider a population made up of N asexual individuals. If this population is closed to migration, then the ancestors of its future inhabitants must be among the group of N individuals currently found there. In other words, 100% of the genetic material comprising the future gene pool (i.e. 100% of the ‘future success’) must have originated from a gene carried by at least one of the population’s N present‐day residents. Determining the reproductive value of the N present‐day residents, then, amounts to determining how to distribute the ‘future success’ among them. In the absence of selection, the reproductive value of each individual is 1: every individual has average fitness. Alternatively, if one individual monopolizes the reproduction, then its reproductive value is N and the reproductive value of all other individuals is zero. In general, the long‐term dynamics of a population determine the reproductive value of each individual.inline image

In a class‐structured population, individuals are grouped according to particular qualities they share. For example, a class‐structured population could consist of a class that contains only males and another class that contains only females. A class‐structured population might alternatively consist of a continuum of classes that corresponds to a continuous range of ages. Because of the intrinsic differences among classes, genetic change in one class may not have the same evolutionary consequences as the same amount of genetic change in another class. To understand how class‐specific genetic change reflects evolutionary success in the long term, these class‐specific changes must be considered alongside the total reproductive value of the individuals in that class (i.e. the total share of the future gene pool given to that class). In particular, multiplying the total reproductive value of a class by a class‐specific gene frequency change, converts the latter into a change in the total share of the future gene pool awarded to the gene in question (Taylor, 1989; Rousset, 2004). Summing all such changes in future shares yields an overall measure of the change in the long‐term, evolutionary success of the gene.

Box 4 – Least‐squares regression

Nature discovered least squares long before Gauss – Crow (2008)

Suppose we have a set of empirical observations relating a predictor variable x to a target variable y, summarized in a data set (xi,yi). We may assume a functional correspondence between x and y, denoted by y = f(x, (αj)), where (αj) denotes a set of parameters. How are we to choose parameter values for the functional form, in a way that is informed by our empirical observations? In other words, how should we fit the functional relation to our data?

If there is only a single data point (x1,y1), then a natural approach is to determine the set of parameter values that minimize the quantity ε1 = y1 − f(x1, (αj)), i.e. the discrepancy (distance) between the predicted and observed values of y. Assuming differentiability, this set (αj)* of parameter values must satisfy:

image((B4.1))

and:

image((B4.2))

for all k.

If there are several data points, then the parameterization that minimizes the discrepancy for the first data point may not coincide with the parameterizations that minimize the discrepancies for the other data points. What aggregate of the discrepancies for the whole set of n data should we choose as our minimand?

Plotting the discrepancy relating to each data point as a separate, orthogonal axis in n‐dimensional space, we can express the overall discrepancy for a given parameterization (αj) as a single point in that space. A natural metric for the aggregate discrepancy is the Euclidean distance of this point from the origin (a distance of zero representing a perfect fit). From Pythagoras’ theorem, this Euclidean distance is given by:

image((B4.3))

Minimizing the aggregate discrepancy inline image is equivalent to minimizing the sum of the squared discrepancies in each dimension S = ∑iεi2. This leads to the following conditions for the best (least‐squares) fit parameterization (αj)*:

image((B4.4))

and:

image((B4.5))

The invention of this least‐squares method of fitting is credited to Carl Friedrich Gauss. Sheynin (1993) provides an overview of the historical development of this method.

The simplest functional form of any explanatory value is the equation of the straight line: E(y|x) = m x + c. This has two parameters: an intercept c and a slope m. For every data point, the discrepancy is of the form εi = yi − E(yi|xi) = yi − m xi − c, and hence the sum of the squared discrepancies is:

image((B4.6))

The intercept parameter that minimizes the sum of the squares is found by differentiating the RHS of (B4.6) by c, setting the result equal to zero, and solving for c. This obtains:

image((B4.7))

Hence, the least‐squares linear regression satisfies E(y|x) = mx − (E(y) − mE(x)) or, equivalently, E(y|x) − E(y) = m(x−E(x)). Note that the best‐fit equation passes through the centre of the data: E(y|x) = E(y) when x = E(x).

With this in mind, the sum of squares is given by:

image((B4.8))

The slope parameter that minimizes the sum of squares is found by differentiating the RHS of (B4.8) by m, setting the result equal to zero, and solving for m. This obtains:

image((B4.9))

Hence, the ratio of the covariance of two random variables x and y and the variance in x is equal to the slope of the least‐squares linear regression line fitted through these (x,y) data. In the main text, we have denoted this ratio βy,x, and here we have shown the relation to least‐squares linear regression.

The least‐squares linear regression approach is readily extended to multiple predictors (xj). Here, the simplest functional form is E(y|(xj)) = ∑jmijxij + c, and the sum of squares is given by:

image((B4.10))

Following the same procedure as above, we can differentiate the RHS of (B4.10) by c, set the result equal to zero and solve for c to obtain the best‐fit intercept parameter:

image((B4.11))

Hence, the least‐squares regression line satisfies E(y|{xj}) = E(y) + ∑jmj(xj − E(xj)). Note that, as above, this line passes through the centre of the data: E(y|{xj}) = E(y) when xj = E(xj) for every j.

With this in mind, the sum of squares is given by:

image((B4.12))

The best‐fit value for each slope parameter can be obtained in the usual way. Differentiating the RHS of (B4.12) by mk, setting the result equal to zero, and solving for mk, obtains:

image((B4.13))

This expression defines as many equations as there are slope parameters to be fitted, and these can be simultaneously solved to obtain values for every j. For example, if there are two predictors of y (say, x1 and x2) then the equations for the two slope parameters are:

image((B4.14))
image((B4.15))

Simultaneously solving (B4.14) and (B4.15) for m1 and m2 yields:

image((B4.16))
image((B4.17))

where ρ12 = cov(x1,x2)/√(var(x1)var(x2)) is the correlation coefficient. In the main text, we use partial regression notation such that inline image and inline image. Equations (B4.16) and (B4.17) show how these may be computed for any model directly from covariances, variances and correlation coefficients, i.e. without having to use least‐squares minimization methods.

Hamilton’s rule with synergy

In the previous sections, we provided a derivation of Hamilton’s rule, and discussed the extent to which this relies upon restrictive assumptions. In this section, we give a detailed analysis of the issue of nonadditive gene interaction, as the idea that this invalidates Hamilton’s rule is an entrenched misconception that continues to manifest in the literature. As a specific example, we consider van Veelen’s (2009) claim that Hamilton’s rule fails to correctly predict the action of natural selection in the context of three‐player evolutionary games with synergistic interaction between the genes of social partners. Here we reanalyse van Veelen’s (2009) model, recovering his results, and we show that these are not really in conflict with Hamilton’s rule. For completeness, we first analyse the case of two‐player games, considering a model developed by Queller (1985).

Two‐player games

Queller (1985) suggested that synergistic interaction between social partners could not be captured within the canonical Hamilton’s rule, and suggested that an additional correction factor is needed to correctly predict genetic change. The following model is based upon his analysis of a two‐player game with synergy. On a historical note, although Queller (1992a) later emphasized the generality of a genetical Hamilton’s rule – i.e. our equations (5) and (6)– others have continued to argue that Queller’s (1985) result is more general than Hamilton’s rule and/or that Hamilton’s rule breaks down with synergistic interactions (e.g. Fletcher & Zwick, 2006; van Veelen, 2009).

We assume an infinite population of haploid individuals engaged in two‐player games. A single locus controls the cooperation phenotype, with a proportion p of individuals carrying an allele A which encodes the cooperator strategy, and the remaining 1 − p carrying an allele a which encodes the noncooperator strategy. Individuals are paired with a genetically identical social partner with probability α, and are paired with a random social partner with probability 1 − α. Social interaction mediates fecundity F, which is given by a unit baseline plus the individual’s payoff from the game – payoffs are given in Table 3. Cooperators incur a fecundity cost C and provide their partners with a fecundity benefit B. Additionally, when two cooperators are paired, they both receive a synergistic benefit of D. We assume that the individual’s relative fitness is equal to their relative fecundity: v = F/E(F).

Table 3. Fecundity payoffs in a two‐player game of synergistic cooperation. Based upon the model of Queller (1985).
Fecundity payoff, P
Cooperator Noncooperator
Cooperator B − C + D C
Noncooperator B 0

Natural selection is formally defined according to change in genetic values (Box 1), and so we begin our analysis of this model by defining the genetic values of each type of individual (Box 2). We denote the phenotypic values of cooperators and noncooperators by z = 1 and z = 0 respectively (and hence E(z) = p). The genic values can be chosen arbitrarily, so long as they are different. We choose x = 1 for the A allele and x = 0 for the a allele (and hence E(x) = p). From equation (B2.1), we fit the linear model E(z|x) = E(z) + βz,x (x − E(x)) to the population data (summarized in Table 4) by the method of least squares, to obtain βz,x = 1. From equation (B2.2), an individual’s genetic value is given by g = E(z) + βz,x (x – E(x)), and this is g = 1 for cooperators and g = 0 for noncooperators.

Table 4. Population data for genetic value calculations in Queller’s (1985) and van Veelen’s (2009) models.
Frequency Genic value (x) Phenotype (z)
1 − p 0 0
p 1 1
E(x) = p E(z) = p

Having obtained genetic values, we can write a linear model of social‐partner genetic value g′ as a function of an individual’s own genetic value g, to compute the coefficient of relatedness between social partners. From equation (B4.1), this is E(g′|g) = E(g′) + βg′,g (g − E(g)). Fitting this linear model to the population data (Table 5) obtains βg′,g = α. Hence, the kin‐selection coefficient of relatedness is given by:

image(7)

Table 5. Data for cost, benefit and relatedness calculations in Queller’s (1985) model.
Frequency Own genetic value (g) Partner’s genetic value (g′) Relative fitness (v)
(1 − p)2 + αp(1 − p) 0 0 1/E(F)
(1 − α)p(1 − p) 0 1 (1 + B)/E(F)
(1 − α)p(1 − p) 1 0 (1 − C)/E(F)
p 2 + αp(1 − p) 1 1 (1 + B − C + D)/E(F)
E(g) = p E(g′) = p E(v) = 1

Finally, we may write a linear model of the individual’s relative fitness v as a function of her genetic value g and the genetic value of her social partner g′. From equation (4), this is E(v|g,g′) = E(v) + βv,g|g (g − E(g)) + βv,g′|g (g′ − E(g′)). Fitting this linear model to the population data (Table 5) obtains the cost (c = −βv,g|g) and benefit (b = βv,g′|g) terms of Hamilton’s rule:

image(8)
image(9)

where inline image, inline image and inline image. Note that, of the synergistic relative fecundity benefit inline image that accrues to the proportion α + (1 − α)p of cooperators who have cooperative social partners, a fraction 1/(1 + α) is assigned by the regression analysis to direct fitness (i.e. −c = −inline image + [1/(1 + α)][α + (1 − α)p] inline image) and the remaining fraction α/(1 + α) is assigned to indirect fitness (i.e. rb = αinline image + [α/(1 + α)][α + (1 − α)p] inline image).

Having computed the three terms (r, b and c) of Hamilton’s rule, we can write a condition for natural selection to increase the average genetic value for cooperation. This is rb − c > 0, or:

image(10)

Condition (10) is precisely that which was derived by Queller (1985) for cooperation to be favoured by natural selection. Although Queller (1985) interpreted this result as a demonstration of the failure of Hamilton’s rule to correctly predict genetic change when gene action is not additive, we have derived it directly from Hamilton’s rule. How are we to account for this difference in conclusion?

Queller (1985) identified the first two terms on the LHS of condition (10) (i.e. αB − C, or equivalently, rB − C) as the LHS of Hamilton’s rule. The condition rB − C > 0 does not correctly predict genetic change when there is a deviation from additivity (D ≠ 0), so Queller (1985) suggested that deviations from additivity result in Hamilton’s rule’s failure. However, Hamilton’s rule is rb − c > 0, i.e. the costs and benefits are fitness effects, properly defined by a least‐squares multi‐regression analysis, rather than arbitrary fecundity parameters that just happened to be represented by similar letters of the alphabet (Grafen, 1985b; Queller, 1992a; Rousset, 2004; Gardner et al., 2007). We have shown that this canonical Hamilton’s rule correctly predicts the action of natural selection, irrespective of the degree of synergy.

A general point here is that we are not suggesting that alternative partitions, such as that provided by Queller (1985), are incorrect or not useful. Indeed, Queller’s result helps to make the mechanistic components of fitness effects more explicit (Gardner et al., 2007). Instead, our point is that Hamilton’s rule remains a correct statement, albeit one in which the cost and benefit terms may be somewhat complicated.

Three‐player games

van Veelen (2009) has argued that, although the issue of synergy can be resolved for two‐player games, Hamilton’s rule cannot be guaranteed to give a correct prediction of the action of natural selection in games of three or more players. This is based upon the idea that Hamilton’s rule contains fewer parameters than are needed to model such complexity. van Veelen (2009) illustrated this point with an example of a three‐player ‘stag hunt’ game. Here, we provide a detailed reanalysis of this game, from a Hamilton’s‐rule perspective.

We consider an infinite population of haploid individuals, with cooperation strategies genetically encoded as in the two‐player game, described above. We now assume that social groups contain three individuals, with the proportion of groups containing one, two and three cooperators being denoted p1, p2 and p3 respectively (hence, the proportion of groups containing zero cooperators is p0 = 1 − p1 − p2 − p3, and the population frequency of cooperators is p = 1/3 p1 + 2/3 p2 + p3. Again, we assume that social interaction mediates individual fecundity, with individuals having a unit baseline fecundity plus a payoff that is determined by their own strategy and the strategies of their two social partners, according to Table 6. According to the three‐player stag‐hunt game, assumed by van Veelen (2009), an individual loses a unit of fecundity if she cooperates, and additionally gains two units of fecundity if all three individuals in her social group cooperate. As before, we assume that an individual’s relative fitness is given by v = F/E(F).

Table 6. Fecundity payoffs in a three‐player game of synergistic cooperation. Based upon the model of van Veelen (2009).
Fecundity payoff, P
Non‐C/Non‐C C/Non‐C C/C
Cooperator −1 −1 1
Noncooperator 0 0 0

As we have assumed a genotype‐phenotype map that is exactly the same as for the above analysis of Queller’s (1985) model (Table 4), we already know that the genetic value for cooperation is g = 1 for individuals carrying the A allele and is g = 0 for individuals carrying the a allele. We compute relatedness by writing a linear model of social‐partner genetic value g′ as a function of an individual’s own genetic value g. From equation (B4.1), this is E(g′|g) = E(g′) + βg′,g (g − E(g)), and fitting this model to the population data (Table 7) obtains βg′,g = 1 − (p1 + p2)/3p(1 − p). Hence, the kin‐selection coefficient of relatedness is:

image(11)

Table 7. Population data for cost, benefit and relatedness calculations in van Veelen’s (2009) model.
Frequency Own genetic value (g) Partner’s genetic value (g′) Other partner’s genetic value (g′′) Relative fitness (v)
p 0 0 0 0 1/E(F)
p 1/3 0 0 1 1/E(F)
p 1/3 0 1 0 1/E(F)
p 2/3 0 1 1 1/E(F)
p 1/3 1 0 0 0
p 2/3 1 0 1 0
p 2/3 1 1 0 0
p 3 1 1 1 2/E(F)
E(g) = p E(g′) = p E(g′′) = p E(v) = 1

Finally, we may write a linear model of the individual’s relative fitness v as a function of her genetic value g and the genetic values of her social partners g′ and g′′. This is E(v|g,g′,g′′) = E(v) + βv,g|g′,g′′ (g − E(g)) + βv,g′|g,g′′ (g′ − E(g′)) + βv,g′′|g,g (g′′ − E(g′′)). Fitting this linear model to the population data (Table 7) obtains the cost (c = −βv,g|g′,g′′) and benefit (b = βv,g′|g,g′′ + βv,g′′|g,g) terms of Hamilton’s rule:

image(12)
image(13)

Note that, of the synergistic relative fecundity benefit 2/E(F) that accrues to the proportion p3/p of cooperators who have two cooperative social partners, a fraction 1/(1 + 2r) = 3p(1 − p)/[9p(1 − p) − 2(p1 + p2)] is assigned by the regression analysis to direct fitness, and the remaining fraction 2r/(1 + 2r) = 6pr(1 − p)/[9p(1 − p) − 2(p1 + p2)] is assigned to indirect fitness, analogous to the result obtained in our reanalysis of Queller’s (1985) model.

Having computed the three terms (r, b and c) of Hamilton’s rule, we can now write a condition for natural selection to increase the average genetic value for cooperation. This is rb − c > 0, and simplifies to give the condition:

image(14)

Hence, if more cooperators are present in groups of three cooperators than are not, then natural selection favours an increase in cooperation.

This is exactly the result derived by van Veelen (2009). However, van Veelen (2009) claimed that this result cannot be formulated in terms of Hamilton’s rule (in particular, ‘cannot be given in a formula with costs, benefits and relatedness only’). He argued that the same is true of any model with three or more social partners interacting in a synergistic way. This conclusion follows from an argument that: (i) in such models, details of population structure in addition to relatedness (genetic similarity of pairs of individuals) must be known to predict the action of natural selection; and (ii) such details do not feature as terms in Hamilton’s rule, so this cannot generally provide correct predictions. For example, two model populations can be set up such that the relatedness between pairs of social partners is the same but other moments of population structure are different, such that cooperation is selected for in one model but selected against in the other. van Veelen (2009) asserted that the corresponding Hamilton’s rules for both models will be identical, and hence unable to account for the difference in selective outcome.

However, as we have shown, this is based upon an incorrect understanding of kin selection and Hamilton’s rule. We have derived van Veelen’s (2009) condition for cooperation to be selectively favoured, using Hamilton’s rule. Irrespective of higher moments of population structure, Hamilton’s rule correctly predicts the action of natural selection. This is because the cost and benefit terms are also mediated by population structure, so that although higher moments of genetic structure are not captured in the relatedness coefficient, they may exert their effects through the cost and benefit terms of Hamilton’s rule. van Veelen (2009) has therefore confused the cost and benefit terms in Hamilton’s rule with arbitrary payoff parameters in an evolutionary game (another recent example of this occurs in Nowak et al., 2010b). In reality, the cost and benefit terms in Hamilton’s rule are the direct and indirect average fitness effects of a genetic predisposition for a character of interest, which are prone to be mediated by the genetic and ecological milieu.

Box 5 – Relatedness

The appropriate regression coefficient must be very near to Sewall Wright’s Coefficient of RelationshipHamilton (1963)

In the main text, we have defined the kin‐selection coefficient of genetic relatedness between social partners as r≡ cov(g′,g)/cov(g,g), where g is an individual’s genetic value for a character of interest, g′ is the genetic value of her social partner and the covariances are taken over all individuals in the population. This definition emerges directly from the application of Price’s (1970) theorem to social evolution, as shown in equation (4) of the main text (see also Hamilton, 1970). The quantity cov(g′,g)/cov(g,g) has a simple interpretation within the context of the least‐squares linear regression approach: it is equal to the least‐squares linear regression coefficient βg′,g in the model:

image((B5.1))

Hence, we can view the kin‐selection coefficient of relatedness as arising from a least‐squares linear regression analysis, in which we are attempting to predict the genetic value of an individual’s social partner given that we know the individual’s own genetic value for a character of interest.

Relatedness has been conceived as a regression coefficient from the very beginning of Hamilton’s work on kin‐selection theory. In the very first of these articles, Hamilton (1963) realized that the impact of a social partner’s increased reproductive success, as a consequence of being the recipient of the actor’s altruistic behaviour, on the genetic evolution of a population, does not simply depend upon the expected number of genes identical to those of the actor that will be added to the gene pool. In addition, those genes that are not identical to those of the actor and that will also be added to the gene pool will have a ‘diluting’ effect, reducing the extent to which the actor’s genes increase in frequency within the population. Consequently, Hamilton (1963) pointed out that the appropriate coefficient of relatedness is one that measures the concentration of the actor’s genes in the recipient, and he suggested that this would in principle take the form of a regression coefficient – albeit one that is well‐approximated by Wright’s correlation coefficient of relationship under certain simplifying assumptions (see also Hamilton, 1964, 1970). Orlove (1975) appears to have been the first to explicitly express the coefficient of relatedness as the slope of the least‐squares linear regression line through data describing the genetics of actors and their recipients. Orlove & Wood (1978) were the first to explicitly express relatedness as the ratio of covariances of genetic values, reproduced above. The geometrical aspects of the regression approach to relatedness was developed by Grafen (1985a,b), and it forms the basis of the widely used Kinship software that provides estimates of relatedness from microsatellite data (Queller & Goodnight, 1989).

Orlove & Wood (1978) also provided an alternative conception of relatedness in terms of covariances of genetic and phenotypic values, which gained some traction in the literature (e.g. Michod & Hamilton, 1980; Seger, 1981; Michod, 1982). Queller (1992a) provided both purely genetical and also phenotypic covariance forms for relatedness, suggesting that Hamilton’s rule is exact and completely general if the genetical form is used but special assumptions are required in order for the phenotypic form to provide the correct answer. Frank (1997a,b) showed that the kin‐selection approach can be extended to give a general theory of ‘correlated selection’, in which any set of predictors that correlate with genes and fitness can be incorporated into the regression analysis, giving rise to relatedness‐like coefficients that describe the association between gene and arbitrary predictor. However, throughout this article, we have focused upon purely genetical predictors, in the spirit of Fisher’s (1918, 1930) paradigm. This is so that the associated partition of natural selection into direct and indirect components will exactly correspond with the direct and indirect components of inclusive fitness, the quantity that organisms are designed to maximize (Hamilton, 1964, 1996; Grafen, 2006a), which better clarifies the link between the process and purpose of Darwinian adaptation.

Altruism among nonkin

The previous section considered claims for the invalidity of Hamilton’s rule that arise from misconceptions regarding costs and benefits. In this section, we consider a recent claim for the invalidity of Hamilton’s rule that concerns the coefficient of genetic relatedness. Fletcher & Doebeli (2009) devised a thought experiment, which they claimed has shown that altruism can be favoured by natural selection even when relatedness is zero. Thus, Fletcher & Doebeli (2009) argue that relatedness is not the key to altruism, and Hamilton’s rule can fail to correctly predict the action of natural selection. Here, we show that their conclusion arises because they have incorrectly computed the coefficient of relatedness in their model.

Fletcher & Doebeli (2009) considered a hypothetical strain of bacterium, for which an altruistic trait is encoded by two bi‐allelic loci. Individuals with allele A at the first locus and allele b at the second locus, or allele a at the first locus and allele B at the second locus, have a probability q of committing suicide, an altruistic act that provides a social partner with a fecundity benefit B (see Table 8). Individuals with the a allele at the first locus and the b allele at the second locus never commit suicide, and individuals with the A allele at the first locus and the B allele at the second locus are presumed absent (owing to lethality of this genotype). Fletcher & Doebeli (2009) considered that an experimenter always pairs Ab individuals with aB individuals, and ab with ab (hence, there is an implicit constraint that the frequency of A and B alleles must be equal; we denote these frequencies by p/2, i.e. the frequency of individuals with either the Ab or the aB genotype is p). Fletcher & Doebeli (2009) showed that the genes for altruism (A and B) can be favoured, despite the recipients of altruism being dissimilar at both loci (Ab helps aB, or aB helps Ab) and hence, they claim, having a relatedness of zero.

Table 8. Fecundity payoffs in a game of altruistic suicide. Based upon the model of Fletcher & Doebeli (2009).
Fecundity payoff, P
Suicider Nonsuicider
Suicider −1 −1
Nonsuicider B 0

As the preceding sections have made clear, natural selection is defined according to genetic – not phenotypic or genotypic – change. So, a kin selection analysis of this model begins by computing individuals’ genetic values for the altruistic character in question. We denote: the altruistic suicide phenotype z = 1 and the nonsuiciding phenotype z = 0; the A allele x1 = 1 and the a allele x1 = 0; and the B allele x2 = 1 and the b allele x2 = 0. From equation (B2.1), we can write a linear regression model of the phenotype as a function of its genic predictors: E(z|x1,x2) = E(z) + βz,x1|x2 (x1 − E(x1)) + βz,x2|x1 (x2 − E(x2)) and, fitting this linear model to the population data (Table 9), we find the least‐square regression coefficients are βz,x1 = βz,x2 = q. From equation (B2.2), genetic value is defined as g = E(z) + βz,x1 (x1 − E(x1)) + βz,x2 (x2 − E(x2)), which is g = 0 for ab individuals (x1 = x2 = 0) and g = q for both Ab individuals (x1 = 1, x2 = 0) and aB individuals (x1 = 0, x2 = 1).

Table 9. Population data for genetic value calculations in Fletcher & Doebeli’s (2009) model.
Proportion Genic value for locus 1 (x1) Genic value for locus 2 (x2) Phenotype (z)
1 − p 0 0 0
p(1 − q)/2 0 1 0
pq/2 0 1 1
p(1 − q)/2 1 0 0
pq/2 1 0 1
E(x1) = p/2 E(x2) = p/2 E(z) = pq

Having computed genetic values, we may compute the kin‐selection coefficient of genetic relatedness. From equation (B4.1), we write a linear‐regression model E(g′|g) = E(g′) + βg′,g(g − E(g)). Fitting this to population data (Table 10) obtains βg′,g = 1. Hence, the kin‐selection coefficient of relatedness is given by:

image(15)

Table 10. Population data for cost, benefit and relatedness calculations in Fletcher & Doebeli’s (2009) model. Here we have generalized the degree of assortment 0 ≤ α ≤ 1; Fletcher’s & Doebeli’s model is recovered by setting α = 1.
Frequency Own genetic value (g) Partner genetic value (g′) Own phenotype (z) Partner phenotype (z′) Relative fitness (v)
(1 − p)2 + αp(1 − p) 0 0 0 0 1/E(F)
(1 − α)p(1 − p)(1 − q) 0 q 0 0 1/E(F)
(1 − α)p(1 − p)q 0 q 0 1 (1+B)/E(F)
(1 − α)p(1 − p)(1 − q) q 0 0 0 1/E(F)
(1 − α)p(1 − p)q q 0 1 0 0
(p2 + αp(1 − p))(1 − q)2 q q 0 0 1/E(F)
(p2 + αp(1 − p))q(1 − q) q q 0 1 (1+B)/E(F)
(p2 + αp(1 − p))q(1 − q) q q 1 0 0
(p2 + αp(1 − p))q2 q q 1 1 0
E(g) = pq E(g′) = pq E(z) = pq E(z′) = pq E(v) = 1

We may now compute the cost and benefit terms of Hamilton’s rule. From equation (4), we write a linear‐regression model of relative fitness as a function of genetic predictors: E(v|g,g′) = E(v) + βv,g|g(g − E(g)) + βv,g′|g(g′ − E(g′)) and, fitting this to the population data (Table 10), we obtain the cost (c = −βv,g|g) and benefit (b = βv,g′|g) terms of Hamilton’s rule:

image(16)
image(17)

Note that, of the antagonistic relative‐fecundity cost qB/E(F) that accrues to potential suiciders paired with other potential suiciders (owing to suciders being unable to enjoy the fecundity benefits of their partner’s suicide), a fraction 1/(1+r) = ½ is assigned by the regression analysis to direct fitness, and the remaining fraction r/(1+r) = ½ is assigned to indirect fitness, analogous to the results obtained in our reanalysis of Queller’s (1985) and van Veelen’s (2009) models.

Having calculated r, b and c, we can substitute these into Hamilton’s rule rb − c > 0 to arrive at a condition for natural selection to favour an increase in altruistic suicide:

image(18)

Thus, we have recovered Fletcher & Doebeli’s (2009) result that natural selection can act to favour altruism in the context of their model. Fletcher & Doebeli (2009) argued that this result was not obtainable from a Hamilton’s‐rule perspective. However, we have derived their result directly from a Hamilton’s‐rule reanalysis of their model, and indeed Hamilton (1971) already provided an analysis of such ‘assortation’ effects.

The difference in conclusion arises because Fletcher & Doebeli (2009) incorrectly calculated the coefficient of genetic relatedness, treating it as a measure of genotypic similarity per se, rather than a measure of genetic similarity. Although social partners under their unusual pairing scheme carry different alleles at both of the loci underlying the altruistic trait, their genetic predisposition for altruism is exactly the same, and hence the coefficient of genetic relatedness is unity (r = 1), and not zero (r = 0) as Fletcher & Doebeli (2009) suggested. Hence, relatedness remains the key to altruism. More generally, Fletcher & Doebeli’s (2009) thought that experiment can be seen as an example of when cooperation is favoured due to assortative interactions between altruists, as discussed in the greenbeard literature (reviewed by Gardner & West, 2010).

Although the formal genetical definition of relatedness used here may appear somewhat alien, it is in fact natural for a number of reasons. First, as students of organismal adaptation, we are not interested in molecular evolution per se, so it is understandable that descriptions of natural selection should not hinge upon molecular similarity. Indeed, Darwinism was developed and formalized before the molecular genetics revolution of the 1950s, so it should be no surprise that such details are extraneous to the basic theory (Grafen, 1999, 2009). Second, natural selection can only ‘see’ phenotypes to the extent that they are heritable (Fisher, 1930). Consequently, it is understandable that descriptions of natural selection should hinge upon the heritable portion of the phenotype (i.e. the individual’s genetic value). Thirdly, relatedness is a population‐level statistic, and should not change according to which individuals we choose to focus upon. Fletcher & Doebeli (2009) focused upon the pairing of genotypically dissimilar Ab and aB individuals, and neglected the concomitant pairing of genotypically identical ab individuals. If relatedness were measured genotypically, then the relatedness defined by this pairing scheme would lurch from r = 0 to r = 1, depending upon whose genotypes we focused our attention. Defining relatedness properly (i.e. genetically) obtains r = 1 for both Ab/aB and ab/ab pairings.

Box 6 – How to make a kin‐selection model

The standard maximization method for solving ESS problems can be adapted to analyse models with interactions among relativesTaylor & Frank (1996, p. 28)

Optimality models of phenotypic evolution rest upon a level of abstraction that neglects the precise genetic architecture of a trait to facilitate the generation of testable predictions when such genetic details are unknown. This has been termed the ‘phenotypic gambit’ (Grafen, 1984), and it has been a hugely successful approach to developing evolutionary theory (Krebs & Davies, 1978, 1984, 1991, 1993, 1997).

From equation (2), the change in average genetic value owing to the action of natural selection is given by:

image((B6.1))

i.e. it is proportional to the least‐squares linear regression of relative fitness against genetic value for the trait in question, where the coefficient of proportionality is the additive genetic variance. Assuming a differentiable functional relationship between phenotypic value for a trait z and relative fitness v (i.e. v = W(z)), and the simplest relationship between genetic and phenotypic values (i.e. g = z), then, in the limit of vanishingly small variation around the population average trait value inline image, the regression is equal to the tangent of the fitness function, giving:

image((B6.2))

The optimality approach typically begins by assuming (or deriving) a fitness function W that relates an individual’s phenotypic trait value to its relative fitness, and differentiates this to obtain the direction of natural selection operating on this trait. If the tangent to the fitness function at the point inline image occupied by a population is positive (dW/dz > 0) then natural selection acts to favour the trait of interest, and if the tangent is negative (dW/dz < 0) then natural selection acts to disfavour the trait of interest. If an intermediate trait value is favoured by natural selection, then this is found by setting the derivative equal to zero and solving for inline image (and checking, from the second derivative, that this corresponds to a fitness maximum and not a fitness minimum). Hence, simple models relating trait value to fitness can be readily analysed using the tools of calculus to obtain predictions for evolutionarily and convergence stable strategies (see Taylor, 1996 for more details).

This basic approach has been adapted by Taylor & Frank (1996) for application to social evolutionary problems (see also Taylor, 1996; Frank, 1997b, 1998; Rousset, 2004; Taylor et al., 2007a,b). They allow for the individual’s fitness to be mediated by the phenotypes expressed by neighbours as well as by its own phenotype i.e. v = W(z,z′). Applying the chain rule of differential calculus, they partition the fitness derivative of equation (B6.2) into partial derivative effects of own and social partner phenotype:

image((B6.3))

As above, in the limit of vanishing trait variation the regression terms in the personal fitness formulation of Hamilton’s rule (equation 5) are equal to the derivative terms in equation (B6.3), i.e. inline image. Hence, the action of natural selection predicted by this simple chain rule approach corresponds exactly to Hamilton’s rule: the trait is favoured when rb − c > 0, disfavoured when rb − c < 0 and intermediate optima satisfy rb − c = 0. Here, we have considered that all individuals in the model population are equivalent. However, class structure is readily allowed for by this simple approach (Taylor & Frank, 1996; and see Gardner et al., 2009 for a recent worked example).

As an illustrative example, consider a simple model of the evolution of parasite virulence (Frank, 1996a). We assume a simple tradeoff between the exploitation of host resources for the production of parasite propagules vs. keeping the host alive to achieve sustained exploitation. We assume the overall transmission of infectious propagules from an infected host is a linearly decreasing function of the rate z′ of exploitation by its parasites. And we assume that the share of these infectious propagules that derive from a focal parasite within the infection is given by its relative exploitation strategy within the host, z/z′. Hence, the focal parasite’s transmission success is given by:

image((B6.4))

Applying the approach of Taylor & Frank (1996), we have partial derivatives inline image. Thus, the optimal level of exploitation satisfies inline image. The reduction in host longevity owing to parasite exploitation is proportional to V = az′, and at the parasite optimum this is V = 1 − r. Thus, higher parasite relatedness r and higher host susceptibility to parasite exploitation a favour reduced exploitation by parasites. In this simple model, the level of host susceptibility a has no effect on the virulence suffered by the host at the parasite optimum: if hosts evolve reduced susceptibility to parasite exploitation, parasites respond by increasing their level of exploitation until hosts are suffering as much as they were before (Frank, 1996a).

In this virulence example, we have assumed numerous parasites per infection, such that z′ is both the average exploitation phenotype exhibited by the social partners of the focal parasite and also the average exploitation phenotype exhibited by the infection as a whole. However, for social groups comprising a smaller number of individuals, we must distinguish the average for social partners from the group average – as the latter contains the former and additionally includes a contribution from the focal individual herself. In particular, if the group contains the focal individual with phenotype z and n − 1 other individuals with average phenotype z′, the group’s average phenotypic value is z′′ = (1/n)z + ((n − 1)/n)z′. For ‘whole group’ traits, whose social effects impact equally on all individual’s the group, it is often easier to use z′′ rather than z′ as the second variable in the chain rule analysis. However, it is important to note that the corresponding relatedness coefficient dz′′/dz is a whole‐group relatedness, i.e. the relatedness of the focal individual to the average member of her group, including herself. In the absence of relatedness between social partners, whole‐group relatedness will not be zero but rather 1/n. In other scenarios, an ‘others only’ approach will be more natural. Importantly, the partition of natural selection made by Hamilton’s rule (i.e. direct vs. indirect fitness effects) corresponds to the others‐only partition (any impact of social group that owes to the individual being a part of its own group counts towards the direct, rather than the indirect, fitness effect) and the coefficient of relatedness in Hamilton’s rule is an others‐only relatedness (Frank, 1996b; Pepper, 2000).

The personal‐fitness methodology developed by Taylor & Frank (1996) has hugely increased the ease of performing social‐evolutionary analyses. Although an inclusive‐fitness approach can be used to solve mathematical models, it can sometimes be tricky for the modeller to keep track of all the inclusive‐fitness effects of an action, whereas it is often easier to write down a complete personal‐fitness function and simply follow the recipe laid out by Taylor & Frank (1996). The ease with which the approach accommodates such complexities such as class structure and relatedness varying at multiple scales helps to ensure that biology leads the analysis. Moreover, the methodology allows analyses to be developed at the desired level of complexity – for example, the fitness function can be assumed or else derived from a mechanistic model, and relatedness can be treated as a parameter (‘open’ model) or else derived from an explicit model of population processes (‘closed’ model; Gardner & West, 2006).

Discussion

Hamilton’s (1963, 1964, 1970) partitions natural selection into two components: direct‐fitness effects and indirect‐fitness effects. Being a simple statement of natural selection, that makes no mechanistic assumptions of how these direct and indirect fitness effects come about, Hamilton’s rule has the same generality as the theory of natural selection itself. In particular, Hamilton’s rule does not break down in response to factors such as gene interactions, strong selection, frequency dependence, pure strategies, mechanisms that favour altruism between nongenealogical relatives, facultative strategies, greenbeards, evolutionary bifurcations, indirect genetic effects, fitness consequences over multiple generations or group selection or non‐pairwise interactions (Table 2). All of these factors are implicitly allowed for in Hamilton’s rule.

General theory vs. streamlined methodology

Evolutionary theory falls into two broad categories: that which aims for generality, and that which aims for applicability to specific problems. These two strands are present in kin‐selection theory, and our main focus in this article has been to emphasize the general theory of kin selection and the corresponding general formulation of Hamilton’s rule. However, kin‐selection theorists also make use of streamlined methodologies which – owing to simplifying assumptions – provide straightforward tools that are readily applied to specific problems to yield concrete, testable results. These streamlined methodologies often employ the phenotypic gambit (Grafen, 1984; see below), i.e. the assumption of a straightforward genetic architecture, and vanishingly low genetic variation is often assumed so that regression coefficients can be replaced with derivatives (e.g. Taylor, 1996; Taylor & Frank, 1996; Frank, 1997b, 1998; Rousset & Billiard 2000; Rousset, 2004; Taylor et al., 2007a; Box 6), allowing the modeller to draw upon the power of calculus.

We have employed Fisher’s (1918, 1930) statistical, genetical paradigm to show that Hamilton’s rule provides a completely general partition of natural selection. Although they have clear meanings, the cost, benefit and relatedness terms of Hamilton’s rule are potentially very complicated functions of genotypic, demographic and ecological parameters (Grafen, 1985b, 2006a; Frank, 1998; Rousset, 2004; Gardner et al., 2007). The cost and benefit terms are strictly defined as partial additive fitness effects obtained by a least‐squares regression analysis, which fits straight lines through population data of arbitrary complexity (equations 5 & 6; Queller, 1992a; Frank, 1997a, 1998; Gardner et al., 2007), and should not be confused with arbitrary model parameters such as fecundity effects or abstract utility payoffs in an evolutionary game (Grafen, 1985b; Rousset, 2004). Likewise, the coefficient of relatedness is a statistical term, emerging from least‐squares linear regression analysis of social partners’ genetic predispositions for characters of interest (Hamilton, 1963, 1970, 1972; Orlove, 1975; Orlove & Wood, 1978; Michod & Hamilton, 1980; Grafen, 1985a; Queller, 1992a; Frank, 1997a, 1998; Box 5), and should not be confused with arbitrary quantities such as those relating to genealogical or genotypic or phenotypic closeness. All of these points follow if we regard Hamilton’s rule as a straightforward extension of Fisher’s (1918, 1930) genetical paradigm of natural selection, which represents the mathematical foundations of Darwinian theory itself. It is simply incorrect to claim that Hamilton’s rule requires restrictive assumptions or that it almost never holds (e.g. van Veelen, 2009; Nowak et al., 2010b).

By contrast, the use of more streamlined methodology is to develop testable predictions for specific problems. The general form of Hamilton’s rule applies to any scatter of genetic and fitness data, irrespective of the underlying causes of this variation. It makes clear that, whatever the relationship between trait and reproductive success, the resulting action of natural selection can be decomposed into cost, benefit and relatedness terms. This is an important result to have obtained, but in most cases the aim of the evolutionary theorist will be to develop readily testable, concrete predictions for particular scenarios, and here the mechanistic causes of the relationship between trait and reproductive success will be of more interest. In such cases, it is more appropriate to build a mechanistic, dynamically sufficient model and to use any standard evolutionary methodology, such as neighbour‐modulated‐fitness, theoretical population genetics, quantitative genetics or evolutionary game theory to analyse the model and derive predictions (e.g. Taylor & Frank, 1996). Hamilton’s rule may then be used as a conceptual aid for understanding the results delivered by these analyses, and connecting these with the results of other models analysed using different methods, as it provides a general framework and universal language for social‐evolution theory (Taylor & Frank, 1996; Frank, 1998; Rousset, 2004; Gardner et al., 2007). Thus, kin‐selection theory is not a specific methodological approach for making and solving models, distinct from population genetics or evolutionary game theory (contraNowak et al., 2010b and Traulsen, 2010). Rather, it is the very foundation of social‐evolution theory.

The streamlined methodologies used to analyse kin‐selection models make simplifying assumptions, which are not required for Hamilton’s rule to work but which are made to achieve dynamical sufficiency or simply for ease of analysis. For example, selection distorts genealogies, such that relatedness can be difficult to compute in populations undergoing strong selection (Hamilton, 1964; Bulmer, 1994; but see Ajar, 2003; Gardner et al., 2007 and Rousset & Roze, 2007), and so a standard simplifying assumption is that of weak selection, which enables neutral‐population coefficients of relatedness to be used as close approximations to the real thing (Taylor 1992, 1996; Frank, 1986, 1998; Rousset, 2004). However, the frequent use of this assumption does not mean that relatedness is only defined in neutral populations or that Hamilton’s rule fails when selection is not weak. More generally, very many authors have not been careful to separate the general theory of kin selection from the streamlined methods, when discussing the limiting assumptions of the latter. In Table 1, we have restricted ourselves to listing only those articles in which the suggested limitations of the kin‐selection approach have concerned its foundations rather than the more heuristic versions of the theory.

Statistics and evolutionary theory

Our account of the general theory of kin selection has been framed in statistical terms. The use of statistical methods in the mathematical development of Darwinian theory has itself been subjected to recent criticism (van Veelen, 2005; Nowak et al., 2010b), so we address this criticism here. The concern is that statistical terms – such as covariances and least‐squares regressions – should properly be reserved for conventional statistical analyses, where hypotheses are tested against explicit data, and that they are out of place in the foundations of evolutionary theory (van Veelen, 2005; Nowak et al., 2010b). However, this concern is misplaced. First, natural selection is a statistical process, and it is therefore natural that this should be defined in terms of aggregate statistics, even if only strictly by analogy (Frank, 1997a, 1998). Second, Fisher (1930, p198) coined the term ‘covariance’ in the context of his exposition of the genetical theory of natural selection, so the evolutionary usage of this term has precedent over the way the term is used in other fields. Third, it is a mistake to assume that it must be the evolutionary theorist, writing out covariances, who is performing the equivalent of a statistical analysis. A better analogy is to regard Mother Nature in the role of statistician, analysing fitness effects of genes by the method of least‐squares, and driving genetic change according to the results of her analyses (cf. Crow, 2008). More generally, analogy is the basis of all understanding, so when isomorphisms arise unexpectedly between different branches of mathematics (in this case, theoretical population genetics and statistical least‐squares analysis) this represents an opportunity for advancing scientific progress and not an anomaly that is to be avoided.

The phenotypic gambit

The phenotypic gambit is a simplifying assumption that phenotypes of biological interest are controlled by a simple genetic architecture, such that evolution proceeds on an essentially adaptationist basis without being upset by recombination between epistatically‐interacting loci and other such factors (Grafen, 1984). This gambit is central to the streamlined kin‐selection methodologies (Box 6), and the evolutionarily stable strategy (ESS) approach more generally. It is clearly a caricature of biological reality, and its users acknowledge that population genetics provides the formal basis for evolutionary theory, and that exact prediction of how a trait will evolve does require detailed knowledge of its true underlying genetic architecture. However, this concession is then set aside on the basis that such genetic details are not known for even a single trait in a single species, so that such exactness is unattainable in practice. Instead, the phenotypic gambit trades exactness for generality, and is employed to deliver approximate predictions that may be applied to a wide range of species that face similar adaptive problems but potentially encode their phenotypes with altogether different loci. Thus, the phenotypic gambit is a pragmatic approach to problems that evade exact solution.

However, and more importantly, the phenotypic gambit facilitates the interplay between theory and data. First, working out the exact solution to a specific model is not very helpful, because in reality we will have a cloud of possible models in mind, and want to get at their robust and general features. The phenotypic gambit provides a robust approximation to a wide range of models, that can tell us more than the exact solution of any one model (for a discussion of such issues more generally in mathematical biology, see May, 2004). Second, the phenotypic gambit provides readily derived predictions, the robustness of which can be easily investigated in response to variation in both the number and values of model parameters. For example, because population‐genetic methods require specific equations for each particular set of assumptions about how relatives interact, the ESS approach has been the only general way to study sex allocation for arbitrary population structures (Frank, 1995, 1998). It is for this reason that kin‐selection theory has had such a larger impact than population‐genetic theory on empirical areas such as sex allocation (West, 2009, pp. 359–361). More generally, decades of theoretical and empirical research activity in disciplines such as behavioural and evolutionary ecology have demonstrated the clear usefulness of the phenotypic gambit (Krebs & Davies, 1978, 1984, 1991, 1997; Cockburn, 1991; Stearns, 1992; Alcock, 2005; Mayhew, 2006; Westneat & Fox, 2010).

We have taken a subtly different approach in this article, using Price’s (1970, 1972b) theorem to explicitly separate the selection and transmission components of evolutionary change, and setting aside the transmission effect and focusing upon the selection effect. Thus, we have not made any limiting assumptions about the genetic architecture underlying traits of interest, but we have paid for this generality by limiting ourselves to only partial results, corresponding to only the selective component of evolutionary change. Hamilton’s rule is explicitly understood to be a theorem that concerns natural selection only, and not the totality of evolutionary change. Hamilton’s rule would be trivially violated by factors such as mutation pressure if we were to understand it as a strong statement about evolutionary change as a whole. Given that Hamilton’s rule is a statement about natural selection only, then it must be formally framed in terms of genetic change, rather than genotypic or phenotypic change, as natural selection only ‘sees’ the heritable component of traits. The phenotypic gambit provides a means of directly translating these genetical results into phenotypic terms that are of more immediate use to empirical biologists.

Kin selection

The term ‘kin selection’ is used in at least three different ways. Hamilton (1963, 1964, 1970, 1971, 1975) developed Hamilton’s rule as a general partition of natural selection, as we have described above, with r being genetic relatedness. The term ‘kin selection’ was introduced in a short commentary by Maynard Smith (1964), to distinguish Hamilton’s theory from Wynne‐Edwards (1962) ideas about group‐level adaptations. Since then, kin selection has been used in three ways: (i) traits are described as being favoured by kin selection if they are favoured due to the consequences of interactions between close genealogical relatives (i.e. the rb term is important, and r is defined in terms of genealogy; e.g. Maynard Smith, 1964; Grafen, 2006a); (ii) traits are described as being favoured by kin selection if they are favoured due to the indirect‐fitness consequences of the trait (i.e. the rb term is important, and r is defined as relative genetic similarity per se, following Hamilton, 1970; e.g. Foster et al., 2006); (iii) kin‐selection theory or the kin‐selection partition describes how the total action of natural selection can be separated into two components – direct and indirect selection (e.g. Frank, 1998).

The difference between the first two uses is that the second (and third) allows for when indirect benefits can accrue owing to interaction between individuals who are not genealogically close relatives, such as through greenbeards effects (Hamilton, 1964; Dawkins, 1976; Gardner & West, 2010). Although the first use fits best with the use of ‘kin’ in the English language, it is the second use that best describes Hamilton’s theory. Although this could be used to argue that kin selection is not the best phrase to describe Hamilton’s theory (see also Hamilton, 1975), its usage is far too common for dropping it to be a realistic option. By way of analogy, the term ‘linkage disequilibrium’ refers to any statistical association between genes, and not just those that are due to physical linkage of genes on the same chromosome. However, rather than get rid of the term, population geneticists credit each other with enough intelligence to see past the misnomer, and do not waste time trying to bring down hitchhiking theory by pointing out examples where genes are situated on different chromosomes. Furthermore, in practice, the distinction between relatedness and relationship is rarely important, because the major cause of significant relatedness between individuals in the real world is genealogical closeness (Okasha, 2002), with greenbeard effects providing only very infrequent exceptions (reviewed by Gardner & West, 2010). Nonetheless, confusion is best avoided by realizing that different authors may mean different things by the phrase kin selection (or, indeed, Hamilton’s rule), and by being clear about what is intended by the phrase, whenever it is used (West et al., 2007a,b).

Alternative partitions

Other ways of partitioning natural selection are possible, and indeed it can often be useful to look at the same process from different perspectives (Maynard Smith, 1983; Frank, 1998). Here, we have focused upon the kin‐selection partition of natural selection into direct and indirect components. An alternative partition is that of multilevel selection, which partitions total change into between‐group and within‐group components (Price, 1972b; Hamilton, 1975; Wade, 1985; Okasha, 2006). Another is the Barton‐Turelli multilocus methodology, which partitions natural selection into a sum of products of selection coefficients and generalized linkage disequilibria (Barton & Turelli, 1991; Kirkpatrick et al., 2002; Gardner et al., 2007).

The usefulness of these different partitions depends upon the purpose at hand. The major use of the kin‐selection approach is that it facilitates the linking of theory to empirical data, as demonstrated by research in fields such as sex allocation, cooperation and conflict in animal societies, cooperation in microbes, parent‐offspring conflict, sibling conflict, selfish genetic elements and genomic imprinting (Mock & Parker, 1997; Haig, 2002; Sachs et al., 2004; Burt & Trivers, 2006; Ratnieks et al., 2006; West et al., 2006, 2007a,b; Strassmann & Queller, 2007; Boomsma, 2009; Clutton Brock, 2009; Hatchwell, 2009; West, 2009). The reason for this empirical success is that the kin‐selection approach emphasises how an individual’s behaviour influences her own reproductive success (and hence her direct fitness, c) and also the reproductive success of her social partners (and hence her indirect fitness, rb). Thus, we can imagine the individual adjusting her inclusive fitness (the sum of her direct and indirect fitness) by altering her behaviour. Natural selection leads organisms to appear designed to maximize their inclusive fitness (Hamilton, 1964, 1996; Grafen, 2006a; Gardner, 2009) and, as a consequence, kin‐selection theory provides an excellent means for understanding the rationale of social adaptations.

Acknowledgments

We thank: N. Barton, J. Biernaskie, S. Frank, A. Grafen, A. Herre, L. Lehmann, S. Lion, N. Mideo, A. Moore, H. Ohtsuki, I. Pen, D. Queller, A. Rodrigues and F. Rousset for helpful discussion; the Royal Society, Balliol College, Leverhulme Trust and ERC for funding.

      Number of times cited: 173

      • , Bifurcated HR practices in family firms: Insights from the normative-adaptive approach to stepfamilies, Human Resource Management Review, 28, 1, (68), (2018).
      • , Mate limitation and sex ratio evolution, Royal Society Open Science, 5, 2, (171135), (2018).
      • , Hamilton's inclusive fitness maintains heritable altruism polymorphism throughrb=c, Proceedings of the National Academy of Sciences, (201710215), (2018).
      • , Demography, life history and the evolution of age‐dependent social behaviour, Journal of Evolutionary Biology, 31, 9, (1340-1353), (2018).
      • , The benefits of grouping as a main driver of social evolution in a halictine bee, Science Advances, 10.1126/sciadv.1700741, 4, 10, (e1700741), (2018).
      • , Inclusive fitness for in-laws, Biology Letters, 10.1098/rsbl.2018.0515, 14, 10, (20180515), (2018).
      • , Who suppresses female sexuality? An examination of support for Islamic veiling in a secular Muslim democracy as a function of sex and offspring sex, Evolution and Human Behavior, 10.1016/j.evolhumbehav.2018.06.006, 39, 6, (632-638), (2018).
      • , Modeling relatedness and demography in social evolution, Evolution Letters, 2, 4, (260-271), (2018).
      • , The Evolution of Psychological Altruism, Philosophy of Science, 10.1086/699743, 85, 5, (1054-1064), (2018).
      • , Mutual altruism and long-term optimization of the inclusive fitness in multilocus genetic systems, Theoretical Population Biology, 10.1016/j.tpb.2018.10.005, (2018).
      • , Assortment and the analysis of natural selection on social traits, Evolution, 71, 11, (2693-2702), (2017).
      • , Kin Selection in the RNA World, Life, 7, 4, (53), (2017).
      • , The general form of Hamilton’s rule makes no predictions and cannot be tested empirically, Proceedings of the National Academy of Sciences, 10.1073/pnas.1701805114, 114, 22, (5665-5670), (2017).
      • , Interpreting selection when individuals interact, Methods in Ecology and Evolution, 8, 6, (688-699), (2017).
      • , Synergy Drives the Evolutionary Dynamics in Biology and Economics, First Complex Systems Digital Campus World E-Conference 2015, 10.1007/978-3-319-45901-1_34, (311-331), (2016).
      • , Metabolism and the Evolution of Social Behavior, Molecular Biology and Evolution, 34, 9, (2367), (2017).
      • , Cooperation, Evolution of ☆, Reference Module in Life Sciences, 10.1016/B978-0-12-809633-8.02340-2, (2017).
      • , Human Sociobiology and Group Selection Theory, On Human Nature, 10.1016/B978-0-12-420190-3.00023-5, (383-396), (2017).
      • , Inclusive fitness in finite populations—effects of heterogeneity and synergy, Evolution, 71, 3, (508-525), (2017).
      • , How to make a sterile helper, BioEssays, 39, 1, (2016).
      • , Synthesizing perspectives on the evolution of cooperation within and between species, Evolution, 71, 4, (814-825), (2017).
      • , Beyond Hamilton's rule, Science, 356, 6337, (485), (2017).
      • , Synergistic cooperation promotes multicellular performance and unicellular free-rider persistence, Nature Communications, 8, (15707), (2017).
      • , Intrafamily and intragenomic conflicts in human warfare, Proceedings of the Royal Society B: Biological Sciences, 284, 1849, (20162699), (2017).
      • , Neutral aggregation in finite-length genotype space, Physical Review E, 95, 1, (2017).
      • , Inbreeding, but not seed availability, affects dispersal and reproductive success in a seed-inhabiting social beetle, Behavioral Ecology and Sociobiology, 71, 12, (2017).
      • , The inclusive fitness controversy: finding a way forward, Royal Society Open Science, 10.1098/rsos.170335, 4, 7, (170335), (2017).
      • , Inclusive Fitness Theory and the Evolution of Mind and Language, Erkenntnis, (2017).
      • , What’s wrong with evolutionary biology?, Biology & Philosophy, 10.1007/s10539-016-9557-8, 32, 2, (263-279), (2016).
      • , The evolution of cooperation in simple molecular replicators, Proceedings of the Royal Society B: Biological Sciences, 284, 1864, (20171967), (2017).
      • , One equation to rule them all: a philosophical analysis of the Price equation, Biology & Philosophy, 32, 1, (97), (2017).
      • , Group Selection ☆, Reference Module in Life Sciences, 10.1016/B978-0-12-809633-8.06485-2, (2017).
      • , By genes alone: a model selectionist argument for genetical explanations of cooperation in non-human organisms, Biology & Philosophy, (2017).
      • , Hamilton's rule, Journal of Theoretical Biology, 10.1016/j.jtbi.2016.08.019, 414, (176-230), (2017).
      • , The evolution of mutualism with modifiers, Ecology and Evolution, 7, 16, (6114-6118), (2017).
      • , Hamilton's rule, inclusive fitness maximization, and the goal of individual behaviour in symmetric two‐player games, Journal of Evolutionary Biology, 29, 3, (473-482), (2016).
      • , Asymmetry within social groups: division of labour and intergroup competition, Journal of Evolutionary Biology, 29, 3, (560-571), (2015).
      • , The influence of space and time on the evolution of altruistic defence: the case of ant slave rebellion, Journal of Evolutionary Biology, 29, 5, (874-886), (2016).
      • , Invasion fitness, inclusive fitness, and reproductive numbers in heterogeneous populations, Evolution, 70, 8, (1689-1702), (2016).
      • , Genes as Cues of Relatedness and Social Evolution in Heterogeneous Environments, PLOS Computational Biology, 12, 6, (e1005006), (2016).
      • , Aggression in Columbian ground squirrels: relationships with age, kinship, energy allocation, and fitness, Behavioral Ecology, (arw098), (2016).
      • , The Mechanistic, Genetic, and Evolutionary Basis of Worker Sterility in the Social Hymenoptera, , 10.1016/bs.asb.2016.03.002, (251-317), (2016).
      • , Multilevel Selection in Kin Selection Language, Trends in Ecology & Evolution, 10.1016/j.tree.2016.07.006, 31, 10, (752-762), (2016).
      • , Evolution of worker policing, Journal of Theoretical Biology, 399, (103), (2016).
      • , References, The Primate Origins of Human Nature, (443-490), (2016).
      • , Game theoretic treatments for the differentiation of functional roles in the transition to multicellularity, Journal of Theoretical Biology, 395, (161), (2016).
      • , The causal meaning of Hamilton’s rule, Royal Society Open Science, 10.1098/rsos.160037, 3, 3, (160037), (2016).
      • , Inclusive Fitness and the Maximizing-Agent Analogy, The British Journal for the Philosophy of Science, (axw003), (2016).
      • , There is no fitness but fitness, and the lineage is its bearer, Philosophical Transactions of the Royal Society B: Biological Sciences, 10.1098/rstb.2015.0085, 371, 1687, (20150085), (2016).
      • , There is no inclusive fitness at the level of the individual, Current Opinion in Behavioral Sciences, 10.1016/j.cobeha.2016.10.002, 12, (122-128), (2016).
      • , Inclusive Fitness 101: Math Not Required, Trends in Ecology & Evolution, 31, 3, (178), (2016).
      • , Interference Competition Among Household Strains of Pseudomonas, Microbial Ecology, 72, 4, (821), (2016).
      • , Extended inclusive fitness theory: synergy and assortment drives the evolutionary dynamics in biology and economics, SpringerPlus, 5, 1, (2016).
      • , Drosophila As a Genetically Tractable Model for Social Insect Behavior, Frontiers in Ecology and Evolution, 4, (2016).
      • , Kin-directed food sharing promotes lifetime natal philopatry of both sexes in a population of fish-eating killer whales, Orcinus orca, Animal Behaviour, 115, (81), (2016).
      • , Natural selection and the maximization of fitness, Biological Reviews, 91, 3, (712-727), (2015).
      • , Evolutionary Dynamics of Homophily and Heterophily, Scientific Reports, 6, 1, (2016).
      • , William D. Hamilton’s Brazilian lectures and his unpublished model regarding Wynne-Edwards’s idea of natural selection. With a note on ‘pluralism’ and different philosophical approaches to evolution, History and Philosophy of the Life Sciences, 38, 4, (2016).
      • , The Relation between Kin and Multilevel Selection: An Approach Using Causal Graphs, The British Journal for the Philosophy of Science, 67, 2, (435), (2016).
      • , Group-level events are catalysts in the evolution of cooperation, Journal of Theoretical Biology, 410, (125), (2016).
      • , Hamilton׳s Rule in finite populations with synergistic interactions, Journal of Theoretical Biology, 397, (151), (2016).
      • , Triggering Kin Selection, Encyclopedia of Evolutionary Psychological Science, 10.1007/978-3-319-16999-6_3089-1, (1-8), (2016).
      • , What is inclusive fitness theory, and what is it for?, Current Opinion in Behavioral Sciences, 10.1016/j.cobeha.2016.09.015, 12, (103-108), (2016).
      • , Indirect Fitness Benefits Enable the Spread of Host Genes Promoting Costly Transfer of Beneficial Plasmids, PLOS Biology, 14, 6, (e1002478), (2016).
      • , Kin and multilevel selection in social evolution: a never-ending controversy?, F1000Research, 10.12688/f1000research.8018.1, 5, (776), (2016).
      • , Fluctuation driven fixation of cooperative behavior, Biosystems, 127, (60), (2015).
      • , Sex investment ratios in eusocial Hymenoptera support inclusive fitness theory, Journal of Evolutionary Biology, 28, 11, (2106-2111), (2015).
      • , Regression, least squares, and the general version of inclusive fitness, Evolution, 69, 11, (2963-2970), (2015).
      • , Understanding Honey Bee Worker Self-Sacrifice, Genomics, Physiology and Behaviour of Social Insects, 10.1016/bs.aiip.2014.12.002, (325-354), (2015).
      • , Sociobiology: Overview, International Encyclopedia of the Social & Behavioral Sciences, 10.1016/B978-0-08-097086-8.81065-X, (908-911), (2015).
      • , The genetical theory of multilevel selection, Journal of Evolutionary Biology, 28, 2, (305-319), (2015).
      • , Multilevel selection theory and evidence: a critique of Gardner, 2015, Journal of Evolutionary Biology, 28, 9, (1734-1746), (2015).
      • , More on the genetical theory of multilevel selection, Journal of Evolutionary Biology, 28, 9, (1747-1751), (2015).
      • , The interpretation of selection coefficients, Evolution, 69, 5, (1101-1112), (2015).
      • , Relatedness, Conflict, and the Evolution of Eusociality, PLOS Biology, 13, 3, (e1002098), (2015).
      • , Kinship, Evolution of, International Encyclopedia of the Social & Behavioral Sciences, 10.1016/B978-0-08-097086-8.81041-7, (44-52), (2015).
      • , Desperately Seeking Meaning, BioScience, 65, 3, (328), (2015).
      • , Games among relatives revisited, Journal of Theoretical Biology, 378, (103), (2015).
      • , Extensive fitness and human cooperation, Theory in Biosciences, 134, 3-4, (127), (2015).
      • , Conceptualization and Operationalization of an Altruistic and Egoistic Continuum of Organizational Citizenship Behavior Motivations, Journal of Hospitality & Tourism Research, (109634801561941), (2015).
      • , Inclusive Fitness Theory Becomes an End in Itself, BioScience, 65, 11, (1103), (2015).
      • , Social genetic and social environment effects on parental and helper care in a cooperatively breeding bird, Proceedings of the Royal Society B: Biological Sciences, 282, 1810, (20150689), (2015).
      • , Kin Selection and Its Critics, BioScience, 65, 1, (22), (2015).
      • , A microbial model of economic trading and comparative advantage, Journal of Theoretical Biology, 364, (326), (2015).
      • , An experimental approach to test the source of information on kinship in butterfly immatures, Ethology Ecology & Evolution, 27, 3, (294), (2015).
      • , Kin Selection, International Encyclopedia of the Social & Behavioral Sciences, 10.1016/B978-0-08-097086-8.81009-0, (26-31), (2015).
      • , Does microchimerism mediate kin conflicts?, Chimerism, 10.4161/chim.29122, 5, 2, (53-55), (2014).
      • , Measures of success in a class of evolutionary models with fixed population size and structure, Journal of Mathematical Biology, 68, 1-2, (109), (2014).
      • , Gene mobility and the concept of relatedness, Biology & Philosophy, 29, 4, (445), (2014).
      • , Hamilton's Rule and Its Discontents, The British Journal for the Philosophy of Science, 65, 2, (381), (2014).
      • , Hamilton's rule and the causes of social evolution, Philosophical Transactions of the Royal Society B: Biological Sciences, 369, 1642, (20130362), (2014).
      • , Seven Misconceptions About the Mereological Fallacy: A Compilation for the Perplexed, Erkenntnis, 79, 5, (1077), (2014).
      • , Inclusive fitness: 50 years on, Philosophical Transactions of the Royal Society B: Biological Sciences, 369, 1642, (20130356), (2014).
      • , Hunter–Gatherer population structure and the evolution of contingent cooperation, Evolution and Human Behavior, 35, 3, (219), (2014).
      • , NONADAPTIVE PROCESSES CAN CREATE THE APPEARANCE OF FACULTATIVE CHEATING IN MICROBES, Evolution, 68, 3, (816-826), (2013).
      • , Mating ecology explains patterns of genome elimination, Ecology Letters, 17, 12, (1602-1612), (2014).
      • , Cultural transmission and the evolution of human behaviour: a general approach based on the Price equation, Journal of Evolutionary Biology, 27, 2, (231-241), (2013).
      • , Effects of genetic similarity on the life‐history strategy of co‐infecting trematodes: are parasites capable of intrahost kin recognition?, Journal of Evolutionary Biology, 27, 8, (1623-1630), (2014).
      • , Natural selection drives the evolution of ant life cycles, Proceedings of the National Academy of Sciences, 111, 35, (12585), (2014).
      • , War and peace: social interactions in infections, Philosophical Transactions of the Royal Society B: Biological Sciences, 369, 1642, (20130365), (2014).
      • , Quantitative genetic versions of Hamilton's rule with empirical applications, Philosophical Transactions of the Royal Society B: Biological Sciences, 369, 1642, (20130358), (2014).
      • , A Conceptual Framework for Organismal Biology: Linking Theories, Models, and Data, Integrative and Comparative Biology, 54, 5, (736), (2014).
      • , Site-specific group selection drives locally adapted group compositions, Nature, 514, 7522, (359), (2014).
      • , A simple model of group selection that cannot be analyzed with inclusive fitness, Journal of Theoretical Biology, 360, (279), (2014).
      • , Generalizations of Hamilton's rule applied to non-additive public goods games with random group size, Frontiers in Ecology and Evolution, 2, (2014).
      • , Inclusive fitness maximization: An axiomatic approach, Journal of Theoretical Biology, 350, (24), (2014).
      • , Continuous-time models of group selection, and the dynamical insufficiency of kin selection models, Journal of Theoretical Biology, 349, (22), (2014).
      • , Inclusive fitness and the sociobiology of the genome, Biology & Philosophy, 29, 4, (477), (2014).
      • , Origins of evolutionary transitions, Journal of Biosciences, 39, 2, (303), (2014).
      • , Resolving Conflicts During the Evolutionary Transition to Multicellular Life, Annual Review of Ecology, Evolution, and Systematics, 45, 1, (599), (2014).
      • , The quantitative genetics of indirect genetic effects: a selective review of modelling issues, Heredity, 112, 1, (61), (2014).
      • , The genetical theory of social behaviour, Philosophical Transactions of the Royal Society B: Biological Sciences, 369, 1642, (20130357), (2014).
      • , Genetic information transfer promotes cooperation in bacteria, Proceedings of the National Academy of Sciences, 111, 30, (11103), (2014).
      • , Evolutionary dynamics of n-player games played by relatives, Philosophical Transactions of the Royal Society B: Biological Sciences, 369, 1642, (20130359), (2014).
      • , Kim Sterelny, Richard Joyce, Brett Calcott and Ben Fraser (eds): Cooperation and Its Evolution, Acta Biotheoretica, 62, 1, (109), (2014).
      • , Using genomic tools to maintain diversity and fitness in conservation programmes, Molecular Ecology, 22, 24, (6091-6099), (2013).
      • , Edward O. Wilson and the Organicist Tradition, Journal of the History of Biology, 46, 4, (599), (2013).
      • , Biological information: why we need a good measure and the challenges ahead, Interface Focus, 3, 6, (20130030), (2013).
      • , MULTILEVEL SELECTION WITH KIN AND NON‐KIN GROUPS, EXPERIMENTAL RESULTS WITH JAPANESE QUAIL (COTURNIX JAPONICA), Evolution, 67, 6, (1598-1606), (2013).
      • , The green beards of language, Ecology and Evolution, 3, 4, (1104-1112), (2013).
      • , Plants are least suppressed by their frequent neighbours: the relationship between competitive ability and spatial aggregation patterns, Journal of Ecology, 101, 5, (1313-1321), (2013).
      • , Natural selection. VII. History and interpretation of kin selection theory, Journal of Evolutionary Biology, 26, 6, (1151-1184), (2013).
      • , Task specialization in two social spiders, tegodyphus sarasinorum (Eresidae) and nelosimus eximius (Theridiidae), Journal of Evolutionary Biology, 26, 1, (51-62), (2012).
      • , Can natural selection favour altruism between species?, Journal of Evolutionary Biology, 26, 9, (1854-1865), (2013).
      • , Kin recognition and competition in plants, Functional Ecology, 27, 4, (898-906), (2013).
      • , Unity and disunity in the search for a unified reproductive skew theory, Animal Behaviour, 85, 6, (1137), (2013).
      • , Adaptive Dynamics with Interaction Structure, The American Naturalist, 181, 6, (E139), (2013).
      • , Altruism Can Proliferate through Population Viscosity despite High Random Gene Flow, PLoS ONE, 8, 8, (e72043), (2013).
      • , Comprehensive analysis of the HEPN superfamily: identification of novel roles in intra-genomic conflicts, defense, pathogenesis and RNA processing, Biology Direct, 8, 1, (2013).
      • , Disentangling Direct and Indirect Fitness Effects of Microbial Dormancy, The American Naturalist, 182, 2, (147), (2013).
      • , Adaptation and Inclusive Fitness, Current Biology, 23, 13, (R577), (2013).
      • , The Altruism Paradox: A Consequence of Mistaken Genetic Modeling, Biological Theory, 8, 1, (103), (2013).
      • , Group Selection, Brenner's Encyclopedia of Genetics, 10.1016/B978-0-12-374984-0.00663-X, (362-364), (2013).
      • , Hamiltonian inclusive fitness: a fitter fitness concept, Biology Letters, 9, 6, (20130335), (2013).
      • , Inclusive and personal fitness in synergistic evolutionary games on graphs, Journal of Theoretical Biology, 325, (76), (2013).
      • , Ultimate explanations concern the adaptive rationale for organism design, Biology & Philosophy, 28, 5, (787), (2013).
      • , Limitations of inclusive fitness, Proceedings of the National Academy of Sciences, 110, 50, (20135), (2013).
      • , Acyl-Homoserine Lactone Quorum Sensing: From Evolution to Application, Annual Review of Microbiology, 67, 1, (43), (2013).
      • , Social Revolution, Biology & Philosophy, 27, 4, (571), (2012).
      • , Clash of the Titans, BioScience, 62, 11, (987), (2012).
      • , Evolving cooperation, Journal of Theoretical Biology, 299, (1), (2012).
      • , THE EFFECTS OF KIN SELECTION ON RATES OF MOLECULAR EVOLUTION IN SOCIAL INSECTS, Evolution, 66, 7, (2080-2093), (2012).
      • , Social systems: demographic and genetic issues, Molecular Ecology, 21, 3, (443-446), (2012).
      • , Evolution of maternal care in diploid and haplodiploid populations, Journal of Evolutionary Biology, 25, 8, (1479-1486), (2012).
      • , A conceptual model for the origin of worker behaviour and adaptation of eusociality, Journal of Evolutionary Biology, 25, 1, (1-19), (2011).
      • , The sociobiology of sex: inclusive fitness consequences of inter-sexual interactions, Philosophical Transactions of the Royal Society B: Biological Sciences, 367, 1600, (2314), (2012).
      • , Group selection and inclusive fitness are not equivalent; the Price equation vs. models and statistics, Journal of Theoretical Biology, 299, (64), (2012).
      • , Does synergy rescue the evolution of cooperation? An analysis for homogeneous populations with non-overlapping generations, Journal of Theoretical Biology, 307, (20), (2012).
      • , Promiscuity and the evolution of cooperative breeding, Proceedings of the Royal Society B: Biological Sciences, 279, 1732, (1405), (2012).
      • , An inclusive fitness analysis of synergistic interactions in structured populations, Proceedings of the Royal Society B: Biological Sciences, 279, 1747, (4596), (2012).
      • , The coevolution of toxin and antitoxin genes drives the dynamics of bacterial addiction complexes and intragenomic conflict, Proceedings of the Royal Society B: Biological Sciences, 279, 1743, (3706), (2012).
      • , Kin selection, quorum sensing and virulence in pathogenic bacteria, Proceedings of the Royal Society B: Biological Sciences, 279, 1742, (3584), (2012).
      • , Suppression of Social Conflict and Evolutionary Transitions to Cooperation, The American Naturalist, 179, 2, (293), (2012).
      • , Evolution of discrimination in populations at equilibrium between selfishness and altruism, Journal of Theoretical Biology, 313, (162), (2012).
      • , Collective action in the fraternal transitions, Biology & Philosophy, 27, 3, (363), (2012).
      • , Life: social to its core, Evolution and Human Behavior, 33, 1, (79), (2012).
      • , Kinship Shapes Affiliative Social Networks but Not Aggression in Ring-Tailed Coatis, PLoS ONE, 7, 5, (e37301), (2012).
      • , Monogamy and haplodiploidy act in synergy to promote the evolution of eusociality, Nature Communications, 2, (397), (2011).
      • , Evolution in structured populations: beyond the kin versus group debate, Trends in Ecology & Evolution, 26, 4, (193), (2011).
      • , Group selection and kin selection: formally equivalent approaches, Trends in Ecology & Evolution, 26, 7, (325), (2011).
      • , A formal theory of the selfish gene, Journal of Evolutionary Biology, 24, 8, (1801-1813), (2011).
      • , The validity and value of inclusive fitness theory, Proceedings of the Royal Society B: Biological Sciences, 278, 1723, (3313), (2011).
      • , A Test of Evolutionary Policing Theory with Data from Human Societies, PLoS ONE, 6, 9, (e24350), (2011).
      • , The Meaning of Death: Evolution and Ecology of Apoptosis in Protozoan Parasites, PLoS Pathogens, 7, 12, (e1002320), (2011).
      • , Kin selection under blending inheritance, Journal of Theoretical Biology, 284, 1, (125), (2011).
      • , Direct fitness for dynamic kin selection, Journal of Evolutionary Biology, 24, 7, (1598-1610), (2011).
      • , Theory of Cooperation, eLS, (1-8), (2016).
      • , Cheating, facilitation and cooperation regulate the effectiveness of phage‐encoded exotoxins as antipredator molecules, MicrobiologyOpen, e00636, (2018).
      • , The levels of selection debate: taking into account existing empirical evidence, Acta Biológica Colombiana, 10.15446/abc.v21n3.54596, 21, 3, (467), (2016).
      • , The Nature of Programmed Cell Death, Biological Theory, 10.1007/s13752-018-0311-0, (2018).
      • , Clonality and non-linearity drive facultative-cooperation allele diversity, The ISME Journal, 10.1038/s41396-018-0310-y, (2018).
      • , Can Hamilton's rule be violated?, eLife, 10.7554/eLife.41901, 7, (2018).
      • , The causes of evolvability and their evolution, Nature Reviews Genetics, 10.1038/s41576-018-0069-z, (2018).