The general version of Hamilton’s rule
Figures

Two Price-like equations and two models.
In Model A, the number of offspring follows a binomial distribution with an expected number of offspring of . This means that the model can be summarized as (see Detailed Calculations 1.6 at the end of Appendix 1 for details). For the transition depicted in panels a and c, we generated an offspring generation using Model A, with and . Combining the Generalized Price equation in regression form with Model A includes choosing and so that they minimize the sum of squared differences between and . In Model B, the number of offspring follows a binomial distribution with an expected number of offspring of , which means that . For the transition depicted in panels b and d, we generated an offspring generation using Model B, with , , and . Combining the Generalized Price equation in regression form with Model B includes choosing , , and so that they minimize the sum of squared differences between and . Reproduction is asexual, so parents and their offspring are always identical, and by definition. For both of these models, we started with a population consisting of 2500 parents for each p-score, ranging from 0 to 1 in increments of 0.1. The four panels represent the four combinations of the two Price-like equations and the two datasets. The red lines in all panels represent the estimated fitnesses as a function of the p-score, as implied by the respective Price-like equations. The aim of this example is not to show that the estimated fitnesses from Price-like equation A match the data generated by Model A, and the estimated fitnesses from Price-like equation B match the data generated by Model B, but the estimated fitnesses from Price equation A do not match the data generated by Model B; that part is obvious. The purpose of this example, instead, is to illustrate that both Price-like equations remain identities, also when they are overspecified (panel c) or underspecified (panel b) with respect to the transition between parent and offspring population they are applied to. With underspecification, it is also visually clear that the estimated fitnesses do not match the data, even though Price-like equation A remains an identity, also when combined with data generated by Model B.
-
Figure 1—source data 1
Simulation data generated by a simple computer algorithm.
- https://cdn.elifesciences.org/articles/105065/elife-105065-fig1-data1-v1.xlsx

Dependence on the parent population.
We assume that the population is infinitely large, so that the average fitness matches the expected values. On the left, a parent population, all members of which either have a p-score of 0 or a p-score of . With expected fitnesses belonging to these p-scores of 0 and , respectively, that results in a of 1. On the right, a parent population, all members of which either have a p-score of , or a p-score of 1. With expected fitnesses belonging to these p-scores of and 2, respectively, that results in a of 3. In the literature, the dependence of the on the parent population is sometimes referred to as dynamical insufficiency. That, however, is not what this is in this case, as here, this is really a symptom of misspecification. If the model that generated the data would have been linear (model A), then, absent the noise, the would have been the same, regardless of the composition of the parent population. Dynamic (in)sufficiency is discussed in Section 6 of this appendix.

A graphical representation of the life cycle for the equilibrium state of the population.
Step 1: differential survival. All individuals in the parent generation with a p-score of 0 die, and half of those with a p-score of 1 do. None of the parents with a p-score of die, so the heterozygote has the highest fitness. Step 2: mothers and fathers. Half of all surviving parents are female, and half are male. Step 3 and 4: random matching. The parents are randomly matched. Step 5: fair meiosis. In each pair, the offspring inherits one allele from either parent. Step 6: scaling up. Parent pairs have on average three kids.

Nested models and their Price-like equations.
There are different models, and each model has its own Price-like equation. These Price-like equations are general, in the sense that they can be written for any dataset, regardless of the underlying data generating process or, in a theory context, for any model. The terms in it, however, only have a meaningful interpretation if the data are generated by a model in the set that the Price-like equation belongs to. In line with the setup in this appendix, set A would represent models that are linear in the p-score (). This is the set of models for which the regression coefficient in the original Price equation in regression form has a meaningful interpretation. Set B could consist of models that are quadratic in the p-score (), set C could consist of models that also include a coefficient for the p-score to the power 3 (), set D models that are linear in the p-score and the q-score (), and set E models that include an interaction term between the p-score and the q-score (). The Price-like equations for these different models are all different.

Symptoms of underspecification.
The red bars indicate frequencies of parents with a p-score of 0, and with 0, 1, …, 10 offspring for the example described in the text. The blue bars indicate frequencies of parents with a p-score of 1, and with 0, 1, …, 10 offspring. The composition of the parent population differs between the panels; in the top panel, 1 out of 4 parents has a p-score of 1; in the middle panel, this is 2 out of 4 parents; and in the bottom panel, it is 3 out of 4 parents. For the correctly specified model , the red and the blue dotted lines indicate the values for and for , if those are chosen so as to minimize the sum of squared errors relative to the correct model. The is the expected value of the number of offspring for parents with a p-score of 0, while is the expected value of the number of offspring for parents with a p-score of and 1. The and that minimize the sum of squared errors do not depend on the composition of the population. For the mis-specified model , the black dotted lines indicate the ’s for which the sum of squared errors is minimized. These do move around. The sum of squared errors when using the mis-specified model is also much larger at the minimum.

Fitnesses for Model 2.
On the left, , and the likelihood of being matched with a cooperator is the same for cooperators and defectors, and equal to the frequency of cooperators. On the right, , where the probability of being matched with a cooperator is for cooperators, and for defectors, where is the frequency of cooperators.

Fitnesses for Model 3.
On the left,, and on the right, . The lines are still straight, but no longer parallel.

Applying the Generalized Price equation for Model 2 to Model 3.
For this figure, we chose Model 3 with , , and . The red lines are (the lower one), and (the higher one), which, when applied to Model 2, would have been the effect on the individual itself and the effect on the partner. The average effect on the individual itself is the lower unbroken blue line, which is accompanied by the average effect for the defectors (the dashed line below it) and the average effect for cooperators (the dashed line immediately above it). The average effect on the partner is the higher unbroken blue line, which is accompanied by the average effect for the defectors (the dashed line immediately below it) and the average effect for cooperators (the dashed line above it).
Tables
Three rules and three models.
This table gives all combinations of the three rules and the three models discussed in the example. All rules indicate the direction of selection correctly for all models. Yellow indicates a combination of a rule and a model, where the rule is more general than is needed for the model. This leads to one or more ’s being 0. These are relatively harmless overspecifications. Red indicates a combination of a rule and a model, where the rule is not general enough (underspecified) for the model. This leads to ’s and ’s that depend on the population state. Terms that depend on the population state are abbreviated as follows: , , , and (see Detailed Calculations 2.3 at the end of Appendix 2 for calculations). Rule 1 is the standard rule for non-social evolution for linear non-social traits. Rule 2 is the classical Hamilton’s rule. Rule 3 is Queller’s rule (Queller, 1985), which is a rule that allows for an interaction effect. Rule 1 is nested in Rule 2, which is nested in Rule 3, which can be nested in more general rules as well.
Model 1: | Model 2: | Model 3: | ||
---|---|---|---|---|
Rule 1: | ||||
Rule 2: | ||||
Rule 3: | ||||
Four transitions and the values for the terms in the Price equation.
This is an overview of all terms that are in the different versions of the Price equation.
Parent 1 | Parent 2 | |||||||
---|---|---|---|---|---|---|---|---|
Transition 1 | ||||||||
Transition 2 | ||||||||
Transition 3 | ||||||||
Transition 4 | ||||||||
Average |
Three rules and three models.
This table shows all combinations of the three rules and the three models. All rules indicate the direction of selection correctly for all models. Yellow indicates a combination of a rule and a model, where the rule is more general than is needed for the model. This leads to one or more ’s being 0. Red indicates a combination of a rule and a model, where the rule is not general enough for the model. This leads to one or more ’s and ’s that depend on the population state. Terms that depend on the population state are abbreviated as follows: , , , and . Rule 1 is the standard rule for non-social traits. Rule 2 is the classical Hamilton’s rule. Rule 3 is a rule that allows for an interaction effect. An appropriate name for this rule would be Queller’s rule (Queller, 1985). This rule can in turn be nested in a sequence of ever more general rules if we allow for p- and q-scores that are not restricted to be binary.
Model 1: | Model 2: | Model 3: | ||
---|---|---|---|---|
Rule 1: | ||||
Rule 2: | ||||
Rule 3: | ||||