Phylogenetic divergence of cell biological features
Abstract
Most cellular features have a range of states, but understanding the mechanisms responsible for interspecific divergence is a challenge for evolutionary cell biology. Models are developed for the distribution of mean phenotypes likely to evolve under the joint forces of mutation and genetic drift in the face of constant selection pressures. Mean phenotypes will deviate from optimal states to a degree depending on the effective population size, potentially leading to substantial divergence in the absence of diversifying selection. The steadystate distribution for the mean can even be bimodal, with one domain being largely driven by selection and the other by mutation pressure, leading to the illusion of phenotypic shifts being induced by movement among alternative adaptive domains. These results raise questions as to whether lineagespecific selective pressures are necessary to account for interspecific divergence, providing a possible platform for the establishment of null models for the evolution of cellbiological traits.
https://doi.org/10.7554/eLife.34820.001eLife digest
When most people think about evolution, they commonly think of natural selection: the evolutionary force that helps populations to develop toward an optimum state for their environment. The observable traits and features of a cell or organism are known as its phenotype. Under natural selection, genes that produce phenotypes that help a cell or organism to thrive and reproduce are more likely to be passed on to future generations. This means that over several generations the population becomes – on average – better adapted to its environment.
Other ‘nonadaptive’ evolutionary forces also influence phenotype. For example, damage to DNA can introduce mutations into the genes that a cell or organism passes on to their offspring. Some mutations are more likely to produce working variants of a gene than others; this is known as a mutation bias. In addition, even in the absence of natural selection, the proportion of particular gene variants in a population changes over the generations because genes are randomly transmitted and not all individuals reproduce. This is known as genetic drift. Together, mutation bias and genetic drift could prevent a population’s average phenotype from reaching an optimal state.
Lynch has now developed mathematical models that describe how certain biological features of cells – such as the structure of the proteins they produce – are likely to evolve due to mutation bias and genetic drift. These models show that these evolutionary processes can cause the features of the cells in a population to diversify, which often leads to a suboptimal average phenotype. Lynch calculated that two alternative phenotypes could even emerge in isolated populations in cases where there is only one optimum phenotype. For example, a mutation bias could drive some cells in one population to evolve one phenotype, while natural selection drives another population towards the other phenotype.
Overall, the model emphasizes that natural selection is not the only force that drives diversity in cells. Future research into cell biology needs to take a broad view of the joint roles played by natural selection, mutation bias and genetic drift.
https://doi.org/10.7554/eLife.34820.002Introduction
As with nearly all biological traits, most cellular features vary among individuals within populations in a nearly continuous fashion, owing to genetic differences among individuals and the myriad of stochastic factors experienced by all organisms (ranging from intrinsic cellular noise to external environmental forces; Lynch and Walsh, 1998). This is true, for example, for catalytic rates, rates of gene expression and intracellular transport, numbers and sizes of organelles, etc. Ultimately, some fraction of withinspecies genetic variation is transformed into amongspecies divergence as alternative alleles arise by mutation and in some cases proceed to fixation (Wright, 1969; Walsh and Lynch, 2018). The magnitude of such divergence is dictated by three major evolutionary factors: the pattern of selection (the phenotypic fitness function), which imposes a directional and/or stabilizing force on the mean phenotype; the rate of origin and distribution of mutational effects, which define the raw materials upon which natural selection operates; and the power of random genetic drift, which imposes noise on the selective process.
Although considerable effort has been devoted to understanding the divergence of mean phenotypes among lineages (Walsh and Lynch, 2018), most of this work is focused on the evolution of morphological phenotypes in response to external pressures, which can vary greatly depending on the ecological setting. In contrast, owing to homeostatic effects, the internal environment of cells remains largely constant over long time scales and broad geographic locations, raising the possibility of establishing general evolutionary principles that transcend the imposition of transient ecological changes. (The same might be true for the internal organs of multicellular species).
The goal here is to derive general expressions for the divergence of mean phenotypes among species under scenarios that are likely to hold for a wide variety of cellular traits. The specific focus will be on the magnitude of divergence expected among lineages in the face of identical evolutionary forces, as this helps clarify the degree to which phenotypic diversification can proceed in the absence of lineagespecific selection pressures. Such a perspective is essential to establishing the degree to which adaptive explanations need to be sought to explain patterns of variation among populations.
The general approach will draw from wellestablished constructs employed in the field of quantitative genetics (the study of continuously distributed traits with a multifactorial genetic basis; Lynch and Walsh, 1998; Walsh and Lynch, 2018). The traditional focus of this field has been on complex traits in multicellular species, but these same methods can be profitably applied to intracellular morphological and molecular features, such as those involved in the cytoskeleton, gene expression, binding energy, and metabolic rates (Nourmohammad et al., 2013; Farhadifar et al., 2015; Phillips and Bowerman, 2015). Indeed, although most work in phenotypic evolution proceeds as though cellular details are irrelevant, the models employed may be equally if not more relevant to cellbiological traits, owing to their potentially less temporally variable fitness effects.
Theory
The distribution of mean phenotypes
All genetically encoded traits are subject to the recurrent forces of mutation and random genetic drift, and potentially to selection. Selection favors some genotypes over others, while mutation modifies existing genotypes independent of the selective process, and random genetic drift causes stochastic variation in gene transmission across generations. Owing to this latter factor, even if the forces of selection and mutation remain constant, the population mean phenotype of a trait will wander within a certain range over evolutionary time, with the frequency of occurrence of alternative mean phenotypes depending on patterns and strengths of selective and mutational effects (Figure 1).
The focus of this study, the stationary distribution of mean phenotypes, can be viewed as a summary distribution of: (1) phenotypic means across a large number of replicate populations exposed to identical conditions for a very long period; or (2) a historical survey of mean phenotypes in a single population over a long time period, again under constant environmental and populationgenetic conditions. Among many other applications, such an approach has long been exploited in attempts to understand the steadystate distribution of allele frequencies expected under a constant regime of selection, mutation, and random genetic drift (e.g. Wright, 1969). From an empirical perspective, this steadystate view of evolution implicitly assumes that enough time has elapsed between observed taxa that the dynamics of the evolutionary process are of negligible significance (which would not be the case for closely related species).
The approach taken here relies on the Kolmogorov forward equation for a diffusion process (Appendix 1, Walsh and Lynch, 2018), the assumption being that the trait of interest is continuously distributed, with $z$ denoting the phenotypic value of an individual. The population mean, $\overline{z},$ moves in arbitrarily small increments each generation via the deterministic forces of selection and mutation and the stochastic process of drift. Under most reasonable biological conditions, independent of the starting conditions, a stationary distribution of mean phenotypes (among hypothetical replicate populations) is eventually converged upon, at which point there is an exact balance between opposing forces. The probability that a population’s mean phenotype will reside at any particular point is defined by this distribution, which has the general form
where $M\left(x\right)$ defines the rate of directional change (resulting from selection and/or mutation) for a population with mean phenotype $x$, and $V\left(x\right)$ is the variance in change (resulting from drift). $C$ is the normalization constant (containing only terms that are independent of $\overline{z}$) that ensures that the entire probability density sums to 1.0.
For a quantitative trait, the directional term can be subdivided into independent selection and mutation components, ${M}_{s}\left(x\right)$ and ${M}_{m}\left(x\right)$, both of which will be discussed in detail below. Under the assumption of negligible genotype $\times $ environment interaction and epistasis, the variance of the change in means, which results from the sampling of heritable genotypic values of individuals, is equal to the underlying additive genetic variance for the trait, ${\sigma}_{A}^{2},$ divided by the effective population size, ${N}_{e}$, in the case of haploidy (assumed here; and $2{N}_{e}$ in the case of diploidy). The latter is typically far below the number of reproductive individuals in the population, and defined by various demographic features and interference imposed by chromosomal linkage, with values ranging between $\sim {10}^{5}$ for multicellular eukaryotes to $\sim {10}^{9}$ for bacteria (Charlesworth, 2009; Lynch et al., 2016; Walsh and Lynch, 2018).
Individual phenotypes are comprised of the sum of a heritable additive genetic component ($A$) and a nonheritable residual deviation ($e$, which includes environmental and nonadditive genetic effects), such that $z=A+e,$ with the withinpopulation phenotypic variance being partitioned as ${\sigma}_{z}^{2}={\sigma}_{A}^{2}+{\sigma}_{e}^{2}.$ For cellular features, a large fraction of ${\sigma}_{e}^{2}$ may be a consequence of stochastic gene expression, imprecise placement of celldivision septa, etc. Assuming that both ${\sigma}_{A}^{2}$ and ${N}_{e}$ remain constant, which is the model adhered to here, Equation (1a) can be rewritten as
showing that the stationary distribution of mean phenotypes (conditional on a particular level of genetic variance, a point that will be returned to below) is proportional to the product of the distributions expected under selection alone and under mutation alone. With extremely weak selection, ${M}_{s}\left(x\right)$ would be essentially a flat function, with the overall distribution reflecting the biases due to mutation alone. Conversely, with a flat mutation function, an unlikely scenario, the distribution will follow that expected under selection alone.
The process of selection
The influence of selection on the mean phenotype (the response to selection) is embodied in the breeder’s equation,
a general statement about the connection between directional selection within generations and the transmission of such change across generations (Walsh and Lynch, 2018). Here, $\overline{z}\left(t\right)$ and ${{\displaystyle \overline{z}}}_{s}\left(t\right)$ denote the mean phenotypes before and after selection in generation $t$, the difference being the selection differential. The heritability of the trait, ${h}^{2}={\sigma}_{A}^{2}/{\sigma}_{z}^{2}$, which equals the proportion of the total phenotypic variance, ${\sigma}_{z}^{2}$, associated with additive genetic variation, ${\sigma}_{A}^{2},$ constitutes the fraction of the withingeneration change in the mean transmitted to the next generation.
Critical to everything that follows, the selection differential can be described in terms of the withinpopulation phenotype distribution, $p(z,t)$, and the function relating individual fitness to phenotype, $W\left(z\right)$. The mean fitness in generation $t$ is
The mean phenotype after selection (but before inheritance) is then obtained by weighting the preselection phenotypes by their relative fitnesses,
We will make use of the fact that most quantitative traits have an approximately normal phenotype distribution on some scale of measurement, which follows from the central limit theorem (Lynch and Walsh, 1998). The distribution of individual measures is therefore described completely by the phenotypic mean and variance,
Substituting Equation (5) into (3) and differentiating, the change in mean fitness with respect to mean phenotype is
(Lande, 1976). From Equation (4), the first term to the right of the integral is equal to $\overline{z}}_{s}\left(t\right)\cdot \overline{W$, and the second term is $\overline{z}\left(t\right)\cdot \overline{W}$. This provides a direct link to Equation (2), which upon rearrangement becomes
This expression states that, provided the phenotype distribution is normal, the change in mean phenotype caused by selection is equal to the product of the genetic variance for the trait and the gradient in the logarithm of mean fitness with respect to mean phenotype. Evolution by natural selection comes to a standstill when there is no genetic variance for the trait or the phenotypic mean resides at a point where the slope of the function of mean fitness with respect to mean phenotype is zero. To endow this expression with practical utility, specific expressions for the fitness function, $W\left(z\right),$ will be considered below.
The process of mutation
Most attempts to consider the longterm evolutionary features of quantitative traits have assumed one of two mutation models: (1) a distribution of mutational effects always having a mean equal to zero and a constant variance, independent of the starting genotype (Kimura, 1965; Lande, 1975; Lynch and Hill, 1986); or (2) a rate of appearance of each type of mutant allele being independent of the ancestral type (Cockerham, 1984; Turelli, 1984). Under the first scenario, mutation has no directional effect on the mean phenotype, and there are no bounds on the possible mutational effects or the physical limits to which the trait can evolve. Under the second scenario, there is a physical limit to phenotypic divergence, and because the directional effect of mutations depends on the current location, more extreme alleles generate mutations with effects biased back toward the center of the distribution.
Neither of these mutational schemes captures the features of a wide variety of cell biological traits, which often have finite numbers of possible states and statedependent spectra of mutational effects. A few examples will suffice to make this point. Proteinprotein interactions (e.g. the interfaces between dimeric molecules) typically depend on no more than a few dozen aminoacid sites. The same is true for intramolecular interactions such as the constellation of backbone residues that assemble during protein folding. In both cases, the underlying residues operate in an approximately binary manner, for example, hydrophobic vs. hydrophilic, or hydrogenbonding vs. nonhydrogen bonding. Likewise, the catalytic sites of enzymes often consist of a smalltomoderate numbers of residues that either facilitate or inhibit catalytic rates, and the sizes of intracellular organelles and cytoskeletal components are constrained by cell size. Many other examples could be cited, including those involved in RNARNA and DNAprotein interactions.
The approximate structure of a mutation function with a bounded range can be arrived at by considering a trait determined by $n$ binary factors (or sites), each with state b having effect 0, and state B having effect $m$. For a trait with an additive genetic basis, the mean phenotype in a haploid population can then be represented as
where ${z}_{0}$ is an arbitrary baseline value for the trait, and $\overline{q}$ is the mean frequency of Btype alleles averaged over all $n$ factors in the population (Lynch and Walsh, 1998).
Letting $u$ be the mutation rate from B to b alleles, and $v$ be the reciprocal rate, the pergeneration change in the mean phenotype resulting from mutation is
With $\widehat{q}=v/(u+v)$ being the equilibrium frequency of B alleles under mutation pressure alone, and ${\theta}_{m}={z}_{0}+nm\widehat{q}$ being the expected mean phenotype under neutrality, Equation (9) further reduces to
This expression is quite general in that $(\overline{z}{\theta}_{m})$ is simply the distance of the mean phenotype from that expected under mutation equilibrium, and $(u+v)$ is a measure of the mutational restoring force per locus. The essential feature of Equation (10) is that mutation acts to reduce the distance between the mean phenotype and ${\theta}_{m}$ to a degree that depends on the magnitude of this deviation. Charlesworth (2013) implemented a similar mutation model in an investigation of genomic features.
The stationary distribution of mean phenotypes
Application of Equations (7) and (10) to (1b) yields a useful simplification of the stationary distribution that will be adhered to below,
with ${\sigma}_{N}^{2}={\sigma}_{A}^{2}/\left[2{N}_{e}\right(u+v\left)\right]$. As will be discussed below, under neutrality, the genetic variance ${\sigma}_{A}^{2}$ often scales directly with ${N}_{e}$, and population size would have no influence on the distribution in this limiting case, as ${\sigma}_{N}^{2}$ would be independent of ${N}_{e}$. More generally, ${\sigma}_{A}^{2}$ is also a function of the intensity of selection, but the bulk of the steadystate distribution will be represented by mean phenotypes that are in the range of effective neutrality with respect to each other, so the scaling relationship of ${\sigma}_{A}^{2}$ under neutrality is expected to be a reasonable firstorder approximation.
Equation (11) shows that, provided the genetic variance remains roughly constant, the stationary distribution is equal to the product of the expectation under neutrality (where mutation and drift are the only operable evolutionary forces) and the mean fitness function exponentiated by $2{N}_{e},$ that is, the stationary distribution is equivalent to a transformation of the neutral expectation by a function of the fitness landscape. Thus, to obtain the overall distribution in the following applications, we require an expression for mean population fitness in terms of the trait mean.
In what follows, insight into the approximate magnitude of ${\sigma}_{N}^{2}$ will be useful. This can be achieved by noting that $2{N}_{e}(u+v)$ will have values of the order of magnitude of $4{N}_{e}\mu $, where $\mu $ is the mutation rate per nucleotide site. This composite parameter is equivalent to the amount of standing heterozygosity at neutral nucleotide sites in natural populations under mutationdrift equilibrium, and generally ranges from 0.001 to 0.1, with the lower and higher ends of the range being typical in vertebrates and microbes, respectively (Lynch, 2007). Thus, because heritabilities (${\sigma}_{A}^{2}/{\sigma}_{z}^{2}$) of traits are typically on the order of 0.1 to 0.5 (Lynch and Walsh, 1998), ${\sigma}_{N}^{2}$ is expected to be in the range of $1\times $ to $100\times $ the average withinpopulation phenotypic variance for the trait.
Selection for an intermediate optimum
A commonly assumed form of selection, probably relevant to many cellular features, is the Gaussian (bellshaped) fitness function with an intermediate optimum phenotype, ${\theta}_{s}$, and a width, $\omega $, determining the strength of selection around the optimum,
Application of this expression to Equations (3) and (4) leads to the expression for mean population fitness, which when applied to Equation (7) yields the expression for ${M}_{s}\left(\overline{z}\right)$ necessary for obtaining the stationary distribution (Table 1). The latter expression shows that the change in the mean phenotype resulting from selection is directly proportional to the deviation of the current mean phenotype from the optimum and inversely proportional to the sum of the squared width of the fitness function and the total phenotypic variance (Lande, 1976). As will be seen repeatedly below, phenotypic variance (an inevitable consequence of external environmental and internal cellular effects) generally reduces the efficiency of selection by diminishing the correspondence between genotype and phenotype. If the mean phenotype were to evolve to the optimum, $\overline{z}={\theta}_{s}$, which is highly unlikely with biased mutation pressure, selection would be purely stabilizing in nature, operating only to reduce the variation around the mean.
With both the selection and mutation terms in Equation (11) being Gaussian functions, the product is also Gaussian (Lande, 1976), in this case leading to a stationary distribution of mean phenotypes
with overall mean
and variance
where $\kappa ={\sigma}_{N}^{2}/{\sigma}_{S}^{2},$ with ${\sigma}_{S}^{2}=({\omega}^{2}+{\sigma}_{z}^{2})/\left(2{N}_{e}\right)$ and ${\sigma}_{N}^{2}$ (as defined as above) being the variances of the contributions associated with selection and mutation.
Equation (13b) states that the grand mean is equal to a weighted average of the expectations under mutation and selection alone (each component being weighted by the inverse of the variance of the function). Equation (13c) states that the variance of means is equal to half the harmonic mean of the variances associated with selection and mutation alone. As ${\sigma}_{S}^{2}\to \infty ,$ which implies a flatter fitness function and hence an approach toward neutrality, the mean and variance converge on the expectations for a purely mutationally driven process, ${\theta}_{m}$ and ${\sigma}_{N}^{2}$. As ${\sigma}_{N}^{2}\to \infty ,$ which implies a weakening influence of mutation on the overall distribution, the mean and variance converge on the expectations for a purely selectiondriven process, ${\theta}_{s}$ and ${\sigma}_{S}^{2}$.
As can be seen from Equations (13b, c), a key determinant of the form of the stationary distribution of means is the composite parameter $\kappa ={\sigma}_{A}^{2}/\left[2\right(u+v\left)\right({\omega}^{2}+{\sigma}_{z}^{2}\left)\right],$ which the following observations suggest is generally $\gg 1.$ First, the width of the fitness function $\omega $ can be expected to be generally greater than the phenotypic standard deviation ${\sigma}_{z}$, else the selective load on the trait would be enormous, and this is indeed generally observed (Walsh and Lynch, 2018). Given the range of heritability estimates noted above, this implies that the ratio ${\sigma}_{A}^{2}/({\omega}^{2}+{\sigma}_{z}^{2})$ is unlikely to be greater than 0.1 under strong selection, and can become one to two orders of magnitude smaller than 0.1 under weak selection. Second, mutation rates at the single nucleotide level are typically in the range of ${10}^{11}$ to ${10}^{8}$, with the former being approached in microbes and the latter in large multicellular species (Lynch et al., 2016). Thus, keeping in mind that individual targets of mutation may comprise more than single nucleotide sites, $1/\left[2\right(u+v\left)\right]$ is still likely to be in the range of ${10}^{7}$ to ${10}^{10}$. Together, these results suggest a likely range for $\kappa $ of ${10}^{4}$ to ${10}^{9},$ which simplifies Equations (13b, c) to
With these parameter values in mind, Figure 2 shows that the form of the stationary distribution varies dramatically with the value of ${\sigma}_{N}^{2}/\kappa ={\sigma}_{S}^{2}$, becoming extremely narrow and extremely flat at opposite ends of the spectrum for this key composite parameter. The degree to which ${\theta}_{m}$ deviates from ${\theta}_{s}$ for cellular features is unknown, but there is no reason to expect them to be equal. If they differ greatly, $\mu \left(\overline{z}\right)$ can substantially deviate from the optimum to a degree that depends on the weighting factor $\kappa $ (Figure 2).
Hyperbolic fitness function
Many cellular features are likely to be primarily under continuous selection for an extreme optimum, but with diminishing strength of selection as the optimum is approached. For example, many enzymes are likely to be selected for as high a catalytic rate as possible, protein structures for as high folding rates and stability as possible, binding interfaces with as high affinities as possible, etc. One way of representing this type of selection involves the hyperbolic function,
where the constants $0\le \alpha \le 1$ and $\beta \ge 0$, respectively, define the amplitude and rapidity of the fitness response to increasing $z$. Fitness is equal to $1\alpha $ when $z=0$, and asymptotically approaches one as $z\to \infty .$
Expressions for the mean population fitness and the change in the mean resulting from selection, obtained by the procedures noted above, are provided in Table 1, and substitution of the former into Equation (11) yields the stationary distribution of mean phenotypes. Because of the asymmetry of this fitness function, the resultant distribution is no longer perfectly Gaussian, but setting $\partial \Phi \left(\overline{z}\right)/\partial \overline{z}=0$ yields an expression for the single mode of the distribution, $\widehat{z}$
with $\varphi =\alpha \text{}\mathrm{e}\mathrm{x}\mathrm{p}\left({\beta}^{2}{\sigma}_{z}^{2}/2\right)$. Despite the monotonic increase in fitness with $z$, the distribution of mean phenotypes is prevented from progressive increase by the counteraction of mutation and the diffusive action of drift. Because selection is always in the positive direction, the expected mode always exceeds the neutral expectation ${\theta}_{m}$, to a degree that increases with the effective population size. Equation (16a) is readily solved numerically, but provided $\beta \hat{z}<1$, in the limit of large ${N}_{e}$,
Although the hyperbolic fitness function generates a slightly asymmetric distribution of means (with tail to the right), the bulk of the distribution is approximately normal, and an excellent approximation to the variance can be obtained from the curvature of the stationary distribution around the mode (using the negative of the inverse of the second derivative of the stationary distribution),
As in the case of the Gaussian fitness function, Equation (13c), the two terms in the denominator are respectively the inverses of the variances expected under the limits of strong selection and neutrality.
An example of the influence of population size on the stationary distribution is given in Figure 3, where there is a strong mutational bias away from the optimum. The distributions progressively move to the right with an increase in ${N}_{e}$, with the mean phenotype increasing fivefold over a three orderofmagnitude range of ${N}_{e}$. As can be seen from Equation (16b), equal changes in either ${N}_{e}$ or the neutral variance ${\sigma}_{N}^{2}$ have identical effects on the mean, although effects on the variance are opposite in direction.
Sigmoid fitness function
Finally, we consider a variant of the fitness function just noted. With the previous fitness function, Equation (15), the selection gradient progressively declines with increasing phenotypic value over the full range of $z$, with increasing $z$ resulting in an asymptotic approach to maximum fitness. With a sigmoid fitness function, sometimes called a mesa function (Gerland and Hwa, 2002; Berg et al., 2004), there is an inflection point such that the fitness landscape becomes progressively flatter at both higher and lower values. This means that adjacent variants become increasingly similar in fitness (i.e. more neutral with respect to each other) at both extremes of the phenotype distribution.
The sigmoid fitness function for individual phenotypes can be described as
where ${z}^{\ast}$ denotes the inflection point at which $W\left(z\right)=0.5$. This function is closely approximated by
where erf is the error function (the cumulative standard normal distribution), which facilitates integration with Equation (4). The resultant expression for mean population fitness is also sigmoid, but with phenotypic variance reducing the strength of selection from $\beta $ to $\beta /\gamma $, where $\gamma =\sqrt{1+({\beta}^{2}{\sigma}_{z}^{2}\pi /8)}$ (Table 1).
As in the case of the hyperbolic fitness function, the mesa function does not yield a perfectly Gaussian distribution of mean phenotypes, but an expression for the mode ($\widehat{z}$) can be acquired using the methods noted above,
which again has a single solution, indicating a unimodal stationary distribution. If $\beta \hat{z}/\gamma <1$, in the limit of large ${N}_{e}$,
which has a form similar to the expression noted with the hyperbolic fitness function. From the form of these equations, it can again be seen that there are several equivalent effects of the underlying parameters. For example, a doubling of ${N}_{e}$ has the same effect as a doubling of ${\sigma}_{N}^{2}$ on the mode, and a doubling of $\beta $ the same effect as a reduction in $\gamma $ by 50%. Although more complicated, the expected variance in means under the sigmoid model is similar in form to that noted above for the Gaussian and hyperbolic fitness functions,
Discussion
The preceding models are meant to provide heuristic guidance into the evolutionary mechanisms responsible for the dispersion of mean phenotypes of a diversity of subcellular and molecular features. Although such traits may sometimes be under selection for an intermediate optimum, selection may often operate in a continuous directional fashion. In either case, there are two reasons why mean phenotypes are unlikely to commonly achieve states that endow a population with maximum fitness. First, if mutation bias conflicts with the directional effects of selection, the optimum phenotype will not coincide with the mean phenotype. Second, even in the absence of mutation bias and regardless of the form of the fitness function, a drift barrier exists beyond which the gradient of the selection function is not steep enough to overcome the vagaries of genetic drift, thereby preventing further adaptive progress. Within the confines of the drift barrier, the mean phenotype will wander to a degree that depends on the strength of local patterns of mutation and selection.
These points have implications for the degree to which the ‘adaptive paradigm’ should be embraced as an explanatory framework for diversification at the cellular level. For example, with mutation bias encouraging the mean phenotype to deviate from the optimum, the result will be a population under persistent directional selection despite the existence of an attainable (but not sustainable) phenotype with maximum fitness. Even without mutation pressure and in the face of intrinsic directional selection, for example, a hyperbolic or mesa fitness function, the most common mean phenotype will not be equivalent to the optimum phenotype, and the drift barrier will ensure variation in mean phenotypes among populations exposed to identical selection pressures.
An attempt has been made to couch the stationary forms of meanphenotype distributions in terms of underlying parameters that are at least in principle observable empirically. Consider, for example, the model for stabilizing selection for a specific optimum. From Equation (14a), the expected deviation of the mean phenotype from the optimum resulting from mutation bias is ${\theta}_{m}/\kappa ,$ which expands to ${\theta}_{m}\left[\text{\hspace{0.17em}}2\right(u+v\left)\right({\omega}^{2}+{\sigma}_{z}^{2})/{\sigma}_{A}^{2}\text{\hspace{0.17em}}],$ a somewhat complex function that may not be immediately transparent. However, a wide variety of models suggest that ${\sigma}_{A}^{2}$ scales directly with ${N}_{e}\mu $ provided selection is weak (Bürger et al., 1989; Zeng and Cockerham, 1993; Charlesworth, 2013), and because $u$ and $v$ (the forward and reverse mutation rates) are both proportional to $\mu $ (the total mutation rate per site), this implies that the average deviation of the mean from the optimum scales as ${\theta}_{m}({\omega}^{2}+{\sigma}_{z}^{2})/{N}_{e},$ or approximately as ${\theta}_{m}{\omega}^{2}/{N}_{e}$ assuming weak selection. Thus, the deviations of phenotypic means from the selective optimum are expected to be inversely proportional to ${N}_{e}$, a point also made by Charlesworth (2013) in a somewhat different analysis. Note, however, that this is only the expected pattern, as the mean phenotype is still expected to drift above and below the expectation to a degree depending on the effective strength of selection. As noted in Equation (14b), and previously pointed out by Lande (1975) and Lande (1976), the magnitude of this drift variance is also inversely proportional to ${N}_{e}$, which implies that the standard deviation with respect to the expected mean scales as $\sim 1/\sqrt{{N}_{e}}.$
Of course, ${\theta}_{m}$ (the mean phenotype expected under neutrality) may differ among lineages and the withinpopulation genetic variance ${\sigma}_{A}^{2}$ is sensitive to the strength of selection, in which case the power to detect such relationships may be challenging. In addition, the linear scaling of ${\sigma}_{A}^{2}$ with ${N}_{e}$ is unlikely to continue indefinitely, unless ${N}_{e}$ in natural populations rarely attains levels where all constituent loci are saturated with segregating mutations. The salient issue is that the preceding expressions provide qualitative insight into the behavior of mean phenotypes in alternative populationgenetic environments, while also revealing the types of measurements that need to be made if we are to understand such behavior. For example, we know essentially nothing about the key mutational (${\theta}_{m}$) and selection (${\omega}^{2}$) parameters for cell biological features and how these might vary among species. This is not a trivial issue, as the influence of both parameters in determining the most likely locations of mean phenotypes are just as central as the role played by ${N}_{e}$.
Applying the same logic to results for plateaued fitness functions leads to the prediction that the expected mode of mean phenotypes will scale fairly strongly with the effective population size, in the limit approaching proportionality to $\sqrt{{N}_{e}},$ that is, a 10fold increase in the mean phenotype with a 100fold increase in ${N}_{e}.$ As shown in Figure 3, a simple change in the mutational variance ${\sigma}_{M}^{2}$ (with no associated change in mutational bias) can also cause a substantial shift in the position of the mean phenotype. These sorts of observations raise the significant possibility that species with substantially different populationgenetic environments may commonly exhibit measurable differences in trait means despite experiencing identical forms of directional selection, again raising challenging issues for those who wish to interpret phenotypic differences as reflections of different underlying processes of selection.
Although the data are not extensive, several lines of evidence support the idea that the mean phenotypes of cellular attributes are indeed modulated by the power of random genetic drift. The most compelling example derives from observations on the mutation rate (per nucleotide site per generation), which scales approximately inversely with the 1000fold range of variation in ${N}_{e}$ across the Tree of Life (Lynch et al., 2016). Such a scaling is qualitatively consistent with the driftbarrier hypothesis for mutationrate evolution (Lynch, 2010; Lynch, 2011), which postulates that because most mutations are deleterious, selection will typically operate to improve replication fidelity, with refinements in molecular performance eventually being thwarted by random genetic drift – as the mutation rate is progressively lowered, there is less room for improvement and hence a narrower range of selectively advantageous replicationfidelity variants accessible by selection.
Enzyme efficiency provides a second broad category of traits with evolutionary behavior seemingly in accordance with the theory outline above. For example, BarEven et al. (2011) have found that enzymes involved in secondary metabolism are on average $\sim 30\times$ less efficient than those involved in central metabolism, suggesting that selection operates less effectively on enzymes further removed from core energetic determinants. More directly relevant to the points made above, BarEven et al. (2011) also found that prokaryotic enzymes have slightly better kinetics than those from eukaryotes, as expected for species with higher effective population sizes and consistent with the prediction that improvement of enzyme efficiencies will stall once the gradient of the fitness surface is on the order of $1/{N}_{e}$ (Hartl et al., 1985). The fact that bacteria utilize transcriptionfactor bindingsite motifs with stronger affinity to their cognate transcription factors than is the case in eukaryotes is also plausibly related to a higher efficiency of selection in the former (Lynch and Hagner, 2015).
Finally, proteins typically evolve to the ‘margin of stability,’ such that only one or two mutations are usually enough to destabilize the folding process (Taverna and Goldstein, 2002; Tokuriki and Tawfik, 2009). Protein stability is deemed to be positively associated with fitness because destabilized proteins are prone to loss of function, aggregation, and/or direct toxicity. Strikingly, however, it is relatively easy to obtain more stable proteins by mutagenesis (Matsuura et al., 1999; Bershtein et al., 2013; Sullivan et al., 2012), with the contributing residues typically interacting in an additive fashion (Wells, 1990; Serrano et al., 1993; Zhang et al., 1995). Moreover, although it is commonly argued that marginal stability is required for proper protein function, with excess stability somehow reducing protein performance, this has not held up to close scrutiny. Many examples exist in which increased stability has been achieved in laboratory modifications of proteins with few if any consequences for enzyme efficiency (e.g. Giver et al., 1998; Zhang et al., 1995; Taverna and Goldstein, 2002; Borgo and Havranek, 2012; Moon et al., 2014).
These observations suggest that despite persistent selection for high folding stability, the plateaulike nature of the fitness landscape results in diminishing fitness advantages of increasing stability. A hyperbolic relationship between fitness and the binding energy involving protein stability follows from biophysical principles (Govindarajan and Goldstein, 1997; Taverna and Goldstein, 2002; Bloom et al., 2005; Zeldovich and Shakhnovich, 2008; Wylie and Shakhnovich, 2011; Serohijos and Shakhnovich, 2014), and under this model, proteins are expected to be pushed by natural selection to more stable configurations until reaching the point where any further fitness improvement is small enough to be offset by the vagaries of random genetic drift and/or mutation pressure towards less stable states. Notably, proteins of equivalent length fold at least ten times more rapidly in bacteria than in eukaryotes (Galzitskaya et al., 2011). Moreover, an in vitro evaluation of the folding stability of the dihydrofolate reductase enzyme from 36 species of mesophilic bacteria illustrates the existence of a substantial range of variation among species, with the standard deviation being roughly 10% of the mean (Bershtein et al., 2015). In principle, such a distribution may reflect the dispersion in mean phenotypes associated with drift around the drift barrier.
Although the mutation function employed here likely comes closer to approximating the situation for cellular features than do previous functions relied on in quantitative genetics, in reality we do not know the exact form of this function for any cellular feature. Thus, the mathematical theory developed here is best viewed as a guide to approaching the problem at hand rather than as an indelible platform for quantitative analysis. Despite such uncertainties, however, the central feature of the theory presented above is that, regardless of the form of the underlying mutation and selection functions, the stationary distribution of mean phenotypes can generally be viewed as the product of the pattern expected under neutrality alone and the associated function for mean population fitness taken to the $2{N}_{e}$ power, as described by Equations (1a,b) and (11). Similar behavior was previously pointed out for the stationary distribution of allele frequencies (Wright, 1969). Thus, once the key underlying functions have been elucidated, the precise details of the theory can be readily modified with alternative mathematical functions.
Finally, a key issue that is not formally evaluated here, but is arguably relevant to a number of cellular features, concerns the matter of peak shifts across the stationary distribution. Questions regarding this matter are typically inspired by Wright’s (1932) metaphor of an adaptive topography, with multiple fitness peaks and valleys of various depths over the phenotypic landscape. However, unless the distribution of mutational effects is completely flat, the relevant topography is not simply defined by the fitness landscape but by the joint action of both selection and mutation. Although the stationary distribution was unimodal in all of the cases examined above, plausible cases exist in which the stationary distribution exhibits two peaks, one largely driven by selection and the other by mutation pressure. For this to occur, the gradient of mutation pressure in one direction has to be of a form such that its product with the selection gradient has an internal minimum (Figure 4). In principle, this can happen when at the intersection of intermediate phenotypes the two functions are sufficiently upwardly concave that their product reaches a local minimum.
Under such a scenario, the population mean phenotype is expected to reside in two alternative semistable domains for extended periods of time, with the rates of transitions between domains depending on the relative heights of the two peaks, the depth of the distributional valley, and the curvatures of the stationary distribution at the inflection points (Lande, 1985; Barton and Rouhani, 1987). Over long evolutionary time periods, such a system will exhibit detailed balance – the net fluxes will be equal in both directions, with the ratio of the occupancy of the two alternative domains being inversely related to the ratio of the transition rates between them, that is, with the less frequent domain having a higher conditional rate of transition to the more frequent domain.
Although the frequency of stationary distributions with multimodal forms is unknown, they have been predicted to arise in some situations involving transcription (Lynch and Hagner, 2015; Tuğrul et al., 2015). Should they exist, the picture from comparative analyses would be one of qualitative changes in mean phenotypes in adjacent lineages. Tempting as it might be to invoke shifting ecological pressures to explain such patterns, they would be occurring in the absence of any underlying changes in selection, being a simple consequence of the multiplicity of mutational opportunities in one direction balanced by selective pressures in the other. Such ideas may be helpful in attempts to decipher the substantial and seemingly disorganized diversity of certain cellular features such as open vs. closed mitosis (Sazer et al., 2014), the structure of the centrosome (CarvalhoSantos et al., 2011), and the variable multimeric states of proteins (Dayhoff et al., 2010; Lynch, 2013; Ahnert et al., 2015) across the Tree of Life.
References

The frequency of shifts between alternative equilibriaJournal of Theoretical Biology 125:397–418.https://doi.org/10.1016/S00225193(87)802102

Adaptive evolution of transcription factor binding sitesBMC Evolutionary Biology 4:42.https://doi.org/10.1186/14712148442

Evolving strategies for enzyme engineeringCurrent Opinion in Structural Biology 15:447–452.https://doi.org/10.1016/j.sbi.2005.06.004

Evolution: Tracing the origins of centrioles, cilia, and flagellaThe Journal of Cell Biology 194:165–175.https://doi.org/10.1083/jcb.201011152

Effective population size and patterns of molecular evolution and variationNature Reviews Genetics 10:195–205.https://doi.org/10.1038/nrg2526

Evolution of protein binding modes in homooligomersJournal of Molecular Biology 395:860–870.https://doi.org/10.1016/j.jmb.2009.10.052

On the selection and evolution of regulatory DNA motifsJournal of Molecular Evolution 55:386–400.https://doi.org/10.1007/s002390022335z

Evolution of model proteins on a foldability landscapeProteins: Structure, Function, and Genetics 29:461–466.https://doi.org/10.1002/(SICI)10970134(199712)29:4<461::AIDPROT6>3.0.CO;2B

Limits of adaptation: the evolution of selective neutralityGenetics 111:655–674.

Genetic drift, selection and the evolution of the mutation rateNature Reviews Genetics 17:704–714.https://doi.org/10.1038/nrg.2016.104

BookGenetics and Analysis of Quantitative TraitsSunderland, MA: Sinauer Assocs., Inc.

Evolution of the mutation rateTrends in Genetics 26:345–352.https://doi.org/10.1016/j.tig.2010.05.003

The lower bound to the evolution of mutation ratesGenome Biology and Evolution 3:1107–1118.https://doi.org/10.1093/gbe/evr066

Evolutionary molecular engineering by random elongation mutagenesisNature Biotechnology 17:58–61.https://doi.org/10.1038/5232

An integrated approach for thermal stabilization of a mesophilic adenylate kinaseProteins: Structure, Function, and Bioinformatics 82:1947–1959.https://doi.org/10.1002/prot.24549

Universality and predictability in molecular quantitative geneticsCurrent Opinion in Genetics & Development 23:684–693.https://doi.org/10.1016/j.gde.2013.11.001

Cell biology: scaling and the emergence of evolutionary cell biologyCurrent Biology 25:R223–R225.https://doi.org/10.1016/j.cub.2015.01.049

Deciphering the evolutionary history of open and closed mitosisCurrent Biology 24:R1099–R1103.https://doi.org/10.1016/j.cub.2014.10.011

Contribution of selection for protein folding stability in shaping the patterns of polymorphisms in coding regionsMolecular Biology and Evolution 31:165–176.https://doi.org/10.1093/molbev/mst189

Why are proteins marginally stable?Proteins: Structure, Function, and Genetics 46:105–109.https://doi.org/10.1002/prot.10016

Stability effects of mutations and protein evolvabilityCurrent Opinion in Structural Biology 19:596–604.https://doi.org/10.1016/j.sbi.2009.08.003

Heritable genetic variation via mutationselection balance: lerch's zeta meets the abdominal bristleTheoretical Population Biology 25:138–193.https://doi.org/10.1016/00405809(84)900170

Dynamics of transcription factor binding site evolutionPLoS Genetics 11:e1005639.https://doi.org/10.1371/journal.pgen.1005639

Additivity of mutational effects in proteinsBiochemistry 29:8509–8517.https://doi.org/10.1021/bi00489a001

The roles of mutation, inbreeding, crossbreeding, and selection in evolutionProc. Sixth Internat. Cong. Genetics pp. 355–366.

The Theory of Gene FrequenciesEvolution and the Genetics of Populations, The Theory of Gene Frequencies, 2, Chicago, IL, Univ. Chicago Press.

Understanding protein evolution: from protein physics to Darwinian selectionAnnual Review of Physical Chemistry 59:105–127.https://doi.org/10.1146/annurev.physchem.58.032806.104449

Enhancement of protein stability by the combination of point mutations in T4 lysozyme is additiveProtein Engineering, Design and Selection 8:1017–1022.https://doi.org/10.1093/protein/8.10.1017
Decision letter

Naama BarkaiReviewing Editor; Weizmann Institute of Science, Israel
In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.
Thank you for submitting your article "Phylogenetic Divergence of Cell Biological Features" for consideration by eLife. Your article has been reviewed by two peer reviewers, and the evaluation has been overseen Naama Barkai as the Senior and Reviewing Editor. The following individual involved in review of your submission has agreed to reveal his identity: Joe Felsenstein (Reviewer #2).
The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.
Please address all comments below most of which can be addressed with suitable changes in the text. Others require a response which the reviewers will consider in rendering a final determination.
Reviewer #1:
I reviewed this manuscript with a PhD student from my laboratory.
Whether cellular traits are at the optimal configuration allowed by biophysical constraints in any organism is unknown. As was shown in population genetics, genetic drift could play an important role in preventing traits from reaching this optimum. However, biased mutation pressures could also contribute to drive average phenotypes away from their optimum. The extent of this effect is unknown. This paper addresses the evolution of cellular traits, particularly the relative contribution of natural selection, mutation and genetic drift on their evolution and divergence among populations or species.
Lynch develops a theoretical framework to identify the conditions in which phenotypes can be driven away from optima by biased mutational pressure. He also introduces different fitness functions that likely apply to cellular traits and shows how these fitness functions interact with various mutational effects to affect the expected stationary trait distribution.
The paper is well written and the findings are important because, among other things, they stimulate new avenues of investigation in cell biology. The presentation however could limit the accessibility of the manuscript to the broad audience of eLife.
It would be useful in the Introduction to mention why cellular traits require a special treatment and approach, and why they cannot just be considered as other quantitative traits. The author mentions that since they evolve in a relatively stable environment due to homeostasis, cellular traits could be under relatively uniform selective pressure, even among distinct lineages. However, wouldn't that be true for internal organs also in multicellular species? Also, homeostasis is the result of these cellular traits interacting with the environment, so it cannot be considered as an independent factor. Maybe cellular traits are just a special case of slowly evolving traits?
The first paragraph of the Introduction appears to be reporting observations that are commonly known for evolutionary biologists. However, eLife not being specialized in evolution, it could be useful to cite some general references for these statements. The second paragraph of the subsection “The Process of Mutation” could also be supported by more references, same thing for the Discussion. Along the same line, the readership of eLife would appreciate some graphical and simplified representations of the processes described with equations, some sort of graphical summary of the paper.
The Introduction mentions examples of cellular traits. It could be useful to mention them earlier to introduce what cellular traits are.
The paragraph above Equation 1B: does this partition of the variance of z implicitly assumes that the parental average (A) does not covary with the deviation from additivity (e)? It would be clearer to briefly state this assumption. Some stochastic effects in cell biology could be noise factors that are correlated positively or negatively with the average effects.
Finally, I do not have the skills required to fully verify the validity of the different derivations and mathematical assumptions presented. I assume another reviewer will have done so.
Reviewer #2:
This is a very interesting and important paper. Where a lot of us have been deterred from thinking hard about this by the worry that characters at the molecular level might involve too few genes to be successfully modeled by the machinery of evolutionary quantitative genetics, Lynch has plunged in and obtained some very interesting results.
I have several substantive questions, and then a bunch of suggestions for clarity. 1) “distributions expected under selection along and under mutation alone” are invoked. The latter I can see. But the former is not obvious. I get the impression from the equations that this is the distribution that would result under selectionversusdrift when the additive genetic variance is held constant at σ 2 A. So there is some kind of assumption that, in the case of “selection alone”, selection does not erode the additive genetic variance.
2) “evolution by natural selection comes to a standstill where.… the phenotypic mean resides at the point where the slope of the fitness function is zero.” I wonder whether that is true. I recall that if the fitness function (curve of fitness as a function of phenotype) is a mixture of two Gaussian peaks, one smaller than the other, where the phenotypic mean will come to a standstill depends on the slope, not of that fitness curve, but of the fitness curve where each component is fattened by the additive genetic variance. That can be quite different. Or do I misunderstand what is being said here?
3) The mathematics here uses compound parameters which determine the final equilibrium distribution. However should it not be noted that these do determine the time dynamics in a population. If one is looking at a phylogeny of species, rather than species that have diverged long enough that each is in its equilibrium distribution, one does need the parameters that determine the time dynamics. This should at least be mentioned, as multispecies data tend to come to us at the tips of phylogenies.
https://doi.org/10.7554/eLife.34820.010Author response
Reviewer #1:
[…] It would be useful in the Introduction to mention why cellular traits require a special treatment and approach, and why they cannot just be considered as other quantitative traits. The author mentions that since they evolve in a relatively stable environment due to homeostasis, cellular traits could be under relatively uniform selective pressure, even among distinct lineages. However, wouldn't that be true for internal organs also in multicellular species? Also, homeostasis is the result of these cellular traits interacting with the environment, so it cannot be considered as an independent factor. Maybe cellular traits are just a special case of slowly evolving traits?
I have attempted to reword the Introduction a bit, and elaborate elsewhere, to accommodate the reviewer’s points here. I do not mean to imply that cellular traits need to be treated differently than other classical quantitative traits, but rather that due to relative homeostasis (and the likely more constant selective environment), they may actually fulfill the assumptions of the models better. The point about internal organs is interesting, and has now been mentioned.
The first paragraph of the Introduction appears to be reporting observations that are commonly known for evolutionary biologists. However, eLife not being specialized in evolution, it could be useful to cite some general references for these statements. The second paragraph of the subsection “The Process of Mutation” could also be supported by more references, same thing for the Discussion. Along the same line, the readership of eLife would appreciate some graphical and simplified representations of the processes described with equations, some sort of graphical summary of the paper.
As requested, additional references are cited – the Lynch and Walsh / Walsh and Lynch books give a very broad and uptodate overview of theoretical and empirical basis of quantitative genetics. I have also attempted to produce a figure that I hope will soften things a bit and provide additional clarity.
The Introduction mentions examples of cellular traits. It could be useful to mention them earlier to introduce what cellular traits are.
This is now mentioned explicitly in the first paragraph of the paper.
The paragraph above Equation 1B: does this partition of the variance of z implicitly assumes that the parental average (A) does not covary with the deviation from additivity (e)? It would be clearer to briefly state this assumption. Some stochastic effects in cell biology could be noise factors that are correlated positively or negatively with the average effects.
This is now stated explicitly at the designated location.
Finally, I do not have the skills required to fully verify the validity of the different derivations and mathematical assumptions presented. I assume another reviewer will have done so.
Reviewer #2:
[…] I have several substantive questions, and then a bunch of suggestions for clarity. 1) “distributions expected under selection along and under mutation alone” are invoked. The latter I can see. But the former is not obvious. I get the impression from the equations that this is the distribution that would result under selectionversusdrift when the additive genetic variance is held constant at σ 2 A. So there is some kind of assumption that, in the case of “selection alone”, selection does not erode the additive genetic variance.
Yes, I believe that is the correct interpretation, and this is now stated explicitly and elaborated upon more in the Discussion, with further justification noted in the section on the stationary distribution.
2) “evolution by natural selection comes to a standstill where.… the phenotypic mean resides at the point where the slope of the fitness function is zero.” I wonder whether that is true. I recall that if the fitness function (curve of fitness as a function of phenotype) is a mixture of two Gaussian peaks, one smaller than the other, where the phenotypic mean will come to a standstill depends on the slope, not of that fitness curve, but of the fitness curve where each component is fattened by the additive genetic variance. That can be quite different. Or do I misunderstand what is being said here?
I have reworded things to make clear that the formulation is referring to mean population fitness with respect to the mean phenotype, under the stated assumption that the phenotype distribution is normal.
3) The mathematics here uses compound parameters which determine the final equilibrium distribution. However should it not be noted that these do determine the time dynamics in a population. If one is looking at a phylogeny of species, rather than species that have diverged long enough that each is in its equilibrium distribution, one does need the parameters that determine the time dynamics. This should at least be mentioned, as multispecies data tend to come to us at the tips of phylogenies.
Yes, this is an important point, and at the beginning of the section on the model, I now state “From an empirical perspective, this steadystate view of evolution implicitly assumes that enough time has elapsed between observed taxa that the dynamics of the evolutionary process are of negligible significance (which would not be the case for closely related species).
https://doi.org/10.7554/eLife.34820.011Article and author information
Author details
Funding
Army Research Office (W911NF0910444)
 Michael Lynch
Army Research Office (W911NF1410411)
 Michael Lynch
National Institutes of Health (R01GM036827)
 Michael Lynch
National Institutes of Health (R35GM122566)
 Michael Lynch
National Science Foundation (PHY1125915)
 Michael Lynch
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
I thank M Bauer, J Felsenstein, P Higgs, P Johri, M Lässig, M Manhart, A Moses, and D Needleman for helpful comments. This research was supported in part by the National Science Foundation under Grant No. PHY1125915 to the Kavli Institute of Theoretical Physics. Support was also provided by the Multidisciplinary University Research Initiative awards W911NF0910411 and W911NF0910444 from the US Army Research Office, National Institutes of Health awards R01GM036827 and R35GM12256601, and National Science Foundation award MCB1518060.
Reviewing Editor
 Naama Barkai, Weizmann Institute of Science, Israel
Publication history
 Received: January 4, 2018
 Accepted: May 10, 2018
 Version of Record published: June 21, 2018 (version 1)
Copyright
© 2018, Lynch et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics

 1,914
 Page views

 279
 Downloads

 6
 Citations
Article citation count generated by polling the highest count across the following sources: PubMed Central, Crossref, Scopus.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading

 Cell Biology
 Genetics and Genomics
Circadian clocks are highly conserved transcriptional regulators that control ~24hour oscillations in gene expression, physiological function, and behavior. Circadian clocks exist in almost every tissue and are thought to control tissuespecific gene expression and function, synchronized by the brain clock. Many disease states are associated with loss of circadian regulation. How and when circadian clocks fail during pathogenesis remains largely unknown because it is currently difficult to monitor tissuespecific clock function in intact organisms. Here, we developed a method to directly measure the transcriptional oscillation of distinct neuronal and peripheral clocks in live, intact Drosophila, which we term Locally Activatable BioLuminescence, or LABL. Using this method, we observed that specific neuronal and peripheral clocks exhibit distinct transcriptional properties. Loss of the receptor for PDF, a circadian neurotransmitter critical for the function of the brain clock, disrupts circadian locomotor activity but not all tissuespecific circadian clocks. We found that, while peripheral clocks in nonneuronal tissues were less stable after the loss of PDF signaling, they continued to oscillate. We also demonstrate that distinct clocks exhibit differences in their loss of oscillatory amplitude or their change in period, depending on their anatomical location, mutation, or fly age. Our results demonstrate that LABL is an effective tool that allows rapid, affordable, and direct realtime monitoring of individual clocks in vivo.

 Biochemistry and Chemical Biology
 Cell Biology
Secreted proteins, which include cytokines, hormones and growth factors, are extracellular ligands that control key signaling pathways mediating cellcell communication within and between tissues and organs. Many drugs target secreted ligands and their cellsurface receptors. Still, there are hundreds of secreted human proteins that either have no identified receptors ('orphans') and are likely to act through cell surface receptors that have not yet been characterized. Discovery of secreted ligandreceptor interactions by highthroughput screening has been problematic, because the most commonly used highthroughput methods for proteinprotein interaction (PPI) screening do not work well for extracellular interactions. Cellbased screening is a promising technology for definition of new ligandreceptor interactions, because multimerized ligands can enrich for cells expressing low affinity cellsurface receptors, and such methods do not require purification of receptor extracellular domains. Here, we present a proteogenomic cellbased CRISPR activation (CRISPRa) enrichment screening platform employing customized pooled cell surface receptor sgRNA libraries in combination with a magnetic bead selectionbased enrichment workflow for rapid, parallel ligandreceptor deorphanization. We curated 80 potentially high value orphan secreted proteins and ultimately screened 20 secreted ligands against two cell sgRNA libraries with targeted expression of all singlepass (TM1) or multipass (TM2+) receptors by CRISPRa. We identified previously unknown interactions in 12 of these screens, and validated several of them using surface plasmon resonance and/or cell binding. The newly deorphanized ligands include three receptor tyrosine phosphatase (RPTP) ligands and a chemokine like protein that binds to killer cell inhibitory receptors (KIR's). These new interactions provide a resource for future investigations of interactions between the human secreted and membrane proteomes.