Measurements of damage and repair of binary health attributes in aging mice and humans reveal that robustness and resilience decrease with age, operate over broad timescales, and are affected differently by interventions
Abstract
As an organism ages, its health-state is determined by a balance between the processes of damage and repair. Measuring these processes requires longitudinal data. We extract damage and repair transition rates from repeated observations of binary health attributes in mice and humans to explore robustness and resilience, which respectively represent resisting or recovering from damage. We assess differences in robustness and resilience using changes in damage rates and repair rates of binary health attributes. We find a conserved decline with age in robustness and resilience in mice and humans, implying that both contribute to worsening aging health – as assessed by the frailty index (FI). A decline in robustness, however, has a greater effect than a decline in resilience on the accelerated increase of the FI with age, and a greater association with reduced survival. We also find that deficits are damaged and repaired over a wide range of timescales ranging from the shortest measurement scales toward organismal lifetime timescales. We explore the effect of systemic interventions that have been shown to improve health, including the angiotensin-converting enzyme inhibitor enalapril and voluntary exercise for mice. We have also explored the correlations with household wealth for humans. We find that these interventions and factors affect both damage and repair rates, and hence robustness and resilience, in age and sex-dependent manners.
Editor's evaluation
The key contribution of this study is to evaluate the longitudinal change in frailty indices by tracking both accumulation of damage and repair of deficits (damage and repair transition rates), using a sophisticated mathematical modeling and a translational approach that spans mice and humans. A second key achievement of this study is to evaluate change in frailty indices and damage and repair transition in interventions that improve health in mice. Collectively this advances progress in translational geroscience by providing new insight regarding how we measure biological age that can aid assessment of aging-relevant interventions. The authors have provided extensive details that support the research frameworks presented in this report.
https://doi.org/10.7554/eLife.77632.sa0Introduction
As organisms age, they can be described by health states that evolve according to dynamical processes of damage and repair. A health state is the net result of accumulated damage and subsequent repair (Howlett and Rockwood, 2013). Studies of aging have mostly focused on discrete health-states rather than the underlying continuous dynamic processes, due the difficulty of their measurement. Two common approaches to measuring individual health-states, the Frailty Index (FI) (Mitnitski et al., 2001) and the Frailty Phenotype (Fried et al., 2001), are assembled from health state data at a specific age and do not separate dynamic damage and repair processes. Nevertheless, strong associations between frailty measures and adverse health outcomes (Hoogendijk et al., 2019; Howlett et al., 2021) indicate that frailty affects the underlying dynamical processes. This is supported by the increasing rate of net accumulation of health deficits with worsening health (Mitnitski et al., 2007; Kojima et al., 2019).
Reduced resilience, or the decreasing ability to repair damage (or recover from stressors), is increasingly seen as a key manifestation of organismal aging (Ukraintseva et al., 2021; Kirkland et al., 2016; Hadley et al., 2017). Resilience is often assessed by the ability to repair following an acute stressor, such as a heat/cold shock, viral infection, or anesthesia; or a non-specific stressor such as a change of the health state, typically within a short timeframe (Scheffer et al., 2018; Gijzel et al., 2019; Rector et al., 2021; Colón-Emeric et al., 2020; Pyrkov et al., 2021). Robustness, or an organism’s resistance to damage, has not been as well studied – but there is also evidence for its average decline with age (Arbeev et al., 2019; Kriete, 2013). Both resilience and robustness sustain organismal health during aging, but their relative importance and their timescales of action remain largely unexplored.
While cellular and molecular damage and dysfunction are classic ‘hallmarks’ of aging (López-Otín et al., 2013), damage and dysfunction at organismal scales may exhibit distinct behavior (Gems and de Magalhães, 2021; Howlett and Rockwood, 2013). Indeed, from a complex systems perspective we may expect qualitatively distinct emergent phenomena at tissue or organismal scales (Cohen et al., 2022). However, a significant amount of organismal health data is discrete and cannot be approached with existing techniques used to study resilience or robustness. It is important both to study resilience and robustness at organismal scales and to be able to use discrete data while doing so.
To simultaneously study both resilience and robustness during aging with binarized health-deficits, we have here developed a novel method of analysis that uses longitudinal data from mice and humans to obtain summary measures of organismal damage and repair processes over time. This approach can be adapted to use any discrete biomarker. We apply our method to study how resilience and robustness evolve with age and how they differ between species, between sexes, and under different health interventions.
Our approach and results are limited to binarized health attributes; for our purposes damage and repair correspond to discrete transitions of these binarized attributes. Since our attributes are at the clinical or organismal scale of health, we do not consider cellular or molecular damage directly. Although our approach could be applied to binarized attributes at any organismal scale, we do not investigate whether our conclusions generalize to different sets of binarized attributes, nor do we consider continuous attributes.
Developing interventions to extend lifespan and healthspan is the goal of geroscience (Kennedy et al., 2014; Sierra, 2016; Sierra et al., 2021). While some interventions that affect aging health have been identified, how they differentially affect damage and repair, and their timescales of action, is less understood. We consider interventions in mice that have previously been shown to have a positive impact on frailty, the angiotensin converting enzyme (ACE) inhibitor enalapril (Keller et al., 2019) and voluntary exercise (Bisset et al., 2022). In humans, we stratify individuals within the English Longitudinal Study of Aging by net household wealth (Phelps et al., 2020; Steptoe et al., 2014). Wealth is a socioeconomic factor associated with aging health (Zimmer et al., 2021; Niederstrasser et al., 2019). Understanding how various interventions affect aging health by affecting resilience and robustness will better enable us to fulfill the geroscience agenda.
Results
Measuring resilience and robustness with binarized data
A well-established approach to quantify health in both humans and in animal models is to count binarized health deficits in an FI (Mitnitski et al., 2001; Whitehead et al., 2014). In longitudinal studies, the FI can be assessed at each follow-up. Here, we use longitudinal binarized health attribute data from mice and humans that can be used to evaluate the FI to also quantify organismal damage and repair processes over time. As illustrated in the schematic in Figure 1, the change in number of deficits from one follow-up to the next is determined by the number of new deficits (indicating damage, with deficit values transitioning from 0 to 1, red arrow) minus the number of repaired deficits that were previously in a damaged state (with transitions of deficit values from 1 to 0, green arrow). These counts of damaged and repaired deficits between follow-ups represent summary measures of the underlying damage and repair processes. We model this process with a Bayesian Poisson model for counts of damaged and repaired deficits, using age-dependent damage and repair rates. For mice we use a joint longitudinal-survival model, which couples the damage and repair rates together with mortality. For humans, we use a similar model but without the survival component due to having no mortality data.

Extracting damage and repair from the longitudinal observation of binary health deficits.
Instead of just considering the Frailty Index (FI) or net count of deficits at each age nt (i.e. FI multiplied by the total number N of deficits considered) as a measure of health, we separately consider the number of deficits damaged or repaired within a time interval . Time-dependent damage and repair rates are extracted using Poisson models for the counts of repaired or damaged deficits.
In our approach, damage rates are the probability of acquiring a new deficit per unit of time, and repair rates are the probability of repairing a deficit per unit time. These are aggregate measures of susceptibility to damage (lack of robustness), and ability to repair (resilience). The FI is a whole organism-level summary measure of health; accordingly, these aggregate damage and repair rates are also whole organism-level measures of robustness and resilience. Note that since these rates are per available deficit, repair rates may exceed damage rates while the FI is still increasing. This can occur due to relatively rapid repair per deficit of a small number of deficits, with a slower damage rate per deficit of a much larger number of undamaged attributes.
While damage of binarized health attributes with age necessarily follows from declining health, repair does not. However, almost all health attributes used in our mouse data have been previously shown to reverse either spontaneously or through extrinsic interventions such as drug treatments or lifestyle changes – see Supplementary file 1, with references. In Figure 5—figure supplement 4 we also show repair counts per deficit type. Nevertheless, for our data, not all deficits repair equally, and some rarely or never repair (‘Cataracts’, ‘Tumours’, ‘diarrhea’, and ‘vaginal/uterine/penile prolapses’).
Both resilience and robustness decline in aging populations
We first establish the trends of repair and damage rates in aging. In Figure 2, we plot the age-dependence of the repair and damage processes in mice and humans for three mouse datasets (a) 1 Keller et al., 2019; (b) 2 Bisset et al., 2022; and (c) 3 Schultz et al., 2020; and (d) humans from the ELSA dataset (Phelps et al., 2020; Steptoe et al., 2014). Humans are plotted by decade of baseline age at entry to the study to separate out recruitment effects. Points are binned averages from the data, and lines are posterior samples from the model of the rates. Posterior predictive checks show good model quality, seen in Figure 2—figure supplement 1 for mice (a-c) and humans (d).

Repair rates decrease and damage rates increase with age.
Repair rates vs age (top), damage rates vs age (bottom) for (a) Mouse dataset 1 (Keller et al., 2019), (b) Mouse dataset 2 (Bisset et al., 2022), (c) Mouse dataset 3 (Schultz et al., 2020) and (d) ELSA humans (Phelps et al., 2020; Steptoe et al., 2014) plotted by decades of baseline age. Points in all plots represent binned averages of rates from the data with standard errors, and lines represent posterior samples from Bayesian models of the rates (see Methods). For each plot, the mean Spearman’s rank correlation between the rate and age is indicated by the median of the posterior and a 95% posterior credible interval in parenthesis. (e-g) Posterior distributions of log hazard ratios of death for damage and repair rates are shown as violin plots for the mouse datasets. These hazard ratios correspond to a 1 standard deviation increase in the damage or repair rates. The black interval shows a 95% credible interval around the median point.
-
Figure 2—source data 1
Data used in figure.
- https://cdn.elifesciences.org/articles/77632/elife-77632-fig2-data1-v2.zip
In each of these datasets, there is a strong decrease in repair rates and increase in damage rates with age (except for damage rates in mouse dataset 2). Spearman rank correlations for each plot are also shown in Figure 2, highlighting the increase or decrease in rates with age, and 95% posterior credible intervals of these correlations are shown in brackets. Overall, we observe decreasing repair rates and increasing damage rates with age which signify decreasing resilience and robustness with age in both mice and humans. Decreasing repair and increasing damage both contribute to an increasing FI with age in mice and humans (shown in Figure 2—figure supplement 2a–d). We also observe higher FI scores in females versus males in both mice and humans, as reported previously (Kane et al., 2019; Gordon and Hubbard, 2020; Kane and Howlett, 2021).
We evaluate the contributions of damage and repair rates to survival using a joint longitudinal-survival model in mice. In Figure 2e–g, we show that damage rates have much larger hazard ratios for death than repair rates. These hazard ratios are for a fixed FI, itself a strong predictor of mortality in mice and in people (Rockwood et al., 2017), which shows that an increasing susceptibility to damage leads to larger decreases in survival than a comparable decline in resilience. Individuals survive longer when damage is avoided altogether, as compared to damage that is subsequently repaired. This intuitive result indicates that there may be lingering effects of the original damage and suggests that interventions that focus on robustness may be more effective than those that focus on resilience.
The acceleration of damage accumulation is determined by a decline in robustness
The plots of FI vs. age shown in Figure 2—figure supplement 2 (see also Mitnitski et al., 2001; Mitnitski et al., 2005; Mitnitski et al., 2012; Mitnitski et al., 2013) has a positive curvature, accelerating upwards near death (Stolz et al., 2021). This positive curvature is also seen in other summary measures such as Physiological Dysregulation (Arbeev et al., 2019). However, the origin of this curvature is unknown – whether it is due to a late-life decrease in resilience or a decline in robustness.
We measure the curvature of the FI with the second time-derivative, which can be computed with the age-slopes of the damage and repair rates (see Materials and methods). In Figure 3, we show the separate contributions to this curvature, separated into terms involving damage (pink) and terms involving repair (green). Summing these terms, we observe the typical positive curvature that indicates an acceleration of damage accumulation.

Frailty Index curvature is dominated by declining robustness.
(a) Frailty Index curvature measures the rate of accumulation of damage. Positive curvature indicates an acceleration of damage accumulation, zero curvature indicates a constant accumulation of damage, and negative curvature indicates a decelerating accumulation of damage. Curvature is computed with the second time-derivative of the Frailty Index (Methods Equation 22). Terms of the curvature involving the repair rate (green) and the damage rate (pink) are separately shown. Lines represent posterior samples from our Bayesian models for (b) Mouse dataset 1 (Keller et al., 2019), (c) Mouse dataset 2 (Bisset et al., 2022), (d) Mouse dataset 3 (Schultz et al., 2020) and (e) ELSA humans (Phelps et al., 2020; Steptoe et al., 2014), plotted separately by decades of baseline age. On all plots, we indicate for which ages the proportion of the posterior for the difference in these terms that is negative is below 0.05; the Bayesian analogue of a p-value testing the contributions of robustness and resilience.
-
Figure 3—source data 1
Data used in figure.
- https://cdn.elifesciences.org/articles/77632/elife-77632-fig3-data1-v2.zip
We find that the decline in robustness (indicated by the damage rate terms) has the strongest effect on the curvature of the FI. In mice, this is seen in every dataset and is significant at the indicated ages (Figure 3b–d) and for humans at older ages (Figure 3e). This observed effect indicates that it is the increase in damage with age, rather than the decline of repair, that causes this acceleration of net damage accumulation. The significance of this effect is evaluated by computing the posterior distribution of the difference between damage and repair terms. When at least 95% of this distribution is above/below zero, we take the effect to be significant. Credible intervals visualizing the difference are shown in Figure 3—figure supplement 1.
In Figure 2e–g we had shown that the decline in robustness has the strongest effect on survival in mice. Together with the results shown in Figure 3, these results highlight the important role of declining robustness in aging.
Interventions modify damage and repair rates in mice and wealth correlates with rates in humans
Mouse datasets 1 (Keller et al., 2019) and 2 Bisset et al., 2022 have additional intervention groups treated with either the ACE inhibitor enalapril, or voluntary aerobic exercise, respectively. In Figure 4—figure supplement 1a and b, we show that these interventions target both repair and damage processes, resulting in lower FI damage accumulation over time for the treated groups. In Figure 4a and b, we investigate the effects of these interventions on the curvature of the FI. This curvature is strongly reduced by exercise in mouse dataset 2, with a weaker effect for enalapril the credible intervals of the intervention effects are shown in Figure 3—figure supplement 1d and e. Notably, exercise stops the acceleration in damage accumulation in both male and female mice by reducing the curvature to zero.

Interventions both increase resilience and decrease damage.
(a, b) The effect of enalapril and exercise on Frailty Index curvature for mice for mouse datasets 1 and 2. 95% credible intervals for these curvatures are shown by errorbars around the median (point). Asterisks (*) indicate credible intervals for the difference between intervention and control fully exclude zero. (c ,d) Repair rates and damage rates time-slopes vs time since intervention for the effect of enalapril and exercise for mouse datasets 1 and 2. 95% credible intervals for these curvatures are shown by errorbars around the median (point). Asterisks (*) indicate credible intervals for the difference between intervention and control fully exclude zero. (e) Spearman rank correlation between wealth and repair rate (green) and damage rate (pink), vs age for ELSA humans. Individuals are separated by decades of baseline age, and 95% credible intervals for these correlations are shown as coloured regions around the median (thick line). We restrict this plot to ages with at least 3 individuals.
-
Figure 4—source data 1
Data used in figure.
- https://cdn.elifesciences.org/articles/77632/elife-77632-fig4-data1-v2.zip
The effect of these interventions on the repair and damage rates is seen in Figure 4c and d, where 95% credible intervals for the age-slopes show the rate of increase or decrease of the repair and damage rates as age increases. These slopes include both the change in the rate with age, and the effect due to increasing FI with age. Interventions affect the rate of decrease of both repair and damage rates with time, resulting in less cumulative damage.
As shown in Figure 4c, enalapril attenuates the rate of decrease of repair rates in both male and female mice, resulting in age-slopes closer to zero than for controls. Significance is evaluated by computing the posterior distribution of the difference between control and intervention. Significance is shown with asterisks (*) when at least 95% of the distribution is above/below zero. In Figure 4—figure supplement 1 we show a significant reduction in damage rate (but not slope) for male and female mice with enalapril. A sex-specific effect is seen for voluntary exercise. For female mice, voluntary exercise leads to stoppage of the decline in repair rates (to an approximately zero slope), whereas for male mice it just attenuates the decline (Figure 4d). For damage rates, female mice exhibit an attenuation of the rise with age whereas in male mice exercise stops the age-dependent rise exhibited by control mice.
For humans, we use net household wealth as a socioeconomic environmental factor that serves as a proxy for medical and behavioural interventions that are not individually tracked. This factor is not an intervention as in mice, and is simply correlational. As such we report correlations of wealth with repair and damage rates with age, rather than age-slopes after a specific intervention is initiated. In Figure 4—figure supplement 2, we show rates stratified by terciles of net household wealth, where the lowest tercile exhibits lower repair rates and higher damage rates for younger ages. Correspondingly, the FI is lower for individuals with a higher net household wealth. Treating the wealth variable as continuous, Figure 4e shows that repair rates are positively correlated with net household wealth, while damage rates are negatively correlated – with significant and stronger effects at younger ages. These results reinforce the findings in mice, where interventions impact both damage and repair rates. In humans, we also see some evidence of decreasing effects of wealth with age – although these may be confounded by recruitment effects depending on baseline age.
Damage and repair have broad timescales
In the results above, we considered the average damage and repair transition rates vs age. Since individual deficits undergo stochastic transitions between damaged and repaired states, we can also measure the lifetime of these individual deficit states (see Figure 5a). These lifetimes are interval censored (transitions typically occur between observation times) and can be right-censored (death or drop-out before transition occurs). We use an interval censored-analogue to the standard Kaplan-Meier estimator for right censored data (see Maerials and methods) to estimate state-survival curves of individual damaged or repaired states. These state-survival curves in Figure 5, considering all possible deficits, represent the probability of a deficit remaining undamaged vs time since a repair transition, or remaining damaged vs time since a damage transition.

Resilience and robustness occur over both short and long time-scales in both mice and humans.
(a) The time-scale of resilience is measured as the lifetime of the damaged state. The time-scale of robustness is measured as the lifetime of the undamaged state. Deficit state-survival curves, showing the probability of remaining in the current damaged or undamaged state since time of transition, are shown for (b) Mouse dataset 1, (c) Mouse dataset 2, (d) Mouse dataset 3, and (e) ELSA human dataset. The shaded regions are 95% posterior credible intervals. p-Values are shown on the lower left of each plot for generalized log-rank tests for the equality of the survival functions between the intervention or wealth groups (Zhao et al., 2008 and Zhaeo, 2012).
-
Figure 5—source data 1
Data used in figure.
- https://cdn.elifesciences.org/articles/77632/elife-77632-fig5-data1-v2.zip
We generally observe a significant drop of state-survival probability at early times, indicating some rapid state transitions at or below the interval between measurements. However, all the curves also extend to very long times – towards the scale of organismal lifetime – indicating that both robustness and resilience operate over a broad range of timescales. These results highlight that repair can occur a long time after damage originally occurred. Note that the timescale of robustness as measured here is not robustness after a specific extrinsic stressor, but robustness from the implicit stressors of aging. A similar form of non-specific robustness has been measured in a previous study, using the onset age of disease (Arbeev et al., 2019).
As shown by exponential time-scales of resilience and robustness for individual deficits in Figure 5—figure supplement 1 and Figure 5—figure supplement 2, mice deficits and human deficits exhibit a variety of time-scales of resilience and robustness. Some deficits repair soon after damage (or damage soon after repair), and some repair (or damage) over a broad range of time-scales. The combination of all of these deficits result in the shape of the state-survival curves in Figure 5.
We evaluate the significance of the difference between state-survival curves with a generalized log-rank test (Zhao et al., 2008; Zhaeo, 2012). For the interventions studied, there are no dramatic changes of resilience or robustness timescales exhibited in mice. Exercise in the male mice slightly shifted the timescale of resilience, such that deficits were repaired faster in mice that were exercised compared to controls. We expect that we would observed stronger effects of the interventions on these time-scales if we had sufficient data to resolve the impact of the time at which the initial damage or repair event occurred – here we have grouped all times together. For humans (see Figure 5e), we see strong and significant effects on resilience and robustness timescales from household wealth in females, but not males. These effects are particularly strong for damage timescales, which characterize robustness: states remain healthy longer at higher wealth terciles.
Discussion
We have presented a new approach for the assessment of damage (robustness) and repair (resilience) rates in longitudinal aging studies with binarized health attributes. With this approach, we have shown that both humans and mice exhibit increasing damage and decreasing repair rates with age, corresponding to decreasing robustness and resilience, respectively. We also demonstrate that decreasing robustness and resilience with age contribute to the acceleration of deficit accumulation for organisms. Decreasing robustness has approximately twice as large an effect when compared to declining resilience; decreasing robustness also has a stronger and significant effect on survival. While much of the focus in previous work has been on the decline of resilience in aging (Ukraintseva et al., 2021), our results indicate that decreasing robustness and decreasing resilience are both important processes underlying the increasing accumulation of health-related deficits with age, and the increasing rate of accumulation at older ages.
In the current study, the observed damage is assumed to occur due to natural processes, rather than a specific applied stressor (Kirkland et al., 2016; Colón-Emeric et al., 2020). Resilience measured by the observed repair also occurs without targeted interventions (certainly in mice, due to their absence of health-care), and so is likely to represent intrinsic resilience with respect to spontaneous damage due to the natural stressors of aging. Our approach has some similarities with recent approaches to measuring resilience by the autocorrelation timescale of intrinsic variations of continuous physiological state variables (Pyrkov et al., 2021; Gijzel et al., 2017; Rector et al., 2021). An advantage of our approach, which uses binarized variables, is that we can estimate both resilience and robustness using similar methods on the same data – so we can compare their relative effects. Previous work has modeled the change in the total count of discrete deficits with age (Mitnitski et al., 2006; Mitnitski et al., 2007; Mitnitski et al., 2010; Mitnitski et al., 2012; Mitnitski et al., 2014), but did not separately measure damage and repair. With our approach, we observe decreasing resilience and robustness with age in both mice and humans.
There are caveats to our approach. We may miss fast damage and repair dynamics that occur on time-scales shorter than the separation between observed time-points, for example, we cannot observe daily or weekly changes in deficit states in mice or monthly changes in humans. Therefore, our measurements of damage and repair can only be interpreted as the net damage and net repair between observed time-points. Furthermore, since we have defined damage and repair (and robustness and resilience) as average rates with respect to binarized attributes it remains an open question how they relate to damage and repair rates assessed from continuous health attributes. Our approach results in summary measures of damage and repair rates.
We are not aware of any selection bias in our mouse studies, and we applied joint modelling to mitigate survivor bias effects. To mitigate selection bias in the human data, we treated onset ages distinctly. There are nevertheless well-known ‘healthy volunteer’ effects that would bias the original population that we did not consider. Furthermore, we did not have human mortality data so we could not treat human survivor bias effects. Measurement errors could also contribute to both damage and repair rates – although presumably not in an age-dependent fashion. In contrast, we observe decreasing repair rates and increasing damage rates with age. Errors in deficit assessment are known to be small for mice (Feridooni et al., 2017; Kane et al., 2017). Supporting this, a sensitivity analysis of pruning putative mouse measurement errors, shown in Figure 5—figure supplement 3, finds no qualitative changes.
We find that both damage and repair processes are targeted by interventions in mice. As a result, developing interventions to target either damage or repair separately is conceivable. While targeting either would affect net deficit accumulation, we found that the damage rate has a stronger effect on both mortality and the acceleration of damage accumulation than the repair rate. Consistent with this, recent work has shown that FI damage is also more associated with mortality than FI repair in humans (Shi et al., 2021). We predict that interventions that facilitate robustness (resistance to damage) may be more important at older ages, where damage accumulation normally accelerates. More broadly, rather than just targeting deficit accumulation or FI (Howlett et al., 2021), our results indicate that interventions could be improved by targeting an appropriate balance of damage and repair processes – in an age- and sex-dependent manner. Since both damage and repair occur on long timescales, this raises the possibility that these rates could be manipulated by interventions over a similarly broad range of timescales from the shortest times to organismal lifetimes. How to optimally deploy available interventions is not yet clear.
The effects of age on both damage and repair, in mice and humans, are qualitatively similar in male and female populations. Nevertheless, we have found that systemic interventions can have qualitatively distinct sex effects in mice. The ACE inhibitor enalapril has stronger effects in female mice. Voluntary exercise stopped the decline in repair rate with age for female mice, but not male mice, and stopped the increase in damage rate with age for male mice, but not female mice. These differences suggest that assessing both damage and repair rates, together with accumulated damage as a FI, in interventional aging studies can provide a clearer assessment of sex differences. Further studies are needed to tease out the sex-dependent effects of other aging interventions, and to provide quantitative insight into the mortality-morbidity paradox, where females live longer but have higher FI scores than males (Kane and Howlett, 2021; Oksuzyan et al., 2008).
Summary measures of health such as the FI exhibit an accelerating accumulation of health deficits with age (Mitnitski et al., 2001; Mitnitski et al., 2005; Mitnitski et al., 2012; Mitnitski et al., 2013). This universally observed behavior must be reflected in either increasing damage rates with age, decreasing repair rates, or – as we find – both. However, the question of whether, and by what mechanisms, damage and repair processes are coupled during aging remains unanswered. Both damage and repair rates have been typically modelled as functions of the health state in descriptive models of aging (Taneja et al., 2016; Farrell et al., 2016; Farrell et al., 2018), but without a mechanistic relationship between them apart from that imposed statistically by the observed accumulated damage. The precise relationships – and whether they are a universal feature of all aging organisms – remains to be determined. Studies of interventions should prove useful in this regard, because they can separately target damage and repair.
Our observations that repair timescales are broadly distributed, up to lifespan-scales, raise three fundamental questions for resilience studies. First, are interventions that facilitate recovery similarly effective after a broad range of timescales? This would imply that we may be able to target resilience with interventions over a longer timeframe than just acutely when damage occurs. Damage propagation may nevertheless limit the benefits of such late repair. Second, what determines the recovery timescales? As we have shown (Figure 5—figure supplement 1), different health attributes can have quite different recovery times. Third, would a similar broad range of resilience timescales be observed for challenge experiments with an induced stressor, and how might that depend on the magnitude and scale of the damage?
We have defined damage in terms of discrete transitions of dichotomized variables. It is possible that dichotomized deficits probe qualitatively different timescales than the continuous measures that are often considered in resilience studies. Future experimental resilience studies across a range of health attributes should explore longer timescales. It will also be important to assess how the broad range of recovery timescales we have uncovered compare to timescales extracted from auto-correlations of physiological state variables – which have also been limited to shorter times (Pyrkov et al., 2021).
We have also limited our study to ‘clinical’ variables at organismal scales. Further studies of resilience and robustness at different biological scales from cellular to organismal, with both continuous and discrete variables, and over organismal timescales, will help us to better understand how damage and repair at cellular scales influences and is influenced by similar processes at organismal scales. It is easier to conceive of how damage can propagate from cellular to organismal (Howlett and Rockwood, 2013), but harder to conceptualize how cellular repair processes such as DNA repair pathways and autophagy (Kirkwood, 2011) might similarly propagate.
Most of the mouse health deficits analyzed here have previously been shown to reverse either spontaneously or in response to interventions including drug treatment or exercise (Supplementary file 1). There are only a few deficits that rarely or never repaired in the current study: cataracts, tumours, diarrhea, and prolapses. Both this study and the literature suggest that most deficits that make up the mouse frailty index can be reversed or repaired. Investigation of the specific mechanisms responsible for the spontaneous repair of deficits, and how they scale from the cell to the organism, should be the focus of future work. We speculate that spontaneous repair occurs by the same pathways that are targeted by health interventions. It is likely that deficits reverse if interventions target one or more of the molecular/cellular pillars of aging, including macromolecular damage, dysregulated stress response, disruption in proteostasis, metabolic dysregulation, epigenetic drift, inflammaging, and stem cell exhaustion (Goh et al., 2022). In terms of the specific interventions investigated here, previous studies have shown that beneficial effects of enalapril treatment and exercise on frailty are attributable, at least in part, to effects on chronic inflammation (Bisset et al., 2022; Keller et al., 2019).
The increasing availability of longitudinal health data over the lifespan of model aging organisms facilitates the analysis of damage and repair rates, and how they extend and change over the organismal lifespan. These damage and repair rates underlie the accumulation of damage that describes aging. Here we have shown the value of considering both resilience and robustness over the lifespan. Further studies will be able to determine how widespread organismal and sex differences in these effects are, and how universal they may prove to be. Studies of the effects on damage and repair rates of both targeted and systemic interventions will also be crucial. We have studied only three interventions or conditions so far (e.g. enalapril and exercise in mice, and wealth in humans). There are many other possibilities, including treatment with geroprotectors (Gonzalez-Freire et al., 2020) and lifestyle interventions, that could be deployed both in humans and in aging animal models.
Materials and methods
Mouse data
For the mouse portion of this manuscript, published data on longitudinal health-related deficits in C57BL/6 mice from three papers was used (Keller et al., 2019; Bisset et al., 2022; Schultz et al., 2020). A brief summary of the methods of each paper is below.
Mouse dataset 1 (Keller et al., 2019)
Male and female C57BL/6 mice were assessed for deficits approximately every 4 weeks from 16 to either 21 months of age (females) or 25 months of age (males). Mice were fed either a diet containing enalapril (280 mg/kg) or control diet for the duration of the experiment. After pre-processing (below), this data contains 21 female control mice, 25 female enalapril mice, 13 male control mice, and 25 male enalapril mice.
Mouse dataset 2 (Bisset et al., 2022)
Male and female C57BL/6 mice were assessed for deficits approximately every 2 weeks from 21 to 25 months of age. Mice were all singly housed, and half were provided a running wheel for voluntary exercise. After pre-processing (below), this data contains 11 female control mice, 11 female exercise mice, 6 male control mice, and 6 male exercise mice.
Mouse dataset 3 (Schultz et al., 2020)
Male C57BL/6Nia mice were assessed for deficits approximately every 6 weeks from 21 months of age until their natural deaths. After pre-processing (below), this data contains 44 male control mice.
Mouse clinical frailty index assessment
Each of the papers above assessed health deficits using the mouse clinical frailty index as described previously (Whitehead et al., 2014). Briefly, this assessment involves scoring 31 non-invasive health-related measures in mice. Most measures are scored as 1 for a severe deficit, 0.5 for a moderate deficit and a 0 for no deficit. Deficits in body weight and temperature were scored based on deviation from reference values in young adult animals, such that a difference of less than 1 SD was scored 0, a difference of ±1 SD was scored 0.25, a difference of ±2 SD was scored 0.5, a difference of ±3 SD was scored 0.75, and a difference of more than 3 SD received the maximal deficit value of 1 (Whitehead et al., 2014). The deficits of malocclusions and body temperature were not assessed in mouse group 3 (Schultz et al., 2020), leaving only 29 deficits for this dataset.
The variables in the Frailty Index are, ‘Alopecia’, ‘Fur color loss’, ‘Dermatitis’, ‘Coat condition’, ‘Loss of whiskers’, ‘Kyposis’, ‘Distended abdomen’, ‘Vestibular disturbance’, ‘Cataracts/corneal capacity’, ‘Eye discharge/swelling’, ‘Microphthalmia’, ‘Malocclusions’, ‘Rectal prolapse’, ‘Penile prolapse’, ‘Mouse grimace scale’, ‘Piloerection’, ‘Tail stiffening’, ‘Gait’, ‘Grip strength’, ‘Body condition’, ‘Hearing loss’, ‘Vision loss’, ‘Menace reflex’, ‘Tremor’, ‘Tumors/growths’, ‘Nasal discharge’, ‘Diarrhoea’, ‘Breathing rate/depth’, ‘Body temperature’, ‘Body weight’. The FI reference sheet at https://github.com/SinclairLab/frailty, shows examples of mice corresponding to the different levels of the deficits.
Mouse data pre-processing
For mouse dataset 1, we impute missing deficit values by propagating the last observed value forward. If the first observed deficit is missing, it is imputed by propagating the first observed value backward. Less than 1% of all total deficit values are missing in this dataset. No values in the other datasets are missing.
Deficits are scored on a fractional scale, with deficit having values . To treat these as binary, we represent each fractional deficit di by 4 ordered binary deficits, . Fractional deficits are then represented by setting of these ordered binary deficits to 1. For example if , this is represented as .
A new Frailty Index is then created by taking all of these new binary deficits, representing a item Frailty Index ( for mouse dataset 3). This process preserves the FI scores, and a single repair or damage transition on this scale can be interpreted as taking a step of size 0.25 on the fractional deficit scale. Each step of a deficit originally on the scale corresponds to 2 steps of size 0.25 on this new scale.
Measurement times with abnormally short or long intervals are removed. In mouse dataset 2, measurement times less than 0.1 months from the previous time are removed. In mouse dataset 3, measurement times more than 2 months from the previous time are removed.
In each dataset, mice with less than 2 observed time-points are removed.
Human data and pre-processing
We use human data from the English Longitudinal Study of Aging (Phelps et al., 2020; Steptoe et al., 2014). We select individuals that have full data for net household wealth and activities of daily living (ADL) and instrumental activities of daily living (IADL). A Frailty Index is created from the count of 10 possible ADLs and 13 possible IADLs, giving a fraction out of 23. Each of these variables are binary with values .
The ADLs are ‘Have difficulty’: ‘Walking 100 yards’, ‘Sitting for about two hours’, ‘Getting up from a chair after sitting for long periods’, ‘Climbing several flights of stairs without resting’, ‘Climbing one flight of stairs without resting’, ‘Stooping, kneeling, or crouching’, ‘Reaching or extending arms above shoulder level’, ‘Pulling/pushing large objects like a living room chair’, ‘Lifting/carrying over 10 lbs, like a heavy bag of groceries’, and ‘Picking up a 5 p coin from a table’. The IADLs are ‘Have difficulty’: ‘Dressing, including putting on shoes and socks’, ‘Walking across a room’, ‘Bathing or showering’, ‘Eating, such as cutting up your food’, ‘Getting in or out of bed’, ‘Using the toilet, including getting up or down’, ‘Using a map to get around a strange place’, ‘Preparing a hot meal’, ‘Shopping for groceries’, ‘Making telephone calls’, ‘Taking medications’, ‘Doing work around the house or garden’, and ‘Managing money, e.g. paying bills and keeping track of expenses’.
We restrict individuals to those that were recruited to the study between the ages of 50 and 89 years. We drop individuals with follow-up time intervals above 4 years and individuals with fewer than 6 follow-ups. This removes 15399 individuals from the dataset, 4326 of which only had a single time-point, 2291 had 2 time-points, 2095 had 3 time-points. The final selected individuals are followed for between 13 and 18 years, with 60% of the individuals being followed for 16 years.
We use net household wealth, as determined in the financial assessment in wave 5 of the ELSA data. We drop individuals that have parts of this assessment imputed. The raw value of net household wealth spans several orders of magnitude (and includes negatives for individuals in debt), and so is transformed by .
After pre-processing, this data contains 1049 males and 1300 females with time-intervals of approximately 2 years between observations. There are 1222 individuals from baseline ages in , 827 with baseline ages in , 281 with baseline ages in , and 19 with baseline ages in .
Extracting damage and repair counts
In each dataset, we observe the state of binary health deficits for each subject at a set of observation times . Summing up the number of deficits at each time, we get deficit counts for each observation time, , which is used to compute the Frailty Index .
We compute the number of deficits damaged ( transitions) and repaired ( transitions) between two time points tj and , denoted as or . These counts satisfy , linking these damage and repair processes with the Frailty Index.
Modelling
We model deficit repair and damage as Poisson point processes with time-dependent rates, and . The count of deficits repaired or damaged in an interval is assumed to follow a Poisson distribution, with mean equal to the instantaneous rate or integrated over this interval times the number of possible deficits available to be repaired , or damaged . For computational convenience, we use constant rates within each time-interval to approximate these integrals, or .
We perform Bayesian inference on our models by inferring the posterior distribution of the parameters given the observed data.
Joint longitudinal-survival model for mice data
We use a joint modelling framework to model repair and damage rates, while assessing their effect on survival. We decompose the multivariate joint distribution of the observed longitudinal damage and repair counts and survival outcome by coupling survival with the repair and damage rates and (Hickey et al., 2016; Brilleman et al., 2020),
We indicate final follow-up times for individual as with a censoring indicator ci where 1 is censored and 0 is an observed death.
Longitudinal component
We use a linear Poisson model for the longitudinal damage and repair rates. A Softplus function, , is used to enforce positive rates. This function is chosen because it is approximately linear for larger values of , in contrast to which is often used for Poisson models (which resulted in poor behaviour for our models). The form of this model is,
The first two equations describe the time-dependent repair and damage rates, and . These rates represent the probability of repair or damage, per deficit per unit time. These rates are multiplied by the number of deficits that can repair or the number that can damage and the time-interval to compute the mean count of repaired or damaged deficits for Poisson distributions. The last Equation 5 shows how we can compute the total count of deficits from this model, allowing this model to be used to model the Frailty Index as well.
The full-cohort parameters are denoted and the subject-specific intercept and time-slopes . The variables include the covariates and their interactions with sex and intervention group,
The “treatment" variable is a 0/1 indicator for enalapril in mouse group 1 or exercise in mouse group 2. The other variables are the time from baseline , the Frailty Index , baseline age a0, and sex (M/F). These interactions allow sex and intervention group specific time-slopes.
The repair and damage processes are linked by including correlations between the subject-specific parameters .
Survival component
We jointly model these repair and damage processes with survival, with proportional hazards and a baseline hazard parameterized with M-splines (Ramsay, 1988) (which are always non-negative). The damage and repair processes are linked with survival by including damage rate and repair rate in the hazard rate,
The first equation describes the hazard rate in terms of the covariates and the repair and damage rates. The baseline hazard is modeled with sex-specific splines, due to the large disparity in survival by sex. The covariates are . The functions are M-spline basis functions of order 3 with an -component knot vector .
Priors and hyperparameters
We use weakly informative priors to regularize parameters,
Broad priors are used on intercept parameters and narrow priors on covariate coefficients (Equation 8). The covariance matrix for the coupling of the subject-specific parameters is decomposed in terms of a correlation matrix with an LKJ prior and standard deviations with half-Cauchy distributions (Equation 9). The LKJ distribution is a standard prior for correlation matrices, where is a uniform distribution over correlation matrices (Lewandowski et al., 2009). Increasing results in a sharper peak at the identity matrix.
These weak priors have the effect of making large parameter values unlikely (all parameters are put on the same scale by standardizing the values of all covariates first to mean 0 and variance 1), which improves the computational speed of the MCMC sampler. Choice of such weak priors affect quantitative results to a small extent, but do not affect our qualitative results – like other hyperparameters.
Spline coefficients use a Dirichlet distribution with concentration 1, representing a uniform prior on the simplex . We use spline knots with knots at the minimum last follow-up age, the maximum, and 15 uniformly spaced quantiles from 0.05 to 0.95 of the last follow-up age (Equation 10).
Integrals of the hazard rate are computed with 5-point Gaussian Quadrature between each observed time interval.
Non-linear modeling for human data
There is much more human data than mice and the data is more complex, where linear effects are not sufficient to capture the combined influence of wealth, baseline age, and time. We use a non-linear Poisson model with non-constant coefficients to include additional degrees of freedom. We parameterize these non-constant coefficients with B-splines. The individuals selected from ELSA with wealth data do not have mortality data available, simplifying the model from the joint model used above for mice.
Our model has the form,
where denotes wealth and a0 denotes baseline age and,
The non-constant coefficients are implemented as 2D B-splines for wealth and baseline age with 5 wealth knots and 5 baseline age knots at the minimum, maximum and terciles of these variables. We use smoothing 2D random-walk priors on the spline coefficients,
All other priors are the same as in the mouse modelling.
Note, in the human data we do not include subject-specific time-slopes and as we did in the mouse data, since we have much shorter time-series. When these slopes are included, we see evidence of the model over-fitting to the data by the proportion of residuals including zero within 95% credible intervals being much higher than 0.95 – nearing 0.99 to 1.00.
Derivatives
We can compute the derivative of the Frailty Index according to the modelled repair and damage rates,
To understand the effect of interventions, we compute the derivative with respect to time for the repair and damage rates,
This is the slope of these rates vs time, with the increase in the Frailty Index included. While we only include explicit linear effects of time in the model, the increase in Frailty Index with time can influence the derivative to change.
We can compute the curvature as the second derivative of the Frailty Index with age, written in terms of first derivatives of the rates,
The first group of terms are those involving damage rate (robustness) and the second group of terms are those involving repair (resilience). These terms are plotted in Figure 3.
Repair and damage timescale mice and human data
We observe the amount of time that has passed between damage and repair events, and vice versa. This can be used to determine the time-scales of these damage and repair processes. However, since a deficit might damage and the individual dies before the deficit is ever repaired, there is right censoring. Additionally, since observations are only made at specific time-points so that we cannot determine the exact time at which a deficit damaged or repaired, there is interval censoring. To estimate the distribution of repair and damage times, we treat repair and damage events for each deficit as a mixture of interval and right censored events (Zhao et al., 2008; Zhaeo, 2012). Accordingly, we model state-survival curves for the damaged state (time-scale of resilience) and undamaged state (time-scale of robustness).
We use a Bayesian survival model with M-splines for the baseline hazard,
This is fit separately for sex and control/intervention groups.
We include both interval-censoring and right-censoring in the likelihood for individual ,
where is the lower interval bound, is the upper interval bound, is a time of right censoring, and is the right censoring indicator (c=1 event censored, c=0 event occurs). We use 32 knots set at 30 evenly spaced quantiles of the event times from 0.1 to 0.9 together with the minimum and maximum event time. A uniform prior is used for the spline coefficients and a broad prior for .
MCMC sampling
We use the STAN no U-turn sampler (NUTS) (Stan Development Team, 2020). We use 4000 warm-up iterations and 6000 sampling iterations on four separate chains for the mouse joint models. For mouse dataset 2, we use the sampler settings adapt_delta = 0.95, max_treedepth = 20 to avoid divergent sampler transitions. For the human model, we use two separate chains with 1000 warm-up iterations and 3000 sampling iterations. For the interval-censored Bayesian survival models, we use 2000 warm-up iterations and 3000 sampling iterations for four separate chains. Number of sampling iterations are chosen to achieve adequate effective sample sizes, while remaining computationally feasible.
In Figure 2—figure supplement 1 we perform posterior predictive checks (Gabry et al., 2019; Gelman et al., 2020) for the mice and human models by plotting observed and simulated distributions of counts. We also compute statistics (Vehtari et al., 2021) and the coverage of credible intervals for residuals.
Sensitivity analysis
Some of the observed damage and repair transitions may be due to measurement error or data entry errors. In particular, we believe this may be the case for some of the isolated damage and repair transitions. For example, if we consider 5 time-points where a variable has values , this may be an erroneous transition.
Under the assumption that such erroneous transitions will most likely occur isolated from true damage/repair, we prune the data by removing these transitions, e.g. becomes (erroneous damage) or becomes (erroneous repair). We only prune damage/repair events isolated by 2 of the opposite state on either side, as shown here. In Figure 5—figure supplement 3, we show that pruning these values only has a limited effect.
Code and data availability
Our code for data pre-processing, modelling, and plotting is available https://github.com/Spencerfar/aging-damagerepair, (Farrell, 2022 copy archived at swh:1:rev:4fe6f883d37dda6b9059c53aa9366f4ff2665a43). Human ELSA data can be accessed by agreeing to an End User Licence https://www.elsa-project.ac.uk/accessing-elsa-data and downloading waves 1–9. Mouse dataset 3 is freely available at https://github.com/SinclairLab/frailty, (Sinclair Lab, 2022; Schultz et al., 2020).
Data availability
Source data files for all figures and summary statistics for all fitting parameters and diagnostics of the models are provided. Only pre-existing datasets were used in this study. Information about the datasets and data cleaning is in the methods section. Raw data for mouse dataset 3 are freely available from https://github.com/SinclairLab/frailty. Raw human data are available from https://www.elsa-project.ac.uk/accessing-elsa-data after registering. All code is available at https://github.com/Spencerfar/aging-damagerepair, (copy archived at swh:1:rev:4fe6f883d37dda6b9059c53aa9366f4ff2665a43). Our code for cleaning these raw datasets is provided.
References
-
“ physiological dysregulation ” as a promising measure of robustness and resilience in studies of aging and a new indicator of preclinical diseaseThe Journals of Gerontology. Series A, Biological Sciences and Medical Sciences 74:462–468.https://doi.org/10.1093/gerona/gly136
-
Aerobic exercise attenuates frailty in aging male and female c57bl/6 mice and effects systemic cytokines differentially by sexThe Journals of Gerontology. Series A, Biological Sciences and Medical Sciences 77:41–46.https://doi.org/10.1093/gerona/glab297
-
A complex systems approach to aging biologyNature Aging 2:580–591.https://doi.org/10.1038/s43587-022-00252-6
-
Two approaches to classifying and quantifying physical resilience in longitudinal dataThe Journals of Gerontology. Series A, Biological Sciences and Medical Sciences 75:731–738.https://doi.org/10.1093/gerona/glz097
-
Network model of human aging: frailty limits and information measuresPhysical Review. E 94:052409.https://doi.org/10.1103/PhysRevE.94.052409
-
SoftwareDamage and repair processes in aging, version swh:1:rev:4fe6f883d37dda6b9059c53aa9366f4ff2665a43Software Heritage.
-
The impact of age and frailty on ventricular structure and function in C57BL/6J miceThe Journal of Physiology 595:3721–3742.https://doi.org/10.1113/JP274134
-
Frailty in older adults: evidence for a phenotypeThe Journals of Gerontology. Series A, Biological Sciences and Medical Sciences 56:M146–M156.https://doi.org/10.1093/gerona/56.3.m146
-
Visualization in bayesian workflowJournal of the Royal Statistical Society 182:389–402.https://doi.org/10.1111/rssa.12378
-
The hoverfly and the wasp: a critique of the hallmarks of aging as a paradigmAgeing Research Reviews 70:101407.https://doi.org/10.1016/j.arr.2021.101407
-
Dynamical resilience indicators in time series of self-rated health correspond to frailty levels in older adultsThe Journals of Gerontology. Series A, Biological Sciences and Medical Sciences 72:991–996.https://doi.org/10.1093/gerona/glx065
-
Resilience in clinical care: getting a grip on the recovery potential of older adultsJournal of the American Geriatrics Society 67:2650–2657.https://doi.org/10.1111/jgs.16149
-
The road ahead for health and lifespan interventionsAgeing Research Reviews 59:101037.https://doi.org/10.1016/j.arr.2020.101037
-
Differences in frailty in older men and womenThe Medical Journal of Australia 212:183–188.https://doi.org/10.5694/mja2.50466
-
Report: NIa workshop on measures of physiologic resiliencies in human agingThe Journals of Gerontology. Series A, Biological Sciences and Medical Sciences 72:980–990.https://doi.org/10.1093/gerona/glx015
-
Joint modelling of time-to-event and multivariate longitudinal outcomes: recent developments and issuesBMC Medical Research Methodology 16:117.https://doi.org/10.1186/s12874-016-0212-5
-
New horizons in frailty: ageing and the deficit-scaling problemAge and Ageing 42:416–423.https://doi.org/10.1093/ageing/aft059
-
Implementation of the mouse frailty indexCanadian Journal of Physiology and Pharmacology 95:1149–1155.https://doi.org/10.1139/cjpp-2017-0025
-
A murine frailty index based on clinical and laboratory measurements: links between frailty and pro-inflammatory cytokines differ in a sex-specific mannerThe Journals of Gerontology. Series A, Biological Sciences and Medical Sciences 74:275–282.https://doi.org/10.1093/gerona/gly117
-
Sex differences in frailty: comparisons between humans and preclinical modelsMechanisms of Ageing and Development 198:111546.https://doi.org/10.1016/j.mad.2021.111546
-
Chronic treatment with the ACE inhibitor enalapril attenuates the development of frailty and differentially modifies pro- and anti-inflammatory cytokines in aging male and female C57BL/6 miceThe Journals of Gerontology. Series A, Biological Sciences and Medical Sciences 74:1149–1157.https://doi.org/10.1093/gerona/gly219
-
Resilience in aging miceThe Journals of Gerontology. Series A, Biological Sciences and Medical Sciences 71:1407–1414.https://doi.org/10.1093/gerona/glw086
-
Systems biology of ageing and longevityPhilosophical Transactions of the Royal Society of London. Series B, Biological Sciences 366:64–70.https://doi.org/10.1098/rstb.2010.0275
-
Generating random correlation matrices based on vines and extended onion methodJournal of Multivariate Analysis 100:1989–2001.https://doi.org/10.1016/j.jmva.2009.04.008
-
Accumulation of deficits as a proxy measure of agingTheScientificWorldJournal 1:323–336.https://doi.org/10.1100/tsw.2001.58
-
Relative fitness and frailty of elderly men and women in developed countries and their relationship with mortalityJournal of the American Geriatrics Society 53:2184–2189.https://doi.org/10.1111/j.1532-5415.2005.00506.x
-
Going from bad to worse: a stochastic model of transitions in deficit accumulation, in relation to mortalityMechanisms of Ageing and Development 127:490–493.https://doi.org/10.1016/j.mad.2006.01.007
-
Trajectories of changes over twelve years in the health status of canadians from late middle ageExperimental Gerontology 47:893–899.https://doi.org/10.1016/j.exger.2012.06.015
-
A multi-state model for the analysis of changes in cognitive scores over a fixed time intervalStatistical Methods in Medical Research 23:244–256.https://doi.org/10.1177/0962280211406470
-
Men: good health and high mortality sex differences in health and agingAging Clinical and Experimental Research 20:91–102.https://doi.org/10.1007/BF03324754
-
Monotone regression splines in actionStatistical Science 3:425–441.https://doi.org/10.1214/ss/1177012761
-
Changes in a frailty index and association with mortalityJournal of the American Geriatrics Society 69:1057–1062.https://doi.org/10.1111/jgs.17002
-
The emergence of geroscience as an interdisciplinary approach to the enhancement of health span and life spanCold Spring Harbor Perspectives in Medicine 6:a025163.https://doi.org/10.1101/cshperspect.a025163
-
Moving geroscience from the bench to clinical care and health policyJournal of the American Geriatrics Society 69:2455–2463.https://doi.org/10.1111/jgs.17301
-
Cohort profile: the english longitudinal study of ageingInternational Journal of Epidemiology 42:1640–1648.https://doi.org/10.1093/ije/dys168
-
Dynamical network model for age-related health deficits and mortalityPhysical Review. E 93:022309.https://doi.org/10.1103/PhysRevE.93.022309
-
Decline in biological resilience as key manifestation of aging: potential mechanisms and role in health and longevityMechanisms of Ageing and Development 194:111418.https://doi.org/10.1016/j.mad.2020.111418
-
A clinical frailty index in aging mice: comparisons with frailty index data in humansThe Journals of Gerontology. Series A, Biological Sciences and Medical Sciences 69:621–632.https://doi.org/10.1093/gerona/glt136
-
BookInterval-Censored Time-to-Event Data: Methods and ApplicationsFlorida, United States: CRC Press.
-
Generalized log-rank tests for partly interval-censored failure time dataBiometrical Journal. Biometrische Zeitschrift 50:375–385.https://doi.org/10.1002/bimj.200710419
-
Education, wealth, and duration of life expected in various degrees of frailtyEuropean Journal of Ageing 18:393–404.https://doi.org/10.1007/s10433-020-00587-2
Decision letter
-
Jamie JusticeReviewing Editor; Wake Forest School of Medicine, United States
-
Carlos IsalesSenior Editor; Augusta University, United States
-
Konstantin ArbeevReviewer; Duke University, United States
Our editorial process produces two outputs: (i) public reviews designed to be posted alongside the preprint for the benefit of readers; (ii) feedback on the manuscript for the authors, including requests for revisions, shown below. We also include an acceptance summary that explains what the editors found interesting or important about the work.
Decision letter after peer review:
Thank you for submitting your article "Measurements of damage and repair in aging mice and humans reveals that robustness and resilience decrease with age, operate over broad timescales, and are affected differently by interventions" for consideration by eLife. Your article has been reviewed by 4 peer reviewers, one of whom is a member of our Board of Reviewing Editors, and the evaluation has been overseen by Carlos Isales as the Senior Editor. The following individual involved in the review of your submission has agreed to reveal their identity: Konstantin Arbeev (Reviewer #3).
The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.
Essential revisions:
1. A key assumption is that an increase in FI is equivalent to a rise in damage, but this relationship is unknown and may ignore a number of important biological processes – including the dynamic interplay between both damage, compensatory processes/feedback, and protection/fitness. There is lingering concern that this may be over-conceptualized, and the true relationship between frailty index and damage is unknown. This should be addressed in the manuscript rationale. Language in discussion and conclusions should be tempered.
2. Additionally, 'repair' assumes a biological process to restore homeostasis, but in this manuscript refers to a switch from out-of-range to a return to in-range for variables and cut-points that are not defined in the manuscript. The issue of measurement error in individual components that switch from out-of-range to in-range or vice versa needs to be addressed. Variables and cutpoints must be detailed. Authors should also distinguish biological processes from mechanical definitions in rationale and discussion.
3. Some variables may represent irreversible processes of deterioration (e.g. loss of whiskers, kyphosis, palpable/visible tumor, etc.), while others may be more dynamic or have cutpoints with unknown validity for in vs out of range. It is anticipated that only a subset of those queried contributes to 'repair' while others irreversibly increase over time. This would challenge the validity of 'repair' models (and interpretation of Figure 5). As above, please include details on specific variables and their cutpoints. Please indicate which of these variables change dynamically over time – with evidence of both 'deficit' and 'repair' – versus those that are irreparable by definition or thresholds.
4. Additional conceptual issues are present in mouse models given typically shorter time scales and acute stress stimulus – in addition to the issues above. Most reviewers have requested additional details to support assumptions made in mouse models.
5. Detailed total follow-up time/time-course in both animal and human data is needed. For example, statistics on total follow-up times in human data. This is important for relative time courses across species and how many times deficits can be damaged/repaired within a given time or relative period of life.
6. Concern over the impact of selection bias/survivor bias in models (especially mice). Please address in the model and discussion.
7. Abstract requires clarity. Data, tests performed, and specific results should be presented as appropriate.
8. Please address specific recommendations and editorial suggestions made by each reviewer below.
Reviewer #1 (Recommendations for the authors):
I commend the authors for their efforts. The modeling of longitudinal data is an important area in aging that needs more attention. I also like the approach at a conceptual level and think the stratification of resilience vs robustness could be quite valuable. I appreciate the idea of making models that are generalizable and think the approach is most appropriate for the human data, but I think more attention needs to be paid to the technical and biological aspects of the variables being modeled for the mouse data.
General comments:
The mouse frailty index is a very useful tool for efficiently measuring the organismal state in large cohorts. A tradeoff for quickly measuring a broad range of health domains is that the individual measurements are low resolution and involve inherent subjectivity (which may be considered measurement error). It is very important to keep this in mind when using individual scores from the index. Some transitions are due to random measurement error and I believe this is especially likely with decreases (or 'resilience' transitions).
While I find the 'robustness' modeling quite interesting, I am skeptical as to how valid the 'resilience' modeling is. The first reason for this is that I do not believe many of the deficits in the mouse index are reversible under normal physiologic conditions. For example, it is exceptionally unlikely for a palpable/visible tumor to resolve in an aged mouse, thus any reversal that was observed must be due to random measurement error. I would conservatively estimate that the following components are not truly reversible in the contexts in which they were studied here: alopecia, loss of fur color, loss of whiskers, tumors, kyphosis, hearing loss, cataracts, corneal capacity, vision loss, rectal prolapse, genital prolapse.
When used as a sum of its components, the mouse frailty index is highly robust to these measurement errors as demonstrated many times in the literature. However, breaking apart the index into its components compromises this robustness, and I am not aware of literature which has validated the use of individual components of the index. How do the authors account for the measurement error which is inherent to the individual components of the mouse frailty index, and how might this error affect their models? I would be particularly concerned with how this would affect the models in figure 5.
Additionally, the authors state that "the observed damage is thought to occur due to natural stochastic transitions." I do not necessarily agree with that characterization for many of the pathologies measured here. For example, I would not consider vision loss a stochastic transition but rather a gradual degenerative process. Stochasticity involves when it crosses a threshold that can be measured/detected by the researcher. The authors state the following assumption: "We assume these time intervals are small so that we use constant rates within each time-interval to approximate these integrals." Small is subjective and the time intervals are up to 2 months which is 7-10% of a mouse lifespan. Can the authors justify this assumption and/or discuss the implications of such an assumption to their models? The time scale should also be considered in the context of the specific measurements being taken.
My final high-level comment regards selection bias. As these studies progress and individuals die, the study population drastically changes. How do the authors' models account for this selection bias? Can the results of the modelling tell us anything about the nature of the populations as selection occurs? And finally, would there be value in performing this modeling as a function of "biological age" (ie. time to death) versus chronological age?
Specific comments:
The authors should specify which packages and versions were used for the development of their models. When I ran some of the scripts from Github they threw pandas warnings, leading me to suspect that I am not using the development versions. This is important if the authors expect future use of their models. I would encourage the authors to include package information both in the manuscript and on the Github.
I would recommend the authors be specific when describing their results. Many complex models and statistical tests are generated and it sometimes is unclear what is being referred to. For example, in the section "The acceleration of damage accumulation…" the statement "In mice, this is seen in every dataset and is significant at the indicated ages" is unclear what statistical test is being referred to. I would also recommend the authors be more precise in their result descriptions. For example, in the section "Interventions modify damage…" the statement "This curvature is strongly reduced…" is subjective and would be better supported by a description of specific statistical results.
The authors should be careful with the language used regarding wealth in the human dataset. Wealth is not an intervention. And wealth per se does not influence frailty as worded in methods. In these data wealth is a variable which is often associated with health outcomes including frailty.
In discussion: "…where female live longer but have higher FI scores than males." Female humans live longer but female B6 mice usually live shorter.
The citation the authors use for ELSA points to the database. PMID 23143611 also needs to be cited where the study and data collection procedures are described. It also seems like an omission to not cite PMID 31665163 which explores the relationship between various factors, including wealth, and frailty in the same dataset. I would also recommend the authors include a supplemental table with the specific mouse measurements and human questionnaire items that comprise the respective frailty indices.
The authors should provide more detail on the human frailty variables in the methods. I am assuming the binary responses to questions HEADLA and HEADLB were used, but this should be clarified/explicitly stated in the methods.
Comments on mouse pre-processing methods: In the mouse frailty index, 29 out of 31 parameters are measured on a [0, 0.5, 1] scale, with only weight and temperature using [0, 0.25, 0.5, 0.75, 1] scale, and it looks to me like the Schultz dataset still used [0, 0.5, 1] for these variables. So I don't find the statement "a single repair or damage transition can be interpreted as taking a step of size 0.25 on the fractional deficit scale" to be accurate because 29 out of 31 parameters only have the resolution to move in 0.5 unit increments.
Page 5: Authors state that "in each of the datasets, there is a strong decrease in repair rates and increase in damage rates with age (except in mouse dataset 2 for damage rates)." This doesn't seem to be the case for repair rates in humans entering ELSA study at age 50-70 (CI for p cross 0).
I am also somewhat unclear about the values for the rates in figure 2. In figure S2-1 the authors show that the model fits the data reasonably well. However, when looking at the rates in figure 2, the repair rates appear to be consistently higher than the damage rates. For example, in the Keller dataset males at 22 months, as frailty is consistently increasing, the modeled repair rate appears to be nearly double the modelled damage rate. Overall, the comparative magnitudes of the rates don't seem consistent with the mouse data, where there it is relatively rare to see a "population level" reduction in frailty at a subsequent timepoint (tends to stay the same or increase over time as shown in Figure 2S-2). Same comment for figure 4S2 human data.
Figure S2-1. It is hard to see the bars of the histogram in comparison to the dots. I would recommend making the dots smaller. Quantifying and plotting the deviation of observed vs posterior samples may also make interpretation easier.
The repair rate in the human 70-80 data (2d) appears to have a negative curvature, but this doesn't seem to be reflected in the second derivative plot (3e, teal line centered on zero).
I find the models in figure 5 somewhat difficult to interpret considering the extensive censoring that is occurring. Could the authors attempt a more 'plain language' interpretation of these curves considering the censoring? It seems like the interpretation would be that at 3 months after damage there is only a 75% probability of that damage being repaired? Also, could the authors interpret the drastic reduction in probability occurring in both damage and repair of the Schultz data at ~3 months?
Reviewer #3 (Recommendations for the authors):
I find the result on sex differences in effects of interventions on robustness and resilience very interesting so consider mentioning this in the abstract.
I would like to see more discussion on why deficits can be repaired, especially for some deficits which may seem to represent irreversible processes of deterioration. One example that is not trivial for me to understand (as I am not a specialist in mice) is the repair of the deficit "loss of whiskers." Do they just grow up again after a while? If they do, what constitutes the event of repair (just when they start growing again or when they are fully grown, assuming there is a measure for this)?
Some statistics on the total follow-up times in human data would be helpful. It can help put the estimates of repair/damage scales in the context to provide insight on how many times the deficit can potentially be damaged/repaired in the time period of data collection.
As you indicated in Methods, "The individuals selected from ELSA with wealth data do not have mortality data available, simplifying the model from the joint model used above for mice." In Results, you wrote, "This is a joint longitudinal-survival model, which couples the damage and repair rates together with mortality." This narrative should be changed then to reflect the fact that the joint model was only used in mice data.
Some readers may not be familiar with Bayesian methods so it would help provide in Methods some discussion on the selection of priors – is this a convenience/traditional choice or some other reasons? Also, some narrative on the sensitivity of results to a different choice of priors would be helpful, in my view.
https://doi.org/10.7554/eLife.77632.sa1Author response
Essential revisions:
1. A key assumption is that an increase in FI is equivalent to a rise in damage, but this relationship is unknown and may ignore a number of important biological processes – including the dynamic interplay between both damage, compensatory processes/feedback, and protection/fitness. There is lingering concern that this may be over-conceptualized, and the true relationship between frailty index and damage is unknown. This should be addressed in the manuscript rationale. Language in discussion and conclusions should be tempered.
The frailty index (FI) is a summary health measure that is useful to predict (and correlates with) many adverse outcomes. It is a percentage of individual health attributes that are considered “damaged”. Individual binarized attributes are not absolute: “damaged” (or “deficit”) attributes are not entirely damaged, while “healthy” attributes are not entirely healthy. Nevertheless, we are considering transitions between these binary variables, so we use the language of damage and repair. Since the FI is the average of the binary attributes, we use the same language for changes of the FI to distinguish increases of FI (damage) from decreases (repair).
While the FI is a summary measure of health it is not the only one, and different summary measures will provide different natural measures of damage and repair. We have tempered our language throughout to make it more clear what we have done (with binary variables) and what questions this raises (with the connection to non-binary approaches and conceptualizing damage). In particular we ignore nonlinearities (compensatory processes, feedback, protection) and do not assess overall fitness apart from the FI. We have also added a caveat paragraph in the discussion (p.6 lines 251-258). See also penultimate paragraph in the introduction (p.2 lines 56-64).
2. Additionally, 'repair' assumes a biological process to restore homeostasis, but in this manuscript refers to a switch from out-of-range to a return to in-range for variables and cut-points that are not defined in the manuscript. The issue of measurement error in individual components that switch from out-of-range to in-range or vice versa needs to be addressed. Variables and cutpoints must be detailed. Authors should also distinguish biological processes from mechanical definitions in rationale and discussion.
As mentioned above, we define “repair” to be a binary transition from deficit (1) to healthy (0) state. We report aggregate repair or damage rates of individual binarized attributes. Whether these correspond to organismal repair or damage conceptualized differently is an interesting question that we do not address. Given the proven utility of the FI to predict (or associate with) adverse outcomes, we do expect that aggregate repair (resilience) of individual binarized attributes is associated with (but not the same as) improved health at the population level as assessed by other health measures – and similarly with damage.
To be clear, we start with binarized variables (a mechanical definition) and analyze aging in that context. We obtain results that capture biological processes of aging, but through the lens of the binarized variables. We are not asserting that the biological processes are binary, but simply that binary variables can be used to study aging. Given that much of medicine uses binary variables (in diagnosis and treatment), and that much aging data is only binary (from e.g. self-report data) we feel that this is a useful and promising approach. We have clarified these points in the text (see e.g. p.2 lines 56-64 and p.6 lines 251-258). In the methods (p.8 and 9), we have also included more details about the mouse and human deficits used.
Transitions of the binarized variables could correspond to small changes of underlying continuous processes, which also raises the question of measurement error resulting from small changes in the underlying biological state. While we cannot avoid measurement error, we do not feel that it causes our key results. We have two reasons. The first is that we do not expect age-dependence in measurement error, so we do not feel that the age-dependence we observe for damage and repair are due to measurement error. Second, we have performed a sensitivity analysis by removing potentially erroneous damage and repair events. For adjoining and isolated damage and repair events characterized by a sequence of variable states {0,0,1,0,0} or {1,1,0,1,1} (i.e. isolated states surrounded by at least two previous and two subsequent of the opposite state), we remove the damage/repair event resulting in states {0,0,0,0,0} or {1,1,1,1,1}. With these isolated events removed, we see little change in the time-dependence of the repair and damage rates vs age and the time-scales of resilience and robustness when compared to our original results. We have added Figure 5 —figure supplements 3 and 4 and Section 5 in the methods describing this approach. We have also added discussion of measurement errors in the discussion (p.6 lines 259-268). We also note that measurement error for mice has been assessed previously, and is small (Feridooni et al., 2015; Kane et al., 2017).
3. Some variables may represent irreversible processes of deterioration (e.g. loss of whiskers, kyphosis, palpable/visible tumor, etc.), while others may be more dynamic or have cutpoints with unknown validity for in vs out of range. It is anticipated that only a subset of those queried contributes to 'repair' while others irreversibly increase over time. This would challenge the validity of 'repair' models (and interpretation of Figure 5). As above, please include details on specific variables and their cutpoints. Please indicate which of these variables change dynamically over time – with evidence of both 'deficit' and 'repair' – versus those that are irreparable by definition or thresholds.
The reviewers have pointed out that some of the variables measured in the mouse frailty index may be “irreversible” processes and thus not susceptible to any extrinsic or intrinsic repair processes. This is a good question that we have addressed in our revised manuscript. We have considered all the deficits in the mouse frailty index to determine whether there is evidence that they can repair, either spontaneously through intrinsic repair processes or extrinsically by interventions such as drug treatment or lifestyle changes (e.g. exercise, caloric restriction, drug treatment etc). Interestingly, we found that virtually all the deficits in the frailty index can potentially reverse, especially in response to an intervention. The only deficits that may not reverse on their own or in response to interventions are microphthalmia (but this is an active area of drug discovery) and vaginal/uterine or penile prolapses (but these can be corrected surgically). We have created Supplementary File 1 (with references) to support the concept that almost all deficits in the murine frailty index can repair either on their own or in response to interventions, including many interventions that are used in aging research. We have added a new paragraph that discusses this to our revised manuscript (p.3 lines 99-105). We have also listed all health attributes used in this study in the methods (p.8 and 9).
Despite ubiquitous repair, not all deficits repair equally. We have included Figure 5 —figure supplement 4 showing the repair counts for each deficit in the 3 mouse datasets. In particular, we find deficits like “Cataracts”, “Tumours”, “diarrhea”, and “vaginal/uterine/penile prolapses” are rarely repaired. We also show which damage/repair events are removed in the sensitivity analysis of isolated damage/repair events. This shows that some of these deficits that are rarely repaired may also be the result of measurement error, but as above, this small amount error does not effect the results due to the rarity of repair for these deficits.
A broader question is how our results depend on the underlying binarization thresholds of individual variables. We have not explored that question here (and cannot for the majority of our variables that are only measured as binary or ordinal values), rather we have used the binarized attributes as previously published. While we do not expect that they will drastically change our aggregate results, we do expect that detailed transition rates of individual health attributes will depend on the binarization thresholds. Previous work on binarization thresholds of the FI has shown weak dependence of predictive power on small changes of the thresholds (Stubbings, G., Farrell, S., Mitnitski, A., Rockwood, K. and Rutenberg, A. Informative frailty indices from binarized biomarkers. Biogerontology 21, 345–355 (2020)). This is an interesting question for future study in the context of robustness and resilience.
4. Additional conceptual issues are present in mouse models given typically shorter time scales and acute stress stimulus – in addition to the issues above. Most reviewers have requested additional details to support assumptions made in mouse models.
As discussed in the paper (p.6 lines 239-250) we do not use acute stress stimuli, and our timescales are limited to the available data (p.6 lines 251-258).
5. Detailed total follow-up time/time-course in both animal and human data is needed. For example, statistics on total follow-up times in human data. This is important for relative time courses across species and how many times deficits can be damaged/repaired within a given time or relative period of life.
We have included follow-up times (p. 8 and 9).
6. Concern over the impact of selection bias/survivor bias in models (especially mice). Please address in the model and discussion.
The mouse study was performed without replacement and studied with a joint-model to capture survivor effects. The human study contained dropout and replacement, but without knowledge of mortality (so the joint survival model was not used for humans). We did not model drop-out effects in the human study. We have added a paragraph on potential population biases in the discussion (p.6 lines 259-268).
7. Abstract requires clarity. Data, tests performed, and specific results should be presented as appropriate.
We have clarified our abstract.
8. Please address specific recommendations and editorial suggestions made by each reviewer below.
See below.
Reviewer #1 (Recommendations for the authors):
I commend the authors for their efforts. The modeling of longitudinal data is an important area in aging that needs more attention. I also like the approach at a conceptual level and think the stratification of resilience vs robustness could be quite valuable. I appreciate the idea of making models that are generalizable and think the approach is most appropriate for the human data, but I think more attention needs to be paid to the technical and biological aspects of the variables being modeled for the mouse data.
General comments:
The mouse frailty index is a very useful tool for efficiently measuring the organismal state in large cohorts. A tradeoff for quickly measuring a broad range of health domains is that the individual measurements are low resolution and involve inherent subjectivity (which may be considered measurement error). It is very important to keep this in mind when using individual scores from the index. Some transitions are due to random measurement error and I believe this is especially likely with decreases (or 'resilience' transitions).
While I find the 'robustness' modeling quite interesting, I am skeptical as to how valid the 'resilience' modeling is. The first reason for this is that I do not believe many of the deficits in the mouse index are reversible under normal physiologic conditions. For example, it is exceptionally unlikely for a palpable/visible tumor to resolve in an aged mouse, thus any reversal that was observed must be due to random measurement error. I would conservatively estimate that the following components are not truly reversible in the contexts in which they were studied here: alopecia, loss of fur color, loss of whiskers, tumors, kyphosis, hearing loss, cataracts, corneal capacity, vision loss, rectal prolapse, genital prolapse.
All mouse deficits are demonstrably reversible (see our new Supplementary File 1).
When used as a sum of its components, the mouse frailty index is highly robust to these measurement errors as demonstrated many times in the literature. However, breaking apart the index into its components compromises this robustness, and I am not aware of literature which has validated the use of individual components of the index. How do the authors account for the measurement error which is inherent to the individual components of the mouse frailty index, and how might this error affect their models? I would be particularly concerned with how this would affect the models in figure 5.
Measurement error is an understudied effect in aging research; it is not typically assessed or reported. We expect that measurement error would be a source of “white noise”, that would contribute to both damage and repair at the same rate. Effects of age or intervention should not be affected by such measurement errors, and these effects make up our results. We have checked the sensitivity of the results of Figure 5 to exclude (“fix”) damage or repair events that occur in isolation away from the first or last element in a time-series (so “11011” would be fixed to “11111”). The results are shown in Figure 5 —figure supplements 3 and 4, and do not affect our qualitative conclusions. We also note that measurement error for mice has been assessed previously, and is small (Feridooni et al., 2015; Kane et al., 2017).
Additionally, the authors state that "the observed damage is thought to occur due to natural stochastic transitions." I do not necessarily agree with that characterization for many of the pathologies measured here. For example, I would not consider vision loss a stochastic transition but rather a gradual degenerative process. Stochasticity involves when it crosses a threshold that can be measured/detected by the researcher.
We have improved the wording. We were contrasting natural processes vs applied stressors.
The authors state the following assumption: "We assume these time intervals are small so that we use constant rates within each time-interval to approximate these integrals." Small is subjective and the time intervals are up to 2 months which is 7-10% of a mouse lifespan. Can the authors justify this assumption and/or discuss the implications of such an assumption to their models? The time scale should also be considered in the context of the specific measurements being taken.
This approximation is implemented for computational convenience so that we can use Poisson statistics despite having non-zero timesteps. While this approximation is common in the modelling literature, it is uncontrolled. Systematic effects of this approximation do not appear to be large, however, since the average model damage and repair rates (lines in Figure 2) are close to the measured rates (points in Figure 2) – and the individual distributions (Figure 2 supplement 1) are close too. Furthermore we reran our mouse analysis using rates evaluated at endpoints rather than the start of time intervals (methods equation 3 and 4 with λ(tj+1) rather than λ(tj)). We see no strong effects (compare with Figure 4).
My final high-level comment regards selection bias. As these studies progress and individuals die, the study population drastically changes. How do the authors' models account for this selection bias? Can the results of the modelling tell us anything about the nature of the populations as selection occurs? And finally, would there be value in performing this modeling as a function of "biological age" (ie. time to death) versus chronological age?
For the mouse studies there was no replacement after mortality. Our analysis was with a joint model that modelled both health and mortality, so any survivor effect is not a bias but rather is the result of aging. To mitigate selection bias in the human data, we treated onset ages distinctly. There are nevertheless well known “healthy volunteer” effects that would bias the original population that we did not consider. Furthermore, we did not have human mortality data so we could not treat survivor bias effects. Studying the selection effects themselves (due to mortality or due to selection bias) would be interesting, but beyond the scope of this study. Modelling with respect to time to death (rather than chronological age) would also be interesting. Such a study would be limited to individuals who die, and so would exclude all of the human data we used – where we had no mortality information.
Specific comments:
The authors should specify which packages and versions were used for the development of their models. When I ran some of the scripts from Github they threw pandas warnings, leading me to suspect that I am not using the development versions. This is important if the authors expect future use of their models. I would encourage the authors to include package information both in the manuscript and on the Github.
The Github includes package information. This has also been added to the manuscript (p.14 lines541-544). A version update in pandas resulted in warnings. We have fixed these for the latest version of pandas.
I would recommend the authors be specific when describing their results. Many complex models and statistical tests are generated and it sometimes is unclear what is being referred to. For example, in the section "The acceleration of damage accumulation…" the statement "In mice, this is seen in every dataset and is significant at the indicated ages" is unclear what statistical test is being referred to. I would also recommend the authors be more precise in their result descriptions. For example, in the section "Interventions modify damage…" the statement "This curvature is strongly reduced…" is subjective and would be better supported by a description of specific statistical results.
We have described in the text and figure captions the statistical tests done. The first example ("seen in every dataset") is addressed p.4 lines 148-151. The second example ("strongly reduced") is quantitatively addressed in Figure 3 —figure supplement 1. We use strongly here to summarize that the credible intervals of exercise effect (in panel e) exclude zero effect.
The authors should be careful with the language used regarding wealth in the human dataset. Wealth is not an intervention. And wealth per se does not influence frailty as worded in methods. In these data wealth is a variable which is often associated with health outcomes including frailty.
We agree; we have cleared up language regarding wealth to emphasize it is not an intervention.
In discussion: "…where female live longer but have higher FI scores than males." Female humans live longer but female B6 mice usually live shorter.
We have fixed this; in our results we also observe these sex effects in the FI.
The citation the authors use for ELSA points to the database. PMID 23143611 also needs to be cited where the study and data collection procedures are described. It also seems like an omission to not cite PMID 31665163 which explores the relationship between various factors, including wealth, and frailty in the same dataset. I would also recommend the authors include a supplemental table with the specific mouse measurements and human questionnaire items that comprise the respective frailty indices.
We have added the two references, and have listed all of the mouse and human items in the methods.
The authors should provide more detail on the human frailty variables in the methods. I am assuming the binary responses to questions HEADLA and HEADLB were used, but this should be clarified/explicitly stated in the methods.
All variables are now listed in the methods. All of our human variables are binary (with values 0 or 1).
Comments on mouse pre-processing methods: In the mouse frailty index, 29 out of 31 parameters are measured on a [0, 0.5, 1] scale, with only weight and temperature using [0, 0.25, 0.5, 0.75, 1] scale, and it looks to me like the Schultz dataset still used [0, 0.5, 1] for these variables. So I don't find the statement "a single repair or damage transition can be interpreted as taking a step of size 0.25 on the fractional deficit scale" to be accurate because 29 out of 31 parameters only have the resolution to move in 0.5 unit increments.
We have clarified this section (p.8-9 lines 367-379). To keep the FI values comparable between studies, we have used the same [0, 0.25, 0.5, 0.75, 1] scale for all health attributes, with a step of size 0.25. This means that for attributes with values [0, 0.5, 1], the only observed steps will be of size 2. For human data, all attributes are binarized and on the same [0, 1] scale with steps of size 1.
Page 5: Authors state that "in each of the datasets, there is a strong decrease in repair rates and increase in damage rates with age (except in mouse dataset 2 for damage rates)." This doesn't seem to be the case for repair rates in humans entering ELSA study at age 50-70 (CI for p cross 0).
The points are binned averages, which are useful as guides to the eye. The lines are from our model. The error bars on the points (and the variability of the lines) indicates uncertainties. As humans enter the ELSA study at ages 50 and 70 there are large uncertainties.
I am also somewhat unclear about the values for the rates in figure 2. In figure S2-1 the authors show that the model fits the data reasonably well. However, when looking at the rates in figure 2, the repair rates appear to be consistently higher than the damage rates. For example, in the Keller dataset males at 22 months, as frailty is consistently increasing, the modeled repair rate appears to be nearly double the modelled damage rate. Overall, the comparative magnitudes of the rates don't seem consistent with the mouse data, where there it is relatively rare to see a "population level" reduction in frailty at a subsequent timepoint (tends to stay the same or increase over time as shown in Figure 2S-2). Same comment for figure 4S2 human data.
Our repair/damage rates are calculated per the number of variables available to damage/repair. As such, repair rates can be above damage rates (per variable) even when the total damage rate (summed over all variables) exceeds the total repair rate. This can occur when most variables are undamaged. We have emphasized this at the end of the first section of the results.
Figure S2-1. It is hard to see the bars of the histogram in comparison to the dots. I would recommend making the dots smaller. Quantifying and plotting the deviation of observed vs posterior samples may also make interpretation easier.
Done.
The repair rate in the human 70-80 data (2d) appears to have a negative curvature, but this doesn't seem to be reflected in the second derivative plot (3e, teal line centered on zero).
The second derivatives of the human age 70-80 repair rate are small and are consistent with zero. The lines in Figure 2d should be compared with the lines in Figure 3e.
I find the models in figure 5 somewhat difficult to interpret considering the extensive censoring that is occurring. Could the authors attempt a more 'plain language' interpretation of these curves considering the censoring? It seems like the interpretation would be that at 3 months after damage there is only a 75% probability of that damage being repaired? Also, could the authors interpret the drastic reduction in probability occurring in both damage and repair of the Schultz data at ~3 months?
(See also detailed point 22 from reviewer #3) While we have tried to correct for interval censoring, the available methods only makes the survival curves self-consistent – it does not impose a model for short measurement intervals that are not observed. The sharp drop of the survival curves reflect the absence of measurements at shorter measurement intervals. As such, the survival curves after the sharp drops are more reliable – but also comparisons in Figure 5 between intervention and control, between male and female, or in Figure 5 -- supplemental Figures1 and 2 between different health attributes.
We have labeled the axes with the plain language interpretation. Yes, for mice most damaged deficits (approx. 75%) are repaired by 3 months.
Reviewer #3 (Recommendations for the authors):
I find the result on sex differences in effects of interventions on robustness and resilience very interesting so consider mentioning this in the abstract.
Done.
I would like to see more discussion on why deficits can be repaired, especially for some deficits which may seem to represent irreversible processes of deterioration. One example that is not trivial for me to understand (as I am not a specialist in mice) is the repair of the deficit "loss of whiskers." Do they just grow up again after a while? If they do, what constitutes the event of repair (just when they start growing again or when they are fully grown, assuming there is a measure for this)?
See new Supplementary file 1 on repairability of mouse deficits. All deficits are evaluated with respect to standard criteria. For example, for whiskers the score is 0 if all whiskers are present, 0.5 if a reduced number are present, and 1 if no whiskers are present.
Some statistics on the total follow-up times in human data would be helpful. It can help put the estimates of repair/damage scales in the context to provide insight on how many times the deficit can potentially be damaged/repaired in the time period of data collection.
We have added follow-up information in the methods (p.9 in Human data and pre-processing).
As you indicated in Methods, "The individuals selected from ELSA with wealth data do not have mortality data available, simplifying the model from the joint model used above for mice." In Results, you wrote, "This is a joint longitudinal-survival model, which couples the damage and repair rates together with mortality." This narrative should be changed then to reflect the fact that the joint model was only used in mice data.
Fixed.
Some readers may not be familiar with Bayesian methods so it would help provide in Methods some discussion on the selection of priors – is this a convenience/traditional choice or some other reasons? Also, some narrative on the sensitivity of results to a different choice of priors would be helpful, in my view.
We have added discussion in the methods (p.11 lines 458-468) describing the choice of prior. Our choice of weak (or “noninformative”) priors are a matter of computational convenience. We have chosen standard weak priors. Changing weak priors does affect the quantitative results, but weakly, and does not change the qualitative results. Any model or analysis pipeline has many such assumptions and hyperparameters; changing any of them will have similar effects. One can think of choice of weak priors as a choice of hyperparameter.
https://doi.org/10.7554/eLife.77632.sa2Article and author information
Author details
Funding
Natural Sciences and Engineering Research Council of Canada (RGPIN-2019-05888)
- Andrew D Rutenberg
Canadian Institutes of Health Research (PJT 155961)
- Susan E Howlett
Heart and Stroke Foundation of Canada (G-22-0031992)
- Susan E Howlett
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Senior Editor
- Carlos Isales, Augusta University, United States
Reviewing Editor
- Jamie Justice, Wake Forest School of Medicine, United States
Reviewer
- Konstantin Arbeev, Duke University, United States
Version history
- Received: February 5, 2022
- Preprint posted: March 3, 2022 (view preprint)
- Accepted: November 20, 2022
- Accepted Manuscript published: November 21, 2022 (version 1)
- Version of Record published: December 6, 2022 (version 2)
Copyright
© 2022, Farrell et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 517
- Page views
-
- 103
- Downloads
-
- 1
- Citations
Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Computational and Systems Biology
- Neuroscience
Previous research has highlighted the role of glutamate and gamma-aminobutyric acid (GABA) in perceptual, cognitive, and motor tasks. However, the exact involvement of these neurochemical mechanisms in the chain of information processing, and across human development, is unclear. In a cross-sectional longitudinal design, we used a computational approach to dissociate cognitive, decision, and visuomotor processing in 293 individuals spanning early childhood to adulthood. We found that glutamate and GABA within the intraparietal sulcus (IPS) explained unique variance in visuomotor processing, with higher glutamate predicting poorer visuomotor processing in younger participants but better visuomotor processing in mature participants, while GABA showed the opposite pattern. These findings, which were neurochemically, neuroanatomically and functionally specific, were replicated ~21 mo later and were generalized in two further different behavioral tasks. Using resting functional MRI, we revealed that the relationship between IPS neurochemicals and visuomotor processing is mediated by functional connectivity in the visuomotor network. We then extended our findings to high-level cognitive behavior by predicting fluid intelligence performance. We present evidence that fluid intelligence performance is explained by IPS GABA and glutamate and is mediated by visuomotor processing. However, this evidence was obtained using an uncorrected alpha and needs to be replicated in future studies. These results provide an integrative biological and psychological mechanistic explanation that links cognitive processes and neurotransmitters across human development and establishes their potential involvement in intelligent behavior.
-
- Computational and Systems Biology
- Neuroscience
Cerebellar climbing fibers convey diverse signals, but how they are organized in the compartmental structure of the cerebellar cortex during learning remains largely unclear. We analyzed a large amount of coordinate-localized two-photon imaging data from cerebellar Crus II in mice undergoing ‘Go/No-go’ reinforcement learning. Tensor component analysis revealed that a majority of climbing fiber inputs to Purkinje cells were reduced to only four functional components, corresponding to accurate timing control of motor initiation related to a Go cue, cognitive error-based learning, reward processing, and inhibition of erroneous behaviors after a No-go cue. Changes in neural activities during learning of the first two components were correlated with corresponding changes in timing control and error learning across animals, indirectly suggesting causal relationships. Spatial distribution of these components coincided well with boundaries of Aldolase-C/zebrin II expression in Purkinje cells, whereas several components are mixed in single neurons. Synchronization within individual components was bidirectionally regulated according to specific task contexts and learning stages. These findings suggest that, in close collaborations with other brain regions including the inferior olive nucleus, the cerebellum, based on anatomical compartments, reduces dimensions of the learning space by dynamically organizing multiple functional components, a feature that may inspire new-generation AI designs.