Evaluating the applicability of replication success metrics in animal-to-human translation: A simulation study

Carolyne Jie Huang; Samuel Pawel; Kimberley Elaine Wever; Benjamin Victor Ineichen; Rachel Heyard

doi:10.7554/eLife.109853.1

Not revised: This Reviewed Preprint includes the authors’ original preprint (without revision), an eLife assessment, and public reviews.

Reviewing Editor
George Okoli
University of Hong Kong, Hong Kong, Hong Kong
Senior Editor
Eduardo Franco
McGill University, Montreal, Canada

Reviewer #1 (Public review):

A well-designed and preregistered simulation study investigating whether replication-success metrics can be applied to assess animal-to-human translation. The study is comprehensive, uses realistic parameter settings, and provides valuable insights into how different metrics behave under varied conditions.

Strengths:

(1) Methodologically rigorous and transparently preregistered.

(2) Comprehensive simulation design covering a wide range of plausible scenarios.

(3) Clear description of metrics and decision rules.

(4) Valuable contribution to understanding the limitations of applying replication metrics to translation questions.

Weaknesses:

(1) The conceptual distinction between replication and translation could be more clearly emphasized.

(2) Interpretation of results is dense and can be challenging to follow without a clear and summarized.

(3) Some simulation parameters (effect sizes, heterogeneity, and number of animal studies) require more substantial justification.

(4) Practical recommendations could be more explicit to guide applied researchers.

https://doi.org/10.7554/eLife.109853.1.sa2

Reviewer #2 (Public review):

Summary:

The authors attempt to address the issue of high rates of translation failure from animal studies to humans in the literature, where promising results in animal studies fail when conducting human clinical trials. Using parameters from a previous meta-analysis on prenatal amino acid supplementation and the effects it has on maternal blood pressure, the authors assessed the performance of the metrics used and whether they can quantify translation success. Performing a simulation study, the authors compared nine translation success metrics and found that no one method was uniformly optimal. The authors list several limitations of the study, such as comparability of effect sizes between animal and human studies, different goals of animal studies versus human studies, and the focus of the study on one aspect (statistics of translation) is part of a broader, more complex decision-making process before proceeding to human trials. The authors recommend using multiple metrics in combination while taking into consideration their strengths and weaknesses to assess the translation of animal studies to human outcomes. The paper achieves the aim of providing a model with several metrics to evaluate translation success from animal studies to humans.

Strengths:

(1) Utilizing 9 different translation success metrics in combination provides strong flexibility in evaluating whether results in animal studies can translate to humans. This would allow researchers to evaluate translation success using multiple different metrics according to the context of the study.

(2) The authors accommodated for the limited sample size in animal studies, which are typically underpowered, and also caution that special attention should be given to heterogeneity when interpreting translation results.

(3) Overall, this approach has the potential to be applied to other biomedical studies, provided the limitations for each of the metrics are considered. It would provide a useful tool in assessing translation from animals to humans, in addition to other factors such as safety, pharmacokinetics, etc.

Weaknesses:

While the study has several strengths, there are some limitations.

(1) Preclinical animal study sizes tend to be much smaller than human studies, which results in underpowered results. The authors adjusted for this by pooling animal study data. However, high heterogeneity in the animal studies can affect translation results.

(2) The study focuses only on evaluating the statistical component of translation, which is only one aspect of the decision-making process to move on to human trials. The study does not take into account safety and toxicological profiles, pharmacokinetics, or genetics, which are important considerations that influence the overall effect in humans.

https://doi.org/10.7554/eLife.109853.1.sa1

Reviewer #3 (Public review):

Summary:

This paper focused on how to navigate the complex decision-making process of whether to go into human trials. This is a critical topic considering the well-documented challenges in replicating and translating findings. While these are two distinct topics (i.e., replication and translation), they are related, and the authors simulated many conditions to assess the utility of replication assessment metrics.

Strengths:

A major strength of the study is the detailed approach to identifying relevant conditions and metrics, and to providing rich results that outline the strengths and weaknesses of each metric. Any simulation study is challenged by trying to identify the most relevant variables of interest, and this study provided sound justification for its chosen variables of interest. While this study does not make a strong recommendation (which I see as a strength), it does provide a comprehensive overview of the various metrics and conditions that were investigated.

Weaknesses:

The weaknesses of the study are the limited focus on specific metrics, the assumptions, particularly in the limited number of human study variables, and the less-than-ideal approachable summary of findings for a non-technical audience.

Conclusion:

This paper provides a much-needed investigation and discussion of how decisions are made when assessing whether to go into human trials. This is an important topic that productively challenges the status quo, considering documented challenges in replication and translation in biomedical research.

https://doi.org/10.7554/eLife.109853.1.sa0

Evaluating the applicability of replication success metrics in animal-to-human translation: A simulation study

Peer review process

Editors

Be the first to read new articles from eLife