Epidemiological characteristics and prevalence rates of research reproducibility across disciplines: a scoping review of articles published in 2018-2019
Abstract
Introduction: Reproducibility is a central tenant of research. We aimed to synthesize the literature on reproducibility and describe its epidemiological characteristics, including how reproducibility is defined and assessed. We also aimed to determine and compare estimates for reproducibility across different fields.
Methods: We conducted a scoping review to identify English language replication studies published between 2018-2019 in economics, education, psychology, health sciences and biomedicine. We searched Medline, Embase, PsycINFO, Cumulative Index of Nursing and Allied Health Literature – CINAHL, Education Source via EBSCOHost, ERIC, EconPapers, International Bibliography of the Social Sciences (IBSS), and EconLit. Documents retrieved were screened in duplicate against our inclusion criteria. We extracted year of publication, number of authors, country of affiliation of the corresponding author, and whether the study was funded. For the individual replication studies, we recorded whether a registered protocol for the replication study was used, whether there was contact between the reproducing team and the original authors, what study design was used, and what the primary outcome was. Finally, we recorded how reproducibilty was defined by the authors, and whether the assessed study(ies) successfully reproduced based on this definition. Extraction was done by a single reviewer and quality controlled by a second reviewer.
Results: Our search identified 11,224 unique documents, of which 47 were included in this review. Most studies were related to either psychology (48.6%) or health sciences (23.7%). Among these 47 documents, 36 described a single reproducibility study while the remaining 11 reported at least two reproducibility studies in the same paper. Less than the half of the studies referred to a registered protocol. There was variability in the definitions of reproduciblity success. In total, across the 47 documents 177 studies were reported. Based on the definition used by the author of each study, 95 of 177 (53.7%) studies reproduced.
Conclusion: This study gives an overview of research across five disciplines that explicitly set out to reproduce previous research. Such reproducibility studies are extremely scarce, the definition of a successfully reproduced study is ambiguous, and the reproducibility rate is overall modest.
Funding: No external funding was received for this work.
Data availability
Data and materials are available on the Open Science Framework (https://osf.io/wn7gm/).
-
Scoping review data on reproducibility of research in a 2018-2019 multi-discipline sampleOpen Science FrameworkDOI 10.17605/OSF.IO/WN7GM.
Article and author information
Author details
Funding
The authors declare that there was no funding for this work.
Reviewing Editor
- David B Allison, Indiana University, United States
Version history
- Preprint posted: March 9, 2022 (view preprint)
- Received: March 10, 2022
- Accepted: June 20, 2023
- Accepted Manuscript published: June 21, 2023 (version 1)
- Version of Record published: July 5, 2023 (version 2)
Copyright
© 2023, Cobey et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 978
- views
-
- 89
- downloads
-
- 4
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Medicine
Caesarean section scar diverticulum (CSD) is a significant cause of infertility among women who have previously had a Caesarean section, primarily due to persistent inflammatory exudation associated with this condition. Even though abnormal bacterial composition is identified as a critical factor leading to this chronic inflammation, clinical data suggest that a long-term cure is often unattainable with antibiotic treatment alone. In our study, we employed metagenomic analysis and mass spectrometry techniques to investigate the fungal composition in CSD and its interaction with bacteria. We discovered that local fungal abnormalities in CSD can disrupt the stability of the bacterial population and the entire microbial community by altering bacterial abundance via specific metabolites. For instance, Lachnellula suecica reduces the abundance of several Lactobacillus spp., such as Lactobacillus jensenii, by diminishing the production of metabolites like Goyaglycoside A and Janthitrem E. Concurrently, Clavispora lusitaniae and Ophiocordyceps australis can synergistically impact the abundance of Lactobacillus spp. by modulating metabolite abundance. Our findings underscore that abnormal fungal composition and activity are key drivers of local bacterial dysbiosis in CSD.
-
- Medicine
- Neuroscience
Gait is impaired in musculoskeletal conditions, such as knee arthropathy. Gait analysis is used in clinical practice to inform diagnosis and to monitor disease progression or intervention response. However, clinical gait analysis relies on subjective visual observation of walking, as objective gait analysis has not been possible within clinical settings due to the expensive equipment, large-scale facilities, and highly trained staff required. Relatively low-cost wearable digital insoles may offer a solution to these challenges. In this work, we demonstrate how a digital insole measuring osteoarthritis-specific gait signatures yields similar results to the clinical gait-lab standard. To achieve this, we constructed a machine learning model, trained on force plate data collected in participants with knee arthropathy and controls. This model was highly predictive of force plate data from a validation set (area under the receiver operating characteristics curve [auROC] = 0.86; area under the precision-recall curve [auPR] = 0.90) and of a separate, independent digital insole dataset containing control and knee osteoarthritis subjects (auROC = 0.83; auPR = 0.86). After showing that digital insole derived gait characteristics are comparable to traditional gait measurements, we next showed that a single stride of raw sensor time series data could be accurately assigned to each subject, highlighting that individuals using digital insoles can be identified by their gait characteristics. This work provides a framework for a promising alternative to traditional clinical gait analysis methods, adds to the growing body of knowledge regarding wearable technology analytical pipelines, and supports clinical development of at-home gait assessments, with the potential to improve the ease, frequency, and depth of patient monitoring.