Research: Bias in the reporting of sex and age in biomedical research on mouse models

  1. Oscar Flórez-Vargas  Is a corresponding author
  2. Andy Brass  Is a corresponding author
  3. George Karystianis
  4. Michael Bramhall
  5. Robert Stevens
  6. Sheena Cruickshank
  7. Goran Nenadic
  1. The University of Manchester, United Kingdom
  2. University of Manchester, United Kingdom
  • Download
  • Cite
  • CommentOpen annotations (there are currently 0 annotations on this page).
5 figures, 2 tables and 3 additional files

Figures

Figure 1 with 1 supplement
General distribution and historical change of reporting and non-reporting of sex and/or age in mouse-model experiments.

Pie-chart (a) showing an overview of the reporting and non-reporting (none) of sex only, age, or both sex and age in a set of 15,311 studies published between 1994 and 2014 by stating the number and …

https://doi.org/10.7554/eLife.13615.003
Figure 1—source data 1

PubMed search terms used for each disease group and their approaches.

https://doi.org/10.7554/eLife.13615.004
Figure 1—source data 2

Example rules for identification of sex and age.

https://doi.org/10.7554/eLife.13615.005
Figure 1—figure supplement 1
Reporting of sex or age in mouse-model experiments by journal.

The figure shows the top 70 journals from a total of 628 journals in which were published 30 or more articles of the corpus; corresponding to 81.05% of papers assessed. The journals are organised in …

https://doi.org/10.7554/eLife.13615.006
Distribution of reporting of the sex and age in mouse model of a group of diseases.

The reporting of these variables was assessed for six groups of diseases from the top 10 causes of death according to the W.H.O. This analysis was performed in the set of 14,225 articles published …

https://doi.org/10.7554/eLife.13615.007
Distribution of reporting of the sex in mouse model of a group of diseases by research approach.

The reporting of sex was assessed for each disease by the topic of research whether genetics (a), immunology (b), physiopathology (c), or therapy (d). This analysis was performed in the set of …

https://doi.org/10.7554/eLife.13615.008
Distribution of reporting of the sex in mouse model of diseases.

The graph shows the reporting in particular diseases. All these diseases that are among the most frequently reported causes of death world-wide or commonly used models. The distribution is presented …

https://doi.org/10.7554/eLife.13615.009
Scatter plots showing the relationship between the reporting and the bibliometric indices.

Journal impact factor in which the papers were published (a) and h-index of journals (b). Spearman’s rank correlation coefficient r square is shown alongside the regression lines. The scatter plots …

https://doi.org/10.7554/eLife.13615.010

Tables

Table 1

Evaluation of the performance of the text mining system.

https://doi.org/10.7554/eLife.13615.002
CharacteristicsTrue-
positives
True-
negatives
False-
positives
False-
negatives
Precision (%)Recall (%)F-score (%)
Sex29163290.693.592.0
Age31141496.888.592.4
  1. A total of 50 articles were used as the data set to evaluate the performance of the text mining system (Supplementary file 2D). The precision (P), calculated as TP/(TP+FP), determines the accuracy of the system in recognizing desirable terms. The recall (R), calculated as TP/(TP+FN), produces the coverage of the system. F-score is the harmonic mean of precision and recall and it is calculated as 2*P*R/(P+R).

Table 2

Summary of the data sets used in this study.

https://doi.org/10.7554/eLife.13615.011
Sets of articlesNumber of articlesTaskFile
Data 115,311Corpus for assessing reporting of the sex and age of the miceSupplementary file 1*
Data 240Creating the text-mining rulesSupplementary file 2A
Data 340Manual inspection for finding the location of the mention of the sex and age of the miceSupplementary file 2B
Data 470Enhancing the performance of the text-mining rulesSupplementary file 2C
Data 550Evaluating the text-mining systemSupplementary file 2D
  1. *Supplementary file 1 also contains data sets of the six groups of diseases analyzed (cardiovascular diseases; cancer; diabetes mellitus; lung diseases; infectious diseases; and neurological disorders), as well as of the different approaches to assess the disease models (i.e. genetics, immunology, physiopathology and therapy), and the disease example for each of the six disease groups.

Additional files

Supplementary file 1

Corpus for assessing reporting of the sex and age of the mice.

https://doi.org/10.7554/eLife.13615.012
Supplementary file 2

(A) Set of articles for creating the text-mining rules.

(B) Set of articles for finding the location of the mention of the sex and age of the mice. (C) Set of articles for enhancing the performance of the text-mining rules. (D) Set of articles for evaluating the text-mining system.

https://doi.org/10.7554/eLife.13615.013
Supplementary file 3

Rules used to identify the sex and age of experimental mouse models.

https://doi.org/10.7554/eLife.13615.014

Download links