Meta-Research: Gender variations in citation distributions in medicine are very small and due to self-citation and journal prestige

  1. Jens Peter Andersen  Is a corresponding author
  2. Jesper Wiborg Schneider
  3. Reshma Jagsi
  4. Mathias Wullum Nielsen
  1. Aarhus University, Denmark
  2. University of Michigan, United States
  • Download
  • Cite
  • CommentOpen annotations (there are currently 0 annotations on this page).
7 figures, 2 tables and 2 additional files

Figures

Density distributions of the log-transformed, per-paper NCS for the matched set of male and female first authors (Sample 1), female and male last authors (Sample 2), and female first and last authors vs. other author combinations (Sample 3).

Dashed lines indicate the mean NCS for each sample. The y-axis indicates the proportion of papers found in that area of the NCS, equivalent to a smoothed histogram. The x-axis gives the per-paper …

https://doi.org/10.7554/eLife.45374.003
Standardized, exponentiated coefficients for the predictors included in the Tweedie regressions.

Error bars represent 95% confidence intervals (see Figure 2—source data 1 for estimate specifications and dispersion parameters). All regressions are based on matched samples. Sample 1 compares …

https://doi.org/10.7554/eLife.45374.004
Figure 2—source data 1

Tweedie regression results.

https://doi.org/10.7554/eLife.45374.005
Figure 2—source data 2

Regression results for Tweedie regressions on the full, unmatched sample, using NCS as outcome.

https://doi.org/10.7554/eLife.45374.006
Figure 2—source data 3

Regression results for the three negative binomial regressions with times cited (CS) as outcome.

https://doi.org/10.7554/eLife.45374.007
Figure 2—source data 4

Tweedie regression of standardized parameters, using MNCS Journal quantiles rather than measurements.

https://doi.org/10.7554/eLife.45374.008
Plot of estimated marginal means for the case and control groups in Samples 1, 2 and 3.

The error bars display 95% confidence intervals. The figure visualizes the predicted, average, differences in per-paper citation scores for the case and control groups after adjusting for …

https://doi.org/10.7554/eLife.45374.009
Odds ratios for the standardized predictors included in the logistic regressions.

Error bars represent 95% confidence intervals (see Figure 4—source data 1 for information on estimates and dispersion parameters). All regressions are based on matched samples. Sample 1 compares …

https://doi.org/10.7554/eLife.45374.010
Figure 4—source data 1

Logistic regression results.

https://doi.org/10.7554/eLife.45374.011
The upper panel shows the distribution of self-citations by five-percentile bins of NCS for each sample.

The average proportions of self-citations are given on the y-axis, the five-percentile bins of NCS on the x-axis. The lower panel displays the distribution of the upper bounds of NCS across the …

https://doi.org/10.7554/eLife.45374.013
The upper panel shows the proportions of papers with female first authors in Sample 1, female last authors in Sample 2, combinations of female first and last authors in Sample 3, by five-percentile bins of MNCS.

The proportions of case papers are given on the y-axis, and the five-percentile bins of MNCS journal on the y-axis. The lower-left panel displays the upper bounds of MNCS journal by five-percentile …

https://doi.org/10.7554/eLife.45374.014
Figure 7 with 3 supplements
Flowchart of data collection, inclusion and exclusion.
https://doi.org/10.7554/eLife.45374.015
Figure 7—source data 1

Excluded countries due to unreliable gender assignments from first name.

https://doi.org/10.7554/eLife.45374.019
Figure 7—source data 2

List of specialty and main specialty designation, and number of papers per specialty for the full sample.

https://doi.org/10.7554/eLife.45374.020
Figure 7—source data 3

Groupings of countries by geographical region.

https://doi.org/10.7554/eLife.45374.021
Figure 7—figure supplement 1
Percentage of papers per journal included in the analysis.

The excluded papers are a combination of missing document types in Web of Science and missing name information. Journals publishing document types which are included in PubMed Medline but not Web of …

https://doi.org/10.7554/eLife.45374.016
Figure 7—figure supplement 2
Reliability of gender assignment per country, shown as the rank of countries.

Gender determination: The online tool Gender-API was used to estimate the gender of all first-name and country pairings. This pairing is important as the gender connotations for some first names …

https://doi.org/10.7554/eLife.45374.017
Figure 7—figure supplement 3
Proportion of papers with gender assignment for all authors.

Reported as function of all sampled papers (p_pubmed) and proportion of all papers matched to Web of Science (p_wos).

https://doi.org/10.7554/eLife.45374.018

Tables

Table 1
Women’s share of authorships overall, across five main specialties, institutional prestige, and geocultural area.

f_w is the weighted proportion of women per paper, f_first the proportion of female first authorships, f_last the proportion of female last authorships, f_both the proportion of papers where women …

https://doi.org/10.7554/eLife.45374.002
Overallf_wf_firstf_lastf_both
0.350.400.260.15
Main specialtyf_wf_firstf_lastf_both
Basic science0.390.460.300.18
Hospital based0.370.430.280.16
Medical0.330.380.240.13
Pediatric0.460.520.370.24
Surgical/procedural0.290.320.210.11
Institutional prestigef_wf_firstf_lastf_both
Top-100 University0.360.420.270.16
Other university0.350.390.250.14
Geographic locationf_wf_firstf_lastf_both
Arab countries0.330.340.270.16
Commonwealth of Independent States0.400.450.300.17
East Asia0.190.190.090.04
Latin America0.460.520.390.25
North America0.360.400.270.15
Oceania0.400.480.310.20
South and Central Europe0.400.440.310.18
Sub-Saharan Africa0.360.390.310.20
South-West Asia0.290.310.240.10
Western Europe0.350.420.240.14
Table 2
Means, standard deviations, medians, Cohen’s d, and Weitzman’s for case-control comparisons of self-citations and MNCS journal in Samples 1, 2 and 3.

Cohen’s d and Weitzman’s are calculated with two and one decimal respectively. Weitzman’s is not calculated for self-citations, as it is a discrete count variables. For sample 1, female first …

https://doi.org/10.7554/eLife.45374.012
X¯ case (σ)X¯ control (σ)X~ caseX~ controld
Sample 1Self-citations1.91 (3.18)2.16 (3.93)11-0.07
MNCS journal1.16 (.90)1.21 (1.04).991.00-0.0596.4%
Sample 2Self-citations1.84 (3.22)2.08 (3.77)11-0.07
MNCS journal1.14 (.98)1.20 (.99).981.00-0.0695.6%
Sample 3Self-citations1.74 (2.84)2.13 (3.91)11-0.11
MNCS journal1.12 (.97)1.20 (1.02).971.0-0.0893.4%

Additional files

Download links