Addressing cultural and knowledge barriers to enable preclinical sex inclusive research

  1. Brianna N Gaskill
  2. Benjamin Phillips
  3. Jonathan Ho
  4. Holly Rafferty
  5. Oladele Olajide Onada
  6. Andrew Rooney
  7. Amrita Ahluwalia
  8. Natasha A Karp  Is a corresponding author
  1. Novartis Biomedical Research, United States
  2. Data Sciences and Quantitative Biology, Discovery Sciences, R&D, AstraZeneca, United Kingdom
  3. Faculty of Medicine & Dentistry, Queen Mary University of London, United Kingdom
  4. University of Glasgow, United Kingdom
  5. Obafemi Awolowo University and Obafemi Awolowo University Teaching Hospitals Complex, Nigeria
  6. Institute of Pharmacy and Biomedical Science, University of Strathclyde, United Kingdom
5 figures, 13 tables and 13 additional files

Figures

Exploration of the perceived barriers and benefits of inclusion of males and females in in vivo research from study 1.

Evaluation of survey data collected in study 1 with 39 participants in the baseline group, 51 in the interested group, and 15 in the intervention group. (A) Percentage of study participants who selected each barrier, displayed by treatment group. (B) Percentage of study participants who selected each benefit, displayed by treatment group. (C) Percentage of study participants who selected each barrier in the general population (baseline and interested group combined). (D) Percentage of study participants who selected each benefit in the general population (baseline and interested group combined). A Pearson’s chi-squared test was used to compare the proportion of participants selecting each barrier/benefit between the treatment groups. Statistical significance is highlighted with a horizontal bar and if the p-value is less than 0.05, it is flagged with one star (*). If the p-value is less than 0.01, it is flagged with 2 stars (**). If a p-value is less than 0.001 it is flagged with 3 stars (***).

Exploration of the perceived barriers and benefits on the topic of inclusion of females and males in in vivo research for study 2.

Evaluation of data collected from study 2 with 29 participants completing the barrier question pre-intervention and 28 in the post-intervention. (A) Percentage of study participants who selected each barrier, pre- and post-intervention, for the barriers targeted by the intervention. (B) Percentage of study participants who selected all other barriers, displayed pre-and post-intervention. (C) Percentage of study participants that selected each benefit, displayed pre- and post-intervention. (D) Comparison of the percentage of study participants selecting benefits in study 1 general population (baseline plus interested group) versus study 2 pre-intervention group. (E) Comparison of the percentage of study participants selecting barriers in study 1 general population (baseline plus interested groups) versus study 2 pre-intervention group. A McNemar’s test of association was used to compare the proportions between the pre- and post-intervention data in study 2. A chi-squared test was used to compare proportions between the two studies. Statistical significance is highlighted with a horizontal bar, and if the p-value is less than 0.05, it is flagged with one star (*). If the p-value is less than 0.01, it is flagged with 2 stars (**). If a p-value is less than 0.001, it is flagged with 3 stars (***).

Exploration of intent for significant and critical predictor variables for study 1 data.

A full model, with all potential predictors and demographics, was fitted to explore the variation in intent and assess for evidence of predictive behaviour (N=39 Baseline group, 51 Interested group and 15 intervention group). The baseline group was set as the reference group. If main effects were significant, the variation by treatment group was explored with Tukey post hoc testing. (A) Relationship between intention and attitude. (B) Relationship between intention and behavioural control. (C) Relationship between intention and social norm. For panels A, B, and C, the grey area indicates the 95% confidence interval for the fitted linear relationship (blue) and the text indicates the statistical significance of the relationship. (D) Model estimated means (Least Square Means) for each treatment group with a standard error bar estimated from the model. Vertical bars represent the planned comparison between groups, with * representing statistical significance <0.05. (E) Violin plot showing the distribution of intent as a function of the ability to influence the design. Points indicate individual study participants, and the red box indicates the calculated mean for each group. The text indicates the statistical assessment for that variable.

Exploration of intent for significant and critical predictor variables for study 2 data.

A full model, with all potential predictors and demographics, was fitted to explore the variation in intent and assess for evidence of predictive behaviour. (A) Relationship between intention and attitude. (B) Relationship between intention and behavioural control. (C) Relationship between intention and social norm. For panels A, B, and C, the grey area indicates the 95% confidence interval for the fitted linear relationship (blue) and the text indicates the statistical significance of the relationship. (D) Exploration of intention between pre- and post-intervention where a line links each individual participants score and shaded area indicates the density of the Box-Cox transformed measure of intent. (E) Relationship between intention and age. For graphs A, B, C, and E, the grey area indicates the 95% confidence interval for the fitted linear relationship (blue) for Box-Cox transformed intent (y-axis) against key predictors in the TPB model (x-axis) (A, B, C) and significant predictor ‘age’ (E).

Exploration of intervention impact on the proportion of correctly answered knowledge questions.

(A) Study 1: Cumulative knowledge score (cumulative questions answered correctly) displayed by treatment group (Baseline: N=39, interest: N=51 and intervention: N=15). Statistical significance assessed with a Poisson regression. (B) Study 1: Percentage of correct answers for each question, displayed by treatment group (Baseline: N=39, interest: N=51, and intervention: N=15). Statistical significance assessed with a Pearson’s chi-squared test. (C) Study 2: Impact of intervention on the cumulative knowledge score. Where a line links each individual participants score, and shaded area indicates the density of the Box-Cox transformed measure of intent. Statistical significance assessed with a paired t-test (N=26 with pre and post data available). (D) Study 2: Percentage correct answers for each question, displayed by pre- and post-intervention group. (Pre: N=29, Post: N=28). Statistical significance assessed with McNemar’s test of association. Statistical significance shown as * for p-value<0.05, ** for p<0.01 and *** for p<0.001.

Tables

Table 1
Workshop Intervention construct.
SectionContent
LectureTerminology
Reflection that clinical sex matters.
Exploration of the status of sex inclusion in preclinical research.
Exploration of what is sex-inclusive research.
The role of factorial analysis in sex-inclusive research.
Interactive elementMultiple choice questions with voting and discussion on appropriate analysis strategy and understanding factorial output.
LectureExploration of the perceived barriers to sex-inclusive research.
Evaluation toolIntroduction of the Sex Inclusive Research Framework (SIRF) which can be used to evaluate research proposals from a sex inclusive perspective.
Interactive elementMultiple choice classification using the SIRF to classify justifications given for one-sex designs.
Table 2
Statistical model output for the full model for study 1 data exploring the predictors ability to explain variation in average intent.
SourceNparmDfSum of SquaresF RatioProb >FSignificanceEta2
Attitude1118.565034.9548<.0001***0.15
Beh_Control110.08140.15340.69640.00
Soc_Norm1112.222923.0137<.0001***0.10
Treatment_Group223.86433.63800.0311*0.03
Age110.09950.18740.66640.00
Gender222.61962.46620.09180.02
Geography330.19550.12270.94640.00
Year_Work110.18280.34420.55920.00
Type_Work331.19580.75050.52550.01
Education222.93392.76210.06960.02
Stats_Training220.38100.35870.69980.00
Factorial_Fam110.03320.06250.80330.00
Factorial_Incor110.26140.49210.48520.00
Ability_Influence112.32534.37820.0398*0.02
  1. Where Beh_control represents the average behavioural control score, Soc_norm the average social normal score, Year_Work represented the number of years the participants have worked in animal research, Type_Work represents the type of research conducted by the participant, Education the highest level of education obtained, Stats_Training represents the level of statistical training received, Factorial_Fam represents how familiar the participants were with factorial experimental design, Factorial_Incor represents how often the participants incorporated males and females into their experiments while studying an intervention, attitude represents the average attitude score, and Ability_Influence represents how often the participants were involved or could influence the planning of experiments involving animals. Nparm stands for the number of parameters, Df represents the degrees of freedom, and Prob >F represents the p-value associated with the F ratio. Statistical significance shown as * for p-value<0.05, ** for p<0.01 and *** for p<0.001.

Table 3
Statistical model output for the full model for study 2 data exploring the predictors ability to explain variation in average intent.
SourceNparmDfDf DenominatorF RatioProb >FSignificanceEta2
Attitude1138.780.02680.87086.91e-04
Beh_Control1142.471.90800.17440.04
Soc_Norm1129.1519.6580.0001***0.40
Intervention1140.060.07580.78451.89e-03
Age1128.25.34280.0283*0.16
Gender2233.930.29730.74470.02
Ability_Influence1127.912.26180.14380.07
Factorial_Fam1142.980.38220.53978.81e-03
Factorial_Incor1135.720.07250.78932.03e-03
  1. Where Beh_control represent the average behavioural control score, Soc_norm the average social normal score, Year_Work represented the number of years the participants have worked in animal research, Type_Work represents the type of research conducted by the participant, Education the highest level of education obtained, Stats_Training represents the level of statistical training received, Factorial_Fam represents how familiar the participants were with factorial experimental design, Factorial_Incor represents how often the participants incorporated males and females into their experiments while studying an intervention, attitude represents the average attitude score, and Ability_Influence represents how often the participants were involved or could influence the planning of experiments involving animals. Nparm stands for the number of parameters, Df represents the degrees of freedom, and Prob > F represent the p-value associated with the F ratio. Statistical significance is flagged with one star (*) if the p-value is less than 0.05, with 2 stars (**) if less than 0.01, and with 3 stars (***) if less than 0.001.

Appendix 1—table 1
Study 1.
Ability_InfluenceFactorial_FamFactorial_IncorAttitudeBeh_ControlSoc_Norm
Ability_Influence1.00000.34990.2581–0.01380.22180.1514
Factorial_Fam0.34991.00000.33170.21720.23710.2341
Factorial_Incor0.25810.33171.00000.25390.44960.4384
Attitude–0.01380.21720.25391.00000.20810.1555
Beh_Control0.22180.23710.44960.20811.00000.3691
Soc_Norm0.15140.23410.43840.15550.36911.0000
Appendix 1—table 2
Study 2.
Ability_InfluenceFactorial_FamFactorial_IncorAttitudeBeh_ControlSoc_NormAge
Ability_Influence1.00000.13610.18620.13050.38180.29261.0000
Factorial_Fam0.13611.0000–0.23190.07140.30410.26710.1361
Factorial_Incor0.1862–0.23191.0000–0.06890.11390.04070.1862
Attitude0.13050.0714–0.06891.00000.26640.07100.1305
Beh_Control0.38180.30410.11390.26641.00000.16240.3818
Soc_Norm0.29260.26710.04070.07100.16241.00000.2926
Age0.2047–0.0511–0.0616–0.36040.10780.16590.2047
Appendix 2—table 1
Test of association of the perceived barriers between treatment groups for study 1.

This survey question provided several pre-defined options and ability to enter a free-text option. Participants were asked to choose all that applied. Exploration of the free-text has grouped the barriers into three additional categories: convention, logistic, and none or no barriers. To test for a statistically significant difference (association) between the treatment groups, a Pearson’s Chi-square test was applied for all options where the total N>10.

Barrier offeredTotal (count) possible N=105Baseline (count) possible N=39Interested (count) possible N=51Intervention (count) possible N=15Test of association test statistic & p-value
Cost4414246Χ2 = 1.157; p = 0.561
Male animals are more likely to fight and may lead to premature euthanasia348215Χ2 = 4.317; p = 0.115
Female animals are more variable296185Χ2 = 4.668; p = 0.097
Complexity of experimental design287183Χ2 = 3.789; p = 0.149
Model behavior may be different in the other sex2710143Χ2 = 0.337; p = 0.844
Sample size concerns247134Χ2 = 0.857; p = 0.651
Not relevant to the research question19991Χ2 = 1.98; p = 0.371
Availability of sample/test material12273Χ2 = 2.884; p = 0.236
Welfare issues5221NA
Data analysis concerns0000NA
Other - Convention7232NA
Other – Logistics1100NA
Other - None3102NA
Appendix 3—table 1
Test of association of the perceived benefits between treatment groups for study 1.

This question provided several pre-defined options and the ability to enter a free-texted option. Participants were asked to choose all that applied. No free-text advantages were provided by survey takers for this question. To test for a statistically significant difference (association) between the treatment groups, a Pearson’s chi-square test of association was applied for all options where the total N>10.

Benefits offeredTotal possible N=105Baseline (count) possible N=39Interested (count) possible N=51Intervention (count) possible N=15Test of association test statistic & p-value
Understanding sex differences91334414Χ2=0.726; p=0.695
Translatability79293515Χ2=6.149; p=0.046
Efficient use of all animals from breeding42121812Χ2=11.855; p=0.0026
Reproducibility4217169Χ2=4.291; p=0.117
3Rs – Reduction3311148Χ2=3.902; p=0.142
Animal welfare221075Χ2=3.48; p=0.176
Other0000NA
Appendix 4—table 1
Test of association of the perceived barriers between pre- and post-intervention for study 2.

This question provided several pre-defined options and the ability to enter a free-texted option. Participants were asked to choose all that applied. No free-text advantages were provided by survey takers for this question. To test for a statistically significant difference (association) between the treatment groups, a McNemar’s test was applied for all options where the total N>10.

Barrier offeredPre-Intervention (count)Possible N=29Post-Intervention (count)Possible N=28McNemar’s testTest statistic & p value
Cost158Χ2=3.57; P=0.060
Complexity of experimental design145Χ2=5.33; P=0.021
Sample size concerns134Χ2=4.45; P=0.035
Model behaviour may be different in the other sex122Χ2=5.44; P=0.020
Availability of sample/test material109Χ2=0.0; P=1.0
Female animals are more variable92Χ2=5.0; P=0.025
Experiment would take longer45NA
Male animals are more likely to fight and may lead to premature euthanasia32NA
Convention24NA
Data analysis concerns24NA
Logistics34NA
Welfare issues21NA
Not relevant to the research question04NA
Free-text: Sponsor request01NA
Free-text: Model induction issue01NA
Appendix 5—table 1
Test of association of the perceived benefits between pre- and post-intervention for study 2.

This survey question provided several pre-defined options and ability to enter a free-texted option. Participants were asked to choose all that applied. Participants were given the option of a free-text response, but none were submitted. To test for a statistically significant difference (association) between the pre- and post-measures, a McNemar’s test was applied for all options where the total N>10. This test accounts for the repeat nature of the data which required the list-wise deletion of those individuals with missing data.

Advantages offeredPre-intervention (count) possible N=29Post-intervention (count) possible N=28McNemar’s test- statistic & p-value
Understanding sex differences2625Χ2=0.0; p=1.0
Translatability2122Χ2=1.0; p=0.32
Efficient use of all animals from breeding1317Χ2=1.6; p=0.21
3Rs - Reduction813Χ2=2.78; p=0.096
Reproducibility613Χ2=6.0; p=0.014
Animal welfare35NA
Appendix 6—table 1
Test of association for study 1 for each knowledge question.
Question(Answer)Total (Baseline & Interest)N=90Baseline(% correct)N=39Interest(% correct)N=51Intervention (% correct)N=15Test of association[Baseline v Interest]Test statistic & p-valueTest of association[across the 3 treatment groups]test statistic & p-value
Do you think inclusion of both sexes requires doubling a study’s sample size? (No)20
(30.3%)
13
(33%)
7
(14%)
10
(67%)
Χ2=5.973; p=0.014Χ2=16.85; p=0.00021
Do you think sex influences data variability, therefore, when you include both sexes, more animals are needed? (No)9
(10%)
5
(13%)
4
(8%)
6
(60%)
Χ2 = 0.608; p=0.4355Χ2=7.994; p=0.01837
When analyzing in vivo data collected from both sexes, do you think sex should be included in the statistical model? (Yes)68
(75.6%)
34
(87%)
34
(67%)
12
(80%)
Χ2=5.035; p=0.0248Χ2 = 5.266; p=0.072
When analyzing in vivo data, do you think data from the two sexes should be pooled (combined) for an intervention into a single group for the analysis? (No)17
(18.9%)
6
(15%)
11
(21%)
6
(40%)
Χ2 = 0.552; p=0.457Χ2 = 3.844; p=0.1463
When analyzing in vivo data collected from both sexes, do you think the analysis should be run independently for each sex through separate statistical tests? (No)18
(20%)
7
(18%)
11
(21%)
8
(53%)
Χ2 = 0.181; p=0.671Χ2=7.823; p=0.020
Appendix 7—table 1
Test of association for study 2 for each knowledge question.
Question(Answer)Pre(% correct)N=29Post(% correct)N=28McNemar’s test of associationtest statistic & p-value
Do you think inclusion of both sexes requires doubling a study’s sample size? (No)6
(21%)
17
(61%)
Χ2=7.14; p=0.007
Do you think sex influences data variability, therefore, when you include both sexes, more animals are needed? (No)2
(7%)
13
(46%)
Χ2=9.31; p=0.0023
When analyzing in vivo data collected from both sexes, do you think sex should be included in the statistical model? (Yes)21
(72%)
21
(75%)
Χ2 = 0.143; p=0.71
When analyzing in vivo data collected from both sexes, do you think the analysis should be disaggregated (for example, run a t-test comparing control and intervention on the male data and then running a separate test on the data from females)? (No)8
(28%)
22
(79%)
Χ2=8.0; p=0.0047
When analyzing in vivo data, do you think data from the two sexes should be pooled (combined) for the analysis (for example run a single t-test comparing control and intervention group)? (No)5
(17%)
18
(64%)
Χ2=10.29; p=0.0013
Appendix 8—table 1
Study 1 free-text response.
StatementClassifiedGroup
Biomed staff generally keep the male rats for experimental use. Generally, I think because it is the way it has been done.Other - ConventionBaseline
ConventionOther -ConventionBaseline
This is how my PI is doing it for many yearsOther -ConventionBaseline
My supervisorsOther – ConventionBaseline
All historical data generated in make animals in a very narrow fieldOther -ConventionBaseline
NoneOther – NoneBaseline
Logistics (duration of experiments)Other - LogisticsBaseline
Larger male is preferred for implant of deviceOther - LogisticsBaseline
After the workshop, I no longer have concerns about including both sexes in future experiments.Other - NoneIntervention
Expectation of boss!Other - ConventionIntervention
Regulatory standard designs that are traditionally usedOther - ConventionIntervention
Cost of female animalsOther - LogisticIntervention
NoneOther – NoneIntervention
Appendix 8—table 2
Study 2 free-text response.
StatementClassifiedGroup
Availability of transgenics that are bredOther - LogisticPre
Experimental model phenotypeOther - LogisticPre
We have to stagger groups in order to test all animalsOther - LogisticPost
Disease exists in female (humans), however, experimental model in female mice protectedOther - LogisticPost
Need a better understanding of stats - already look at genotype, time, and treatment intersections, not sure how to add sex as another variableOther - LogisticPost
Sponsor requestOther - ConventionPost

Additional files

Supplementary file 1

Summary of demographic and potential predictors between groups for all survey 1 contributors who met the inclusion criteria.

The demographic information included is for the full 105 participants who met the inclusion criteria. While all 105 met the inclusion criteria, seven participants left the question about age blank. As age was included as a predictor in the analysis of intent, the missing values were managed with listwise deletion, assuming missing at random, reducing the dataset size to 98 participants (N=35 baseline, N=48 interested, and N=15 intervention). To test for a statistically significant difference (association) between the treatment groups, a Pearson’s Chi-square test was used for categorical variables, ordered logistic regression for nominal variables, and ANOVA test for continuous variables. Institute type was collected as a demographic, the resulted population sampled was predominately academic and this variable was therefore removed from downstream analysis due to the lack of predictive ability to assess institute type on the outcome of interest. The abbreviation name in bracket, within the demographic column, indicates the term used within the statistical model and associated output.

https://cdn.elifesciences.org/articles/106545/elife-106545-supp1-v1.xlsx
Supplementary file 2

Summary of demographic and potential predictors for all participants who met the inclusion criteria for Study 2.

The demographic information is for the full 31 participants who met the inclusion criteria in study 2. While 31 unique individuals met the criteria, an individual may not have responded to both surveys (N=29 pre-survey and N=28 in the post-survey). For instance, two participants did not meet the inclusion criteria for the pre-survey but met the criteria for the post-intervention survey. For the intention analysis, some missing data was observed in the demographic data. To reduce survey burden, the post-survey only included one demographic question (the participant’s age) to support alignment of data, just in case duplicate initials were used as an identifier. Two responders did not include the age information in the study and were managed with listwise deletion, assuming missing at random. The abbreviation name in bracket, within the demographic column, indicates the term used within the statistical model and associated output. For McNemar’s paired analysis, we conducted listwise deletion, assuming missing at random, reducing the dataset size to 26.

https://cdn.elifesciences.org/articles/106545/elife-106545-supp2-v1.xlsx
Supplementary file 3

Cumulative knowledge score for study 1.

For each participant of study 1, the cumulative knowledge score is reported along with metadata of treatment group and reported gender. Participant identity is masked.

https://cdn.elifesciences.org/articles/106545/elife-106545-supp3-v1.xlsx
Supplementary file 4

Pre- and post-cumulative knowledge scores for study 2.

For each participant in study 2, the pre- and post-cumulative knowledge score is reported. Participant identity is masked.

https://cdn.elifesciences.org/articles/106545/elife-106545-supp4-v1.xlsx
Supplementary file 5

Study 1 survey.

The survey that was used in study 1 for all participants.

https://cdn.elifesciences.org/articles/106545/elife-106545-supp5-v1.pdf
Supplementary file 6

Pre-workshop survey.

The survey that was used in study 2 for all participants prior to the intervention.

https://cdn.elifesciences.org/articles/106545/elife-106545-supp6-v1.pdf
Supplementary file 7

Postworkshop survey.

The survey that was used in study 2 for all participants post the intervention.

https://cdn.elifesciences.org/articles/106545/elife-106545-supp7-v1.pdf
Supplementary file 8

Workshop training material.

The training material used in study 2.

https://cdn.elifesciences.org/articles/106545/elife-106545-supp8-v1.pdf
Supplementary file 9

Cumulative knowledge score for study 1.

For each participant of study 1, the cumulative knowledge score is reported alone with meta data of treatment group and reported gender. Participant identity is masked.

https://cdn.elifesciences.org/articles/106545/elife-106545-supp9-v1.xlsx
Supplementary file 10

Pre- and post-cumulative knowledge scores for study 2.

For each participant of study 2, the pre and post cumulative knowledge score is reported. Participant identity is masked.

https://cdn.elifesciences.org/articles/106545/elife-106545-supp10-v1.xlsx
Supplementary file 11

Intention analysis for both study 1 and 2.

The data and SAS code which can reproduce the analysis of intention for study 1 and study 2.

https://cdn.elifesciences.org/articles/106545/elife-106545-supp11-v1.pdf
Supplementary file 12

Analysis of cumulative knowledge score for study 1.

The Rmarkdown output from the analysis of the cumulative knowledge score for study 1.

https://cdn.elifesciences.org/articles/106545/elife-106545-supp12-v1.pdf
MDAR checklist
https://cdn.elifesciences.org/articles/106545/elife-106545-mdarchecklist1-v1.docx

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Brianna N Gaskill
  2. Benjamin Phillips
  3. Jonathan Ho
  4. Holly Rafferty
  5. Oladele Olajide Onada
  6. Andrew Rooney
  7. Amrita Ahluwalia
  8. Natasha A Karp
(2025)
Addressing cultural and knowledge barriers to enable preclinical sex inclusive research
eLife 14:RP106545.
https://doi.org/10.7554/eLife.106545.3