1. Computational and Systems Biology

Groundbreaking text mining project highlights ‘gender gap’ in scientific research

An analysis of 15,000 mouse studies showed that around half of them failed to report the sex and age of the animals involved, highlighting the issue of reproducibility in scientific research.
Press Pack
  • Views 34
  • Annotations

A project at The University of Manchester to analyse 15,000 mouse studies – the largest of its kind ever undertaken – has revealed that about half of these studies failed to report the sex and age of the mice involved, despite these being recognised as key variables that can affect the outcome of scientific studies. The project utilised text mining software developed at the University, which can analyse large collections of documents to unearth information which would otherwise have been virtually impossible to discover. The software relies on a number of rules, which automatically scan the method section of papers to identify mentions of gender and age.

The results of the project, published last week in eLife, highlight the issue of reproducibility of scientific research – around £20 billion is spent every year on research which is not reproducible, and over 80% of potential therapeutics fail in humans after being tested in mice. Previously published studies have suggested that research done on female animals may not be applicable for men, and in many of the studies analysed in this project, the animals used were overwhelmingly female. This may be due to female mice being less aggressive, which makes them easier to use in the studies. This is important, because the sexes can have markedly different responses to the same investigations – for example, in infection research. This may significantly reduce the reliability of studies, and lead to drugs that won’t work for half of the population.

The reproducibility of studies often focuses on the interpretation of statistics, but this project has highlighted that the methods used may not be reported rigorously enough to assess whether they were done correctly. By looking at the methods used, it is possible to infer whether or not the statistics produced are sound, and reproducible in the future. Without knowing these methods, this cannot be inferred at all, which hampers cross-disciplinary research and longevity of data.

The project has produced a vital tool to measure the reproducibility of scientific studies, but there is a long way to go – failure to consider gender in research is still very much the norm, and according to one analysis of scientific studies published in 2009, only 45% of animal studies involving depression or anxiety and only 38 percent involving strokes used females, even though these conditions are more common in women.

“The opportunity to use text mining to cover such a broad portfolio of research was brilliant, and vital to see the bigger picture,” said Sheena Cruickshank, Senior Lecturer in Immunology at The University of Manchester. “We are an interdisciplinary team, and it was this which enabled us to spot this issue and then explore it. The paper builds on several pieces of work we have done together, and highlights the importance of the scientific community to come together and define what is important in the current reproducibility crisis.”

“This study has demonstrated how state-of-the-art computer science technology is instrumental for a large-scale and systematic analysis of literature,” said Dr Goran Nenadic from The University of Manchester’s School of Computer Science. “It avoids small sample bias, and allows us to explore the research landscape on a large scale to identify key issues in reporting details of scientific methodologies, which are necessary for reproducibility, transparency and fidelity of research.”

##

Media contacts

  1. Emily Packer
    eLife
    e.packer@elifesciences.org
    +441223855373

  2. Joe Paxton
    University of Manchester
    joe.paxton@manchester.ac.uk

About

About The University of Manchester

The University of Manchester, a member of the prestigious Russell Group of British universities, is the largest and most popular university in the UK. It has 20 academic schools and hundreds of specialist research groups undertaking pioneering multi-disciplinary teaching and research of worldwide significance. 



The University of Manchester is one of the country’s major research institutions, rated fifth in the UK in terms of ‘research power’ (REF 2014), and has had no fewer than 25 Nobel laureates either work or study there. The University had an annual income of £886 million in 2013/14.

About eLife

eLife is a unique collaboration between the funders and practitioners of research to improve the way important research is selected, presented and shared. eLife publishes outstanding works across the life sciences and biomedicine — from basic biological research to applied, translational, and clinical studies. All papers are selected by active scientists in the research community. Decisions and responses are agreed by the reviewers and consolidated by the Reviewing Editor into a single, clear set of instructions for authors, removing the need for laborious cycles of revision and allowing authors to publish their findings quickly. eLife is supported by the Howard Hughes Medical Institute, the Max Planck Society and the Wellcome Trust. Learn more at elifesciences.org.