Risk factors relate to the variability of health outcomes as well as the mean: a GAMLSS tutorial

  1. David Bann  Is a corresponding author
  2. Liam Wright
  3. Tim J Cole
  1. University College London, United Kingdom

Abstract

Background: Risk factors or interventions may affect the variability as well as the mean of health outcomes. Understanding this can aid aetiological understanding and public health translation, in that interventions which shift the outcome mean and reduce variability are typically preferable to those which affect only the mean. However, most commonly used statistical tools do not test for differences in variability. Tools that do have few epidemiological applications to date, and fewer applications still have attempted to explain their resulting findings. We thus provide a tutorial for investigating this using GAMLSS (Generalised Additive Models for Location, Scale and Shape).

Methods: The 1970 British birth cohort study was used, with body mass index (BMI; N=6,007) and mental wellbeing (Warwick-Edinburgh Mental Wellbeing Scale; N=7,104) measured in midlife (42-46 years) as outcomes. We used GAMLSS to investigate how multiple risk factors (sex, childhood social class and midlife physical inactivity) related to differences in health outcome mean and variability.

Results: Risk factors were related to sizable differences in outcome variability-for example males had marginally higher mean BMI yet 28% lower variability; lower social class and physical inactivity were each associated with higher mean and higher variability (6.1% and 13.5% higher variability, respectively). For mental wellbeing, gender was not associated with the mean while males had lower variability (-3.9%); lower social class and physical inactivity were each associated with lower mean yet higher variability (7.2% and 10.9% higher variability, respectively).

Conclusions: The results highlight how GAMLSS can be used to investigate how risk factors or interventions may influence the variability in health outcomes. This underutilised approach to the analysis of continuously distributed outcomes may have broader utility in epidemiologic, medical, and psychological sciences. A tutorial and replication syntax is provided online to facilitate this (https://osf.io/5tvz6/).

Funding: DB is supported by the Economic and Social Research Council (grant number ES/M001660/1), The Academy of Medical Sciences / Wellcome Trust ('Springboard Health of the Public in 2040' award: HOP001/1025); DB and LW are supported by the Medical Research Council (MR/V002147/1). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Data availability

All data are available to download from the UK Data Archive: https://beta.ukdataservice.ac.uk/datacatalogue/series/series?id=200001

Article and author information

Author details

  1. David Bann

    Centre for Longitudinal Studies, Social Research Institute, University College London, london, United Kingdom
    For correspondence
    david.bann@ucl.ac.uk
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-6454-626X
  2. Liam Wright

    Centre for Longitudinal Studies, Social Research Institute, University College London, london, United Kingdom
    Competing interests
    The authors declare that no competing interests exist.
  3. Tim J Cole

    Great Ormond Street Institute of Child Health, University College London, london, United Kingdom
    Competing interests
    The authors declare that no competing interests exist.

Funding

Medical Research Council (MR/V002147/1)

  • David Bann
  • Liam Wright

Economic and Social Research Council (ES/M001660/1)

  • Liam Wright

Wellcome Trust (HOP001/1025)

  • David Bann

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Ethics

Human subjects: This paper uses secondary data analysis using data from a cohort study which has been followed-up since birth in 1970. Cohort members provided informed consent, and the study received full ethical approval - most recently from the NRES Committee South East Coast-Brighton and Sussex.

Copyright

© 2022, Bann et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 1,542
    views
  • 208
    downloads
  • 7
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. David Bann
  2. Liam Wright
  3. Tim J Cole
(2022)
Risk factors relate to the variability of health outcomes as well as the mean: a GAMLSS tutorial
eLife 11:e72357.
https://doi.org/10.7554/eLife.72357

Share this article

https://doi.org/10.7554/eLife.72357

Further reading

    1. Epidemiology and Global Health
    Xiaoning Wang, Jinxiang Zhao ... Dong Liu
    Research Article

    Artificially sweetened beverages containing noncaloric monosaccharides were suggested as healthier alternatives to sugar-sweetened beverages. Nevertheless, the potential detrimental effects of these noncaloric monosaccharides on blood vessel function remain inadequately understood. We have established a zebrafish model that exhibits significant excessive angiogenesis induced by high glucose, resembling the hyperangiogenic characteristics observed in proliferative diabetic retinopathy (PDR). Utilizing this model, we observed that glucose and noncaloric monosaccharides could induce excessive formation of blood vessels, especially intersegmental vessels (ISVs). The excessively branched vessels were observed to be formed by ectopic activation of quiescent endothelial cells (ECs) into tip cells. Single-cell transcriptomic sequencing analysis of the ECs in the embryos exposed to high glucose revealed an augmented ratio of capillary ECs, proliferating ECs, and a series of upregulated proangiogenic genes. Further analysis and experiments validated that reduced foxo1a mediated the excessive angiogenesis induced by monosaccharides via upregulating the expression of marcksl1a. This study has provided new evidence showing the negative effects of noncaloric monosaccharides on the vascular system and the underlying mechanisms.

    1. Epidemiology and Global Health
    2. Microbiology and Infectious Disease
    Amanda C Perofsky, John Huddleston ... Cécile Viboud
    Research Article

    Influenza viruses continually evolve new antigenic variants, through mutations in epitopes of their major surface proteins, hemagglutinin (HA) and neuraminidase (NA). Antigenic drift potentiates the reinfection of previously infected individuals, but the contribution of this process to variability in annual epidemics is not well understood. Here, we link influenza A(H3N2) virus evolution to regional epidemic dynamics in the United States during 1997—2019. We integrate phenotypic measures of HA antigenic drift and sequence-based measures of HA and NA fitness to infer antigenic and genetic distances between viruses circulating in successive seasons. We estimate the magnitude, severity, timing, transmission rate, age-specific patterns, and subtype dominance of each regional outbreak and find that genetic distance based on broad sets of epitope sites is the strongest evolutionary predictor of A(H3N2) virus epidemiology. Increased HA and NA epitope distance between seasons correlates with larger, more intense epidemics, higher transmission, greater A(H3N2) subtype dominance, and a greater proportion of cases in adults relative to children, consistent with increased population susceptibility. Based on random forest models, A(H1N1) incidence impacts A(H3N2) epidemics to a greater extent than viral evolution, suggesting that subtype interference is a major driver of influenza A virus infection ynamics, presumably via heterosubtypic cross-immunity.