Demographic history mediates the effect of stratification on polygenic scores

  1. Arslan A Zaidi  Is a corresponding author
  2. Iain Mathieson  Is a corresponding author
  1. University of Pennsylvania, United States

Abstract

Population stratification continues to bias the results of genome-wide association studies (GWAS). When these results are used to construct polygenic scores, even subtle biases can cumulatively lead to large errors. To study the effect of residual stratification, we simulated GWAS under realistic models of demographic history. We show that when population structure is recent, it cannot be corrected using principal components of common variants because they are uninformative about recent history. Consequently, polygenic scores are biased in that they recapitulate environmental structure. Principal components calculated from rare variants or identity-by-descent segments can correct this stratification for some types of environmental effects. While family-based studies are immune to stratification, the hybrid approach of ascertaining variants in GWAS but re-estimating effect sizes in siblings reduces but does not eliminate stratification. We show that the effect of population stratification depends not only on allele frequencies and environmental structure but also on demographic history.

Data availability

The data used in this study were generated through simulations. The code for these simulations is freely available at https://github.com/Arslan-Zaidi/popstructure and can be used to reproduce all simulations and carry out all analyses in the manuscript.

Article and author information

Author details

  1. Arslan A Zaidi

    Genetics, University of Pennsylvania, Philadelphia, United States
    For correspondence
    aazaidi@pennmedicine.upenn.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-2155-8367
  2. Iain Mathieson

    Department of Genetics, University of Pennsylvania, Philadelphia, United States
    For correspondence
    mathi@pennmedicine.upenn.edu
    Competing interests
    The authors declare that no competing interests exist.

Funding

National Institute of General Medical Sciences (R35GM133708)

  • Iain Mathieson

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Reviewing Editor

  1. George H Perry, Pennsylvania State University, United States

Publication history

  1. Received: July 29, 2020
  2. Accepted: November 16, 2020
  3. Accepted Manuscript published: November 17, 2020 (version 1)
  4. Version of Record published: December 23, 2020 (version 2)

Copyright

© 2020, Zaidi & Mathieson

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 3,679
    Page views
  • 363
    Downloads
  • 32
    Citations

Article citation count generated by polling the highest count across the following sources: Crossref, Scopus, PubMed Central.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Arslan A Zaidi
  2. Iain Mathieson
(2020)
Demographic history mediates the effect of stratification on polygenic scores
eLife 9:e61548.
https://doi.org/10.7554/eLife.61548

Further reading

    1. Epidemiology and Global Health
    2. Immunology and Inflammation
    Zaki A Sherif, Christian R Gomez ... RECOVER Mechanistic Pathway Task Force
    Review Article

    COVID-19, with persistent and new onset of symptoms such as fatigue, post-exertional malaise, and cognitive dysfunction that last for months and impact everyday functioning, is referred to as Long COVID under the general category of post-acute sequelae of SARS-CoV-2 infection (PASC). PASC is highly heterogenous and may be associated with multisystem tissue damage/dysfunction including acute encephalitis, cardiopulmonary syndromes, fibrosis, hepatobiliary damages, gastrointestinal dysregulation, myocardial infarction, neuromuscular syndromes, neuropsychiatric disorders, pulmonary damage, renal failure, stroke, and vascular endothelial dysregulation. A better understanding of the pathophysiologic mechanisms underlying PASC is essential to guide prevention and treatment. This review addresses potential mechanisms and hypotheses that connect SARS-CoV-2 infection to long-term health consequences. Comparisons between PASC and other virus-initiated chronic syndromes such as myalgic encephalomyelitis/chronic fatigue syndrome and postural orthostatic tachycardia syndrome will be addressed. Aligning symptoms with other chronic syndromes and identifying potentially regulated common underlining pathways may be necessary for understanding the true nature of PASC. The discussed contributors to PASC symptoms include sequelae from acute SARS-CoV-2 injury to one or more organs, persistent reservoirs of the replicating virus or its remnants in several tissues, re-activation of latent pathogens such as Epstein–Barr and herpes viruses in COVID-19 immune-dysregulated tissue environment, SARS-CoV-2 interactions with host microbiome/virome communities, clotting/coagulation dysregulation, dysfunctional brainstem/vagus nerve signaling, dysautonomia or autonomic dysfunction, ongoing activity of primed immune cells, and autoimmunity due to molecular mimicry between pathogen and host proteins. The individualized nature of PASC symptoms suggests that different therapeutic approaches may be required to best manage specific patients.

    1. Epidemiology and Global Health
    Mette Hartmann Nonboe, George Napolitano ... Elsebeth Lynge
    Research Article

    Background:

    Denmark was one of the few countries where it was politically decided to continue cancer screening during the COVID-19 pandemic. We assessed the actual population uptake of mammography and cervical screening during this period.

    Methods:

    The first COVID-19 lockdown in Denmark was announced on 11 March 2020. To investigate possible changes in cancer screening activity due to the COVID-19 pandemic, we analysed data from the beginning of 2017 until the end of 2021. A time series analysis was carried out to discover possible trends and outliers in the screening activities in the period 2017–2021. Data on mammography screening and cervical screening were retrieved from governmental pandemic-specific monitoring of health care activities.

    Results:

    A brief drop was seen in screening activity right after the first COVID-19 lockdown, but the activity quickly returned to its previous level. A short-term deficit of 43% [CI –49 to –37] was found for mammography screening. A short-term deficit of 62% [CI –65 to –58] was found for cervical screening. Furthermore, a slight, statistically significant downward trend in cervical screening from 2018 to 2021 was probably unrelated to the pandemic. Other changes, for example, a marked drop in mammography screening towards the end of 2021, also seem unrelated to the pandemic.

    Conclusions:

    Denmark continued cancer screening during the pandemic, but following the first lockdown a temporary drop was seen in breast and cervical screening activity.

    Funding:

    Region Zealand (R22-A597).