Demographic history mediates the effect of stratification on polygenic scores

  1. Arslan A Zaidi  Is a corresponding author
  2. Iain Mathieson  Is a corresponding author
  1. University of Pennsylvania, United States

Abstract

Population stratification continues to bias the results of genome-wide association studies (GWAS). When these results are used to construct polygenic scores, even subtle biases can cumulatively lead to large errors. To study the effect of residual stratification, we simulated GWAS under realistic models of demographic history. We show that when population structure is recent, it cannot be corrected using principal components of common variants because they are uninformative about recent history. Consequently, polygenic scores are biased in that they recapitulate environmental structure. Principal components calculated from rare variants or identity-by-descent segments can correct this stratification for some types of environmental effects. While family-based studies are immune to stratification, the hybrid approach of ascertaining variants in GWAS but re-estimating effect sizes in siblings reduces but does not eliminate stratification. We show that the effect of population stratification depends not only on allele frequencies and environmental structure but also on demographic history.

Data availability

The data used in this study were generated through simulations. The code for these simulations is freely available at https://github.com/Arslan-Zaidi/popstructure and can be used to reproduce all simulations and carry out all analyses in the manuscript.

Article and author information

Author details

  1. Arslan A Zaidi

    Genetics, University of Pennsylvania, Philadelphia, United States
    For correspondence
    aazaidi@pennmedicine.upenn.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-2155-8367
  2. Iain Mathieson

    Department of Genetics, University of Pennsylvania, Philadelphia, United States
    For correspondence
    mathi@pennmedicine.upenn.edu
    Competing interests
    The authors declare that no competing interests exist.

Funding

National Institute of General Medical Sciences (R35GM133708)

  • Iain Mathieson

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Copyright

© 2020, Zaidi & Mathieson

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 5,474
    views
  • 476
    downloads
  • 94
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Citations by DOI

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Arslan A Zaidi
  2. Iain Mathieson
(2020)
Demographic history mediates the effect of stratification on polygenic scores
eLife 9:e61548.
https://doi.org/10.7554/eLife.61548

Share this article

https://doi.org/10.7554/eLife.61548