Background selection and biased gene conversion affect more than 95% of the human genome and bias demographic inferences

Abstract

Disentangling the effect on genomic diversity of natural selection from that of demography is notoriously difficult, but necessary to properly reconstruct the history of species. Here, we use high-quality human genomic data to show that purifying selection at linked sites (i.e. background selection, BGS) and GC-biased gene conversion (gBGC) together affect as much as 95% of the variants of our genome. We find that the magnitude and relative importance of BGS and gBGC are largely determined by variation in recombination rate and base composition. Importantly, synonymous sites and non-transcribed regions are also affected, albeit to different degrees. Their use for demographic inference can lead to strong biases. However, by conditioning on genomic regions with recombination rates above 1.5 cM/Mb and mutation types (C↔G, A↔T), we identify a set of SNPs that is mostly unaffected by BGS or gBGC, and that avoids these biases in the reconstruction of human history.

Data availability

All data generated and script to analyse them is provided on the dryad repesitory: http://datadryad.org/review?doi=doi:10.5061/dryad.t76fk80

The following data sets were generated
The following previously published data sets were used

Article and author information

Author details

  1. Fanny Pouyet

    Institute of Ecology and Evolution, University of Bern, Berne, Switzerland
    For correspondence
    fanny.pouyet@iee.unibe.ch
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-5614-6998
  2. Simon Aeschbacher

    Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
    Competing interests
    The authors declare that no competing interests exist.
  3. Alexandre Thiéry

    Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
    Competing interests
    The authors declare that no competing interests exist.
  4. Laurent Excoffier

    Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
    For correspondence
    laurent.excoffier@iee.unibe.ch
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-7507-6494

Funding

Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung (310030B-166605)

  • Laurent Excoffier

University of Berkeley (Visiting Miller Professorship)

  • Laurent Excoffier

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Reviewing Editor

  1. Krishna Veeramah, Stony Brook University, United States

Version history

  1. Received: March 1, 2018
  2. Accepted: August 17, 2018
  3. Accepted Manuscript published: August 20, 2018 (version 1)
  4. Accepted Manuscript updated: August 23, 2018 (version 2)
  5. Version of Record published: October 9, 2018 (version 3)

Copyright

© 2018, Pouyet et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 8,127
    Page views
  • 999
    Downloads
  • 70
    Citations

Article citation count generated by polling the highest count across the following sources: Scopus, Crossref, PubMed Central.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Fanny Pouyet
  2. Simon Aeschbacher
  3. Alexandre Thiéry
  4. Laurent Excoffier
(2018)
Background selection and biased gene conversion affect more than 95% of the human genome and bias demographic inferences
eLife 7:e36317.
https://doi.org/10.7554/eLife.36317

Further reading

    1. Genetics and Genomics
    2. Immunology and Inflammation
    Nadja S Katheder, Kristen C Browder ... Heinrich Jasper
    Research Article

    Disruption of epithelial barriers is a common disease manifestation in chronic degenerative diseases of the airways, lung and intestine. Extensive human genetic studies have identified risk loci in such diseases, including in chronic obstructive pulmonary disease (COPD) and inflammatory bowel diseases (IBD). The genes associated with these loci have not fully been determined, and functional characterization of such genes requires extensive studies in model organisms. Here, we report the results of a screen in Drosophila melanogaster that allowed for rapid identification, validation and prioritization of COPD risk genes that were selected based on risk loci identified in human genome-wide association studies (GWAS) studies. Using intestinal barrier dysfunction in flies as a readout, our results validate the impact of candidate gene perturbations on epithelial barrier function in 56% of the cases, resulting in a prioritized target gene list. We further report the functional characterization in flies of one family of these genes, encoding for nicotinic acetylcholine receptor subunits (nAchR). We find that nAchR signaling in enterocytes of the fly gut promotes epithelial barrier function and epithelial homeostasis by regulating the production of the peritrophic matrix. Our findings identify COPD associated genes critical for epithelial barrier maintenance, and provide insight into the role of epithelial nAchR signaling for homeostasis.

    1. Genetics and Genomics
    2. Microbiology and Infectious Disease
    William Matlock, Samuel Lipworth ... REHAB Consortium
    Research Article Updated

    Plasmids enable the dissemination of antimicrobial resistance (AMR) in common Enterobacterales pathogens, representing a major public health challenge. However, the extent of plasmid sharing and evolution between Enterobacterales causing human infections and other niches remains unclear, including the emergence of resistance plasmids. Dense, unselected sampling is essential to developing our understanding of plasmid epidemiology and designing appropriate interventions to limit the emergence and dissemination of plasmid-associated AMR. We established a geographically and temporally restricted collection of human bloodstream infection (BSI)-associated, livestock-associated (cattle, pig, poultry, and sheep faeces, farm soils) and wastewater treatment work (WwTW)-associated (influent, effluent, waterways upstream/downstream of effluent outlets) Enterobacterales. Isolates were collected between 2008 and 2020 from sites <60 km apart in Oxfordshire, UK. Pangenome analysis of plasmid clusters revealed shared ‘backbones’, with phylogenies suggesting an intertwined ecology where well-conserved plasmid backbones carry diverse accessory functions, including AMR genes. Many plasmid ‘backbones’ were seen across species and niches, raising the possibility that plasmid movement between these followed by rapid accessory gene change could be relatively common. Overall, the signature of identical plasmid sharing is likely to be a highly transient one, implying that plasmid movement might be occurring at greater rates than previously estimated, raising a challenge for future genomic One Health studies.