Background selection and biased gene conversion affect more than 95% of the human genome and bias demographic inferences
Abstract
Disentangling the effect on genomic diversity of natural selection from that of demography is notoriously difficult, but necessary to properly reconstruct the history of species. Here, we use high-quality human genomic data to show that purifying selection at linked sites (i.e. background selection, BGS) and GC-biased gene conversion (gBGC) together affect as much as 95% of the variants of our genome. We find that the magnitude and relative importance of BGS and gBGC are largely determined by variation in recombination rate and base composition. Importantly, synonymous sites and non-transcribed regions are also affected, albeit to different degrees. Their use for demographic inference can lead to strong biases. However, by conditioning on genomic regions with recombination rates above 1.5 cM/Mb and mutation types (C↔G, A↔T), we identify a set of SNPs that is mostly unaffected by BGS or gBGC, and that avoids these biases in the reconstruction of human history.
Data availability
All data generated and script to analyse them is provided on the dryad repesitory: http://datadryad.org/review?doi=doi:10.5061/dryad.t76fk80
-
Data from: Background selection and biased gene conversion affect more than 95% of the human genome and bias demographic inferencesAvailable at Dryad Digital Repository under a CC0 Public Domain Dedication.
Article and author information
Author details
Funding
Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung (310030B-166605)
- Laurent Excoffier
University of Berkeley (Visiting Miller Professorship)
- Laurent Excoffier
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Copyright
© 2018, Pouyet et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 8,583
- views
-
- 1,034
- downloads
-
- 126
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Cell Biology
- Genetics and Genomics
Single-nucleus RNA sequencing (snRNA-seq), an alternative to single-cell RNA sequencing (scRNA-seq), encounters technical challenges in obtaining high-quality nuclei and RNA, persistently hindering its applications. Here, we present a robust technique for isolating nuclei across various tissue types, remarkably enhancing snRNA-seq data quality. Employing this approach, we comprehensively characterize the depot-dependent cellular dynamics of various cell types underlying mouse adipose tissue remodeling during obesity. By integrating bulk nuclear RNA-seq from adipocyte nuclei of different sizes, we identify distinct adipocyte subpopulations categorized by size and functionality. These subpopulations follow two divergent trajectories, adaptive and pathological, with their prevalence varying by depot. Specifically, we identify a key molecular feature of dysfunctional hypertrophic adipocytes, a global shutdown in gene expression, along with elevated stress and inflammatory responses. Furthermore, our differential gene expression analysis reveals distinct contributions of adipocyte subpopulations to the overall pathophysiology of adipose tissue. Our study establishes a robust snRNA-seq method, providing novel insights into the biological processes involved in adipose tissue remodeling during obesity, with broader applicability across diverse biological systems.
-
- Developmental Biology
- Genetics and Genomics
Paternal obesity has been implicated in adult-onset metabolic disease in offspring. However, the molecular mechanisms driving these paternal effects and the developmental processes involved remain poorly understood. One underexplored possibility is the role of paternally induced effects on placenta development and function. To address this, we investigated paternal high-fat diet-induced obesity in relation to sperm histone H3 lysine 4 tri-methylation signatures, the placenta transcriptome, and cellular composition. C57BL6/J male mice were fed either a control or high-fat diet for 10 weeks beginning at 6 weeks of age. Males were timed-mated with control-fed C57BL6/J females to generate pregnancies, followed by collection of sperm, and placentas at embryonic day (E)14.5. Chromatin immunoprecipitation targeting histone H3 lysine 4 tri-methylation (H3K4me3) followed by sequencing (ChIP-seq) was performed on sperm to define obesity-associated changes in enrichment. Paternal obesity corresponded with altered sperm H3K4me3 at promoters of genes involved in metabolism and development. Notably, altered sperm H3K4me3 was also localized at placental enhancers. Bulk RNA-sequencing on placentas revealed paternal obesity-associated sex-specific changes in expression of genes involved in hypoxic processes such as angiogenesis, nutrient transport, and imprinted genes, with a subset of de-regulated genes showing changes in H3K4me3 in sperm at corresponding promoters. Paternal obesity was also linked to impaired placenta development; specifically, a deconvolution analysis revealed altered trophoblast cell lineage specification. These findings implicate paternal obesity effects on placenta development and function as one potential developmental route to offspring metabolic disease.