Background selection and biased gene conversion affect more than 95% of the human genome and bias demographic inferences
Abstract
Disentangling the effect on genomic diversity of natural selection from that of demography is notoriously difficult, but necessary to properly reconstruct the history of species. Here, we use high-quality human genomic data to show that purifying selection at linked sites (i.e. background selection, BGS) and GC-biased gene conversion (gBGC) together affect as much as 95% of the variants of our genome. We find that the magnitude and relative importance of BGS and gBGC are largely determined by variation in recombination rate and base composition. Importantly, synonymous sites and non-transcribed regions are also affected, albeit to different degrees. Their use for demographic inference can lead to strong biases. However, by conditioning on genomic regions with recombination rates above 1.5 cM/Mb and mutation types (C↔G, A↔T), we identify a set of SNPs that is mostly unaffected by BGS or gBGC, and that avoids these biases in the reconstruction of human history.
Data availability
All data generated and script to analyse them is provided on the dryad repository: http://dx.doi.org/10.5061/dryad.t76fk80
-
Data from: Background selection and biased gene conversion affect more than 95% of the human genome and bias demographic inferencesAvailable at Dryad Digital Repository under a CC0 Public Domain Dedication.
Article and author information
Author details
Funding
Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung (310030B-166605)
- Laurent Excoffier
University of Berkeley (Visiting Miller Professorship)
- Laurent Excoffier
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Copyright
© 2018, Pouyet et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 8,616
- views
-
- 1,035
- downloads
-
- 127
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Genetics and Genomics
- Microbiology and Infectious Disease
The ability to analyze the function of all genes in a genome is highly desirable, yet challenging in Leishmania due to a repetitive genome, limited DNA repair mechanisms, and lack of RNA interference in most species. While our introduction of a cytosine base editor (CBE) demonstrated potential to overcome these limitations (Engstler and Beneke, 2023), challenges remained, including low transfection efficiency, variable editing rates across species, parasite growth effects, and competition between deleterious and non-deleterious mutations. Here, we present an optimized approach addressing these issues. We identified a T7 RNAP promoter variant ensuring high editing rates across Leishmania species without compromising growth. A revised CBE single-guide RNAs (sgRNAs) scoring system was developed to prioritize STOP codon generation. Additionally, a triple-expression construct was created for stable integration of CBE sgRNA expression cassettes into a Leishmania safe harbor locus using AsCas12a ultra-mediated DNA double-strand breaks, increasing transfection efficiency by ~400-fold to 1 transfectant per 70 transfected cells. Using this improved system for a small-scale proof-of-principle pooled screen, we successfully confirmed the essential and fitness-associated functions of CK1.2, CRK2, CRK3, AUK1/AIRK, TOR1, IFT88, IFT139, IFT140, and RAB5A in Leishmania mexicana, demonstrating a significant improvement over our previous method. Lastly, we show the utility of co-expressing AsCas12a ultra, T7 RNAP, and CBE for hybrid CRISPR gene replacement and base editing within the same cell line. Overall, these improvements will broaden the range of possible gene editing applications in Leishmania species and will enable a variety of loss-of-function screens in the near future.
-
- Genetics and Genomics
- Neuroscience
Spinal muscular atrophy (SMA) is caused by mutations in the Survival Motor Neuron 1 (SMN1) gene. While traditionally viewed as a motor neuron disorder, there is involvement of various peripheral organs in SMA. Notably, fatty liver has been observed in SMA mouse models and SMA patients. Nevertheless, it remains unclear whether intrinsic depletion of SMN protein in the liver contributes to pathology in the peripheral or central nervous systems. To address this, we developed a mouse model with a liver-specific depletion of SMN by utilizing an Alb-Cre transgene together with one Smn2B allele and one Smn1 exon 7 allele flanked by loxP sites. Initially, we evaluated phenotypic changes in these mice at postnatal day 19 (P19), when the severe model of SMA, the Smn2B/- mice, exhibit many symptoms of the disease. The liver-specific SMN depletion does not induce motor neuron death, neuromuscular pathology or muscle atrophy, characteristics typically observed in the Smn2B/- mouse at P19. However, mild liver steatosis was observed, although no changes in liver function were detected. Notably, pancreatic alterations resembled that of Smn2B/-mice, with a decrease in insulin-producing β-cells and an increase in glucagon-producingα-cells, accompanied by a reduction in blood glucose and an increase in plasma glucagon and glucagon-like peptide (GLP-1). These changes were transient, as mice at P60 exhibited recovery of liver and pancreatic function. While the mosaic pattern of the Cre-mediated excision precludes definitive conclusions regarding the contribution of liver-specific SMN depletion to overall tissue pathology, our findings highlight an intricate connection between liver function and pancreatic abnormalities in SMA.