Natural variation in the consequences of gene overexpression and its implications for evolutionary trajectories
Abstract
Copy number variation (CNV) through gene or chromosome amplification provides a route for rapid phenotypic variation and supports long-term evolution of gene functions. Although the evolutionary importance of CNV is known, little is understood about how genetic background influences CNV tolerance. Here, we measured fitness costs of over 4,000 over-expressed genes in 15 Saccharomyces cerevisiae strains representing different lineages, to explore natural variation in tolerating gene overexpression (OE). Strain-specific effects dominated the fitness costs of gene OE. We report global differences in the consequences of gene OE, independent of the amplified gene, as well as gene-specific effects that were dependent on the genetic background. Natural variation in the response to gene OE could be explained by several models, including strain-specific physiological differences, resource limitations, and regulatory sensitivities. This work provides new insight on how genetic background influences tolerance to gene amplification and the evolutionary trajectories accessible to different backgrounds.
Data availability
Barcode sequencing data are available in the Short Read Archive under accession number GSE171586. RNA-Seq data are available in GEO accession number GSE171585.
Article and author information
Author details
Funding
National Cancer Institute (R01CA229532)
- James Hose
- Adam Jochem
- Audrey P Gasch
U.S. Department of Energy (DE-SC0018409)
- Mike Place
- Audrey P Gasch
National Institutes of Health (GT32GM007133)
- DeElegant Robinson
National Human Genome Research Institute (5T32HG002760)
- DeElegant Robinson
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Reviewing Editor
- Kevin J Verstrepen, VIB-KU Leuven Center for Microbiology, Belgium
Publication history
- Preprint posted: May 19, 2021 (view preprint)
- Received: May 21, 2021
- Accepted: July 30, 2021
- Accepted Manuscript published: August 2, 2021 (version 1)
- Version of Record published: August 9, 2021 (version 2)
Copyright
© 2021, Robinson et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 1,493
- Page views
-
- 137
- Downloads
-
- 7
- Citations
Article citation count generated by polling the highest count across the following sources: PubMed Central, Crossref, Scopus.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Genetics and Genomics
The b-hemoglobinopathies, such as sickle cell disease and b-thalassemia, are one of the most common genetic diseases worldwide and are caused by mutations affecting the structure or production of β-globin subunits in adult hemoglobin. Many gene editing efforts to treat the β-hemoglobinopathies attempt to correct β-globin mutations or increase γ-globin for fetal hemoglobin production. δ-globin, the subunit of adult hemoglobin A2, has high homology to β-globin and is already pan-cellularly expressed at low levels in adult red blood cells. However, upregulation of δ-globin is a relatively unexplored avenue to increase the amount of functional hemoglobin. Here, we use CRISPR-Cas9 to repair non-functional transcriptional elements in the endogenous promoter region of δ-globin to increase overall expression of adult hemoglobin 2 (HbA2). We find that insertion of a KLF1 site alone is insufficient to upregulate δ-globin. Instead, multiple transcription factor elements are necessary for robust upregulation of δ-globin from the endogenous locus. Promoter edited HUDEP-2 immortalized erythroid progenitor cells exhibit striking increases of HBD transcript, from less than 5% to over 20% of total β-like globins in clonal populations. Edited CD34+ hematopoietic stem and progenitors (HSPCs) differentiated to primary human erythroblasts express up to 46% HBD in clonal populations. These findings add mechanistic insight to globin gene regulation and offer a new therapeutic avenue to treat β-hemoglobinopathies.
-
- Genetics and Genomics
Principal Component Analysis (PCA) and the Linear Mixed-effects Model (LMM), sometimes in combination, are the most common genetic association models. Previous PCA-LMM comparisons give mixed results, unclear guidance, and have several limitations, including not varying the number of principal components (PCs), simulating simple population structures, and inconsistent use of real data and power evaluations. We evaluate PCA and LMM both varying number of PCs in realistic genotype and complex trait simulations including admixed families, subpopulation trees, and real multiethnic human datasets with simulated traits. We find that LMM without PCs usually performs best, with the largest effects in family simulations and real human datasets and traits without environment effects. Poor PCA performance on human datasets is driven by large numbers of distant relatives more than the smaller number of closer relatives. While PCA was known to fail on family data, we report strong effects of family relatedness in genetically diverse human datasets, not avoided by pruning close relatives. Environment effects driven by geography and ethnicity are better modeled with LMM including those labels instead of PCs. This work better characterizes the severe limitations of PCA compared to LMM in modeling the complex relatedness structures of multiethnic human data for association studies.