Systematic identification of cis-regulatory variants that cause gene expression differences in a yeast cross

  1. Kaushik Renganaath
  2. Rockie Chong
  3. Laura Day
  4. Sriram Kosuri
  5. Leonid Kruglyak  Is a corresponding author
  6. Frank Wolfgang Albert  Is a corresponding author
  1. University of Minnesota, United States
  2. University of California, Los Angeles, United States

Abstract

Sequence variation in regulatory DNA alters gene expression and shapes genetically complex traits. However, the identification of individual, causal regulatory variants is challenging. Here, we used a massively parallel reporter assay to measure the cis-regulatory consequences of 5,832 natural DNA variants in the promoters of 2,503 genes in the yeast Saccharomyces cerevisiae. We identified 451 causal variants, which underlie genetic loci known to affect gene expression. Several promoters harbored multiple causal variants. In five promoters, pairs of variants showed non-additive, epistatic interactions. Causal variants were enriched at conserved nucleotides, tended to have low derived allele frequency, and were depleted from promoters of essential genes, which is consistent with the action of negative selection. Causal variants were also enriched for alterations in transcription factor binding sites. Models integrating these features provided modest, but statistically significant, ability to predict causal variants. This work revealed a complex molecular basis for cis-acting regulatory variation.

Data availability

Raw data and barcode assignments to oligos are available under GEO accession GSE155944. Source Data is provided for Figures 2, 3, 4, 5, and 6. Additional processed data and the MPRA design are available as Supplementary Files.

The following data sets were generated
The following previously published data sets were used

Article and author information

Author details

  1. Kaushik Renganaath

    Department of Genetics, Cell Biology, & Development, University of Minnesota, Minneapolis, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-1010-3604
  2. Rockie Chong

    Department of Chemistry & Biochemistry, University of California, Los Angeles, Los Angeles, United States
    Competing interests
    The authors declare that no competing interests exist.
  3. Laura Day

    Department of Human Genetics, University of California, Los Angeles, Los Angeles, United States
    Competing interests
    The authors declare that no competing interests exist.
  4. Sriram Kosuri

    Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-4661-0600
  5. Leonid Kruglyak

    Department of Human Genetics, University of California, Los Angeles, Los Angeles, United States
    For correspondence
    LKruglyak@mednet.ucla.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-8065-3057
  6. Frank Wolfgang Albert

    Department of Genetics, Cell Biology, and Development, University of Minnesota, Minneapolis, United States
    For correspondence
    falbert@umn.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-1380-8063

Funding

National Institutes of Health (R35GM124676)

  • Frank Wolfgang Albert

Howard Hughes Medical Institute

  • Leonid Kruglyak

Pew Charitable Trusts

  • Frank Wolfgang Albert

Alfred P. Sloan Foundation

  • Frank Wolfgang Albert

Kinship Foundation

  • Sriram Kosuri

Department of Energy, Labor and Economic Growth (DE-FC02-02ER63421)

  • Sriram Kosuri

National Institutes of Health (R01GM102308)

  • Leonid Kruglyak

National Institutes of Health (DP2GM114829)

  • Sriram Kosuri

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Reviewing Editor

  1. Christian R Landry, Université Laval, Canada

Version history

  1. Received: September 2, 2020
  2. Accepted: November 11, 2020
  3. Accepted Manuscript published: November 12, 2020 (version 1)
  4. Version of Record published: November 24, 2020 (version 2)

Copyright

© 2020, Renganaath et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 2,726
    views
  • 287
    downloads
  • 12
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Kaushik Renganaath
  2. Rockie Chong
  3. Laura Day
  4. Sriram Kosuri
  5. Leonid Kruglyak
  6. Frank Wolfgang Albert
(2020)
Systematic identification of cis-regulatory variants that cause gene expression differences in a yeast cross
eLife 9:e62669.
https://doi.org/10.7554/eLife.62669

Share this article

https://doi.org/10.7554/eLife.62669

Further reading

    1. Genetics and Genomics
    2. Neuroscience
    Bohan Zhu, Richard I Ainsworth ... Javier González-Maeso
    Research Article

    Genome-wide association studies have revealed >270 loci associated with schizophrenia risk, yet these genetic factors do not seem to be sufficient to fully explain the molecular determinants behind this psychiatric condition. Epigenetic marks such as post-translational histone modifications remain largely plastic during development and adulthood, allowing a dynamic impact of environmental factors, including antipsychotic medications, on access to genes and regulatory elements. However, few studies so far have profiled cell-specific genome-wide histone modifications in postmortem brain samples from schizophrenia subjects, or the effect of antipsychotic treatment on such epigenetic marks. Here, we conducted ChIP-seq analyses focusing on histone marks indicative of active enhancers (H3K27ac) and active promoters (H3K4me3), alongside RNA-seq, using frontal cortex samples from antipsychotic-free (AF) and antipsychotic-treated (AT) individuals with schizophrenia, as well as individually matched controls (n=58). Schizophrenia subjects exhibited thousands of neuronal and non-neuronal epigenetic differences at regions that included several susceptibility genetic loci, such as NRG1, DISC1, and DRD3. By analyzing the AF and AT cohorts separately, we identified schizophrenia-associated alterations in specific transcription factors, their regulatees, and epigenomic and transcriptomic features that were reversed by antipsychotic treatment; as well as those that represented a consequence of antipsychotic medication rather than a hallmark of schizophrenia in postmortem human brain samples. Notably, we also found that the effect of age on epigenomic landscapes was more pronounced in frontal cortex of AT-schizophrenics, as compared to AF-schizophrenics and controls. Together, these data provide important evidence of epigenetic alterations in the frontal cortex of individuals with schizophrenia, and remark for the first time on the impact of age and antipsychotic treatment on chromatin organization.

    1. Computational and Systems Biology
    2. Genetics and Genomics
    Weichen Song, Yongyong Shi, Guan Ning Lin
    Tools and Resources

    We propose a new framework for human genetic association studies: at each locus, a deep learning model (in this study, Sei) is used to calculate the functional genomic activity score for two haplotypes per individual. This score, defined as the Haplotype Function Score (HFS), replaces the original genotype in association studies. Applying the HFS framework to 14 complex traits in the UK Biobank, we identified 3619 independent HFS–trait associations with a significance of p < 5 × 10−8. Fine-mapping revealed 2699 causal associations, corresponding to a median increase of 63 causal findings per trait compared with single-nucleotide polymorphism (SNP)-based analysis. HFS-based enrichment analysis uncovered 727 pathway–trait associations and 153 tissue–trait associations with strong biological interpretability, including ‘circadian pathway-chronotype’ and ‘arachidonic acid-intelligence’. Lastly, we applied least absolute shrinkage and selection operator (LASSO) regression to integrate HFS prediction score with SNP-based polygenic risk scores, which showed an improvement of 16.1–39.8% in cross-ancestry polygenic prediction. We concluded that HFS is a promising strategy for understanding the genetic basis of human complex traits.