Abstract

Shotgun metagenomic sequencing is a powerful approach to study microbiomes in an unbiased manner and of increasing relevance for identifying novel enzymatic functions. However, the potential of metagenomics to relate from microbiome composition to function has thus far been underutilized. Here, we introduce the Metagenomics Genome-Phenome Association (MetaGPA) study framework, which allows linking genetic information in metagenomes with a dedicated functional phenotype. We applied MetaGPA to identify enzymes associated with cytosine modifications in environmental samples. From the 2365 genes that met our significance criteria, we confirm known pathways for cytosine modifications and proposed novel cytosine-modifying mechanisms. Specifically, we characterized and identified a novel nucleic acid modifying enzyme, 5-hydroxymethylcytosine carbamoyltransferase, that catalyzes the formation of a previously unknown cytosine modification, 5-carbamoyloxymethylcytosine, in DNA and RNA. Our work introduces MetaGPA as a novel and versatile tool for advancing functional metagenomics.

Data availability

All raw and processed sequencing data generated in this study have been submitted to the NCBI Sequence Read Archive (SRA; https://www.ncbi.nlm.nih.gov/sra) under accession number PRJNA714147.

The following data sets were generated

Article and author information

Author details

  1. Weiwei Yang

    Research department, New England Biolabs Inc, Ipswich, United States
    Competing interests
    Weiwei Yang, The author is employee of New England Biolabs Inc. a manufacturer of restriction enzymes and molecular reagents..
  2. Yu-Cheng Lin

    Research department, New England Biolabs Inc, Ipswich, United States
    Competing interests
    Yu-Cheng Lin, The author was an employee of New England Biolabs Inc. a manufacturer of restriction enzymes and molecular reagents..
  3. William Johnson

    Research department, New England Biolabs Inc, Ipswich, United States
    Competing interests
    William Johnson, The author was an employee of New England Biolabs Inc. a manufacturer of restriction enzymes and molecular reagents..
  4. Nan Dai

    RNA Biology, New England Biolabs Inc, Ipswich, United States
    Competing interests
    Nan Dai, The author is an employee of New England Biolabs Inc. a manufacturer of restriction enzymes and molecular reagents..
  5. Romualdas Vaisvila

    Research department, New England Biolabs Inc, Ipswich, United States
    Competing interests
    Romualdas Vaisvila, The author is an employee of New England Biolabs Inc. a manufacturer of restriction enzymes and molecular reagents..
  6. Peter Weigele

    Research department, New England Biolabs Inc, Ipswich, United States
    Competing interests
    Peter Weigele, The author is an employee of New England Biolabs Inc. a manufacturer of restriction enzymes and molecular reagents..
  7. Yan-Jiun Lee

    Research department, New England Biolabs Inc, Ipswich, United States
    Competing interests
    Yan-Jiun Lee, The author is an employee of New England Biolabs Inc. a manufacturer of restriction enzymes and molecular reagents..
  8. Ivan R Corrêa Jr

    RNA Biology, New England Biolabs Inc, Ipswich, United States
    Competing interests
    Ivan R Corrêa, The author is an employee of New England Biolabs Inc. a manufacturer of restriction enzymes and molecular reagents..
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-3169-6878
  9. Ira Schildkraut

    Research department, New England Biolabs Inc, Ipswich, United States
    Competing interests
    Ira Schildkraut, The author is an employee of New England Biolabs Inc. a manufacturer of restriction enzymes and molecular reagents..
  10. Laurence Ettwiller

    Research department, New England Biolabs Inc, Ipswich, United States
    For correspondence
    laurence.ettwiller@gmail.com
    Competing interests
    Laurence Ettwiller, The author is employee of New England Biolabs Inc. a manufacturer of restriction enzymes and molecular reagents..
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-3957-6539

Funding

New England Biolabs (no data)

  • Weiwei Yang
  • Yu-Cheng Lin
  • William Johnson
  • Nan Dai
  • Romualdas Vaisvila
  • Peter Weigele
  • Yan-Jiun Lee
  • Ivan R Corrêa Jr
  • Ira Schildkraut
  • Laurence Ettwiller

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Copyright

© 2021, Yang et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 1,713
    views
  • 212
    downloads
  • 4
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Weiwei Yang
  2. Yu-Cheng Lin
  3. William Johnson
  4. Nan Dai
  5. Romualdas Vaisvila
  6. Peter Weigele
  7. Yan-Jiun Lee
  8. Ivan R Corrêa Jr
  9. Ira Schildkraut
  10. Laurence Ettwiller
(2021)
A Genome-Phenome Association study in native microbiomes identifies a mechanism for cytosine modification in DNA and RNA
eLife 10:e70021.
https://doi.org/10.7554/eLife.70021

Share this article

https://doi.org/10.7554/eLife.70021

Further reading

    1. Genetics and Genomics
    Wenjing Liu, Shujin Li ... Xianjun Zhu
    Research Article

    Familial exudative vitreoretinopathy (FEVR) is a severe genetic disorder characterized by incomplete vascularization of the peripheral retina and associated symptoms that can lead to vision loss. However, the underlying genetic causes of approximately 50% of FEVR cases remain unknown. Here, we report two heterozygous variants in calcyphosine-like gene (CAPSL) that is associated with FEVR. Both variants exhibited compromised CAPSL protein expression. Vascular endothelial cell (EC)-specific inactivation of Capsl resulted in delayed radial/vertical vascular progression, compromised endothelial proliferation/migration, recapitulating the human FEVR phenotypes. CAPSL-depleted human retinal microvascular endothelial cells (HRECs) exhibited impaired tube formation, decreased cell proliferation, disrupted cell polarity establishment, and filopodia/lamellipodia formation, as well as disrupted collective cell migration. Transcriptomic and proteomic profiling revealed that CAPSL abolition inhibited the MYC signaling axis, in which the expression of core MYC targeted genes were profoundly decreased. Furthermore, a combined analysis of CAPSL-depleted HRECs and c-MYC-depleted human umbilical vein endothelial cells uncovered similar transcription patterns. Collectively, this study reports a novel FEVR-associated candidate gene, CAPSL, which provides valuable information for genetic counseling of FEVR. This study also reveals that compromised CAPSL function may cause FEVR through MYC axis, shedding light on the potential involvement of MYC signaling in the pathogenesis of FEVR.

    1. Developmental Biology
    2. Genetics and Genomics
    Mitchell Bestry, Alexander N Larcombe ... David Martino
    Research Article

    Alcohol consumption in pregnancy can affect genome regulation in the developing offspring but results have been contradictory. We employed a physiologically relevant murine model of short-term moderate prenatal alcohol exposure (PAE) resembling common patterns of alcohol consumption in pregnancy in humans. Early moderate PAE was sufficient to affect site-specific DNA methylation in newborn pups without altering behavioural outcomes in adult littermates. Whole-genome bisulfite sequencing of neonatal brain and liver revealed stochastic influence on DNA methylation that was mostly tissue-specific, with some perturbations likely originating as early as gastrulation. DNA methylation differences were enriched in non-coding genomic regions with regulatory potential indicative of broad effects of alcohol on genome regulation. Replication studies in human cohorts with fetal alcohol spectrum disorder suggested some effects were metastable at genes linked to disease-relevant traits including facial morphology, intelligence, educational attainment, autism, and schizophrenia. In our murine model, a maternal diet high in folate and choline protected against some of the damaging effects of early moderate PAE on DNA methylation. Our studies demonstrate that early moderate exposure is sufficient to affect fetal genome regulation even in the absence of overt phenotypic changes and highlight a role for preventative maternal dietary interventions.