Global biogeographic sampling of bacterial secondary metabolism

  1. Zachary Charlop-Powers
  2. Jeremy G Owen
  3. Boojala Vijay B Reddy
  4. Melinda A Ternei
  5. Denise O Guimarães
  6. Ulysses A de Frias
  7. Monica T Pupo
  8. Prudy Seepe
  9. Zhiyang Feng
  10. Sean F Brady  Is a corresponding author
  1. Howard Hughes Medical Institute, Rockefeller University, United States
  2. Universidade Federal do Rio de Janeiro, Brazil
  3. University of São Paulo, Brazil
  4. Nelson R Mandela School of Medicine, South Africa
  5. Nanjing Agricultural University, China

Abstract

Recent bacterial (meta)genome sequencing efforts suggest the existence of an enormous untapped reservoir of natural-product-encoding biosynthetic gene clusters in the environment. Here we use the pyro-sequencing of PCR amplicons derived from both nonribosomal peptide adenylation domains and polyketide ketosynthase domains to compare biosynthetic diversity in soil microbiomes from around the globe. We see large differences in domain populations from all except the most proximal and biome-similar samples, suggesting that most microbiomes will encode largely distinct collections of bacterial secondary metabolites. Our data indicate a correlation between two factors, geographic distance and biome-type, and the biosynthetic diversity found in soil environments. By assigning reads to known gene clusters we identify hotspots of biomedically relevant biosynthetic diversity. These observations not only provide new insights into the natural world, they also provide a road map for guiding future natural products discovery efforts.

Article and author information

Author details

  1. Zachary Charlop-Powers

    Laboratory for Genetically Encoded Small Molecules, Howard Hughes Medical Institute, Rockefeller University, New York, United States
    Competing interests
    The authors declare that no competing interests exist.
  2. Jeremy G Owen

    Laboratory of Genetically Encoded Small Molecules, Howard Hughes Medical Institute, Rockefeller University, New York, United States
    Competing interests
    The authors declare that no competing interests exist.
  3. Boojala Vijay B Reddy

    Laboratory of Genetically Encoded Small Molecules, Howard Hughes Medical Institute, Rockefeller University, New York, United States
    Competing interests
    The authors declare that no competing interests exist.
  4. Melinda A Ternei

    Laboratory of Genetically Encoded Small Molecules, Howard Hughes Medical Institute, Rockefeller University, New York, United States
    Competing interests
    The authors declare that no competing interests exist.
  5. Denise O Guimarães

    Laboratório de Produtos Naturais, Curso de Farmácia, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
    Competing interests
    The authors declare that no competing interests exist.
  6. Ulysses A de Frias

    School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, São Paulo, Brazil
    Competing interests
    The authors declare that no competing interests exist.
  7. Monica T Pupo

    School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, São Paulo, Brazil
    Competing interests
    The authors declare that no competing interests exist.
  8. Prudy Seepe

    KwaZulu-Natal Research Institute for Tuberculosis and HIV, Nelson R Mandela School of Medicine, Durban, South Africa
    Competing interests
    The authors declare that no competing interests exist.
  9. Zhiyang Feng

    College of Food Science and Technology, Nanjing Agricultural University, Nanjing, China
    Competing interests
    The authors declare that no competing interests exist.
  10. Sean F Brady

    Laboratory of Genetically Encoded Small Molecules, Howard Hughes Medical Institute, Rockefeller University, New York, United States
    For correspondence
    sbrady@rockefeller.edu
    Competing interests
    The authors declare that no competing interests exist.

Reviewing Editor

  1. Jon Clardy, Harvard Medical School, United States

Version history

  1. Received: October 5, 2014
  2. Accepted: January 7, 2015
  3. Accepted Manuscript published: January 19, 2015 (version 1)
  4. Version of Record published: February 13, 2015 (version 2)

Copyright

© 2015, Charlop-Powers et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 5,851
    views
  • 963
    downloads
  • 101
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Zachary Charlop-Powers
  2. Jeremy G Owen
  3. Boojala Vijay B Reddy
  4. Melinda A Ternei
  5. Denise O Guimarães
  6. Ulysses A de Frias
  7. Monica T Pupo
  8. Prudy Seepe
  9. Zhiyang Feng
  10. Sean F Brady
(2015)
Global biogeographic sampling of bacterial secondary metabolism
eLife 4:e05048.
https://doi.org/10.7554/eLife.05048

Share this article

https://doi.org/10.7554/eLife.05048

Further reading

    1. Computational and Systems Biology
    2. Genetics and Genomics
    Weichen Song, Yongyong Shi, Guan ning Lin
    Tools and Resources

    We propose a new framework for human genetic association studies: at each locus, a deep learning model (in this study, Sei) is used to calculate the functional genomic activity score for two haplotypes per individual. This score, defined as the Haplotype Function Score (HFS), replaces the original genotype in association studies. Applying the HFS framework to 14 complex traits in the UK Biobank, we identified 3619 independent HFS–trait associations with a significance of p < 5 × 10−8. Fine-mapping revealed 2699 causal associations, corresponding to a median increase of 63 causal findings per trait compared with single-nucleotide polymorphism (SNP)-based analysis. HFS-based enrichment analysis uncovered 727 pathway–trait associations and 153 tissue–trait associations with strong biological interpretability, including ‘circadian pathway-chronotype’ and ‘arachidonic acid-intelligence’. Lastly, we applied least absolute shrinkage and selection operator (LASSO) regression to integrate HFS prediction score with SNP-based polygenic risk scores, which showed an improvement of 16.1–39.8% in cross-ancestry polygenic prediction. We concluded that HFS is a promising strategy for understanding the genetic basis of human complex traits.

    1. Genetics and Genomics
    2. Immunology and Inflammation
    Jean-David Larouche, Céline M Laumont ... Claude Perreault
    Research Article

    Transposable elements (TEs) are repetitive sequences representing ~45% of the human and mouse genomes and are highly expressed by medullary thymic epithelial cells (mTECs). In this study, we investigated the role of TEs on T-cell development in the thymus. We performed multiomic analyses of TEs in human and mouse thymic cells to elucidate their role in T-cell development. We report that TE expression in the human thymus is high and shows extensive age- and cell lineage-related variations. TE expression correlates with multiple transcription factors in all cell types of the human thymus. Two cell types express particularly broad TE repertoires: mTECs and plasmacytoid dendritic cells (pDCs). In mTECs, transcriptomic data suggest that TEs interact with transcription factors essential for mTEC development and function (e.g., PAX1 and REL), and immunopeptidomic data showed that TEs generate MHC-I-associated peptides implicated in thymocyte education. Notably, AIRE, FEZF2, and CHD4 regulate small yet non-redundant sets of TEs in murine mTECs. Human thymic pDCs homogenously express large numbers of TEs that likely form dsRNA, which can activate innate immune receptors, potentially explaining why thymic pDCs constitutively secrete IFN ɑ/β. This study highlights the diversity of interactions between TEs and the adaptive immune system. TEs are genetic parasites, and the two thymic cell types most affected by TEs (mTEcs and pDCs) are essential to establishing central T-cell tolerance. Therefore, we propose that orchestrating TE expression in thymic cells is critical to prevent autoimmunity in vertebrates.