An atlas of the binding specificities of transcription factors in Pseudomonas aeruginosa directs prediction of novel regulators in virulence
Abstract
A high-throughput systematic evolution of ligands by exponential enrichment assay was applied to 371 putative TFs in P. aeruginosa, which resulted in the robust enrichment of 199 sequence motifs describing the binding specificities of 182 TFs. By scanning the genome, we predicted in total 33,709 significant interactions between TFs and their target loci, which were more than 11-fold enriched in the intergenic regions but depleted in the gene body regions. To further explore and delineate the physiological and pathogenic roles of TFs in P. aeruginosa, we constructed regulatory networks for nine major virulence-associated pathways, and found that 51 TFs were potentially significantly associated with these virulence pathways, 32 of which had not been characterized before, and some were even involved in multiple pathways. These results will significantly facilitate future studies on transcriptional regulation in P. aeruginosa and other relevant pathogens, and accelerate to discover effective treatment and prevention strategies for the associated infectious diseases.
Data availability
Sequencing data have been deposited in GEO under accession code GSE151518.
Article and author information
Author details
Funding
National Natural Science Foundation of China (8187364)
- Jian Yan
City University of Hong Kong (7005314)
- Jian Yan
National Natural Science Foundation of China (31900443)
- Wenju Sun
National Natural Science Foundation of China (31870116)
- Xin Deng
Research Grants Council, University Grants Committee (21103018)
- Xin Deng
Research Grants Council, University Grants Committee (21100420)
- Jian Yan
Research Grants Council, University Grants Committee (11101619)
- Xin Deng
China Postdoctoral Science Foundation (2019M663799)
- Wenju Sun
China Postdoctoral Science Foundation (2019M663794)
- Ligang Fan
City University of Hong Kong (9667188)
- Jian Yan
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Copyright
© 2021, Wang et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 5,280
- views
-
- 629
- downloads
-
- 24
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Epidemiology and Global Health
- Genetics and Genomics
Alzheimer’s disease (AD) is a complex degenerative disease of the central nervous system, and elucidating its pathogenesis remains challenging. In this study, we used the inverse-variance weighted (IVW) model as the major analysis method to perform hypothesis-free Mendelian randomization (MR) analysis on the data from MRC IEU OpenGWAS (18,097 exposure traits and 16 AD outcome traits), and conducted sensitivity analysis with six models, to assess the robustness of the IVW results, to identify various classes of risk or protective factors for AD, early-onset AD, and late-onset AD. We generated 400,274 data entries in total, among which the major analysis method of the IVW model consists of 73,129 records with 4840 exposure traits, which fall into 10 categories: Disease, Medical laboratory science, Imaging, Anthropometric, Treatment, Molecular trait, Gut microbiota, Past history, Family history, and Lifestyle trait. More importantly, a freely accessed online platform called MRAD (https://gwasmrad.com/mrad/) has been developed using the Shiny package with MR analysis results. Additionally, novel potential AD therapeutic targets (CD33, TBCA, VPS29, GNAI3, PSME1) are identified, among which CD33 was positively associated with the main outcome traits of AD, as well as with both EOAD and LOAD. TBCA and VPS29 were negatively associated with the main outcome traits of AD, as well as with both EOAD and LOAD. GNAI3 and PSME1 were negatively associated with the main outcome traits of AD, as well as with LOAD, but had no significant causal association with EOAD. The findings of our research advance our understanding of the etiology of AD.
-
- Chromosomes and Gene Expression
- Genetics and Genomics
Annotation of newly sequenced genomes frequently includes genes, but rarely covers important non-coding genomic features such as the cis-regulatory modules—e.g., enhancers and silencers—that regulate gene expression. Here, we begin to remedy this situation by developing a workflow for rapid initial annotation of insect regulatory sequences, and provide a searchable database resource with enhancer predictions for 33 genomes. Using our previously developed SCRMshaw computational enhancer prediction method, we predict over 2.8 million regulatory sequences along with the tissues where they are expected to be active, in a set of insect species ranging over 360 million years of evolution. Extensive analysis and validation of the data provides several lines of evidence suggesting that we achieve a high true-positive rate for enhancer prediction. One, we show that our predictions target specific loci, rather than random genomic locations. Two, we predict enhancers in orthologous loci across a diverged set of species to a significantly higher degree than random expectation would allow. Three, we demonstrate that our predictions are highly enriched for regions of accessible chromatin. Four, we achieve a validation rate in excess of 70% using in vivo reporter gene assays. As we continue to annotate both new tissues and new species, our regulatory annotation resource will provide a rich source of data for the research community and will have utility for both small-scale (single gene, single species) and large-scale (many genes, many species) studies of gene regulation. In particular, the ability to search for functionally related regulatory elements in orthologous loci should greatly facilitate studies of enhancer evolution even among distantly related species.