Peer review process
Not revised: This Reviewed Preprint includes the authors’ original preprint (without revision), an eLife assessment, public reviews, and a provisional response from the authors.
Read more about eLife’s peer review process.Editors
- Reviewing EditorAlyssa WilsonIcahn School of Medicine at Mount Sinai, New York, United States of America
- Senior EditorKathryn CheahUniversity of Hong Kong, Hong Kong, Hong Kong
Reviewer #1 (Public review):
Summary:
In this manuscript, the authors present a pipeline for the identification of transcription factor (TF) co-occurrence in regulatory regions. This pipeline aims to generate a catalogue of combinations of TFs working together, and the authors apply this during human embryonic development. In particular, they identified co-occurrences of TFs starting from H3K27ac ChIP-seq and RNA-seq input data to select active enhancers and transcribed TFs. The pipeline is applied to explore TF motifs co-occurrence at tissue-specific developmental enhancers across 11 human embryonic tissues. The application of the pipeline suggests the presence of regulatory patterns in different human developmental tissue-specific enhancers in association with ubiquitous TFs. The authors further explore the role of TEAD1 (an ubiquitously expressed TF) as a repressor. They test the role of TEAD1 as a co-repressor using a luciferase assay and tissue-specific enhancers, either alone or combined with a YAP coactivator. Overall, this paper presents an important aspect in mammalian gene regulation, the cooperative binding of TFs, and provides an important resource for TF pairs.
Strengths:
I appreciated the number of datasets analysed and the validation of a subset of enhancers.
Weaknesses:
Not many, but probably validation at more enhancers could have made the paper stronger.
Reviewer #2 (Public review):
Summary:
Garcia-Mora et al. presented a two-step bioinformatics pipeline using H3K27ac ChIP-seq and RNA-seq data from 11 human embryonic tissues published by the same groups of senior authors. "First Search" identifies motifs for TFs that are both tissue-restricted in expression and enriched in tissue-specific enhancers. "Second Search" then looks for additional motifs that co-occur near each "First Search" motif. The authors here went further than previous motif co-occurrence/co-enrichment analyses by identifying TEAD motifs as (1) representing a ubiquitously expressed family and (2) showing high co-occurrence with tissue-specific motifs at tissue-specific enhancers. They then elaborate on this finding and speculate that "TEAD, in concert with cardiac-restricted transcriptional regulators, may contribute to the recruitment of CHD4 and may play a role in attenuating the activity of enhancers involved in cardiomyocyte differentiation." They also discussed validation experiments using the luciferase assay.
Strengths:
The manuscript is well-written and easy to follow for the most part.
Weaknesses:
My main concerns and criticisms are about the sensitivity of the method and the validation of experiment designs and conclusions. Some examples where validation could be improved are as follows:
(1) The authors propose a mechanism of a TF trio (TEAD - CHD4 - tissue-specific TFs). However, only one validation experiment checked CHD4. CHD4 binding was not mentioned at all in the other cases.
(2) The authors integrated E12.5 TEAD binding with E11.5 acetylation data, and it would be important to show that this experimental approach is valid or otherwise qualify its limitations.
(3) Motif co-occurrence analysis was extended to claiming TF interactions without further validation.
Reviewer #3 (Public review):
Summary:
Mora et al employ published ChIP-seq and RNA-seq from embryonic tissues to nominate transcription factors that work combinatorially during development. This manuscript addresses an important gap in knowledge regarding the complexities of gene regulation. However, as written, the manuscript is focused on confirming mostly known associations and does not unveil principles that can be broadly applied, given multiple technical caveats that are outlined below.
Strengths:
(1) Instead of focusing on a single transcription factor motif enriched within peaks, the authors search the flanking regions of enriched motifs to nominate additional transcription factors that may work cooperatively to provide organ specificity. This type of analysis is a crucial next step in the gene regulation field, as transcription factors rarely work independently.
(2) Figure 6 is a good demonstration of the preliminary experiments that can be done to test the activity of co-occurring motifs.
(3) This is a really nice resource of organ-specific motif associations that can be used to generate many testable hypotheses.
(4) The rationale and writing are very clear and easy to read.
Weaknesses:
(1) Much of this manuscript focuses on confirming transcription factor relationships that have been reported previously. For example, it is well known that GATA4 interacts with MEF2 in the ventricle. There are limited new or unexpected associations discussed and tested.
(2) Embryonic tissues are highly heterogeneous, limiting the utility of the bulk ChIP-seq employed in these analyses. Does the cellular heterogeneity explain the discrepancy between TEAD binding and histone acetylation? Similarly, how does conservation between species affect the TF predictions?
(3) Some of the interpretations should also be fleshed out a bit more to clarify the advantage of the analyses presented here. For example, if Gata4 and Foxa2 transcripts are expressed during different stages of development, then it's likely that (as stated by the authors) these motifs are not used during the same stage of development. But examining the flanking regions wasn't necessary to make that statement. This type of conclusion seems tangential to the benefit of this analysis, which is to understand which TFs work together in a single organ at a single time point.
(4) This manuscript hinges on luciferase assays whose results can be difficult to translate to complex gene regulation networks. Many motifs are often clustered together, which makes designing experiments at endogenous loci important in studies such as this one.