The Wg and Dpp morphogens regulate gene expression by modulating the frequency of transcriptional bursts

Abstract
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

Morphogen signaling contributes to the patterned spatiotemporal expression of genes during development. One mode of regulation of signaling-responsive genes is at the level of transcription. Single-cell quantitative studies of transcription have revealed that transcription occurs intermittently, in bursts. Although the effects of many gene regulatory mechanisms on transcriptional bursting have been studied, it remains unclear how morphogen gradients affect this dynamic property of downstream genes. Here we have adapted single molecule fluorescence in situ hybridization (smFISH) for use in the Drosophila wing imaginal disc in order to measure nascent and mature mRNA of genes downstream of the Wg and Dpp morphogen gradients. We compared our experimental results with predictions from stochastic models of transcription, which indicated that the transcription levels of these genes appear to share a common method of control via burst frequency modulation. Our data help further elucidate the link between developmental gene regulatory mechanisms and transcriptional bursting.

Introduction

Paracrine signaling is a highly conserved means for cells within a tissue to communicate with one another to regulate diverse activities including proliferation, differentiation, apoptosis, and movement. Many of these activities are mediated by changes in gene transcription that are brought about by reception of the signals. Paracrine factors acting as morphogens are a particularly important class of gene regulators. Morphogens form spatially-extended gradients from the source of their synthesis, and elicit different transcription outputs from target genes, depending on local concentration of the morphogen (Tabata and Takei, 2004). Many paracrine signals regulate gene transcription via control of the availability or activity of sequence-specific transcription factors. Some transcription factors regulate assembly of the preinitiation complex composed of Pol II and general factors at the transcription start site (Esnault et al., 2008). Other factors recruit coregulators that modify nucleosomes or remodel the chromatin architecture of the gene (Bannister and Kouzarides, 2011).

However, transcription is a dynamic process, and thus, molecular models of regulation via PIC assembly or chromatin structure, do not adequately capture what kinetic steps in transcription initiation are being regulated. Recently developed methods have uncovered greater complexity in the transcription initiation process than previously imagined. Genes that are constitutively expressed rarely show uniform and continuous mRNA synthesis. Rather, mRNA synthesis occurs in bursts that are interrupted by periods of dormant output. This phenomenon is known as transcriptional bursting (Chen et al., 2019; Chubb et al., 2006; Dey et al., 2015; Raj et al., 2006; Suter et al., 2011).

Various studies have explored how mechanisms of gene regulation affect the size and frequency of transcriptional bursts, and thereby affect transcription output. The availability of transcription factors has been shown to affect burst frequency (Ezer et al., 2016; Larson et al., 2013; Senecal et al., 2014). For example, the Drosophila transcription factors Bicoid and Dorsal have been studied in great detail with respect to their effects on transcription burst frequency in the embryo (Garcia et al., 2013; He et al., 2012; Holloway and Spirov, 2017; Little et al., 2013; Xu et al., 2015). Enhancer strength and enhancer-promoter contact correlate with burst frequency of genes (Bartman et al., 2016; Bothma et al., 2014; Chen et al., 2019; Fukaya et al., 2016; Larsson et al., 2019). These studies altogether suggest that bursting frequency is potentiated by enhancer-promoter contact and is mediated by transcription factors binding to DNA.

In this study, we have explored how the Wnt protein Wingless (Wg) and BMP protein Decapentaplegic (Dpp) regulate transcription dynamics of genes in the Drosophila wing imaginal disc. The Wnt and BMP families of proteins are two highly conserved paracrine factors that can act as morphogens. In canonical Wnt signaling, the binding of extracellular Wnt protein to its transmembrane receptor Frizzled causes β-catenin to be stabilized and free to enter the nucleus, where it relieves repression of Wnt-responsive genes by binding to the sequence-specific transcription factor TCF (Clevers and Nusse, 2012; Swarup and Verheyen, 2012). In canonical BMP signaling, ligand binding to receptor triggers phosphorylation of SMAD proteins, which translocate to the nucleus along with co-SMADs, bind to responsive genes, and activate their transcription (Hamaratoglu et al., 2014; Shi and Massagué, 2003).

To explore the effects of Dpp and Wg signaling on transcription dynamics, we have adapted single molecule fluorescent in situ hybridization (smFISH) for use in imaginal disc tissues. We use smFISH to quantify nascent and mature mRNAs for several genes expressed in highly diverse spatial patterns within the wing disc. Taken together, our data suggest that all of the genes investigated are regulated by modulation of their transcription burst frequency by Dpp and Wg even though their mean expression patterns are distinct from one another.

Results

In this study, we have explored how the Wg and Dpp morphogens regulate transcription dynamics in the wing disc. Each morphogen is synthesized in a narrow stripe of cells within the disc. Wg is produced in cells at the boundary between Dorsal and Ventral (DV) compartments of the wing pouch, while Dpp is produced in cells at the boundary between Anterior and Posterior (AP) compartments (Figure 1A). These factors form concentration gradients across the disc, and in the case of Dpp, it regulates gene expression in a concentration-dependent manner.

Figure 1 with 3 supplements see all

Download asset Open asset

smFISH analysis of sfGFP-sens mRNA levels in wing imaginal discs.

(A) Schematic of a wing disc outlining different regional domains, and the positions of boundaries between Dorsal (D) - Ventral (V) and Anterior (A) - Posterior (P) compartments of the disc. Each wing disc is composed of roughly 50,000 cells organized in a pseudostratified epithelium. (B) Schematized expression pattern for Sens inside the wing pouch centered around the DV boundary. Sens is also expressed in clusters of cells in the notum, which are not shown. (**C-E**) Confocal sections of wing discs expressing sfGFP-Sens. (C) sfGFP-Sens protein fluorescence. (D) sfGFP-Sens mRNAs as visualized by smFISH using sfGFP probes. Scale bar = 10 μm. (E) Higher magnification of sfGFP-Sens mRNAs as visualized by smFISH using sfGFP probes. Scale bar = 10 μm. (F) Distribution of wing disc cells as a function of the number of Sens mRNA molecules per cell. (G) Sens mRNA number as a function of cell distance from the DV boundary displays a bimodal expression pattern for Sens. Cells were binned according to the shortest path length from its centroid to the DV boundary, and whether they were dorsal or ventral compartment cells. Median mRNA number/cell for each bin is plotted with 95% bootstrapped confidence intervals.

smFISH detection of mRNA molecules in the wing disc

In order to assay gene expression in the wing imaginal disc, we quantified mRNA numbers using smFISH. With smFISH, a tandem array of fluorescently-labeled oligonucleotides complementary to a given mRNA are hybridized to fixed and permeabilized tissue. When a sufficient number of oligo probes anneal to one mRNA molecule, the aggregate fluorescence can be detected by standard confocal microscopy (Raj et al., 2008). This method has been developed and applied to many systems, including cell culture, C. elegans, and the Drosophila embryo (Ji and van Oudenaarden, 2012; Little and Gregor, 2018; Youk et al., 2010). We developed a robust smFISH method applicable for imaginal discs (see Materials and methods for details).

We first probed for expression of the senseless (sens) gene in the wing disc. Sens is required for cells to adopt a sensory organ fate, and the gene is expressed in two stripes of cells adjacent to and on either side of the DV boundary in the wing pouch (Figure 1B,C; Nolo et al., 2000). Sens expression in the wing pouch is induced by Wg, which is expressed by cells located at the DV boundary (Jafar-Nejad et al., 2006). We probed for sens mRNAs expressed from a transgenic version of the sens gene. We did so for a number of reasons. First, the genomic transgene rescues the endogenous gene based on function and expression (Cassidy et al., 2013). Second, the transgene is tagged such that the amino-terminal coding sequence corresponds to super-fold GFP (sfGFP). By using oligo probes directed against sfGFP, we could easily determine the specificity of detection.

Discs from sfGFP-sens animals were probed and imaged by confocal microscopy, revealing the expected pattern of fluorescence localized to two stripes adjacent to the DV midline in the wing pouch (Figure 1D). The fluorescence signal was specific for sfGFP-sens since wing discs from larvae not carrying the transgene gave a low background fluorescence pattern (Figure 1—figure supplement 1A,B). The fluorescence signal from sfGFP-sens discs was sufficiently bright that spots were readily detected in optical sections when imaged under higher magnification (Figure 1E). The size of each 2D spot was approximately the expected diffraction limit of ~600 nm for smFISH probes emitting at 633 nm wavelength (Lipson et al., 1995). A custom image-analysis pipeline was developed to segment and analyze all of the 3D fluorescent spots in an entire stack of optical sections (Figure 1—figure supplement 1C). Details of the segmentation and analysis are provided in the Materials and methods.

We tested the ability of the pipeline to correctly identify RNA spots by several means. First we expected sfGFP-sens mRNA molecules to generate fluorescence spots with a homogeneous composition since the mRNAs could equivalently anneal to the probes. The distribution of fluorescence intensity for the identified 3D spots was unimodal, suggesting that the spots had a homogeneous composition (Figure 1—figure supplement 1D). Second, we incubated wing discs in medium containing actinomycin-D, an inhibitor of mRNA synthesis. The number of fluorescence spots was greatly diminished, as would be expected if they were localized to mRNA molecules (Figure 1—figure supplement 1E). Third, if the method is accurate, almost all spots would correspond to sfGFP-sens mRNAs. We compared the number of identified spots in discs expressing the sfGFP-sens transgene versus discs lacking the transgene. From this, we estimated that 0.5% of identified spots are false-positive (Figure 1—figure supplement 2A). Finally, we estimated the number of sfGFP-sens mRNAs that fail to be identified as fluorescent spots. We simultaneously hybridized sfGFP-sens wing discs with two sets of non-overlapping probes - one set recognized sfGFP and the other set recognized sens sequences. Each probe-set was labeled with a different fluor. If a spot identified using the sfGFP probe set was not identified by the sens probe-set, we classified that spot as a false-negative. The analysis indicated that a maximum of 6% of mRNAs (232 out of 3842 spots scored) were not identified by both probe-sets (Figure 1—figure supplement 2B). This rate of false-negative identification is comparable to smFISH methods in other systems (Raj et al., 2008).

We next looked to partition identified mRNAs into the cells from which they were expressed. Since the smFISH method denatured the epitopes of all tested antibodies and it also denatured sfGFP, we were unable to segment cells using standard approaches. In the absence of a direct approach, we adopted a computational approach to resolving the smFISH signal at single-cell resolution. Using the fluorescent dye DAPI to visualize cell nuclei in the imaged samples, we segmented nuclei into 3D objects (Figure 1—figure supplement 2C–E), which are located throughout the apical-basal axis of the pseudostratified epithelium of the wing disc (Aldaz and Escudero, 2010). Based on segmented nuclei, we were able to construct effective cell boundaries by performing a 3D Voronoi tessellation (Figure 1—figure supplement 2F). RNAs were then partitioned into the distinct Voronoi cells (Figure 1—figure supplement 2G). Despite the local inaccuracies in our protocol for assigning transcripts to single cells, the Voronoi based tessellation of the three-dimensional tissue is a democratic prescription, lacking any hyperparameters, that is able to reveal the global quantitative trends in the data. The same democratic approach has been used by others in assigning mRNA transcripts to early embryonic nuclei when cell boundaries are unseen (Little et al., 2013). Details of tessellation are provided in the Materials and methods.

The abundance of sens mRNAs within the DV stripes varied from one to fifty molecules per cell (Figure 1F), reflecting the graded expression pattern of Sens protein induced by the Wg morphogen across the width of each stripe (Jafar-Nejad et al., 2006). Binning cells according to their distance from the DV boundary, we were able to observe peaks in mRNA number per cell as a function of distance from the boundary (Figure 1G).

We also used the sfGFP-sens gene to determine whether the smFISH method could detect mRNAs in other imaginal discs. In the eye disc, sens is expressed in a stripe of cells located within the morphogenetic furrow, and indeed we were able to detect smFISH signals in furrow cells of the eye disc (Figure 1—figure supplement 3). Thus, our method is broadly applicable to imaginal discs.

smFISH detection of gene expression regulated by Dpp

We extended the analysis to genes downstream of the BMP family protein Dpp. Dpp is expressed in a stripe of cells located at the AP boundary of the wing disc, orthogonal to the Wg stripe (Figure 2A). Dpp protein is transported bidirectionally to form gradients across the disc, and several genes are regulated by Dpp in a concentration-dependent manner. Spalt-major (salm), optomoter-blind (omb), daughters-against-dpp (dad), and brinker (brk) are expressed in symmetric domains within the anterior and posterior compartments of the wing pouch (Figure 2A,B). Salm is symmetrically expressed in a domain somewhat broader than the Dpp stripe, whereas omb and dad are expressed more broadly, and brk is expressed only near the wing pouch border (de Celis et al., 1996; Grimm and Pflugfelder, 1996; Tabata and Takei, 2004). When smFISH was used to detect mRNAs of these genes, it qualitatively recapitulated their known expression patterns (Figure 2C–F). We quantified the number of mRNAs per cell and attempted to map the distribution to cell position within the wing pouch. Since the only landmark we could reliably use was the border between the wing pouch and the rest of the disc, we measured cell position as a function of distance from the border (Figure 2G). When we did so, the distributions in mRNA number per cell displayed profiles that were consistent with previous qualitative descriptions of their expression patterns (Figure 2H). To ensure that these distributions were not an artifact of landmarking the border, we probed for mRNAs produced from the scalloped (sd) gene. The sd gene is expressed uniformly throughout the wing pouch (Campbell et al., 1992; Williams et al., 1993), and thus we anticipated a uniform distribution of mRNAs/cell if our method was accurate. Indeed, there was a fairly constant level of mRNAs/cell across the wing pouch as determined by our smFISH pipeline (Figure 2—figure supplement 1A).

Figure 2 with 1 supplement see all

Download asset Open asset

smFISH analysis of mRNA levels from Dpp-responsive genes.

(A) Schematic of wing discs highlighting the graded distribution of Dpp protein in the wing pouch, centered around the AP boundary, and the expression domain for *salm*, one of the targets of Dpp regulation. Not shown is Dpp localization in the notum domain of the disc. (B) Expression domains of four target genes of Dpp signaling. (**C-F**) Confocal sections of wing pouches probed for mRNAs synthesized from the *salm* (C), *omb* (D), *dad* (E), and *brk* (F) genes. Orange arrows mark the position of the AP boundary in each image. (**G, H**) mRNA number as a function of cell distance from the anterior-most border of the wing pouch. (G) A border-to-boundary axis, orthogonal to the AP boundary, is used to map cell position, along which distances are displayed in μm from the wing pouch border. (H) Cells were binned according to position along the border-to-boundary axis. Median mRNA number/cell for each bin is plotted with 95% bootstrapped confidence intervals.

smFISH detects sites of nascent transcription

A further benefit to smFISH is that it can detect and quantify RNA as it is being transcribed from a gene. We sought to identify and characterize these sites of nascent transcription in the wing disc. Quantification of pixel intensity of all fluorescent spots revealed two discrete populations: a large population of dim spots of uniform intensity, and a smaller population of brighter spots with more variable intensity (Figure 3A,B). The former population corresponded to those described earlier, and they were primarily located in the cytoplasm - these are the mature mRNAs. The latter population was primarily located inside nuclei, and thus we hypothesized that these were sites of nascent transcription. To confirm that these bright spots corresponded to transcription sites, we used probes complementary to an intron in the omb gene. These probes only detected the brighter population of spots localized to nuclei (Figure 3C). Since introns are not spliced out until after transcription, this result supports the conclusion that the brighter nuclear spots are sites of nascent transcription.

Figure 3 with 2 supplements see all

Download asset Open asset

Sites of nascent transcription are detected by smFISH.

(A) Sites of nascent transcription can fluoresce more brightly than single mRNA molecules due to multiple nascent transcripts localized to one gene locus. (B) Probes recognizing an *omb* exon generate many small dim spots and a few large bright spots. Right image shows the merge of probe and DAPI fluorescence. The bright spots are associated with nuclei whereas most dim spots are not. (C) Probes recognizing an *omb* intron only generate large bright spots that are associated with nuclei. Scale bars = 5 μm. (D) Frequency distribution of intensity for all spots identified in a wing disc probed for *sens* RNAs. Using a threshold of twice the median spot intensity, all single mRNA spots were filtered out, leaving only spots that are associated with transcription sites. The frequency distribution for this class of spots is shown. (E) Transcription sites are assigned to cells. For each cell that contains one or more mRNA molecules, it is scored for whether it also has one or more transcription sites. The average fraction of all such cells with a transcription site is shown for each gene. Error bars represent 95% confidence intervals. (F) The ratio of the variance of mRNAs/cell to its mean, as a function of the mean, for all genes. This ratio is larger than one, irrespective of the mRNA number for binned sub-populations of cells and the gene. Error bars represent 95% confidence intervals.

Although wing disc cells are diploid, fewer than 15% of nuclei contained more than one transcription site for a given gene. One explanation is that transcription is infrequent enough that 85% of the time only one allele is actively transcribing. Another explanation is that two alleles are physically co-localized, and their nascent transcripts cannot be resolved by confocal microscopy. Drosophila and other animals have extensive physical pairing of homologous chromosomes in somatic cells (McKee, 2004). Consequently, alleles on paired chromosomes are often spatially juxtaposed (Szabo et al., 2018). For genes such as omb that we probed far upstream of the transcription termination site, it is likely that we were observing transcription from both alleles at once, given that a detectable nascent RNA would stay at the transcription site for a long time (~50 min). Even for these very bright transcription spots, only one transcription site per nucleus was observed (Figure 3B,C). This observation is consistent with a single transcription spot in a nucleus representing transcription from both alleles.

Transcription occurs in bursts

Transcription sites were counted by applying a cutoff that only included spots with at least twice the intensity of a mature mRNA spot (Figure 3D, Figure 3—figure supplement 1). There was a broad distribution of transcription site intensities, suggesting a large range of nascent RNA numbers that were present on a gene at a given time.

Strikingly, many cells did not have a detectable transcription site even though the cells contained mature mRNAs (Figure 3E). Between 50–80% of all cells had this feature, and it was observed for all genes. This observation is not an artifact of segmentation erroneously assigning mature mRNAs to cells that do not express the genes. For all genes, the number of transcription sites strongly correlated with mRNA number when discs were binned but not segmented (Figure 3—figure supplement 2). Hence, although assignment errors occur at the local scale, they cannot account for the quantitative global trends where 2–5 fold more cells lack a transcription site than lack any mature mRNAs.

Why do cells with mature mRNAs lack detectable transcription sites? One explanation is that each gene's promoter is always open, but since transcription is stochastic, there would be times when zero or just a few Pol II molecules are presently transcribing the gene. In this scenario, the birth and death of mRNAs can be described as a Poisson process, where the ratio of the variance of the distribution of number of mRNAs to its mean is expected to be one (Munsky et al., 2012; Raj and van Oudenaarden, 2008). Since mRNA number per cell varied systematically across the wing disc because of Wg and Dpp signaling, we binned cells according to their position in the disc, as had been described earlier (Figures 1G and 2H), and empirically estimated the ratio of a bin's variance to its mean. The ratio of variance to mean mature mRNA number per cell was between 5 and 10 for all genes, and was fairly independent of mRNA output (Figure 3F). This indicated that a Poisson process could not explain why we failed to detect transcription sites in every cell expressing mRNA.

To determine if our observations were possibly caused by transcription bursting, we invoked a two-state model of transcription (Figure 4A). A promoter exists in one of two possible states - ON and OFF. The promoter switches between states at particular rates k_on and k_off. When the promoter is in the ON state, Pol II is permitted to initiate transcription that is subject to a rate constant k_ini. When the promoter is in the OFF state, Pol II is unable to initiate transcription. The model also includes a transcription elongation step, which is assumed to be 100% processive, and whose timescale depends on the gene length and the rate of elongation. The latter is assumed to be 1100 nucleotides/min, which is a value that has been experimentally determined in Drosophila (Ardehali et al., 2009).

Figure 4

Download asset Open asset

Modeling transcription sites using bursting dynamics.

(A) Model framework showing the three rate parameters affecting transcription initiation. Two parameters affect the promoter state, while the third parameter only affects how many initiation events occur when the promoter is ON. (B) Pol II molecules in elongation mode are distributed along the transcription unit. If Pol II is upstream of the probe binding sites, the nascent transcript will not be detected. If Pol II is downstream, the nascent transcript will be detected as 100% signal. If Pol II is transcribing within the binding sites, the nascent transcript will be detected as a partial signal. These three different scenarios are explicitly accounted for in our mathematical model. For example in the simulation result shown here, four Pol II's are situated such that a total of 12 virtual probe-binding sites are present. Since each mRNA has six binding sites, it means that this simulated transcription site has 12/6 or 2 units of normalized signal. Applying our filter cutoff for identifying a transcription site as two or more units, this simulated site would be scored as a positive. (C) The phase diagram of transcription site detection as a function of burst size and frequency in the model. Both burst size and frequency impact the likelihood of detecting a transcription site. When burst size increases at low burst frequency, the likelihood of detecting a transcription site remains fairly constant. When burst size increases at high burst frequency (horizontal red arrow), the likelihood of detecting a transcription site is ultrasensitive to burst size. Likewise, when burst frequency increases at low burst size, the likelihood of detecting a transcription site remains fairly constant. When burst frequency increases at high burst size (vertical red arrow), the likelihood of detecting a transcription site is ultrasensitive to burst size. The phase diagram makes manifest that a range of combinations of burst frequency and size could explain observed transcription site frequency data.

In the model, transcriptional bursts have a characteristic size (number of transcripts per burst) and frequency (rate at which bursts occur). The average burst size is defined as k_ini/k_off, whereas the average burst frequency is defined as (k_on^-1 + k_off^-1)^–1 (Dar et al., 2012). We systematically and independently varied the parameters k_on, k_off and k_ini to tune the frequency and size of virtual bursts. For each parameter set, we ran 1000 simulations of the model. To capture the stochastic nature of gene expression, reactions in the model were treated as probabilistic events, with the exception of the transcript elongation time.

To directly relate the results of model simulations to experimental data, we performed the following treatment of simulation data. First, we transformed output of each simulation to mimic the experimentally detected fluorescence at a single gene allele. Fluorescence intensity depends on how many probe-binding sites are present in nascent RNAs on a gene allele at a given time (Figure 4B). This varies with the number of elongating Pol II molecules on the allele, and the position of the probe-binding sites relative to the transcription start and stop sites. We normalized the output of simulated nascent RNAs by calculating the number of Pol II molecules upstream, within, and downstream of the binding region at the completion of a simulation. This normalization provided an approximation of fluorescence intensity from one gene allele. Second, we randomly paired two independent simulations to mimic the transcription site fluorescence of paired alleles within a nucleus. If simulated transcription site fluorescence fell below a cutoff of twice the fluorescence of a single RNA, we counted that simulation as having no ‘detectable’ transcription site. This mimicked the cutoff that was applied to experimental data for identifying a transcription site.

We then asked what combination of burst size and frequency could theoretically account for the observed frequency of finding cells with a transcription site (this ranged from 20% to 50% of cells). A phase diagram revealed that a broad range of burst size and frequency could explain our experimental observations (Figure 4C). Therefore, according to our model results, tuning burst frequency and/or size can produce the variable likelihood of detecting a transcription site by smFISH.

Burst frequency is regulated by Dpp and Wg

We quantified the frequency of detecting a transcription site as a function of cell position within the wing pouch (Figure 5A,B). This frequency varied across the disc in a manner that was gene-specific. Strikingly, the spatial distributions of transcription site frequency strongly paralleled the mRNA number per cell for all genes (compare Figure 5A,B and Figures 1G and 2H). To ensure that this was not an artifact of variable smFISH detection, we also quantified the frequency of detecting a transcription site for sd, which is uniformly expressed in the wing pouch. This frequency was constant across the disc and paralleled the sd mRNA number per cell (Figure 2—figure supplement 1A,B).

Figure 5

Download asset Open asset

Transcription site detection correlates with mRNA number.

(**A,B**) The probability of detecting a cell with a transcription site varies with the cell's location relative to the source of morphogen. Error bars are 95% bootstrapped confidence intervals. (A) Cells are binned according to their distance from the pouch border, and the fraction of cells in each bin with a transcription site are shown for each Dpp-responsive gene. (B) Cells are binned according to their distance from the DV boundary, and the fraction of cells in each bin with a transcription site is shown for the *sens* gene. (**C,D**) The probability of detecting a cell with a transcription site varies linearly with the number of mRNA molecules in the cell. Fitted lines are from linear regression. Error bars are 95% confidence intervals. (C) Cells are binned according to the number of mRNAs they contain, and the fraction of cells in each bin with a transcription site are shown for each Dpp-responsive gene. (D) Cells are binned according to the number of mRNAs they contain, and the fraction of cells in each bin with a transcription site is shown for the *sens* gene. (E) Linear regression analysis was performed on samples from C and D, shown is the slope with a parametric 95% confidence interval.

We further examined the relationship between mRNA number per cell and transcription site frequency (Figure 5C,D). Average mRNA number per cell and the likelihood of detecting a transcription site were linearly correlated with one another for all genes. The positive correlation confirms that Dpp and Wg regulate gene expression primarily through control of transcription initiation. Remarkably, the slopes of linear fits for three Dpp-responsive genes, brk, omb, and salm, were not significantly different from one another, and the slope for dad was similar to brk and omb but smaller than for salm (Figure 5E). This conserved linear relationship between gene transcription and mRNA number has several implications. It suggests that mRNA decay rates are not very different between these Dpp target genes since the slopes would be different from one another if decay rates varied. Moreover, since the slopes are constant over a broad range of mRNA output, it suggests that mRNA decay is not being actively regulated by Dpp.

The likelihood of detecting a transcription site increases because either the promoter is spending more total time in the ON state or more RNAs are being transcribed while in the ON state. These properties are affected by burst size and burst frequency in different ways. We sought to determine whether burst size or frequency was being regulated. We did so by estimating the number of nascent RNAs at each transcription site, which was quantified as a multiple of the median pixel intensity of mature RNA spots (Figure 3—figure supplement 1). The average number of nascent RNAs per transcription site did not significantly vary between cells that were receiving different levels of Dpp and Wg signal (Figure 6A,B). This was observed for all genes, including the uniformly expressed sd gene (Figure 2—figure supplement 1C). Moreover, the average number of nascent RNAs per transcription site was also independent of the likelihood that transcription was occurring in a cell (Figure 6C). Therefore, the propensity for a cell to generate nascent transcripts does not correlate with the number of nascent transcripts.

Figure 6 with 1 supplement see all

Download asset Open asset

Burst frequency is regulated by Dpp and Wg.

(**A,B**) The average number of nascent RNAs in a transcription site does not vary with the cell's location relative to the source of morphogen. Error bars are bootstrapped 95% confidence intervals. (A) Cells are binned according to their distance from the pouch border, and the average number of nascent RNAs per site in each bin are shown for each Dpp-responsive gene. (B) Cells are binned according to their distance from the DV boundary, and the average number of nascent RNAs per site in each bin is shown for the sens gene. (C) The average number of nascent RNAs in a transcription site does not vary with the probability of detecting a cell with a transcription site. Error bars are 95% confidence intervals. (**D,E**) Model predictions of the relationship between average number of nascent RNAs in a transcription site and the probability of detecting a site for the *dad* gene. (D) Simulations are performed where the rate parameter *k_ini* has been systematically varied so as to modulate burst size alone. Resulting values for nascent RNA number and fraction of cells with a site are shown. Each datapoint is the average of 1000 simulations. Simulations are repeated for three different values of *k_on* to specifically set the burst frequency to 0.04, 0.2 and 0.4 min⁻¹. (E) Simulations are performed where the rate parameter *k_on* has been systematically varied so that burst frequency alone is variable. Resulting values for nascent RNA number and fraction of cells with a site are shown. Each datapoint is the average of 1000 simulations. Simulations are repeated for three different values of *k_ini* to specifically set the burst size to 1, 4 and 20.

To understand the causes of the relationship between these observed features, we turned to our mathematical model. We first considered whether modulation of transcription burst size by Wg and Dpp could explain our observations. We modulated burst size by systematically varying the k_ini parameter, and from simulations, then calculated the number of nascent RNAs per transcription site and the transcription site detection frequency. There was a positive correlation between nascent RNA number in a transcription site and the probability of detecting a transcription site (Figure 6D and Figure 6—figure supplement 1A). This was observed across a wide range of fixed burst frequencies. When nascent RNA number was three or higher, the correlation with transcription site frequency was strongest. Moreover, when the probability of a transcription site was very low, nascent RNA number converged to a common value irrespective of burst frequency. None of these model predictions were observed in the experimental results with the target genes (Figure 6C). It suggests that transcription burst size is not strongly regulated by Dpp and Wg.

We then modulated burst frequency in the model by systematically varying k_on, and calculated the number of nascent RNAs per transcription site and the transcription site frequency. There was little change in nascent RNA number as transcription site frequency changed, even across a wide range of fixed burst sizes (Figure 6E and Figure 6—figure supplement 1B). The burst size appeared to determine what nascent RNA number value was held at a constant. Moreover, there was no convergence of nascent RNA number when the probability of a transcription site was very low, irrespective of burst size. All of these model predictions agree well with the experimental results (Figure 6C). This suggests that Dpp and Wg regulation of genes in the wing disc primarily occurs by modulation of transcriptional burst frequency.

Discussion

Morphogens elicit different transcriptional outputs from target genes, depending on local concentration of the morphogen. The targets of Dpp signaling in the wing offer a well-studied example of this concept. Transcription of the gene brk is directly regulated by the Dpp effector protein Mothers-against-dpp (Mad) (Minami et al., 1999; Moser and Campbell, 2005). Mad, in complex with Medea and Schnurri, represses brk transcription (Cai and Laughon, 2009). This generates a gradient of Brk protein expression that is inverted to the Dpp gradient. In turn, the level of Brk protein is instrumental in repressing the expression of omb and salm, which are induced by Dpp (Campbell and Tomlinson, 1999). Thus, opposing gradients of activation and repression determine the expression domains of omb and salm. Since omb is less sensitive to Brk repression than salm, its expression domain is broader. salm transcription is directly activated by Dpp without participation of Schnurri (Moser and Campbell, 2005). Curiously, omb transcription does not directly depend on Dpp signaling, and its transcriptional activation is brought about by unknown factors (Sivasankaran et al., 2000).

Given the diverse molecular mechanisms by which genes such as omb, brk, and salm are regulated, it is illuminating that regulation of transcription burst frequency occurs for all of them. In the two-state view of promoter kinetics, the on-rate, k_on, then is the most likely rate constant being regulated since it specifically affects burst frequency alone (Dar et al., 2012). It determines the average rate at which a promoter will switch from its OFF to its ON state. When a promoter is in the OFF state, the next burst will only occur when it switches ON, which is controlled by k_on and not k_off. When a promoter is in the ON state, the size of its burst depends on when it switches OFF, which is controlled by k_off and not k_on (Dar et al., 2012). However, k_off also affects burst frequency because the longer a promoter is ON, the longer the time it takes before a new burst can occur. If Dpp regulates k_off, then we would have seen modulation of both size and frequency of bursts. However, burst size appears to be independent of Dpp signaling.

If k_on is the kinetic rate constant under regulation for all of these genes, how does this occur given such diverse enhancer architectures and transcription factor inputs? It has been found that burst frequency correlates with enhancer strength and enhancer-promoter contact, suggesting that k_on is potentiated by enhancer-promoter contact and is mediated by transcription factor binding to DNA (Bartman et al., 2016; Bothma et al., 2014; Chen et al., 2019; Fukaya et al., 2016; Larsson et al., 2019). This suggests that occupancy of Dpp effectors on target enhancers varies the k_on rate for their linked promoters, and this modulation is negative for repressors such as Brk and positive for activators such as Mad.

Burst frequency regulation is also observed for developmental genes in the embryo (Bothma et al., 2014; Chen et al., 2019; Fukaya et al., 2016; Garcia et al., 2013; Holloway and Spirov, 2017; Little et al., 2013; Xu et al., 2015). Thus, a common mechanism to regulate patterned gene expression is by control of burst frequency. However, burst size can also be regulated by cell-cell signaling, as is the case for Notch target genes in the Drosophila embryo (Falo-Sanjuan et al., 2019). Moreover, eve gene expression in the embryo is regulated by transcription factors that modulate burst frequency, plus there is an orthogonal mechanism that controls the window of time over which a nucleus can transcribe the eve gene (Lammers et al., 2020). This distinct mechanism appears to be regulated by repressors, perhaps acting on nucleosome organization. Modeling of various embryonic genes suggests that they transition through several intermediate transcriptionally-silent states before their transcription can begin (Desponds et al., 2016; Dufourt et al., 2018; Eck et al., 2020). Chromatin remodeling factors appear to modulate these transitions (Eck et al., 2020). Although a two-state model explains much of our experimental results, likely there are other factors that also help determine the expression domains of Dpp-responsive genes.

Our results challenge the view that salm and omb expression domains have sharp boundaries due to transcription thresholds set by Brk and Dpp. We find that omb and salm mRNA numbers per cell drop gradually with distance from the source of Dpp (Figure 2H). As well, their gradients in mRNA number are inversely correlated with the gradient in brk mRNA number. Salm has relatively constant mRNA number in cells near the AP boundary, and those numbers gradually diminish in cells located more laterally. A similar pattern is seen with omb, except the domain with constant omb mRNA number is smaller than for salm. However, the salm and omb enhancer trap reporters as well as anti-Salm immunohistochemistry have reported expression domains with sharp boundaries (Mayer et al., 2013). Possibly, the discrepancy hints at some threshold of mRNA expression below which protein output drops sharply. It is also possible that the previously characterized expression domains for salm and omb were distorted by non-linear detection of antibodies that recognize Salm and the protein product of lacZ, β-galactosidase.

Materials and methods

Key resources table

Reagent type (species) or resource	Designation	Source or reference	Identifiers	Additional information
Gene (Drosophila melanogaster)	white¹¹¹⁸	BloomingtonDrosophilaStock Center	BDSC: 3605 Flybase: FBst0003605 RRID:BDSC_3605
Gene (Drosophila melanogaster)	sens^E1	Nolo et al., 2000	Flybase: FBal0098024	From Hugo Bellen
Genetic reagent (Drosophila melanogaster)	sfGFP-sens [VK37]	Venken et al., 2006. From Hugo Bellen		Pacman construct containing sens gene with N-terminal 3xFlag-TEV-StrepII-sfGFP-FlAsH fusion tag inserted at 22A3 (VK37)
Genetic reagent (Drosophila melanogaster)	dad-GFP [VK37]	BloomingtonDrosophilaStock Center	BDSC: 81273 Flybase: FBti0150281 RRID:BDSC_81273	y w; PBac{y[+mDint2] w[+mC]=Dad GFP.FLAG} inserted at 22A3 (VK37)
Genetic reagent (Drosophila melanogaster)	brk-GFP [VK33]	BloomingtonDrosophilaStock Center	BDSC: 38629 Flybase: FBti0147730 RRID:BDSC_38629	w¹¹¹⁸; PBac{y[+mDint2] w[+mC]=brk GFP.FPTB} inserted at 65B2 (VK33)
Sequence-based reagent	GFP hybridization oligo probes	Biosearch Technologies	Custom probe set	Set of oligos with 3' modification mdC(TEG-Amino). Sequence of all oligos is in Supplementary file 1
Sequence-based reagent	sens hybridization oligo probes	Biosearch Technologies	Custom probe set	Set of oligos with 3' modification mdC(TEG-Amino). Sequence of all oligos is in Supplementary file 1
Sequence-based reagent	salm hybridization oligo probes	IDT	Custom probe set	Set of oligos. Sequence of all oligos is in Supplementary file 1
Sequence-based reagent	omb hybridization oligo probes	IDT	Custom probe set	Set of oligos. Sequence of all oligos is in Supplementary file 1
Sequence-based reagent	sd hybridization oligo probes	IDT	Custom probe set	Set of oligos. Sequence of all oligos is in Supplementary file 1
Sequence-based reagent	omb intron hybridization oligo probes	IDT	Custom probe set	Set of oligos. Sequence of all oligos is in Supplementary file 1
Sequence-based reagent	omb 5' exon hybridization oligo probes	IDT	Custom probe set	Set of oligos. Sequence of all oligos is in Supplementary file 1
Chemical compound, drug	NHS-ester ATTO 633 dye	Sigma	#01464
Chemical compound, drug	NHS-ester ATTO 565 dye	Sigma	#72464
Chemical compound, drug	amino-11-ddUTP	Lumiprobe	A5040
Chemical compound, drug	Paraformaldehyde (powder)	Polysciences	00380–1
Chemical compound, drug	Triton X-100	Sigma Aldrich	T9284-500ML
Chemical compound, drug	VectaShield	Vector Labs	H-1000
Chemical compound, drug	4′,6-diamidino-2-phenylindole (DAPI)	Life Technologies	D1306
Chemical compound, drug	salmon sperm single stranded DNA	Invitrogen	#15632
Chemical compound, drug	vanadyl ribonucleoside	New England Biolabs	#S14025
Software, algorithm	MATLAB pipeline to process raw smFISH images with no prior preprocessing	This paper		https://github.com/elifesciences-publications/smfish_pipeline
Other	Graces’ Insect Medium	Sigma	#69771	Growth medium for organ culture

Gene	τ_elong (min)
brk	1.35
dad	2.05
sens	5.15
salm	5.30
omb	3.05

Share this article

Cite this article

smFISH analysis of sfGFP-sens mRNA levels in wing imaginal discs.

smFISH analysis of mRNA levels from Dpp-responsive genes.

Sites of nascent transcription are detected by smFISH.

Modeling transcription sites using bursting dynamics.

Transcription site detection correlates with mRNA number.

Burst frequency is regulated by Dpp and Wg.

Author details

Rachael Bakker

Contribution

Competing interests

Madhav Mani

Contribution

For correspondence

Competing interests

Richard W Carthew

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism