Natural changes in light interact with circadian regulation at promoters to control gene expression in cyanobacteria
Abstract
The circadian clock interacts with other regulatory pathways to tune physiology to predictable daily changes and unexpected environmental fluctuations. However, the complexity of circadian clocks in higher organisms has prevented a clear understanding of how natural environmental conditions affect circadian clocks and their physiological outputs. Here, we dissect the interaction between circadian regulation and responses to fluctuating light in the cyanobacterium Synechococcus elongatus. We demonstrate that natural changes in light intensity substantially affect the expression of hundreds of circadian-clock-controlled genes, many of which are involved in key steps of metabolism. These changes in expression arise from circadian and light-responsive control of RNA polymerase recruitment to promoters by a network of transcription factors including RpaA and RpaB. Using phenomenological modeling constrained by our data, we reveal simple principles that underlie the small number of stereotyped responses of dusk circadian genes to changes in light.
https://doi.org/10.7554/eLife.32032.001eLife digest
Living things face daily, predictable challenges due to the regular day and night cycle imposed by the Earth’s rotation. Many of them have evolved an internal ‘circadian’ clock to anticipate daily changes in the environment. However, nature can also change in unpredictable ways, and in order to survive, organisms must account for both the time of day stipulated by their clocks and changes in their present environment. For example, cyanobacteria depend on the sun for survival and must cope with light variations throughout the day and the absence of light at nighttime.
Circadian clocks are made up of specific genes and their proteins. Most of what we know about how these clocks control the behavior of an organism comes from experiments performed under constant conditions. Previous research has shown that under such circumstances, the circadian clock of cyanobacteria periodically turns on a set of genes every 24 hours via a protein called RpaA. However, to understand how cyanobacteria use this clock, we must know how it works in a fluctuating environment.
To test this, Piechura, Amarnath and O’Shea measured the activation of genes in cyanobacteria that had been exposed to changes in light mimicking those in nature. Compared to constant conditions, fluctuating light drastically changed the timing of activation of circadian genes. When light decreased – as it would in nature during sunset or if a cloud blocks the sun – the circadian genes were activated.
Changes in light did not change the ‘ticking’ of the clock, but did affect the ability of RpaA to turn on circadian genes. Moreover, the activity of a second protein called RpaB increased when light decreased and the genes were activated. Thus, cyanobacteria switch on circadian genes as the sun is setting or during unexpected shade, likely through RpaA and RpaB, to help them survive without light.
This study shows that circadian clocks activate genes differently in the real world compared to unnatural, constant conditions. This may prompt scientists to think carefully about how an organism’s natural environment can affect its inner workings. A next step will be to see how else light affects circadian gene levels. A deeper understanding of how cyanobacteria control their genes in a natural environment will be useful for scientists who engineer these organisms to produce biofuels from sunlight.
https://doi.org/10.7554/eLife.32032.002Introduction
Circadian clocks allow organisms from almost all branches of life to alter physiology in anticipation of diurnal changes in the environment. Circadian clocks are autonomous core oscillators that keep time even in the absence of environmental cues (Dunlap et al., 2004). Output pathways interpret timing information from the core oscillator to generate oscillating outputs, such as oscillations in the mRNA levels (expression) of genes and higher order behaviors (Dunlap et al., 2004; Wijnen and Young, 2006). Laboratory studies of the outputs of circadian clocks have been primarily performed under constant conditions to isolate circadian regulation from environmental responses. In nature, however, organisms with circadian clocks must also cope with unexpected fluctuations in the environment. Thus a major challenge in chronobiology is to understand circadian regulation in dynamic environments.
Previous studies suggest that circadian clock output pathways interact with responses to the environment to tailor physiology to both the time of day and the current state of the environment. For example, sleep/wake cycles in Drosophila melanogaster and photosynthesis in Arabidopsis thaliana are controlled by both the circadian clock and environmental variables like day length or light (Lamaze et al., 2017; Millar and Kay, 1996). Further, circadian clocks can modulate responses to the environment based on the time-of-day in a process called circadian gating (Hotta et al., 2007; Greenham and McClung, 2015). However, the complexity of higher organisms has prevented a detailed understanding of the interaction between circadian timing information and environmental responses. In contrast, the circadian clock in the cyanobacterium Synechococcus elongatus PCC7942, an obligate photoautotroph, has a simple architecture which controls gene expression oscillations (Figure 1A) to influence metabolism and growth. S. elongatus must carefully monitor its environment, as the sunlight required for photosynthesis fluctuates on the minute, day, and seasonal timescales (Figure 1B, [Petty and Weidner, 2017]). While it is well understood how the circadian clock in S. elongatus behaves under constant conditions, it is unclear how this system changes in natural, fluctuating light.
In S. elongatus grown under ‘Constant Light’ conditions (Figure 1A, dashed navy blue line), genes which show oscillatory expression (circadian genes) can be divided into two groups, the dawn and the dusk genes, which peak at subjective dawn and subjective dusk (Ito et al., 2009; Vijayan et al., 2009) (Figure 1A). Subjective dawn and subjective dusk refer to the times at which dark-to-light or light-to-dark transitions would occur in a 12 hr light-12 hr dark environmental cycle. The dawn genes consist of the core metabolic and growth genes for S. elongatus, including the photosystems, ATP synthase, carbon fixation/Calvin-Benson-Bassham cycle enzymes, and ribosomal proteins (Vijayan et al., 2009; Ito et al., 2009; Diamond et al., 2015). In the absence of regulation by the circadian clock under Constant Light, S. elongatus constantly expresses dawn genes (Markson et al., 2013). The clock primarily regulates the expression of dusk genes (Markson et al., 2013), which include the genes required to utilize glycogen as an energy source in the absence of sunlight, such as glycogen phosphorylase and cytochrome c oxidase. As such, the circadian clock serves a critical function in switching S. elongatus from a daytime state of photosynthesis to a nighttime state of carbon metabolism through glycogen breakdown (Diamond et al., 2015; Diamond et al., 2017; Pattanayak et al., 2014; Puszynska and O'Shea, 2017). In Constant Light conditions, the dusk and dawn genes show oscillatory expression with a 24 hr period, resulting in broad peaks of maximal expression (Figure 1A, solid green line and dashed maroon line) (Vijayan et al., 2009; Ito et al., 2009). Recent whole-cell modeling of metabolism, protein levels, and growth predict that this picture of circadian gene expression should change under the dynamic light conditions of a natural, clear day (Figure 1B, navy blue line) (Reimers et al., 2017). The modeling suggests that making and using glycogen is a major cost to cell growth and thus the expression of genes required to switch metabolism from photosynthesis to glycogen breakdown should be delayed until absolutely necessary (Reimers et al., 2017). However, gene expression in natural light conditions has not been measured in S. elongatus.
Consistent with predictions of light-dependent changes in circadian gene expression, current evidence suggests interaction between the circadian and light regulatory pathways. The cyanobacterial clock keeps track of the time of day using a core post-translational oscillator (PTO) that consists of three proteins, KaiA, KaiB, and KaiC, whose enzymatic activities result in 24 hr oscillations in the phosphorylation state of KaiC (Nakajima et al., 2005; Rust et al., 2007; Johnson et al., 2011). In vivo under Constant Light conditions the Kai PTO modulates circadian gene expression by controlling oscillations in phosphorylation state of the master OmpR-type transcription factor RpaA (Markson et al., 2013; Takai et al., 2006) to peak at subjective dusk (Figure 1A, dotted black line; Figure 1C) (Gutu and O'Shea, 2013; Takai et al., 2006). Phosphorylated RpaA (RpaAP) binds to the promoters of some dusk genes to activate their expression, leading to indirect activation of other dusk genes and repression of dawn genes (Figure 1C) (Markson et al., 2013). As kaiBC is a dusk gene target of RpaA, the Kai PTO directs its own expression, resulting in a transcription-translation feedback loop that stabilizes the phase of the clock (Qin et al., 2010; Teng et al., 2013; Zwicker et al., 2010). Exposure to complete darkness at specific times of day causes phase shifts in the PTO to align clock output with the external day/night cycle (Rust et al., 2011), in a process called entrainment. However, it is not understood whether any aspect of this model, such as the dynamics of RpaA activity or the transcription-translation feedback loop, changes in the presence of more subtle natural light changes during the day (Figure 1C).
Meanwhile the OmpR-type transcription factor RpaB binds to some circadian gene promoters (Hanaoka et al., 2012), and the phosphorylation state and DNA binding activity of this protein decreases in response to high light exposure (Figure 1C) (López-Redondo et al., 2010; Moronta-Barrios et al., 2012). However, it is not clear how natural light changes like sunset or shade pulses affect RpaB activity (Figure 1C). RpaB clearly plays some role in altering circadian gene expression in response to light (Espinosa et al., 2015), but it is unclear how (Figure 1C). While light likely exerts global, growth-rate-dependent regulation of protein levels (Scott et al., 2010; Du et al., 2016; Burnap, 2015), the interaction between circadian and light regulation to control the activities of RpaA and RpaB represents a particularly tractable scenario for dissecting the mechanisms underlying interaction between clock and environment to control circadian gene expression.
Here we measure and model circadian gene expression and several layers of regulation in cyanobacteria grown under the fluctuating light intensities typically experienced in nature. We find that fluctuations in light alter the expression patterns of almost all circadian genes. We identify key regulatory steps at which information about changes in light interact with clock output pathways to control gene expression, and reveal a complex regulatory network underlying circadian gene expression in natural conditions. Finally, we show that phenomenological models effectively describe the integration of the circadian clock with responses to environmental fluctuations.
Results
Sunlight on a clear day delays the timing of circadian gene expression relative to constant light conditions
To grow and assay cyanobacteria in natural light conditions, we custom-built a culturing setup with a light source that can be programmed to mimic natural fluctuations in sunlight. On a cloudless ‘Clear Day,’ light intensity varies in a parabolic manner due to the rotation of the Earth, ending with a gradual ramp down of light intensity prior to dusk (‘Sunset’, Figure 1B). Rapid changes in cloud cover cause abrupt increases (‘High Light pulse’) and decreases (‘Shade pulse’) in sunlight (Petty and Weidner, 2017) (Figure 1B). Using a set of programmable warm white LED arrays (Materials and methods, Construction of light apparatus and Calibrating light conditions) for illumination, in all experiments we grew cells for 12 hr in either a Clear Day condition that peaked at 600 mol photons m s or a continuous Low Light condition of 50 mol photons m s (Figure 2A, top panel) followed by 12 hr of darkness for at least two days to acclimate and synchronize the cells before measurement. Note that the Low Light condition used here differs from the Constant Light condition (often denoted as LL in the literature; Figure 1A, dashed navy blue line) in that the cells are exposed to more naturally-relevant 12 hr light-12 h dark days (LD). Cultures grown under the Clear Day condition adjusted their pigment content after two days of exposure to the Clear Day condition (Figure 2—figure supplement 1). Further, cells acclimated to the Clear Day conditions grew approximately twice as fast as Low Light acclimated cultures at midday, 6 hr after dawn (Figure 2—figure supplement 1). These data indicate that S. elongatus PCC7942 is capable of acclimating to the higher light intensities of the Clear Day condition and thus that the intensities used in our measurements are relevant for this strain.
-
Figure 2—source data 1
- https://doi.org/10.7554/eLife.32032.009
To determine whether a natural light profile affects circadian output, we compared genome-wide gene expression in Clear Day conditions versus Low Light conditions using RNA sequencing (Figure 2A, Setup, arrows indicate sampling). We acclimated cultures in their respective condition for 2 light/dark cycles, and sampled them (arrows) over the next (third) light period (Figure 2A, Setup). We focused our analysis on a set of high amplitude circadian genes that show oscillatory expression under Constant Light conditions (Figure 2—figure supplement 2; see Materials and methods, Definition of circadian genes). The Low Light condition (Figure 2B, upper panel) reproduces the expression profile previously observed under Constant Light conditions (Figure 2—figure supplement 2). However, in the Clear Day condition 159 of the 281 dusk genes were expressed at least two fold higher after midday compared to Low Light, demonstrating light-dependent expression. Dawn genes show the opposite behavior — they have higher expression at midday under Clear Day conditions, although this trend is less pronounced (Figure 2—figure supplement 3). Taken together, Clear Day conditions significantly influence the expression dynamics of almost all circadian genes, with the strongest effects on dusk genes.
To look more closely at how the Clear Day condition affects the dusk genes, which are the primary regulatory targets of the clock, we analyze the gene expression dynamics of the representative dusk gene Synpcc7942_1567. Under Low Light conditions, Synpcc7942_1567 exhibits an increase in expression from dawn to dusk, reaching a plateau by 8 hr after dawn (Figure 2C, solid black line). Under Clear Day conditions, however, the expression of this gene remains low through the midday peak of light intensity (Figure 2C, solid magenta line; 4–8 hr after dawn), and its expression sharply increases just prior to dusk as light intensity decreases, reaching maximal expression just as the dark period begins. This delayed pattern of gene expression can be seen in almost all dusk genes (Figure 2B; Synpcc7942_1567 indicated with arrows). Thus Clear Day conditions significantly alter the dynamics and amplitude of dusk gene expression to peak just before dusk.
The delay of dusk gene expression likely enables cyanobacteria to switch to glycogen breakdown only when absolutely necessary so that they can survive the extended darkness of night. The two glycogen breakdown genes, glgP and glgX, are both light-dependent dusk genes that strongly peak in Clear Day at dusk, while glgC, which codes for the rate limiting enzyme of glycogen synthesis, is a dawn gene whose expression is higher in Clear Day conditions compared to Low Light (Figure 2—figure supplement 4). These gene expression dynamics would favor both the maintenance of glycogen synthesis until the end of the day and a delay in the activation of glycogen breakdown until just before it is required at nighttime, in agreement with predictions from metabolic modeling during the same Clear Day conditions used here (Reimers et al., 2017). Thus, environmental conditions are integrated into the output of the circadian clock to potentially optimize resource allocation in naturally-relevant diurnal cycles, as recently suggested (Reimers et al., 2017).
Remarkably, though in both light conditions the cells experience 50 mol photons m s at the end of the day just before night, light-dependent dusk genes have substantially higher expression in the Clear Day conditions relative to the Low Light conditions (Figure 2B–C). Indeed, 95/281 dusk genes were expressed at least three fold higher in Clear Day relative to Low Light at 12 hr after dawn. This strong activation of dusk genes occurs concomitant with the decrease in light intensity during Clear Day that mimics Sunset, which hinted that changes in light intensity affect activation of dusk genes as opposed to absolute light intensity levels. Dusk gene expression could thus happen ‘just-in-time’ before the sustained darkness of nighttime regardless of the seasonal timing of Sunset.
Changes in light intensity control the transcription of circadian genes
To test whether changes in light intensity are a key factor controlling the expression of circadian genes, we exposed cells to a High Light pulse or a Shade pulse and measured genome-wide gene expression using RNA sequencing. We grew cultures in either Low Light or Clear Day conditions for three days (Figure 3A–B, Setup). On the fourth day at 8 hr after dawn, when RpaA is most active, we exposed the cells to a High Light pulse (Figure 3A) or a Shade pulse (Figure 3B) for 1 hr before returning to the original condition. We sampled the cells before, during, and after the perturbation (Figure 3A–B, Setup, arrows). The expression of dusk genes rapidly changed in a direction opposite to the change in light intensity (Figure 3C, all dusk genes; Figure 3E, example dusk gene; Figure 3D, all dusk genes; Figure 3F, example dusk gene), as expected from the effects of the decrease in light intensity at Sunset of the Clear Day condition on circadian gene expression (Figure 2B–C). A large subset of dusk genes were affected by the light pulses, with 105/281 repressed by at least three fold by the High Light condition, and 136/281 induced by at least three fold by the Shade condition. Further, many genes responded rapidly and changed in expression at least three fold after just 15 min into the pulse (75/281 repressed by High Light, 79/281 induced by Shade). When cultures were restored to their original condition (High Light to Low Light, Figure 3C,E; Shade to Clear Day, Figure 3D,F), dusk gene expression quickly reverted to a level comparable to that before the pulse. Thus, light-induced changes in dusk gene expression are reversible and responsive to successive shifts in light availability. Dawn gene expression showed the opposite behavior of dusk genes, albeit with less dramatic changes (Figure 3—figure supplement 1). Hence, decreases in light intensity favor the expression of dusk genes (Sunset in Clear Day, Figure 2; Clear Day to Shade and High Light to Low Light, Figure 3), while increases in light favor the expression of dawn genes (midday peak in Clear Day, Figure 2—figure supplement 3; Shade to Clear Day and Low Light to High Light, Figure 3—figure supplement 1). Given the more substantial effects of light on dusk gene expression, we focus on these genes for the remainder of the manuscript.
-
Figure 3—source data 1
- https://doi.org/10.7554/eLife.32032.013
-
Figure 3—source data 2
- https://doi.org/10.7554/eLife.32032.014
To cause these reversible changes in the mRNA levels of dusk genes, changes in light intensity must affect either the transcription and/or the degradation of dusk gene mRNAs. We reasoned that changes in transcription would manifest as differences in the amount of RNA polymerase (RNAP) localized to dusk genes. To determine whether changes in light intensity regulate the recruitment of RNAP to dusk gene promoters, we performed chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) of RNAP in cells immediately before the High Light or Shade pulse (8 hr after dawn in Low Light or Clear Day), and then again 15 or 60 min following the start of the pulse. Changes in RNAP enrichment upstream of dusk genes correlated with changes in downstream dusk gene expression (Figure 3G,H; Figure 3—figure supplement 2). Thus, changes in light affect RNAP recruitment to dusk gene promoters, suggesting that light conditions substantially affect the rates of transcription of dusk gene mRNAs. Because mRNAs in bacteria have very short steady state half lives (Chen et al., 2015; Hambraeus et al., 2003; Salem and van Waasbergen, 2004), we argue that changes in transcription rates of dusk gene mRNAs are sufficient to lead to the rapid changes in dusk gene mRNA levels given a fast basal degradation rate, though we cannot rule out that changes in light may affect the rates of degradation of some mRNAs. These results point to a potential interaction between sunlight and signaling pathways upstream of RNAP. We next explored how the observed changes in dusk gene expression in the presence of natural light fluctuations (Figures 2 and 3) could be achieved via gene regulatory mechanisms.
Regulation of dusk gene expression by RpaA and RpaB under dynamic light regimes
Given the strong dependence of dusk gene expression on RpaAP levels under Constant Light conditions (Figure 1A, [Markson et al., 2013]) and the drastic change in dusk gene expression dynamics under our dynamic light conditions (Figure 2B,C; Figure 3C–F), we hypothesized that light conditions alter RpaAP dynamics to alter dusk gene expression. However, levels of RpaAP increased from dawn to dusk similarly in cells grown in either Low Light or Clear Day conditions, and abrupt changes in light intensity did not affect these dynamics. (Figure 4A,B; Figure 4—figure supplement 1). Thus, these natural light fluctuations do not affect the phase of the Kai PTO nor the control of RpaAP levels by the Kai PTO (Figure 4E). These data demonstrate that the regulation of dusk genes is de-coupled from RpaAP levels under dynamic light conditions, and light must affect dusk gene expression downstream of RpaAP. Interestingly, ChIP-seq showed that light intensity fluctuations alter RpaAP binding upstream of dusk genes (Figure 4C; Figure 4—figure supplement 2) in conjunction with RNAP binding upstream of the same gene (Figure 4D; Figure 4—figure supplement 3). The binding of RpaAP and RNAP correlated with changes in downstream dusk gene expression (Figure 4C; Figure 4—figure supplement 2). Thus, light fluctuations control the binding of RpaAP and RNAP to promoters, suggesting that light-induced changes in the binding of these factors may modulate the activation of dusk gene expression (Figure 4E).
-
Figure 4—source data 1
- https://doi.org/10.7554/eLife.32032.020
-
Figure 4—source data 2
- https://doi.org/10.7554/eLife.32032.021
Interestingly, RpaA regulation at a small number (10) of promoters including that of kaiBC is not substantially affected by light intensity (Figure 4C,D - points around origin; Figure 4—figure supplement 4), demonstrating that the light-dependent regulation of RpaA binding is locus-specific. The KaiABC clock regulates RpaAP levels independent of changes in light intensity (Figure 4A,B), and kaiBC gene expression dynamics do not substantially change in the Clear Day conditions compared to the Low Light condition (Figure 4—figure supplement 4H–K). Hence, the stabilizing PTO/transcription-translation feedback loop circadian circuit is robust to natural fluctuations in sunlight. The circadian clock can thus control gene expression independent of environmental changes at select promoters. It is possible that RpaAP binding to some promoters is dependent on the association of RNAP with that promoter. As such, regulation that affects RNAP binding to a specific promoter, such as that by sigma factor activity (Gruber and Gross, 2003), could affect RpaAP binding to select promoters.
Our analysis so far has established that the previous model for the regulation and expression of circadian genes in Constant Light conditions (Figure 1A) becomes more complex in natural environmental conditions, suggesting the involvement of other pathways. Thus, we next asked whether RpaB plays a role in controlling light-dependent expression of circadian genes. We observed that levels of RpaBP changed rapidly in a direction opposite to the change in light (Figure 5A,B; Figure 5—figure supplement 1), suggesting that light affects RpaB activity through its phosphorylation state (Figure 5E). Levels of RpaBP decreased 3.1 fold after 15 min in the High Light pulse, and increased 1.9 fold after 15 min in the Shade pulse, concomitant with the rapid repression and induction of many dusk genes (Figure 3). Further, RpaBP levels increased 1.7 fold between 10 and 12 hr after dawn in the Clear Day condition concomitant with the decrease in light during Sunset and the strong induction of many dusk genes (Figure 2). This strong correlation between RpaBP levels and the expression of dusk genes under dynamic light conditions (also compare Figure 3E,F to Figure 5A,B) suggests that RpaBP acts as an activator of dusk gene expression. Indeed, using ChIP-seq we found that RpaB binds upstream of a large subset of dusk genes (42/281 dusk genes, Figure 5—source data 2). RpaB binding upstream of these genes shifts after rapid changes in light (Figure 5C; Figure 5—figure supplement 2), correlating with changes in RpaBP levels (Figure 5A,B), RNAP binding upstream of the same gene (Figure 5D; Figure 5—figure supplement 3), and downstream dusk gene expression (Figure 5C; Figure 5—figure supplement 2). These results suggest that RpaBP directly activates the expression of many dusk genes by binding to promoters with RNAP (Figure 5E). Thus, changes in sunlight can regulate dusk genes by adjusting RpaBP levels (Figure 5E).
-
Figure 5—source data 1
- https://doi.org/10.7554/eLife.32032.026
-
Figure 5—source data 2
- https://doi.org/10.7554/eLife.32032.027
Because RpaA and RpaB bind only a subset of light-responsive dusk genes (Figure 6A,B), additional factors must be involved in controlling light-responsive dusk gene expression. Sigma factors are sequence-specific RNAP subunits which regulate gene expression in bacteria (Gruber and Gross, 2003). Interestingly, RpaA, RpaB, and RNAP bind to the promoters of three sigma factor genes (Figure 6C; Figure 6—figure supplement 1A–C). The binding of RpaA, RpaB, and RNAP to these promoters shifts in conjunction after abrupt changes in light intensity, correlating with light-responsive changes in expression of these genes (Figure 6D; Figure 6—figure supplement 1D–F). These sigma factor genes show light-dependent dusk gene expression patterns (Figure 6—figure supplement 1G–L) that mirror those of the larger group of dusk genes (Figures 2 and 3), suggesting that these sigma factors could regulate the expression of other dusk genes. Thus, RpaA and RpaB may indirectly regulate the expression of non-target dusk genes by controlling the circadian and light-responsive expression of sigma factor genes (Hanaoka et al., 2012), similar to how RpaA drives all dusk gene expression in Constant Light conditions by binding to a subset of dusk genes (Markson et al., 2013). It is also possible that changes in light intensity affect dusk gene expression in a manner independent of RpaAP and RpaBP regulation. For instance, global growth-rate-dependent gene regulatory mechanisms such as the stringent response (Scott et al., 2010; Burnap, 2015; Ryals et al., 1982; Hood et al., 2016) likely cause some of the light-dependent changes in circadian gene expression due to unavoidable differences in the growth rate in different light conditions (Figure 2—figure supplement 1).
We have defined a regulatory picture in which changes in light intensity affect the activity of RpaA and RpaB to control the expression of dusk genes. However, light affects RpaA activity in complex and promoter-specific ways. Additionally, light-dependent regulation in addition to that mediated by RpaA and RpaB may control dusk gene expression in response to environmental perturbations. Still, despite the apparent complexity of regulation of dusk genes in response to light fluctuations, the expression of almost all dusk genes show strikingly regular dynamics (Figures 2 and 3). Furthermore, the activity of RpaA and RpaB at a subset of promoters (especially those of sigma factor genes) could lead to pervasive and coordinated changes in the expression of other dusk genes. Hence, we reasoned that mathematical models (Alon, 2006) of RpaA and RpaB activity might effectively describe the regulatory circuits underlying the dynamics of large groups of dusk genes. Such an approach would enable an understanding of the basic principles of interaction between circadian gene expression regulation with light-dependent regulation without needing to describe all underlying molecular mechanisms.
Phenomenological models suggest simple principles underlying the activation of clusters of light-responsive dusk genes
We find that dusk genes collectively display a small number of responses to changes in environmental light intensity. Using k-means clustering of the gene expression dynamics from our different light profiles (Figures 2 and 3), as well as from perturbations of RpaA (Figure 2—figure supplement 2 [Markson et al., 2013]), we identify three major groups of dusk genes (35–80 genes, see Figure 7—source data 1 for full lists) which show distinct and coordinated changes in gene expression over circadian time and in response to changes in light intensity (Figure 7; Figure 7—figure supplement 1). Under Constant Light conditions, all three clusters are activated by RpaAP but display distinct activation dynamics from dawn to dusk (Figure 7—figure supplement 1B) that are mirrored under our Low Light conditions (Figure 7—figure supplement 1A). We named the clusters the Early, Middle, and Late dusk genes based on the order of activation.
-
Figure 7—source data 1
- https://doi.org/10.7554/eLife.32032.032
The Shade pulse and Sunset in the Clear Day condition have differing effects on the expression of each of the major dusk gene clusters. Early dusk gene expression rapidly increases in response to Shade, but during Sunset plateaus at 1/2 of the maximal gene expression reached in Shade (Figure 7A). Conversely, the Late gene cluster responds most strongly to Sunset in Clear Day conditions but has a mild increase in expression in Shade relative to the Early and Middle dusk genes (Figure 7C). In contrast, the Middle gene cluster is induced to a similar magnitude by both Shade and Sunset (Figure 7B). Shade and Sunset represent similar light changes that occur at different times of day (afternoon and dusk, respectively). As such, the Early and Late dusk gene clusters are differentially induced by a decrease in light intensity depending on the time of day in which it occurs. This circadian effect on the intensity and dynamics of a response to environmental change is a signature of circadian gating (Hotta et al., 2007; Greenham and McClung, 2015). Though circadian gating has been observed (e.g., [Belbin et al., 2017]) and modeled without any knowledge of the transcriptional regulation (Dalchau et al., 2010) in plants, it remains unclear what gene regulatory circuits are sufficient to explain such behavior.
At present there is no mechanistic model to explain the differential response of these clusters to circadian regulation and changes in sunlight. Given that there are unknown regulators involved in circadian gene expression (Figure 6A,B), and because it is not possible to exhaustively test all possible models of regulation of dusk gene expression, we sought to construct the simplest models that can describe the expression dynamics of these clusters using a phenomenological modeling approach. Such models can be used to highlight regulatory architectures that are sufficient to recapitulate the observed gene expression dynamics, as well as direct further mechanistic studies to reveal the underlying molecular details of regulation.
Given the clear roles for RpaAP and RpaBP in activating dusk genes, we asked whether the dynamic expression of the major dusk gene clusters in naturally-relevant light conditions could be described by these variables. We constructed phenomenological models that describe the kinetics of the synthesis and breakdown of an average gene in each of the dusk gene clusters (Mangan and Alon, 2003) (see Materials and methods, Mathematical modeling). The rate of synthesis was the sum of a baseline rate of transcription and a maximal adjustable rate of transcription that could be modulated by the activity of one or more regulators. We described the the effects of a regulator such as RpaAP or RpaBP using a Hill function, whose shape is determined by the Hill coefficient and the coefficient of activation. We determined how well a model could describe the dynamics of a cluster by fitting it to the Clear Day and Shade pulse data and assuming all parameters could vary freely (see Materials and methods, Mathematical modeling; Table 1).
We began by asking whether levels of RpaAP or RpaBP (Figure 8A) can describe the gene expression dynamics of the major dusk clusters in natural light conditions. We first constructed models in which dusk cluster gene expression is solely dependent on RpaAP. Activation by RpaAP can recapitulate the ordered activation of the dusk gene clusters through differential coefficients of activation for RpaAP, but cannot describe the light-responsive expression of these genes (RpaA-only models, Figure 8—figure supplement 1A–C; Table 2). Further, activation by RpaBP alone cannot describe the dusk gene expression patterns of the clusters (RpaB-only models, Figure 8—figure supplement 1D–F; Table 2). However, models in which dusk gene expression is a function of BOTH RpaAP and RpaBP can recapitulate much of the time-of-day and light intensity dependent expression of the Early and Late clusters and nearly all of the expression dynamics of the Middle clusters (RpaA and RpaB models, Figure 8B–E; Table 2). This suggests that RpaBP is a variable which can capture the effects of dynamic light conditions on RpaAP activity. The fit parameters for simple joint activation can accommodate indirect activation through downstream regulators like sigma factors and thus do not require direct RpaA/B binding to all genes. Conceptually, our results suggest that transcription factors whose activity track the measured dynamics of both RpaAP and RpaBP can describe the circadian and light-responsive expression of dusk genes. However, joint activation by RpaAP and RpaBP predicts that the Early and Late clusters will respond similarly to Shade and Sunset in Clear Day conditions (Figure 8C,E), and thus cannot capture well the circadian gating of these clusters.
We reasoned that additional regulatory interactions downstream of RpaA and RpaB, or ‘network motifs’ (Alon, 2006), could account for the observed gating of the Early and Late clusters. Thus, we constructed models in which dusk cluster gene expression is positively or negatively dependent on the expression of another cluster alongside activation by RpaAP and RpaBP (Feedback models, Figure 8F–I; Figure 8—figure supplements 2–4; Table 2). Interestingly, the gating of the Early cluster is recapitulated by a model incorporating an incoherent feedforward loop in which the Late cluster represses Early cluster expression downstream of RpaAP and RpaBP activation (Figure 8G; Figure 8—figure supplement 2; Table 2). Further, the gating of the Late cluster is well described by a coherent feedforward loop in which Late cluster expression is dependent on RpaAP, RpaBP, AND Middle cluster expression levels (Figure 8I; Figure 8—figure supplement 4; Table 2). Thus, we highlight regulatory schemes downstream of RpaA and RpaB which can generate large time-of-day differences, or circadian gating, in the response to a decrease in light intensity.
Our results highlight that the measured dynamics of RpaAP and RpaBP can account for the dynamics of large groups of clock-controlled genes after environmental changes and suggest regulatory schemes that can diversify gene expression responses downstream of RpaA and RpaB. The models suggested here offer constraints and testable hypotheses to guide future studies of the molecular mechanisms underlying these responses.
Discussion
Changes in light adjust circadian gene expression to optimize metabolism in response to shifting ambient light intensity
We show that natural fluctuations in light intensity significantly affect the dynamics of circadian gene expression (Figures 2 and 3). While previous studies have measured genome-wide gene expression in a single natural light condition (Waldbauer et al., 2012), here we compare genome-wide circadian gene expression in several physiologically-relevant conditions, including Clear Day, High Light pulse, Shade pulse, and Low Light, to carefully dissect the effects of light on clock output. Natural light changes most greatly affected a large fraction of the dusk genes (Figures 2B and 3C,D), possibly because most of the direct targets of RpaA are dusk genes (Markson et al., 2013). We speculate that the opposing trends we observe in dawn gene expression (Figure 2—figure supplement 3 and Figure 3—figure supplement 1) may in part be due to competition for RNAP between the dusk and dawn genes (Gruber and Gross, 2003; Mauri and Klumpp, 2014) or by growth-rate-dependent mechanisms (Scott et al., 2010), as this group of genes contains the primary growth genes. A systematic exploration of the effects of light on circadian genes will be necessary to fully elaborate the contributions of light, clock, and growth rate on circadian gene dynamics.
We find that large groups of light-responsive dusk genes are activated by diminished light conditions to different extents depending on the time of day the stimulus is applied. These differences in activation may serve to optimally change metabolism for a given light condition and time of day. The light-responsive dusk genes grouped into three clusters - Early, Middle, and Late - with different activation dynamics during Sunset at the end of the Clear Day versus the Shade pulse in the afternoon (Figure 7, see Figure 7—source data 1 for full lists of genes in each cluster). Glycogen breakdown genes and the central carbon metabolism genes glyceraldehyde-3-phosphate dehydrogenase and oxalate decarboxylase belong to the Middle dusk genes, which are activated to similar levels by Shade and Sunset (Figure 7B). This suggests that cyanobacteria delay the activation of glycogen breakdown pathways (Reimers et al., 2017) until just before dusk when grown under Clear Day conditions, but can transiently activate these genes in response to Shade to access alternate energy reserves if necessary. Interestingly, genes encoding pyridine nucleotide transhydrogenase, which reversibly converts NADH to the NADPH required for electron transport, belong to the Late cluster and are strongly activated only by Sunset and not afternoon Shade (Figure 7C). Such a response might delay the adjustment of the relative levels of NADH/NADPH until only when absolutely needed at night, when NADPH is potentially important for defense against reactive oxygen species (Diamond et al., 2017). The cytochrome c oxidase genes belong to the Early cluster, which respond more intensely to Shade than to Sunset (Figure 7A). This enzyme is essential for preventing photodamage in response to rapid changes in light intensity (Lea-Smith et al., 2013); such changes are not expected to occur during the night, where it serves solely as the terminal electron acceptor for respiration. More generally, the genome-wide gene expression dynamics measured here qualitatively agree with predictions from a whole-cell model of S. elongatus that assumed optimization of growth (Reimers et al., 2017). To resolve how the circadian and light-dependent transcriptional changes effect these metabolic changes, future studies must measure enzyme levels and metabolic fluxes under fluctuating light conditions.
Mechanistic principles underlying the activation of light-responsive dusk genes
While light does not alter the post-translational oscillator/transcription-translation feedback loop circadian circuit, it regulates the activation of dusk genes via RpaAP promoter binding (Figure 4) and RpaB promoter binding through its phosphorylation state (Figure 5) at a subset of dusk genes. RpaA binding upstream of its target genes under dynamic light conditions (Figure 4C) correlates with the changes in expression of non-RpaA target genes (Figure 6B). Thus, RpaAP may remain the ‘master regulator’ of circadian gene expression whose promoter binding activity is altered by other molecular factors that encode information about the environment, such as RpaB. Previous work suggested that changes in RpaBP phosphorylation would alter RpaAP levels through competition with the enzymes that control RpaAP levels (Espinosa et al., 2015). However, we find that RpaAP levels remain constant (Figure 4A,B) under conditions in which RpaBP levels change substantially (Figure 5A,B), arguing that RpaBP does not influence RpaAP levels. RpaBP might influence RpaAP binding at promoters where both proteins bind (Figure 6—figure supplement 1) as previously suggested (Hanaoka et al., 2012), and joint control of sigma factors by RpaA and RpaB could feedback to affect RpaA binding at select promoters. Still, the question of how light changes RpaAP binding in a promoter-specific way remains unclear.
We define a clear role for the stress-responsive transcription factor RpaB as a transcriptional activator of a large subset of dusk genes (Figure 5E). Further, we demonstrate that decreases in light intensity like a Shade Pulse lead to increases in RpaBP levels to allow RpaB to activate the expression of genes. This result shows that RpaB acts in scenarios beyond its previously appreciated role in High Light stress (Kato et al., 2011; Seki et al., 2007; Hanaoka and Tanaka, 2008; López-Redondo et al., 2010). RpaAP and RpaBP might cooperate to indirectly regulate the expression of most light-responsive dusk genes by jointly controlling the expression levels of multiple sigma factors (Figure 6—figure supplement 1) (Hanaoka et al., 2012). However, our attempts to cleanly perturb RpaB activity to further explore its role as a regulator of dusk genes were unsuccessful, in part because the rpaB gene is essential (López-Redondo et al., 2010). The role of sigma factors in this network of regulation, while strongly implied, remains ambiguous and attempts to assess this role using genetic deletion of sigma factors yielded inconclusive results. More subtle approaches such as anchors away (Haruki et al., 2008) might allow perturbation experiments that clearly explicate the roles of the sigma factors and RpaB in mediating circadian gene expression.
Although complex molecular mechanisms underlie the light-responsive expression of dusk genes, we demonstrate that phenomenological models effectively describe the differential activation of large groups of dusk genes to afternoon Shade and Sunset. These models suggest that transcription factors with the dynamics of RpaAP and RpaBP (Figure 8A) are sufficient to reproduce much of the activation of the Early, Middle, and Late clusters in response to a Shade pulse in the afternoon or Sunset just before night (Figure 8C–E). Our models suggest that additional feedback from the other gene clusters may be necessary to achieve the extent of circadian gating observed for the Early and Late clusters (Figure 8G–I). Our models suggest that interactions between the major dusk clusters can diversify the responses of these clusters to signals from RpaA and RpaB. Regulatory interactions between the sigma factors RpoD6, RpoD5, and SigF2 (Figure 6—figure supplement 1), which belong to the Early, Middle, and Late clusters, respectively, could generate feedback downstream of RpaA and RpaB similar to that in our models (Figure 8—figure supplements 2–4) to generate the diverse responses of the dusk clusters to light conditions. However, feedback could also come from other sources with similar dynamics to the cluster expression levels. Indeed we could not simultaneously fit our models to all four light conditions, likely because of global growth-rate-dependent differences between the Low Light and Clear Day conditions. Thus, complete modeling of transcription dynamics of light-dependent dusk genes likely requires explicitly including the effects of metabolism and growth on gene expression (Reimers et al., 2017; Burnap, 2015; Scott et al., 2010).
Closing remarks
RpaB and its cognate upstream histidine kinase NblS (van Waasbergen et al., 2002) have been implicated in a variety of stress responses (Marin et al., 2003; Mikami et al., 2002; Shoumskaya et al., 2005), which suggests that the mechanisms and regulatory circuits defined here may apply to other environmental changes such as temperature or osmolarity. The requirement of RpaB for mediating the environmental response of circadian genes suggests that the circadian circuit coevolved with RpaB to optimize responses to predictable and unpredictable changes in the environment and motivates the further exploration of the interaction between light and circadian rhythms in S. elongatus. Resolution of this interaction and subsequent integration into whole cell models of cyanobacterial growth (Burnap, 2015; Westermark and Steuer, 2016) will help to explain the fitness benefits of the circadian clock (Johnson and Egli, 2014) and optimize synthetic biology efforts to engineer cyanobacteria to produce useful compounds (Ducat et al., 2011) from the constantly changing sunlight in nature.
Genomics data
All high throughput sequencing data is available from the Gene Expression Omnibus with the accession number GSE104204.
Materials and methods
The resources table includes the genetically modified organisms and strains, cell lines, reagents, and software that are essential to reproduce the results presented.
Cyanobacterial strains
Request a detailed protocolMost experiments were conducted in a pure wildtype background of Synechococcus elongatus PCC7942 (ATCC catalog number 33912, RRID:SCR_001672). For RNAP ChIP experiments, we used a strain in which the subunit of RNA polymerase (Synpcc7942_1524, gene info available through Cyanobase, RRID:SCR_007615) was C-terminally tagged with a 3x FLAG epitope (a gift from Ania Puszynska). To make this strain, wildtype S. elongatus was transformed with a plasmid encoding the Synpcc7942_1524 gene with sequence encoding a 3X GS linker and a 3X FLAG epitope inserted before the stop codon, targeted to insert at the native locus of the gene. A downstream kanamycin resistance cassette was used for selection. This plasmid is available through Addgene with the ID 102337. Two different clones of this strain, EOC398 and EOC399, were confirmed by sequencing colony PCR fragments that amplified the modified regions of the gene, and the presence of the tagged subunit was confirmed with Western blotting.
Construction of light apparatus
Request a detailed protocolTo grow the cyanobacteria in different light profiles, we constructed an apparatus to control the intensity of four high powered LED arrays (parts list in Table 3, p. 2). ‘Warm white’ LED arrays (1 in. x 1 in., Bridgelux) were chosen because of maximal overlap with the phycobilisome absorption spectrum. An LED array was mounted on a heatsink (Nuventix) and powered by a Flexblock LED driver (LEDdynamics) wired in the ‘boost only’ configuration (Table 4, p. 3). The intensity of the LEDs was controlled by varying the voltage input into the DIM line of the Flexblock between 0 and 10 V. We used a digital potentiometer (AD7376, Analog Devices) as a controllable 10 V source. The voltage output of the digipot was controlled via serial peripheral interface with an Arduino Uno board (Arduino) (see Table 5, p. 4). Each LED array was controlled separately, and a single array was sufficient to grow a single 750 mL culture of S. elongatus. All wires carrying substantial currents from the main power supply to the LED arrays were rated 18 AWG, and all other wires were rated 22 AWG. The relatively low voltage of the main power supply (18 V) is essential for being able to turn off the LED arrays completely.
Calibrating light conditions
Request a detailed protocolA single LED was mounted to shine perpendicular to the ground and isolated from other light sources. A single 750 mL cyanobacterial culture in a 150 cm BD Falcon Tissue culture flask (Fisher Scientific) was placed beneath the LED, tilted such that the broad face of the culture was almost perpendicular to the incoming light. Each LED was calibrated by passing a known voltage input to the LEDs and recording the intensity of the light in mol photons m s at the position of the surface of the culture directly beneath the LED using a LI-COR LI-250A light meter equipped a quantum sensor. To access a greater dynamic range of light intensity values, we calibrated the lights to give light intensity values at either of two distances from the light source — raised towards the lights to access higher light intensities, or lowered away from the lights to access lower light intensities.
To define the Clear Day conditions, we used light intensity values measured by the Ground-based Atmospheric Monitoring Instrument Suite, Rooftop Instrument Group on March 23rd, 2013 (Figure 1B, dark blue line, [Petty and Weidner, 2017]). We used this light intensity profile to define the rate of change of light intensity in our Clear Day condition, with a maximal light intensity of 600 mol photons m s. This intensity is consistent with measurements of light intensity in aquatic environments (Waldbauer et al., 2012), while also offering an order of magnitude difference in intensity compared to the Low Light condition, which was a constant 50 mol photons m s. The Shade pulse condition was defined by dividing the intensity value of our Clear Day profile by 10 fold between 8 and 9 hr after dawn. The High Light pulse was defined as the intensity of the Clear Day condition between 8 and 9 hr after dawn. Low Light cultures were grown continuously at 50 mol photons m s. We generated the dynamic changes in light intensity of our conditions by changing the intensity of the LED every three minutes by passing the calibrated voltage value corresponding to the appropriate light intensity of our defined profile. The light intensity values of the Low Light and Clear Day conditions are listed in Figure 2—source data 1, and the High Light and Shade pulse values are listed in Figure 3—source data 1. After the 12 hr light profile, the LEDs were turned off for 12 hr during the dark period. Cultures were grown semi-turbidostatically (OD maintained at 0.3) with periodic dilution in BG-11M media supplemented with 10 mM HEPES pH 8.0 at 30 C, continuously bubbled with 1% CO in air, and shaken at 25 rpm in an enclosure impermeable to room lighting. Cells were not grown with antibiotics during the course of the experiment.
Purification of anti-RpaA and anti-RpaB antibodies
Request a detailed protocolRecombinant RpaA was purified as previously described (Takai et al., 2006). To purify recombinant RpaB, we cloned the rpaB gene (Synpcc7942_1453, gene info available through Cyanobase, RRID:SCR_007615) into the pET48-b + plasmid (Novagen) and overexpressed Trx-His-tagged RpaB in Novagen Tuner (DE3) competent cells carrying this plasmid by adding 300 M IPTG to mid-log phase cultures. RpaB was purified from cell lysate using Ni-NTA chromatography as described previously (Gutu and O'Shea, 2013). The Trx-His tag was cleaved from RpaB and removed using a subsequent Ni-NTA step as described (Gutu and O'Shea, 2013). Purified, cleaved RpaB was dialyzed into a buffer containing 20 mM HEPES-KOH, pH 8.0, 150 mM KCl, 10% w/v glycerol, and 1 mM DTT. Protein concentration was measured with the Pierce BCA assay, and aliquots were flash frozen and stored at −80°C .
Anti-RpaB serum was generated by immunization of two rabbits with purified RpaB by Cocalico Biologicals (Reamstown, PA). RpaA- and RpaB-conjugated Affigel 10/15 resin (Bio-Rad) was prepared following manufacturer’s instructions as described previously (Gutu and O'Shea, 2013). Anti-RpaB serum was first passed over an RpaA-conjugated resin and the flowthrough collected to subtract cross-reacting antibodies. Anti-RpaB antibodies were then purified from the flowthrough using an RpaB-conjugated resin as described previously (Gutu and O'Shea, 2013). The same process was repeated to purify anti-RpaA antibodies using rabbit serum described previously (Markson et al., 2013), passing the serum over an RpaB-conjugated resin and purifying with an RpaA-conjugated resin. No cross reactivity of the purified anti-RpaA and anti-RpaB antibodies for the opposite regulator was detected via ELISA assay.
Measurement of RpaAP and RpaBP levels
Request a detailed protocolTen mL of cyanobacterial culture with OD were collected on cellulose acetate filters and flash frozen prior to storage at −80 C. Cell lysates for Western blotting were prepared from the collected cells as described previously (Markson et al., 2013). Equal amounts of cell lysate (10–15 g) were resolved on Phos-tag acrylamide gels (Wako Laboratory Chemicals) and transferred to nitrocellulose membranes as described previously (Gutu and O'Shea, 2013). Membranes were probed with 1/5000 dilution of purified anti-RpaA and anti-RpaB antibody. RpaA blots were then incubated with goat anti-rabbit HRP-conjugated secondary antibody and developed using the Pierce Femto chemiluminescence kit. The exposed blots were imaged with an Alpha Innotech Imaging station. RpaB blots were incubated with Goat anti-Rabbit Westerndot 585 antibody (RRID:AB_2556786) and imaged with a Typhoon Imager. The intensities of the bands corresponding to unphosphorylated and phosphorylated RpaA/B were quantified using Imagequant software (GE Healthcare Life Sciences, RRID:SCR_014246) using rubber band background subtraction. The percent of RpaA (or RpaB) phosphorylated was quantified as the intensity of the RpaAP band divided by the sum of the intensities of the RpaA and RpaAP bands, multiplied by 100. Values reported in Figures 4A,B and and 5A,B represent the average of two separate measurements from replicate Western blots, with error bars displaying the range of the measured values (See Figure 4—source data 1, and Figure 5—source data 1 for raw data from the replicate experiments). The trends seen were reproducibly observed between separate biological replicates of the light condition time courses.
RNA sequencing
Request a detailed protocolTwenty-five mL of cyanobacterial culture with OD were collected on cellulose acetate filters and flash frozen prior to storage at −80 C. Cells were resuspended in RNAprotect Bacteria reagent (Qiagen), and 1/3 of the cells were resuspended in a buffer containing 15 mg/mL lysozyme, 10 mM Tris-Cl, 1 mM EDTA pH 8, and 50 mM NaCl and incubated for 10 min. RNA was purified from the lysed cells using the Qiagen RNeasy Mini Kit. Ribosomal RNA was depleted from 1.25 g of purified RNA using the Ribo-Zero bacteria rRNA removal kit (Illumina). Strand-specific RNAseq libraries were prepared from the depleted RNA using the Truseq Stranded mRNA Sample prep kit (Illumina) and sequenced on an Illumina HiSeq 2500 machine by the Bauer Core Facility at the Harvard FAS Center for Systems Biology. Sequencing reads were aligned to the S. elongatus genome using Bowtie (RRID:SCR_005476) as described previously (Markson et al., 2013), with samples averaging 8 million aligned reads. We quantified expression of a gene by counting the number of aligned sequencing reads corresponding to the appropriate strand between the start and stop of each gene (gene info obtained from Cyanobase, RRID:SCR_007615), and normalized these values between all samples from the light conditions in this work using median normalization, followed by dividing the median normalized read count value by the length of the open reading frame of the gene, as described previously (Anders and Huber, 2010; Markson et al., 2013). The time course and RNA sequencing was repeated twice for two biological replicates (data available in Figure 2—source data 1 and Figure 3—source data 1). The data plotted in this work are from replicate 2, and the trends observed are reproduced in both biological replicates.
Definition of circadian genes
Request a detailed protocolWe defined a subset of previously identified circadian genes on which to focus our analysis. We began with a list of 856 previously described reproducibly circadian genes (Markson et al., 2013; Vijayan et al., 2009). We next required that these genes have a Cosiner amplitude (Kucho et al., 2005) of greater that 0.15 under Constant Light conditions (Vijayan et al., 2009). We also required that the gene display expression of at least one read per nucleotide in at least one time point of the RNA sequencing experiments in this study. These filters produce a list of 450 high confidence circadian genes.
We noted that genes classified as dawn (class 2) and dusk (class 1) genes under Constant Light conditions (Vijayan et al., 2009) showed maximal expression at a different time of day under our Low Light conditions, while the relative ordering of genes by Cosiner phase (Kucho et al., 2005) from Constant Light conditions (Vijayan et al., 2009) was preserved. As such, we redefined dawn genes as those genes with a phase of 40 to 189 under Constant Light conditions (Vijayan et al., 2009), and dusk genes as those with a phase of 190 to 360 and 0 to 39, as determined by the Cosiner algorithm (Kucho et al., 2005). These definitions produce a list of 169 high confidence dawn genes, and 281 high confidence dusk genes. The expression of our redefined circadian genes under Constant Light conditions is plotted in Figure 2—figure supplement 2. The list of high confidence circadian genes and high confidence class assignments is available in Figure 2—source data 1 and Figure 3—source data 1.
ChIP sequencing
Request a detailed protocolOne hundred and twenty mL of OD 0.3 cyanobacterial culture were removed and crosslinked with 1% formaldehyde at 30 C for 5 min in front of a light source. Crosslinking was quenched with 125 mM glycine. Crosslinked cells were washed twice with phosphate buffered saline, pelleted, and flash frozen prior to storage at −80 C.
Pellets were resuspended in 1 mL of BG-11M supplemented with 500 mM L-proline and 1 mg/mL lysozyme and incubated at 30 C for 1 hr to digest the cell wall. Cells were collected and resuspended in a Lysis buffer (50 mM HEPES pH 7.5, 140 mM NaCl, 1 mM EDTA, 1% Triton X-100, 0.1% sodium deoxycholate, and 1x Roche Complete EDTA-free Protease Inhibitor Cocktail) prior to shearing in a Covaris E220 Adaptive Focus System (Peak Incident Power = 175; Duty Factor = 10%; Cycles per burst = 200; Time = 160 s). The lysates were cleared via centrifugation, and concentration was determined via the Pierce BCA Assay.
For a given pulldown, 800 g of lysate was incubated overnight at 4 C in 500 L of lysis buffer with 8 g of anti-RpaA, anti-RpaB, or FLAG M2 mouse monoclonal antibody (Sigma-Aldrich) for RNAP pulldowns. A mock pulldown was carried out in which equal amounts of lysate from every time point of the time course (Shade 0, 15, 60 min, High Light 0, 15, 60 min) in a total of 800 g was incubated with 8 g of rabbit Igg. Next, 35 L of Dynabeads protein G (Thermo Fischer Scientific) equilibrated in lysis buffer were added and the sample was incubated with mixing for 2 hr at 4 C. The beads were washed and DNA was eluted and purified as described previously (Markson et al., 2013).
Sequencing libraries were prepared from the purified ChIP DNA using the NEBNext Ultra II DNA Library Prep Kit (New England Biolabs, Ipswich, MA). Libraries were sequenced on an Illumina HiSeq 2500 instrument by the by the Bauer Core Facility at the Harvard FAS Center for Systems Biology. We created sequencing libraries of ChIP experiments from two separate biological repeats of the time course experiment. Reads were aligned to the S. elongatus genome using Bowtie (RRID:SCR_005476) as described previously (Markson et al., 2013), resulting in an average of 3 million aligned reads for replicate 1, and 5 million aligned reads for replicate 2.
ChIP-seq analysis
Request a detailed protocolThe aligned read data per genomic position was smoothed with a Gaussian filter (window size = 400 base pairs, standard deviation = 50). Each data set was normalized to the Mock ChIP-seq experiment and peaks which were significantly enriched above the Mock were identified in each data set using a previously described (Markson et al., 2013) custom-coded form of the Peak-seq algorithm (Rozowsky et al., 2009). Within each replicate time course for a given protein, we compiled a list of peaks which were enriched at least 3.5 fold over the Mock experiment at the position of highest ChIP signal. Finally, we required that a peak be detected in both replicates for it to be considered. This analysis generated 114 RpaA peaks, 218 RpaB peaks, and 451 RNAP peaks. To calculate enrichment for a peak, we determined the ChIP signal at a given time point at the genomic position of the highest ChIP signal detected for that peak and divided this by the value of the Mock experiment at that position. The data plotted in this manuscript are from replicate 2, but all trends hold in replicate 1. We assigned a gene as a target of a peak if: (i) the start codon of the gene was within 500 bp of the position of maximal ChIP signal within a peak; (ii) the peak resided upstream of the gene; (iii) The gene was the closest gene to that peak on the same strand. Lists of RNAP, RpaA, and RpaB peaks and gene targets are found in Figure 3—source data 2, Figure 4—source data 2, and Figure 5—source data 2, respectively.
For Figures 3G, 4C and 5C, we identified all RNAP, RpaA, or RpaB peaks with dusk gene targets based on the above criteria, respectively. 82 dusk genes are targets of RNAP peaks, 56 dusk genes were targets of RpaA peaks, and 42 dusk genes are targets of RpaB peaks. Then, for each peak - dusk gene pair, we calculated the change in gene expression of the dusk gene after 60 min, and the change in ChIP enrichment of the upstream peak over the mock pulldown (described above) after 60 min in High light, each compared to their respective values at Low light at 8 hr since dawn. We plotted these data on the x- and y-axes, respectively, with orange triangles. We repeated this process, comparing gene expression and ChIP enrichment values after 60 min in Shade compared to 8 hr since dawn in Clear Day conditions, and plotted the data as gray circles. We calculated the correlation coefficient between the change in gene expression and the change in ChIP enrichment for all peak-gene pairs of the relevant factor in the High Light pulse, and then calculated the same correlation in Shade pulse conditions separately. We calculated the correlation coefficients comparing changes after 15 min in either the High Light or Shade pulse conditions, and list these values in the legends of Figure 3—figure supplement 2, Figure 4—figure supplement 2, and Figure 5—figure supplement 2. The data used for these plots for RNAP, RpaA, and RpaB are available in Figure 3—source data 2, Figure 4—source data 2, and Figure 5—source data 2, respectively. We plot data from replicate 2, and the trends are reproduced in replicate 1.
For Figure 3—figure supplement 2, Figure 4—figure supplement 2, and Figure 5—figure supplement 2, we took the lists of RNAP/RpaA/RpaB peaks with dusk gene targets from above. For each peak - gene pair, we calculated the log fold change in ChIP enrichment of the peak and the change in expression of the downstream gene in 15 or 60 min in the High Light pulse compared to the value at 8 hr since dawn in Low Light conditions. We repeated these calculations for each peak-gene pair in 15 or 60 min in Shade pulse compared to 8 hr since dawn in Clear Day conditions. We used hierarchical clustering on the collective ChIP and gene expression data from both conditions to determine the plotting order of the peak-gene pairs in the heat maps, and then plotted the log change in ChIP enrichment and dusk target gene expression in the two conditions in separate heat maps. The change in enrichment of a peak and the change in expression of its target dusk gene are aligned horizontally in their respective heat maps. The leftmost column of each heat map is white, because this column compares the time 0 data to itself and thus has a log value of 0. One RpaA peak resides upstream of two dusk genes, and two RpaB peaks reside upstream of two dusk genes each, and thus the listed number of RpaA and RpaB peaks is smaller than the number of RpaA and RpaB target dusk genes. The data used for these plots for RNAP, RpaA, and RpaB are available in Figure 3—source data 2, Figure 4—source data 2, and Figure 5—source data 2, respectively. We plot data from replicate 2, and the trends are reproduced in replicate 1.
For Figures 4D and 5D we identified all dusk genes that were targets of both RpaA and RNAP (for Figure 4D) or both RpaB and RNAP (for Figure 5D). 33 dusk genes are targets of both RpaA and RNAP peaks, and 27 dusk genes are targets of both RpaB and RNAP. Then, for each pair of RpaA/B - RNAP peaks, we calculated the change in ChIP enrichment of the RpaA/B peak after 60 min, and the change in ChIP enrichment of the RNAP peak upstream of the same dusk gene over the mock pulldown (described above) after 60 min in High light, each compared to their respective values at Low light at 8 hr since dawn. We plotted these data on the x- and y-axes, respectively, with orange triangles. We repeated this process, comparing RpaA/B ChIP enrichment and RNAP ChIP enrichment values after 60 min in Shade compared to 8 hr since dawn in Clear Day conditions, and plotted the data as gray circles. We calculated the correlation coefficient between the change in RpaA/B ChIP enrichment and the change in RNAP ChIP enrichment for all RpaA/B - RNAP peak pairs of the relevant factor in the High Light pulse, and then calculated the same correlation in Shade pulse conditions separately. We calculated the correlation coefficients comparing changes after 15 min in either the High Light or Shade pulse conditions, and list these values in the legends of Figure 4—figure supplement 3, and Figure 5—figure supplement 3. The RNAP, RpaA, and RpaB peaks associated with each dusk gene are listed in Figure 2—source data 1 and Figure 3—source data 1, and the enrichment values for these peaks are listed in Figure 3—source data 2, Figure 4—source data 2, and Figure 5—source data 2, respectively. The data plotted here are from replicate 2, and the trends are reproduced in replicate 1.
For Figure 4—figure supplement 3 and Figure 5—figure supplement 3, we took the lists of RpaA/RpaB - RNAP peaks pairs upstream of the same dusk gene from above. For each RpaA/B - RNAP peak, we calculated the log fold change in ChIP enrichment of the RpaA/B peak and the change in ChIP enrichment of the RNAP peak upstream of the same dusk gene in 15 or 60 min in the High Light pulse compared to the value at 8 hr since dawn in Low Light conditions. We repeated these calculations for each peak-gene pair in 15 or 60 min in Shade pulse compared to 8 hr since dawn in Clear Day conditions. We used hierarchical clustering on the collective RpaA/B and RNAP ChIP data from both conditions to determine the plotting order of the RpaA/RpaB - RNAP peak pairs in the heat maps, and then plotted the log change in RpaA/B ChIP enrichment and RNAP ChIP enrichment in the two conditions in separate heat maps. The change in enrichment of an RpaA/B peak and the change in enrichment of the RNAP peak upstream of the same dusk gene are aligned horizontally in their respective heat maps. The leftmost column of each heat map is white, because this column compares the time 0 data to itself and thus has a log value of 0. The RNAP, RpaA, and RpaB peaks associated with each dusk gene are listed in Figure 2—source data 1 and Figure 3—source data 1, and the enrichment values for these peaks are listed in Figure 3—source data 2, Figure 4—source data 2, and Figure 5—source data 2, respectively. The data plotted here are from replicate 2, and the trends are reproduced in replicate 1.
For Figure 4—figure supplement 4D–F, Figure 6D, and Figure 6—figure supplement 1D–F, we identified all RpaA, RpaB, and RNAP peaks that targeted the specified gene, as described above. Then, we calculated the log change in RpaA (dashed red line), RpaB (dotted blue line), RNAP (dashed green line) ChIP enrichment or expression of the downstream gene (solid black lines) in the High Light pulse compared to 8 hr since dawn in the Low Light condition, and plotted these values with downward triangles. We repeated these calculations, comparing enrichment and gene expression in the Shade pulse to the data at 8 hr since dawn in the Clear Day condition, and plotted these values with circles. The RNAP, RpaA, and RpaB peaks associated with each dusk gene are listed in Figure 2—source data 1 and Figure 3—source data 1, and the enrichment values for these peaks are listed in Figure 3—source data 2, Figure 4—source data 2, and Figure 5—source data 2, respectively. The data plotted here are from replicate 2, and the trends are reproduced in replicate 1.
K-means clustering
Request a detailed protocolWe calculated normalized expression values of high confidence dusk genes under our dynamic light conditions, as well as in previously described RpaA perturbations in Constant Light (Markson et al., 2013). We separately normalized the data from set of dynamic light conditions (Low Light, Clear Day, High Light pulse, Shade pulse) and the Constant Light data (Wildtype, OX-D53E cells — rpaA-, kaiBC-, Ptrc::rpaA(D53E) — without inducer, OX-D53E with inducer, [Markson et al., 2013]) using z-score normalization, and used this data to separate the dusk genes into eight groups with k-means clustering in MATLAB (RRID:SCR_001622) using Pearson correlation as the distance metric. We focused our analysis on the three largest clusters which accounted for most of the dusk genes (187/281 genes). The lists of genes belonging the three major clusters are found in Figure 7—source data 1.
Mathematical modeling
Request a detailed protocolWe observed very regular and systematic changes in the expression of large clusters of dusk genes in natural light conditions (Figures 2, 3 and 7) that correlated with recruitment of RNAP (Figures 4–6). Thus, our goal was to determine whether simple phenomenological models similar to that inspired by Alon (Alon, 2006) could reproduce these observations and offer some intuition into how they might arise. While most of the dusk genes underwent systematic changes, a small group of 20 genes including kaiBC was relatively insensitive to changes in light intensity (Figure 4—figure supplement 4), and we do not model those genes’ expression dynamics.
Our model treats the activation or repression of the expression of a dusk gene cluster by , or another cluster using effective Hill kinetics. We coarse-grained each of the three groups of circadian dusk genes (the Early, Middle, and Late clusters in Figure 7) to a single effective gene with the average dynamics of the group (Figure 7, solid lines). We modeled the dynamics of a gene cluster using a simple kinetic model of an AND gate at a promoter (Mangan and Alon, 2003),
where is the basal transcription rate; is a function of the interaction of with , or another cluster ; is the max transcription rate; and is the decay/dilution rate. Activating interactions were treated using a simple Hill function,
where is the concentration of the active transcription factor, is the Hill coefficient of interaction, and is the coefficient of activation. Bacteria can easily tune the interactions between proteins and between transcription factors and promoters to adjust and for different clusters (Buchler et al., 2003). and , were treated as activators, consistent with the results from Figures 4–6. Repressive interactions between clusters were treated using
where is now the coefficient of repression. In Equation 1, , and were measured experimentally; the remainder of the parameters were left free.
We determined the sufficiency of a model to describe the data by fitting the parameters using the range of values shown in Table 1. Time propagation of the differential Equation 1 was performed using the ode45 solver in MATLAB (RRID:SCR_001622), with set as the observed expression level at the beginning of the simulated time period. Model fitting was performed in MATLAB using the non-linear least squares solver lsqnonlin.
The Akaike Information Criterion (AIC) and the Chi-squared test are typically used to quantify whether a model with more parameters fits the data better than another with fewer parameters simply because it is more complex. However, both approaches are for statistical models in which little to no information is used to construct the model and are not strictly applicable to the model constructed here, which is based on our understanding of transcription. If we do use AIC to compare the models, the feedback models are predicted to be most probable.
In our model, and are effective constants that represent the overall ability of , or another gene cluster to affect gene expression. These constants include potential indirect activation through the sigma factors, which is may be why joint activation by and describe the dynamics of the Middle cluster reasonably well. However, circadian gating of the Early and Late dusk genes requires further interactions that cannot be described by Hill functions of measured and levels. Clearly there may be more complex networks at play than those we have considered here, and much more needs to be done to fully model gene expression in S. elongatus. Here we have constructed a first model to suggest simple principles underlying the interaction of circadian and light regulation of dusk genes and offer directions for further exploration.
Data availability
-
Natural changes in light interact with circadian regulation at promoters to control gene expression in cyanobacteriaPublicly available at the NCBI Gene Expression Omnibus (accession no: GSE104204).
-
Oscillations in supercoiling drive circadian gene expression in cyanobacteriaPublicly available at the NCBI Gene Expression Omnibus (accession no: GSE18902).
-
Circadian control of global gene expression by the cyanobacterial master regulator RpaAPublicly available at the NCBI Gene Expression Omnibus (accession no: GSE50922).
References
-
BookAn Introduction to Systems Biology: Design Principles of Biological CircuitsBoca Raton: CRC press.
-
Systems and photosystems: cellular limits of autotrophic productivity in cyanobacteriaFrontiers in Bioengineering and Biotechnology 3:1.https://doi.org/10.3389/fbioe.2015.00001
-
Genome-wide study of mRNA degradation and transcript elongation in Escherichia coliMolecular Systems Biology 11:781.https://doi.org/10.15252/msb.20145794
-
Engineering cyanobacteria to generate high-value productsTrends in Biotechnology 29:95–103.https://doi.org/10.1016/j.tibtech.2010.12.003
-
BookChronobiology: Biological TimekeepingSunderland, Massachusetts: Sinauer Associates, Inc.
-
Integrating circadian dynamics with physiological processes in plantsNature Reviews Genetics 16:598–610.https://doi.org/10.1038/nrg3976
-
Multiple sigma subunits and the partitioning of bacterial transcription spaceAnnual Review of Microbiology 57:441–466.https://doi.org/10.1146/annurev.micro.57.030502.090913
-
Genome-wide survey of mRNA half-lives in Bacillus subtilis identifies extremely stable mRNAsMolecular Genetics and Genomics 269:706–714.https://doi.org/10.1007/s00438-003-0883-6
-
RpaB, another response regulator operating circadian clock-dependent transcriptional regulation in Synechococcus elongatus PCC 7942Journal of Biological Chemistry 287:26321–26327.https://doi.org/10.1074/jbc.M111.338251
-
Modulation of environmental responses of plants by circadian clocksPlant, Cell & Environment 30:333–349.https://doi.org/10.1111/j.1365-3040.2006.01627.x
-
Metabolic compensation and circadian resilience in prokaryotic cyanobacteriaAnnual Review of Biochemistry 83:221–247.https://doi.org/10.1146/annurev-biochem-060713-035632
-
The cyanobacterial circadian system: from biophysics to bioevolutionAnnual Review of Biophysics 40:143–167.https://doi.org/10.1146/annurev-biophys-042910-155317
-
Global analysis of circadian expression in the cyanobacterium Synechocystis sp. strain PCC 6803Journal of Bacteriology 187:2190–2199.https://doi.org/10.1128/JB.187.6.2190-2199.2005
-
Environmental control of phosphorylation pathways in a branched two-component systemMolecular Microbiology 78:475–489.https://doi.org/10.1111/j.1365-2958.2010.07348.x
-
A model for sigma factor competition in bacterial cellsPLoS Computational Biology 10:e1003845.https://doi.org/10.1371/journal.pcbi.1003845
-
WebsiteGround-based atmospheric monitoring instrument suite, rooftop instrument group: solar radiation dataUniversity of Wisconsin-Madison, Space Science and Engineering Center and Department of Atmospheric and Oceanic Sciences.
-
PeakSeq enables systematic scoring of ChIP-seq experiments relative to controlsNature Biotechnology 27:66–75.https://doi.org/10.1038/nbt.1518
-
Control of rRNA and tRNA syntheses in Escherichia coli by guanosine tetraphosphateJournal of Bacteriology 151:1261–1268.
-
Induction of a group 2 sigma factor, RPOD3, by high light and the underlying mechanism in Synechococcus elongatus PCC 7942Journal of Biological Chemistry 282:36887–36894.https://doi.org/10.1074/jbc.M707582200
-
Toward multiscale models of cyanobacterial growth: a modular approachFrontiers in Bioengineering and Biotechnology 4:95.https://doi.org/10.3389/fbioe.2016.00095
-
Interplay of circadian clocks and metabolic rhythmsAnnual Review of Genetics 40:409–448.https://doi.org/10.1146/annurev.genet.40.110405.090603
Article and author information
Author details
Funding
Howard Hughes Medical Institute
- Erin K O'Shea
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
JRP and KA thank Phil Shiu, Bin He, Eddie Wang, Andrian Gutu, Chris Chidley, Luca Gerosa, Vadim Patsalo, Rohan Balakrishnan, Matteo Mori, and Doran Bennett for helpful comments and discussions on the manuscript. We thank Andrew Kennard for initial experiments on the role of RpaB in regulating gene expression under dynamic light conditions. JRP thanks Christian Daly, Claire Reardon, Jennifer Couget, and Patrick Dennett of the Harvard Bauer Core Facility for their help with high throughput sequencing and other experiments. KA thanks Al Takeda and Jim MacArthur from the Harvard Electronics Shop for their extensive help in building the controllable lights. JRP and KA were supported by the Howard Hughes Medical Institute through EKO.
Copyright
© 2017, Piechura et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 4,157
- views
-
- 531
- downloads
-
- 29
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Computational and Systems Biology
- Microbiology and Infectious Disease
Timely and effective use of antimicrobial drugs can improve patient outcomes, as well as help safeguard against resistance development. Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) is currently routinely used in clinical diagnostics for rapid species identification. Mining additional data from said spectra in the form of antimicrobial resistance (AMR) profiles is, therefore, highly promising. Such AMR profiles could serve as a drop-in solution for drastically improving treatment efficiency, effectiveness, and costs. This study endeavors to develop the first machine learning models capable of predicting AMR profiles for the whole repertoire of species and drugs encountered in clinical microbiology. The resulting models can be interpreted as drug recommender systems for infectious diseases. We find that our dual-branch method delivers considerably higher performance compared to previous approaches. In addition, experiments show that the models can be efficiently fine-tuned to data from other clinical laboratories. MALDI-TOF-based AMR recommender systems can, hence, greatly extend the value of MALDI-TOF MS for clinical diagnostics. All code supporting this study is distributed on PyPI and is packaged at https://github.com/gdewael/maldi-nn.
-
- Computational and Systems Biology
- Genetics and Genomics
Enhancers and promoters are classically considered to be bound by a small set of transcription factors (TFs) in a sequence-specific manner. This assumption has come under increasing skepticism as the datasets of ChIP-seq assays of TFs have expanded. In particular, high-occupancy target (HOT) loci attract hundreds of TFs with often no detectable correlation between ChIP-seq peaks and DNA-binding motif presence. Here, we used a set of 1003 TF ChIP-seq datasets (HepG2, K562, H1) to analyze the patterns of ChIP-seq peak co-occurrence in combination with functional genomics datasets. We identified 43,891 HOT loci forming at the promoter (53%) and enhancer (47%) regions. HOT promoters regulate housekeeping genes, whereas HOT enhancers are involved in tissue-specific process regulation. HOT loci form the foundation of human super-enhancers and evolve under strong negative selection, with some of these loci being located in ultraconserved regions. Sequence-based classification analysis of HOT loci suggested that their formation is driven by the sequence features, and the density of mapped ChIP-seq peaks across TF-bound loci correlates with sequence features and the expression level of flanking genes. Based on the affinities to bind to promoters and enhancers we detected five distinct clusters of TFs that form the core of the HOT loci. We report an abundance of HOT loci in the human genome and a commitment of 51% of all TF ChIP-seq binding events to HOT locus formation thus challenging the classical model of enhancer activity and propose a model of HOT locus formation based on the existence of large transcriptional condensates.