Evolutionary emergence of Hairless as a novel component of the Notch signaling pathway
Abstract
Suppressor of Hairless [Su(H)], the transcription factor at the end of the Notch pathway in Drosophila, utilizes the Hairless protein to recruit two co-repressors, Groucho (Gro) and C-terminal Binding Protein (CtBP), indirectly. Hairless is present only in the Pancrustacea, raising the question of how Su(H) in other protostomes gains repressive function. We show that Su(H) from a wide array of arthropods, molluscs, and annelids includes motifs that directly bind Gro and CtBP; thus, direct co-repressor recruitment is ancestral in the protostomes. How did Hairless come to replace this ancestral paradigm? Our discovery of a protein (S-CAP) in Myriapods and Chelicerates that contains a motif similar to the Su(H)-binding domain in Hairless has revealed a likely evolutionary connection between Hairless and Metastasis-associated (MTA) protein, a component of the NuRD complex. Sequence comparison and widely conserved microsynteny suggest that S-CAP and Hairless arose from a tandem duplication of an ancestral MTA gene.
https://doi.org/10.7554/eLife.48115.001Introduction
A very common paradigm in the regulation of animal development is that DNA-binding transcriptional repressors bear defined amino acid sequence motifs that permit them to recruit, by direct interaction, one or more common co-repressor proteins that are responsible for conferring repressive activity. Two such universal co-repressors are Groucho (Gro) and C-terminal Binding Protein (CtBP).
The ancient and highly conserved transcription factor Suppressor of Hairless [Su(H)] functions at the terminus of the widely utilized Notch cell-cell signaling pathway. Su(H) is converted into an activator by signaling through the Notch receptor, but in the absence of signaling it functions as a repressor. Earlier studies have revealed that in many settings in Drosophila, Su(H)’s repressive activity depends on binding to the Hairless protein (Figure 1). Hairless includes separate Gro- and CtBP-binding motifs, which permit it to function as an adaptor to bring these two corepressors to Su(H) (Figure 1B) (Barolo et al., 2002a). Thus, the Su(H)/H partnership in the fly represents a notable exception to the rule of direct co-repressor recruitment.
As genome and transcriptome sequences have become available for more and more insects and other arthropods, we have searched for possible Hairless orthologs in a wide variety of species, in an attempt to determine the protein’s phylogenetic distribution. We have found that Hairless is confined to the Pancrustacea (or Tetraconata), a clade of arthropods that includes the Crustacea and Hexapoda (Misof et al., 2014; Kjer et al., 2016). While this indicates that Hairless was gained at least 500 Mya, it also raises the question of how Su(H) in other protostomes acquires repressive activity.
Here we present evidence that direct co-repressor recruitment by Su(H) is likely to be ancestral in the protostomes. We show that Su(H) in a broad range of protostomes, including arthropods, molluscs, and annelids, bears both a short linear motif that mediates binding of CtBP and a novel motif for direct recruitment of Gro. Thus, the evolutionary appearance of Hairless has permitted the replacement of an ancient and predominant regulatory mechanism (direct co-repressor recruitment) with a novel one (indirect recruitment).
What can we learn about the evolutionary history of Hairless? While Hairless itself is found only in the Pancrustacea, we show that the genomes of Myriapods and Chelicerates encode a protein with clear sequence and functional similarities to Hairless. These proteins include a motif that strongly resembles the Su(H)-binding domain of Hairless, and we demonstrate that this motif from the house spider Parasteatoda tepidariorum does indeed bind Su(H). In addition, these Myriapod and Chelicerate proteins also include one or more canonical motifs for recruitment of CtBP. Accordingly, we designate these factors as ‘Su(H)-Co-repressor Adaptor Proteins’ (S-CAPs).
Finally, further sequence analyses, along with the discovery of conserved microsynteny, have provided substantial evidence that Hairless and the S-CAPs are likely to be homologous and that they arose from a duplication of the gene encoding Metastasis-associated (MTA) protein, a component of the nucleosome remodeling and deacetylase (NuRD) complex.
An intriguing question in evolutionary biology concerns the path by which a particular clade has escaped a strongly selected character that has been conserved for hundreds of millions of years. We believe that our study has yielded valuable insight into both the emergence of an evolutionary novelty and its replacement of an ancestral paradigm.
Results
Hairless is present only in the Pancrustacea
We have conducted extensive BLAST searches of genome and transcriptome sequence data for a wide variety of metazoa in an attempt to define the phylogenetic distribution of Hairless. We find that Hairless as originally described (Bang and Posakony, 1992; Maier et al., 1992; Maier et al., 2008) is confined to the Pancrustacea (or Tetraconata), and occurs widely within this clade, including the Hexapoda, Vericrustacea, and Oligostraca (Figure 2A). By contrast, no evidence for a true Hairless gene has been detected in either Myriapods or Chelicerates, even in cases where substantially complete genome sequence assemblies are available.
The enormous variation in the size of the Hairless protein in various Pancrustacean clades is worthy of note (Figure 1A). The known extremes are represented by the Diplostracan (shrimp) Eulimnadia texana (343 aa) (Baldwin-Brown et al., 2018) and the Dipteran (fly) Protophormia terraenovae (1614 aa) (Hase et al., 2017), a 4.7-fold difference. There is a broad tendency for the size of the protein to be relatively stable within an order (Supplementary file 1). Thus, as noted previously (Maier et al., 2008), the Hymenoptera generally have a small Hairless (of the order of 400 aa; see Figure 1A), while the Diptera typically have a much larger version (of the order of 1000 aa or more). Notable exceptions to this pattern of uniformity are aphids, where Hairless is typically ~900 aa compared to ~400 aa in other Hemiptera, and chalcid wasps, where the protein is over 500 aa instead of the Hymenoptera-typical ~400 aa noted above (Supplementary file 1). Smaller Hairless proteins typically retain all five conserved motifs/domains characteristic of this factor (Maier et al., 2008), while the regions that flank and lie between these sequences are reduced in size (Figure 1A; Supplementary file 2).
A known CtBP-binding motif is present in the non-conserved N-terminal region of Su(H) in a wide variety of protostomes
The apparent confinement of the Hairless co-repressor adaptor protein to the Pancrustacea raises the question of the mechanism(s) by which Su(H) in other protostomes might recruit co-repressor proteins to mediate its repressor function. Of course, other protostomes need not utilize the Gro and CtBP co-repressors for this purpose; different co-repressors might substitute. Nevertheless, we first sought to identify known binding motifs for Gro and CtBP in Su(H) from arthropods lacking Hairless. As shown in Table 1, we found a canonical CtBP recruitment motif of the form PϕDϕS (where ϕ = I, L, M, or V) in predicted Su(H) proteins from a variety of Myriapods and Chelicerates, including the centipede Strigamia maritima, the tick Ixodes scapularis, the spider Parasteatoda tepidariorum, the horseshoe crab Limulus polyphemus, and the scorpion Centruroides sculpturatus. These motifs are all located in the non-conserved N-terminal region of Su(H) (Supplementary file 3).
Extending this sequence analysis to other protostome phyla led to the finding that a similar PϕDϕS motif occurs in the N-terminal region of Su(H) from a large number of molluscs and annelids, as well as from multiple Nemertea, Brachiopoda, Phoronida, and monogonont rotifers, and also from some flatworms (Table 1). It is notable, by contrast, that we do not find CtBP-binding motifs present in Su(H) from nematodes. Nevertheless, given the broad phylogenetic distribution of the PϕDϕS motif in Su(H) from both Ecdysozoa and Lophotrochozoa, our observations strongly suggest that direct recruitment of CtBP by Su(H) is ancestral in the protostomes.
To verify that the shared PϕDϕS motif in protostome Su(H) proteins can indeed mediate direct recruitment of CtBP, we carried out an in vitro pulldown assay using GST-tagged Drosophila CtBP (bound to Glutathione Sepharose beads) and a His-tagged fragment of Strigamia maritima Su(H) (Figure 3A). We found that the two proteins do interact directly and robustly, in a manner that is dependent on the integrity of the PVDLS motif in Strigamia Su(H).
A novel conserved motif in protostome Su(H) binds the Gro co-repressor
In addition to a PϕDϕS CtBP-binding motif, we have found that Su(H) from a wide variety of protostomes includes a novel motif similar to GSLTPPDKV (Table 1). Where present, this sequence typically lies a short (but variable) distance C-terminal to the PϕDϕS motif, also within the non-conserved N-terminal region of the protein (Supplementary file 3). The GSLTPPDKV motif is particularly prevalent in Su(H) from the Trochozoa, which includes annelids, sipunculans, molluscs, nemerteans, brachiopods, and phoronids (Kocot et al., 2017). Among the Ecdysozoa, it appears consistently in Su(H) from Crustacea and Myriapoda, and in small subsets of both Hexapoda (Ephemeroptera, Odonata, Zygentoma, Archaeognatha, Diplura, and Collembola) and Chelicerata [harvestmen (Opiliones) and Scorpiones]. The motif is absent from Su(H) in all other insect orders, and we have not found it so far in Su(H) from nematodes, flatworms, rotifers, or tardigrades; it is, however, found in the onychophoran Euperipatoides kanangrensis (Table 1). Perhaps surprisingly, the motif is present in Su(H) from the acorn worms Saccoglossus kowalevskii and Ptychodera flava (Simakov et al., 2015), which are hemichordates (deuterostomes).
Using an in vitro pulldown assay, we tested the possibility that the GSLTPPDKV motif mediates binding of the Gro co-repressor (Figure 3B). Indeed, we find that GST-tagged Gro protein interacts strongly with a His-tagged protein bearing this motif at its C-terminus, and that this binding is abolished when the motif is replaced by alanine residues. We conclude that Su(H) from a broad range of protostomes is capable of directly recruiting both CtBP and Gro (Table 1), and that this capacity is hence very likely to be ancestral in this clade.
Retention of the hybrid state: Species that have both Hairless and the co-repressor-binding motifs in Su(H)
The evolutionary emergence of Hairless as an adaptor protein capable of mediating the indirect recruitment of both Gro and CtBP to Su(H) might be expected to relieve a selective pressure to retain the ancestral Gro- and CtBP-binding motifs in Su(H) itself. And indeed, we find that Su(H) from multiple insect orders comprising the Neoptera lacks both of these sequences (Figure 2B). Strikingly, however, we have observed that Crustacea and a small group of Hexapoda retain both traits (Figure 2B). Thus, multiple representatives of the Branchiopoda, Malacostraca, and Copepoda, along with Ephemeroptera, Odonata, Zygentoma, Archaeognatha, Diplura, and Collembola, have both a canonical Hairless protein (including its Gro- and CtBP-binding motifs) and Gro- and CtBP-binding motifs within Su(H). These clades, then, appear to have retained a ‘hybrid intermediate’ state (Baker et al., 2012) characterized by the presence of both co-repressor recruitment mechanisms.
Myriapods and Chelicerates encode a protein with similarity to Hairless
While canonical Hairless proteins are confined to the Pancrustacea, we have discovered that the genomes of Myriapods and Chelicerates nevertheless encode a protein with intriguing similarities to Hairless. Most notable is the presence of a motif that strongly resembles the ‘Su(H)-binding domain’ (SBD) of Hairless, which mediates its high-affinity direct interaction with Su(H) (Figure 1; Figure 4A). We will refer to these proteins as ‘S-CAPs’; the basis for this designation will be made clear in forthcoming figures. We note that the occurrence of this protein in the centipede Strigamia maritima has also recently been reported by Maier (2019). In the Pancrustacea, the N-terminal and C-terminal halves of the Hairless SBD are encoded by separate exons (Figure 4B). Strikingly, the related motif in Myriapod and Chelicerate S-CAPs is likewise encoded by separate exons, with exactly the same splice junction as in Hairless (Figure 4B). We believe that this is highly unlikely to be coincidental, and is instead strongly suggestive of an evolutionary relationship between Hairless and S-CAPs.
A recent structural analysis of the Su(H)-Hairless protein complex identified several residues in the Hairless SBD that are involved in binding to the C-terminal domain (CTD) of Su(H) (Yuan et al., 2016) (Figure 4A). These include four hydrophobic amino acids in the main body of the SBD (L235, F237, L245, and L247; these are highlighted in red in Figure 4A). Note that the Myriapod and Chelicerate S-CAP motifs share these same residues. In addition, a tryptophan (W258) C-terminal to the main body of the Hairless SBD also participates in binding to Su(H) (Figure 4A). Myriapod and Chelicerate S-CAPs all include a tryptophan residue at a similar position C-terminal to the main SBD-like domain (Figure 4A). Moreover, this particular W residue in both Hairless and the S-CAPs is followed by a hydrophobic residue, typically V or I. These sequence features, we suggest, is further strong evidence of a common ancestry for the respective segments of Hairless and S-CAPs.
A third structural similarity between Hairless and S-CAPs is the presence in the latter of one or more short linear motifs capable of binding the CtBP co-repressor (Figure 5A). These motifs typically reside in the C-terminal half of the S-CAPs, superficially resembling the C-terminal location of Hairless's CtBP recruitment motif.
A table listing representative examples of Myriapod and Chelicerate S-CAPs is provided as Supplementary file 4, and an annotated FASTA file of their amino acid sequences is included as Supplementary file 5. It is important to note that we have not found non-Hairless S-CAPs in the Pancrustacea.
Spider S-CAP binds to Drosophila Su(H)
Given the clear sequence similarity between the Hairless SBD and the SBD-like motif in Myriapod and Chelicerate S-CAPs, we investigated whether the latter motif is likewise capable of mediating direct binding to Su(H). As noted above, the Hairless SBD interacts specifically with the CTD of Su(H). Since this domain in Su(H) is very highly conserved throughout the Bilateria and Cnidaria, we thought it reasonable to utilize Drosophila Su(H) for this binding assay. As shown in Figure 4C, we find that a 200-amino-acid segment of S-CAP from the spider Parasteatoda tepidariorum binds directly to Drosophila Su(H) in vitro. This interaction depends strictly on the integrity of the five residues that in Hairless have been shown to contact the Su(H) CTD (highlighted in red in Figure 4A).
Given the presence of one or more CtBP recruitment motifs in the Myriapod and Chelicerate S-CAP proteins (Figure 5A), along with the ability of their SBD-like domains to bind Su(H) (Figure 4C), we have designated these as ‘Su(H)-Co-repressor Adaptor Proteins’ (S-CAPs).
Chelicerate S-CAP proteins are related to Metastasis-associated (MTA) proteins
In addition to their similarities to Hairless, the S-CAP proteins of Chelicerates include two regions with strong sequence homology to the Metastasis-associated (MTA) protein family, which is highly conserved among Metazoa. The MTA proteins play an important role in transcriptional regulation via their function as core components of the nucleosome remodeling and deacetylase (NuRD) complex (Allen et al., 2013; Burgold et al., 2019). The N-terminal half of MTAs includes four well-defined functional domains: BAH (Bromo-Adjacent Homology), ELM2 (Egl-27 and MTA1 homology), SANT (Swi3, Ada2, N-CoR, and TFIIIB), and GATA-like zinc finger (Millard et al., 2014) (Figure 5B). Of these, the ELM2 and SANT domains are retained at the N-terminal end of Chelicerate S-CAPs (Figure 5B; Figure 5—figure supplement 1A). This is highly likely to have functional significance, as the ELM2 and SANT domains of MTA proteins work together to recruit and activate the histone deacetylases HDAC1 and HDAC2 (Millard et al., 2013). Further suggesting homology between Chelicerate S-CAPs and MTAs is the observation that their shared ELM2 and SANT domains are each encoded by two exons with exactly the same splice junction (Figure 5C).
It is noteworthy that, despite sharing the SBD-like and CtBP recruitment motifs of Chelicerate S-CAPs, the available Myriapod S-CAP protein sequences lack the N-terminal ELM2 and SANT homologies with MTA proteins (Figure 5B). Consistent with this, the SBD motif in Myriapod S-CAPs lies much closer to the protein’s N terminus than the SBD motif in Chelicerate S-CAPs, suggesting that simple loss of the ELM2/SANT-encoding exons might underlie this difference between the two S-CAP clades. Likewise, Hairless proteins are devoid of clear similarities to MTAs.
In addition to their SBD and ELM2/SANT domains, Chelicerate S-CAPs share a third region of homology that lies between the ELM2 and SANT sequences (Figure 5—figure supplement 1A). This region is absent from both Hairless and the Myriapod S-CAPs. Conversely, Myriapod S-CAPs include a segment of sequence similarity that is not found in either Hairless or Chelicerate S-CAPs (Figure 5—figure supplement 1B).
Conserved microsynteny between MTA and S-CAP/Hairless genes
Our analysis of the genomic locations of genes encoding MTA proteins in Arthropoda, Hairless in Pancrustacea, and S-CAPs in Myriapods and Chelicerates has yielded the surprising finding that proximate or near-proximate linkage between MTA and Hairless genes or between MTA and S-CAP genes is broadly conserved among arthropods (Figure 6; Supplementary file 1; Supplementary file 4). Thus, in the centipede Strigamia maritima, the gene encoding S-CAP lies immediately upstream of that encoding MTA, in the same orientation (Figure 6; Supplementary file 4). A similar linkage relationship between S-CAP and MTA genes is seen in many arachnids, including the spiders Nephila clavipes (Supplementary file 4) and Parasteatoda tepidariorum (Figure 6; Supplementary file 4) and the mites Achipteria coleoptrata and Sarcoptes scabiei (Supplementary file 4). Likely due at least in part to its history of whole-genome duplication (Nossa et al., 2014; Kenny et al., 2016), the horseshoe crab Limulus polyphemus (representing the Merostomata/Xiphosura) has three paralogous copies of this same S-CAP-MTA linkage pairing (Supplementary file 4). Some exceptions to this pattern do exist. In the genomes of the mites Metaseiulus occidentalis (Supplementary file 4) and Varroa destructor (Techer et al., 2019), for example, the genes encoding S-CAP and MTA are far separated from each other.
Close, typically adjacent, linkage between Hairless and MTA genes is likewise widely observed in the genomes of Pancrustacea. Among the Hexapoda, this pattern can be found in many different orders (Supplementary file 1), including Diptera, Lepidoptera, Coleoptera (Figure 6), Hymenoptera (Figure 6), Psocodea, Hemiptera (Figure 6), Thysanoptera, Blattodea, Orthoptera, Odonata, and Collembola. Among the Vericrustacea, adjacent linkage of Hairless and MTA is seen in the shrimp Triops cancriformis (Notostraca) (Supplementary file 1). Nevertheless, exceptions are readily found, even within the same orders as above (Supplementary file 1). Examples include Drosophila melanogaster, Ceratitis capitata, and Lucilia cuprina (Diptera; Supplementary file 1), Bicyclus anynana (Lepidoptera), Anoplophora glabripennis, Dendroctonus ponderosae, and Nicrophorus vespilloides (Coleoptera), and Cimex lectularius (Hemiptera; Supplementary file 1).
Interestingly, in some instances Hairless/MTA microsynteny is preserved, but the genes’ relative orientation is different (Figure 6; Supplementary file 1). Thus, in the aphids — in contrast to other Hemiptera — MTA lies downstream of Hairless, but in the opposite orientation (Figure 6). In the beetle Harmonia axyridis (Coleoptera), MTA lies upstream of Hairless (Figure 6).
Despite the multiple instances in which it has been lost, we believe that the most parsimonious interpretation of our analysis is that close linkage between MTA and S-CAP/Hairless genes is ancestral in the respective taxa (Myriapods/Chelicerates and Pancrustacea). We leave for the Discussion our proposed interpretation of the evolutionary significance of this adjacency.
Discussion
The evolution of Hairless represents a shift from the ancestral and dominant paradigm of direct co-repressor recruitment by Su(H)
Our analysis of sequences from a broad range of protostomes strongly suggests that direct recruitment of the CtBP and Gro co-repressors by Su(H) is ancestral in this clade. This is consonant with the fact that direct co-repressor recruitment by DNA-binding repressor proteins in general is a dominant paradigm among Metazoa. This evokes the intriguing question of what might have led to the loss of direct recruitment by Su(H) in the Neoptera (see Figure 1B) and its replacement by Hairless-mediated indirect recruitment? Does Hairless provide some advantageous functional capacity? Note that this is not intended to suggest that Hairless must be an evolutionary adaptation per se (Lynch, 2007); rather, we are asking: What capability might it have conferred that would lead to its retention and the subsequent loss of the recruitment motifs in Su(H)?
One appealing (but of course speculative) possibility is that Hairless may have permitted Su(H) for the first time to recruit both CtBP and Gro simultaneously to the same target genes. As we have noted, the apparently ancestral PϕDϕS and GSLTPPDKV motifs in protostome Su(H) typically lie quite close to each other in the protein’s linear sequence (Supplementary file 3). CtBP (~400 aa) and Gro (~700 aa) are both large proteins that engage in oligomerization as part of their functional mechanism (Song et al., 2004; Bhambhani et al., 2011). It is very unlikely that both could bind stably to DNA-bound Su(H) at the same time. In contrast, the Gro and CtBP recruitment motifs in Hairless are far apart in the linear sequence (Figure 1A) and are separated by a region predicted to be largely disordered (Figure 1—figure supplement 1). We suggest that this might be compatible with simultaneous recruitment of the two co-repressors.
Whatever may have been the selective forces that led to the loss of direct co-repressor recruitment by Su(H) in the Neoptera and its replacement by Hairless-mediated indirect recruitment, Hairless is a notable evolutionary novelty for having permitted the unusual abandonment of an ancestral and highly conserved paradigm. We suggest that this represents a striking example of ‘developmental system drift’ (True and Haag, 2001), in which a common output (widespread ‘default repression’ of Notch pathway target genes) is achieved via distinct molecular mechanisms in different species.
A possible evolutionary pathway for the appearance of Hairless
We have described here several findings that we believe have important implications for an attempt to reconstruct the history of Hairless as an evolutionary novelty. First, we observe that Hairless is apparently confined to the Pancrustacea, wherein it is widely distributed among diverse taxa (Figure 2A; Supplementary file 1). Second, we have discovered in the sister groups Myriapoda and Chelicerata a protein (S-CAP) with clear sequence homology to the Su(H)-binding domain (SBD) of Hairless (Figure 4A). Significantly, in both Hairless and the S-CAPs these motifs are encoded by contributions from two exons, with the associated splice junction in precisely the same location (Figure 4B; Supplementary file 4). Third, we find that S-CAPs in the Chelicerata include in their N-terminal region strong homology to the ELM2 and SANT domains of MTAs, which themselves are highly conserved among Metazoa, and therefore would have been present in the arthropod common ancestor (Figure 5B,C). Finally, our analysis indicates that close, usually adjacent, linkage of Hairless and MTA genes (in the Pancrustacea) and between S-CAP and MTA genes (in the Myriapoda and Chelicerata) is widespread (Figure 6; Supplementary file 1; Supplementary file 4), and hence very likely to be ancestral, in these taxa.
While any attempt to infer the sequence of evolutionary events that led to the appearance of Hairless is necessarily speculative, we believe that the above findings offer substantial support for the following hypothetical pathway. We propose that in a deep arthropod ancestor a tandem duplication of the MTA gene occurred. One copy retained the strong sequence conservation (and presumably ancestral function) of metazoan MTA genes, while the second copy diverged very substantially, eventually encoding a protein that had lost all but the ELM2 and SANT domains of the MTA ancestor. The extensive reconfiguration of this paralog also included the eventual acquisition of the SBD motif and the addition of one or more CtBP recruitment motifs (see Figure 7 for some possible sources of these components). In the Myriapod lineage, even the ELM2 and SANT domains were eventually lost. In the Pancrustacea, we suggest that this same divergent MTA paralog evolved to become Hairless. Beyond the alterations described for the Myriapoda, this would have involved the acquisition of sequences encoding additional now-conserved domains and motifs, including the Gro recruitment motif (Supplementary file 2). This radical evolutionary transformation resulted in a protein with little or no remaining homology to its MTA ancestor, and with an entirely novel regulatory function (Holland et al., 2017).
In this context, it is of interest that the Drosophila Mi-2/Nurd complex — which includes the MTA protein — has recently been shown to engage in direct repression of multiple Notch pathway target genes, independent of both Su(H) and Hairless (Zacharioudaki et al., 2019). Whether this activity preceded the emergence of Hairless is unknown, but the possibility that it is in some way connected to Hairless’s evolutionary history is indeed intriguing.
Materials and methods
Sequence searches, analysis, and annotation
Request a detailed protocolGenome and transcriptome sequences encoding Hairless, Suppressor of Hairless, S-CAP, and MTA proteins from a wide variety of species were recovered via BLAST searches, using either the online version at the NCBI website (Boratyn et al., 2013) or the version implemented by the BlastStation-Local64 desktop application (TM Software, Inc). Sequences were analyzed and annotated using the GenePalette (Rebeiz and Posakony, 2004; Smith et al., 2017) and DNA Strider (Marck, 1988; Douglas, 1995) desktop software tools. Analysis of predicted disordered regions in Hairless was conducted using DISOPRED3 on the PSIPRED server (Buchan et al., 2013; Jones and Cozzetto, 2015).
Generation of constructs for GST pulldown experiments
Strigamia maritima Su(H) protein constructs to test CtBP binding
Request a detailed protocolA codon-optimized fragment corresponding to exons 2 and 3 from S. maritima Su(H) mRNA was synthesized by Genewiz, Inc, and cloned into pRSET-C using Acc65I and BamHI restriction sites. The CtBP-motif mutant was subsequently generated by overlap extension PCR using the primers HISsmarSUH-f (CGCTGGATCCGCGGCCAGTATGAC), HISsmarSUH-r (CCATGGTACCAGTTATGCGTGGTG), HISsmarSUHctbpm-f (AACCACgCCGcaGcTGcGgCTAACAGCCATCGCGGTGAAGGCGGCCAC), HISsmarSUHctbpm-r (GCTGTTAGcCgCAgCtgCGGcGTGGTTGTCGGCGAAGTGAGGGGTCAG). After sequence confirmation, this fragment was also cloned into pRSET-C using the same enzymes. Binding of these constructs to Drosophila melanogaster CtBP was assayed using GST alone and a GST-CtBP fusion protein (Nibu et al., 1998).
Constructs to test potential Gro-binding motif in Strigamia maritima Su(H)
Request a detailed protocolA truncated version of HLHmβ (HLHmβ-WRPWtrunc) was amplified from a pRSET-HLHmβ-WT construct using the primers HISmbeta-f (cgatggatccgaATGGTTCTGGAAATGGAGATGTCCAAG) and HISmbetatrunc-r (ccatggtaccagTCACATGGGGCCagaggtggagctggcctcgctgggcgc); a version of HLHmβ with the WRPW motif replaced with the amino acids GSLTPPDKV (HLHmβ+Smar-motifWT) was amplified from the WT construct with HISmbeta-f and mbetaSmarSuH-r (ccatggtaccagTCACACTTTATCAGGTGGAGTGAGAGAACCCATGGGGCCagaggtggagctggcc); and a version of HLHmβ with the WRPW motif replaced with a stretch of 9 alanine residues (HLHmβ+Smar-motifMUT) was amplified using HISmbeta-f and mbetaSmarSuHmut-r (ccatggtaccagTCAggctgccgctgcggctgccgctgctgcCATGGGGCCagaggtggagctggcc). Each construct was then subsequently cloned into pRSET-C using the restriction enzymes BamHI and Acc65I and sequence verified. Binding of these constructs to Drosophila melanogaster Gro was assayed using GST alone and a GST-Gro fusion protein. The latter construct was made by cloning the full-length Gro coding sequence into the pGEX-KG expression vector at the EcoRI and SalI restriction sites: gtggcgaccatcctccaaaatcggatctggttccgcgtggatccccgggaatttccggtggtggtggtggaattctaATG...TAAATCCACAAAACCATGCAGTTTTTTCATTTTGTAATAAGCTCGTATAGTTTTTATTACAACATGTTCGAAATCATGCAcccgggctgcaggaattcgatatcaagcttatcgataccgtcgactcgagctcaagcttaattcatcgtgactgactgacgatctg (underlined = pGEX KG vector; uppercase = gro cDNA; bold = gro start and stop codons; italic = linker)
S-CAP/Hairless constructs for Su(H) interaction analysis
Request a detailed protocolCodon-optimized fragments from Drosophila melanogaster Hairless (residues 192–389), and Parasteatoda tepidariorum cS-CAP (residues 233–432) as well as 5-alanine mutant substitutions (Dmel: GGRLQFFKDGKFILELARSKDGDKSGW - > GGRAQAFKDGKFIAEAARSKDGDKSGA; Ptep: VGSLKFFLGGRLVLKLNAQQDGGSGNKCQW - > VGSAKAFLGGRLVAKANAQQDGGSGNKCQA) were synthesized by Genewiz, Inc. Inserts were subsequently cloned into pRSET-C using the restriction enzymes BamHI and Acc65I. Binding of these constructs to Drosophila melanogaster Su(H) was assayed using GST alone and a GST-Su(H) fusion protein (Bailey and Posakony, 1995).
GST pulldowns using each of the above constructs were performed as previously described (Fontana and Posakony, 2009).
Synthesized, codon-optimized sequences
>Smar Su(H)ex2-3 WT (116 aa)
Request a detailed protocolCGCTGGATCCGCGGCCAGTATGACTACCCGCCGCCGTTAGCCAGCACATACAGCCGCGAGGCCGACCTGTGGAACGTGAACCTGGCCACCTACAGCAGCGCACCGACCACATGCACCGGTGCAACCCCGGCACCTAGCGTTACCGGTTTCTACGCCCAGGCCACCGGCAGCAACAGCGTTAGCCCGAGTAGCGTGAGCCTGACCACCCTGACCCCTCACTTCGCCGACAACCACCCGGTGGACCTGAGCAACAGCCATCGCGGTGAAGGCGGCCACCTGGATCTGGTGCGCTTCCAGAGCGACCGCGTGGATGCCTACAAGCACGCCAACGGCCTGAGCGTGCATATCCCGGACCACCACGCATAACTGGTACCATGG
>Smar Su(H)ex2-3 mut
Request a detailed protocolCGCTGGATCCGCGGCCAGTATGACTACCCGCCGCCGTTAGCCAGCACATACAGCCGCGAGGCCGACCTGTGGAACGTGAACCTGGCCACCTACAGCAGCGCACCGACCACATGCACCGGTGCAACCCCGGCACCTAGCGTTACCGGTTTCTACGCCCAGGCCACCGGCAGCAACAGCGTTAGCCCGAGTAGCGTGAGCCTGACCACCCTGACCCCTCACTTCGCCGACAACCACgCCGcaGcTGcGgCTAACAGCCATCGCGGTGAAGGCGGCCACCTGGATCTGGTGCGCTTCCAGAGCGACCGCGTGGATGCCTACAAGCACGCCAACGGCCTGAGCGTGCATATCCCGGACCACCACGCATAACTGGTACCATGG
>Dmel Hairless192-389 WT
Request a detailed protocolCGATGGATCCGAGCAGTGGTTGCAGCAGCAGCTGGCACTGCCAAAATTGGTAAAGGCAGCAACAGCGGTGGCAGTTTTGATATGGGCCGCACACCGATCAGCACCCACGGCAACAATAGTTGGGGTGGCTATGGCGGCCGTTTACAGTTCTTTAAAGATGGCAAGTTTATTTTAGAACTGGCCCGCAGCAAAGATGGCGATAAAAGCGGCTGGGTGAGTGTGACCCGCAAAACCTTTCGCCCGCCGAGTGCAGCAACCAGCGCAACCGTGACCCCTACCAGTGCCGTGACCACCGCCTACCCGAAGAATGAAAACAGCACCTCTTTAAGCTTCAGCGACGACAATAGCAGCATTCAGAGCAGCCCGTGGCAGCGTGATCAGCCGTGGAAACAGAGTCGTCCGCGCCGTGGCATCAGCAAAGAACTGTCTTTATTTTTCCACCGCCCGCGCAATAGTACACTGGGTCGTGCAGCCTTACGTACCGCAGCCCGCAAACGTCGTCGTCCGCATGAACCGCTGACCACCAGCGAAGATCAGCAGCCGATCTTTGCCACCGCAATCAAAGCCGAGAACGGTGATGATACTTTAAAAGCCGAAGCAGCCGAATAACTGGTACCATGG
>Dmel Hairless192-389 5Amut
Request a detailed protocolCGATGGATCCGAGCCGTTGTGGCAGCAGCAGCTGGCACTGCCAAAATCGGCAAAGGCAGCAATAGCGGTGGTAGCTTTGACATGGGCCGCACCCCGATTAGCACCCATGGCAACAACAGCTGGGGTGGTTATGGTGGTCGTGCCCAAGCTTTTAAAGACGGCAAGTTCATCGCCGAAGCCGCACGCAGCAAAGATGGCGACAAAAGCGGTGCCGTGAGCGTGACCCGCAAAACCTTTCGTCCGCCGAGTGCAGCAACCAGCGCAACCGTTACCCCGACCAGCGCAGTTACCACCGCCTACCCGAAAAACGAAAACAGCACCTCTTTAAGCTTTAGCGACGACAACAGCAGCATTCAGAGCAGCCCGTGGCAGCGCGATCAGCCGTGGAAACAGAGCCGTCCTCGTCGCGGCATCAGCAAAGAGCTGTCTTTATTCTTTCATCGCCCGCGCAATAGCACTTTAGGTCGTGCAGCACTGCGCACAGCAGCACGTAAACGTCGTCGCCCGCATGAACCGCTGACCACCAGCGAAGACCAGCAGCCGATTTTTGCCACCGCAATCAAAGCCGAGAACGGCGATGATACTTTAAAAGCAGAAGCAGCCGAATAACTGGTACCATGG
>Ptep s-CAP233-432 WT
Request a detailed protocolCGATGGATCCGAACCGTGAATACCGAAGATCCGCCGAAGGATAGCATCAACTTTCTGGACCACAGCCGCGTGACCGATCCGTGTAGTGCCGCAAGCGAAACCAGCCTGCCGCAGGATGTGCCGGCAACAAGCACCGTGGGCAGCCTGAAATTTTTTCTGGGCGGTCGCCTGGTGCTGAAATTAAACGCCCAGCAGGATGGCGGCAGCGGCAATAAATGCCAGTGGGTGCAGAGCAACGATCTGCCGAAACATAGCAACCATAACAAAAAAGATAAACATAAGAAAAAATTTGCACCGTATAGCTATAGCAGCAGCGGCACTCAGAAACCGCTGAAGAAAGGCGACGATACCAGTGCCGTGCCGGACTGTGATCCGAGCGGCATCAAAAAGCCGCGCCTGAAAGAGTACGAGACCAGCGAGAATAGCGCCCTGGGTCTGCTGCTGTGCAGCAGCAGTTGGACCCCGCCGGTTGCAGATGGTCAGGAGAGCATTGACGTGGACGATACCAGCAGCAAAACCAGCGAGGGCTATATTAGCCCGATCCTGAGCAACAATAGCCGCACCAGCAAAATCGACACCATCAAGCACGATTTTGCCAGCAACCCGAACACCTAACTGGTACCATGG
>Ptep s-CAP233-432 5Amut
Request a detailed protocolCGATGGATCCGAACCGTGAACACCGAAGACCCGCCGAAAGATAGCATCAACTTTTTAGACCATAGCCGCGTGACAGACCCGTGCAGTGCCGCAAGTGAAACCTCTTTACCGCAAGATGTGCCGGCAACCAGCACCGTGGGTAGCGCCAAAGCCTTTCTGGGCGGTCGTCTGGTGGCCAAAGCCAATGCCCAGCAAGATGGTGGTAGTGGTAACAAATGCCAAGCTGTGCAGAGCAACGATCTGCCGAAACACAGCAATCACAATAAGAAAGACAAACACAAGAAAAAATTTGCCCCGTATAGCTATAGCAGCAGCGGCACCCAGAAACCGCTGAAAAAAGGCGATGACACCAGCGCAGTGCCGGATTGCGATCCGAGCGGCATTAAGAAACCGCGTTTAAAGGAGTACGAGACCAGCGAAAACAGTGCTTTAGGTTTACTGCTGTGCAGCAGCAGTTGGACACCGCCGGTGGCCGATGGTCAAGAAAGTATCGATGTGGACGACACCAGCAGCAAAACCAGCGAAGGCTACATCAGCCCGATTCTGAGCAACAATAGCCGCACCAGCAAAATTGATACCATTAAACATGATTTTGCAAGCAATCCGAATACCTAACTGGTACCATGG
Data availability
All data generated or analysed during this study are included in the manuscript and supporting files.
References
-
The NuRD architectureCellular and Molecular Life Sciences 70:3513–3524.https://doi.org/10.1007/s00018-012-1256-2
-
Articulating "archiannelids": phylogenomics and annelid relationships, with emphasis on meiofaunal taxaMolecular Biology and Evolution 32:2860–2875.https://doi.org/10.1093/molbev/msv157
-
No accumulation of transposable elements in asexual arthropodsMolecular Biology and Evolution 33:697–706.https://doi.org/10.1093/molbev/msv261
-
BLAST: a more efficient report with usability improvementsNucleic Acids Research 41:W29–W33.https://doi.org/10.1093/nar/gkt282
-
Xenacoelomorpha survey reveals that all 11 animal homeobox gene classes were present in the first bilateriansGenome Biology and Evolution 10:2205–2217.https://doi.org/10.1093/gbe/evy170
-
Scalable web services for the PSIPRED protein analysis workbenchNucleic Acids Research 41:W349–W357.https://doi.org/10.1093/nar/gkt381
-
Spider transcriptomes identify ancient large-scale gene duplication event potentially important in silk gland evolutionGenome Biology and Evolution 7:1856–1870.https://doi.org/10.1093/gbe/evv110
-
DNA strider. An inexpensive sequence analysis package for the MacintoshMolecular Biotechnology 3:37–45.https://doi.org/10.1007/BF02821333
-
Genomic insights into the Ixodes scapularis tick vector of lyme diseaseNature Communications 7:10507.https://doi.org/10.1038/ncomms10507
-
New genes from old: asymmetric divergence of gene duplicates and the evolution of developmentPhilosophical Transactions of the Royal Society B: Biological Sciences 372:20150480.https://doi.org/10.1098/rstb.2015.0480
-
Genomic features of the damselfly Calopteryx splendens representing a sister clade to most insect ordersGenome Biology and Evolution 9:415–430.https://doi.org/10.1093/gbe/evx006
-
Progress, pitfalls and parallel universes: a history of insect phylogeneticsJournal of the Royal Society Interface 13:20160363.https://doi.org/10.1098/rsif.2016.0363
-
Phylogenomics of Lophotrochozoa with consideration of systematic errorSystematic Biology 66:256–282.https://doi.org/10.1093/sysbio/syw079
-
Whole transcriptome analysis of the monogonont rotifer Brachionus koreanus provides molecular resources for developing biomarkers of carbohydrate metabolismComparative Biochemistry and Physiology Part D: Genomics and Proteomics 14:33–41.https://doi.org/10.1016/j.cbd.2015.02.003
-
Re-evaluating the phylogeny of Sipuncula through transcriptomicsMolecular Phylogenetics and Evolution 83:174–183.https://doi.org/10.1016/j.ympev.2014.10.019
-
Nemertean and phoronid genomes reveal lophotrochozoan evolution and the origin of bilaterian headsNature Ecology & Evolution 2:141–151.https://doi.org/10.1038/s41559-017-0389-y
-
Hairless, a Drosophila gene involved in neural development, encodes a novel, serine rich proteinMechanisms of Development 38:143–156.https://doi.org/10.1016/0925-4773(92)90006-6
-
Towards an understanding of the structure and function of MTA1Cancer and Metastasis Reviews 33:857–867.https://doi.org/10.1007/s10555-014-9513-5
-
GenePalette: a universal software tool for genome sequence visualization and analysisDevelopmental Biology 271:431–438.https://doi.org/10.1016/j.ydbio.2004.04.011
-
Reanalyzing the Palaeoptera problem - The origin of insect flight remains obscureArthropod Structure & Development 47:328–338.https://doi.org/10.1016/j.asd.2018.05.002
-
Groucho oligomerization is required for repression in vivoMolecular and Cellular Biology 24:4341–4350.https://doi.org/10.1128/MCB.24.10.4341-4350.2004
-
Developmental system drift and flexibility in evolutionary trajectoriesEvolution and Development 3:109–119.https://doi.org/10.1046/j.1525-142x.2001.003002109.x
-
Nemertean toxin genes revealed through transcriptome sequencingGenome Biology and Evolution 6:3314–3325.https://doi.org/10.1093/gbe/evu258
Article and author information
Author details
Funding
National Institute of General Medical Sciences (R01GM046993)
- James W Posakony
National Institute of General Medical Sciences (R01GM120377)
- James W Posakony
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We are especially grateful to our many colleagues who made their genome and transcriptome sequence assemblies freely available; without their generosity, this study would not have been possible. We thank Scott Rifkin and Mark Rebeiz for helpful discussion and input during the preparation of the manuscript. We also thank the following artists for making available the illustrations shown in Figure 2A: crab, by Firkin (https://openclipart.org/detail/270221/crab-silhouette); monarch butterfly, by carolemagnet (https://openclipart.org/detail/263384/monarch-butterfly); centipede, by Firkin (https://openclipart.org/detail/261126/centipede-3); spider, by liftarn (https://openclipart.org/detail/179190/spider); horseshoe crab, by Gosc (https://openclipart.org/detail/174556/horseshoe-crab); tick, by Juhele (https://openclipart.org/detail/279073/simple-tick-ixodes-ricinus-silhouette). This work was supported by NIH Grants R01GM046993 and R01GM120377 (to JWP).
Copyright
© 2019, Miller et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 1,603
- views
-
- 224
- downloads
-
- 3
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Developmental Biology
- Genetics and Genomics
The establishment and growth of the arterial endothelium requires the coordinated expression of numerous genes. However, regulation of this process is not yet fully understood. Here, we combined in silico analysis with transgenic mice and zebrafish models to characterize arterial-specific enhancers associated with eight key arterial identity genes (Acvrl1/Alk1, Cxcr4, Cxcl12, Efnb2, Gja4/Cx37, Gja5/Cx40, Nrp1 and Unc5b). Next, to elucidate the regulatory pathways upstream of arterial gene transcription, we investigated the transcription factors binding each arterial enhancer compared to a similar assessment of non-arterial endothelial enhancers. These results found that binding of SOXF and ETS factors was a common occurrence at both arterial and pan-endothelial enhancers, suggesting neither are sufficient to direct arterial specificity. Conversely, FOX motifs independent of ETS motifs were over-represented at arterial enhancers. Further, MEF2 and RBPJ binding was enriched but not ubiquitous at arterial enhancers, potentially linked to specific patterns of behaviour within the arterial endothelium. Lastly, there was no shared or arterial-specific signature for WNT-associated TCF/LEF, TGFβ/BMP-associated SMAD1/5 and SMAD2/3, shear stress-associated KLF4 or venous-enriched NR2F2. This cohort of well characterized and in vivo-verified enhancers can now provide a platform for future studies into the interaction of different transcriptional and signalling pathways with arterial gene expression.
-
- Developmental Biology
- Genetics and Genomics
Paternal obesity has been implicated in adult-onset metabolic disease in offspring. However, the molecular mechanisms driving these paternal effects and the developmental processes involved remain poorly understood. One underexplored possibility is the role of paternally induced effects on placenta development and function. To address this, we investigated paternal high-fat diet-induced obesity in relation to sperm histone H3 lysine 4 tri-methylation signatures, the placenta transcriptome, and cellular composition. C57BL6/J male mice were fed either a control or high-fat diet for 10 weeks beginning at 6 weeks of age. Males were timed-mated with control-fed C57BL6/J females to generate pregnancies, followed by collection of sperm, and placentas at embryonic day (E)14.5. Chromatin immunoprecipitation targeting histone H3 lysine 4 tri-methylation (H3K4me3) followed by sequencing (ChIP-seq) was performed on sperm to define obesity-associated changes in enrichment. Paternal obesity corresponded with altered sperm H3K4me3 at promoters of genes involved in metabolism and development. Notably, altered sperm H3K4me3 was also localized at placental enhancers. Bulk RNA-sequencing on placentas revealed paternal obesity-associated sex-specific changes in expression of genes involved in hypoxic processes such as angiogenesis, nutrient transport, and imprinted genes, with a subset of de-regulated genes showing changes in H3K4me3 in sperm at corresponding promoters. Paternal obesity was also linked to impaired placenta development; specifically, a deconvolution analysis revealed altered trophoblast cell lineage specification. These findings implicate paternal obesity effects on placenta development and function as one potential developmental route to offspring metabolic disease.