Telomeres are nucleoprotein complexes at the ends of chromosomes and are indispensable for the protection and lengthening of terminal DNA. Despite the evolutionarily conserved roles of telomeres, the telomeric double-strand DNA (dsDNA)-binding proteins have evolved rapidly. Here, we identified double-strand telomeric DNA-binding proteins (DTN-1 and DTN-2) in Caenorhabditis elegans as non-canonical telomeric dsDNA-binding proteins. DTN-1 and DTN-2 are paralogous proteins that have three putative MYB-like DNA-binding domains and bind to telomeric dsDNA in a sequence-specific manner. DTN-1 and DTN-2 form complexes with the single-strand telomeric DNA-binding proteins POT-1 and POT-2 and constitutively localize to telomeres. The dtn-1 and dtn-2 genes function redundantly, and their simultaneous deletion results in progressive germline mortality, which accompanies telomere hyper-elongation and chromosomal bridges. Our study suggests that DTN-1 and DTN-2 are core shelterin components in C. elegans telomeres that act as negative regulators of telomere length and are essential for germline immortality.
Telomeres at the ends of linear eukaryotic chromosomes are composed of tandem repeats of short G-rich DNA sequences – (TTAGGG)n in vertebrates – and sequence-specific DNA-binding proteins (Palm and de Lange, 2008). The telomere nucleoprotein complex has various pivotal functions such as protection of chromosome ends, lengthening of the terminal DNA, and promotion of meiotic homolog pairing (Dilley and Greenberg, 2015; Shay, 2016; Shibuya et al., 2015; Shibuya et al., 2014; Zhang et al., 2017). Although telomeres have ancient evolutionarily conserved roles and conserved DNA sequences, the telomeric double-strand DNA (dsDNA)-binding proteins have evolved rapidly, a phenomenon referred to as the telomere paradox (Saint-Leandre and Levine, 2020). Despite their low-sequence conservation, telomeric dsDNA-binding proteins in a wide variety of species – such as the fission yeast protein Taz1, the plant protein RTBP1, the protist (Trypanosoma) protein tbTRF, and the mammalian proteins TRF1 and TRF2 – typically have a single MYB-like DNA-binding domain (MYB) at their C-termini (known as a telobox) that directly recognizes the telomeric dsDNA in a sequence-specific manner (Broccoli et al., 1997; Li et al., 2005; Spink et al., 2000; Yu et al., 2000).
A deviation is found in budding yeast, where telomeric dsDNA is bound by the Rap1 protein with two tandem MYB domains (Konig et al., 1996; Krauskopf and Blackburn, 1996), and this deviation has been attributed to the atypical telomeric dsDNA sequence (heterogeneous and not GC-rich) in this organism (Červenák et al., 2017). Another deviation is found in Caenorhabditis elegans, which has a typical telomeric dsDNA sequence (TTAGGC)n, but its genome does not have any TRF-like single MYB domain proteins or RAP1-like telomeric proteins (Wicky et al., 1996). There have been several reports showing that some transcription factors and chromatin remodelers, such as CEH-37 and HMG-5, bind to telomeric dsDNA in C. elegans, but deletions of these factors did not show any chromosomal defects, thus leaving the true regulator of telomeric dsDNA unidentified (Im and Lee, 2003; Kim et al., 2003; Lanjuin et al., 2003). It is curious why C. elegans lost the typical telomeric dsDNA-binding proteins while retaining the typical telomeric DNA sequence and how they maintain telomeric functions without these telomeric proteins. The identification of telomeric dsDNA-binding proteins in this organism will provide information on how general telomeric function is ensured by different telomeric dsDNA-binding proteins.
The recognition of telomeric dsDNA via dsDNA-binding proteins leads to assemblies of downstream telomere-associating proteins, thus forming the shelterin complexes (Palm and de Lange, 2008). An evolutionarily conserved component of the shelterin complex is the protection of telomere (POT) proteins, which directly recognize the telomeric single-strand DNA (ssDNA) through their conserved OB-fold domains. POT proteins generally act as negative regulators of telomerases through competitive binding to the telomeric ssDNA (Kelleher et al., 2005). Different from the dsDNA-recognition proteins, the POT proteins are well conserved, including in C. elegans, and it is reported that POT-1 and POT-2 (also known as CeOB2 and CeOB1) in C. elegans function as negative regulators of telomerase (Raices et al., 2008; Shtessel et al., 2013).
In this study, we screened for proteins that bind to POT-1 and identified two uncharacterized proteins, double-strand telomeric DNA-binding protein 1 and 2 (DTN-1 and DTN-2), in C. elegans. DTN-1 and DTN-2 are paralogous proteins, sharing 70% amino acid identity with each other. We performed secondary structure predictions and identified three tandem MYB domains at their N-termini, and we found that they exhibited sequence-specific dsDNA-binding activity toward telomeric sequences. DTN-1 and DTN-2 localized to telomeres both in somatic cells and germ cells from embryo to adulthood suggesting that they are constitutive telomere-binding proteins in vivo. Notably, the double knockout worm showed synthetic fertility defects that were transgenerationally progressive and were accompanied by various chromosomal defects, including chromosomal non-disjunction in meiosis, chromosomal bridges, and abnormal extensions of telomeric DNAs. Our findings suggest that DTN-1 and DTN-2 are bona fide telomeric dsDNA-binding proteins in C. elegans and that they are indispensable for the maintenance of germline immortality and telomere length homeostasis.
In order to identify novel telomeric proteins in C. elegans, we used a yeast two-hybrid (Y2H) approach to screen for POT-1-binding proteins from the C. elegans mixed-stage cDNA library, and we identified two functionally uncharacterized proteins encoded by the R06A4.2 and T12E12.3 genes (Figure 1A and Figure 1—figure supplement 1). These proteins, hereafter referred as double-strand telomeric DNA-binding proteins 1 and 2 (DTN-1 and DTN-2), respectively, have three putative MYB domains tandemly aligned in their N-terminal regions followed by a cluster of acidic amino acids in the middle (Figure 1B and Figure 1—figure supplement 2), which is similar to the domain configuration of the canonical c-MYB transcription factor. The POT-1-binding region (PBR) identified by the Y2H screening is located at the C-termini of these proteins (Figure 1B and Figure 1—figure supplement 1), where the amino acid sequences are highly conserved between DTN-1 and DTN-2 (80% identity). We confirmed by the Y2H analysis that both DTN-1 and DTN-2 bind to POT-1, but not POT-2 (Figure 1C), in a manner dependent on the C-terminal PBR (Figure 1D). In order to verify their in vivo interactions, we integrated three tandem FLAG tags followed by a GFP tag onto the endogenous dtn-1 and dtn-2 loci using CRISPR-Cas9 gene editing, and we purified the endogenous protein complex by FLAG immunoprecipitation (IP). Western blot showed the specific enrichment of the DTN-1-FLAG-GFP and DTN-2-FLAG-GFP proteins in the knock-in strain extracts, but not in wild type (N2) (Figure 1E). Western blot with polyclonal antibodies against POT-1 and POT-2 showed that both endogenous POT-1 and POT-2 proteins were co-precipitated with both DTN-1-FLAG-GFP and DTN-2-FLAG-GFP, proving that they form stable complexes in vivo (Figure 1E). Quantitative mass spectrometry analysis, which is an antibody-independent approach and is more comprehensive, also confirmed the presence of POT-1 and POT-2 in the FLAG immunoprecipitates (Figure 1F). The reciprocal IP experiments of GFP-POT-1 and POT-2-GFP from endogenously tagged strains also showed that both GFP-POT-1 and POT-2-GFP co-precipitated endogenous DTN-1 and DTN-2 (Figure 1—figure supplement 3). Together these results suggest that DTN-1 and DTN-2 are telomeric proteins in C. elegans that directly bind to POT-1 and indirectly bind to POT-2 in vivo.
The three putative MYB domains in DTN-1 and DTN-2 are composed of three alpha helixes, which is characteristic of other MYB domains (Figure 2A). However, the sequence alignment of their MYB domains showed that their amino acids are highly divergent from those of known telomeric dsDNA-binding proteins found in other eukaryotes. The tryptophan residues in helices 1 and 2 (shown in the yellow rectangle in Figure 2A), which are known to be important for maintaining the helix-turn-helix structure and thus for the DNA-binding activity (Zargarian et al., 1999), are conserved in the second and third MYB domains in both DTN-1 and DTN-2. However, the basic amino acids in helix 3 (shown in the red rectangle in Figure 2A), which are known to make direct contact with the telomeric dsDNA (Nishikawa et al., 2001), are poorly conserved in DTN-1 and DTN-2. Phylogenetic analysis further confirmed that the MYB domains of DTN-1 and DTN-2 form a unique cluster that is branched from the canonical single MYB domain telomeric factors (i.e. the telobox found in TRF1/2 in mammals, RTBP1 in plants, and Taz1 in fission yeast) and the Rap1 protein in budding yeast, suggesting that DTN-1 and DTN-2 have distinct evolutionary origins from the known telomeric factors (Figure 2B).
To test if DTN-1 and DTN-2 have direct DNA-binding activity, we purified recombinant proteins fused with the MBP tag (Figure 2C) and performed an in vitro electron mobility shift assay (EMSA). Notably, both MBP-DTN-1 and MBP-DTN-2 showed robust dsDNA-binding activity toward the C. elegans telomeric sequence (TTAGGC)15 but not to the scrambled sequence (GCTGTA)15 (Figure 2D). The quantification of the EMSA suggested that DTN-2 (Kd = 0.54 ± 0.047 μM) binds to telomeric dsDNA 1.7 times more strongly than DTN-1 (Kd = 0.93 ± 0.023 μM). Both MBP-DTN-1 and MBP-DTN-2 bound only very weakly to the shorter telomeric DNA containing 1, 2, or 3 telomeric repeats, suggesting that the robust binding requires longer (more than three repeats) dsDNA (Figure 2—figure supplement 1) and that these proteins preferentially bind to the terminal telomere repeats rather than to the interstitial telomeric sequences under physiological conditions.
To determine the in vivo localization of DTN-1 and DTN-2, we analyzed the GFP signals in the knock-in worms expressing FLAG-GFP-tagged DTN-1 and DTN-2 under the control of their native promotors. Their embryos showed 18–29 punctate GFP foci specifically localized within the cell nuclei (Figure 3A). The average numbers of these foci per nucleus were 23 and 22 for DTN-1-FLAG-GFP and DTN-2-FLAG-GFP, respectively, which approximately corresponded to the number of telomeres in C. elegans (12 chromosomes and 24 telomeres). Further, the observation of pachytene oocytes in the knock-in worms’ germlines showed approximately half the number of foci (Figure 3A, average of 12 foci per nucleus for both DTN-1 and DTN-2), which was likely due to the occurrence of meiotic homologous synapsis that reduces the apparent numbers of telomeres by half. The close observation of condensed bivalent chromosomes at the later diakinesis stage of meiosis showed eight distinct foci located at the ends of condensed chromosomes, corresponding to the telomeres of the individual chromatids (Figure 3B). To further confirm that these foci represent the telomeres, we performed immunostaining of the knock-in worms with a GFP antibody followed by fluorescent in situ hybridization (FISH) with a C. elegans telomeric DNA probe (TTAGGC)3. In the embryonic nuclei, the observed GFP foci were almost completely colocalized with the telomeric FISH signals, proving that these GFP foci were bona fide telomeric signals (Figure 3C). In addition to embryos and germline cells, we also observed punctate GFP foci in all post-mitotic somatic nuclei in adult worms, including epidermal cells and intestinal cells (Figure 3—figure supplement 1). Intestinal cells in C. elegans become polyploid during post-embryonic development after undergoing several rounds of endomitosis (Hedgecock and White, 1985), and accordingly we observed numerous GFP foci in the adult intestinal nuclei (Figure 3—figure supplement 1) consistent with their larger telomere number compared to other somatic cells. Together, our data suggest that DTN-1 and DTN-2 localize to telomeres in both somatic and germ cells from embryo to adulthood and thus function as constitutive telomere-binding proteins in C. elegans.
To gain insights into the functions of DTN-1 and DTN-2, we made knockout (KO) worms by deleting almost the entire coding regions of the dtn-1 and dtn-2 genes using CRISPR-Cas9 gene editing (Figure 4A and B). Western blotting using polyclonal antibodies against DTN-1 and DTN-2 confirmed that the specific bands appeared between 100 kDa and 150 kDa (close to the expected molecular weights of 95 kDa for DTN-1 and 93 kDa for DTN-2) in wild type (N2) worm extracts and that these bands completely disappeared in extracts from both corresponding KO worms (Figure 4C), suggesting that the protein expression was abolished in these KO worms. The western blot showed that the expression level of DTN-1 in the dtn-2 KO worm was comparable to wild type (N2) and vice versa, suggesting that the protein stability of DTN-1 and DTN-2 is mutually independent (Figure 4C).
Notably, we observed no abnormalities in either of the single KO worms – they looked healthy, were maintained almost perpetually through self-fertilization under normal laboratory conditions, and had comparable numbers of progeny as wild type (N2) worms (Figure 4D, lane 3 and lane 7 in the graph). To investigate the possible redundancy in their functions, we crossed single KO worms to obtain double heterozygous hermaphrodites (dtn-1+⁄−; dtn-2+⁄−), which also appeared healthy and normal. From this parental strain, we isolated individual F1 progeny and performed the fertility assay. After confirming the cessation of egg laying, the genotypes of individual F1 worms were determined by single worm genotyping (Figure 4D). The double KO worms (dtn-1−⁄−; dtn-2−⁄−) appeared at the expected Mendelian ratio among the F1 progeny, suggesting that dtn-1 and dtn-2 are not essential for embryonic development (Figure 4D, lane nine in the graph). However, counting of brood size showed that the double KO worms exhibited severe fertility defects or were completely sterile (42% of the worms were completely sterile in the first generation). Intriguingly, retention of one intact allele of the dtn-1 or dtn-2 gene was sufficient to rescue the fertility defects, as shown by the normal brood sizes of dtn-1+⁄−; dtn-2−⁄− and dtn-1−⁄−; dtn-2+⁄− worms (Figure 4D, lane 6 and lane 8 in the graph), suggesting that these genes have redundant functions in the maintenance of fertility. We confirmed that the telomeric localization of DTN-1 and DTN-2 was mutually independent (Figure 4—figure supplement 1), further supporting their redundant roles at telomeres.
Even though there were only a few offspring born from the double KO worms, the continuous self-fertilization of the double KO hermaphrodites in successive generations resulted in complete sterility within a few generations, suggesting that the defect is transgenerationally progressive (Figure 4E). In addition to the fertility defects, we could also see a variety of morphological defects in late-generation double KO worms, such as dumpy phenotype or larval arrest (Figure 4F), suggesting that some somatic defects had accumulated in these worms.
C. elegans hermaphrodites have two X chromosomes (XX), which are stably maintained during self-fertilization. Spontaneous X chromosome non-disjunction during meiosis produces male (XO) progeny, which rarely appear (~0.2%) under normal conditions (Hodgkin et al., 1979). During the course of our experiments, we noticed that the double KO (dtn-1; dtn-2) hermaphrodites produced male progeny at an abnormally high frequency (10%), suggesting that chromosomal non-disjunction occurred more frequently in meiosis in the double KO worms (Figure 5A). Furthermore, close inspection of the somatic nuclei revealed the prevalence of chromosomal bridges, especially in large intestinal nuclei, in the double KO worms (Figure 5B and C). The chromosomal bridge is a characteristic phenotype that has also been reported in mutant worms lacking genes encoding the telomerase catalytic subunit TRT-1 and in mutant worms lacking genes required for DNA replication and thus is an indication of aberrant chromosomal fusion or catenation (Korzelius et al., 2011; Meier et al., 2006). Interestingly, single or multiple telomeric FISH signals always coincided with the stretched DNAs between the bridging nuclei (Figure 5D), suggesting that these bridges likely occurred due to fusion or replication defects in their telomeric DNAs.
Because of the severe fertility defects, we could not collect large amounts of DNA samples from the double KO worms, and thus the biochemical characterization of their telomeric DNA was experimentally unfeasible. As an alternative, we performed quantitative fluorescent in situ hybridization (Q-FISH) using the telomeric probe. To eliminate artifacts caused by differences in cell cycle stage, we focused on post-mitotic somatic nuclei found in adult worms. Notably, the double KO worms had stronger telomeric FISH signals compared to wild type (N2) worms, and quantification in epidermal nuclei showed that the signal intensities in double KO worms were 5.7 times stronger than in wild type (N2) worms, suggesting that telomeric DNAs were abnormally elongated in the double KO worms (Figure 5E). We confirmed that the number of telomeric FISH foci in each epidermal nucleus was comparable between wild type (N2) and double KO worms, suggesting that the stronger telomere FISH signal in the double KO worms was not due to telomere fusion (Figure 5—figure supplement 1). Southern blot experiments showed that dtn-1 single KO worms had abnormally elongated telomeres, while dtn-2 single KO worms had similar or even slightly shorter telomeres compared to wild type (N2) (Figure 5F and Figure 5—figure supplement 2), which was also confirmed by Q-FISH (Figure 5G) suggesting that DTN-1 but not DTN-2 is responsible for the negative regulation of telomere length. Collectively, we conclude that DTN-1 and DTN-2 are redundantly required for germline immortality, while having distinct roles in the maintenance of telomere length, and if they are deleted simultaneously the worms exhibit mortal germlines accompanied by multiple chromosomal defects, including X chromosome non-disjunction in meiosis, chromosomal bridges, and hyper-elongation of their telomeric DNAs (Figure 5H).
Canonical telomeric dsDNA-binding proteins have a single MYB domain at their C-termini and are found in a number of eukaryotic species, including fission yeast, protists, plants, and mammals (Bilaud et al., 1996; Broccoli et al., 1997; Červenák et al., 2017; Li et al., 2005; Spink et al., 2000; Yu et al., 2000). Even with extensive efforts, corresponding telomeric dsDNA-binding proteins have not been identified in the nematode C. elegans. Conventional genetic screening for isolating mutant worms with mortal germlines successfully identified several key factors, such as MRT-2 and MRT-1 that are indispensable for telomeric DNA replication and telomerase activity respectively, but failed to identify the key telomeric dsDNA-binding proteins (Ahmed and Hodgkin, 2000; Meier et al., 2009). Using protein-interaction screening, the present study identified two non-canonical telomeric dsDNA-binding proteins, DTN-1 and DTN-2, in C. elegans. DTN-1 and DTN-2 are paralogous proteins and have redundant roles in the maintenance of germline immortality. Notably, even a single allele of either the dtn-1 or dtn-2 gene is sufficient to sustain germline immortality, which is likely to be the reason why these factors have evaded identification by conventional genetic screenings.
Structural modeling of DTN-1 and DTN-2 identified three putative MYB domains at their N-terminal regions followed by an acidic domain, which is similar to the domain configuration of the c-MYB transcription factor. The MYB domains of DTN-1 and DTN-2 are highly divergent from those of other telomeric proteins, and thus they seem to have distinct evolutionary origins. It is still unclear why these distinct telomeric proteins evolved while the telomeric DNA sequence has remained rather static (Bilaud et al., 1996).
For the recognition of dsDNA, two MYB domains function as a unit to hold the dsDNA. The canonical telomeric proteins, with single MYB domains, such as TRF1, TRF2, Taz1, and RTBP1, achieve this by forming a homodimer through their N-terminal domains (Bianchi et al., 1997; Fairall et al., 2001; Spink et al., 2000; Yu et al., 2000). In the case of the c-MYB transcription factor, two successive MYB domains (MYB2 and MYB3) within a single molecule are responsible for direct dsDNA recognition (Sakura et al., 1989). In the present study, we have shown that both DTN-1 and DTN-2 have robust sequence-specific dsDNA-binding activity toward the C. elegans telomeric sequence. It will be interesting to investigate how the three tandem MYB domains in DTN-1 and DTN-2 orchestrate substrate binding by determining their crystal structure. Structural comparison with the known telomeric proteins or with the c-MYB transcription factor might also provide insight into the evolutionary origin of these non-canonical dsDNA-binding proteins.
Telomeres are characterized by the 3′ G-rich ssDNA overhang found in almost all eukaryote species, and these overhangs are bound by the POT proteins (POT1 in human) (Palm and de Lange, 2008). In humans, the telomeric dsDNA-binding proteins TRF1 and TRF2 are responsible for the telomeric localization of POT1 (Sfeir and de Lange, 2012). TRF1 and TRF2 (as well as their accessary protein RAP1) indirectly bind to and recruit POT1 through the bridging proteins TIN2 and TPP1, and thus they form the hetero-hexameric shelterin complex TRF1-TRF2-RAP1-TIN2-TPP1-POT1 (de Lange, 2018). In C. elegans, there are both 3′ G-rich and 5′ C-rich ssDNA overhangs at terminal DNAs, which are bound by POT-2 and POT-1, respectively (Raices et al., 2008). We found that the C-termini of DTN-1 and DTN-2 directly bind to POT-1, and thus there seem to be no bridging proteins analogous to mammalian TIN2 and TPP1 in C. elegans. In this sense, the C. elegans shelterin complex seems to be a more simplified system, where the dsDNA recognition module is directly connected to the ssDNA recognition module. It is noteworthy that mammalian TPP1 is not merely the bridging protein required for the localization of POT1, and it also functions as a regulator of telomerase activity through direct binding to the telomerase catalytic subunit TERT (Nandakumar et al., 2012; Wang et al., 2007). DTN-1 and DTN-2 are much bigger proteins than mammalian TRF1 and TRF2, and it is possible that DTN-1 and DTN-2 have some additional functions – such as telomerase regulation – that are carried out by mammalian TPP1. Given that we could not detect any direct interaction between DTN-1/2 and POT-2, it is possible that there are uncharacterized bridging proteins linking DTN-1/2 and POT-2 in C. elegans, and such proteins might have an analogous function as mammalian TIN2 and TPP1. The analysis of the epistatic relationships between these proteins, as well as the screening of additional factors that bind to DTN-1 and DTN-2, should help to fully uncover the function of the C. elegans shelterin complex and show how telomerase is regulated in this organism.
The primary role of DTN-1 and DTN-2 in the maintenance of telomere homeostasis remains enigmatic. We have shown that the double KO worms showed signs of chromosomal abnormalities such as chromosomal non-disjunction in meiosis I (as indicated by a high incidence of male progeny) and chromosomal fusions in intestinal cells. Together with the progressive sterility phenotypes found in the double KO worms, it is speculated that these mutants are defective in the homeostasis of telomeric dsDNA, ssDNA, or both. Indeed, our data showed that the double KO and dtn-1 single KO worms, but not dtn-2 single KO worms, had extremely long telomeres compared to wild type (N2). These findings suggest that DTN-1 and DTN-2 have distinct roles in the maintenance of telomere length, and the role of DTN-1 seems similar to that of fission yeast Taz1, budding yeast Rap1, and mammalian TRF1, the deletions or mutations of which result in the hyper elongation of telomeric DNA (Cooper et al., 1997; Krauskopf and Blackburn, 1996; van Steensel and de Lange, 1997). Notably, the deletion of pot-1, pot-2, or both in C. elegans results in the hyper elongation of telomeres in a manner dependent on telomerase, but these worms do not exhibit chromosomal fusion or a high incidence of male progeny and are completely fertile (Raices et al., 2008; Shtessel et al., 2013). This suggests that the hyperelongated telomeres in the dtn-1 and dtn-2 double KO worm are less likely to be the primary reason for the observed chromosomal defect and their mortal germlines, and thus there must be some additional defects such as deprotection of telomeres and subsequent activation of the DNA-damage response pathway in the double KO worms. Further analyses will show how DTN-1 and DTN-2 protect telomeric DNAs and how evolutionarily conserved telomeric function is ensured by these distinct telomeric proteins.
Worms were grown at 20°C and maintained as described (Brenner, 1974). The following strains were used in this study: Bristol N2 wild strain, dtn-1(syb1925), dtn-2 (syb1886), dtn-1::flag::gfp (syb2016), dtn-2::flag::gfp (syb1995), HS001 (syb1925/syb1925; syb1995/syb1995), HS002 (syb2016/syb2016; syb1886/syb1886), PHX2217 (nT1[qIs51]/syb1886; syb1925/syb1925), gfp::flag::pot-1 (syb3002), pot-2::gfp (syb889). dtn-1(syb1925) was crossed with dtn-2 (syb1886) to isolate double heterozygous worms. The double KO worms (syb1925/syb1925; syb1886/syb1886) generated from the balancer strain (PHX2217) were used in all experiments shown in Figure 5. All mutant alleles were generated by CRISPR-Cas9 and verified by PCR and sequencing. PCR primers used for the genotyping are listed in the key resources table.
The homology search for MYB domains and the subsequent pyrogenetic analysis were performed using the CLUSTALW program (https://www.genome.jp/tools-bin/clustalw). The presence of MYB domains in DTN-1 and DTN-2 was predicted by structural modeling using Phyre2 (http://www.sbg.bio.ic.ac.uk/~phyre2/html/page.cgi?id=index) and manual alignment. The prediction of protein secondary structure was performed using Jpred 4 (http://www.compbio.dundee.ac.uk/jpred/).
Y2H screening was performed by Hybrigenics Services, Paris, France. The coding sequence for pot-1 was cloned into pB27 as a C-terminal fusion to LexA (LexA-pot-1). The construct was used as a bait to screen a random-primed C. elegans mixed-stage cDNA library constructed in pP6. Using a mating approach with YHGX13 and L40ΔGal4 yeast strains, 176 million clones were screened. A total of 82 positive colonies were selected on selective plates. The prey fragments of the positive clones were amplified by PCR and sequenced at their 5′ and 3′ junctions. The resulting sequences were used to identify the corresponding interacting proteins in the GenBank database (NCBI) using a fully automated procedure. For the yeast two-hybrid assay, dtn-1, dtn-2, dtn1ΔPBR (a.a. 1–736), and dtn2ΔPBR (a.a. 1–715) cDNAs were cloned into the pGBKT7 vector. pot-1 and pot-2 cDNAs were cloned into the pGADT7 vector. These bait and prey were co-transformed into the yeast strain AH109, and the positive transformants were selected on nutrition-restricted plates (SD-tryptophan-leucine-histidine-adenine).
dtn-1 and dtn-2 cDNAs were cloned into pMAL-c5X (New England Biolabs) for expression with an N-terminal MBP tag. Constructs were expressed in BL21 (DE3) cells (Thermo Fisher Scientific) and induced with 0.4 mM IPTG for 16 hr at 15°C. Cell disruption was achieved by sonication in extraction buffer (50 mM Tris-HCl (pH 7.5), 150 mM NaCl, 0.1% Triton X-100, and 1 mM β-mercaptoethanol), and cellular debris was removed by centrifugation at 40,000 × g. Fusion proteins were purified through amylose beads (NEB).
To prepare the DNA probes, NotI/NdeI fragments containing 15 telomere repeats (TTAGGC) or scrambled repeats (GCTGTA) were radiolabeled with [γ-32P] ATP by T4 polynucleotide kinase (New England Biolabs). For preparation of shorter DNA probes with one, two, and three repeats of telomeric DNA, the complementary oligonucleotides were annealed and radiolabeled with [γ-32P] ATP by T4 polynucleotide kinase. Proteins were mixed with 0.2 nM of labeled probes for one reaction in binding buffer (10 mM Tris-HCl (pH 7.5), 50 mM NaCl, 4% glycerol, 0.5 mM EDTA, 1 mM MgCl2,0.5 μg poly[dI-dC], 0.5 mM DL-dithiothreitol) and electrophoresed in a 0.8% agarose gel in 0.5× Tris-borate-EDTA at room temperature.
The following antibodies were used: rabbit antibodies against DTN-1 (this study) 1:1000, DTN-2 (this study) 1:1000, POT-1 (this study) 1:1000, POT-2 (this study) 1:1000, and GFP (Invitrogen; A11122) 1:1000, and mouse antibody against β-ACTIN (Sigma; A2228-100UL) 1:1000 and GFP (Roche; 11814460001).
cDNAs encoding the C-terminus of dtn-1 (a.a. 441–837), the C-terminus of dtn-2 (a.a. 434–818), and full-length pot-2 were cloned into the pET28c+ vector (Millipore). cDNA encoding the C-terminus of pot-1 (a.a. 100–300) was cloned into the pGEX-6P-1 vector (Addgene). The HIS- or GST-tagged recombinant proteins were expressed in BL21 (DE3) cells, solubilized in extraction buffer (600 mM NaCl and 50 mM Tris-HCl (pH 7.5)), and purified with Ni-nitrilotriacetic acid (QIAGEN) for the HIS tag or with glutathione agarose (Thermo Fisher Scientific) for the GST tag. The recombinant proteins were dialyzed into PBS and used to immunize the animals. The polyclonal antibodies were affinity purified on antigen-coupled Sepharose beads (GE Healthcare).
Images were obtained on a microscope (Olympus IL-X71 Delta Vision; Applied Precision) equipped with 100× NA 1.40 and 60× NA 1.42 objectives, a camera (CoolSNAP HQ; Photometrics), and softWoRx 5.5.5 acquisition software (Delta Vision). The acquired images were processed with deconvolution (softWoRx 5.5.5) and Photoshop (Adobe).
Age-matched hermaphrodites, 16–20 hr post-L4 larval stage, were dissected on coverslips in 20 μl of 1× egg buffer (containing 0.1% Tween-20 and 15 mM NaAzide). A SuperFrost Plus slide (Fisher) was immediately applied to the sample followed by freezing on dry ice. The coverslips were removed, and the slides were immediately placed in cold methanol for 1 min. The slides were post-fixed with 4% formaldehyde (diluted from fresh 37% formaldehyde). After washing with PBST, the slides were stained with DAPI. For immuno-FISH, the fixed slides were stained with GFP antibody and FITC-labeled secondary antibodies and fixed with 4% formaldehyde again (this step was skipped for FISH). After dehydration, the PNA-(TTAGGC)3 probe was added to the slide. The slides were denatured at 85°C for 10 min and hybridized at 37°C for 4 hr. After sequential washing in 50% formamide/0.5×SSC (twice) and 1×SSC (twice) at 42°C for 5 min each time, the slides were stained with DAPI.
Asynchronously cultured C. elegans samples were collected for the genomic DNA extraction. A total of 15 µg of C. elegans genomic DNA were digested with HinfI (New England Biolabs) and RsaI (New England Biolabs) and separated on a 0.6% agarose gel at 8 V/cm for 3 hr. After transfer to the membrane, southern blotting was performed using the DNA probes with four repeats of telomeric DNA (TTAGGC)4 radiolabeled with [γ-32P] ATP by T4 polynucleotide kinase.
Mixed-stage worms were collected and suspended in IP buffer (20 mM HEPES (pH 7.0), 200 mM KCl, 5 mM MgCl2, 10% glycerol, 0.1% Triton X-100, and 1 mM β-mercaptoethanol) supplemented with cOmplete Protease Inhibitor (Roche) and Phosphatase Inhibitor (Roche). After sonication, the cell extract was centrifuged at 50,000 × g for 30 min at 4°C and the supernatant was isolated. The extract was supplemented with Dynabeads protein A (Thermo Fisher Scientific) conjugated with 80 μg of antibodies or IgG as the negative control and incubated for 6 hr at 4°C. The beads were washed with high-salt buffer (20 mM HEPES (pH 7.0), 400 mM KCl, 5 mM MgCl2, 10% glycerol, 0.1% Triton X-100, and 1 mM β-mercaptoethanol) supplemented with cOmplete Protease Inhibitor (Roche) and Phosphatase Inhibitor (Roche). The samples were eluted with 0.1 M glycine (pH 2.5).
Individual worms at the L4 larval stage were isolated and grown at 20°C. After reaching adulthood, the worms were transferred to a new plate every day until no eggs were laid, and viable progeny were counted approximately 24 hr after removing the parent. The parental strains, after the cessation of egg laying, were genotyped by PCR.
The MS protocol was largely similar to the method described in our earlier publication (Zhang et al., 2020). The eluted samples were reduced with DL-dithiothreitol at a final concentration of 100 mM at 60°C for 30 min and supplemented with sodium dodecyl sulfate to a 1.5% final concentration. The samples were then processed according to the filter-aided sample preparation method modified from Wiśniewski et al., 2009. In short, reduced samples were diluted with 500 µl of 8 M urea and 50 mM triethylammonium bicarbonate (TEAB) solution, transferred onto Nanosep 30 k Omega filters (Pall Life Sciences), and washed once with 500 µl and twice with 200 µl of 8M urea and 50 mM TEAB solution and twice with the digestion buffer (0.5% sodium deoxycholate and 50 mM TEAB). The reduced cysteine side chains were alkylated with 10 mM methyl methanethiosulfonate diluted in digestion buffer for 30 min at room temperature and the samples were then repeatedly washed with digestion buffer. Trypsin in digestion buffer was added (300 ng) and the sample was incubated at 37°C for 4 hr, then another 300 ng portion of trypsin was added and the mixture was incubated overnight. Digested peptides were collected by centrifugation, followed by a wash with 20 µl of the digestion buffer and further centrifugation. The peptide samples were treated using the HiPPR detergent removal resin kit (PN 88305, Thermo Fisher Scientific, Waltham, MA, USA) according to the manufacturer’s instructions with 25 mM TEAB solution as the equilibration buffer. Sodium deoxycholate was precipitated and removed by acidification with 10% TFA and subsequent centrifugation. The supernatants were purified using Pierce peptide desalting spin columns (PN 89851, Thermo Fisher Scientific) according to the manufacturer’s instructions. The purified peptide samples were dried on Speedvac and reconstituted in 15 μl of 3% acetonitrile and 0.2% formic acid for the liquid chromatography-mass spectrometry (LC-MS) analysis.
LC-MS experiments were performed on an Orbitrap Fusion Lumos mass spectrometer interfaced with an Easy-nLC1200 nanoflow liquid chromatography system (both from Thermo Fisher Scientific). A total of 8 µl out of 15 μl of each peptide sample were trapped on an Acclaim Pepmap 100 C18 trap column (100 μm × 2 cm, particle size 5 μm, Thermo Fischer Scientific) and separated on an analytical column (75 μm × 35 cm) packed in-house with Reprosil-Pur C18 material (particle size 3 μm, Dr. Maisch, Germany) using a gradient with 0.2% formic acid in water as solvent A and 80% acetonitrile with 0.2% formic acid as solvent B at a flow rate of 300 nL/min. The elution profile was as follows: 5% to 33% B in 77 min, 33% to 100% B in 3 min, and 100% B for 10 min. Precursor ion scans were performed at 120,000 target resolution with an m/z range of 375–1500 and an AGC target of 4e5. The most abundant precursors with charges 2–7 were selected for fragmentation with a maximum duty cycle of 3 s and a dynamic exclusion duration of 45 s. Precursors were isolated with a 1.0 Da window and fragmented by higher energy collision-induced dissociation at 30% collision energy with a maximum injection time of 150 ms and an AGC target 5e4, and the MS2 spectra were recorded at 30,000 resolution.
Peptide and protein identification and quantification were performed using Proteome Discoverer version 2.4 (Thermo Fisher Scientific). The LC-MS files were matched against the C. elegans reference Uniprot database (May 2020) supplemented with common proteomic contaminants (26924 proteins in total) using Mascot 2.5.1 (Matrix Science, London, United Kingdom) as a database search engine with trypsin and one allowed missed cleavage as an enzyme rule, with the precursor tolerance of 10 ppm and fragment tolerance of 0.03 Da; methionine oxidation was set as a variable modification, and methylthiolation on cysteine was set as a fixed modification. Fixed Value PSM validator was used to assess the quality of peptide matches. Precursor ion quantification was accomplished via the Minora feature detection node in Proteome Discoverer 2.4, with the maximum peak intensity values used for quantification. Transfer of identifications between the runs was disabled. Abundance values for all unique peptides were used to calculate the protein abundances, and the intensity normalization was disabled.
The experiments were not randomized, so no statistical method was used to predetermine sample size, and the investigators were not blinded to allocation during the experiments or to outcome assessment. Each conclusion in the manuscript was based on results that were reproduced in at least three independent experiments. Sample sizes, statistical tests, and p-values are indicated in the text, figures, and figure legends.
All data generated or analysed during this study are included in the manuscript and supporting files.
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
We thank Marc Pilon and his lab members (University of Gothenburg) for valuable discussions and generous help with daily experiments. We thank the Proteomics Core Facility of the University of Gothenburg, in particular to Maria Segeda, for preparing the IP-MS experiment. We thank Owen R Davies (Newcastle University) for valuable discussion. This work was supported by Assar Gabrielsson’s Foundation FB 17–10 (HS) and O E och Edla Johanssons vetenskapliga stiftelse 253550102 (HS).
© 2021, Yamamoto et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Collagen-I fibrillogenesis is crucial to health and development, where dysregulation is a hallmark of fibroproliferative diseases. Here, we show that collagen-I fibril assembly required a functional endocytic system that recycles collagen-I to assemble new fibrils. Endogenous collagen production was not required for fibrillogenesis if exogenous collagen was available, but the circadian-regulated vacuolar protein sorting (VPS) 33b and collagen-binding integrin α11 subunit were crucial to fibrillogenesis. Cells lacking VPS33B secrete soluble collagen-I protomers but were deficient in fibril formation, thus secretion and assembly are separately controlled. Overexpression of VPS33B led to loss of fibril rhythmicity and overabundance of fibrils, which was mediated through integrin α11β1. Endocytic recycling of collagen-I was enhanced in human fibroblasts isolated from idiopathic pulmonary fibrosis, where VPS33B and integrin α11 subunit were overexpressed at the fibrogenic front; this correlation between VPS33B, integrin α11 subunit, and abnormal collagen deposition was also observed in samples from patients with chronic skin wounds. In conclusion, our study showed that circadian-regulated endocytic recycling is central to homeostatic assembly of collagen fibrils and is disrupted in diseases.
Endometriosis is a debilitating disease affecting 190 million women worldwide and the greatest single contributor to infertility. The most broadly accepted etiology is that uterine endometrial cells retrogradely enter the peritoneum during menses, and implant and form invasive lesions in a process analogous to cancer metastasis. However, over 90% of women suffer retrograde menstruation, but only 10% develop endometriosis, and debate continues as to whether the underlying defect is endometrial or peritoneal. Processes implicated in invasion include: enhanced motility; adhesion to, and formation of gap junctions with, the target tissue. Endometrial stromal (ESCs) from 22 endometriosis patients at different disease stages show much greater invasiveness across mesothelial (or endothelial) monolayers than ESCs from 22 control subjects, which is further enhanced by the presence of EECs. This is due to the enhanced responsiveness of endometriosis ESCs to the mesothelium, which induces migration and gap junction coupling. ESC-PMC gap junction coupling is shown to be required for invasion, while coupling between PMCs enhances mesothelial barrier breakdown.