Research Article

Drosophila SUMM4 complex couples insulator function and DNA replication control

Department of Cell Biology, Albert Einstein College of Medicine, United States
UNC-SPIRE, University of North Carolina, United States
EpiCypher, United States
Integrative Program for Biological and Genome Sciences, University of North Carolina at Chapel Hill, United States
Lineberger Comprehensive Cancer Center, University of North Carolina, United States
Department of Biology, University of North Carolina, United States
Department of Genetics, University of North Carolina, United States

Dec 2, 2022

Open access
Copyright information

Abstract
Editor's evaluation
eLife digest
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

Asynchronous replication of chromosome domains during S phase is essential for eukaryotic genome function, but the mechanisms establishing which domains replicate early versus late in different cell types remain incompletely understood. Intercalary heterochromatin domains replicate very late in both diploid chromosomes of dividing cells and in endoreplicating polytene chromosomes where they are also underreplicated. Drosophila SNF2-related factor SUUR imparts locus-specific underreplication of polytene chromosomes. SUUR negatively regulates DNA replication fork progression; however, its mechanism of action remains obscure. Here, we developed a novel method termed MS-Enabled Rapid protein Complex Identification (MERCI) to isolate a stable stoichiometric native complex SUMM4 that comprises SUUR and a chromatin boundary protein Mod(Mdg4)-67.2. Mod(Mdg4) stimulates SUUR ATPase activity and is required for a normal spatiotemporal distribution of SUUR in vivo. SUUR and Mod(Mdg4)-67.2 together mediate the activities of gypsy insulator that prevent certain enhancer–promoter interactions and establish euchromatin–heterochromatin barriers in the genome. Furthermore, SuUR or mod(mdg4) mutations reverse underreplication of intercalary heterochromatin. Thus, SUMM4 can impart late replication of intercalary heterochromatin by attenuating the progression of replication forks through euchromatin/heterochromatin boundaries. Our findings implicate a SNF2 family ATP-dependent motor protein SUUR in the insulator function, reveal that DNA replication can be delayed by a chromatin barrier, and uncover a critical role for architectural proteins in replication control. They suggest a mechanism for the establishment of late replication that does not depend on an asynchronous firing of late replication origins.

Editor's evaluation

This important paper will be of interest to those studying DNA replication in the context of chromatin and development and to those interested in higher-order chromatin organization. It uncovers a new interaction partner for SuUR and reports how this complex (SUMM4; Suppressor of Underreplication – Modifier of Mdg4) functions to control under-replication. The results are convincing and support the conclusions.

https://doi.org/10.7554/eLife.81828.sa0

eLife digest

Inside cells, molecules of DNA provide the instructions needed to make proteins. Cells carefully maintain and repair their DNA, and typically make a complete copy of the genome before they divide to ensure that after division, each daughter cell has a full set.

Within human, fly and other eukaryotic nuclei, DNA is packaged into structures known as chromosomes. Cells follow precisely controlled programs to replicate distinct regions of chromosomes at different times. To start copying a particular region, the cell machinery that replicates DNA binds to a sequence known as the origin of replication. It is thought that as-yet unknown cues from the cell may lead the replication machinery to bind to different origins of replication at different times.

In some circumstances, cells make extra copies of their DNA without dividing. For example, many cells in the larvae of fruit flies contain hundreds of extra DNA copies to sustain their increased sizes. However, the entire genome is not copied during this process, so cells end up with more copies of some regions of the genome than others. A protein called SUUR is required for hindering the replication of the ‘underrepresented’ regions, but it is not clear how it works.

To address this question, Andreyeva, Emelyanov et al. developed a new approach based on liquid chromatography and quantitative proteomics to identify the native form of SUUR in fruit flies. This revealed that SUUR exists as a stable complex with a protein called Mod(Mdg4), which is needed to recruit SUUR to the chromosomes. Further experiments suggested that SUUR and Mod(Mdg4) work together to bind to regions of DNA known as gypsy insulator elements, creating a physical barrier that hinders the replication machinery from accessing some parts of the genome.

The findings of Andreyeva, Emelyanov et al. provide an alternative explanation for how individual cells may stagger the process of copying their DNA without relying on the replication machinery binding to various replication origins at different times. Rather, late replication timing may be instructed by an insulator-born delay of the progression of replication over particular genomic regions. This mechanism adds to the list of nuclear processes (chromosome partitioning, transcriptional regulation, etc.) that are known to be directed by insulators and associated architectural proteins.

Introduction

Replication of metazoan genomes occurs according to a highly coordinated spatiotemporal program, where discrete chromosomal regions replicate at distinct times during S phase (Rhind and Gilbert, 2013). The replication program follows the spatial organization of the genome in Megabase-long constant timing regions interspersed by timing transition regions (Marchal et al., 2019). The spatiotemporal replication program exhibits correlations with genetic activity, epigenetic marks, and features of 3D genome architecture and subnuclear localization. Yet the reasons for these correlations remain obscure. Interestingly, the timing of firing for any individual origin of replication is established during G1 before pre-replicative complexes (pre-RC) are assembled at origins (Dimitrova and Gilbert, 1999), suggesting a mechanism that involves factors other than the core replication machinery.

Most larval tissues of Drosophila melanogaster grow via G-S endoreplication cycles that duplicate DNA without cell division, resulting in polyploidy (Zielke et al., 2013). Endoreplicated DNA molecules frequently align in register to form giant polytene chromosomes (Zhimulev et al., 2004). Importantly, in some cell types, genomic domains corresponding to the latest replicated regions of dividing cells, specifically pericentric (PH) and intercalary (IH) heterochromatin, fail to fully replicate during each endocycle resulting in underreplication (UR). These regions are depleted of sites for binding the Origin of Replication Complex (ORC), and thus, their replication primarily relies on forks progressing from external origins (Sher et al., 2012) in both dividing and endoreplicating cells, which suggests that both cell types utilize related mechanisms of regulation of late replication. Although cell cycle programs are dissimilar between endoreplicating and mitotically dividing cells (Zielke et al., 2013), they likely share the components of core biochemical machinery for DNA replication. Thus, underreplication provides a facile readout for late replication initiation and delayed fork progression.

The Suppressor of UnderReplication (SuUR) gene is essential for polytene chromosome underreplication in intercalary and pericentric heterochromatin (Belyaeva et al., 1998). In SuUR mutants, the DNA copy number in underreplicated regions is partially restored to almost reach those for fully polyploidized regions of the genome. SuUR encodes a protein (SUUR) containing a helicase domain with homology to that of the SNF2/SWI2 family. The occupancy of ORC in intercalary and pericentric heterochromatin is not increased in SuUR mutants (Sher et al., 2012), and, thus, the increased replication of underreplicated regions is likely not due to the firing of additional origins. Rather, SUUR negatively regulates the rate of replication fork progression (Nordman et al., 2014) by an unknown mechanism. It has been proposed (Posukh et al., 2015) that retardation of the replisome by SUUR takes place via simultaneous physical association with the components of the fork (e.g., CDC45 and PCNA) (Kolesnikova et al., 2013; Nordman et al., 2014) and repressive chromatin proteins, such as HP1a (Pindyurin et al., 2008).

Using a newly developed proteomics approach, we discovered that SUUR forms a stable stoichiometric complex with a chromatin boundary protein Mod(Mdg4)-67.2. We demonstrate that SUUR and Mod(Mdg4)-67.2 together are required for maximal underreplication of intercalary heterochromatin and full activity of the gypsy insulator, thereby implicating insulators in obstructing replisome progression and the control of late DNA replication.

Results

Identification of SUMM4, the native form of SUUR in Drosophila embryos

To determine how SUUR functions in replication control, we sought to identify its native complex. Previous attempts to characterize the native form of SUUR by co-IP or tag-affinity purification gave rise to multiple putative binding partners (Kolesnikova et al., 2013; Munden et al., 2018; Nordman et al., 2014; Pindyurin et al., 2008). However, evaluating whether any of these proteins are present in a native SUUR complex is problematic because of the low abundance of SUUR, which also precludes its purification by conventional chromatography. Therefore, we developed a novel biochemical approach using embryonic extracts (which can be obtained in large quantities) that relies on partial purification by multistep FPLC (fast protein liquid chromatography) (Figure 1A) and shotgun proteomics of chromatographic fractions by quantitative LCMS. We term this technology MERCI for MS-Enabled Rapid protein Complex Identification (‘Materials and methods’).

Figure 1 with 1 supplement see all

Download asset Open asset

FPLC fractionation and MS-Enabled Rapid protein Complex Identification (MERCI) quantification of native SUUR.

(A) Schematic of FPLC purification of the native form of SUUR using MERCI approach. ILR, ion library obtained by information-dependent acquisitions (IDA) of recombinant FLAG-SUUR; IL1-5, ion libraries obtained by IDA of FPLC fractions from chromatographic steps 1–5. KPi, potassium phosphate, pH 7.6. (B) Representation of SUUR in ion libraries ILR and IL1-5 (Supplementary file 1). Total number of identified proteins and the confidence rank of SUUR among them as well as the total number of detected peptides (95% confidence) and the number of SUUR-specific peptides are shown. (C) Recombinant FLAG-SUUR expressed in Sf9 cells. Identities of eight most prominent bands were determined by mass-spectroscopy. p130 and p65 correspond to full-length and C-terminally truncated FLAG-SUUR, respectively (red arrows). Other bands represent common Sf9-specific contaminants purified by FLAG chromatography (blue dashed lines), *cf.* purified EGG-F (green arrow). Molecular mass marker bands are indicated (kDa). (**D–H**) SWATH quantitation profiles of SUUR fractionation across individual FPLC steps. Ion libraries (IL) used for SWATH quantitation are shown at the bottom of each panel. Z-scores across indicated column fractions are plotted; error bars, standard deviations (N = 3). Gray rectangles, fraction ranges used for the next FPLC step; in (G), black arrows, expected peaks of globular proteins with indicated molecular masses in kDa. (I) SWATH quantitation profiles of SUUR fractionation across five FPLC steps. IL5 ion library was used for SWATH quantification.

Figure 1—source data 1 FPLC column parameters (Figure 1A). The following FPLC column parameters were used for partial purification of native SUMM4. HEG: 25 mM HEPES, pH 7.6, 0.1 mM EDTA, 10% glycerol, 0.02% NP-40, 1 mM DTT, 1 mM benzamidine, 0.4 mM PMSF; 10 mM KPi: 10 mM potassium phosphate, pH 7.6, 10% glycerol, 1 mM DTT, 1 mM benzamidine, 0.4 mM PMSF; 0.8 M KPi: 800 mM potassium phosphate, pH 7.6, 10% glycerol, 1 mM DTT, 1 mM benzamidine, 0.4 mM PMSF; cv, column volume.: https://cdn.elifesciences.org/articles/81828/elife-81828-fig1-data1-v2.docx
Download elife-81828-fig1-data1-v2.docx
Figure 1—source data 2 Recombinant proteins expressed in Sf9 cells and purified by FLAG affinitychromatography. Lane 1, protein size marker; lane 2, FLAG-SUUR, 72 hr infection of Sf9 cells; lane 3, FLAG-SUUR, 60 hr infection of Sf9 cells; lane 4, XNP-FLAG (Emelyanov et al., 2010), 72 hr infection of Sf9 cells; lane 5, XNP-FLAG, 60 hr infection of Sf9 cells; lane 6, EGG-FLAG, 72 hr infection of Sf9 cells; lane 7, EGG-FLAG, 60 hr infection of Sf9 cells. Prep amounts equivalent to ~20 ml Sf9 culture were loaded in each lane. Cropped images encompassing lanes 1–2 and 6 (open boxes, dashed red line) were used for Figure 1C.: https://cdn.elifesciences.org/articles/81828/elife-81828-fig1-data2-v2.zip
Download elife-81828-fig1-data2-v2.zip

Shotgun quantification of complex mixtures of polypeptides by LCMS is performed in two steps. First, the composition of the mixture is examined by information-dependent acquisitions (IDA) that establish protein identities based on MS1 and MS2 spectra of detected tryptic peptides. This information is used to compile a so-called ‘ion library’ (IL), which is then utilized to quantify spectral information obtained from the same samples by unbiased, data-independent acquisitions (DIA), sometimes termed sequential window acquisitions of all theoretical mass spectra (SWATH-MS/SWATH). Importantly, the depth of proteomic quantification is limited by the range of peptides in the IL originally built by IDA.

SUUR-specific peptides could not be found in ILs obtained from acquisitions of crude nuclear extracts or any fractions from the first, phosphocellulose, step (IL1, Figure 1B, Supplementary file 1), and therefore, SUUR could not be quantified in SWATH acquisitions of phosphocellulose fractions when IL1 alone is used as a reference. Thus, to measure the relative abundance of SUUR in phosphocellulose fractions, we augmented IL1 with the IL obtained by IDA of recombinant SUUR (ILR, Figure 1B and C). In ion libraries from subsequent chromatographic steps (IL2–IL5), peptides derived from native SUUR were detected (Figure 1B, Supplementary file 1) and used for quantification of cognate DIA/SWATH acquisitions (Figure 1D–H).

The final aspect of the MERCI algorithm calls for re-quantification of FPLC fraction SWATH acquisitions with an IL from the last step (IL5) that is enriched for peptides derived from SUUR and co-purifying polypeptides (Figure 1A) and includes only 140 proteins (Figure 1B, Supplementary file 1). In this fashion, scarce polypeptides (including SUUR and, potentially, SUUR-binding partners) that may not be detectable in earlier steps will not evade quantification. Purification profiles of proteins quantified in all five FPLC steps (132) were then artificially stitched into 83-point arrays of Z-scores (Figure 1I, Supplementary file 2). These profiles were Pearson-correlated with that of SUUR and ranked down from the highest Pearson coefficient, PCC (Figure 2A). Whereas the PCC numbers for the bottom 130 proteins lay on a smooth curve, the top two proteins, SUUR (PCC = 1.000) and Mod(Mdg4) (PCC = 0.939) fell above the extrapolated (by polynomial regression) curve (Figure 2B). Consistently, SUUR and Mod(Mdg4) exhibited nearly identical purification profiles in all five FPLC steps (Figure 2C), unlike the next two top-scoring proteins, EGG (PCC = 0.881) and CG6700 (PCC = 0.874) (Figure 2—figure supplement 1A and B). Also, HP1a (PCC = 0.503), which had been proposed to form a complex with SUUR (Pindyurin et al., 2008) did not co-purify with SUUR in any FPLC steps (Figure 2—figure supplement 1C).

Figure 2 with 2 supplements see all

Download asset Open asset

Identification of the SUMM4 complex by MS-Enabled Rapid protein Complex Identification (MERCI).

(A) Pearson correlation of fractionation profiles for individual 132 proteins to that of SUUR, sorted from largest to smallest. Red box, the graph portion shown in (B). (B) Top 10 candidate proteins with the highest Pearson correlation to SUUR. Red dashed line, trend line extrapolated by polynomial regression (n = 5) from the bottom 130 proteins. (C) SWATH quantitation profiles of SUUR (red) and Mod(Mdg4) (cyan) fractionation across five FPLC steps, Figure 1I. IL5 ion library was used for SWATH quantification. (D) Western blot analyses of Superdex 200 fractions with SUUR and ModT antibodies, Figure 1G. Molecular mass markers are shown on the left (kDa). (E) Co-IP experiments. SUUR (red arrowhead) co-purifies from nuclear extracts with Mod(Mdg4)-67.2 (cyan arrowheads) but not HP1a (green arrowhead). Anti-XNP co-IPs HP1a but not SUUR of Mod(Mdg4)-67.2. Asterisks, IgG heavy and light chains detected due to antibody cross-reactivity. Mod(Mdg4)-67.2(FL) antibody recognizes all splice forms of Mod(Mdg4).

Figure 2—source data 1 Western blots of chromatographic fractions. Left panels, 700 nm channel (Odyssey Fc), rabbit anti-SUUR antibody and protein size marker; right panels, 800 nm channel (Odyssey Fc), guinea pig ModT antibody; top panels, hydroxylapatite fractions: starting material, flow-through, marker, fractions 1–12 (Figure 1H); bottom panels, Superdex 200 increase fractions: starting material, marker, fractions 5–15 (Figure 1G). Cropped images from bottom panels (open boxes, dashed red line) were used for Figure 2D.: https://cdn.elifesciences.org/articles/81828/elife-81828-fig2-data1-v2.zip
Download elife-81828-fig2-data1-v2.zip
Figure 2—source data 2

Co-IP of SUMM4 subunits.

(A, E) Westerns, 700 nm channel (Odyssey Fc), mouse anti-HP1a and protein size marker; (B) western, 800 nm channel (Odyssey Fc), rabbit anti-Mod(Mdg4)-FL; (C, G) Westerns, 700 nm channel (Odyssey Fc), protein sizemarker only; (D) Western, 800 nm channel (Odyssey Fc), rabbit anti-SUUR; (F) Western, 800 nm channel (Odyssey Fc), guinea pig ModT; (H) Western, 800 nm channel (Odyssey Fc), guinea pig anti-SUUR. Lanes 1, 5, 9, 12, 15, and 18, protein size marker; lanes 2, 6, 10, 13, 16, and 19, input (nuclear extract), 5 or 10%; lanes 3 and 7, IP with guinea pig ModT antibody #1; lanes 4 and 8, IP with guinea pig ModT antibody #2; lanes 11 and 17, IP with rabbit preimmune serum; lanes 14 and 20, IP with rabbit anti-XNP. Cropped images encompassing lanes 1–3, 5–7, 12–14, and 18–20 (open boxes, dashed red line) were used for Figure 2E.: https://cdn.elifesciences.org/articles/81828/elife-81828-fig2-data2-v2.zip
Download elife-81828-fig2-data2-v2.zip

Mod(Mdg4) is a BTB/POZ domain protein that functions as an adapter for architectural proteins that promote various aspects of genome organization (Georgiev and Gerasimova, 1989; Gerasimova et al., 1995). It is expressed as 26 distinct polypeptides generated by splicing in trans of a common 5′-end precursor RNA with 26 unique 3′-end precursors (Büchner et al., 2000). IL5 contained seven peptides derived from Mod(Mdg4) (99% confidence). Whereas four of them mapped to the common N-terminal 402 residues, three were specific to the C-terminus of a particular form, Mod(Mdg4)-67.2 (Figure 2—figure supplement 2). Peptides specific to other splice forms were not detected. We raised an antibody to the C-terminus of Mod(Mdg4)-67.2, designated ModT antibody, and analyzed size-exclusion column fractions by immunoblotting. Consistent with SWATH analyses (Figures 1G and 2C), SUUR and Mod(Mdg4)-67.2 polypeptides copurified as a complex with an apparent molecular mass of ~250 kDa (Figure 2D). Finally, we confirmed that SUUR specifically co-immunoprecipitated with Mod(Mdg4)-67.2 from embryonic nuclear extracts (Figure 2E). As a control, XNP co-immunoprecipitated with HP1a as shown previously (Emelyanov et al., 2010), but did not with SUUR or Mod(Mdg4) (Figure 2E). We conclude that SUUR and Mod(Mdg4) form a stable stoichiometric complex that we term SUMM4 (Suppressor of Underreplication – Modifier of Mdg4).

Biochemical activities of recombinant SUMM4 in vitro

We reconstituted recombinant SUMM4 complex by co-expressing FLAG-SUUR with Mod(Mdg4)-67.2-His₆ in Sf9 cells and purified it by FLAG affinity chromatography (Figure 3A). Mod(Mdg4)-67.2 is the predominant form of Mod(Mdg4) expressed in embryos (e.g., Figure 2E, left panel). Thus, minor Mod(Mdg4) forms may have failed to be identified by IDA in IL5 (Figure 2—figure supplement 2A). We discovered that FLAG-SUUR did not co-purify with another splice form, Mod(Mdg4)-59.1 (Figure 3A, Figure 2—figure supplement 2C). Whereas the identity of an ~100 kDa Mod(Mdg4)-67.2-His₆ band co-purifying with FLAG-SUUR was confirmed by mass-spec sequencing, the FLAG-purified material from Sf9 cells expressing FLAG-SUUR and Mod(Mdg4)-59.1 did not contain Mod(Mdg4)-specific peptides. Therefore, the shared N-terminus of Mod(Mdg4) (1–402) is not sufficient for interactions with SUUR. However, this result does not exclude a possibility that SUUR may form complex(es) with some of the other, low-abundance 24 splice forms of Mod(Mdg4). The SUUR-Mod(Mdg4)-67.2 interaction is specific as the second-best candidate from our correlation analyses (Drosophila SetDB1 ortholog EGG; Figure 2B) did not form a complex with FLAG-SUUR (Figure 3—figure supplement 1A), although it is associated with its known partner WDE, an ortholog of hATF7IP/mAM (Wang et al., 2003).

Figure 3 with 2 supplements see all

Download asset Open asset

Biochemical activities of recombinant SUMM4.

(A) Recombinant SUMM4. Mod(Mdg4)-His₆, 67.2 (p100, cyan arrowhead) and 59.1 (p75, green arrowhead) splice forms were co-expressed with FLAG-SUUR (red arrowheads, p130 and p65) or separately in Sf9 cells and purified by FLAG or Ni-NTA affinity chromatography. Mod(Mdg4)-67.2 forms a specific complex with SUUR. Identities of the 130, 100, 75, and 65 kDa protein bands from FLAG- and Ni-NTA-purified material were determined by mass spectroscopy. (B) ATPase activities of recombinant ISWI (brown bars), FLAG-SUUR (red bars), and SUMM4 (FLAG-SUUR + Mod(Mdg4)-67.2-His₆, purple bars). Equimolar amounts of proteins were analyzed in reactions in the absence or presence of plasmid DNA or equivalent amounts of reconstituted oligonucleosomes,±H1. SUUR(KA) and MMD4, ATPases activities of K59A mutant of SUUR (gray bars) and Mod(Mdg4)-67.2-His₆ (cyan bars). Hydrolysis rates were converted to moles ATP per mole protein per minute. All reactions were performed in triplicate (N=3), error bars represent standard deviations. p-Values for statistically significant differences are indicated (Mann–Whitney test). (C) DNA- and nucleosome-dependent stimulation or inhibition of ATPase activity. The activities were analyzed as in (B). Statistically significant differences are shown (Mann–Whitney test). (D) Nucleosome sliding activities by EpiDyne-PicoGreen assay (see ‘Materials and methods’) with 5 nM of recombinant ISWI, SUUR, or SUMM4. Reaction time courses are shown for terminally (6-N-66) and centrally (50-N-66) positioned mononucleosomes (Figure 3—figure supplement 2B–E). RFU, relative fluorescence units produced by PicoGreen fluorescence.

Figure 3—source data 1 Recombinant proteins expressed in Sf9 cells and purified by FLAG or Ni-NTA affinity chromatography. Lanes 1 and 7, protein size marker; lane 2, FLAG-SUUR, FLAG-purified; lane 3, FLAG-SUUR + Mod(Mdg4)-67.2-His6, FLAG-purified; lane 4, FLAG-SUUR + Mod(Mdg4)-59.1-His6, FLAG-purified; lane 5, Mod(Mdg4)-67.2-His6, Ni-NTA-purified; lane 6, Mod(Mdg4)–-9.1-His6, Ni-NTA-purified. All proteins were purified 72 hr post-infection. Prep amounts equivalent to ~20 ml (FLAG-purified, lanes 2–4) or ~1 ml (Ni-NTA-purified, lanes 5 and 6) Sf9 cultures were loaded in each lane. Cropped image encompassing all lanes (open box, dashed red line) was used for Figure 3A.: https://cdn.elifesciences.org/articles/81828/elife-81828-fig3-data1-v2.zip
Download elife-81828-fig3-data1-v2.zip

The N-terminus of SUUR contains a region homologous with SNF2-like DEAD/H helicase domains. Although SUUR requires its N-terminal domain to function in vivo (Munden et al., 2018), it has been hypothesized to be inactive as an ATPase (Nordman and Orr-Weaver, 2015). We analyzed the ability of recombinant SUUR and SUMM4 (Figure 3A) to hydrolyze ATP in vitro in comparison to recombinant Drosophila ISWI (Figure 3—figure supplement 1B). Purified recombinant Mod(Mdg4)-67.2 (Figure 3A) and a variant SUUR protein with a point mutation in the putative Walker A motif (K59A) were used as negative controls (Figure 3A, Figure 3—figure supplement 1B). Contrary to the prediction, both SUUR and SUMM4 exhibited strong ATPase activities (Figure 3B). SUMM4 was 1.4- to 2-fold more active than SUUR alone, indicating that Mod(Mdg4)-67.2 stimulates SUUR enzymatic activity. We then examined whether DNA and nucleosomes can stimulate the activity of SUUR. To this end, we reconstituted oligonucleosomes on plasmid DNA (Figure 3—figure supplement 1C–E). Linker histone H1-containing chromatin was also used as a substrate/cofactor because SUUR has been demonstrated to physically interact with H1 (Andreyeva et al., 2017). In contrast to ISWI, SUUR was not stimulated by addition of DNA or nucleosomes and moderately (by about 70%) activated by H1-containing oligonucleosomes (Figure 3C) consistent with its reported direct physical interaction with H1 (Andreyeva et al., 2017).

We examined the nucleosome remodeling activities of SUUR and SUMM4; specifically, their ability to expose a positioned DNA motif in the EpiDyne-PicoGreen assay (‘Materials and methods’ and Figure 3—figure supplement 2A). Centrally or terminally positioned mononucleosomes were efficiently mobilized by ISWI and human BRG1 in a concentration- and time-dependent manner (Figure 3—figure supplement 2B–E). In contrast, SUUR and SUMM4 did not reposition either nucleosome (Figure 3D). Thus, SUUR and SUMM4 do not possess a detectable remodeling activity and may resemble certain other SNF2-like enzymes (e.g., RAD54) that utilize the energy of ATP hydrolysis to mediate alternate DNA translocation reactions (Jaskelioff et al., 2003).

The distribution of SUMM4 complex in vivo

We examined the positions of SUUR and Mod(Mdg4)-67.2 within polytene chromosomes by indirect immunofluorescence (IF) and discovered that they overlap at numerous locations (Figure 4A, Figure 4—figure supplement 1A and B). In late endo-S phase, when SUUR exhibited a characteristic distribution, it co-localized with Mod(Mdg4)-67.2 at numerous (hundreds of) loci along the chromosome arms (Figure 4—figure supplement 1B). Mod(Mdg4)-67.2 was present at classical regions of SUUR enrichment, such as underreplicated domains in 75C and 89E (Figure 4—figure supplement 1A). The chromocenter, which consists of underreplicated pericentric heterochromatin, contains SUUR but did not show occupancy by Mod(Mdg4)-67.2 (Figure 4—figure supplement 1A). Conversely, there were multiple sites of Mod(Mdg4)-67.2 localization that were free of SUUR (Figure 4—figure supplement 1A and B). Individual pixel intensities of IF signals for SUUR and Mod(Mdg4)-67.2 were plotted as a 2D scatter plot (Figure 4—figure supplement 1C) and were found to exhibit a weak positive correlation (R² = 0.278). Consistent with the possible multi-phasic relative distribution of SUUR and Mod(Mdg4)-67.2 (Figure 4—figure supplement 1B), the 2D plot encompassed four distinct areas, where SUUR and Mod(Mdg4)–67.2-were co-localized, enriched separately, or absent (Figure 4—figure supplement 1D). When regions of SUUR-alone and Mod(mdg4)-67.2-alone enrichment were excluded, and only the regions of their apparent colocalization were considered, the anti-SUUR and anti-ModT signals exhibited a strong positive correlation (R² = 0.568, Figure 4—figure supplement 1D).

Figure 4 with 3 supplements see all

Download asset Open asset

Spatiotemporal distribution of SUMM4 in vivo.

(A) Colocalization of SUUR and Mod(Mdg4)-67.2 in *wild-type* polytene chromosomes. Localization patterns of Mod(Mdg4)-67.2 and SUUR in L3 polytene chromosomes were analyzed by indirect immunofluorescence (IF) staining. The polytene spread fragment (3L and 3R arms) corresponds to a nucleus in late endo-S phase, according to PCNA staining (Figure 4—figure supplement 1A). Left panel, DAPI staining shows the overall chromosome morphology. Middle panel, ModT (green) and SUUR (red) signals overlap extensively in euchromatic arms. Right panel, a colocalization image with swapped red (ModT) and green (SUUR) channels is shown for comparison. Note the additional strong ModT IF loci that are SUUR-free as well as Mod(Mdg4)-67.2-free SUUR in pericentric 3LR. (B) SUUR loading into chromosomes during early endo-S phase is compromised in *mod(mdg4*) mutants. *SuUR* mutation does not appreciably change the distribution of Mod(Mdg4)-67.2. Endo-S timing was established by PCNA staining (Figure 4—figure supplement 3B). (C) Abnormal subcellular distribution of SUMM4 subunits in *mod(mdg4*) and *SuUR* mutants. L3 salivary glands were fixed and whole-mount-stained with DAPI, ModT, and SUUR antibodies. Whereas both polypeptides are mostly nuclear in wild-type, they are partially mis-localized to the cytoplasm in *mod(mdg4)^u1* mutant.

The existence of chromosome loci heavily enriched for Mod(Mdg4)-67.2 but devoid of SUUR suggests that there are additional native form(s) of Mod(Mdg4)-67.2, either as an individual polypeptide or in complex(es) other than SUMM4. When we fractionated Drosophila nuclear extract using a different progression of FPLC steps (Figure 4—figure supplement 2A), we found that Mod(Mdg4)-67.2 can form a megadalton-sized complex that did not contain SUUR (Figure 4—figure supplement 2B–D). Therefore, a more intricate pattern of Mod(Mdg4)-67.2 distribution likely reflects loading of both SUMM4 and an alternative Mod(Mdg4)-67.2-containing complex.

We tested whether SUUR and Mod(Mdg4) loading into polytene chromosomes were mutually dependent using mutant alleles of SuUR and mod(mdg4). SuUR^ES is a null allele of SuUR (Makunin et al., 2002). mod(mdg4)^m9 is a null allele with a deficiency that removes gene regions of the shared 5′-end precursor and eight specific 3′-precursors (Savitsky et al., 2016). mod(mdg4)^u1 contains an insertion of a Stalker element in the last coding exon of Mod(Mdg4)-67.2 3′-precursor (Gerasimova et al., 1995), and thus is predicted only to disrupt expression of this isoform. SuUR^ES and mod(mdg4)^u1 are homozygous viable, and mod(mdg4)^m9 is recessive adult pharate lethal. Although homozygous mod(mdg4)^m9 animals die after the pupal stage, they survive until late third-instar larvae (L3). Therefore, this allele cannot be used to study adult phenotypes, but it is possible to analyze its effects in L3, such as on polytene chromosome structure. Importantly, however, since the homozygous progeny is produced by heterozygous parents, the recessive phenotypes would not reveal themselves until the maternally loaded protein and RNA are exhausted (diluted and/or degraded) by late larval stages, as frequently occurs for other Drosophila mutants.

We could not detect Mod(Mdg4)-67.2 expression in homozygous mod(mdg4)^m9 L3 salivary glands by immunoblotting, whereas mod(mdg4)^u1 expressed a truncated polypeptide (cf., ~70 kDa and ~100 kDa, Figure 4—figure supplement 3A). The truncated 70 kDa polypeptide failed to load into polytene chromosomes (Figure 4B, Figure 4—figure supplement 3B). As shown previously, SUUR could not be detected in SuUR^ES chromosomes. Since homozygous mod(mdg4)^m9 L3 larvae were produced by inter se crosses of heterozygous parents, the very low amounts of Mod(Mdg4)-67.2 in mod(mdg4)^m9 polytene chromosomes (barely above the detection limit) were presumably maternally contributed.

The absence (or drastic decrease) of Mod(Mdg4)-67.2 also strongly reduced the loading of SUUR (Figure 4B, Figure 4—figure supplement 3B). The normal distribution pattern of SUUR in polytene chromosomes is highly dynamic (Andreyeva et al., 2017; Kolesnikova et al., 2013). SUUR is initially loaded in chromosomes at the onset of endo-S phase and then redistributes through very late endo-S, when it accumulates in underreplicated domains and pericentric heterochromatin. In both mod(mdg4) mutants, we observed a striking absence of SUUR in euchromatic arms of polytene chromosomes during early endo-S (Figure 4B, Figure 4—figure supplement 3B), which indicates that the initial deposition of SUUR is dependent on its interactions with Mod(Mdg4). Although SUUR deposition slightly recovered by late endo-S, it was still several fold weaker than that in wild-type control. Potentially, in the absence of Mod(Mdg4), SUUR may be tethered to intercalary and pericentric heterochromatin loci by direct binding with linker histone H1 as shown previously (Andreyeva et al., 2017). Finally, the gross subcellular distribution of SUUR also strongly correlated with that of Mod(Mdg4): a mis-localization of truncated Mod(Mdg4)-67.2 from nuclear to partially cytoplasmic was accompanied by a similar mis-localization of SUUR (Figure 4C). This result indicates that the truncation of Mod(Mdg4) in mod(mdg4)^u1 may have an antimorphic effect by mis-localization and deficient chromatin loading of interacting polypeptides, including SUUR (Figure 4C) and others (Figure 4—figure supplement 2B–D).

The role of SUMM4 as an effector of the insulator/chromatin barrier function

Mod(Mdg4)-67.2 does not directly bind DNA but instead is tethered by a physical association with zinc finger factor Suppressor of Hairy Wing, Su(Hw) (Gause et al., 2001). Su(Hw) directly binds to consensus sequences that are present in gypsy transposable elements and are also widely distributed across the Drosophila genome in thousands of copies (Adryan et al., 2007). Mod(Mdg4)-67.2 was previously shown to be essential for the insulator activity of gypsy (Gerasimova et al., 1995), which functions in vivo to prevent enhancer–promoter interactions and establish a barrier to the propagation of chromatin forms (Cai and Levine, 1995; Roseman et al., 1993). We therefore tested whether SUMM4 contributes to the gypsy insulator functions.

The ct⁶ allele of Drosophila contains a gypsy element inserted between the wing enhancer and promoter of the gene cut. The insertion inactivates cut expression and results in abnormal wing development (Figure 5A). We discovered that both mod(mdg4)^u1 and SuUR^ES mutations partially suppressed this phenotype (Figure 5A) and significantly increased the wing size compared to ct⁶ allele alone (Figure 5B). Thus, both subunits of SUMM4 are required to mediate the full enhancer-blocking activity of gypsy. Interestingly, the double, SuUR^ES and mod(mdg4)^u1, mutant produced an additional suppression of the ct⁶ phenotype compared to that by mod(mdg4)^u1 alone (Figure 5A, red arrowhead), which suggests that SUUR may contribute to the insulator function in the absence of Mod(Mdg4)-67.2.

Figure 5

Download asset Open asset

Biological functions of SUMM4 in the regulation of gene expression.

(A) SUMM4 subunits are required for the enhancer-blocking activity in *ct⁶*. Top: schematic diagram of the *ct⁶* reporter system; the *gypsy* retrotransposon is inserted in between the wing enhancer and promoter of *cut* (Bag et al., 2019). Bottom left: the appearance of wild-type adult wing; bottom right: the appearance of *ct⁶* adult wing in the wild-type background. *SuUR^ES* and *mod(mdg4)^u1* alleles are recessive suppressors of the *ct⁶* phenotype. Red and black arrowheads point to distinct anatomical features of the wing upon *SuUR* mutation. (B) Relative sizes (areas) of wings in adult male flies of the indicated phenotypes were measured (N=17) as described in ‘Materials and methods.’ p-Values for statistically significant differences are indicated (t-test). (C) SUMM4 subunits are required for the chromatin barrier activity of Su(Hw) binding sites. Top: schematic diagram of the *P{SUPor-P}* reporter system (Bellen et al., 2004); clustered 12 copies of *gypsy* Su(Hw) binding sites flanks the transcription unit of *white. KV00015* and *KV00138* are *P{SUPor-P}* insertions in pericentric heterochromatin of 2L. *SuUR^ES* and *mod(mdg4)^u1* alleles are recessive suppressors of the boundary that insulates *white* from heterochromatin encroachment.

Another insulator assay makes use of a collection of P{SUPor-P} insertions that contain the white reporter flanked by 12 copies of gypsy Su(Hw)-binding sites (Figure 5C, top). When P{SUPor-P} is inserted in heterochromatin, white is protected from silencing, resulting in red eyes (Roseman et al., 1995). Both mod(mdg4)^u1 and SuUR^ES relieved the chromatin barrier function of Su(Hw) sites, causing repression of white (Figure 5C). We conclude that SUMM4 is an insulator complex that contributes to the enhancer-blocking and chromatin boundary functions of gypsy by a mechanism schematized in Figure 6A and B.

Figure 6

Download asset Open asset

Schematic models for the biological functions of SUMM4 in the regulation of gene expression and DNA replication.

(A) Schematic model for the function of SUMM4 in blocking enhancer–promoter interactions in the *ct⁶* locus. A *gypsy* mobile element inserted between wing enhancer and gene *cut* encompasses multiple Su(Hw) binding sites. (B) Schematic model for the function of SUMM4 in establishing a chromatin barrier in heterochromatin-inserted *P{SUPor-P}* elements. The reporter gene *white* is flanked on both sides by 12 copies of *gypsy* insulator element. (C) Schematic model for a putative function of SUMM4 in blocking/retardation of replication fork progression in intercalary heterochromatin domains. Black oval, Su(Hw) protein bound to a *gypsy* insulator element(s); cyan oval, Mod(Mdg4)-67.2 protein tethered to Su(Hw); red oval, SUUR protein associated with Mod(Mdg4)-67.2 in SUMM4 complex; brown ovals represent heterochromatin components; gray rectangles, gene *cut* and its upstream wing enhancer; orange rectangle, gene *white*.

The role of SUMM4 in the regulation of DNA replication in polytene chromosomes

A similar, chromatin partitioning-related mechanism may direct the function of SUUR in the establishment of underreplication in late-replicating intercalary heterochromatin domains of polytene chromosomes (Figure 6C). It has been long known that 3D chromosome partitioning maps show an ‘uncanny alignment’ with replication timing maps (Rhind and Gilbert, 2013). To examine the possible roles of SUMM4 in underreplication, we measured DNA copy number genome-wide in salivary glands of L3 larvae by next-generation sequencing (NGS). In w¹¹¹⁸ control salivary glands, the DNA copy profile revealed large (>100 kbp) domains of reduced ploidy (Figure 7A), similar to previous reports (Andreyeva et al., 2017; Sher et al., 2012; Yarosh and Spradling, 2014). Excluding pericentric and sub-telomeric heterochromatin, we called 70 underreplicated regions (Table 1) in euchromatic arms, as described in ‘Materials and methods’.

Figure 7 with 1 supplement see all

Download asset Open asset

Biological functions of SUMM4 in the regulation of DNA replication.

(A) Genome-wide analyses of DNA copy numbers in *Drosophila* salivary gland cells (*w¹¹¹⁸* control). DNA from L3 salivary glands was subjected to high-throughput sequencing. DNA copy numbers (normalized to diploid embryonic DNA) are shown for chromosomes X, II, and III. Chromosome arms are indicated in white. Brown- and green-shades boxes, mapped pericentric and telomeric heterochromatin regions (Hoskins et al., 2015), respectively. Asterisks, positions of underreplicated domains (Table 1). Genomic coordinates in Megabase pairs are indicated at the bottom. (B) Analyses of DNA copy numbers in *Drosophila* salivary gland cells from wild-type and mutant alleles. Normalized DNA copy numbers are shown across the X chromosome. The control trace (*w¹¹¹⁸* allele) is shown as semitransparent light gray in the foreground; *SuUR^ES* (homozygous null) and *mod(mdg4)^m9* (zygotic null from crosses of heterozygous parents) traces are shown in the background in red and green, respectively; their overlaps with *w¹¹¹⁸* traces appear as lighter shades of colors. Black box, 4C9-E3 cytological region. (C) Close-up view of DNA copy numbers in region 4C9-E3 from high-throughput sequencing data are presented as in (B). DNA copy numbers were also measured independently by real-time qPCR. The numbers were calculated relative to embryonic DNA and normalized to a control intergenic region. The X-axis shows chromosome positions (in Megabase pairs) of target amplicons. Black, *w¹¹¹⁸*; red, *SuUR^ES* (homozygous null); green, *mod(mdg4)^m9* (zygotic null from crosses of heterozygous parents); purple, *SuUR^ES* (zygotic null from crosses of heterozygous parents). Error bars represent the confidence interval (N=9, see ‘Materials and methods’). Black arrowheads, positions of mapped Su(Hw) binding sites (Nègre et al., 2010). Yellow boxes show approximate boundaries of cytogenetic bands. (D) Close-up view of DNA copy numbers by high-throughput sequencing and by qPCR for region 75B11-C2 and DAPI-stained polytene chromosome segments around cytological regions 75B-75C. Yellow lines or brackets in DAPI images indicate positions of 75C1 and 75C2 bands (*w¹¹¹⁸* control) or fused 75C1-2 band (mutants); cyan, *mod(mdg4)^u1* (homozygous null); for other designations see (C).

Figure 7—source data 1 Primer sequences used for qPCR. Genomic coordinates indicate full amplicons, including the length of each primer. Coordinates refer to the BDGP R6/dm3 assembly.: https://cdn.elifesciences.org/articles/81828/elife-81828-fig7-data1-v2.docx
Download elife-81828-fig7-data1-v2.docx

Table 1

Underrepli cated domains and suppression of underreplication in (UR) SUMM4 subunit mutant alleles.

Domains of UR in euchromatic arms of polytene chromosomes were called in w¹¹¹⁸ as described in ‘Materials and methods.’ Their genomic coordinates, approximate cytological location (‘Cyto band’), and average DNA copy numbers (‘<CN>’) in homozygous w¹¹¹⁸, SuUR^ES, and mod(mdg4)^m9 L3 larvae are shown. <CN> numbers were normalized to the average DNA copy numbers across euchromatic genome. UR percent recovery levels were calculated as (<CN> mut –<CN>_w1118) / (1 – <CN >_w1118); negative numbers indicate increased UR. UR p-values were calculated using the DESeq2 package by averaging the Wald test p-values of each 5 kbp bin significantly different than the w¹¹¹⁸ signal. UR was called as suppressible by a mutant if p<0.01; p-values for regions that exhibit a statistically significant recovery of UR are shown in bold blue. Averages of <CN> across all called underreplicated domains and averages of percent Recovery across all suppressible underreplicated domains (‘<Recovery>’, bottom row) were adjusted for each underreplicated domain length; calculation errors = standard deviations.

N	Chromosome coordinates				Length	UR, w¹¹¹⁸	UR, SuUR^ES			UR, mod(mdg4)^m9
N	Arm	Left	Right	Cyto band	Length	<CN>	<CN>	Recovery (%)	p-Value	<CN>	Recovery (%)	p-Value
1	X	2,950,001	3,140,000	3C3-C7	190,000	0.51	0.93	86	7.3E-05	0.58	14	1.1E-02
2	X	4,710,001	4,900,000	4C15-D5	190,000	0.56	0.96	92	3.9E-04	0.81	57	6.9E-05
3	X	4,965,001	5,070,000	4E1-E2	105,000	0.72	0.86	50	5.6E-04	0.80	28	1.4E-02
4	X	6,415,001	6,525,000	6A1-B1	110,000	0.71	0.90	65	1.4E-03	0.80	29	7.3E-03
5	X	7,335,001	7,560,000	7B1-B4	225,000	0.65	0.98	95	1.2E-03	0.79	40	2.8E-03
6	X	7,750,001	7,865,000	7B7-C1	115,000	0.64	0.94	84	3.0E-09	0.84	55	5.2E-07
7	X	8,880,001	9,005,000	8B5-C2	125,000	0.73	0.86	50	5.5E-03	0.76	9	4.6E-03
8	X	9,405,001	9,555,000	8D12-E7	150,000	0.72	0.91	67	3.6E-04	0.85	47	3.6E-03
9	X	11,170,001	11,325,000	10A10-B3	155,000	0.67	0.84	53	3.2E-03	0.78	35	2.6E-03
10	X	12,040,001	12,430,000	11A2-A10	390,000	0.38	0.97	94	1.4E-08	0.42	6	6.8E-03
11	X	13,950,001	14,100,000	12D1-E1	150,000	0.69	0.72	10	1.0E-02	0.73	14	1.4E-02
12	X	14,290,001	14,565,000	12E7-F1	275,000	0.51	0.94	87	4.1E-04	0.69	36	8.1E-04
13	X	17,925,001	18,030,000	16F3-F5	105,000	0.67	0.99	98	1.7E-15	0.90	68	3.4E-05
14	X	20,000,001	20,105,000	19A4-B1	105,000	0.79	1.12	157	1.4E-13	0.82	12	6.1E-03
15	X	20,525,001	21,020,000	19D2-E7	495,000	0.50	0.97	93	1.3E-07	0.51	2	4.9E-03
16	X	21,630,001	22,450,000	20A5-C1	820,000	0.04	0.32	29	1.8E-03	0.06	2	6.4E-03
17	X	22,550,001	22,995,000	20C2-F3	445,000	0.48	0.81	64	7.8E-05	0.74	51	3.5E-04
18	2L	3,920,001	4,025,000	24D1-D4	105,000	0.63	0.93	81	7.9E-07	0.80	46	5.9E-05
19	2L	4,585,001	4,790,000	25A2-A5	205,000	0.66	0.99	98	1.9E-08	0.78	36	1.3E-03
20	2L	5,400,001	5,510,000	25E1-E4	110,000	0.82	0.99	95	4.0E-08	0.90	45	8.3E-03
21	2L	6,155,001	6,320,000	26B9-C2	165,000	0.74	1.08	130	7.3E-14	0.88	54	4.7E-04
22	2L	9,030,001	9,150,000	29F8-30A2	120,000	0.76	0.98	93	1.5E-04	0.95	79	3.3E-03
23	2L	11,535,001	11,795,000	32F2-33A1	260,000	0.44	0.90	83	2.9E-04	0.57	24	1.5E-03
24	2L	12,215,001	12,340,000	33D3-E1	125,000	0.58	0.86	66	3.6E-11	0.75	40	1.1E-04
25	2L	12,765,001	12,970,000	33F5-34A3	205,000	0.55	0.91	79	8.8E-04	0.73	40	7.0E-05
26	2L	14,685,001	15,010,000	35B4-B8	325,000	0.41	0.88	80	5.7E-04	0.54	23	7.2E-04
27	2L	15,295,001	15,735,000	35D1-D4	440,000	0.49	0.76	53	2.3E-05	0.54	9	4.0E-03
28	2L	15,770,001	15,900,000	35D4-D6	130,000	0.54	0.87	71	4.5E-08	0.68	31	6.7E-04
29	2L	15,925,001	16,240,000	35D6-F1	315,000	0.29	0.90	87	6.7E-07	0.38	12	1.4E-05
30	2L	16,925,001	17,375,000	36B4-C7	450,000	0.23	0.89	85	1.4E-04	0.26	4	4.3E-03
31	2L	17,515,001	18,100,000	36C10-E4	585,000	0.34	0.87	80	5.0E-06	0.36	2	3.7E-03
32	2L	18,160,001	18,300,000	36E6-F2	140,000	0.67	0.99	97	3.3E-06	0.90	69	3.1E-06
33	2L	20,110,001	20,290,000	38C1-C4	180,000	0.48	0.69	41	8.9E-04	0.46	–5	1.8E-03
34	2L	20,485,001	20,620,000	38C8-D1	135,000	0.77	0.98	93	1.0E-06	0.99	97	2.1E-05
35	2L	21,400,001	21,550,000	39D3-E2	150,000	0.10	0.15	5	3.2E-03	0.14	3	4.4E-03
36	2L	21,805,001	22,125,000	40A4-E4	320,000	0.53	0.94	87	6.9E-05	0.54	1	9.5E-03
37	2R	4,875,001	5,050,000	41C4-D1	175,000	0.35	0.86	78	2.3E-10	0.34	–1	4.0E-03
38	2R	5,410,001	5,535,000	41F1-F3	125,000	0.58	0.79	50	1.1E-03	0.52	–13	2.2E-03
39	2R	6,290,001	6,505,000	42A14-B1	215,000	0.13	0.50	42	9.3E-04	0.14	1	2.7E-03
40	2R	13,620,001	13,760,000	50B6-C3	140,000	0.63	0.95	88	4.1E-18	0.78	41	1.3E-05
41	2R	20,355,001	20,540,000	56F17-57A5	185,000	0.56	0.92	83	2.0E-06	0.71	35	8.2E-04
42	2R	21,830,001	21,945,000	58A2-A4	115,000	0.72	0.95	83	1.1E-05	0.71	–3	2.2E-02
43	2R	23,145,001	23,320,000	59D1-D6	175,000	0.62	1.04	110	1.3E-22	0.67	13	7.7E-03
44	3L	4,840,001	5,100,000	64C1-C5	260,000	0.38	0.92	87	3.5E-08	0.40	3	6.6E-03
45	3L	5,385,001	5,510,000	64C15-D3	125,000	0.51	0.88	76	1.9E-22	0.73	45	6.0E-09
46	3L	6,290,001	6,485,000	65A11-B3	195,000	0.52	0.89	77	4.9E-05	0.71	38	1.2E-04
47	3L	9,180,001	9,300,000	67A1-A7	120,000	0.67	0.97	90	6.5E-09	0.73	20	1.0E-02
48	3L	10,000,001	10,195,000	67D3-D10	195,000	0.62	0.97	93	4.4E-13	0.79	44	5.7E-06
49	3L	13,085,001	13,220,000	70A1-A2	135,000	0.66	1.01	104	3.6E-09	0.89	66	2.9E-06
50	3L	13,550,001	13,855,000	70B6-C4	305,000	0.26	0.95	94	1.8E-06	0.39	18	7.3E-04
51	3L	15,175,001	15,500,000	71B7-D3	325,000	0.39	0.94	89	5.6E-04	0.46	10	3.7E-03
52	3L	17,115,001	17,240,000	73F1-74A1	125,000	0.71	1.02	106	4.3E-05	0.84	45	2.7E-03
53	3L	18,175,001	18,525,000	75B11-75D2	350,000	0.45	0.87	76	6.8E-05	0.47	4	4.6E-03
54	3L	20,555,001	20,695,000	77D1-77E3	140,000	0.60	1.02	106	2.2E-22	0.84	61	3.6E-11
55	3R	6,060,001	6,310,000	83D2-E4	250,000	0.70	0.92	72	7.6E-04	0.63	–22	1.0E-02
56	3R	6,495,001	6,635,000	83F1-84A1	140,000	0.53	0.96	91	7.8E-08	0.71	39	2.2E-04
57	3R	6,915,001	7,055,000	84B1-B2	140,000	0.64	0.93	80	3.9E-04	0.82	49	1.9E-05
58	3R	7,550,001	7,785,000	84D9-84E2	235,000	0.44	0.80	65	8.0E-06	0.51	12	4.2E-03
59	3R	10,450,001	10,660,000	86B6-C4	210,000	0.55	0.98	97	8.1E-11	0.66	25	7.6E-04
60	3R	10,910,001	11,140,000	88C15-86D4	230,000	0.45	0.94	89	2.3E-10	0.46	2	2.3E-03
61	3R	12,050,001	12,165,000	87A5-B1	115,000	0.63	0.96	88	9.9E-24	0.81	49	5.9E-09
62	3R	12,745,001	12,935,000	87C8-D4	190,000	0.67	0.89	68	7.5E-05	0.60	–21	1.1E-02
63	3R	14,935,001	15,055,000	88D8-D10	120,000	0.70	0.88	61	7.6E-06	0.84	47	1.0E-04
64	3R	16,670,001	16,970,000	89D6-E5	300,000	0.40	0.92	87	2.7E-09	0.47	10	3.2E-03
65	3R	17,160,001	17,355,000	89F1-90A2	195,000	0.62	0.94	84	1.0E-03	0.86	64	2.8E-04
66	3R	20,085,001	20,290,000	92C4-E1	205,000	0.61	0.81	53	1.5E-03	0.71	26	3.6E-03
67	3R	20,340,001	20,525,000	92E4-E12	185,000	0.58	0.96	91	5.0E-05	0.79	50	7.2E-04
68	3R	22,110,001	22,295,000	94A2-A4	185,000	0.61	0.93	83	3.4E-11	0.76	39	3.0E-04
69	3R	28,005,001	28,295,000	98B7-C3	290,000	0.40	0.91	85	2.5E-05	0.60	32	6.9E-04
70	3R	28,370,001	28,480,000	98C5-D2	110,000	0.73	0.98	94	1.2E-09	0.91	66	4.3E-07
UR domains: 70 <Length> : 216 ± 64 kbp Average <CN> across all UR domains: 0.49 ± 0.08							Suppressed UR domains: 69 <Length> : 217 ± 64 kbp <Recovery> : 78 ± 11%			Suppressed UR domains: 60 <Length> : 225 ± 67 kbp <Recovery> : 26 ± 9%

In both SuUR and mod(mdg4)^m9 null larvae, we observed statistically significant suppression of underreplication in intercalary heterochromatin (Figure 7B, Figure 7—figure supplement 1A, Table 1). In line with its lack of accumulation within the chromocenter of polytene chromosomes (Figure 4A), Mod(Mdg4) was largely dispensable for underreplication in pericentric heterochromatin. The NGS data strongly correlated with qPCR measurements of DNA copy numbers (Figure 7C and D). Furthermore, cytological evidence in the 75C region supported the molecular analyses in that both mutants exhibited a brighter DAPI staining of the 75C1-2 band than that in w¹¹¹⁸, indicative of higher DNA content (Figure 7D). Importantly, consistent with the role of Mod(Mdg4)-dependent insulators in the establishment of underreplication, the boundaries of underreplicated domains frequently encompass multiple clustered Su(Hw) binding sites (Figure 7C and D).

Uniformly, SuUR mutation gave rise to a stronger relief of underreplication than that produced by the mod(mdg4)^m9 null allele (Table 1). This result can be explained by embryonic deposition of functional Mod(Mdg4) proteins and RNA by heterozygous mothers, unlike the complete absence of SUUR throughout the life cycle of the homozygous viable and fertile SuUR^ES animals. Although third-instar larvae are >1000-fold larger, volume-wise, than the embryos, persistent Mod(Mdg4)-67.2 can still be detected in polytene chromosomes of these larvae by IF despite its dilution and degradation (Figure 4B, Figure 4—figure supplement 3B). In contrast, unlike L3, first-instar larvae (L1) are nearly identical in size to the embryos. Therefore, since the endoreplication cycles initiate in embryos and L1, in mod(mdg4)^m9 animals the first few out of 10–11 rounds of chromosome polytenization take place with an almost normal amount of Mod(Mdg4) present, which may substantially limit the effect of mod(mdg4)^m9 mutation on underreplication as measured in L3.

Seemingly, there is a contradiction between a strong effect that mod(mdg4) null mutation has on the loading of SUUR in polytene chromosomes (Figure 4B) and a weaker effect on underreplication (Figure 7B–D, Figure 7—figure supplement 1A and B, Table 1). However, the SUUR occupancy is examined in L3 after the maternal mod(mdg4) product is nearly eliminated (Figure 4B). On the other hand, the DNA copy number, although also measured in L3 (Figure 7B–D, Figure 7—figure supplement 1A and B, Table 1), is a product of multiple rounds of endoreplication that initiate before Mod(Mdg4) is exhausted. To validate the putative effect of maternally contributed SUMM4 on the establishment of underreplication, we performed qPCR measurements of DNA copy numbers in salivary glands of homozygous SuUR animals produced by inter se crosses of heterozygous SuUR^ES/+ parents (Figure 7C and D, zygotic SuUR^ES). Similar to the maternal Mod(Mdg4), the initial maternal contribution of SUUR partially limited the reversal of underreplication in cytological regions 4D and 75C. Thus, when the SuUR and mod(mdg4) null mutant animals are similarly derived from heterozygous mothers that deposit wild-type gene product into their progeny, the mutant underreplication phenotypes in the third-instar larval salivary gland are essentially indistinguishable. Finally, we analyzed the effect of homozygous mod(mdg4)^u1 mutation, which is viable and fertile, on DNA copy numbers in the 75C underreplicated domain by qPCR and cytologically (Figure 7D). We observed a substantially stronger suppression of underreplication than that in mod(mdg4)^m9, presumably due to the absence of maternal contribution of full-length Mod(Mdg4)-67.2.

We conclude that SUUR and Mod(Mdg4)-67.2 act together as subunits of stable SUMM4 complex, which is required for the establishment of underreplication in the intercalary heterochromatin domains of Drosophila polytene chromosome.

Discussion

MERCI is a powerful new approach to characterize stable stoichiometric protein complexes

We present here a facile method, termed MERCI, to rapidly identify subunits of stable native complexes by only partial chromatographic purification. It allows one to circumvent the conventional, rate-limiting approach to purify proteins to apparent homogeneity. Since a multistep FPLC scheme invariably leads to an exponential loss of material, reducing the number of purification steps in the MERCI protocol allows identification of rare complexes, such as SUMM4, which may be present in trace amounts in native sources. On the other hand, MERCI obviates introduction of false-positives frequently associated with tag purification of ectopically expressed targets that render results less reliable. Notably, MERCI is not limited to analyses of known polypeptides since it is readily amenable to fractionation of native factors based on a correlation with their biochemical activities in vitro.

The dissection of protein interactome by extract fractionation on orthogonal FPLC columns and MS-based approaches has been previously attempted (Havugimana et al., 2012; Shatsky et al., 2016). However, unlike the newly developed MERCI approach, these studies were aimed at comprehensive, proteome-wide analyses, which managed to only yield data for the most abundant complexes. The major distinction of the MERCI protocol is that it is targeted toward a particular protein (SUUR in this study). The crucial final stage of the MERCI algorithm is re-quantification of all acquired SWATH data using a library acquired from fractions of the last column (IL5, Figure 1A, B, and I). The target protein and co-purifying polypeptides are substantially enriched after several chromatographic steps and, thus, yield a greater number of detected peptides, which helps a more precise quantification. Although SWATH allows reliable measurement of picogram amounts of proteins (Figure 1—figure supplement 1A and B), the range of quantified polypeptides is always limited by those present in IDA (ion libraries). For low-abundance proteins, such as SUUR and Mod(Mdg4), specific peptides are not detectable by IDA in earlier chromatographic steps (Supplementary file 1). Consequently, SWATH quantification using only the cognate ion libraries would not discern the near perfect co-fractionation of SUUR and Mod(Mdg4) in all five steps (Figure 2C), precluding identification of the SUUR-Mod(Mdg4) complex (Figure 2B and C).

One limitation of the MERCI protocol is its failure to measure the absolute amounts of identified polypeptides. For instance, quantification of SWATH data (Figure 1D–H) measures the relative (to reference proteins and each other) amounts of SUUR across fractions. To measure the absolute levels of SUUR, a semi-quantitative approach was used by building a titration curve from SWATH acquisitions of known amounts of recombinant SUUR (Figure 1—figure supplement 1A and B). We estimated the amount of SUUR in the nuclear extract (~140 pg in 25 µg total protein, Figure 1—figure supplement 1B) and in individual fractions from all chromatographic steps (Figure 1—figure supplement 1C). Although in five FPLC steps we achieved >3000-fold purification of SUUR, it remained only ~2% pure (Figure 1—figure supplement 1D). A progressive loss of material precludes further purification (300 ng of SUUR in 16 µg total protein). Thus, the SUMM4 complex would be nearly impossible to purify to homogeneity from a substantial amount of starting material (~1 kg Drosophila embryos, ~2.5 g protein), suggesting that SUMM4 could not be identified by the classical FPLC approach.

SUMM4 regulates the function of gypsy insulator elements

Both subunits of SUMM4 contribute to the known functions of gypsy insulator (Figure 5A–C). Although a SuUR mutation decreased the insulator activity, the suppression was universally weaker than that by mod(mdg4)^u1. It is possible that SUUR is not absolutely required for the establishment of the insulator. For instance, the loss of SUMM4 may be compensated by the alternative complex of Mod(Mdg4)-67.2 (Figure 4—figure supplement 2). Furthermore, the mod(mdg4)^u1 allele is expected to have an antimorphic function since it can mis-localize interacting partner proteins, including SUUR itself (Figure 4C). Interestingly, SuUR has been previously characterized as a weak suppressor of variegation of the white^m4h X chromosome inversion allele, which places the white gene near pericentric heterochromatin (Belyaeva et al., 2003). In contrast, SuUR mutation enhances variegation in the context of insulated, heterochromatin-positioned white (Figure 5C). Therefore, this phenotype is unrelated to the putative Su(var) function of SuUR but, rather, is insulator-dependent.

ATP-dependent motor proteins are required for the establishment of chromatin barrier and chromosome partitioning

Our discovery and analyses of SUMM4 provide a biochemical link between ATP-dependent motor factors and the activity of insulators in the regulation of gene expression and chromatin partitioning. Insulator elements organize the genome into chromatin loops (Gerasimova et al., 1995) that are involved in the formation of topologically associating domains [TADs] (Peterson et al., 2021; Rowley et al., 2017; Szabo et al., 2019). In mammals, CTCF-dependent loop formation requires ATP-driven motor activity of SMC complex cohesin (Davidson et al., 2019). In contrast, CTCF and cohesin are thought to be dispensable for chromatin 3D partitioning in Drosophila (Matthews and White, 2019). Instead, the larger, transcriptionally inactive domains (canonical TADs) are interspersed with smaller active compartmental domains, which themselves represent TAD boundaries (Rowley et al., 2017). It has been proposed that in Drosophila, domain organization does not rely on architectural proteins but is established by transcription-dependent, A-A compartmental (gene-to-gene) interactions (Rowley et al., 2017). However, Drosophila TAD boundaries are enriched for architectural proteins other than CTCF (Van Bortle et al., 2014), and their roles have not been tested in loss-of-function models. Thus, it is possible that in Drosophila, instead of CTCF, the 3D partitioning of the genome is facilitated by another group of insulator proteins, such as Su(Hw) and SUMM4, that together associate with class 3 insulators (Schwartz et al., 2012).

Moreover, SUUR may provide the DNA motor function to promote a physical separation of active and inactive loci and help establish chromosome contact domains (Figure 6A–C). We propose that within the SUMM4 complex, SUUR utilizes its putative ATP-dependent motor activity to translocate along chromatin strands, thus facilitating the establishment of higher-order structures that isolate promoters from enhancers (Figure 6A) and stabilize DNA loops/domains to prevent unrestricted heterochromatin encroachment (Figure 6B) and penetration of replication forks (Figure 6C). The translocation model is consistent with observations of an asymmetric, selective occupancy of SUUR away from its initial sites of deposition via Su(Hw)-Mod(Mdg4) binding toward inside of intercalary heterochromatin regions but not outside (Figure 7—figure supplement 1C; Filion et al., 2010), which may be facilitated by physical interactions between SUUR and linker histone H1 enriched in intercalary heterochromatin (Andreyeva et al., 2017). It has been reported that another Drosophila BTB/POZ domain insulator protein CP190 forms a complex with a DEAD-box helicase Rm62 that contributes to the insulator activity (Lei and Corces, 2006). Thus, ATP-dependent motor proteins may represent an obligatory component of the insulator complex machinery.

SUMM4 mediates known biological functions of SUUR

Our discovery explains previous observations about biological functions of SUUR. For instance, the initial deposition of SUUR and its colocalization with PCNA has been proposed to depend on direct physical interaction with components of the replisome (Kolesnikova et al., 2013). Our model indicates that, instead, the apparent colocalization of SUUR with PCNA throughout endo-S phase (Figure 4—figure supplement 3B) may be caused by a replication fork retardation at insulator sites. SUUR is deposited in chromosomes as a subunit of SUMM4 complex at thousands of loci by tethering via Mod(Mdg4)-Su(Hw) interactions. As replication forks progress through the genome, they encounter insulator complexes where replication machinery pauses for various periods of time before resolving the obstacle. Thus, the increased co-residence time of PCNA and SUUR manifests cytologically as their partial colocalization. With the progression of endo-S phase, some of the SUMM4 insulator complexes are evicted and, thus, the number of SUUR-positive loci is decreased, until eventually the replication fork encounters nearly completely impenetrable insulators demarcating the underreplicated domain boundaries.

This mechanism is especially plausible given that boundaries of intercalary heterochromatin loci very frequently encompass multiple, densely clustered Su(Hw) binding sites (e.g., Figure 7C and D). We examined the data from genome-wide proteomic analyses for Su(Hw) and SUUR performed by DamID in Kc167 cells (Filion et al., 2010). Strikingly, Su(Hw) DamID-measured occupancy does not exhibit a discrete pattern expected of a DNA-binding factor. Instead, it appears broadly dispersed, together with SUUR, up to tens of kbp away from mapped Su(Hw) binding sites (Figure 7—figure supplement 1C). Interestingly, when hidden Markov modeling was applied to the DamID data, Su(Hw), Mod(Mdg4)-67.2, and SUUR occupancies were found to strongly correlate genome-wide in a novel chromatin form (‘malachite’) that frequently demarcates the boundaries of intercalary heterochromatin (Khoroshko et al., 2016). These observations strongly corroborate the translocation model for the mechanism of action of SUMM4. According to this model, upon tethering to DNA-bound Su(Hw), SUMM4 traverses the underreplicated region, which helps to separate it in a contact domain. As DNA within the underreplicated region is tracked by SUUR (Figure 6C), it is brought into a transient close proximity with both SUMM4 and the associated Su(Hw) protein, which is detected by DamID (or ChIP) as an expanded occupancy pattern.

The deceleration of SUUR-bound replication forks was also invoked as an explanation for the apparent role of SUUR in the establishment of epigenetic marking of intercalary heterochromatin (Posukh et al., 2015). We propose that global epigenetic modifications observed in the SuUR mutant likely do not directly arise from derepression of the replisome as suggested but, rather, result from the coordinate insulator-dependent regulatory functions of SUUR in both the establishment of a chromatin barrier and DNA replication control (Figure 6B and C).

Architectural proteins can attenuate replication forks and regulate replication timing

Our work demonstrates for the first time that insulator complexes assembled on chromatin can attenuate the extent of replication in discrete regions of the salivary gland polyploid genome. Despite distinct cell cycle programs in dividing and endoreplicating cells (Zielke et al., 2013), the core biochemical composition of replisomes in both cell types is likely similar. Although the putative relationship is limited by a paucity of comparative biochemical analyses of replication factors in different cell types, related insulator-driven control mechanisms for DNA replication may be conserved in endoreplicating and mitotically dividing diploid cells. Our data thus implicates insulator/chromatin boundary elements as a critical attribute of DNA replication control. Our model suggests that delayed replication of repressed chromatin (e.g., intercalary heterochromatin) during very late S phase can be imposed in a simple, two-component mechanism (Figure 6C). First, it requires that an extended genomic domain be completely devoid of functional origins of replication. The assembly and licensing of proximal pre-RC complexes can be repressed epigenetically or at the level of DNA sequence. Second, this domain is separated from flanking chromatin by a barrier element associated with an insulator complex, such as SUMM4. This structural organization is capable of preventing or delaying the entry of external forks fired from distal origins.

An important frequent feature of the partially suppressed underreplication in mod(mdg4) animals is its asymmetry (Figure 7D, Figure 7—figure supplement 1B), which is consistent with a unidirectional penetration of the underreplicated domain by a replication fork firing from the nearest external origin (Figure 6C). The SUMM4-dependent barrier may be created as a direct physical obstacle to MCM2-7 DNA-unwinding helicase or other enzymatic activities of the replisome. Alternatively, SUMM4 may inhibit the replication machinery indirectly by assembling at the insulator a DNA/chromatin structure that is incompatible with replisome translocation. This putative inhibitory structure may involve epigenetic modifications of chromatin as proposed earlier (Gaszner and Felsenfeld, 2006), linker histone H1 as shown previously (Andreyeva et al., 2017) and may also be dependent on Rif1, a negative DNA replication regulator that acts downstream of SUUR (Munden et al., 2018).

In conclusion, we used a newly developed MERCI approach to identify a stable stoichiometric complex termed SUMM4 that comprises SUUR, a previously known negative effector of replication, and Mod(Mdg4), an insulator protein. SUMM4 subunits cooperate to mediate transcriptional repression and chromatin boundary functions of gypsy-like (class 3) insulators (Schwartz et al., 2012) and inhibit DNA replication likely by slowing down replication fork progression through the boundary element. Thus, SUMM4 is required for coordinate regulation of gene expression, chromatin partitioning, and DNA replication timing. The insulator-dependent regulation of DNA replication offers a novel mechanism for the establishment of replication timing in addition to the currently accepted paradigm of variable timing of replication origin firing.

Share this article

Cite this article

FPLC fractionation and MS-Enabled Rapid protein Complex Identification (MERCI) quantification of native SUUR.

Figure 1—source data 1

Figure 1—source data 2

Identification of the SUMM4 complex by MS-Enabled Rapid protein Complex Identification (MERCI).

Figure 2—source data 1

Figure 2—source data 2

Biochemical activities of recombinant SUMM4.

Figure 3—source data 1

Spatiotemporal distribution of SUMM4 in vivo.

Biological functions of SUMM4 in the regulation of gene expression.

Schematic models for the biological functions of SUMM4 in the regulation of gene expression and DNA replication.

Biological functions of SUMM4 in the regulation of DNA replication.

Figure 7—source data 1

Underrepli cated domains and suppression of underreplication in (UR) SUMM4 subunit mutant alleles.

Author details

Evgeniya N Andreyeva

Contribution

Contributed equally with

Competing interests

Alexander V Emelyanov

Contribution

Contributed equally with

Competing interests

Markus Nevil

Contribution

Competing interests

Lu Sun

Contribution

Competing interests

Elena Vershilova

Contribution

Competing interests

Christina A Hill

Contribution

Competing interests

Michael-C Keogh

Contribution

Competing interests

Robert J Duronio

Contribution

Competing interests

Arthur I Skoultchi

Contribution

Competing interests

Dmitry V Fyodorov

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism