Repair of naturally occurring mismatches can induce mutations in flanking DNA
Abstract
‘Normal’ genomic DNA contains hundreds of mismatches that are generated daily by the spontaneous deamination of C (U/G) and methyl-C (T/G). Thus, a mutagenic effect of their repair could constitute a serious genetic burden. We show here that while mismatches introduced into human cells on an SV40-based episome were invariably repaired, this process induced mutations in flanking DNA at a significantly higher rate than no mismatch controls. Most mutations involved the C of TpC, the substrate of some single strand-specific APOBEC cytidine deaminases, similar to the mutations that can typify the ‘mutator phenotype’ of numerous tumors. siRNA knockdowns and chromatin immunoprecipitation showed that TpC preferring APOBECs mediate the mutagenesis, and siRNA knockdowns showed that both the base excision and mismatch repair pathways are involved. That naturally occurring mispairs can be converted to mutators, represents an heretofore unsuspected source of genetic changes that could underlie disease, aging, and evolutionary change.
https://doi.org/10.7554/eLife.02001.001eLife digest
The inherent chemical instability of the four bases that are found in DNA leads to our genetic material being damaged on a daily basis. The sequence of these bases codes the genetic instructions necessary for all cellular functions, so damaged bases must be efficiently recognized and accurately repaired. The base excision repair pathway carries out these functions.
However, there are some circumstances in which random changes to the genetic code can be beneficial. In immune cells, for example, these changes enhance the diversity of antibodies generated to fight bacteria and viruses. In immune cells, a second repair pathway—the mismatch repair pathway—hijacks the base excision repair pathway. This gives enzymes belonging to the APOBEC family access to the DNA that is undergoing repair, and these enzymes change cytosine bases to uracil bases. Subsequent processing steps can lead to different bases substituted for the original cytosine. The recent discovery that APOBEC enzymes are abundant in other types of cells raised the possibility that these enzymes could be significant source of mutations in the DNA of cells where such mutations are not welcome.
To explore this possibility Chen et al. deliberately introduced a number of mutations (that are normally repaired by the base excision repair pathway) into non-immune human cells and observed what happened. The mutations were repaired, but the number of mutations in neighboring bases increased by a statistically significant amount. In particular, most of these mutations involved a cytosine base that was preceded by a thymine base. Chen et al. also showed that both APOBEC and the mismatch repair pathway are involved, as is the case for the mutations caused by APOBEC enzymes in immune cells. Similar APOBEC mutations are known to be involved in cancer.
The model system developed by Chen et al. not only shows that normally error-free DNA repair can be involved in generating these mutations, but also used to obtain a better understanding of these processes and thereby provide new insights in cancer biology.
https://doi.org/10.7554/eLife.02001.002Introduction
Species survival depends on the faithful replication of genetic information which is monitored and maintained by a number of complex and interacting DNA repair pathways (Modrich, 2006; Cannavo et al., 2007; Cortázar et al., 2007; Hsieh and Yamane, 2008; Kunz et al., 2009; Robertson, 2009; Liu et al., 2010; Jacobs and Schar, 2012; Jiricny, 2013). Continual DNA repair is required to correct the thousands of genetic lesions that occur daily due to just the inherent chemical lability of DNA (Atamna et al., 2000; Barnes and Lindahl, 2004). For example, the susceptibility of C (and its methylated derivative) to spontaneous hydrolytic deamination daily generates hundreds of U/G and T/G mismatches respectively, (Barnes and Lindahl, 2004) and could explain why C is the most frequent source of single nucleotide substitutions in mammals (Hwang and Green, 2004).
Error-free base excision repair (BER) can correct the naturally occurring U/G and T/G mismatches (reviewed in Cortázar et al., 2007; Hegde et al., 2008; Robertson, 2009; Jacobs and Schar, 2012). The basic reaction involves removal of the base that is paired with G by anyone of several glycosylases to generate an abasic site that is cleaved on its 5′ side by the APE1 endonuclease. The resulting 3′ end is extended by insertion of dCMP by the high fidelity polymerase β coincident with its hydrolysis of the 5′-phosphodeoxyribose that had been generated by the glycosylase. This step is followed by sealing the resulting single stranded break (SSB) by DNA ligase III with the assistance of the scaffolding protein XRCC1.
However, in lymphoid (B) cells the U/Gs that are generated by activation-induced cytidine deaminase (AID, a member of the AID/apolipoprotein B mRNA editing enzyme, catalytic polypeptide-like (APOBEC) family of cytidine deaminases, Conticello, 2008) on transient single-stranded DNA regions produced during transcription, are prone to several mutagenic processes that enhance diversification of immunoglobulins (somatic hypermutation, SHM, Martomo and Gearhart, 2006; Teng and Papavasiliou, 2007; Peled et al., 2008). One of these involves processing U/G mismatches, or BER products thereof, by a non-canonical application of the mismatch repair (MMR) pathway. Normally, MMR (Modrich, 2006; Hsieh and Yamane, 2008; Jiricny, 2013) is a high fidelity process that operates post-replicatively on the nascent DNA strand to remove mismatches that have escaped the proof-reading activity of high fidelity replicative DNA polymerases. Essential components of this pathway include the heterodimer MutSα (MSH2 and MSH6), which recognizes mismatches, the heterodimer MutLα (MLH1 and PMS2), which accesses the mismatch-containing nascent strand in a reaction mediated by proliferating cell nuclear antigen (PCNA, a multipurpose replication clamp, e.g., Moldovan et al., 2007; Lee and Myung, 2008). PCNA also activates a latent endonuclease in MutLα (Kadyrov et al., 2006; Pluciennik et al., 2010), which provides entry points for the EXO1 nuclease that excises the mismatch-containing nascent strand to expose the replication template for re-copying by a high fidelity DNA polymerase, such as polymerase δ.
In B cells, non-canonical MMR can expose single stranded regions at U/G-containing sites unrelated to DNA replication that can serve as a template for DNA repair. However, the high fidelity DNA polymerase is replaced by the error-prone polymerase η (mediated by mono-ubiquitylated PCNA). Thus, MMR is subverted to an error-prone process that contributes to SHM. Recently, elements of non-canonical MMR have been recapitulated in vitro (Schanz et al., 2009; Peña-Diaz et al., 2012), and the latter study also showed that extracts of non-lymphoid mammalian cells can also process U/G mismatches by a non-canonical MMR process. In addition, these non-lymphoid cells when stressed in vivo by the alkylating agent, N-methyl-N′-nitro-N-nitrosoguanidine (MNNG), generated more mutations in MMR proficient than deficient cells, thereby implicating MMR in the mutagenic process but presumably using an AID-independent mechanism (Hsieh, 2012).
As DNA homeostasis would seem to require a continual state of DNA repair, its involvement in error-prone processes even at a low frequency would have important implications for the mutational mechanisms that could underlie evolution, aging, and disease. Of interest in this regard is the mechanism that produces the enhanced mutation rate that characterizes certain tumors, which has been termed the ‘mutator phenotype’ (Bielas et al., 2006; Venkatesan et al., 2006). Recent examples of such a mutator phenotype are the high mutation rates of the C of TpCpN (or its complement, G of NpGpA) that can accompany the progression of some cancers (Nik-Zainal et al., 2012; Roberts et al., 2012). TpC-preferring members of the AID/APOBEC family of C deaminases, particularly APOBEC3B (A3B), mediate these mutations (Burns et al., 2013; Leonard et al., 2013; Roberts et al., 2013; Taylor et al., 2013), which can occur in strand-coordinated clusters. These deaminases prefer single stranded DNA and an important issue is how the single stranded APOBEC substrate is generated. Experiments using yeast as a model system showed that the transitory single stranded regions that arise at replication forks or during double strand break repair can accumulate such mutations upon chronic alkylation of DNA (Roberts et al., 2012). These mutations were also observed at experimentally introduced double strand breaks coincident with overexpression of AID/APOBEC deaminases (Taylor et al., 2013).
Here, we determined directly whether repair of the naturally occurring T/G and U/G mismatches would be mutagenic to flanking DNA in mammalian cells that were not stressed by genotoxic agents. We introduced these mismatches (and other mispairs and lesions) into an SV40-episome that can replicate in human cells. Numerous studies have shown that processing lesions harbored by such episomes (as well as those introduced in the SV40 virion) faithfully captured the DNA repair repertoire and mutational environment of its host (e.g., Hare and Taylor, 1985; Seidman et al., 1985; Brown and Jiricny, 1987; Choi and Pfeifer, 2005; Terai et al., 2010; Pathania et al., 2011; Qi et al., 2012). We found that the introduced mispairs were invariably corrected. However, their repair generated mutations in normal flanking DNA at statistically higher rates than the no mismatch control, but only if the episomes were passed through mammalian cells.
Regardless of the lesion, most of the mutations involved the C of TpCpN, and shared other features of the aforementioned mutations that occur in some cancers. siRNA knockdowns showed that the TpC-preferring deaminases, particularly A3B mediated the mutagenic effect, and chromatin immunoprecipitation (ChIP) showed that A3B could access the episome in a mismatch dependent way. siRNA knockdowns also showed that components of both the BER and MMR pathways are involved in generating the single-stranded APOBEC substrate and that two factors that have diverse roles in various aspects of DNA metabolism, ataxia telangiectasia and Rad3-related protein (ATR) and PCNA (Moldovan et al., 2007; Flynn and Zou, 2011) modulate the mutagenic effect.
The APOBEC deaminases are relatively ubiquitous in various tissues (Refsland et al., 2010), as is the potential for generating single-stranded templates from BER processed lesions. These would not only include the mispairs generated from C and methyl-C by their spontaneous deamination as we found here, but potentially also those involving other normal metabolites of methyl-C (e.g., Guo et al., 2011; Wu and Zhang, 2014) and the thousands of BER substrates that arise daily from other naturally occurring degradative processes that affect DNA (Atamna et al., 2000; Barnes and Lindahl, 2004). The implications of our findings in light of these issues are discussed.
Results
DNA repair can induce mutations in flanking DNA
Figure 1A shows the SV40-based episome (shuttle vector) (Parris and Seidman, 1992) into which we inserted a mismatch region (MM1). Both strands of MM1 contain two sites for a different pair of single strand restriction enzymes, which facilitate the exchange of either strand with an exact complement to generate a no mismatch (0 MM) control or with an oligonucleotide that would generate a mismatch or other lesions (Figure 1B, ‘Materials and methods’, Figure 1—figure supplements 1 and 2). Supplementary file 1 lists the oligonucleotides used and their corresponding numbers are given in the figures. After passage in mammalian cells we screened the episomes for mutations by blue/white selection in E. coli (Figure 1B) and determined the DNA sequence of the reporter cassette of the episomes from all the white colonies.
Figure 2A shows that the repair of different types of mismatches or lesions - T/G, 5-hydroxymethyl-U (hmU)/G, U/G, or an abasic site opposite a G, (ab)/G – induced significantly more mutations in the reporter region than we found with the 0 MM control. HmU is a byproduct of the enzymatic demethylation of methyl-C (Bhutani et al., 2011; Guo et al., 2011; Wu and Zhang, 2014) and abasic sites are generated during BER (e.g., Robertson, 2009; Jacobs and Schar, 2012). For convenience, we refer to both mismatches and ab/G sites as lesions. Mutation frequency is the percent (%) white colonies (per total screened) that contained undeleted episomes. We did not consider deletions because most were missing all or part of the reporter region. These deletions had resulted from our initial method of vector preparation and were essentially eliminated by its subsequent modification (Figure 2—figure supplement 1 and ‘Materials and methods-Vector preparation’). The few percent that persisted were unrelated to either the type or even presence of an introduced DNA lesion (see ‘Materials and methods-Data acquisition and analysis’). Finally, no mutated episomes were obtained if they were passed directly into E. coli (3 deleted episomes /42,000 colonies screened, results not shown).
The relative mutagenic effect of repairing a given type of lesion on a given strand (top or bottom) was more or less indifferent to its number (up to three), context (i.e., CpG or non-CpG) or position(s) in the MM region (Figure 2—figure supplement 2; Supplementary file 1). Thus, the mutagenic effects of repairing the various configurations of T/G, hmU/G and U/G shown in Figure 2A can be recapitulated by a given set of these lesions (i.e., 2 T/Gs, 2 hmU/Gs or 2 U/Gs at the same positions in the MM region, the uppermost bar graph of Figure 2—figure supplement 2).
The top or bottom strand location of the lesions differentially affected their mutagenic effect: repair of top strand T/Gs generated ∼fourfold more mutated episomes than the 0 MM control (respective mean percent mutation frequency 0.107 vs 0.025). HmU/G repair induced ∼sevenfold more mutations (0.182%), and U/G or ab/G repair induced ∼30-fold more mutations (0.726% and 0.78% respectively) than the 0 MM control. These differences were statistically significant (legend, Figure 2A). The bottom strand location reduced the mutagenic effect of repairing U/G or ab/G, but not of T/G or hmU/G. Repair of bottom strand T/G was ∼twofold more mutagenic than on the top strand: 0.23% ± 0.013 vs 0.11% ± 0.011 (mean ± standard error, p<10−7, Fisher exact test).
The differences in mutagenesis were not due to differences in repair efficiency. Figure 2B shows that, except for T/G, essentially all of the top and bottom strand lesions were restored to C/G. T/G was converted to T/A about 25% of the time. This would result from using the T-containing strand as the repair template (see section, ‘The mutational spectrum, sequence context, and fate of the bases mutated in response to DNA repair are consistent with APOBEC-mediated mutagenesis’). The mutated plasmids represented repair of 426 T/G, 116 hmU/G, 293 U/G and 90 ab/G mismatches (combined top and bottom lesions, Figure 2B), and the percent restoration to the C/G base pair was: 72.3, 99, 100 and 100 respectively. The values for the restoration of lesions in the non-mutated episomes (isolated from blue colonies), were 75% of 111 T/G, 95.8% of 24 hmU/G, 99% of 105 U/G, and 100% of 7 ab/G mismatches. Thus, the ratio of restoring T/G to C/G or converting it to T/A was independent of repair-induced mutagenesis. Conversion of T/G mismatches to T/A had also been observed with T/G-containing SV40 virion DNA (e.g., Hare and Taylor, 1985; Brown and Jiricny, 1987). The mutagenic effect was also independent of the G+C content of the mismatch region; 69% G+C for FM1, 39% for FM2, and 53% for FM3 (Figure 2—figure supplement 3). Thus, mutagenesis was not affected by the presumed stability of the MM region helix.
The distance between the lesion and the cis 3′ end of the reporter region affects mutagenesis
The distance between the cis 3′ end of the reporter region and the lesion is <50 bp for top-strand, but ∼5 kb for bottom-strand lesions in FM1 (top panel Figure 2C). This difference accounts for their different mutagenic effects because relocating the MM region upstream of the reporter region (M1F) switches these distances (now <50 bp for bottom—but ∼5 kb for top-strand lesions respectively), and it also switches the extent of their respective mutagenic effects (cf. top and middle panels of Figure 2C). Reversing the orientation of the entire reporter cassette (the DNA between the EcoRI and NheI sites [Figure 1A] to produce FM1_R [Figure 2C, lower panel]) has the same effect. These results also show that strand-specific mutagenesis is neither due to transcription nor DNA replication effects (i.e., transcribed vs non-transcribed strand, and leading vs lagging strand, e.g., Fijalkowska et al., 1998; Hanawalt and Spivak, 2008) imposed on the reporter region by the episome backbone.
BER would be recruited to T/G, hmU/G and U/G mismatches (Robertson, 2009; Jacobs and Schar, 2012). Removal of the mismatched T, hmU, or U by a glycosylase and cleavage of the DNA 5′ of the ensuing (or introduced) abasic site (ab), and modification of the 3′ ends ultimately generates a single strand break (SSB, arrow immediately 5′of the green triangle Figure 2A, also see Figure 4A) that could be extended in the 5′ to 3′ direction by either a single (short patch) or several (long patch) nucleotides. Neither is error prone nor would BER remove the DNA strand on the 5′ side of the lesion (green triangle, Figure 2A), which would expose its complement as the template for the error-prone process that would generate mutations in the reporter region. However, components of the MMR pathway can hijack U/G-BER intermediates and generate such gapped substrates (e.g., Kadyrov et al., 2006; Schanz et al., 2009; Pluciennik et al., 2010; Peña-Diaz et al., 2012) and we show later in the paper that components of both BER and MMR are involved in the mutagenesis.
The most right hand bar graphs in the upper and middle panels in Figure 2C also show that a preformed SSB (i.e., one did not result from BER activity on the introduced mismatches) can also induce mutagenesis in flanking DNA. Here, we replaced respectively the top or bottom strands of FM1, and bottom or top strands of M1F with non-5′ phosphorylated versions of the 0 MM control oligonucleotide. The preformed SSB <50 bp from the cis 3′ end of the reporter region in the top strand of FM1 or bottom strand of M1F generates the same mutagenic effect as U/G-ab/G at these positions. We also obtained this result when using non-5′ phosphorylated versions of T/G-mismatch producing oligonucleotides (Figure 2—figure supplement 4). Therefore, processing this nick bypasses T/G repair, and generates a substrate that is susceptible to mutagenesis. However, unlike the other lesions, a preformed SSB at ∼5 kb from the cis 3′ end of the reporter region (bottom and top strand respectively for FM1 and M1F) has little if any mutagenic effect. This difference could reflect the fact that these SSBs would be substrates for a SSB repair (SSBR) pathway that would recruit proteins different from those at the SSB generated during BER (Caldecott, 2008).
The mutational spectrum, sequence context, and fate of the bases mutated in response to DNA repair are consistent with APOBEC-mediated mutagenesis
We determined the fraction of total mutations contributed by each base of the reporter and mismatch regions, and the identity of the base to which it was mutated (i.e., its mutational fate), and report these results in terms of the top strand sequence. (Supplementary file 2 lists all the mutations for each mutated episome.) The pooled data for top strand T/G or U/G that were restored to C/G are shown in Figure 3A,B respectively. The mutations are distributed throughout the cassette, mostly on G, irrespective of the type of lesion (summarized in Figure 3C). Mutations of either G or C would equally affect the integrity of the supF tRNA stems and potentially produce a non-functional tRNA. Thus, the skewed mutational pattern toward G does not likely reflect an ascertainment bias. In addition, the sequence context of ∼80% of the G mutations is GpA, but with less specificity for the 5′ base (Figure 3D). Figure 3—figure supplement 1 shows that essentially the same mutational spectrum was induced by restoring each type of top strand lesion—that is, T/G, hmU/G, U/G or ab/G to C/G. Thus a similar mutagenic process is recruited during repair of any of these lesions.
The restoration of top strand lesions to C/G would employ the bottom strand as the repair template. Thus, the preponderance of mutations on the G of GpA indicates that its complement on the repair template, the C of TpC, is targeted by the mutagenic process that is recruited during repair. This preference implicates the TpC-preferring APOBEC family members of single strand-specific cytidine deaminases. The left panel of Figure 4A outlines how this could occur during the restoration of top strand lesions to C/G after the repair process was initiated by BER at the lesion (green triangle), and subsequent resection of the lesion-containing strand by MMR as described in the previous section. Deamination of the C of TpC would generate TpU, and as discussed by others (Lange et al., 2011; Nik-Zainal et al., 2012; Peña-Diaz et al., 2012; Roberts et al., 2012; Burns et al., 2013; Leonard et al., 2013; Roberts et al., 2013; Taylor et al., 2013) further processing of the U could generate a number mutational outcomes of the original C (and its complement G). Faithful copying of the U in the repair template would produce an A/U pair. Subsequent replication would result in a G to A transition (complementary C to T transition). On the other hand, the A/U base pair could also be a substrate for a U glycosylase, which would remove the U and generate an abasic site. Subsequent replication of this strand by a high fidelity DNA polymerase would generally insert an A opposite the abasic site–that is, the ‘A rule’, (Strauss, 2002). The fact that the most frequent mutational outcome at GpA was a G to A transition is consistent with the above two mechanism of generating an A opposite the original C of TpC (Figure 4B, upper left quadrant). On the other hand, replication across the abasic site by various other DNA polymerases could insert a T or C opposite the abasic site and generate G to T or C transversions (Lange et al., 2011), which we also found (Figure 4B, upper left quadrant). As a consequence approximately equal amounts of transitions and transversions result from repair-induced mutagenesis, an outcome that was also observed for APOBEC-mediated mutations in some tumors (Burns et al., 2013; Leonard et al., 2013). All the data shown in Figure 4B are the pooled mutational fates induced by repair of all the lesions, and similar results were found for each type of lesion (results not shown).
The right hand side of Figure 4A illustrates the steps when bottom strand lesions are restored to C/G. In this case, the top strand serves as the repair template, but because we show all the mutations in terms of the top strand sequence the final mutational outcome is basically the complement of the results generated from the repair of top strand lesions. Figure 3—figure supplement 2 shows the fate and mutational spectra (summarized in Figure 3—figure supplement 3) induced by repair of bottom strand lesions. Now, most of the mutations are on C of TpC when the lesions are restored to C/G (Figure 3—figure supplement 4), and, as the lower left quadrant of Figure 4B shows, the most common mutational outcome involves C to T transitions and with about an equal amount of transversions, to A and G.
The right side of Figure 4B (gray box) shows the mutational outcome of converting top or bottom strand T/Gs to T/As. In these cases, the top or bottom strand respectively serves as the repair templates (indicated in red font, right side Figure 4B), and the mutational outcomes parallel those that found for the restoration of bottom or top strand lesions to C/G wherein the top or bottom strands respectively were used as the repair template. Thus, when a top strand T paired with a bottom strand G is converted to T/A, most of the mutations are on C (Figure 3C, T/G_con), ∼80% of which involve the C of TpC with almost no specificity for the 3′ base (top panel, Figure 3—figure supplement 5). However, when a bottom strand T/G is converted to T/A, most mutations are on the G of GpA (Figure 3—figure supplement 3–T/G_con, Figure 3—figure supplement 5, bottom panel).
Approximately, equal amounts of G and C mutations are generated from the 0 MM controls (most left hand bar graphs, Figure 3C, Figure 3—figure supplement 3). These mutational spectra and their dinucleotide sequence contexts resemble what would result from repair of a mixture of top and bottom SSBs (Figure 3—figure supplement 6, Figure 3—figure supplement 7), and are thus consistent with that expected from repairing trace amount of random nicks on both strands.
APOBEC-mediated mutations in some cancer cells showed a strong preference for the C of TpCpW (W is T or A) (Roberts et al., 2013). The trinucleotide context for opposite strand G mutations would be ApGpA and TpGpA. Although the trinucleotide context of about half of our G mutations was ApGpA, the others were mostly CpGpA and not TpGpA, which not surprising as the latter trinucleotide is represented only once in the reporter region (Figure 3—figure supplement 8). These results indicate that the APOBECs do not strongly discriminate between the nucleotides that are 3′ to TpC (readily apparent in Figure 3—figure supplement 4). A lack of discrimination on 3′ nucleotides was also shown in vitro for APOBEC3B (Burns et al., 2013), and reflected in the mutational effect of APOBECs on HIV (Bishop et al., 2004; Doehle et al., 2005), and in this case the very low preference toward TpCpG (opposite strand CpGpA), reflected the strong bias against CpG in the HIV genome (van der Kuyl and Berkhout, 2012).
Repair-induced mutagenesis can cause clustered mutations
Clustered mutations can be characteristic of APOBEC-mediated TpC (GpA) mutations in tumors (Nik-Zainal et al., 2012; Roberts et al., 2012, 2013; Burns et al., 2013; Taylor et al., 2013) and also for AID-induced mutations in lymphoid cells (e.g., Peled et al., 2008). Figure 3 and Supplementary file 2 showed that the reporter/mismatch region (∼300 bp) of some episomes contained more than one mutation. Thus, mutations can occur in a concerted fashion, which is the case for 25–30% of the episomes (Figure 5A,B). This figure also shows that the mutational spectra and the dinucleotide contexts of the clustered and unclustered mutations induced by repair of top strand lesions did not differ, and all exhibited the mutational preference for the complement of the C of TpC, that is, the G of GpA. However, the spectra and dinucleotide context of clustered and unclustered mutations induced by repair of bottom strand lesions differed. In contrast to the clustered mutations, which had the same mutational APOBEC signature as the top strand lesions (Figure 5B), the unclustered mutations show less of a preference towards mutating the C of TpC, most evident for those induced by restoration of T/G to C/G (cf. lower 2 panels, Figure 5A and the lower panels labeled unclustered, Figure 5B). Perhaps, the generation of unclustered mutations with a non-strictly TpC mutational signature in the reporter region is facilitated by the ∼5 kb distance between its cis 3′ end and the lesion (top panel, Figure 2C).
TpC preferring APOBEC C deaminases mediate repair induced mutagenesis
The results in the foregoing two sections strongly implicated the involvement of APOBEC deaminases in repair-mediated mutagenesis. We examined this possibility in two ways: first, after determining the repertoire of APOBEC deaminases in the HeLa cells used for the foregoing experiments, we examined the effect of their knockdown by siRNA on repair-induced mutagenesis. Then, we used ChIP to determine if APOBEC could access the episome in response to an introduced lesion.
siRNA knockdowns of APOBEC deaminases
qRT-PCR (Refsland et al., 2010; Burns et al., 2013) showed that several APOBEC deaminases, including A3B, A3F and A3C (Conticello et al., 2007; Prochnow et al., 2009) are expressed in the HeLa cells (HeLa_JM) that was used for all the experiments presented here unless otherwise stated (Figure 7A). A3B, A3F, and A3C exhibit >40% preference for TpC (Bishop et al., 2004; Langlois et al., 2005), and Figure 6A shows that siRNA directed at them reduced their expression. While knockdown of any one of them marginally affected the mutational frequency, knockdown of two (A3B and A3F) inhibited mutagenesis associated with T/G and U/G repair (middle panel, Figure 6A). Although knocking down all three did not further reduce the mutagenic effect, the lower part of Figure 6A shows that only knocking down all three deaminases (siA3(B,F,C)) eliminates the mutational signature associated with deamination of the C of TpC. However, even though the single knockdowns did not substantially inhibit the mutational frequency, they do change the mutational signature with respect of the dinucleotide context (hereafter referred to as the dinucleotide signature) consistent with the extent of their preferences for TpC (Bishop et al., 2004; Langlois et al., 2005). Thus, knocking down A3C (46% preference for TpC) hardly alters the dinucleotide signature, whereas knocking down A3B (91% preference) had a more profound effect, and the knockdown of A3F (intermediate at 77%) produced an intermediate effect on this signature. Although the numbers of mutations analyzed are relatively low, these data suggest that while the three deaminases compete for substrates generated during repair, A3B is the most important contributor to repair-induced mutagenesis.
The results in Figure 6B support a predominant role for A3B. Expression of hemagglutinin (HA)-tagged A3B, which is resistant to siA3B (top panel, see ‘Materials and methods’) reversed the mutagenic effect of the triple knockdown of A3B, A3F and A3C (siA3(B,F,C), middle panel), and restored the TpC mutational signature, but not if it lacked deaminase activity, A3B_Cat (bottom panel, Figure 6B and bottom two bar graphs, Figure 6—figure supplement 1). This result corroborates the requirement for the deaminase activity. Although we only recovered five mutants in the latter experiment—due to the greatly reduced mutagenic activity, the results agree with those of the triple knockdown shown for the ten mutations shown for siA3(B,F,C) in Figure 6A.
APOBEC3B accesses the SV40 episome in a lesion-specific way
We carried out ChIP to directly determine if the A3B deaminase can access the lesion-containing episome. Figure 6C shows that A3B binds T/G- or U/G-containing FM1 episomal DNA in parallel with their relative mutagenic effects and their top or bottom strand location. Thus, more deaminase is bound to the supF reporter of top strand U/G- than bottom strand U/G-containing episomes (cf. mutational frequencies in Figure 2, panel C). Also, as predicted from Figure 2C, the extent of deaminase binding to the supF reporter of T/G-containing episomes is indifferent to the top or bottom strand location of the mismatch. Figure 6C also shows that the deaminase binds to other episomal regions, for example, the T antigen-containing region for both mismatches. This would be expected as the mutagenic effect can extend over 5 kb for both lesions, which is undiminished for T/G but not for U/G. Consistent with this difference, the deaminase binds both supF- and T antigen-containing regions to about the same extent for either the top or bottom strand location of T/G. However, considerably more deaminase binds to the supF reporter gene than T antigen gene for top strand U/G wherein the U is closer to the cis 3′ end of the supF reporter gene than to the T antigen gene (Figure 6C, top panel). On the other hand, somewhat more deaminase binds to the T antigen gene than supF for bottom strand U/G wherein the U is closer to T antigen gene than to supF gene (Figure 6C, bottom panel).
Although these results show that A3B, A3F and A3C are necessary for repair-induced mutagenesis, their presence might not be sufficient. We also screened other cell lines, including a second HeLa cell isolate, HeLa_KU, for their contents of APOBEC family members as well as the extent to which T/G and U/G repair are mutagenic (Figure 7). Two of the cell lines (HeLa_KU and 2102ep) exhibited distinct dinucleotide mutational signatures (Figure 7C). Also the repertoire of APOBEC enzymes in these cell lines differed, even quite dramatically between HeLa_JM and HeLa_KU (Figure 7A). Nonetheless, all the cell lines exhibited roughly similar levels of T/G-induced mutagenesis: about the same for HeLa_JM, 143b, and HeLa_KU, and about one-half as much for the other three cell lines (Figure 7B). In contrast, differences between the extent of U/G repair-associated mutagenesis were more marked. Thus, U/G repair was about fivefold more mutagenic in HeLa_JM than in HEK293 although their APOBEC repertoires were comparable. Therefore, factors other than just the availability of these deaminases are required for repair-induced mutagenesis, in agreement with observations on some cancer cells (Roberts et al., 2013).
siRNA knockdowns implicate the involvement of DNA repair pathways in generating the APOBEC deaminase substrate
As APOBEC-mediated mutagenesis occurred in response to DNA repair, we used siRNA to knock down selected components of different DNA repair pathways and two proteins that would be expected to process or respond to the lesions that we introduced. For these experiments, we treated the reconstituted episomes briefly with the 5′–3′ exonuclease− Klenow polymerase just before transfection into the mammalian cells to remove traces of gapped plasmids (see Figure 8—figure supplement 1, the legends to Figures 8 and 9, ‘Materials and methods’).
BER and MMR
T/G and U/G mispairs are substrates for BER, and BER-repair intermediates generated from U/G were shown to be hijacked by MMR. This process could expose a single stranded template 5′ of the lesion (Figures 4A and 10; Kadyrov et al., 2006; Schanz et al., 2009; Pluciennik et al., 2010; Peña-Diaz et al., 2012), which could be accessed by the deaminase. Thus, both pathways would be expected to be involved in repair-induced mutagenesis.
The first step in BER involves a glycosylase (Figure 10). However, U/G is a substrate for four glycosylases (UNG, SMUG, TDG, MBD4), and T/G for two (TDG, MBD4) (e.g., Cortázar et al., 2007; Robertson, 2009; Jacobs and Schar, 2012). Therefore, we knocked down two BER proteins (APE1 and XRRC1, Figure 10) that act downstream of the glycosylases. We also knocked down a component of the heterodimers (MutSα, MutLα) that initiate MMR (MSH2 and MLH1, Figure 10). Figure 8 shows that the knockdown of any one of the above proteins was sufficient to reduce mutagenesis induced by repair of T/G or U/G. While the maximal effect of the knockdowns of U/G-induced mutagenesis was about the same for the BER or MMR, the maximal inhibition of T/G-induced mutation by MMR knockdown was greater than that attained by the BER knockdown. This result is consistent with a model whereby MMR can not only highjack BER intermediates generated from a T/G mismatch but also directly access the mismatch, that is, independently of BER. Direct access of T/G by MMR has been demonstrated in vitro (Pluciennik et al., 2010; Peña-Diaz et al., 2012). The knockdown of neither BER nor MMR affected the mutational signature of induced mutagenesis (Figure 8—figure supplement 2). Thus, these pathways act upstream of the APOBEC deaminases.
PCNA and ATR
PCNA and ATR have various roles in DNA metabolism. They can interact with both BER and MMR proteins and the DNA products generated by these pathways (Moldovan et al., 2007; Schanz et al., 2009; Pluciennik et al., 2010). PCNA acts as a DNA replication clamp that is involved in numerous aspects of DNA replication and can also be involved in both generating gapped substrates and assembling DNA replication complexes on them (Andersen et al., 2008; Caldecott, 2008). ATR binds to single-stranded gaps whereupon its kinase activity is activated and phosphorylates various cellular factors (Liu et al., 2010; Flynn and Zou, 2011; Nam and Cortez, 2011; Pabla et al., 2011).
Figure 9A shows that the knockdowns of PCNA and ATR have different effects on T/G and U/G repair-induced mutagenesis. Inhibiting PCNA synthesis somewhat reduced T/G-induced mutagenesis, but ATR knockdown had no effect. In contrast, PCNA knockdown stimulated the mutagenic effect of U/G repair, but ATR knockdown inhibited its effect. Simultaneous knockdown of both proteins returns the mutagenic effect of U/G repair to control levels, a result that suggests the effects of both proteins converge on a common pathway. However, the double knockdowns did not alter the inhibition of T/G-induced mutagenesis by PCNA knockdown, not unexpected as T/G repair-induced mutagenesis is indifferent to ATR. Panels B and C show that siRNA-resistant versions of these genes, PCNA_R or ATR_R rescued the effects of the knockdowns. For comparison, the PCNA and ATR results from Figure 9A are shown respectively in panels B and C as cross-hatched bar graphs. Figure 9C shows that the kinase activity of ATR is essential for its effect on U/G-repair-induced mutagenesis and further corroborates the siATR knockdown, because ATR deficient in kinase activity does not rescue it.
The difference between the effects of the PCNA and ATR knockdowns on the mutagenic effect of T/G and U/G repair further differentiates their repair-intermediates as was revealed by the results shown in Figures 2, 6C, 7B and 8A. Neither PCNA nor ATR knockdown altered the mutational signature of repair-induced mutagenesis (Figure 9—figure supplement 1). Thus, similar to the BER and MMR proteins, PCNA and ATR act upstream of the APOBEC deaminases. Although our results do not provide any mechanistic insights on the roles of PCNA and ATR on APOBEC-mediated mutagenesis, they do indicate the levels of, or access to, APOBEC substrates are sensitive to the activities of these proteins.
Discussion
Here, we showed that repair of normally occurring mismatches (i.e., those that arise from the inherent chemical instability of DNA) can be mutagenic to flanking DNA. Figure 10 summarizes our results, which suggest that APOBEC-mediated mutagenesis induced by DNA repair acts downstream of the DNA repair pathways that generate the obligatory single-stranded substrates for TpC preferring APOBEC deaminases (Conticello et al., 2007). siRNA knockdowns (bold font depicts the factors that we tested) indicate that components of both BER and MMR are required to generate the deaminase substrates from both T/G and U/G, and that PCNA and ATR could modulate the availability of these substrates to the deaminases. With respect to U/G and T/G, we depict the interaction of the BER and MMR pathways in the framework of that proposed from elegant biochemical studies that showed that MMR can hijack the 3′ end of the nick that would result from a U/G-BER intermediate and generate a single stranded region 5′ of the lesion that could serve as an APOBEC substrate (Kadyrov et al., 2006; Schanz et al., 2009; Pluciennik et al., 2010; Peña-Diaz et al., 2012).
The cited in vitro studies from the Modrich group also showed that MMR could directly access T/G mismatches in a PCNA-dependent reaction, which we illustrate on the right hand side of Figure 10. In such cases, MMR could load on either strand of the T/G containing episome and lead to resection of either the bottom or top strand of the duplex. However, we can only currently score mutagenic events that occur on the 5′ side of the T/G wherein resection of the top strand would expose the supF reporter region. Thus, our results suggest an apparent 3′ bias for the MMR-mediated resection. We are now implementing experiments to examine the induction of mutations 3′ of the T/G.
Although the sensitivity of U/G-induced mutagenesis to the proximity of this lesion to the cis 3′ end of the reporter region is consistent with the foregoing biochemical results, some of our findings are at odds with the biochemical model—for example, the apparent inhibitory effect of PCNA on U/G-repair induced mutagenesis (knockdown increases its mutagenic effect, Figure 9) vs the requirement for PCNA in the MMR hijacking of BER (Kadyrov et al., 2006; Pluciennik et al., 2010). While these paradoxical results could be reconciled by considering differences between threshold concentrations of PCNA required for its various functions (e.g., Pluciennik et al., 2010), our results do not directly address the mechanistic details of the interaction of PCNA and the U/G-repair generated APOBEC substrate. Furthermore, while we depict PCNA and ATR converging on a common pathway downstream of the hijacked BER intermediate, our only rationale for doing so are the results in Figure 9A, which show that their simultaneous knockdown cancels the effects of their separate knockdowns.
Likewise, while the effects of knocking down various BER and MMR components, as well as the ATR and PCNA knockdowns distinguish the T/G- and U/G-repair intermediates, the mechanistic relationships between these pathways or factors, and the properties of the T/G repair intermediate that distinguish it from the U/G intermediate are not clear. The fact that the extent of T/G-induced mutagenesis is more sensitive to the knockdown of MMR than BER suggests that MMR can generate an APOBEC-sensitive substrate independent of (i.e., in addition to) that mediated by BER (right hand side of Figure 10). Also, that an APOBEC-vulnerable substrate can be generated from either strand of a T/G mismatch—either restoration of T/G to C/G or conversion to T/A can induce mutagenesis—and that it is sensitive to PCNA knockdown (Figure 9) are consistent with studies in vitro that showed that MMR components can access either strand of T/G containing DNA in a PCNA-dependent reaction (Pluciennik et al., 2010). Given the relative insensitivity of the mutagenic effect of T/G to its distance from a cis 3′ end of reporter region, we tentatively propose that a migrating D-loop can be mounted on either strand of the episome to provide a top or bottom strand deaminase substrate (Figure 10).
Numerous recent studies support a role for the TpC preferring APOBEC C deaminases, especially APOBEC3B, in generating large numbers of mutations (i.e., the ‘mutator phenotype’ [Bielas et al., 2006; Venkatesan et al., 2006]) that characterize the progression, and perhaps initiation, of cancers (Nik-Zainal et al., 2012; Roberts et al., 2012; Stephens et al., 2012; Burns et al., 2013; Leonard et al., 2013; Roberts et al., 2013; Taylor et al., 2013). Less certain is the source of the single-stranded DNAs that are the preferred substrates for the APOBEC deaminases. Here, we directly showed that A3B can access the T/G and U/G repair intermediates generated in our mammalian model system (Figure 6C) and generate the mutational signatures induced in some cancers (Figure 6B, Figure 6—figure supplement 1). The hundreds of T/Gs and U/Gs that are generated daily by spontaneous hydrolytic deamination respectively of methyl-C and C would provide a continual source of potential APOBEC substrates. This source could also be potentially augmented by the thousands of other BER-processed lesions that arise daily due to additional spontaneous degradative processes (Atamna et al., 2000; Barnes and Lindahl, 2004).
Although our findings showed that DNA repair of T/G, U/G, and other lesions can generate APOBEC deaminase substrates, the composition of the repair intermediates that both renders them susceptible to APOBEC-mediated deamination and accounts for their distinct mutagenic properties are unknown. Components of both the BER and MMR pathways that are normally recruited to these lesions can also engage in complex interactions with factors involved in both DNA metabolism and other cellular functions (e.g.,Cortázar et al., 2007; Kovtun and McMurray, 2007; Jacobs and Schar, 2012; Peña-Diaz and Jiricny, 2012; Peña-Diaz et al., 2012). Additionally, the availability of APOBEC deaminases per se may not be sufficient to support repair-induced mutagenesis (Figure 7, also observed in certain cancers, Roberts et al., 2013). Thus some of the factors required to either generate APOBEC substrates from repair-intermediates, or recruit the deaminases to them, might not be present in all cells.
Given the fairly ubiquitous distribution of ABOPBEC deaminases in various cell types and tissues (Refsland et al., 2010), determining such factors will be important for evaluating the extent to which DNA repair intermediates can be accessed by APOBEC-mediated mutators, and whether DNA repair could be vulnerable to other error-prone processes as well. Such alternate processes might contribute to the distinctive mutational signatures of the unclustered mutations induced by repair of bottom strand lesions (Figure 5A,B). Therefore, an important next step is to characterize the composition of the mutagenic and non-mutagenic repair intermediates that assemble on the various mismatches. An advantage of generating these repair intermediates using preformed mispairs on a defined DNA as done here, is that they could be isolated from cells by selection for either the nucleic acid or a presumed protein component (e.g., Dèjardin and Kingston, 2009; Wu et al., 2011), or by the sequential application of each. The latter approach should enhance the chances of isolating genuine mutagenic intermediates, a prerequisite for more definitive analysis of their functional and biochemical properties.
Repair-associated mutagenesis might not only contribute to the genetic changes that underlie various disease states (e.g., cancer, as discussed above) but also aging and evolutionary change (as has been earlier suggested, e.g., Huttley et al., 2000; Walser and Furano, 2010), which in some instances might result from normal physiological processes. For example, we recently found that the repair of 5-carboxy-C/G, an intermediate of the physiological demethylation of methyl-C (Bhutani et al., 2011; Wu and Zhang, 2014), can also be mutagenic. Thus, the normal cycling of the epigenetic methyl-C mark could contribute to the high mutation rate of regulatory sequences thought to contribute both disease processes (Maurano et al., 2012) and evolutionary novelty (Wittkopp and Kalay, 2012).
Materials and methods
Vector preparation
Request a detailed protocolWe constructed various versions of a previously described SV40-based shuttle vector (Seidman et al., 1985) as described in detail in the next section. Our major modification was the introduction of a mismatch (MM) region immediately 3′ of the bacterial supF tRNA gene to generate the prototypical FM1 vector (Figure 1A). SupF tRNA suppresses an amber mutation in lacZamb bacteria (ß-galactosidase minus), which permits blue/white screening to assess the activity of the supF gene (loss of activity due to mutations in either its promoter region or coding sequence produces a white colony). We also introduced a bar code of eight random nucleotides to uniquely identify each clone (previously named the signature element, Parris and Seidman, 1992). In our initial experiments some episomal plasmids suffered large deletions that included all or part of the reporter cassette, which we found was due to over zealous attempts to remove episomes that lack the bar code by digestion with BstZ17I, which is unique to this region. However, we found that this step was not necessary and eliminating it largely eliminated the deletions (Figure 2—figure supplement 1). Nonetheless, all of the mutational frequencies reported here are the percentage of white colonies containing undeleted plasmids—‘Materials and methods—Data acquisition and analysis’. The design of the MM region follows the method of Hou et al. (2007), which facilitates removal and replacement of the top or bottom strand of a DNA duplex and is illustrated for the top strand in Figure 1B and Figure 1—figure supplement 1.
After nicking the MM region with the New England Biolabs (NEB, Ipswich, MA) enzymes, Nt.BbvCI (top strand) or Nb.BbvCI (bottom strand), the nicked strand was removed by incubation with its biotinylated complement followed by removal of the hybrid with streptavidin-magnetic beads. The gapped vectors were purified by extraction with phenol/chloroform and EtOH precipitation. A 100-fold excess of 5′-phosphorylated control (perfect complement, 0 MM) or oligonucleotides that would generate T/G, hmU/G, U/G or ab/G mismatches (lesions) when paired to the gapped vector were added, and the mixture was heated for 5 min at 95°C, then for 4 hr at 40°C, and after slow cooling to room temperature incubated overnight with T4 DNA ligase (NEB). Supplementary file 1 lists the oligonucleotides and their numbers are indicated in the figures. We monitored the efficiency of gapping and the reconstitution of the gapped plasmids by following the loss and restoration of the KpnI site between the nicking sites (Figure 1—figure supplement 1). For those mismatches that would eliminate an AatII restriction site in the mismatch region, we also monitored the reconstitution of the mismatch-containing episomes by their resistance to digestion by this enzyme (Figure 1—figure supplement 2, as has been done by others, e.g., Peña-Diaz et al., 2012). Using the T4 DNA ligase from NEB, but not from other sources, eliminated the need for purifying closed circular plasmids prior to their use (Walser, J-C and Aschrafi, A, unpublished observations).
However, the reconstituted episomes contained traces of gapped plasmids (Figure 1—figure supplement 1), which were inconsequential except when we knocked down various proteins involved in DNA repair (i.e., selected components of the BER and MMR pathways, or PCNA and ATR; see Figures 8–10). Most of these knockdowns increased the mutagenic effect of the 0 MM control (and presumably also that of the lesion-containing plasmids, results not shown). We reasoned that perhaps the trace amounts of gapped episomes are not readily repaired when various repair enzymes are depleted and thereby provide ready-made substrates for the APOBEC deaminases. This idea was supported by our finding that knockdowns of any of the tested repair proteins increased the mutagenesis induced by non-reconstituted gapped plasmids (results not shown). A brief treatment of the reconstituted vectors in the ligation reaction with the 5′–3′ exo− Klenow nuclease just prior to their transfection into mammalian cells reduced the amount of gapped plasmids to undetectable amounts (Figure 8—figure supplement 1), eliminated the increase of the mutagenic effect of the 0 MM control during siRNA knockdowns of repair proteins, but did not materially affect the extent of the mutagenesis induced by the 0 MM control or the lesion-containing episomes in experiments with the siRNA controls—cf. the siCtrl for 0 MM, T/G or U/G in Figures 8 and 9 with those in Figures 2, 6 and 7 (for HeLa_JM). The Klenow treatment consisted of a 10-min incubation at 37°C of 6 μl of the ligation reaction with 3 μl of a solution that contained 0.2 unit of Klenow polymerase (NEB, M0212), 3 × NEBuffer 2 and 100 μM of dNTP.
Shuttle vectors
FM1, 2, and 3
Request a detailed protocolpSP189 was digested with BamHI and MluI, and ligated to mismatch (MM) regions 1, 2 or 3, each of which contains a pair of the aforementioned single strand nicking enzyme sites on the top and bottom strands but they differ in G+C content. These were annealed from complementary oligonucleotides: MM1, InsA_BM_t, 5′GATCCCTCCTCAGCTGACGTCGGACGGAGGCGGCGGAACCTAGGTACCTCAGCCCGA3′/InsA_BM_b, 5′CGCGTCGGGCTGAGGTACCTAGGTTCCGCCGCCTCCGTCCGACGTCAGCTGAGGAGG3′; MM2, InsB_BM_t, 5′GATCCATCCTCAGCTGACGTCTAATACGATTATCGATATATAGGTACCTCAGCTTAA3′/InsB_BM_b, 5′CGCGTTAAGCTGAGGTACCTATATATCGATAATCGTATTAGACGTCAGCTGAGGATG3′; MM3, InsC_BM_t, 5′GATCCAGCCTCAGCTGACGTCTCGTACGATGATCGATCGATAGGTACCTCAGCTGAA3′/InsC_BM_b, 5′CGCGTTCAGCTGAGGTACCTATCGATCGATCATCGTACGAGACGTCAGCTGAGGCTG3′, to generate respectively the shuttle vectors: pSP189-FM1, pSP189-FM2 and pSP189-FM3.
We then introduced the bar code into these vectors by digesting them with SacI and NheI endonucleases and ligating each to a SacI/NheI-digested PCR fragment that was generated from the template: Sig_T, 5′TAGTACGCGTGAGCTCTANNNNNNNNTACGTACGGCTAGCAAGCTCAATT3′, with the following primers: Sig_F, 5′AGCTGAAAGGATGACTAGTACGCGTGAGCTCT3′ and Sig_R, 5′CCCGACCTCGACCCGAATTGAGCTTGCTAGCC3′. The random sequence of 8 bp can uniquely identify up to 48 possible members of a plasmid population. These shuttle vectors were called FM1, FM2, and FM3 respectively. Same site mutations that have different bar codes in a given experiment represent independent events. Thus these vectors were never propagated from a single colony but generated anew by inserting the bar code into their parental pSP189-FM1, pSP189-FM2 and pSP189-FM3 vectors respectively.
M1F
Request a detailed protocolThe supF gene was amplified from pSP189 with primers (SupF_BM_F, 5′AAAGGATCCTGTTGACAATTAATCATC3′/SupF_BM_R, 5′AAAACGCGTGGGTATTGAATTTCGGCC3′). The ensuing PCR product was restricted with BamHI and MluI, and ligated to the BamHI/MluI-digested pSP189 to generate pSP189_FF. pSP189_FF was digested with EcoRI and BamHI, and ligated to an MM1 that had been annealed from the complementary oligonucleotides (InsA_EB_t, 5′AATTCCTCCTCAGCTGACGTCGGACGGAGGCGGCGGAACCTAGGTACCTCAGCCCGG3′/InsA_EB_b, 5′GATCCCGGGCTGAGGTACCTAGGTTCCGCCGCCTCCGTCCGACGTCAGCTGAGGAGG3′) to generate the shuttle vector pSP189_M1F. pSP189_M1F was digested with SacI and NheI and ligated to the bar code sequence as we did above for pSP189_FM1 to generate the shuttle vector M1F.
FM1_R
Request a detailed protocolThe entire reporter cassette (Figure 1A) was amplified from FM1 with primers (SupF_NheI_F, 5′AAAGCTAGCTGTTGACAATTAATCATC3′ and Sig_EcoRI_R, 5′AGAATTCCTAGCCGTACGTA3′), restricted with EcoRI and NheI and ligated to the EcoRI/NheI-digested pSP189 to generate shuttle vector FM1_R.
The structures of all the vectors were verified by DNA sequencing.
Transfection of mammalian cells, blue/white colony screening, and siRNA knockdown
Request a detailed protocolCells were seeded in a six-well plate at a density of 3 × 105 per well. After 24 hr, we added 100 µl of serum-free DMEM that contained 6 µl each of the overnight ligation mixture (1 µg of reconstituted plasmid) and 6 µl of Fugene 6. After 48 hr, the plasmids were extracted from the cells with Wizard Plus SV Miniprep kit (Promega, Madison, WI), digested with DpnI (removes un-replicated input plasmid) and electroporated into E. coli MBM7070 (lacZuag_amber), which were grown on LB plates containing 100 µg/ml ampicillin, 1 mM IPTG and 0.03% Bluo-gal (Invitrogen/Life Technologies, Grand Island, NY). After incubation at 37°C overnight and at room temperature for another day (for maximal color development), the plasmids from white colonies were collected and analyzed for deletions by PCR (primer: F4914, 5′CCAGCGTTTCTGGGTGAGCA3′/R250, 5′TTTTTGTGATGCTCGTCAGG3′, which respectively flank the EcoRI and NheI sites that encompass the reporter cassette, Figure 1A). The mutation frequency is the number of white colonies that contained undeleted plasmids/total number of colonies. Directly electroporating reconstituted plasmids into E. coli generated no white colonies with undeleted vectors (3 deleted vectors /42,000 screened, results not shown). Thus, the mutagenic effect of DNA repair requires passage through mammalian cells.
For siRNA knockdown, we added 20 pmol siRNA (in a total volume of 400 µl serum-free DMEM that contained 6 µl Lipofectamine RNAiMAX, Invitrogen/Life Technologies) to cells immediately after they were plated. To rescue the siRNA knockdowns, we added 1 µg of the siRNA-resistant plasmid in 400 µl serum-free DMEM that contained 3 µl Lipofectamine LTX, Invitrogen/Life Technologies 24 hr after the siRNA was added, and these cells were transfected 24 hr later with the reconstituted plasmids. We changed the media before each addition.
RNA extraction and quantitative RT-PCR (qRT-PCR)
Request a detailed protocolWe extracted total RNA with the PureLink RNA Mini Kit, Ambion/LifeTechnologies according to the provided instructions and synthesized cDNA with ProtoScript II Reverse Transcriptase (NEB). RNA (2.5 μl @ 400 ng/μl) and 1.5 μl random hexamer DNA primers (100 μM) were heated at 65°C for 10 min and immediately put on ice. The reverse transcription mixture containing 4 μl 5 × RT buffer, 2 μl 10 × DTT, 1 μl 10 mM dNTP, 1 μl RNAse inhibitor (SUPERaseIn, 20 U/μl, Ambion/LifeTechnologies), 1 μl ProtoScript II Reverse Transcriptase (200 U/μl) and 7 μl nuclease-free water. The reactions were incubated at 25°C for 5 min, 42°C for 1 hr and then 80°C for 5 min to inactivate the enzyme. The reactions were diluted with 80 μl of nuclease-free water and qPCR was performed as described in Refsland et al. (2010) and Burns et al. (2013).
Preparation of APOBEC3B, PCNA, and ATR expression plasmids
Request a detailed protocolThe target site of the human apobec3b siRNA is located in the 3′ UTR of the native mRNA. Thus, the APOBEC3B expression plasmid, A3B, which contains only the apobec3b protein coding sequence, is resistant to apobec3b siRNA. A catalytic deficient version of the plasmid, A3B_Cat, was made by introducing the mutations E68A and E255Q (Burns et al., 2013). The PCNA gene was amplified from Mammalian Gene Collection—Human (ATCC, Manassas, VA MGC-8367) with primers (PCNA_BAMHI_F, 5′TTCAAAGGATCCCGTTCGCCCGCTCGCTCTGAGGCT3′/OTB7_R, TTTTTGTTTGCAAGCAGCAGATTAC), digested with BamHI and XhoI and then ligated to BamHI/XhoI digested pcDNA3.1 (+) (Invitrogen) to generate the PCNA-expressing plasmid PCNA_WT. The siRNA target site in PCNA_WT was changed to 5′AATAGAAGACGAGGAGGGT3′ to generate the RNAi-resistant plasmid, PCNA_R.
The ATR expression plasmids (ATR_WT and ATR_KD) were gifts from Dr Stephen J Elledge (Cortez et al., 2001) and the siRNA target site was changed to 5′TACCCGTCTTCTCAGAATAGCTGCA3′ to generate RNAi-resistant plasmids ATR_R and ATR_KDR. All site-directed mutagenesis were performed with Agilent, Santa Clara, CA, QuikChange II Kits.
siRNA
Request a detailed protocolControl siRNA (no target in mammalian cells, R0017) was purchased from Abnova, Walnut, CA. siRNA against atr (5′AACGAGACTTCTGCGGATTGCAGCA3′) and msh2 (5′AATCTGCAGAGTGTTGTGCTTAGTA3′) were supplied by Invitrogen (Stealth RNAi siRNA duplex). SiRNA against xrcc1 (M-009394-01-0005), ape1 (M-010237-00-0005), mlh1 (J-003906-10-0005), pcna (D-003289-10-0005, M-003289-02-0005 for rescue), apobec3b (J-017322-08-0005), apobec3c (J-013711-08-0002) and apobec3f (J-019039-17-0005) were purchased from Dharmacon RNAi Technologies / GE Healthcare, Fairfield, CT.
Antibodies
Antibodies were purchased from the following sources: antibodies to ATR (ab10312), MSH2 (ab52266)–Abcam, Cambridge, MA; to PCNA (#2586), XRCC1 (#2735), APE1 (#4128), MLH1 (#3515)–Cell Signaling Technology, Boston, MA; to HA (HA-7, H3663); FLAG (Ctrl IgG for ChIP, F7425)–Sigma, St. Louis, MO.
Chromatin immunoprecipitation (ChIP)
Request a detailed protocolChIP assays were performed according to a published protocol (Carey et al., 2009). HeLa cells were seeded in a 6-cm dish plate at a density of 8 × 105 and after 24 hr, 1 µg of the A3B-3HA expressing plasmid in 400 µl serum-free DMEM that contained 3 µl Lipofectamine LTX was added. After another 48 hr, 200 µl of solution of serum-free DMEM that contained 12 µl each of the overnight ligation mixture (2 µg of mismatch/DNA lesion-containing plasmids) and 12 µl of Fugene 6 was added to the cells and after 4 hr cells were harvested for ChIP assay. The qPCR primers for detecting the SupF region are: SupF_F, 5′GGGGCGAAAACTCTCAAGGATCTTACCGCTG3′ and SupF_R, 5′GGGATCCGGGTATTGAATTTCGGCCGTG3′; for the T antigen region are: 189LT_F, 5′CCAGCCATCCATTCTTCTATGTCAGCAGAGCC3′ and 189LT_R, AAGAACAGCCCAGCCACTATAAGTACCATGAA.
Cell lines
Request a detailed protocolHeLa_JM was provided by Dr John Moran (University of Michigan), and HeLa_KU by Dr Karen Usdin (NIH), 2102ep (a human embryonal carcinoma cell line) was provided by Dr Tom Fanning, HEK293 was provided by Dr Roland Owens (NIH), and PA-1 (an ovarian teratocarcinoma cell line) and 143B (an osteosarcoma cell line) were purchased from ATCC. Unless otherwise indicated all the experiments were carried out with HeLa_JM cells.
Data acquisition and analysis
Request a detailed protocolEach reconstituted vector was introduced into various cells in two or more independent transfections for a total of ∼500 distinct transfections. The replicated plasmids isolated from each transfection were subject to one or more independent trials of blue/white screening. The plasmids from all the white clones were screened for deletions by PCR and all those with undeleted plasmids from a given transfection were sequenced. More than 99% of the undeleted plasmids contained at least one mutation in the reporter region (data not shown). The ratio of white colonies with mutated undeleted plasmids to the total number of colonies examined in the blue/white trials for a given transfection is taken as the mutation frequency.
We aligned the sequences from each transfection to its relevant starting sequence using SEAVIEW (Galtier et al., 1996). Overall, we determined the DNA sequences of ∼2600 undeleted plasmids, about 5% of which contained small indels in the reporter region that could not be detected by PCR screening. We excluded these sequences from further analysis because their frequency was not correlated with either the type of introduced lesion or even the presence of one. We grouped the rest (∼2500) into 155 distinct alignments. We also determined the DNA sequences of ∼250 blue clones, grouped into 38 alignments. We parsed these alignments and computed mutational data using custom Unix, Perl, and R scripts (R Foundation for Statistical Computing, http://www.R-project.org). We also used R for statistical analysis as indicated in the legends to the Figures or text. We determined the fate of the introduced DNA lesions, and the frequency with which each base of the reporter and mismatch regions was mutated, and to which base it was mutated (i.e., its mutational fate). We determined the mutagenic effect caused by the repair of each type of lesion (T/G, hmU/G, U/G, ab/G) located on the top or bottom strand from the pooled results of each lesion class irrespective of their location, number, or context within the mismatch region of a given vector as these variables did not materially affect the mutagenic effect of their repair (Figure 2—figure supplement 2).
References
-
A method for detecting abasic sites in living cells: Age-dependent changes in base excision repairProceedings of the National Academy of Sciences of the United States of America 97:686–691.https://doi.org/10.1073/pnas.97.2.686
-
Repair and genetic consequences of endogenous DNA base damage in mammalian cellsAnnual Review of Genetics 38:445–476.https://doi.org/10.1146/annurev.genet.38.072902.092448
-
Human cancers express a mutator phenotypeProceedings of the National Academy of Sciences of the United States of America 103:18238–18242.https://doi.org/10.1073/pnas.0607057103
-
Cytidine deamination of retroviral DNA by diverse APOBEC proteinsCurrent Biology: CB 14:1392–1396.https://doi.org/10.1016/j.cub.2004.06.057
-
Evidence for APOBEC3B mutagenesis in multiple human cancersNature Genetics 45:977–983.https://doi.org/10.1038/ng.2701
-
Single-strand break repair and genetic diseaseNature Reviews Genetics 9:619–631.https://doi.org/10.1038/nrg2380
-
Characterization of the interactome of the human MutL homologues MLH1, PMS1, and PMS2The Journal of Biological Chemistry 282:2976–2986.https://doi.org/10.1074/jbc.M609989200
-
Chromatin immunoprecipitation (ChIP)Cold Spring Harbor Protocols 9:.https://doi.org/10.1101/pdb.prot5279
-
DNA deamination in immunity: AID in the context of its APOBEC relativesAdvances in Immunology 94:37–73.https://doi.org/10.1016/S0065-2776(06)94002-4
-
The enigmatic thymine DNA glycosylaseDNA Repair 6:489–504.https://doi.org/10.1016/j.dnarep.2006.10.013
-
ATR and ATRIP: partners in checkpoint signalingScience 294:1713–1716.https://doi.org/10.1126/science.1065521
-
Unequal fidelity of leading strand and lagging strand DNA replication on the Escherichia coli chromosomeProceedings of the National Academy of Sciences of the United States of America 95:10020–10025.https://doi.org/10.1073/pnas.95.17.10020
-
ATR: a master conductor of cellular responses to DNA replication stressTrends in Biochemical Sciences 36:133–140.https://doi.org/10.1016/j.tibs.2010.09.005
-
SEAVIEW and PHYLO_WIN: two graphic tools for sequence alignment and molecular phylogenyComputer Applications in the Biosciences 12:543–548.
-
Transcription-coupled DNA repair: two decades of progress and surprisesNature Reviews Molecular Cell Biology 9:958–970.https://doi.org/10.1038/nrm2549
-
One role for DNA methylation in vertebrate cells is strand discrimination in mismatch repairProceedings of the National Academy of Sciences of the United States of America 82:7350–7354.https://doi.org/10.1073/pnas.82.21.7350
-
DNA mismatch repair: Dr. Jekyll and Mr. Hyde?Molecular Cell 47:665–666.https://doi.org/10.1016/j.molcel.2012.08.020
-
DNA mismatch repair: molecular mechanism, cancer, and ageingMechanisms of Ageing and Development 129:391–407.https://doi.org/10.1016/j.mad.2008.02.012
-
How important is DNA replication for mutagenesis?Molecular Biology and Evolution 17:929–937.https://doi.org/10.1093/oxfordjournals.molbev.a026373
-
Bayesian Markov chain Monte Carlo sequence analysis reveals varying neutral substitution patterns in mammalian evolutionProceedings of the National Academy of Sciences of the United States of America 101:13994–14001.https://doi.org/10.1073/pnas.0404142101
-
Postreplicative mismatch repairCold Spring Harbor Perspectives in Biology 5:a012633.https://doi.org/10.1101/cshperspect.a012633
-
DNA Repair in mammalian cells: mismatched repair: variations on a themeCellular and Molecular Life Sciences: CMLS 66:1021–1038.https://doi.org/10.1007/s00018-009-8739-9
-
PCNA modifications for regulation of post-replication repair pathwaysMolecules and Cells 26:5–11.
-
Interactions of human mismatch repair proteins MutSalpha and MutLalpha with proteins of the ATR-Chk1 pathwayThe Journal of Biological Chemistry 285:5974–5982.https://doi.org/10.1074/jbc.M109.076109
-
Somatic hypermutation: subverted DNA repairCurrent Opinion in Immunology 18:243–248.https://doi.org/10.1016/j.coi.2006.03.007
-
Mechanisms in eukaryotic mismatch repairThe Journal of Biological Chemistry 281:30305–30309.https://doi.org/10.1074/jbc.R600022200
-
ATR signalling: more than meeting at the forkThe Biochemical Journal 436:527–536.https://doi.org/10.1042/BJ20102162
-
hMSH2 recruits ATR to DNA damage sites for activation during DNA damage-induced apoptosisThe Journal of Biological Chemistry 286:10411–10418.https://doi.org/10.1074/jbc.M110.210989
-
The biochemistry of somatic hypermutationAnnual Review of Immunology 26:481–511.https://doi.org/10.1146/annurev.immunol.26.021607.090236
-
Mammalian mismatch repair: error-free or error-prone?Trends in Biochemical Sciences 37:206–214.https://doi.org/10.1016/j.tibs.2012.03.001
-
PCNA function in the activation and strand direction of MutLalpha endonuclease in mismatch repairProceedings of the National Academy of Sciences of the United States of America 107:16066–16071.https://doi.org/10.1073/pnas.1010662107
-
APOBEC deaminases-mutases with defensive roles for immunityScience in China Series C, Life Sciences/Chinese Academy of Sciences 52:893–902.https://doi.org/10.1007/s11427-009-0133-1
-
Interferon regulatory factor 1 transactivates expression of human DNA polymerase eta in response to carcinogen N-methyl-N'-nitro-N-nitrosoguanidineThe Journal of Biological Chemistry 287:12622–12633.https://doi.org/10.1074/jbc.M111.313429
-
Base excision repair: the long and short of itCellular and Molecular Life Sciences 66:981–993.https://doi.org/10.1007/s00018-009-8736-z
-
Interference of mismatch and base excision repair during the processing of adjacent U/G mispairs may play a key role in somatic hypermutationProceedings of the National Academy of Sciences of the United States of America 106:5593–5598.https://doi.org/10.1073/pnas.0901726106
-
Immunoglobulin somatic hypermutationAnnual Review of Genetics 41:107–120.https://doi.org/10.1146/annurev.genet.41.110306.130340
-
The mutational spectrum of non-CpG DNA varies with CpG contentGenome Research 20:875–882.https://doi.org/10.1101/gr.103283.109
-
Cis-regulatory elements: molecular mechanisms and evolutionary processes underlying divergenceNature Reviews Genetics 13:59–69.https://doi.org/10.1038/nrg3095
Article and author information
Author details
Funding
Intramural Research Program of the NIH
- Anthony V Furano
The funder had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We thank Ms Pritha Bhattacharyya for participating in some of the experiments, Dr Stephen J Elledge for providing the ATR-expressing plasmid, and Drs Jean-Claude Walser and Angelique Aschrafi for their participation in the preliminary implementation of the shuttle vector assay. We also thank Dr Michael Seidman for providing us the pS189 shuttle vector and are most appreciative of his sustained interest and illuminating discussions throughout the course of this work. We also thank Drs Peggy Hsieh and Deborah Hinton for their comments and suggestions regarding this paper. The following reagent was obtained through the NIH AIDS Research and Reference Reagent Program, Division of AIDS, NIAID, NIH: phAPOBEC3B-HA plasmid, Cat #, 11090 from Dr Bryan R Cullen. Financial Disclosure: This research was supported by the Intramural Research Program of the NIH, The National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Copyright
This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
Metrics
-
- 2,783
- views
-
- 257
- downloads
-
- 81
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Biochemistry and Chemical Biology
- Genetics and Genomics
Yerba mate (YM, Ilex paraguariensis) is an economically important crop marketed for the elaboration of mate, the third-most widely consumed caffeine-containing infusion worldwide. Here, we report the first genome assembly of this species, which has a total length of 1.06 Gb and contains 53,390 protein-coding genes. Comparative analyses revealed that the large YM genome size is partly due to a whole-genome duplication (Ip-α) during the early evolutionary history of Ilex, in addition to the hexaploidization event (γ) shared by core eudicots. Characterization of the genome allowed us to clone the genes encoding methyltransferase enzymes that catalyse multiple reactions required for caffeine production. To our surprise, this species has converged upon a different biochemical pathway compared to that of coffee and tea. In order to gain insight into the structural basis for the convergent enzyme activities, we obtained a crystal structure for the terminal enzyme in the pathway that forms caffeine. The structure reveals that convergent solutions have evolved for substrate positioning because different amino acid residues facilitate a different substrate orientation such that efficient methylation occurs in the independently evolved enzymes in YM and coffee. While our results show phylogenomic constraint limits the genes coopted for convergence of caffeine biosynthesis, the X-ray diffraction data suggest structural constraints are minimal for the convergent evolution of individual reactions.
-
- Biochemistry and Chemical Biology
- Structural Biology and Molecular Biophysics
The SARS-CoV-2 main protease (Mpro or Nsp5) is critical for production of viral proteins during infection and, like many viral proteases, also targets host proteins to subvert their cellular functions. Here, we show that the human tRNA methyltransferase TRMT1 is recognized and cleaved by SARS-CoV-2 Mpro. TRMT1 installs the N2,N2-dimethylguanosine (m2,2G) modification on mammalian tRNAs, which promotes cellular protein synthesis and redox homeostasis. We find that Mpro can cleave endogenous TRMT1 in human cell lysate, resulting in removal of the TRMT1 zinc finger domain. Evolutionary analysis shows the TRMT1 cleavage site is highly conserved in mammals, except in Muroidea, where TRMT1 is likely resistant to cleavage. TRMT1 proteolysis results in reduced tRNA binding and elimination of tRNA methyltransferase activity. We also determined the structure of an Mpro-TRMT1 peptide complex that shows how TRMT1 engages the Mpro active site in an uncommon substrate binding conformation. Finally, enzymology and molecular dynamics simulations indicate that kinetic discrimination occurs during a later step of Mpro-mediated proteolysis following substrate binding. Together, these data provide new insights into substrate recognition by SARS-CoV-2 Mpro that could help guide future antiviral therapeutic development and show how proteolysis of TRMT1 during SARS-CoV-2 infection impairs both TRMT1 tRNA binding and tRNA modification activity to disrupt host translation and potentially impact COVID-19 pathogenesis or phenotypes.