Abstract
5-Methylcytosine (m5C) is one of the major post-transcriptional modifications in mRNA and is highly involved in the pathogenesis of various diseases. However, the capacity of existing assays for accurately and comprehensively transcriptome-wide m5C mapping still needs improvement. Here, we develop a detection method named DRAM (deaminase and reader protein assisted RNA methylation analysis), in which deaminases (APOBEC1 and TadA-8e) are fused with m5C reader proteins (ALYREF and YBX1) to identify the m5C sites through deamination events neighboring the methylation sites. This antibody-free and bisulfite-free approach provides transcriptome-wide editing regions which are highly overlapped with the publicly available BS-seq datasets and allows for a more stable and comprehensive identification of the m5C loci. In addition, DRAM system even supports ultra-low input RNA (10ng) and monitors the dynamic accumulation of cellular m5C. We anticipate that the DRAM system could pave the way for uncovering further biological functions of m5C modifications.
Introduction
Epigenetics refers to stable inheritance without changing the basic sequence of DNA, involving various forms such as DNA methylation, histone modification and RNA modification. In recent years, RNA sequencing technology has boosted research on RNA epigenetics. More than 170 RNA modifications have been identified, mainly including m6A, m5C, m1A, m7G and others 1,2. Notably, RNA m5C methylation represents a crucial post-transcriptional modification observed across different RNA types, such as tRNA, mRNA, rRNA, vault RNA, microRNA, long non-coding RNA and enhancer RNA3–8. Numerous studies have revealed multiple molecular functions of m5C in numerous key stages of RNA metabolism, such as mRNA stability, translation, and nuclear export5,9–13. The dynamic alterations of m5C play integral roles in many physiological and pathological processes, such as early embryonic development14, neurodevelopmental disorder15,16 and multifarious tumorigenesis and migration17–20. Moreover, this modification significantly contributes to the regulation of gene expression5,9–13,17. Therefore, the detection of m5C sites appears to be essential for understanding their underlying effects on cellular function and disease states.
With the recent advances in sequencing techniques, several high-throughput assays have been developed for qualitative or quantitative analysis of m5C. To date, bisulfite-sequencing (BS-seq) has been proven to be the gold standard method for RNA m5C methylation analysis5,21,22. This approach chemically deaminates unmethylated cytosine to uracil, while keeping methylated cytosine unchanged. The m5C methylation sites can be identified by subsequent library construction and sequencing. However, bisulfite treatment of BS-seq is extremely detrimental to RNA, thus resulting in unstable detection of m5C in low abundance RNA or highly structured RNA, which directly affects the confidence of results23,24. Another major type of global m5C analysis depends on antibody-assisted immunoprecipitation of m5C methylated RNAs, such as m5C-RIP-seq25–27, AZA-IP-seq28 or miCLIP-seq7. These methods are unable to recognize methylation on mRNAs with low abundance and secondary structure. Moreover, these methods are highly dependent on antibody specificity, which usually leads to unspecific binding of RNA and a low amount of m5C-modified regions. Moreover, TAWO-seq, originally developed for the identification of hm5C, is also capable of m5C analysis, but it highly depends on the oxidation efficiency of perovskite, which usually causes false positives and unstable conversion 29,30. Furthermore, the emerging third-generation sequencing, such as Nanopore-seq, can directly map m5C by tracking the characteristic changes of bases, but it still faces challenges of a high error rate31–33. These together largely hamper its wide application on transcriptome profiling of m5C (Supplementary Table 1). Hence, there is an urgent need for a simple, efficient, sensitive, and antibody-independent method for global m5C detection.
The RNA-binding protein ALYREF is the initially recognized nuclear m5C reader that binds directly to m5C sites in mRNA and plays key roles in promoting mRNA nuclear export or tumor progression5. Another well-known m5C reader, YBX1, can also recognize m5C-modified mRNA through its cold-shock domain and participates in a variety of RNA-dependent events such as mRNA packaging, mRNA stabilization and translational regulation9,18. RNA affinity chromatography and mass spectrometry analyses using biotin-labelled oligonucleotides with or without m5C were performed in previous reports, which indicated that ALYREF and YBX1 had a more prominent binding ability to m5C-modified oligonucleotides5,18. YBX1 can preferentially recognize mRNAs with m5C modifications via key amino acids W65-N70 (WFNVRN)18, while K171 is essential for the specific binding of ALYREF to m5C sites 5. Previous studies have shown that mutations in key amino acids responsible for recognising m5C binding in ALYREF and YBX1 lead to a significant reduction in their binding levels to m5C-containing oligonucleotides5,18. Nucleic acid deaminases, primarily categorized as cytosine deaminases and adenine deaminases, are zinc-dependent enzymes which facilitate the deamination of cytosine or adenine within DNA or RNA substrates34. APOBEC1, an evolutionarily conserved family member of APOBEC proteins, can specifically catalyze the deamination of cytosine in single-stranded RNA (ssRNA) or DNA (ssDNA) to uracil35–37. TadA8e is an adenine deaminase optimized through re-engineering of TadA and it induces conversion of adenine to inosine (eventually read as guanine by transcriptases) in ssRNA or ssDNA38,39. APOBEC1 and TadA8e, with their prominent deamination efficiency, have been employed for the development of precise and efficient base editors such as CBE and ABE8e, which find widespread application in studies related to genome editing37,38.
Here we aim to establish a deaminase and m5C reader-assisted RNA methylation sequencing approach (DRAM-seq), which identifies the m5C sites through reader-mediated recognitions and deaminase-mediated point mutations neighboring the m5C methylation sites. This bisulfite-free and antibody-free method is anticipated to provide more comprehensive and cost-effective transcriptome-wide detection of m5C methylation, which may better assist on exploring its further regulatory mechanisms.
Results
Development of DRAM system for m5C detection
Our sequencing platform is inspired by the concept of the m6A DART-seq assay, in which C near the m6A site is converted into U without affecting sequences near non-m6A sites40. Therefore, we hypothesized that, by utilizing the targeted binding of m5C readers, deaminase can be recruited to achieve deamination of cytosine or adenine in the vicinity of the m5C sites on single-stranded RNA, thereby facilitating the detection of the m5C site. This approach was named DRAM (deaminase and m5C reader-assisted RNA methylation sequencing). As RNA-binding proteins, ALYREF and YBX1 also could bind to RNAs without m5C modification5,18. To exclude the false-positive detection of DRAM due to the non-m5C specific binding of ALYREF and YBX1, knockout of W65-N70 (WFNVRN) amino acids in YBX1and K171A mutation in ALYREF were introduced seperately, resulting in the DRAMmut system (Fig. S1A-S1D). DRAMmut exhibits substantially reduced binding to m5C in RNA pulldown assays (Fig. S1D-S1F). To confirm the recognition of m5C site by DRAM system, DRAM, DRAMmut and Deaminase system were transfected into the human HEK293T cells, respectively. Finally, we considered the presence of m5C modification in the vicinity only if the deamination changes produced under DRAM induction were significantly different from those produced under DRAMmut or Deaminase induction (Fig. 1A).
Previous studies have indicated that there is no uniform intrinsic signature motif sequence that can characterize all m5C sites5,26,41,42. To comprehensively detect the m5C loci, the readers of m5C (ALYREF and YBX1) were separately fused to the C-terminus of the deaminases (APOBEC1 and TadA-8e), namely DRAM-ABE and DRAM-CBE system (Fig.1B).
DRAM detection system is assayed in an m5C-dependent form
To confirm the recognition of m5C site by DRAM system, DRAM, DRAMmut and Deaminase were transfected into the human HEK293T cells, respectively. To evaluate candidate DRAM constructs within a cellular environment, we performed fluorescence microscopy to analyze the expression of DRAM. The results showed that DRAM-ABE and DRAM-CBE were properly expressed in HEK293T cells (Fig. S2A-S2B). In addition, flow cytometry displayed ∼60% of cells were GFP-positive (Fig. S2C). Two previously reported m5C sites in RPSA and AP5Z1 were selected for the analysis 5,21, and their methylation status was verified by bisulfite sequencing PCR. The deep sequencing results showed that the m[C fraction of RPSA and SZRD1 was 75.5% and 27.25%, respectively (Fig.2A and B). Sanger sequencing following RT-PCR was then performed to determine the editing of neighbouring m5C sites by DRAM system in these two mRNA. Notably, adenine close to the m5C site in RPSA mRNA was mutated into guanine, resulting in an A-to-G editing rate of 14.7% by DRAM-ABE, whereas this was rarely observed with TadA-8e or DRAMmut-ABE (Fig.2C). DRAM-CBE induced C to U editing in the vicinity of the m5C site in AP5Z1 mRNA, with 13.6% C-to-U editing, while this effect was significantly reduced with APOBEC1 or DRAMmut-CBE (Fig.2D). Subsequently, in order to investigate whether the DRAM system can detect other types of RNA, such as tRNA, 28S rRNA, or others, we performed PCR amplification of the flanking sequences of the m5C sites 3782 and 4447 on 28S rRNA and several m5C sites on tRNA, such as the m5C48 and m5C49 sites of tRNAVal, the m5C48 and m5C49 sites of tRNAAsp, and the m5C48 site of tRNALys. But Sanger sequencing showed that there was no valid A-to-G/C-to-U mutation detected, which is most likely due to the fact that ALYTEF and YBX1 are mainly responsible for the mRNA m5C binding proteins, and thus the DRAM system is more suitable for the mRNA m5C detection (Fig. S3). Taken together, the fusion of m5C reader and deaminase can effectively and selectively deaminate cytosine/adenine in the vicinity of the mRNA m5C sites.
NSUN243 and NSUN644, two family members of NOL1/NSUN protein, were both identified as m5C methyltransferase of mRNA45. To verify that the detection of DRAM occurs in the presence of m5C, we individually depleted NSUN2 and NSUN6 in HEK293T cells and performed DRAM transfection. The knockout efficiency has been confirmed by western blotting (Fig.2E, 2F and Fig. S4A,4B). It has been previously demonstrated that m5C methylation of AP5Z1 and RPSA is catalyzed by NSUN2 and NSUN6, respectively21,46. In line with this, sanger sequencing following RT-PCT showed a significant reduction in C-to-U or A-to-G mutations near the m5C sites in methyltransferase-deficient cells compared with WT cells (Fig. 2G and H). Overall, these findings suggest that the DRAM detection system is assayed in an m5C-dependent form.
DRAM enables transcriptome-wide analysis of m5C methylation
Subsequently, we performed RNA-seq analysis after DRAM transfection by detecting C-to-U/A-to-G editing events to accomplish transcriptome-wide detection of m5C (Fig.3A). To serve as positive controls, two previously published BS-seq datasets were also integrated5,21. Mutations were detected near the m5C site in RPSA as A-to-G by DRAM-ABE (Fig.3B), and DRAM-CBE detected the presence of C-to-U mutations near the AP5Z1 m5C site (Fig.3C). However, the DRAMmut and Deaminase systems induced few effective mutations close to these sites. Examination of multiple reported high-confidence RNA m5C sites showed that DRAM-seq editing events were also enriched in the vicinity of the BS-seq sites (Fig.3B, 3C and Fig. S5).
DRAM-seq analysis further confirmed that DRAM was detected in an m5C-dependent manner and that mutations in AP5Z1 and RPSA mRNA were reduced in methyltransferase knockout cells compared to wild-type cells (Fig. 3D, 3E). Moreover, the knockout cells exhibited overall rare DRAM-seq editing events close to m5C sites in other mRNAs (Fig. S6). Unfortunately however due to the inability of the DRAM detection system to achieve m5C detection with single base resolution, we did not find sequence preferences for the m5C sites responsible for catalysis by each of the two methyltransferases NSUN2 and NSUN6(Fig. S4D).
A comparison of three biological replicates from each experimental group revealed a strong reproducibility of A-to-G/C-to-U mutations in HEK293T cells expressing DRAM-ABE and DRAM-CBE (Fig. S7). Moreover, the integration and analysis of the experimental data revealed a high degree of overlap between the three biological replicates (Fig. S4C). And a recent study by Wang et al. showed that ALYREF deletion affects the expression of 94 mRNAs47, and only 55.32% of these ALYREF-regulated mRNAs can be detected by the DRAM system (Fig. S4E). These findings suggest that DRAM selectively targets specific RNAs for editing, exhibiting a high degree of consistency across samples.
To obtain information on a set of high-confidence DRAM-seq data, we filtered the list of sites transfected with deaminase alone and screened the sequencing results with methyltransferase depleted, pooled editing events occurring in at least 10% of reads across multiple samples to obtain a set of high-confidence editing sites (Fig. 3F and Supplementary Table 2), and integrated genes with editing sites occurring in DRAM-ABE and DRAM-CBE (Fig. 3F and Supplementary Table 3).
Previous studies have indicated that m5C sites are predominantly distributed in the coding sequences (CDS) and notably enriched near the initiation codon5,25,26,48–50. To further delineate the characteristics of the DRAM-seq data, we compared the distribution of DRAM-seq editing sites within the gene structure, specifically examining their occurrences in the 5’untranslated region (5’UTR), 3’ untranslated region (3’UTR), CDS and Intergenic/Intron region. Our analysis revealed that DRAM-seq editing events in cells expressing DRAM-ABE and DRAM-CBE were primarily located in the CDS and 3’UTR, indicating a non-random distribution of m5C (Fig.3G, Fig. S8A and 8B). Moreover, plotting the distribution of DRAM-seq editing sites in mRNA segments (5’UTR, CDS, and 3’UTR) highlighted a significant enrichment in the CDS (Fig.3H). In contrast, cells expressing the deaminase exhibited a distinct distribution pattern of editing sites, characterized by a prevalence throughout the 3’UTR (Fig.3H). This finding reaffirms that the specific editing pattern observed in DRAM-seq across the transcriptome depends on its capacity to bind m5C.
Comparative analysis of the DRAM-seq editing sites with the previously published BS-seq m5C sites indicated that the likelihood of editing was notably higher in closer proximity to the m5C sites (Fig.3I). Furthermore, the editing window of DRAM exhibited enrichment approximately 20bp before and after the m5C site (Fig.3I). Investigation into the sequences surrounding the editing window revealed that AC motifs were the most significantly enriched in DRAM-CBE, whereas (U/C) A motifs were most notably enriched in DRAM-ABE. In contrast, the APOBEC1 and TadA-8e samples displayed no significantly enriched motifs, with mutations being more randomly orientated (Fig.3J, 3K).
DRAM-seq provides stable and comprehensive identification of m5C loci
Subsequently, we then evaluated the ability of DRAM-seq to detect m5C across the entire transcriptome and compared its performance to that of the previously reported BS-seq. Although both previous studies employed bisulfite treatment, the resulting data obtained significant discrepancies due to variations in their treatment and analysis methodologies. Our results indicated that DRAM-seq identified the presence of m5C modifications covering 79.6% of the genes detected by Yang et al.5 and 91.9% of the genes detected by Zhang et al.21 (Fig.4A and C). Remarkably, certain pivotal regulators with diverse biological functions, such as ATG16L1(coordinats autophagy pathway)51 and ARHGEF25 (plays an important role in actin cytoskeleton reorganisation)52, were identified by Zhang et al. and DRAM-seq, but not by Yang et al. (Fig.4B). Conversely, FANCD2 (Maintains chromosome stability)53 and RPL15(components of the large ribosomal subunit)54,55, were discovered by Yang et al. and DRAM-seq, but not by Zhang et al. (Fig.4D). Hence, DRAM-seq appears to offer a more stable and comprehensive identification of the m5C loci.
To provide functional insights into m5C RNA-modified genes in HEK293T cells, we conducted Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses. These results highlighted the involvement of these genes in the regulation of diverse key biological processes, such as cell division, cell cycle, mRNA splicing, protein processing in the endoplasmic reticulum, nucleocytoplasmic transport, translation, DNA repair and others (Fig.4E, 4F, Fig. S8C and S8D).
DRAM monitors m5C dynamics in cellular RNA
Next, we aim to determine whether changes in m5C can be detected in real time by DRAM. Previous studies have established that oxidative stress induced by 200 μM sodium arsenite (NaAsO2) treatment inhibited the expression of NSUN2 and NSUN6, resulting in a gradual decrease in m5C methylation56,57. As cells exposed to NaAsO2 typically exhibited slowed cell cycle progression and NaAsO2-induced apoptosis, we performed a low concentration screen58,59. After gradient treatment of NaAsO2 (50μM, 100μM, 150μM, and 200μM, separately) in cells, 200μM group was identified as the optimal concentration, leading to highly significant decreases of NSUN2 and NSUN6 expression (Fig.5A). Subsequently, HEK293T cells expressing DRAM with NaAsO2 for 3 hours revealed a decreasing trend in the mRNA expression of NSUN2 and NSUN6 (Fig.5B). Bisulfite sequencing PCR also verified a decreasing trend in m5C levels in selected mRNAs after NaAsO2 treatment (Fig. 5C and D).
Next, we performed DRAM system assays, and Sanger sequencing results showed that induction of the DRAM system after NaAsO2 treatment resulted in a highly significant reduction of A-to-G/C-to-U mutations on both RPSA and AP5Z1 mRNAs compared to cells without NaAsO2 treatment (Fig.5E and F). Taken together, with our previous evidence that the methyltransferase-deleted cells exhibited overall rare DRAM editing events close to m5C sites, we suggest that the DRAM could effectively monitor the accumulation of m5C in individual RNAs in response to the alteration of cellular conditions.
DRAM enables low-input m5C profiling
A significant challenge in m5C detection lies in the specificity of antibodies and the substantial amount of input RNA required for sequencing. RNA is susceptible to degradation during denaturation, sodium bisulfite treatment and desulfurization steps in the BS-seq assay60. Immunoprecipitation-based m5C assays and LC-MS/MS also impose high demand for sample input7,25,61. Several experiments have highlighted the requirement of 100-500 ng of RNA for m5C-RIP-seq, while BS-seq necessitates an even more demanding 750-1000 ng of RNA21,25,62. To assess the detection limits of DRAM-Sanger, we attempted to amplify two representative m5C-containing sites in the RPSA and AP5Z1 transcripts from diluted RNA samples.
Remarkably, we successfully generated PCR products of these two mRNAs from cDNAs corresponding to 250 ng, 50 ng, and 10 ng of total RNA. Quantitative analysis by Sanger sequencing demonstrated nearly identical Sanger traces across these dilutions (Fig.5G and H). This finding underscores that the specificity of DRAM editing depended on its ability to bind m5C, and DRAM is proficient in low-input m5C analyses. Furthermore, cell viability was determined by CCK8 assay on HEK293T cells transfected with DRAM (Fig.5I). Importantly, there was no significant difference in the relative proliferative capacity of the cells compared to untransfected cells (NC), indicating that DRAM expression did not adversely affect cell viability (Fig.5J).
Transfection of the DRAM system in cells results in the transient overexpression of fusion proteins. To investigate how varying expression levels of these proteins influence A-to-G and C-to-U editing within the same m5C region, we conducted a gradient transfection using plasmid concentrations of 1500 ng, 750 ng and 300 ng. This approach allowed us to progressively reduce the expression levels of the fusion proteins (Fig. S9A and S9B). Sanger sequencing revealed that the editing efficiency of A-to-G and C-to-U within the m5C region significantly decreased as fusion protein expression diminished (Fig. S9C and S9D). These findings suggest that the transfection efficiency of the DRAM system is concentration-dependent and that the ratio of editing efficiency to transfection efficiency may assist in the quantitative analysis of m5C using the DRAM system.
Discussion
In recent years, m5C methylation modifications have received increasing attention, with multiple reports detailing the distribution of RNA m5C methylation modifications across various species and tissues, elucidating their characteristics. Despite the relatively low abundance of m5C, its highly dynamic changes hold significant implications for the regulation of physiological and pathological processes5,21,44. However, due to the limitations of sequencing methods and the variability of data processing, there remains ample room for progress in the study of m5C detection methods.
In this study, we developed a site-specific, depth-sequencing-free m5C detection method using DRAM-Sanger. This workflow relies on conventional molecular biology assays such as RT-PCR and Sanger sequencing, eliminating the need for specialized techniques and thereby simplifying the process of m5C detection.
DRAM-seq introduces a novel strategy for transcriptome-wide m5C detection, overcoming inherent limitations in existing methods. Notably, DRAM-seq covered around 80% of the high-confidence m5C-modified genes detected by BS-seq and identified more potential m5C sites. This can be attributed to the avoidance of bisulfite treatment by DRAM-seq, preventing RNA damage and ensuring a more comprehensive representation of RNA samples. This feature also likely contributes to the observed stability of DRAM-seq in comparison to BS-seq. Additionally, DRAM-seq is not limited by antibody specificity and is resistant to chemical-induced damage.
A prominent challenge in existing m5C profiling methods is their reliance on substantial amounts of input RNA samples. In contrast, DRAM operates through the deamination activity of deamination activity of deaminase, preserving RNA integrity and preventing degradation. The notable advantage of DRAM lies in its capacity for low-input m5C detection. Our analysis demonstrates that DRAM requires as low as 10ng of total RNA for m5C detection. While DRAM is currently well-suited for detecting m5C on a transcriptome-wide scale, the potential for future applications involving third-generation sequencing could extend its utility to individual mRNAs, particularly m5C heterogeneity on mRNA splicing variants. In addition, the DRAM system depends on the specific recognition of m5C modifications on ssRNA by the reader protein, theoretically avoiding the false-positive effects of 5-hydroxymethylation modifications in other assays, such as BS-seq21–23. This potential feature could enhance the accuracy of the DRAM assay, albeit it still requires careful validation.
In our study, m5C detection was performed following the transient transfection of the DRAM detection system into mammalian cells, which might result in a lower mutation rate at the corresponding site. Therefore, employing lentiviral-mediated transfection into cell lines of interest could potentially enhance the efficiency of m5C detection. In the present study, the editing efficiency of the DRAM system, which is believed to detect dynamic changes in m5C, was significantly reduced following sodium arsenite treatment. However, sodium arsenite can adversely affect all proteins containing thiol groups, impacting not only NSUN proteins but potentially also the function of reader proteins and deaminases. As a result, this validation cannot entirely rule out the possibility that the observed decrease in editing frequency is due to the inactivation of fusion proteins by sodium arsenite. The optimization of this validation may require the development of more specific m5C inhibitors in the future. Further elucidation of the key amino acids directing ALYREF and YBX1’s binding to m5C methylation sites should enable more accurate and sensitive m5C detection by DRAM-seq. Due to the lack of a fixed base composition for characterizing all m5C modification sites, DRAM has an apparent limitation in achieving single-base resolution for detecting m5C. However, our present study proved that the measuring resolution of DRAM is around 40nt, which facilitates higher precision than that of m5C-RIP-seq (∼100nt). In the future, with more in-depth analyses of m5C reader structures and the identification of new potential m5C readers, we expect to achieve more precise m5C localization and more comprehensive m5C modification detection. Moreover, the substitution of deaminases, such as A3A and A3G (the family members of APOBEC), could also potentially enhance the efficiency of the DRAM detection 63–65.
Following the purification of DRAM fusion protein, achieving in vitro detection of RNA m5C methylation could broaden the applicability to a more diverse range of samples. Another potential application for DRAM-seq could be the expression of drug-inducible DRAM systems in vivo using various animal models for m5C analysis. These will together provide novel insights into m5C modifications for biological and clinical research.
Conclusions
In summary, we developed a novel deaminase and reader protein-assisted RNA m5C methylation approach that detects the m5C region by deaminating As or Cs in close proximity to the m5C sites, which does not rely on antibodies or bisulfite, thus leading to unprecedently comprehensive transcriptome-wide RNA m5C methylation profiling. We anticipated that this system could pave the way for uncovering further biological functions of m5C modifications and facilitate the development of therapeutic interventions for associated diseases.
Materials and methods
Plasmid construction
ALYREF and YBX1 expression plasmids were purchased from MIAOLING BIOLOGY (http://www.miaolingbio.com/), and the ALYREF and YBX1 fractions were then amplified using specific primer. The ALYREF and YBX1 portions were amplified using pCMV-APOBEC1-YTH (Addgene plasmid no. 131636; https://www.addgene.org/131636/) and ABE8e (Addgene plasmid no. 138489; https://www.addgene.org/ 138489/) to amplify the deaminase portion and the essential plasmid construct proxies, and finally the fragments were recombined by the ClonExpress Ultra One Step Cloning Kit to complete the plasmid vector construction. Both DRAMmut-ABE and DRAMmut-CBE related vectors were obtained by introducing the corresponding key amino acid mutations using Fast Site-Directed Mutagenesis Kit (TIANGEN Biotech). The primer sequences used are listed in Supplementary Table 4.
Cell culture and plasmid transfection
HEK293T cell line (ATCC) was cultured in Dulbecco’s Modified Eagle Medium (DMEM) supplemented with 10% fetal bovine serum (CLARK BIOSCIENCE) and 1% penicillin (100 U/ml)-streptomycin (100μg/ml). The cells were seeded in 12-well plates and transfected using Hieff Trans™ Liposomal Transfection Reagent (Yeasen).
NSUN2-depleted cell lines were generated by cloning NSUN2-targeting single guide RNA sequences into the pSpCas9(BB)−2A-Puro (PX459) V2.0 plasmid (Addgene plasmid no. 62988; http://n2t.net/addgene:62988). Plasmids were then transfected into HEK293T cells and Puromycin (Meilunbio) was added at a final concentration of 3 μg/ml to enrich the positively transfected cells 24 h after transfection. After 72 h, the cells were collected and used for genotyping by Sanger sequencing. NSUN6-depleted cell lines were generated in the same way. The primers used for genotyping and single guide RNA sequences are listed in Supplemental Table 4.
Cell viability measurements
HEK293T cells were transfected with DRAM plasmid and cultured at 37°C for 24 h. Subsequently, 1000 cells were seeded in 96-well plates. After waiting for the cells to attach to the wall, the cell activity was detected by Cell Counting Kit-8 (Meilunbio). Cell Counting Kit-8 contains WST-8, which in the presence of the electronically coupled reagent 1-Methoxy PMS can be reduced by mitochondrial dehydrogenase to the orange-colored metazan product Formazan, the absorbance of which is measured at 450 nm to analyze cellular activity.
Western blotting
For protein blotting, samples were lysed in RIPA Lysis Buffer (Meilunbio) with Phenylmethanesulfonyl fluoride (PMSF) and the BCA protein assay kit (Beyotime Biotechnology) was used to Protein concentration was measured. Total protein extracts were separated by SDS-PAGE on a 10% gel and then transferred to 0.22 nm polyvinylidene fluoride membranes (Boster). Subsequently, the proteins were probed with specific antibodies after the blot was blocked with 5% non-fat milk (Boster). Images were quantified using ImageJ software and all data are expressed as mean ± SEM.
The following antibodies and concentrations were used: NSUN2 Polyclonal antibody (Proteintech; Cat No.20854-1-AP; 1:7500), NSUN6 Polyclonal antibody (Proteintech; Cat No. 17240-1-AP; 1:2000), RabbitAnti-GAPDH antibody (Bioss; bs-41373R; 1:2000), Alpha Tubulin Polyclonal antibody (Proteintech; Cat No. 11224-1-AP; 1:2000), HRP-labeled Goat Anti-Rabbit IgG(H+L) (Beyotime Biotechnology; A0208; 1:2000).
cDNA synthesis and Sanger sequencing
Total cellular RNA was extracted with TRIzol reagent (TIANGEN Biotech) and cDNA was synthesized using PrimeScript™ II 1st Strand cDNA Synthesis Kit (Takara Bio) according to the manufacturer’s recommendations. PCR was then performed using 2×Taq PCR MasterMix II (TIANGEN Biotech) and primers flanking m5C target sites, and the purified PCR products were directly sequenced by Sanger sequencing. The Sanger sequencing results were analyzed using EditR 1.0.10 to calculate the mutation frequency66. The primers used in this study are shown in Supplemental Table 4.
Real-time quantitative PCR
cDNA was synthesized using FastKing RT kit (with gDNase) (TIANGEN Biotech) according to the manufacturer’s recommendations. RT-qPCR assay was performed using SuperReal PreMix Plus (SYBR Green) (TIANGEN Biotech). GAPDH was used as an endogenous control, and the expression levels were normalized to the control and calculated by the 2-ΔΔCt formula. All samples were analyzed in triplicate and each mRNA quantification represents the average of at least three measurements. All data are expressed as mean ± SEM. The primers used in this study are shown in Supplemental Table 4.
Protein structure modelling
Protein structure simulations were performed using the SWISS-MODEL online website (https://swissmodel.expasy.org/interactive)67. The SWISS-MODEL database is able to provide up-to-date annotated 3D protein models, which are generated from automated homology modelling of related model organisms and experimental structural information for all sequences in UniProtKB, with reliable structural information, and subsequently protein structure observations were performed using PyMOL68.
Bisulfite sequencing PCR
We referenced bisulfite sequencing PCR, an assay established by Matthias Schaefer et al. We chemically deaminated cytosine in RNA using the EZ RNA methylation kit (50) (ZYMO RESEARCH) and then quantified m5C methylation levels based on PCR amplification of cDNA combined with deep sequencing23.
RNA Conversion Reagent was premixed with prepared RNA samples, and the RNA was denatured at 70°C for 5 minutes, followed by a reaction period of 45 minutes at 54°C. Finally, the purified RNA samples were recovered after desulfurization by RNA Desulphonation Buffe. cDNA was synthesized using PrimeScript™ II 1st Strand cDNA Synthesis Kit (Takara Bio) according to the manufacturer’s recommendations. PCR was then performed using 2× EpiArt HS Taq Master Mix (Dye Plus) (Vazyme) and m5C target site-specific Bisulfite Primer (primer sequences were designed at https://zymoresearch.eu/pages/bisulfite-primer-seeker), the products were purified by TIANgel Midi Purification Kit (TIANGEN Biotech), and the connectors for second-generation sequencing were attached at both ends of the products for sequencing. Finally, deep sequencing was performed by HiTOM analysis to detect the methylation level (The number of reads >1000 in deep sequencing) 69. The primers used in this study are shown in Supplemental Table 4.
Library construction and next-generation sequencing
1μg of total cellular RNA was used for sequencing library generation by NEBNext Ultra RNA Library Prep Kit for Illumina (NEB, USA, Catalog #: E7530L) following manufacturer’s recommendations and index codes were added to attribute sequences to each sample. Briefly, mRNA was purified from total RNA using poly-T oligo-attached magnetic beads. Fragmentation was carried out using divalent cations under elevated temperature in NEB Next First Strand Synthesis Reaction Buffer(5X). First-strand cDNA was synthesized using random hexamer primer and M-MuLV Reverse Transcriptase (RNase H). Second-strand cDNA synthesis was subsequently performed using DNA Polymerase I and RNase H. Remaining overhangs was converted into blunt ends via exonuclease/polymerase activities. After adenylation of 3’ ends of DNA fragments, NEB Next Adaptor with hairpin loop structure was ligated to prepare for hybridization. To select cDNA fragments of preferentially 370∼420 bp in length, the library fragments were purified with AMPure XP system (Beverly, USA). Then 3 µL USER Enzyme (NEB, USA) was used with size selected, adaptor-ligated cDNA at 37°C for 15 min followed by 5 min at 95 °C before PCR. Then PCR was performed with Phusion High-Fidelity DNA polymerase, Universal PCR primers and Index (X) Primer. At last, PCR products were purified (AMPure XP system) and library quality was assessed on the Agilent 5400 system(Agilent, USA)and quantified by QPCR (library concentration ≥ 1.5 nM). The qualified libraries were pooled and sequenced on Illumina platforms with PE150 strategy in Novogene Bioinformatics Technology Co., Ltd (Beijing, China), according to effective library concentration and data amount required.
DRAM-seq analysis and calling of edited sites
The raw fastq sequencing data were cleaned by trimming the adapter sequences using Fastp (v0.23.1) and were aligned to the human genome (hg19) using STAR (v2.7.7) in paired-end mode. The aligned BAM files were sorted and PCR duplicates were removed using Samtools (v1.12). The cite calling pf DRAM-seq was performed using Bullseye, a previously customized pipeline to look for C-to-U or A-to-G edited sites throughout the transcriptome40. Briefly, the sorted and deduplicated BAM files were initially parsed by parseBAM.pl script.
Then, Find_edit_site.pl script was employed to find C-to-U or A-to-G editing events by DRAM-seq with at least 10 reads of coverage, an edit ratio of 5%-95%, an edit ratio at least 1.5-fold higher than NSUN2 or NSUN6-knockout samples, and at least 2 editing events at a given site. Sites that were only found in one replicate of each DRAM protein variant were removed. Editing events appeared in cells expressing merely APOBEC1 or TadA8e were also removed. For high confidence filtering, we further adjusted the Find_edit_site.pl parameters to the edit ratio of 10%-60%, an edit ratio of control samples at least 2-fold higher than NSUN2 or NSUN6-knockout samples, and at least 4 editing events at a given site.
Metagene and motif analyses
Metagene analysis was performed using hg19 annotations according to previously reported tool, MetaplotR70. For motif analysis, the 20bp flanking sequence of each DRAM-seq editing site was extracted by Bedtools (v2.30.0)71. The motif logos were then plotted by WebLogo (v3.7.12)72.
Replicates analysis
Independent biological replicates of DRAM-ABE or DRAM-CBE in DRAM-seq analysis were separately compared by computing the Pearson correlation coefficient between the number of C-to-U mutations per mRNA between any two replicate experiments.
GO and KEGG analysis
GO and KEGG analysis of DRAM-seq edited mRNAs was performed using the DAVID bioinformatic database 73. GO terms with a P value of less than 0.05 were considered statistically significant.
RNA pulldown assays
RNAs containing C or m5C residues and proteins expressing DRAM or DRAMmut were prepared in advance, processed using the Pierce™ Magnetic RNA-Protein Pull-Down Kit (Thermo) and finally analysed by Western blotting.
Statistical analysis
All data are expressed as mean ± S.E.M of three independent determinations. Data were analyzed through a two-tailed t-test. A probability of P < 0.05 was considered statistically significant; [, P < 0.05, [[, P < 0.01, *, P < 0.05, **, P <0.01, ***, P < 0.001 and ****, P < 0.0001 denote the significance thresholds; ns denotes not significant.
Data and Materials Availability
The data supporting the findings of this study are available within the article and its Supplementary Information. Other data and reagents are available from the corresponding authors upon reasonable request.
Additional information
Author contributions
Conceptualization: JZ, YH, LL, ZL
Methodology: JZ, DZ, JL
Investigation: JZ, DZ, JL, DK, XL, RZ, YL
Visualization: XG, YQ, DW, JC
Supervision: DK, XL, RZ, YL, XG, YQ, DW, JC, YH
Funding acquisition: YH, LL, ZL
Data curation: JZ, YH
Writing—original draft: JZ, DZ, JL
Writing—review & editing: JZ, YH, LL, ZL
Competing Interests
All other authors declare they have no competing interests.
Funding
This work was supported by the National Natural Science Foundation of China (Nos.32200466).
Acknowledgements
We thank Yuning Song, Yuanyuan Xu and Tingting Sui for critical feedbacks on the work and manuscript.
References
- 1The epitranscriptome beyond m(6)ANat Rev Genet 22:119–131https://doi.org/10.1038/s41576-020-00295-8
- 2The pivotal regulatory landscape of RNA modificationsAnnu Rev Genomics Hum Genet 15:127–150https://doi.org/10.1146/annurev-genom-090413-025405
- 3NSUN2 introduces 5-methylcytosines in mammalian mitochondrial tRNAsNucleic Acids Res 47:8720–8733https://doi.org/10.1093/nar/gkz559
- 4Neuronal Nsun2 deficiency produces tRNA epitranscriptomic alterations and proteomic shifts impacting synaptic signaling and behaviorNat Commun 12https://doi.org/10.1038/s41467-021-24969-x
- 55-methylcytosine promotes mRNA export -NSUN2 as the methyltransferase and ALYREF as an m(5)C readerCell Res 27:606–625https://doi.org/10.1038/cr.2017.55
- 6Yeast Nop2 and Rcm1 methylate C2870 and C2278 of the 25S rRNA, respectivelyNucleic Acids Res 41:9062–9076https://doi.org/10.1093/nar/gkt679
- 7NSun2-mediated cytosine-5 methylation of vault noncoding RNA determines its processing into regulatory small RNAsCell Rep 4:255–261https://doi.org/10.1016/j.celrep.2013.06.029
- 8Deposition of 5-Methylcytosine on Enhancer RNAs Enables the Coactivator Function of PGC-1αCell Rep 14:479–492https://doi.org/10.1016/j.celrep.2015.12.043
- 9RNA 5-Methylcytosine Facilitates the Maternal-to-Zygotic Transition by Preventing Maternal mRNA DecayMol Cell 75:1188–1202https://doi.org/10.1016/j.molcel.2019.06.033
- 10Post-transcriptional regulation by cytosine-5 methylation of RNABiochim Biophys Acta Gene Regul Mech 1862:240–252https://doi.org/10.1016/j.bbagrm.2018.12.003
- 11The dynamic RNA modification 5-methylcytosine and its emerging role as an epitranscriptomic markWiley Interdiscip Rev RNA 10https://doi.org/10.1002/wrna.1510
- 12The emerging role of RNA modifications in the regulation of mRNA stabilityExp Mol Med 52:400–408https://doi.org/10.1038/s12276-020-0407-z
- 13Epitranscriptomic Addition of m(5)C to HIV-1 Transcripts Regulates Viral Gene ExpressionCell Host Microbe 26:217–227https://doi.org/10.1016/j.chom.2019.07.005
- 14Developmental mRNA m(5)C landscape and regulatory innovations of massive m(5)C modification of maternal mRNAs in animalsNat Commun 13https://doi.org/10.1038/s41467-022-30210-0
- 15Expression of the RNA methyltransferase Nsun5 is essential for developing cerebral cortexMol Brain 12https://doi.org/10.1186/s13041-019-0496-6
- 16Cytosine-5 RNA Methylation Regulates Neural Stem Cell Differentiation and MotilityStem Cell Reports 8:112–124https://doi.org/10.1016/j.stemcr.2016.11.014
- 17RNA bisulfite sequencing reveals NSUN2-mediated suppression of epithelial differentiation in pancreatic cancerOncogene 41:3162–3176https://doi.org/10.1038/s41388-022-02325-7
- 185-methylcytosine promotes pathogenesis of bladder cancer through stabilizing mRNAsNat Cell Biol 21:978–990https://doi.org/10.1038/s41556-019-0361-y
- 19The RNA methyltransferase NSUN6 suppresses pancreatic cancer development by regulating cell proliferationEBioMedicine 63https://doi.org/10.1016/j.ebiom.2020.103195
- 20Long noncoding RNA DIAPH2-AS1 promotes neural invasion of gastric cancer via stabilizing NSUN2 to enhance the m5C modification of NTN1Cell Death Dis 14https://doi.org/10.1038/s41419-023-05781-5
- 21Genome-wide identification of mRNA 5-methylcytosine in mammalsNat Struct Mol Biol 26:380–388https://doi.org/10.1038/s41594-019-0218-x
- 22Bisulfite Sequencing of RNA for Transcriptome-Wide Detection of 5-MethylcytosineMethods Mol Biol 1870:1–21https://doi.org/10.1007/978-1-4939-8808-2_1
- 23RNA cytosine methylation analysis by bisulfite sequencingNucleic Acids Res 37https://doi.org/10.1093/nar/gkn954
- 24Detection of 5-Methylcytosine in Specific Poly(A) RNAs by Bisulfite SequencingMethods Mol Biol 1562:107–121https://doi.org/10.1007/978-1-4939-6807-7_8
- 255-Methylcytosine RNA Methylation in Arabidopsis ThalianaMol Plant 10:1387–1399https://doi.org/10.1016/j.molp.2017.09.013
- 26Transcriptome-wide mapping of 5-methylcytidine RNA modifications in bacteria, archaea, and yeast reveals m5C within archaeal mRNAsPLoS Genet 9https://doi.org/10.1371/journal.pgen.1003602
- 27Methylated RNA Immunoprecipitation Assay to Study m5C Modification in ArabidopsisJ Vis Exp https://doi.org/10.3791/61231
- 28Identification of direct targets and modified bases of RNA cytosine methyltransferasesNat Biotechnol 31:458–464https://doi.org/10.1038/nbt.2566
- 29Bisulfite-free and base-resolution analysis of 5-methylcytidine and 5-hydroxymethylcytidine in RNA with peroxotungstateChem Commun (Camb 55:2328–2331https://doi.org/10.1039/c9cc00274j
- 30Base-Resolution Analysis of 5-Hydroxymethylcytosine by One-Pot Bisulfite-Free Chemical Conversion with PeroxotungstateJ Am Chem Soc 138:14178–14181https://doi.org/10.1021/jacs.6b06428
- 31Direct sequencing of RNA with MinION Nanopore: detecting mutations based on associationsNucleic Acids Res 47https://doi.org/10.1093/nar/gkz907
- 32Transcriptional and epi-transcriptional dynamics of SARS-CoV-2 during cellular infectionCell Rep 35https://doi.org/10.1016/j.celrep.2021.109108
- 33Beyond sequencing: machine learning algorithms extract biology hidden in Nanopore signal dataTrends Genet 38:246–257https://doi.org/10.1016/j.tig.2021.09.001
- 34Engineered deaminases as a key component of DNA and RNA editing toolsMol Ther Nucleic Acids 34https://doi.org/10.1016/j.omtn.2023.102062
- 35Transcriptome-wide sequencing reveals numerous APOBEC1 mRNA-editing targets in transcript 3’ UTRsNat Struct Mol Biol 18:230–236https://doi.org/10.1038/nsmb.1975
- 36Low expression of the apolipoprotein B mRNA-editing transgene in mice reduces LDL levels but does not cause liver dysplasia or tumorsArterioscler Thromb Vasc Biol 18:1013–1020https://doi.org/10.1161/01.atv.18.6.1013
- 37Functions and Malfunctions of Mammalian DNA-Cytosine DeaminasesChem Rev 116:12688–12710https://doi.org/10.1021/acs.chemrev.6b00296
- 38Phage-assisted evolution of an adenine base editor with improved Cas domain compatibility and activityNat Biotechnol 38:883–891https://doi.org/10.1038/s41587-020-0453-z
- 39Directed evolution of adenine base editors with increased activity and therapeutic applicationNat Biotechnol 38:892–900https://doi.org/10.1038/s41587-020-0491-6
- 40DART-seq: an antibody-free method for global m(6)A detectionNat Methods 16:1275–1280https://doi.org/10.1038/s41592-019-0570-0
- 41Effects of NSUN2 deficiency on the mRNA 5-methylcytosine modification and gene expression profile in HEK293 cellsEpigenomics 11:439–453https://doi.org/10.2217/epi-2018-0169
- 42Overview of distinct 5-methylcytosine profiles of messenger RNA in human hepatocellular carcinoma and paired adjacent non-tumor tissuesJ Transl Med 18https://doi.org/10.1186/s12967-020-02417-6
- 43The role of RNA m(5)C modification in cancer metastasisInt J Biol Sci 17:3369–3380https://doi.org/10.7150/ijbs.61439
- 44Sequence-and structure-selective mRNA m(5)C methylation by NSUN6 in animalsNatl Sci Rev 8https://doi.org/10.1093/nsr/nwaa273
- 45Multiple links between 5-methylcytosine content of mRNA and translationBMC Biol 18https://doi.org/10.1186/s12915-020-00769-5
- 46CIGAR-seq, a CRISPR/Cas-based method for unbiased screening of novel mRNA modification regulatorsMol Syst Biol 16https://doi.org/10.15252/msb.202010025
- 47m(5)C-dependent cross-regulation between nuclear reader ALYREF and writer NSUN2 promotes urothelial bladder cancer malignancy through facilitating RABL6/TK1 mRNAs splicing and stabilizationCell Death Dis 14https://doi.org/10.1038/s41419-023-05661-y
- 48OsNSUN2-Mediated 5-Methylcytosine mRNA Modification Enhances Rice Adaptation to High TemperatureDev Cell 53:272–286https://doi.org/10.1016/j.devcel.2020.03.009
- 49Widespread occurrence of 5-methylcytosine in human coding and non-coding RNANucleic Acids Res 40:5023–5033https://doi.org/10.1093/nar/gks144
- 50RNA 5-Methylcytosine Controls Plant Development and Environmental AdaptationTrends Plant Sci 25:954–958https://doi.org/10.1016/j.tplants.2020.07.004
- 51The WD40 domain of ATG16L1 is required for its non-canonical role in lipidation of LC3 at single membranesEMBO J 37https://doi.org/10.15252/embj.201797840
- 52A Rac/Cdc42-specific exchange factor, GEFT, induces cell proliferation, transformation, and migrationJ Biol Chem 278:13207–13215https://doi.org/10.1074/jbc.M208896200
- 53The FANC pathway and BLM collaborate during mitosis to prevent micro-nucleation and chromosome abnormalitiesNat Cell Biol 11:761–768https://doi.org/10.1038/ncb1883
- 54Structures of the human and Drosophila 80S ribosomeNature 497:80–85https://doi.org/10.1038/nature12104
- 55Structural snapshots of human pre-60S ribosomal particles before and after nuclear exportNat Commun 11https://doi.org/10.1038/s41467-020-17237-x
- 56Cytosine-5 RNA methylation links protein synthesis to cell metabolismPLoS Biol 17https://doi.org/10.1371/journal.pbio.3000297
- 57SIRT1/PGC-1α is involved in arsenic-induced male reproductive damage through mitochondrial dysfunction, which is blocked by the antioxidative effect of zincEnviron Pollut 320https://doi.org/10.1016/j.envpol.2023.121084
- 58Arsenite delays progression through each cell cycle phase and induces apoptosis following G2/M arrest in U937 myeloid leukemia cellsJ Pharmacol Exp Ther 313:877–887https://doi.org/10.1124/jpet.104.080713
- 59NaAsO(2) decreases GSH synthesis by inhibiting GCLC and induces apoptosis through Hela cell mitochondrial damage, mediating the activation of the NF-κB/miR-21 signaling pathwayEcotoxicol Environ Saf 234https://doi.org/10.1016/j.ecoenv.2022.113380
- 60RNA 5-Methylcytosine Analysis by Bisulfite SequencingMethods Enzymol 560:297–329https://doi.org/10.1016/bs.mie.2015.03.007
- 61Eukaryotic rRNA Modification by Yeast 5-Methylcytosine-Methyltransferases and Human Proliferation-Associated Antigen p120PLoS ONE 10https://doi.org/10.1371/journal.pone.0133321
- 62Transcriptome-Wide Mapping 5-Methylcytosine by m(5)C RNA Immunoprecipitation Followed by Deep Sequencing in PlantMethods Mol Biol 1933:389–394https://doi.org/10.1007/978-1-4939-9045-0_24
- 63Unraveling the Enzyme-Substrate Properties for APOBEC3A-Mediated RNA EditingJ Mol Biol 435https://doi.org/10.1016/j.jmb.2023.168198
- 64The Base-Editing Enzyme APOBEC3A Catalyzes Cytosine Deamination in RNA with Low Proficiency and High SelectivityACS Chem Biol 17:629–636https://doi.org/10.1021/acschembio.1c00919
- 65Nanoscale Characterization of Interaction of APOBEC3G with RNABiochemistry 56:1473–1481https://doi.org/10.1021/acs.biochem.6b01189
- 66EditR: A Method to Quantify Base Editing from Sanger SequencingCrispr j 1:239–250https://doi.org/10.1089/crispr.2018.0014
- 67The SWISS-MODEL Repository-new features and functionalityNucleic Acids Res 45:D313–d319https://doi.org/10.1093/nar/gkw1132
- 68The PyMol Molecular Graphics SystemProteins Structure Function and Bioinformatics 30:442–454
- 69Hi-TOM: a platform for high-throughput tracking of mutations induced by CRISPR/Cas systemsSci China Life Sci 62:1–7https://doi.org/10.1007/s11427-018-9402-9
- 70MetaPlotR: a Perl/R pipeline for plotting metagenes of nucleotide modifications and other transcriptomic sitesBioinformatics 33:1563–1564https://doi.org/10.1093/bioinformatics/btx002
- 71BEDTools: a flexible suite of utilities for comparing genomic featuresBioinformatics 26:841–842https://doi.org/10.1093/bioinformatics/btq033
- 72WebLogo: a sequence logo generatorGenome Res 14:1188–1190https://doi.org/10.1101/gr.849004
- 73DAVID: a web server for functional enrichment analysis and functional annotation of gene lists (2021 update)Nucleic Acids Res 50:W216–w221https://doi.org/10.1093/nar/gkac194
Article and author information
Author information
Version history
- Sent for peer review:
- Preprint posted:
- Reviewed Preprint version 1:
- Reviewed Preprint version 2:
Copyright
© 2024, Zhou et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
- views
- 253
- downloads
- 14
- citations
- 0
Views, downloads and citations are aggregated across all versions of this paper published by eLife.