Exploiting functional regions in the viral RNA genome as druggable entities

  1. Dehua Luo
  2. Yingge Zheng
  3. Zhiyuan Huang
  4. Zi Wen
  5. Lijun Guo
  6. Yingxiang Deng
  7. Qingling Li
  8. Yuqing Bai
  9. Shozeb Haider
  10. Dengguo Wei  Is a corresponding author
  1. National Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, China
  2. Hubei Hongshan Laboratory, Interdisciplinary Sciences Institute, Huazhong Agricultural University, China
  3. National Reference Laboratory of Veterinary Drug Residues (HZAU) and National Safety Laboratory of Veterinary Drug (HZAU), MOA Key Laboratory for Detection of Veterinary Drug Residues, MOA Laboratory for Risk Assessment of Quality and Safety of Livestock and Poultry Products, China
  4. College of Life Science and Technology, Huazhong Agricultural University, China
  5. Frontiers Science Center for Animal Breeding and Sustainable Production, China
  6. College of Informatics, Huazhong Agricultural University, China
  7. University of Tabuk (PFSCBR), Saudi Arabia
  8. UCL School of Pharmacy, University College London, United Kingdom
  9. UCL Centre for Advanced Research Computing, University College London, United Kingdom
  10. Shenzhen Institute of Nutrition and Health, Huazhong Agricultural University, China
  11. Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, China
6 figures and 9 additional files

Figures

Figure 1 with 1 supplement
Four regions with different characteristics in the porcine epidemic diarrhea virus (PEDV) genome.

(A) Schematic of SHAPE-MaP for probing the RNA structure of the PEDV genome in situ. (B) Well-folded regions (low SHAPE reactivity and low Shannon entropy; 26.40% of genome). These regions represent stably folded RNA structures with minimal conformational flexibility, serving as structural scaffolds or functional elements in viral replication. (C) Dynamic structured regions (low SHAPE reactivity and high Shannon entropy; 11.70% of genome). These conformationally plastic domains mediate regulatory switches between alternative secondary structures during infection. (D) Dynamic unpaired regions (high SHAPE reactivity and high Shannon entropy; 26.90% of genome). These regions are prone to form non-canonical nucleic acid structures (e.g., G-quadruplexes), which can be stabilized by small-molecule ligands to inhibit viral replication. (E) Persistent unpaired regions (high SHAPE reactivity and low Shannon entropy; 9.67% of genome). These regions are more accessible for siRNA binding, facilitating recruitment of Argonaute proteins and Dicer to form the RNA-induced silencing complex (RISC) for targeted cleavage.

Figure 1—figure supplement 1
Genome-wide SHAPE-MaP analysis of porcine epidemic diarrhea virus (PEDV).

(A) Heat scatter plot of the SHAPE reactivities across two biological replicates for the ex vivo datasets. (B) Reverse transcription mutation rates (median indicated by line, average indicated by ‘.’) for two ex vivo biological replicates: NAI-treated (modified) vs. the DMSO-treated (unmodified). A significant increase in the reverse transcription mutation rate was observed in the NAI-treated group compared to the DMSO-treated group, suggesting that NAIs are effective at modifying viral genomic RNA ex vivo. (C) SHAPE secondary structure modeling of the PEDV 5’ UTR. ***p<0.001 by equal variance unpaired Student’s t-test.

Figure 2 with 14 supplements
SHAPE structural map of the porcine epidemic diarrhea virus (PEDV) genome in infected cells.

From top to bottom: SHAPE reactivity, Shannon entropy, base-pairing probabilities, translation reading frames are indicated by arrows, and genomic coordinates are marked at the bottom. Well-folded regions with low SHAPE reactivity and low Shannon entropy are shaded in blue. Potential G4 forming sequences (PQSs) are marked with a black arrow.

Figure 2—figure supplement 1
Folding characteristics of known functional regions and coding areas in the porcine epidemic diarrhea virus (PEDV) genome.

The length of ORF1ab is 20,345 nt, accounting for 72.5% of the total length of the PEDV genome (28,044 nt) (a major contributor to the global median). The blue dashed line and the orange dashed line represent the global median SHAPE reactivity and the global median Shannon entropy, respectively.

Figure 2—figure supplement 2
Well-folded structures 1–8 in porcine epidemic diarrhea virus (PEDV) genome.

Secondary structure predicted by SHAPE reactivity as a constraint. Green shading indicates high accessibility windows.

Figure 2—figure supplement 3
Well-folded structures 9–15 in porcine epidemic diarrhea virus (PEDV) genome.

Secondary structure predicted by SHAPE reactivity as a constraint. Green shading indicates high accessibility windows.

Figure 2—figure supplement 4
Well-folded structures 10–21 in porcine epidemic diarrhea virus (PEDV) genome.

Secondary structure predicted by SHAPE reactivity as a constraint. Green shading indicates high accessibility windows.

Figure 2—figure supplement 5
Well-folded structures 22–29 in porcine epidemic diarrhea virus (PEDV) genome.

Secondary structure predicted by SHAPE reactivity as a constraint. Green shading indicates high accessibility windows.

Figure 2—figure supplement 6
Well-folded structures 30–37 in porcine epidemic diarrhea virus (PEDV) genome.

Secondary structure predicted by SHAPE reactivity as a constraint. Green shading indicates high accessibility windows.

Figure 2—figure supplement 7
Well-folded structures 38–42 in porcine epidemic diarrhea virus (PEDV) genome.

Secondary structure predicted by SHAPE reactivity as a constraint. Green shading indicates high accessibility windows.

Figure 2—figure supplement 8
Well-folded structures 43–50 in porcine epidemic diarrhea virus (PEDV) genome.

Secondary structure predicted by SHAPE reactivity as a constraint.Green shading indicates high accessibility windows.

Figure 2—figure supplement 9
Well-folded structures 51–54 in porcine epidemic diarrhea virus (PEDV) genome.

Secondary structure predicted by SHAPE reactivity as a constraint. Green shading indicates high accessibility windows.

Figure 2—figure supplement 10
Well-folded structures 55–57 in porcine epidemic diarrhea virus (PEDV) genome.

Secondary structure predicted by SHAPE reactivity as a constraint. Green shading indicates high accessibility windows.

Figure 2—figure supplement 11
Well-folded structures 58–60 in porcine epidemic diarrhea virus (PEDV) genome.

Secondary structure predicted by SHAPE reactivity as a constraint. Green shading indicates high accessibility windows.

Figure 2—figure supplement 12
Structures of 5’UTR-SL5 in the genomes of porcine epidemic diarrhea virus (PEDV) and SARS-CoV-2.

(A) Secondary structure of 5’UTR-SL5 in the PEDV genome predicted by SHAPE reactivity as a constraint. (B) Secondary structure of 5’UTR-SL5 in the SARS-CoV-2 genome (Huston et al., 2021). (C) The three-dimensional structural model of PEDV 5’UTR-SL5 based on secondary structure was predicted using FARFAR2. (D) Three-dimensional of 5’UTR-SL5 in the SARS-CoV-2 genome (PDB id: 8UYS).

Figure 2—figure supplement 13
ITC binding profile of (A) compound 1 (1-(2-methylpyrimidin-4-yl) piperidin-4-amine; 250 µM) and (B) compound 4 (4-(4-chlorophenyl) pyrimidin-2-amine; 250 µM) titrated into the 10 µM 5’UTR-SL5 of porcine epidemic diarrhea virus (PEDV).

Buffer solution: 100 mM HEPES (pH 8.0), 100 mM NaCl, and 10 mM MgCl2. Temperature: 25°C.

Figure 2—figure supplement 14
Antiviral activity of compounds interacting with the porcine epidemic diarrhea virus (PEDV) 5’UTR-SL5 in vitro.

Lycorine is a PEDV-positive inhibitor. Confocal fluorescence microphotographs of the same cells with DAPI (left), N protein of PEDV (middle) or Merge (right) were demonstrated (scale bar = 50 µm).

Figure 3 with 7 supplements
The PQS1 within high SHAPE-high Shannon regions has the potential to form G4 structure in the porcine epidemic diarrhea virus (PEDV) genome.

(A) Distribution of PQSs in the PEDV genome. (B) SHAPE reactivity and Shannon entropy for the regions containing PQS1. (C) Local secondary structure of the region containing PQS1 predicted with SHAPE reactivity constraints. PQS1 is marked with a blue dashed box. (D) Fluorescence turn-on assays of ThT (1 µM) in the presence of PQS1 (0.5 µM), PQS1-long chain (0.5 µM), and PQS1mut-long chain (0.5 µM). (E) Fluorescence turn-on assays of ThT (1 µM) in the presence of PQS3 (0.5 µM) and PQS3-long chain (0.5 µM). Excitation wavelength was 442 nm. PRRSV-G4 RNA was used as control.

Figure 3—figure supplement 1
Prediction of potential G-quadruplex forming sequences (PQSs) in the porcine epidemic diarrhea virus (PEDV) genome by different prediction tools.

Edwards’ Venn diagram showing the overlap of the putative PQSs predicted by G4Catchall, QGRS mapper, ImGQfinder and pqsfinder.

Figure 3—figure supplement 2
Secondary structure of the local region containing PQS2 predicted by SHAPE reactivity as a constraint.
Figure 3—figure supplement 3
Secondary structure of the local region containing PQS3 predicted by SHAPE reactivity as a constraint.
Figure 3—figure supplement 4
Secondary structure of the local region containing PQS4 predicted by SHAPE reactivity as a constraint.
Figure 3—figure supplement 5
Secondary structure of the local region containing PQS5 predicted by SHAPE reactivity as a constraint.
Figure 3—figure supplement 6
PQSs in well-folded regions are resistant to G4 formation.

(A) Fluorescence turn-on assays of ThT (1 µM) in the presence of PQS2 (0.5 µM) and PQS2-long chain (0.5 µM). (B) Fluorescence turn-on assays of ThT (1 µM) in the presence of PQS4 (0.5 µM), PQS4-long chain (0.5 µM), and PQS4mut-long chain (0.5 µM). (C) Fluorescence turn-on assays of ThT (1 µM) in the presence of PQS6 (0.5 µM) and PQS6-long chain (0.5 µM). Excitation wavelength was 442 nm. PRRSV-G4 RNA was used as control.

Figure 3—figure supplement 7
G-quadruplex structure folded by PQS1.

(A) WebLogo representation nucleotide conservation of PQS1 among different porcine epidemic diarrhea virus (PEDV) strains. The height of the letters indicates the relative frequency of occurrence of a particular nucleotide at that position. (B) Native-PAGE analysis of the migration of G4 RNAs (PQS1) and G4 mutant RNAs (PQS1m) was performed under different KCl concentrations. Lanes 1 and 5, RNA ladder; lanes 2, 3, 4, PQS1; lanes 6, 7, 8, PQS1m. (C) CD spectroscopy of PQS1 in the presence of different concentrations of KCl.

Figure 3—figure supplement 7—source data 1

Original native-PAGE gel for Figure 3—figure supplement 7B, indicating the relevant bands and treatments.

https://cdn.elifesciences.org/articles/103923/elife-103923-fig3-figsupp7-data1-v2.zip
Figure 3—figure supplement 7—source data 2

Original files for native-PAGE gel displayed in Figure 3—figure supplement 7B.

https://cdn.elifesciences.org/articles/103923/elife-103923-fig3-figsupp7-data2-v2.zip
Figure 4 with 4 supplements
The G-quadruplex structure, biological functions of PQS1, and antiviral effects of Braco-19.

(A) 1H nuclear magnetic resonance (NMR) analysis of PQS1. (B) Circular dichroism (CD) melting profiles of PQS1. (C) Quantitative fluorescence signal using the corrected total cell fluorescence method for EGFP in cells transfected with plasmids containing the empty vector, PQS1 and PQS1mut. (D) Proliferation curve of the PEDV wild type (WT) strain and PQS1 mutant strain. (E) The relative inhibition rates of Braco-19 against AJ1102-WT and AJ1102-PQS1mut. (F) Western blot analysis of the effects of Braco-19 on the viral N protein expression of AJ1102-WT and AJ1102-PQS1mut.

Figure 4—source data 1

Original western blots for Figure 4F, indicating the relevant bands and treatments.

https://cdn.elifesciences.org/articles/103923/elife-103923-fig4-data1-v2.zip
Figure 4—source data 2

Original files for western blot analysis displayed in Figure 4F.

https://cdn.elifesciences.org/articles/103923/elife-103923-fig4-data2-v2.zip
Figure 4—figure supplement 1
G4 structure of PQS1 inhibits RNA-dependent RNA synthesis.

(A) Schematic diagram of the RNA stop assay. The PQS1 sequence (orange part) was integrated into the 5’ end of the RNA template, and a 5’ FAM-tagged RNA primer P15 (green and blue parts of the RNA primer) was designed to target the 3’ end of the template (blue part of the RNA blue part of the template) to initiate RNA synthesis. When G4 blocks RNA synthesis, there are fewer full-length elongated bands and truncated bands at the site of G4 formation. (B) Denaturing-PAGE analysis. Black arrows indicate the positions of the full-length product, G4 paused product, and free primer. The fully extended product and the template RNA and the polymerase form a stable complex that moves much more slowly than the ssRNA. Partial full-length extended products and stopped products (due to G-quadruplex fold) were observed along template PQS1 (lane 2), but only fully extended products were observed along G4-mutated template PQS1mut (lane 5). When increasing amounts of compound Braco-19 were incubated with template PQS1, a gradual decrease in fully extended products was observed (lanes 2–4). On the contrary, the fully extended products of template PQS1mut were not affected by the addition of Braco-19, and the G4-specific termination event was not characterized (lanes 5–7). Lane 1, RNA ladder (p15, m17, m20, and m42 in Supplementary file 1); lanes 2, no Braco19 control; lanes 3 and 4, 0.5 and 1 µM compounds inhibit RNA extension; lanes 5, no Braco19 control; lanes 6 and 7, compounds do not inhibit RNA extension.

Figure 4—figure supplement 1—source data 1

Original denaturing-PAGE gel for Figure 4—figure supplement 1B, indicating the relevant bands and treatments.

https://cdn.elifesciences.org/articles/103923/elife-103923-fig4-figsupp1-data1-v2.zip
Figure 4—figure supplement 1—source data 2

Original files for denaturing-PAGE gel displayed in Figure 4—figure supplement 1B.

https://cdn.elifesciences.org/articles/103923/elife-103923-fig4-figsupp1-data2-v2.zip
Figure 4—figure supplement 2
PQS1 inhibits green fluorescent protein (GFP) expression.

(A) Structure of compound Braco-19. (B) Schematic representation of the pEGFP-C1 plasmids. The empty vector carries the wild-type EGFP sequence, while the engineered plasmids contain the porcine epidemic diarrhea virus (PEDV) PQS1 sequence and the PQS1mut sequence. (C) Plasmids containing wild-type EGFP, PQS1, and PQS1mut were transfected into Vero cells, which were then treated with 0, 5 or 10 µM Braco-19 for 24 h. EGFP expression levels were measured via confocal fluorescence microscopy. Compared with DMSO treatment, the addition of Braco-19 led to a significant decrease in PQS1 EGFP fluorescence intensity. In contrast, no difference was observed in the EGFP expression of the PQS1mut vector and the empty vector. Scale bar = 50 µm.

Figure 4—figure supplement 3
The PQS3 in the well-folded region lacks biological function.

(A) Schematic diagram of the process used to obtain the PQSs mutant recombinant strains. (B) Proliferation curve of the PQS3 mutant strain. (C) The relative inhibition rates of Braco-19 against AJ1102-WT and AJ1102-PQS3mut.

Figure 4—figure supplement 4
Antiviral activity and cytotoxicity of Braco-19.

(A) The median effective inhibitory concentration (EC50) of Braco-19 against PEDV AJ1102 in Vero cells and (B) the cytotoxicity of Braco-19 for Vero cells.

Figure 5 with 2 supplements
Secondary structure of target regions and antiviral effects of siRNAs.

(A–D) Local secondary structures of the stable single-stranded regions targeted by siRNAs predicted by SHAPE reactivity as a constraint; (A) ss-1; (B) ss-2; (C) ss-3; (D) ss-4. (E) qPCR showing the relative abundance of the porcine epidemic diarrhea virus (PEDV) RNA genome in infected Vero cells. The four siRNAs targeting the high SHAPE-low Shannon regions and the four siRNAs targeting the duplex regions are labeled ss-1 to ss-4 and ds-1 to ds-4, respectively. ss (single-stranded targeting siRNAs); ds (dual-stranded targeting siRNAs). si-NC was a control siRNA that did not target any viral or host sequences, and the mock group was not inoculated with virus. (F) TCID50 assays for detecting virus titers. The presented results represent the means and standard deviations of data from three independent experiments. ns: no significant difference. *p<0.05; **p<0.01; ***p<0.001, Duncan’s multiple comparison test.

Figure 5—figure supplement 1
Local secondary structures of the duplex regions targeted by siRNAs.

Secondary structure of the (A) ds-1; (B) ds-2; (C) ds-3; (D) ds-4 target regions predicted by SHAPE reactivity as a constraint. Ds: dual-stranded targeting siRNAs.

Figure 5—figure supplement 2
Cytopathic effects (CPE) of siRNA targeting regions of duplex or single-strand on porcine epidemic diarrhea virus (PEDV)-infected Vero cells.

Scale bar = 50 µm.

Single-stranded RNA regions in viral genomes as the targets for antiviral therapy.

(A) PQSs in the single strands are easier to be induced into G4s by ligands than those in the paired regions. (B) Influence of the secondary structures on the binding efficiency of siRNAs.

Additional files

MDAR checklist
https://cdn.elifesciences.org/articles/103923/elife-103923-mdarchecklist1-v2.docx
Supplementary file 1

Oligos used in this study.

https://cdn.elifesciences.org/articles/103923/elife-103923-supp1-v2.docx
Supplementary file 2

Characteristics of regions with different SHAPE reactivity and Shannon entropy.

https://cdn.elifesciences.org/articles/103923/elife-103923-supp2-v2.docx
Supplementary file 3

Location of regions with different SHAPE reactivity and Shannon entropy.

https://cdn.elifesciences.org/articles/103923/elife-103923-supp3-v2.docx
Supplementary file 4

Binding affinities of Compounds with 5'UTR-SL5 in the PEDV genome.

https://cdn.elifesciences.org/articles/103923/elife-103923-supp4-v2.docx
Supplementary file 5

PQSs with high conservation in the PEDV genome.

https://cdn.elifesciences.org/articles/103923/elife-103923-supp5-v2.docx
Supplementary file 6

Genomic locations and structural features of PQS-long chain regions.

https://cdn.elifesciences.org/articles/103923/elife-103923-supp6-v2.docx
Supplementary file 7

Sequence and characterization of target regions of siRNAs.

https://cdn.elifesciences.org/articles/103923/elife-103923-supp7-v2.docx
Supplementary file 8

Structural features of anti-SARS-CoV-2 siRNA target regions.

https://cdn.elifesciences.org/articles/103923/elife-103923-supp8-v2.docx

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Dehua Luo
  2. Yingge Zheng
  3. Zhiyuan Huang
  4. Zi Wen
  5. Lijun Guo
  6. Yingxiang Deng
  7. Qingling Li
  8. Yuqing Bai
  9. Shozeb Haider
  10. Dengguo Wei
(2025)
Exploiting functional regions in the viral RNA genome as druggable entities
eLife 13:RP103923.
https://doi.org/10.7554/eLife.103923.3