RNA Splicing: An intimate view of a spliceosome component
Gene expression is a carefully regulated process. Within the nucleus of a eukaryotic cell, multiple molecules work in concert to control whether or not a gene is transcribed to produce a molecule of pre-messenger RNA. Other molecules then direct how this molecule is processed to form a messenger RNA, and still more molecules determine if the messenger RNA is, in turn, translated to form a protein.
Molecular complexes containing short RNA molecules and proteins also play several essential roles in regulating gene expression in eukaryotes. The RNA molecules in these so-called small nuclear ribonucleoprotein particles (or snRNPs) are rich in uridine bases, so they are often known as ‘U snRNPs’. Five of these particles are required for a process called pre-messenger RNA splicing: this involves the removal of introns—regions of RNA that do not code for proteins—from the pre-messenger RNA. Because the vast majority of genes in higher eukaryotes contain introns, this process must take place for almost all messenger RNAs.
Splicing is carried out by a large ribonucleoprotein complex known as the spliceosome. Unlike ribosomes, the particles that translate messenger RNA, spliceosomes are not pre-formed but are assembled anew on each intron. Thirty-five years ago it was proposed that U1 snRNP recognizes the start, or 5′ end, of an intron (Lerner et al., 1980; Rogers and Wall, 1980); this was confirmed by experiments six years later (Zhuang and Weiner, 1986). The recognition of the 5′ splice site by U1 snRNP is now known to be the molecular event that initiates the assembly of the spliceosome.
Now, in eLife, Kiyoshi Nagai and colleagues at the MRC Laboratory of Molecular Biology—including Yasushi Kondo and Chris Oubridge as joint first authors—report a high-resolution crystallographic analysis of the structure of U1 snRNP. In doing so, they finally reveal in detail how this ribonucleoprotein particle engages 5′ splice sites (Kondo et al., 2015).
The U1 snRNP is comprised of one short RNA molecule, a proteinaceous ring (called the Sm ring) made of seven Sm proteins, and three more U1 snRNP-specific proteins (named U1-70k, U1-A, and U1-C). When viewed in two dimensions, most of the RNA molecule resembles a cloverleaf (because it folds back on itself to form three loops). The first loop is the binding site for the U1-70k protein, the second is the binding site for the U1-A protein, and the third makes extensive contacts with the Sm ring. The U1 RNA molecule also contains a fourth stem loop, but this is far removed from the business end of the snRNP.
Nagai and colleagues had previously determined the structure of U1 snRNP using X-ray crystallography to a moderate resolution of 5.5 Å (Pomeranz Krummel et al., 2009). Attempts to obtain a more detailed picture of the structure were unsuccessful, largely because the complex was too flexible to form the highly ordered crystals needed for higher resolution. However, guided by the existing structure, Nagai, Kondo, Oubridge and colleagues were able to cleverly split the snRNP into two smaller substructures, each of which produced more ordered crystals that diffracted to high resolution. One substructure contained the Sm ring, the U1-C protein, a fragment of the U1-70K protein, and a shortened version of the U1 RNA molecule. The fragment of U1-70K was included because it was known to make multiple protein–protein interactions with the Sm ring and the U1-C protein (which also makes many contacts with the Sm ring).
As expected a section near the beginning of the U1 RNA molecule bound, via base pairing, to a complementary sequence in a short RNA molecule that had been designed to mimic a 5′ splice site. Of more interest were contacts made between this double-stranded RNA structure (or duplex) and the U1-C protein. It had previously been reported that U1 snRNP lacking its starting sequence, and thus unable to base pair with the 5′ splice site, still selected 5′ splice site sequences from a pool of RNA molecules of random sequence (Du and Rosbash, 2002). This result was interpreted to mean that the U1-C was a sequence specific RNA-binding protein.
Nevertheless, in the high-resolution crystal structure, U1-C does not make any contacts with the bases of either the U1 RNA molecule or the 5′ splice site RNA. Instead, it makes multiple contacts with the sugar phosphate backbones of both RNA strands (Figure 1). These observations simultaneously rule out the notion that the U1-C protein is a sequence-specific RNA-binding protein and reveal the true role of U1-C in splice site recognition. That is to say that, via backbone interactions, U1-C stabilizes the duplex between the U1 RNA molecule and 5′ splice sties. This stabilization function explains why many 5′ splice site sequences, some of which are not completely complementary to the sequence in the U1 RNA, can still interact in a functionally significant way with the U1 snRNP.
In closing, recent studies have revealed three other functions for the U1 snRNP beyond splice site recognition. First, it prevents the poly(A) tail—which marks the end of a mature messenger RNA—from being added too early or at the wrong sites in a new pre-messenger RNA (Kaida et al., 2010). This activity likely results from the U1-70K protein antagonizing the enzyme that builds the poly(A) tail onto the messenger RNA (Gunderson et al., 1998). The second function, which may well be related to the first, is that U1 snRNP regulates precisely where a poly(A) tail is added to pre-messenger RNAs with more than one useable site (Berg et al., 2012). As such, this latter function determines how long the mature messenger RNA will be. Third, it is now known that most promoters in higher cells initiate RNA synthesis in both directions. U1 snRNP has a central role in ensuring that only RNA synthesis in the right direction is productive (Almada et al., 2013). As the mechanisms behind these activities are analyzed, the crystal structure of U1 snRNP will undoubtedly aid in the design and interpretation of future biochemical experiments.
References
-
A mechanism for RNA splicingProceedings of the National Academy of Sciences of USA 77:1877–1879.https://doi.org/10.1073/pnas.77.4.1877
Article and author information
Author details
Publication history
Copyright
© 2015, Nilsen
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 1,577
- views
-
- 116
- downloads
-
- 2
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Computational and Systems Biology
- Structural Biology and Molecular Biophysics
Viral adhesion to host cells is a critical step in infection for many viruses, including monkeypox virus (MPXV). In MPXV, the H3 protein mediates viral adhesion through its interaction with heparan sulfate (HS), yet the structural details of this interaction have remained elusive. Using AI-based structural prediction tools and molecular dynamics (MD) simulations, we identified a novel, positively charged α-helical domain in H3 that is essential for HS binding. This conserved domain, found across orthopoxviruses, was experimentally validated and shown to be critical for viral adhesion, making it an ideal target for antiviral drug development. Targeting this domain, we designed a protein inhibitor, which disrupted the H3-HS interaction, inhibited viral infection in vitro and viral replication in vivo, offering a promising antiviral candidate. Our findings reveal a novel therapeutic target of MPXV, demonstrating the potential of combination of AI-driven methods and MD simulations to accelerate antiviral drug discovery.
-
- Chromosomes and Gene Expression
- Structural Biology and Molecular Biophysics
Type II nuclear receptors (T2NRs) require heterodimerization with a common partner, the retinoid X receptor (RXR), to bind cognate DNA recognition sites in chromatin. Based on previous biochemical and overexpression studies, binding of T2NRs to chromatin is proposed to be regulated by competition for a limiting pool of the core RXR subunit. However, this mechanism has not yet been tested for endogenous proteins in live cells. Using single-molecule tracking (SMT) and proximity-assisted photoactivation (PAPA), we monitored interactions between endogenously tagged RXR and retinoic acid receptor (RAR) in live cells. Unexpectedly, we find that higher expression of RAR, but not RXR, increases heterodimerization and chromatin binding in U2OS cells. This surprising finding indicates the limiting factor is not RXR but likely its cadre of obligate dimer binding partners. SMT and PAPA thus provide a direct way to probe which components are functionally limiting within a complex TF interaction network providing new insights into mechanisms of gene regulation in vivo with implications for drug development targeting nuclear receptors.