Structural determinants of nuclear export signal orientation in binding to exportin CRM1

Abstract
eLife digest
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

The Chromosome Region of Maintenance 1 (CRM1) protein mediates nuclear export of hundreds of proteins through recognition of their nuclear export signals (NESs), which are highly variable in sequence and structure. The plasticity of the CRM1-NES interaction is not well understood, as there are many NES sequences that seem incompatible with structures of the NES-bound CRM1 groove. Crystal structures of CRM1 bound to two different NESs with unusual sequences showed the NES peptides binding the CRM1 groove in the opposite orientation (minus) to that of previously studied NESs (plus). Comparison of minus and plus NESs identified structural and sequence determinants for NES orientation. The binding of NESs to CRM1 in both orientations results in a large expansion in NES consensus patterns and therefore a corresponding expansion of potential NESs in the proteome.

https://doi.org/10.7554/eLife.10034.001

eLife digest

Many organisms keep their DNA within a structure inside their cells called the nucleus. Two layers of membrane surround the nucleus and keep the DNA separate from the rest of the cell's contents. Yet, proteins and other molecules can move in and out of the nucleus by passing through small pores in this nuclear membrane.

To travel through these pores, larger molecules such as proteins rely on the assistance of transport receptors, including one called CRM1. This transport receptor helps to export hundreds of different proteins from the nucleus by recognizing a part of their structure called the ‘nuclear export signal’. Earlier work has shown that three different nuclear export signals interact with CRM1 in a similar ways by binding to a groove on its outer surface. But, there are several different types of nuclear export signal, and many are predicted to have three-dimensional structures that would seem to prevent them from binding to CRM1 in this way. As yet, it remains unknown how these diverse signals interact with this important transporter receptor.

Protein crystallization is a technique that is used to visualize a protein's three-dimensional structure. Fung et al. have now used this approach to investigate how a particular class of nuclear export signals (called ‘class 3’) bind to CRM1. First, a modified form of CRM1 was crystallized once it had bound to a small fragment of protein that contains a class 3 nuclear export signal. The protein's molecular structure was then revealed by performing X-ray diffraction on the crystals.

The results show, unexpectedly, that two different nuclear export signals in class 3 bind to the groove of CRM1 in the opposite direction to that reported previously. Additional biochemical and structural experiments then identified a particular feature or motif in the nuclear export signals that determines which way round they bind to CRM1.

This discovery advances our understanding of how these signals work, which will allow us to more accurately identify new nuclear export signals from genome sequences. As more CRM1-binding nuclear export signals are discovered in the future, the experimental data sets used to train the computational programs that are currently used to locate these signals in genomic sequences will be diversified and improved.

https://doi.org/10.7554/eLife.10034.002

Introduction

The exportin CRM1 (Chromosome Region Maintenance 1 protein; also known as exportin 1 or XPO1) is the most prominent nuclear export receptor in the cell. CRM1 maintains the cellular localization of hundreds of diverse-functioning protein cargos, including many tumor suppressor, cell cycle proteins, and viral proteins (Fornerod et al., 1997; Fukuda et al., 1997; Ossareh-Nazari et al., 1997). CRM1 is also a promising cancer drug target, and a small molecule inhibitor of CRM1 named Selinexor is currently in more than 40 clinical trials for a variety of cancers (clinicaltrials.gov) (Lapalombella et al., 2012; Etchin et al., 2013; Sun et al., 2013; Fung and Chook, 2014; Xu et al., 2015). CRM1 recognizes its protein cargos through 8–15 residue long nuclear export signals (NESs) in the proteins (la Cour et al., 2004; Kosugi et al., 2008; Xu et al., 2010). NES sequences are highly diverse, and the peptides bind CRM1 with a large affinity range, with dissociation constants (K_Ds) ranging from low nanomolar to tens of micromolar (Kutay and Guttinger, 2005). Sequence, peptide-library, and bioinformatic analyses have found that NESs are best described by a set of six consensus sequences, which differ in the spacings between four key hydrophobic residues Φ1, Φ2, Φ3, and Φ4 (Figure 1A) (la Cour et al., 2004; Kosugi et al., 2008; Xu, Farmer et al., 2012a). While sequence patterns are available to describe many NESs, there is limited structural information on diverse NESs and how they bind CRM1.

Figure 1 with 1 supplement see all

Download asset Open asset

hRio2^NES and CPEB4^NES bind CRM1 in orientation opposite to the PKI^NES.

(A) Six nuclear export signal (NES) consensus patterns (Φ is Leu, Val, Ile, Phe or Met; X is any amino acid). (B) Structure of PKI^NES (yellow cartoon) bound to Chromosome Region of Maintenance (CRM1) (gray surface) (3NBY) on the right and PKI^NES was removed to show hydrophobic pockets P0–P4 in the CRM1 groove on the left panel. (C) Overall structure of the CRM1* (gray)-RanGppNHp (orange)-RanBP1 (light purple)-hRio2^NES (blue) complex. (D) Structures of hRio2^NES (blue) and CPEB4^NES (purple) bound to the CRM1 groove (gray surfaces). All NES peptides are in cartoon and their hydrophobic Φ residues shown as sticks. Their Φ residues and the corresponding P0–P4 CRM1 pockets that they bind are shown below. (E) Kick OMIT map meshes contoured at the 3.0σ level overlaid on the final, refined coordinates for hRio2^NES and CPEB4^NES. Kicked OMIT maps were generated by PHENIX by omitting the NES peptides.

https://doi.org/10.7554/eLife.10034.003

Structures are available for only three different NESs, from the cargos protein kinase A inhibitor (PKI), Snurportin-1 (SNUPN), and the HIV1-Rev protein, bound to CRM1. The NESs bind in a hydrophobic groove, which is located on the outer/convex surface of the ring-shaped CRM1 (Dong et al., 2009; Monecke et al., 2009; Güttler et al., 2010). The NESs use almost exclusively their side chains, especially their hydrophobic Φ side chains, to bind CRM1. The NES-binding groove of CRM1, which is wide at one end and narrow at the other end, consists of 5 hydrophobic pockets P0–P4 and is virtually identical in all CRM1-NES structures (Figure 1B). NESs from PKI and SNUPN (PKI^NES and SNUPN^NES) share a similar structure when bound to CRM1—an N-terminal 3-turn α-helix followed by a short C-terminal β-strand-like extension (Figure 1B) (Dong et al., 2009; Monecke et al., 2009; Güttler et al., 2010; Koyama et al., 2014). The NES helix binds the wide part of the CRM1 groove, while the β-strand binds the narrow end of the groove. The Rev^NES peptide binds the CRM1 groove in a different manner by adopting an entirely extended conformation (Güttler et al., 2010). All three NES peptides bind in the same direction, with their N-termini at the wide part of the groove.

The vastly different conformations of the extended Rev^NES compared to the helix-strand PKI^NES and SNUPN^NES suggest that CRM1 may recognize divergent signal sequences in part by binding different peptide structures. The repertoire of conformations for CRM1-bound NESs remains unclear, but the asymmetric and seemingly structurally invariant NES-bound CRM1 groove presents physical constraints on structures of bound NESs. For example, the class 3 NES consensus of Φ1X_(2,3)Φ2X_(2,3)Φ3X₂Φ4 with two intervening residues between Φ3 and Φ4 suggests a single long NES helix. The substitution of a narrow strand or extended chain at the C-terminus of an NES with a helix presents a steric problem as the thicker helix is unlikely to fit into the tapering CRM1 groove. In current NES databases, class 3 NES sequences are as prevalent as NESs of classes 1b, 1c, 1d, and 2, but information of how they are able to bind CRM1 is missing (Xu et al., 2015).

We have developed a general strategy to crystallize CRM1 bound to NES peptides in order to study how diverse sequences, including the enigmatic class 3 NESs, bind the exportin. Crystal structures of two different class 3 NESs bound to CRM1 revealed a novel NES binding mode where polypeptide direction of the NES is reversed. We show that NES peptides can bind the CRM1 groove bidirectionally (in both plus and minus directions), and biochemical and structural analyses identified determinants for one direction of binding vs the other. Bidirectional exportin-signal interactions suggest a significant expansion of the current NES consensus patterns that will enable new, previously unknown NESs to be identified.

Results

A general strategy for structure determination of CRM1 bound to NES peptides

Crystallization of CRM1-Ran-NES peptide complexes has generally not been successful, possibly due to conformational flexibility and low affinities for the NESs (Kutay and Guttinger, 2005). Crystal structures of NESs bound to CRM1 were instead determined using the CRM1-Ran-SNUPN complexes, including ones where the SNUPN^NES was replaced with the PKI^NES and Rev^NES (Güttler et al., 2010). This strategy was limited because of severe mosaicity of the CRM1-Ran-SNUPN crystals (Güttler et al., 2010). On the other hand, the ternary complex of Saccharomyces cerevisiae CRM1 (^ScCRM1) with RanGTP (human or yeast RanGTP, Gsp1p) and RanBP1 (Yrb1p) reliably yields crystals that diffract to high resolution and has been used to determine structures of several CRM1-inhibitor complexes (Koyama and Matsuura, 2010; Lapalombella et al., 2012; Etchin et al., 2013; Sun et al., 2013; Haines et al., 2015). We therefore used the CRM1-Ran-RanBP1 complex to determine structures of the exportin bound to the enigmatic class 3 NES peptides.

RanBP1 binding normally stimulates NES release by closing the NES-binding groove of ^ScCRM1 (Koyama and Matsuura, 2010). We engineered CRM1 to shift the open-closed groove equilibrium toward the open state, in order for CRM1-Ran-RanBP1 to bind NESs. We started with a ^ScCRM1 construct (residues 1–1058, Δ377–413, ⁵³⁷DLTVK⁵⁴¹ to GLCEQ) that is known to crystallize easily and has an NES groove that is virtually identical to that of human CRM1 (Lapalombella et al., 2012; Etchin et al., 2013; Sun et al., 2013; Haines et al., 2015). Koyama and Matsuura showed that mutation of the H9 loop of CRM1, which packs against the back of a closed NES groove, stabilizes the open CRM1 groove (Koyama and Matsuura, 2010). Thus, we mutated the H9 loop (Val441Asp) to detach it from the back of the NES groove and to open the groove even when CRM1 is complexed with Ran and RanBP1. The resulting CRM1* construct, with ^ScCRM1 residues 1–1058, Δ377–413, V441D and groove residues ⁵³⁷DLTVK⁵⁴¹ mutated to GLCEQ (to mimic human CRM1, see methods), binds NES peptides in the presence of RanGTP and RanBP1. We generated quaternary CRM1*-RanGppNHp-RanBP1-NES complexes with two class 3 NES peptides, the hRio2^NES (³⁸⁹RSFEMTEFNQALEEI⁴⁰³) and the CPEB4^NES (³⁷⁹RTFDMHSLESSLIDI³⁹³; predicted Φ1–Φ4 positions are underlined) and determined their structures to 2.3 Å and 2.1 Å resolution (Figure 1C,D; crystallographic statistics in Table 1). The crystals are isomorphous to previously crystallized inhibitor-bound CRM1-Ran-RanBP1 complexes, with one CRM1*-RanGppNHp-RanBP1-NES complex in the asymmetric unit (Sun et al., 2013). Residues modeled in the three proteins are CRM1* residues 1–440 and 460–1053, Ran residues 9–216, and RanBP1 residues 63–69 and 78–200. hRio2^NES residues 391–403 and CPEB4^NES residues 379–393 were modeled in the respective structures.

Table 1

Data collection and refinement statistics

https://doi.org/10.7554/eLife.10034.005

	^ScXPO1-RanGppNHp-Yrb1p bound to NES of:
	Selenomethione-hRio2	CPEB4	Selenomethione-hRio2-R	CPEB4-R	PKI-Flip3
Data collection
Space group	P4₃2₁2
Cell dimensions
a, b, c (Å)	106.48, 106.48, 303.73	105.96, 105.96, 304.00	106.69, 106.69, 304.50	106.48, 106.48, 303.73	105.96, 105.96, 304.00
a, b, g (°)	90, 90, 90	90, 90, 90	90, 90, 90	90, 90, 90	90, 90, 90
Resolution (Å)	50.00–2.28 (2.32–2.28)*	50.00–2.10 (2.14–2.10)	50.00–2.28 (2.32–2.28)	50.00–2.94 (3.00–2.94)	50.00–2.55 (2.59–2.55)
R_pim	2.9 (37.7)	3.5 (43.4)	3.5 (38.6)	4.9 (40.6)	4.1 (46.5)
I/sI	24.3 (2.17)	19.5 (1.70)	22.5 (2.72)	13.3 (1.87)	19.0 (1.92)
Completeness (%)	98.6 (99.8)	99.5(100)	98.0 (99.2)	94.6 (96.0)	99.6 (100)
Redundancy	7.0 (5.9)	6.0 (6.1)	7.0 (7.0)	6.2 (5.7)	5.5 (5.5)
Refinement
Resolution (Å)	45.7–2.28 (2.32–2.28)	40.2–2.09 (2.12–2.09)	37.7–2.28 (2.31–2.28)	47.5–2.94 (3.02–2.94)	47.5–2.54 (2.60–2.54)
No. reflections	77,245 (2833)	98,659 (1793)	79,492 (3267)	34,265 (2013)	56862 (3361)
R_work/R_free	17.8 (25.8)/21.9 (27.3)	17.0 (23.8)/20.8 (27.0)	16.8 (24.7)/21.2 (27.6)	18.1 (25.2)/24.0 (31.3)	18.6 (25.0)/22.6 (30.6)
No. atoms
Protein	10,859	11,114	10,823	10,708	10797
Ligand/ion	60	76	59	51	51
Water	271	660	358	8	253
NES Peptide/Φ	111/46	122/43	130/46	112/43	105/43
B-factors
Protein	42.0	39.3	43.9	53.8	46.5
Ligand/ion	44.3	51.7	46.9	41.8	41.6
Water	33.4	34.8	35.4	23.3	35.3
NES peptide/Φ	80.5/77.3	77.6/70.4	67.5/61.7	81.2/80.5	98.6/96.0
R.m.s deviations
Bond lengths (Å)	0.003	0.003	0.006	0.003	0.004
Bond angles (°)	0.617	0.689	0.835	0.578	0.673
PDB code	5DHF	5DIF	5DI9	5DHA	5DH9

*

Highest resolution shell is shown in parenthesis.
One crystal was used for each structure.

Overall structures of the CRM1*-Ran-RanBP1-NES complexes are highly similar to previously determined CRM1-Ran-RanBP1 structures (all residue Cα rmsds 0.2–0.5 Å when compared to unliganded CRM1-Ran-RanBP1 (PDB code: 3M1I, 4HB2) (Koyama and Matsuura, 2010; Sun et al., 2013) and to inhibitor-bound CRM1-Ran-RanBP1 complexes (PDB code: 4HAT, 4HAU, 4HAV, 4GMX, 4GPT) (Lapalombella et al., 2012; Etchin et al., 2013; Sun et al., 2013; Haines et al., 2015). Structures of the NES peptides were verified by kick-OMIT maps (Praznikar et al., 2009) generated without the peptide (Figure 1E, stereo views in Figure 1—figure supplement 1). Selenomethionine hRio2^NES peptide was also generated and anomalous data were collected to confirm correct placement of its methionine, and unambiguously confirm the direction of the NES polypeptide chain (Figure 1—figure supplement 1).

hRio2^NES and CPEB4^NES bind CRM1 in the opposite or minus direction

The CRM1-bound hRio2^NES and CPEB4^NES structures unexpectedly revealed that both NESs bound the CRM1 groove in opposite orientation (termed the minus direction) compared to previous NES structures (PKI^NES, SNUPN^NES, and Rev^NES bind in the plus direction) (Figure 1B,D). The NES groove of CRM1 is nearly invariant when bound to plus or minus NESs (Cα rmsds 0.3–0.5 Å; all atom rmsds 1.0–1.3 Å, for CRM1 residues 521–605 in all available CRM1-NES structures [Dong et al., 2009; Monecke et al., 2009; Güttler et al., 2010]). Although the polypeptide directions are reversed, local structures of the hRio2^NES and CPEB4^NES are similar to those of the PKI^NES and SNUPN^NES. All four NES peptides are combinations of 3-turn α-helices and 2-residue β-strand-like extensions.

Helices of the minus NESs are now at the C-termini of the peptides and their strands at the N-termini (Figure 1D). Both plus and minus NES helices bind the same part of the CRM1 groove, with hydrophobic residues from one helix face occupying hydrophobic pockets P0–P3 of CRM1 (Figure 1B,D). The structures show that the two minus NESs clearly match the consensus pattern Φ1XΦ2XXXΦ3XXΦ4XXΦ5, which is the reverse of the class 1a pattern, Φ0XXΦ1XXΦ2XXXΦ3XΦ4. The five hydrophobic residues of the hRio2^NES and CPEB4^NES, designated Φ1–Φ5, bind the same P0–P4 pockets as the plus NESs, but in reverse order, with Φ1 in P4 and Φ5 in P0 (Figure 1B,D). The narrow part of the CRM1 groove is still occupied by an extended strand motif, which is formed by Φ1 and Φ2 of the minus NESs as they occupy the CRM1 P4 and P3 pockets, respectively. The hRio2^NES and CPEB4^NES are in fact not class 3 NESs in the traditional sense, as the four previously designated Φ residues in hRio2^NES (³⁸⁹RSFEMTEFNQALEEI⁴⁰³) and the CPEB4^NES (³⁷⁹RTFDMHSLESSLIDI³⁹³; predicted Φ1–Φ4 positions are underlined) do not occupy the P1–P4 hydrophobic pockets as predicted. The four hydrophobic residues that match the class 3 NES consensus, in fact form only a portion of an inverted class 1a pattern Φ2XXXΦ3XXΦ4XXΦ5, with M393 of hRio2^NES and M383 of CPEB4^NES in the Φ2 positions. F391 of hRio2^NES and F381 of CPEB4^NES, which we had previously missed as consensus residues, are the Φ1 positions of the N-terminal ΦXΦ motif of an inverted class 1a pattern.

Comparative structural and biochemical analyses of minus and plus NESs

Comparison of plus and minus NESs showed translational offsets of the helices along their axes (Figure 2). Cαs of minus NESs are shifted 1.3–3.5 Å from equivalent Cαs in the plus NESs, with the largest shifts observed for residues that occupy the P0 and P1 CRM1 pockets. In an α-helix, amino acid side chains are angled toward the N-terminus of the helix. Thus, since plus and minus helices progress from N- to C-terminus in opposite directions, side chains that emanate from the helices also project in opposite directions. The Φ0, Φ1, Φ2, and Φ3 side chains of the plus NES helix project toward the wide end of the CRM1 groove near P0 to occupy the P0–P4 CRM1 pockets (Figure 2). In contrast, the equivalent Φ5, Φ4, Φ3, and Φ2 residues of the minus NES helix project toward the narrow end of the CRM1 groove, thus necessitating a shift of the entire helix in the opposite direction to allow the hydrophobic side chains to reach the P0, P1, P2, and P3 CRM1 pockets.

Figure 2

Download asset Open asset

Comparison of plus and minus NESs.

Pairwise comparison of hRio2^NES (blue) or CPEB4^NES (purple) with PKI^NES (yellow; 3NBY) upon superposition of NES-bound CRM1 grooves. Hydrophobic NES residues (Φs) are shown as sticks and orientation of the CRM1 grooves is indicated by positions of the P0–P4 pockets.

https://doi.org/10.7554/eLife.10034.006

Because the entire minus NES helix shifts relative to a plus helix, it is not surprising that hydrophobic side chain preferences of the minus and plus helices are similar. We generated single amino acid mutants by replacing each of positions Φ3, Φ4, and Φ5 in hRio2^NES with other hydrophobic residues, and tested binding of the mutants to CRM1. Results of in vitro pull-down assays using immobilized GST-hRio2^NES mutants, purified human CRM1, and yeast RanGTP show that medium-sized hydrophobic side chains such as isoleucine and leucine are preferred at Φ4 and Φ5 for CRM1 interaction. Medium and larger hydrophobic side chains such as isoleucine, leucine, and methionine are preferred at Φ3 for binding CRM1 (Figure 3A). These results are similar to ones previously shown in the mutagenesis study of the PKI^NES (Güttler et al., 2010).

Figure 3

Download asset Open asset

Hydrophobic side chain preferences for hRio2^NES binding to CRM1.

In vitro pull-down assay (Coomassie-stained SDS/PAGE) of purified human CRM1 binding to immobilized GST-hRio2^NES mutants (A) Φ3, Φ4, or Φ5 or (B) Φ1 or Φ2 position mutated in the presence of excess ^ScRanGTP. Relative band intensities of triplicate experiments are plotted in histograms.

https://doi.org/10.7554/eLife.10034.007

The shift of the minus NES helix relative to the plus NES helix results in a corresponding translation of the preceding strand/loop segment that places the Φ1 and Φ2 side chains farther from the P3 and P4 pockets. In both the hRio2^NES and CPEB4^NES, large hydrophobic residues in the Φ1 (phenylalanines) and the Φ2 (methionines) positions within the extended segments enable a longer reach into the comparatively distal P3 and P4 CRM1 pockets. Mutagenesis of the Φ1 and Φ2 positions of hRio2^NES and pull-down assays with CRM1 and Ran show that large hydrophobic side chains such as leucine, methionine, phenylalanine, and tryptophan are preferred in both positions (Figure 3B). Smaller hydrophobic side chains like alanine, valine, and isoleucine in these positions are disfavored as the mutants do not bind CRM1 efficiently (Figure 3B). The preference for large hydrophobic side chains is consistent with the need for side chains in the extended portions of the minus NESs to reach farther into their CRM1-binding sites. In contrast, the large aromatic residues phenylalanine and tryptophan are disfavored in equivalent Φ3 and Φ4 positions in the plus direction PKI^NES (Güttler et al., 2010).

Structural determinants of the plus vs minus NES

Structural and mutagenesis data to compare plus and minus NESs suggest that placement of the strand-like ΦXΦ motif at the N-terminus of an NES generates a signal peptide that binds CRM1 in the minus direction, whereas a C-terminal ΦXΦ results in a plus direction NES. To first investigate whether features of the sequence such as spacings between hydrophobic residues and placement of the ΦXΦ motif are critical in determining directionality of NES binding, we reversed the sequence of the hRio2^NES (FEMTEFNQALEEI) to generate hRio2^NES-R (IEELAQNFETMEF). We also reversed CPEB4^NES (FDMHSLESSLIDI) to generate CPEB4^NES-R (IDILSSELSHMDF). Both reversed peptides match the class 1a NES pattern and were predicted to bind CRM1 like the PKI^NES, which is another class 1a NES. Binding affinities of NES peptides to CRM1 were measured in competition differential bleaching experiments using FITC-PKI^NES as a fluorescent probe, MBP-NESs as competitors and monitored with a microscale thermophoresis instrument (Figure 4, Figure 4—figure supplement 1). The competition differential bleaching approach is explained in methods and representative titration data are shown in Figure 4—figure supplement 2. Wild-type NESs MBP-hRio2^NES and MBP-CPEB4^NES bind CRM1 with K_Ds of 2200 nM [1600,2900] and 590 nM [400,840], respectively (Figure 4B). The ranges in brackets represent the 68.3% confidence intervals as calculated using F-statistics and error-surface projection method (Bevington and Robinson, 1992). When NES sequences are reversed, MBP-hRio2^NES-R and MBP-CPEB4^NES-R still bind CRM1 with similar affinities, K_Ds of 2400 nM [2100,2800] and 780 nM [610,980], respectively (Figure 4A,B). All of the NES peptides were also cloned into EYFP-NLS-NES fusions and tested for nuclear export activity in HeLa cells. They were all found to direct nuclear export in a Leptomycin B sensitive manner, suggesting that the reverse peptides function as active NESs in cells (Figure 4C).

Figure 4 with 2 supplements see all

Download asset Open asset

hRio2^NES, CPEB4^NES, and their reverse counterparts bind CRM1 with similar binding affinities.

(A) Sequences of NESs used. (B) Binding of FITC-PKI^NES and various MBP-NESs to CRM1 measured by differential bleaching, monitored by a microscale thermophoresis instrument. MBP-NESs compete with FITC-PKI^NES for CRM1 in competition titrations. Fitted binding curves are overlaid onto data points with error bars representing the mean and standard deviation of triplicate titrations. Dissociation constants (K_Ds) of the NESs are reported below the graphs with ranges in brackets representing the 68.3% confidence intervals. Binding of MBP-NPMmutA^NES (a moderate CRM1 binder) and MBP-SNUPN^NES (a weak binder) is shown on the rightmost panel for reference. *Experiments performed on separate days were fitted with a new triplicate set of direct bind titrations. (C) Leptomycin B (LMB) sensitive nuclear export activity of EYFP-NLS-NES fusions in HeLa cells. YFP (pseudocolored in yellow), Hoechst (pseudocolored in blue), and merged images were captured using spinning disk confocal microscope (60×). Images are maximum intensity projection of five confocal Z stacks spaced 0.3 µm apart.

https://doi.org/10.7554/eLife.10034.008

Crystal structures of CRM1-bound hRio2^NES-R and CPEB4^NES-R peptides were solved at 2.3 Å and 2.9 Å resolution, respectively (Figure 5, Figure 5—figure supplement 1 and Table 1). These structures show the peptides binding in the plus direction, that is, opposite that of their wild-type counterparts (Figure 5A). The CRM1 grooves in the hRio2^NES-R and CPEB4^NES-R complexes are almost identical to those in the wild-type hRio2^NES and CPEB4^NES complexes (Cα rmsds 0.2–0.3 Å). The N-terminal helices of hRio2^NES-R and CPEB4^NES-R that span Φ0–Φ3 bind CRM1 much like the helix of the plus direction PKI^NES. Their C-terminal strand-like ΦXΦ segments bind in the narrow part of the CRM1 groove, but are placed slightly outward toward solvent, perhaps to better accommodate the large Phe and Met side chains in the P3 and P4 CRM1 pockets (Figure 5B). Pull-down assays with single amino acid mutants of hRio2^NES-R reveal that smaller hydrophobic residues such as leucine in the Φ3 position and isoleucine, leucine, and valine in the Φ4 position are preferred for binding to CRM1 (Figure 5C). The preference for smaller hydrophobic side chains in ΦXΦ segment can possibly be explained by the relief of steric constraints caused by the bulky phenylalanine in the native hRio2^NES-R sequence. The structures of CRM1-bound hRio2^NES-R and CPEB4^NES-R peptides support the idea that the spacing between the hydrophobic residues is critical for determining the orientation the NES binds. When the sequence and the hydrophobic spacing pattern of an NES are reversed, the direction of the peptide binding CRM1 is also reversed. However, binding affinities of the NESs are similar regardless of binding orientation, consistent with the observation that hydrophobic interactions between the CRM1 groove and side chains in the Φ positions of hRio2^NES and CPEB4^NES, which likely govern CRM1-NES affinity, are preserved in hRio2^NES-R and CPEB4^NES-R.

Figure 5 with 1 supplement see all

Download asset Open asset

hRio2^NES-R and CPEB4^NES-R are plus NESs.

(A) Structures of hRio2^NES-R (light blue) and CPEB4^NES-R (light pink) bound to CRM1 (gray surfaces). (B) Pairwise comparisons of PKI^NES (yellow), hRio2^NES-R (light blue), and hRio2^NES (blue) when bound to CRM1. (C) In vitro pull-down assay of purified human CRM1 binding to immobilized GST-hRio2^NES-R mutants (Φ3 or Φ4 mutated) in the presence of excess ^ScRanGTP. Relative band intensities of triplicate experiments are plotted in histograms.

https://doi.org/10.7554/eLife.10034.011

To more rigorously test the idea that position of the ΦXΦ motif is critical for determining NES orientation, we flipped the C-terminal ΦXΦ (LDI) of PKI^NES (SNELALKLAGLDI) to the N-terminus of the peptide while preserving the sequence of the NES helix of wild-type PKI^NES (Figure 6A). We named the new peptides PKI^NES-Flip and three variations were designed. PKI^NES-Flip1 has the inverted wild-type LDI at the N-terminus, giving sequence IDLNELALKLAGL. The two hydrophobic side chains in the N-terminal Φ-X-Φ were incrementally made larger to generate PKI^NES-Flip2 (FDLNELALKLAGL) and PKI^NES-Flip3 (FDMNELALKLAGL) mutants. PKI^NES-Flip1 does not interact with CRM1 in pull-down assays, while PKI^NES-Flip2 and PKI^NES-Flip3 show graded increases in CRM1 binding (Figure 6B). We solved the structure of PKI^NES-Flip3 bound to CRM1 at 2.5 Å resolution and it indeed binds in the minus direction (Figure 6C, Figure 6—figure supplement 1 and Table 1). These results show that NES binding in the minus vs plus direction is determined by placement of the ΦXΦ pattern at the N- or C-terminal end of the NES peptide. Secondary to this positioning, hydrophobic side chains of the N-terminal ΦXΦ segment of a minus NES should be long enough to reach into binding pockets and pack with the CRM1 groove favorably.

Figure 6 with 1 supplement see all

Download asset Open asset

N-terminal ΦXΦ motif generates a minus orientation PKI^NES-Flip mutant.

(A) Sequence alignment of PKI^NES and PKI^NES-Flip peptides with their hydrophobic residues of ΦXΦ motifs in red and the NES helix shown as a cylinder. (B) Pull-down assay of immobilized GST-PKI^NES mutants, purified CRM1 and RanGTP (Coomassie-stained SDS/PAGE). (C) Structure of the PKI^NES-Flip3 peptide (red, in cartoon with Φ residues in sticks) bound to CRM1 (gray surface) with its sequence and CRM1 pockets for each Φ residue shown below.

https://doi.org/10.7554/eLife.10034.013

Bioinformatics analysis of minus NESs in an NES database

The discovery that CRM1 binds NESs in both the plus and minus directions almost doubles the number of possible NES consensus sequences. Of the six NES patterns in Figure 1A, class 1a, 1b, 1c, and 1d patterns are asymmetric, whereas class 2 and class 3 patterns are symmetric. Each of the asymmetric class 1a, 1b, 1c, and 1d patterns, which represent plus NESs, could be reversed to give class 1a-R, 1b-R, 1c-R, and 1d-R patterns that represent minus NESs (Figure 7A). In principle, symmetric class 2 and 3 patterns can also bind CRM1 in both the plus and minus directions. For example, the class 2 Rev^NES binds CRM1 in the plus direction as an entirely extended chain, but it is also possible that hydrophobic side chains of another class 2 NES can be presented from a similar extended peptide in the minus direction. However, it remains to be determined whether any of the currently known class 2 and true class 3 peptides can indeed bind in the minus direction. Expansion of NES consensus by reversing class 1 NES consensus patterns to generate class 1-R patterns further suggests a corresponding increase of potential NESs in the proteome.

Figure 7

Download asset Open asset

Prevalence of putative minus NESs in the Dbase data set.

(A) Consensus patterns for minus NESs in new NES classes 1a-R to 1d-R (reverse of class 1 patterns). (B) The number of sequences in the 246 proteins in Dbase that match the class 1 (+) and class 1-R (−) consensus patterns. NES regions are defined according to original literature that experimentally identified CRM1 cargos and their NES regions. (C) The numbers of NES regions in Dbase divided into four categories according to the consensus matches they overlap with.

https://doi.org/10.7554/eLife.10034.015

We searched for sequences that match class 1-R (minus) patterns in the Dbase data set, which compiled 246 NES-containing CRM1 cargos from previously published literature (Xu et al., 2015) . Each CRM1 cargo contains multiple sequences that match NES consensus patterns but most of these sequences are not functional export signals. Dbase reports a total of 290 experimentally identified NES regions for the 246 CRM1 cargos in the database. Matches for both class 1 (plus) and class 1-R (minus) patterns appear to be similarly prevalent in the 246 CRM1 cargos (1849 minus vs 1950 plus matches) (Figure 7B). However, plus patterns seem to be somewhat enriched within NES regions (340 plus vs 230 minus matches; Chi-square test, p-value = 1.378e⁻⁰⁵) (Figure 7B). The bias for plus patterns in these previously reported NES regions may be a consequence of NES searches that were guided solely by the plus consensus patterns, since the minus patterns were unknown. The Dbase data set is further complicated by a lack of validation of direct CRM1-NES interactions. Only 60% of previously reported class 1 NESs that were tested recently were found to actually bind CRM1 (Xu et al., 2012a). Of the 290 NES regions in Dbase, 40% (116) contain sequences that match both class 1 (plus) and class 1-R (minus) patterns (Figure 7C). 89 NES regions match class 1 pattern exclusively and 24 match class 1-R patterns exclusively (Figure 7C), suggesting that there are still a significant population of putative minus NES even though the current NES annotation is biased and imperfect.

We further investigated the 24 NES regions that contained only class 1-R matches, filtering out four because of overlap with the class 2 consensus, which was previously not considered in the analysis. The remaining 20 NES regions contain 22 sequences that match class 1-R patterns, which were tested for CRM1 binding in pull-down assays. Of the 22 sequences tested, one degraded and another aggregated during purification resulting in only 20 relevant NES sequences. 10 out of the 20 putative minus NESs, or 50%, bind CRM1 (Figure 8). This percentage is similar to the proportion of tested class 1 (plus) NESs that bind CRM1 (Xu et al., 2012a). These results suggest that there are a substantial number of functional NESs that likely bind CRM1 in the minus direction, even in an NES dataset like Dbase where many NESs were previously identified using the only plus NES consensus patterns. These possibly include cases where NES patterns have been mistakenly annotated or annotated as previously non-canonical patterns. This new expansion of NES consensus provides a means to identify previously unrecognizable NESs in previously identified and new CRM1 cargos.

Figure 8

Download asset Open asset

Putative minus NESs in the Dbase data set.

(A) Summary of putative minus NESs (in the Dbase data set that match class 1-R patterns exclusively) tested for CRM1 binding. Nap1p (*) was previously shown to direct nuclear export in cells even though no CRM1 binding was observed (Xu et al., 2015). (B) Putative minus NESs that bind CRM1 in pull-down assays with CRM1 and RanGTP. (C) Putative minus NESs that show no observable CRM1 binding.

https://doi.org/10.7554/eLife.10034.016

Discussion

The NES appears to be the only nuclear-targeting signal, and perhaps the only organelle-targeting signal, that has been shown thus far to bind its receptor in both polypeptide directions. This is in contrast to several modular-domain signaling systems, which are known to bind their linear motifs in both polypeptide chain directions (Feng et al., 1994; Lim et al., 1994; Osawa et al., 1999; Swanson et al., 2004; Song et al., 2005; Lorenz et al., 2008; Ng et al., 2008; Neufeld et al., 2009). Are protein systems that recognize linear motifs in opposite orientations unique or will most linear motifs bind their receptors in both orientations even though this phenomenon has not yet been observed for many? Alternatively, do linear motifs that bind only in a single orientation do so because of spatial constraints inherent to their cellular functions?

Regularly spaced hydrophobic pockets in the CRM1-NES groove interact with similarly spaced NES side chains that often project from one face of amphipathic α-helices. Interestingly, several other linear motifs that bind in opposite orientations (Paxillin LD motifs, HBP1/Mad1 Sin2-interaction domains, SH3-binding polyproline peptides, and various calmodulin targets [Osawa et al., 1999; Swanson et al., 2004; Lorenz et al., 2008; Neufeld et al., 2009]) also present side chains from helices for recognition. Side chain to side chain distances within secondary structural elements of motifs are preserved regardless of polypeptide orientation, thus producing a feature that may be conducive for binding in opposite orientations. The required shift of the backbone to put the plus and minus side chains in the same position is more likely for linear motifs than for extensive interfaces between two folded proteins, since the latter are constrained by many additional contacts outside of the helix-binding groove. Thus, bidirectional recognition is probably more prevalent in recognition of linear helical motifs than in recognition of larger structured elements.

Extended linear motifs such as phosphotyrosine peptides that bind SH2 domains also bind in opposite orientations (Ng et al., 2008). Here, side chains, mainly the phosphotyrosine side chain, contribute the majority of binding energy, which can still be preserved when peptide orientation is reversed. Linear motifs that use mostly side chains for binding may be amenable to interactions in opposite orientations but those that make extensive contacts using their backbones may be limited to a particular orientation. For example, the IBB region of Importin-α, which is the nuclear localization signal or NLS that binds directly to Importin-β, is a long 28-residue helix that is preceded by a loop (Cingolani et al., 1999). The IBB uses mostly charged and polar side chains to interact with Importin-β, and perhaps these side chain interactions could be preserved when polypeptide direction of the NLS peptide is flipped. Similarly, PY-NLS binding to Karyopherin-β2 (also known as Transportin-1) involves mostly the NLS side chains, and we may observe these NLSs binding to Karyopherin-β2 in the opposite orientation in the future (Lee et al., 2006; Soniat et al., 2013). In contrast, the classical-NLS recognition by Importin-α and the Kap121-specific lysine-rich NLS (also called the IK-NLS) recognition by Kap121 involve extensive interactions with the NLS main chains and are therefore less likely to bind bidirectionally to their importins (Conti et al., 1998; Fontes et al., 2000; Kobayashi et al., 2015; Soniat and Chook, 2015). Further studies will inform on orientation requirements for NLSs binding to their respective importins. We suggest that bidirectional recognition may, in fact, be widely present, but simply not widely observed.

Components of modular-domain signaling and nuclear-targeting systems consist of mostly soluble proteins that bind linear motifs found within intrinsically disordered regions. These protein–peptide interaction systems are relatively free of spatial constraints compared to systems that bind organelle-targeting signals for delivery into membrane compartments. An example of the latter is the binding of ER signal sequences by the signal recognition particle SRP, which is likely constrained spatially by the nascent chain emerging from the ribosome and by subsequent delivery into the lumen of the translocon (Janda et al., 2010; Akopian et al., 2013). In principle, linear motifs that could bind in both orientations are sometimes constrained by other factors that limit them to only one. The CRM1-NES interaction is free from such spatial constraints as the entire exportin–cargo complex enters the nuclear pore complex for transport to the cytoplasm, thereby allowing some NESs to bind CRM1 in the plus orientation and others in the minus orientation.

Finally, accurate prediction of NESs has been difficult because of the breadth and simultaneously, the insufficient coverage of the NES consensus. Many functional NES-containing regions of proteins contain multiple NES consensus matches and sometimes no NES match, suggesting that the set of NES consensus does not provide sufficient coverage for NES identification. Our study shows that structures of NESs bound to CRM1 can accurately define consensus patterns and sometimes identify new consensus patterns. Expansion of the NES consensus upon discovery of minus NESs leads to improved coverage of potential NESs, thus allowing identification of previously unrecognized NESs in known and new CRM1 cargos. However, the improved coverage afforded by the knowledge of bidirectional NES-binding is largely orthogonal to the problems in NES prediction that arise from false positive NESs (Fu et al., 2011; Xu et al., 2012a). The majority of the NES patterns describe the ubiquitous 2-turn amphipathic helix, which are found in most helix-containing proteins, and many of these consensus-matching sequences are part of hydrophobic cores that are not accessible for CRM1 binding. In the development of NES predictors (NESsential by the Horton Lab and LocNES by the Chook lab, Fu et al., 2011; Xu et al., 2015), we found that prediction accuracy was improved by using both sequence and structural/biophysical properties (such as disorder propensity and/or solvent accessibility) as features for machine-learning methods. The latter features allowed consensus-matching sequences in the interior of folded domains to be flagged. A set of consensus sequences with high coverage rate such as the expanded set in Figure 7A is desirable when employed as a pre-filter in NES prediction as the machine-learning process that follows serves to reduce false positive matches. Future identification of minus NESs will also increase the size, diversity, and accuracy of experimental NES databases, which are the training/testing data sets for the development of our next generation NES predictors.

In summary, we have found that NES peptides can bind the narrow CRM1-NES groove in two opposite orientations, which we now describe as the plus and minus orientations. Whether an NES binds CRM1 in the plus or minus orientation is determined by the location of its ΦXΦ strand motif. A C-terminal ΦXΦ motif that follows a helix dictates a plus NES, while an N-terminal ΦXΦ followed by a helix results in a minus NES. The five hydrophobic pockets in the CRM1-NES groove interact with hydrophobic side chains that are presented in many different ways on NES peptides, by different secondary structural elements and in both polypeptide chain directions, to enable specific recognition of diverse NES sequences.

Materials and methods

Protein expression, purification, and complex formation

Request a detailed protocol

^ScCRM1 (1–1058, Δ377–413, ⁵³⁷DLTVK⁵⁴¹ to GLCEQ, V441D) was cloned into the previously described pGEX-TEV vector (Chook and Blobel, 1999). As previously described in Sun et al. (2013) polypeptide segments that make up the ^ScCRM1 and human CRM1-NES grooves are 81% identical in sequence, with complete conservation in residues lining the groove that contact NESs and inhibitors (Sun et al., 2013). In order to maximize similarity to the human CRM1-NES groove, we mutated the only stretch of ^ScCRM1 groove residues that has more than 2 non-conserved residues, ⁵³⁷DLTVK⁵⁴¹ in the NES-binding groove of ^ScCRM1 to the human CRM1 sequence GLCEQ (Sun et al., 2013). Yrb1p (residues 62–201; or RanBP1) was cloned into pGEX-TEV and human Ran (full-length) was cloned into the pET-15b vector. Various NESs were cloned into the pMal-TEV vector. Sequences of NES peptides used for crystallization after TEV cleavage are hRio2^NES: GGSY³⁸⁹RSFEMTEFNQALEEI⁴⁰³; hRio2^NES-R: GGSYGKIEELAQNFETMEFSR; CPEB4^NES: GGSY³⁷⁹RTFDMHSLESSLIDI³⁹³: CPEB4^NES-R: GGSYRMIDILSSELSHMDFTR; PKI^NES-Flip3: GGSYRSFDMNELALKLAGLD. Sequences of NESs used for binding affinity measurements are listed in Figure 4. All proteins were expressed separately in E. coli BL-21(DE3) by induction with 0.5 mM isopropyl β-D-1-thiogalactopyranoside for 10 hr at 25°C. GST-^ScCRM1 and GST-RanBP1 cells were lyzed in buffer containing 40 mM HEPES (pH 7.5), 2 mM MgOAc, 200 mM NaCl, 10 mM dithiothreitol (DTT) and protease inhibitors, purified by affinity chromatography using glutathione Sepharose 4B beads (GE Healthcare Life Sciences, PA), followed by cleavage with TEV protease and finally size-exclusion chromatography in GF buffer (20 mM HEPES pH 7.5, 100 mM NaCl, 5 mM MgOAc, and 2 mM DTT). Cells expressing His-Ran were lyzed in buffer containing 50 mM HEPES (pH 8.0), 2 mM MgOAc, 200 mM NaCl, 10% (vol/vol) glycerol, 5 mM imidazole (pH 7.8), 2 mM DTT and protease inhibitors, purified by affinity chromatography with Ni-NTA Agarose (Qiagen, Hilden, Germany) and further purified by gel filtration chromatography in TB buffer (20 mM HEPES pH 7.5, 110 mM KOAc, 2 mM MgOAc, 10% glycerol, and 2 mM DTT). Ran was loaded with non-hydrolyzable GTP analog GppNHp by nucleotide exchange. Cells expressing MBP-NESs were lyzed in buffer containing 50 mM HEPES pH 7.5, 100 mM NaCl, 10% glycerol, 2 mM DTT and protease inhibitors, purified by affinity chromatography using amylose resin (New England Biolabs, MA) and ion exchange chromatography using (HiTrap Q, GE Healthcare Life Sciences) with a salt gradient from 50 mM to 1 M NaCl. Purified MBP-NES proteins were concentrated, cleaved with TEV protease and NES peptides were then isolated by gel filtration chromatography in GF buffer. To assemble the CRM1-Ran-RanBP1-NES complex, the RanGppNHp-RanBP1 heterodimer was first purified by gel filtration chromatography. ^ScCRM1*, Ran-RanBP1 and NES peptides were then assembled in 1:3:10 molar ratio and the quaternery complexes were purified by gel filtration chromatography in GF buffer. Purified ^ScCRM1*-Ran-RanBP1-NES complexes were concentrated to ∼10 mg/ml and excess NES peptides were added to stabilize the complex during concentration.

Crystallization, data collection, and structure determination

Request a detailed protocol

^ScCRM1-Ran-RanBP1-NES complexes were crystallized in 17% (wt/vol) PEG3350, 100 mM Bis-Tris (pH 6.4), 200 mM ammonium nitrate, and 10 mM Spermine HCl. Crystals were cryoprotected with the same crystallization condition supplemented with up to 23% PEG3350 and 12% glycerol and flash cooled in liquid nitrogen. X-ray diffraction data were collected at 0.9795 Å at the Advanced Photon Source 19ID beamline in the Structural Biology Center at Argonne National Laboratory. Data were indexed, integrated, and scaled using HKL-3000 (Minor et al., 2006). All crystals in this study were isomorphous to crystals of previously solved inhibitor-bound and unliganded ^ScCRM1-Ran-RanBP1 complexes and has space group P4₃2₁2. Therefore, structures were determined by multiple rounds of refinement of unliganded complex (4HB2) against collected data using PHENIX (Adams et al., 2010; Afonine et al., 2012) and manual modeling in Coot (Emsley et al., 2010). X-ray/stereochemistry and X-ray/ADP weights were optimized in phenix.refine in final stages of refinement. Structure validation was guided by Molprobity suite in PHENIX (Chen et al., 2010). Ramachandran plots of the five structures showed that 97.3–97.9% of residues are in favored regions and 0.0–0.1% are in disallowed regions. Structure figures were generated with PyMOL (Schrodinger, 2010). NESs in Figures 2 and 5 were compared by superimposing H12A helices of their respective CRM1s.

In vitro CRM1-NES pull-down binding assays

Request a detailed protocol

Full-length human CRM1 (^HsCRM1) was purified in the same manner as ^ScCRM1* with buffers supplemented with 10% glycerol. ^ScRan (Gsp1p) was expressed using pET21d-GSP1 (GSP1 residues 1–179, Q71L) (gift from Dr. Takuya Yoshizawa) and purified as described above for human Ran (buffers in HEPES pH 7.4 instead of pH 8.0). After affinity purification, ^ScRan was loaded with GTP (incubated with molar excess of ethylenediaminetetraacetic acid (EDTA) for 30 min on ice followed by incubation with excess GTP and MgOAC for 30 min at room temperature) and then purified by ion exchange chromatography (HiTrap SP, GE Healthcare Life Sciences). NESs were cloned into the pGEX-TEV vector (Chook and Blobel, 1999), purified, and immobilized on glutathione Sepharose beads (GE Healthcare Life Sciences) in TB buffer described above containing 15% glycerol. 2.5 μM ^HsCRM1 and 7.5 μM ^ScRanGTP were added to ∼10 μg of immobilized GST-NESs in TB buffer in total volumes of 200 μl for 30 min at 4°C. Unbound proteins were washed extensively with TB buffer and bound proteins on the Sepharose beads were separated by sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS/PAGE) and visualized with Coomassie Blue staining. All binding assays were performed in triplicates. To compare the relative intensities of CRM1 bands to yield an estimate of binding activities of various NES mutants, SDS/PAGE gels were dried and scanned with an Epson V300 scanner and the images analyzed with ImageJ software. CRM1 band intensities were corrected for differences in GST-NESs band intensities and normalized to wild-type control intensity in each set of mutations. Corrected relative CRM1 band intensities were plotted as histograms with standard errors with GraphPad Prism.

To test putative minus NESs identified from the Dbase data set, 5 μM ^HsCRM1 with or without 15 μM ^ScRanGTP were used instead. Putative NESs that show no CRM1 binding were expressed in larger scale and purified by size-exclusion chromatography to assess their aggregation states. They were also subjected to intact mass determination by mass spectroscopy to ensure that the GST-NES proteins were not degraded.

Nuclear-cytoplasmic localization assays

Request a detailed protocol

Cellular localization of EYFP2-NLS-NES fusion proteins overexpressed in HeLa cells was observed using procedures as previously described (Xu et al., 2015). Expression constructs for EYFP2-NLS-NES fusion proteins were cloned similarly into pEYFP2-SV40^NLS vectors. Live cell images were collected using a spinning disk confocal microscope system, Nikon-Andor (Nikon, NY), and MetaMorph software. Image analysis was performed similarly with ImageJ. CRM1 dependence was demonstrated by the nuclear accumulation of EYFP fusion proteins after treatment with 2 nM Leptomycin B for 16 hr at 37°C. Experiments were performed in at least duplicates with over a total of 150 transfected cells.

Competition differential bleaching assay monitored by microscale thermophoresis to measure CRM1-NES affinities

Request a detailed protocol

Differential bleaching was used as a parameter to monitor binding of MBP-NES proteins to CRM1. In short, we observed that the fluorophore reporter attached to PKI^NES (FITC) underwent a reproducible time-dependent bleaching when exposed to excitation light (Figure 4—figure supplement 2). Furthermore, this phenomenon was concentration-dependent, that is, the bleaching was accelerated when the FITC-PKI^NES probe was exposed to increasing concentrations of CRM1. However, this phenomenon was saturable at high concentrations of CRM1, indicating that it was a function of CRM1 binding and not simply the presence of the protein that was causing the change. The differential bleaching can be counteracted by titrating of mixture of FITC-PKI^NES and CRM1 with a known competitor, MBP-PKI^NES, which competes directly with the fluorescent probe for the NES binding groove in CRM1. A sigmoidal appearance of the binding and competition isotherms is observed when differential bleaching, quantified as the average fluorescence at a time after bleaching normalized by the averaged fluorescence just after the beginning of bleaching, is plotted vs titrant concentration in a semilog graph (see Figure 4).This illustrates that this bleaching behavior can be described as a two-state system where unbound probe and CRM1-bound probe bleach at different but specific rates, and that these quantities report on the populations of bound and unbound FITC-PKI^NES. A detailed description of the data-fitting procedures will be described in manuscript in preparation by C.A.B. For error reporting, we used F-statistics and error-surface projection method to calculate the 68.3% confidence intervals of the fitted data (Bevington and Robinson, 1992). While error reporting using the error surface projection method is relatively uncommon, the ranges more accurately represent the true confidence intervals given the observed noise in the performed set of experiments because they explicitly account for the ability of the fitting algorithm to compensate for fitting defects by modifying correlated parameters. Thus, they provide better evaluation of the fitted data than other, more commonly used methods (e.g., error estimations from the parametric variance-covariance matrix).

All proteins used were subjected to an extra gel filtration step and dialysis overnight in TB buffer with 15% glycerol to remove possible aggregation and ensures buffer matching. The FITC-PKI^NES peptide (FITC-SGNSNELALKLAGLDINKT) was chemically synthesized by GenScript, NJ and dissolved in the TB buffer with 15% glycerol. For the direct titration, ^HsCRM1 was serially diluted from 40 μM to 1.2 nM and incubated with mixture of 120 μM ^ScRan-GTP and 40 nM FITC-PKI^NES in 1:1 vol to a total volume of 20 μl, and incubated for 1 hr in the dark at room temperature. For competition experiments, MBP-NESs were serially diluted from 100 μM to 3 nM in presence of 40 nM of FITC-PKI^NES and incubated with mixture of 300 nM ^HsCRM1 and 120 μM ^ScRan-GTP in 1:1 vol to a total of 20 μl, and incubated for 1.5 hr in the dark at room temperature. All reactions mixtures were supplemented with 0.05% Tween-20. Following incubations, reactions were loaded into NanoTemper's ‘Standard’ treated capillaries and fluorescence signals were monitored by NanoTemper Monolith NT.115 equipment (NanoTemper Technologies, München, Germany) with 60% LED power for 10 s. Titrations for parallel comparisons were performed in triplicates on the same day. Data collected were then analyzed with PALMIST (manuscript in preparation) and imported to GUSSI for generating figures (Brautigam, 2015).

Bioinformatic studies of plus and minus NESs in the Dbase data set

Request a detailed protocol

Protein sequences in the Dbase data set, a non-redundant compilation of CRM1 cargos from two of the most recent NES databases, ValidNESs (Fu et al., 2013), and NESdb (Xu, Grishin et al., 2012b) (http://prodata.swmed.edu/LRNes), were used for analyses of plus and minus NESs. All protein sequences in the Dbase data set were first compiled along with their annotated NES regions into an in-house database implemented by MySQL (version 5.5.43) on Linux (Ubuntu 12.04). NES regions are defined according to original reports in the published literature that identified the CRM1 cargos. PHP (version 5.3) regular expression with look-ahead assertions was used to capture all sequences (including overlapping sequences) that match the eight different class 1 (plus) and class 1-R (minus) NES consensus patterns. Duplicate matches (such as 10-mer sequences that simultaneously match both class 1a and 1d patterns, or match both 1a-R and 1d-R patterns) were removed using Linux command line tools, and the resulting numbers of consensus matches (see Figure 7) were used for the enrichment test of plus consensus patterns within NES regions by Chi-square test using R (version 2.14.1). The same MySQL database was used to search for putative minus NESs, and 24 NES regions that match the 1-R patterns exclusively were identified. Four of these NES regions were removed because of overlap with class 2 patterns, resulting in 20 NES regions (containing 22 1-R consensus matches) for examination of CRM1 interactions by pull-down binding assays.

Accession codes

Request a detailed protocol

Structures and crystallographic data have been deposited at the PDB: 5DHF (CRM1-hRio2^NES complex), 5DIF (CRM1-CPEB4^NES complex), 5DI9 (CRM1-hRio2^NES-R complex), 5DHA (CRM1-CPEB4^NES-R complex), 5DH9 (CRM1-PKI^NES-Flip3 complex).

Data availability

The following previously published data sets were used

1. Xu D
2. Marquis K
3. Pei J
4. Fu SC
5. Cağatay T
6. Grishin NV
7. Chook YM
(2014) LocNES: a computational tool for locating classical NESs in CRM1 cargo proteins
Publicly available in Supplementary Data of the paper at Bioinformatics.

http://dx.doi.org/10.1093/bioinformatics/btu826

References

1. Adams PD
2. Afonine PV
3. Bunkoczi G
4. Chen VB
5. Davis IW
6. Echols N
7. Headd JJ
8. Hung LW
9. Kapral GJ
10. Grosse-Kunstleve RW
11. McCoy AJ
12. Moriarty NW
13. Oeffner R
14. Read RJ
15. Richardson DC
16. Richardson JS
17. Terwilliger TC
18. Zwart PH
(2010) PHENIX: a comprehensive Python-based system for macromolecular structure solution
Acta Crystallographica. Section D, Biological Crystallography 66:213–221.

https://doi.org/10.1107/S0907444909052925
- Google Scholar
(2012) Towards automated crystallographic structure refinement with phenix.refine
Acta Crystallographica. Section D, Biological Crystallography 68:352–367.

https://doi.org/10.1107/S0907444912001308
- Google Scholar
1. Akopian D
2. Shen K
3. Zhang X
4. Shan SO
(2013) Signal recognition particle: an essential protein-targeting machine
Annual Review of Biochemistry 82:693–721.

https://doi.org/10.1146/annurev-biochem-072711-164732
- Google Scholar
Book
1. Bevington PR
2. Robinson DK
(1992)
Data reduction and error analysis for the physical sciences

New York: Mc-Graw-Hill.
- Google Scholar
1. Brautigam CA
(2015) Calculations and publication-quality illustrations for analytical ultracentrifugation data
Methods in Enzymology 562:109–133.

https://doi.org/10.1016/bs.mie.2015.05.001
- Google Scholar
(2010) MolProbity: all-atom structure validation for macromolecular crystallography
Acta Crystallographica. Section D, Biological Crystallography 66:12–21.

https://doi.org/10.1107/S0907444909042073
- Google Scholar
1. Chook YM
2. Blobel G
(1999) Structure of the nuclear transport complex karyopherin-beta2-Ran x GppNHp
Nature 399:230–237.

https://doi.org/10.1038/20375
- Google Scholar
1. Cingolani G
2. Petosa C
3. Weis K
4. Muller CW
(1999) Structure of importin-beta bound to the IBB domain of importin-alpha
Nature 399:221–229.

https://doi.org/10.1038/20367
- Google Scholar
1. Conti E
2. Uy M
3. Leighton L
4. Blobel G
5. Kuriyan J
(1998) Crystallographic analysis of the recognition of a nuclear localization signal by the nuclear import factor karyopherin alpha
Cell 94:193–204.

https://doi.org/10.1016/S0092-8674(00)81419-1
- Google Scholar
1. Dong X
2. Biswas A
3. Suel KE
4. Jackson LK
5. Martinez R
6. Gu H
7. Chook YM
(2009) Structural basis for leucine-rich nuclear export signal recognition by CRM1
Nature 458:1136–1141.

https://doi.org/10.1038/nature07975
- Google Scholar
1. Emsley P
2. Lohkamp B
3. Scott WG
4. Cowtan K
(2010) Features and development of Coot
Acta Crystallographica. Section D, Biological Crystallography 66:486–501.

https://doi.org/10.1107/S0907444910007493
- Google Scholar
1. Etchin J
2. Sun Q
3. Kentsis A
4. Farmer A
5. Zhang ZC
6. Sanda T
7. Mansour MR
8. Barcelo C
9. McCauley D
10. Kauffman M
11. Shacham S
12. Christie AL
13. Kung AL
14. Rodig SJ
15. Chook YM
16. Look AT
(2013) Antileukemic activity of nuclear export inhibitors that spare normal hematopoietic cells
Leukemia 27:66–74.

https://doi.org/10.1038/leu.2012.219
- Google Scholar
1. Feng S
2. Chen JK
3. Yu H
4. Simon JA
5. Schreiber SL
(1994) Two binding orientations for peptides to the Src SH3 domain: development of a general model for SH3-ligand interactions
Science 266:1241–1247.

https://doi.org/10.1126/science.7526465
- Google Scholar
1. Fontes MR
2. Teh T
3. Kobe B
(2000) Structural basis of recognition of monopartite and bipartite nuclear localization sequences by mammalian importin-alpha
Journal of Molecular Biology 297:1183–1194.

https://doi.org/10.1006/jmbi.2000.3642
- Google Scholar
1. Fornerod M
2. Ohno M
3. Yoshida M
4. Mattaj IW
(1997) CRM1 is an export receptor for leucine-rich nuclear export signals
Cell 90:1051–1060.

https://doi.org/10.1016/S0092-8674(00)80371-2
- Google Scholar
1. Fu SC
2. Huang HC
3. Horton P
4. Juan HF
(2013) ValidNESs: a database of validated leucine-rich nuclear export signals
Nucleic Acids Research 41:D338–D343.

https://doi.org/10.1093/nar/gks936
- Google Scholar
1. Fu SC
2. Imai K
3. Horton P
(2011) Prediction of leucine-rich nuclear export signal containing proteins with NESsential
Nucleic Acids Research 39:e111.

https://doi.org/10.1093/nar/gkr493
- Google Scholar
1. Fukuda M
2. Asano S
3. Nakamura T
4. Adachi M
5. Yoshida M
6. Yanagida M
7. Nishida E
(1997) CRM1 is responsible for intracellular transport mediated by the nuclear export signal
Nature 390:308–311.

https://doi.org/10.1038/36894
- Google Scholar
1. Fung HY
2. Chook YM
(2014) Atomic basis of CRM1-cargo recognition, release and inhibition
Seminars in Cancer Biology 27:52–61.

https://doi.org/10.1016/j.semcancer.2014.03.002
- Google Scholar
1. Güttler T
2. Madl T
3. Neumann P
4. Deichsel D
5. Corsini L
6. Monecke T
7. Ficner R
8. Sattler M
9. Görlich D
(2010) NES consensus redefined by structures of PKI-type and Rev-type nuclear export signals bound to CRM1
Nature Structural & Molecular Biology 17:1367–1376.

https://doi.org/10.1038/nsmb.1931
- Google Scholar
1. Haines JD
2. Herbin O
3. de la Hera B
4. Vidaurre OG
5. Moy GA
6. Sun Q
7. Fung HY
8. Albrecht S
9. Alexandropoulos K
10. McCauley D
11. Chook YM
12. Kuhlmann T
13. Kidd GJ
14. Shacham S
15. Casaccia P
(2015) Nuclear export inhibitors avert progression in preclinical models of inflammatory demyelination
Nature neuroscience 18:511–520.

https://doi.org/10.1038/nn.3953
- Google Scholar
1. Janda CY
2. Li J
3. Oubridge C
4. Hernandez H
5. Robinson CV
6. Nagai K
(2010) Recognition of a signal peptide by the signal recognition particle
Nature 465:507–510.

https://doi.org/10.1038/nature08870
- Google Scholar
(2015) Crystal structure of the karyopherin Kap121p bound to the extreme C-terminus of the protein phosphatase Cdc14p
Biochemical and Biophysical Research Communications 463:309–314.

https://doi.org/10.1016/j.bbrc.2015.05.060
- Google Scholar
1. Kosugi S
2. Hasebe M
3. Tomita M
4. Yanagawa H
(2008) Nuclear export signal consensus sequences defined using a localization-based yeast selection system
Traffic 9:2053–2062.

https://doi.org/10.1111/j.1600-0854.2008.00825.x
- Google Scholar
1. Koyama M
2. Matsuura Y
(2010) An allosteric mechanism to displace nuclear export cargo from CRM1 and RanGTP by RanBP1
The EMBO Journal 29:2002–2013.

https://doi.org/10.1038/emboj.2010.89
- Google Scholar
(2014) Structural insights into how Yrb2p accelerates the assembly of the Xpo1p nuclear export complex
Cell Reports 9:983–995.

https://doi.org/10.1016/j.celrep.2014.09.052
- Google Scholar
1. Kutay U
2. Guttinger S
(2005) Leucine-rich nuclear-export signals: born to be weak
Trends in Cell Biology 15:121–124.

https://doi.org/10.1016/j.tcb.2005.01.005
- Google Scholar
1. la Cour T
2. Kiemer L
3. Molgaard A
4. Gupta R
5. Skriver K
6. Brunak S
(2004) Analysis and prediction of leucine-rich nuclear export signals
Protein Engineering, Design & Selection 17:527–536.

https://doi.org/10.1093/protein/gzh062
- Google Scholar
1. Lapalombella R
2. Sun Q
3. Williams K
4. Tangeman L
5. Jha S
6. Zhong Y
7. Goettl V
8. Mahoney E
9. Berglund C
10. Gupta S
11. Farmer A
12. Mani R
13. Johnson AJ
14. Lucas D
15. Mo X
16. Daelemans D
17. Sandanayaka V
18. Shechter S
19. McCauley D
20. Shacham S
21. Kauffman M
22. Chook YM
23. Byrd JC
(2012) Selective inhibitors of nuclear export show that CRM1/XPO1 is a target in chronic lymphocytic leukemia
Blood 120:4621–4634.

https://doi.org/10.1182/blood-2012-05-429506
- Google Scholar
1. Lee BJ
2. Cansizoglu AE
3. Suel KE
4. Louis TH
5. Zhang Z
6. Chook YM
(2006) Rules for nuclear localization sequence recognition by karyopherin beta 2
Cell 126:543–558.

https://doi.org/10.1016/j.cell.2006.05.049
- Google Scholar
(1994) Structural determinants of peptide-binding orientation and of sequence specificity in SH3 domains
Nature 372:375–379.

https://doi.org/10.1038/372375a0
- Google Scholar
(2008) Structural analysis of the interactions between paxillin LD motifs and alpha-parvin
Structure 16:1521–1531.

https://doi.org/10.1016/j.str.2008.08.007
- Google Scholar
(2006) HKL-3000: the integration of data reduction and structure solution–from diffraction images to an initial model in minutes
Acta Crystallographica. Section D, Biological Crystallography 62:859–866.

https://doi.org/10.1107/S0907444906019949
- Google Scholar
1. Monecke T
2. Guttler T
3. Neumann P
4. Dickmanns A
5. Gorlich D
6. Ficner R
(2009) Crystal structure of the nuclear export receptor CRM1 in complex with Snurportin1 and RanGTP
Science 324:1087–1091.

https://doi.org/10.1126/science.1173388
- Google Scholar
1. Neufeld C
2. Filipp FV
3. Simon B
4. Neuhaus A
5. Schuller N
6. David C
7. Kooshapur H
8. Madl T
9. Erdmann R
10. Schliebs W
11. Wilmanns M
12. Sattler M
(2009) Structural basis for competitive interactions of Pex14 with the import receptors Pex5 and Pex19
The EMBO Journal 28:745–754.

https://doi.org/10.1038/emboj.2009.7
- Google Scholar
1. Ng C
2. Jackson RA
3. Buschdorf JP
4. Sun Q
5. Guy GR
6. Sivaraman J
(2008) Structural basis for a novel intrapeptidyl H-bond and reverse binding of c-Cbl-TKB domain substrates
The EMBO Journal 27:804–816.

https://doi.org/10.1038/emboj.2008.18
- Google Scholar
1. Osawa M
2. Tokumitsu H
3. Swindells MB
4. Kurihara H
5. Orita M
6. Shibanuma T
7. Furuya T
8. Ikura M
(1999) A novel target recognition revealed by calmodulin in complex with Ca2+-calmodulin-dependent kinase kinase
Nature structural biology 6:819–824.

https://doi.org/10.1038/12271
- Google Scholar
(1997) Evidence for a role of CRM1 in signal-mediated nuclear protein export
Science 278:141–144.

https://doi.org/10.1126/science.278.5335.141
- Google Scholar
1. Praznikar J
2. Afonine PV
3. Guncar G
4. Adams PD
5. Turk D
(2009) Averaged kick maps: less noise, more signal... and probably less bias
Acta Crystallographica. Section D, Biological Crystallography 65:921–931.

https://doi.org/10.1107/S0907444909021933
- Google Scholar
1. Schrodinger LLC
(2010)
The PyMOL molecular graphics system, version 1.7.0.1

The PyMOL molecular graphics system, version 1.7.0.1.
- Google Scholar
1. Song J
2. Zhang Z
3. Hu W
4. Chen Y
(2005) Small ubiquitin-like modifier (SUMO) recognition of a SUMO binding motif: a reversal of the bound orientation
The Journal of Biological Chemistry 280:40122–40129.

https://doi.org/10.1074/jbc.M507059200
- Google Scholar
1. Soniat M
2. Chook YM
(2015) Nuclear localization signals for four distinct karyopherin-beta nuclear import systems
The Biochemical Journal 468:353–362.

https://doi.org/10.1042/BJ20150368
- Google Scholar
1. Soniat M
2. Sampathkumar P
3. Collett G
4. Gizzi AS
5. Banu RN
6. Bhosle RC
7. Chamala S
8. Chowdhury S
9. Fiser A
10. Glenn AS
11. Hammonds J
12. Hillerich B
13. Khafizov K
14. Love JD
15. Matikainen B
16. Seidel RD
17. Toro R
18. Rajesh Kumar P
19. Bonanno JB
20. Chook YM
21. Almo SC
(2013) Crystal structure of human Karyopherin beta2 bound to the PY-NLS of Saccharomyces cerevisiae Nab2
Journal of Structural and Functional Genomics 14:31–35.

https://doi.org/10.1007/s10969-013-9150-1
- Google Scholar
1. Sun Q
2. Carrasco YP
3. Hu Y
4. Guo X
5. Mirzaei H
6. Macmillan J
7. Chook YM
(2013) Nuclear export inhibition through covalent conjugation and hydrolysis of Leptomycin B by CRM1
Proceedings of the National Academy of Sciences of USA 110:1303–1308.

https://doi.org/10.1073/pnas.1217203110
- Google Scholar
(2004) HBP1 and Mad1 repressors bind the Sin3 corepressor PAH2 domain with opposite helical orientations
Nature Structural & Molecular Biology 11:738–746.

https://doi.org/10.1038/nsmb798
- Google Scholar
1. Xu D
2. Farmer A
3. Chook YM
(2010) Recognition of nuclear targeting signals by Karyopherin-beta proteins
Current Opinion in Structural Biology 20:782–790.

https://doi.org/10.1016/j.sbi.2010.09.008
- Google Scholar
1. Xu D
2. Farmer A
3. Collett G
4. Grishin NV
5. Chook YM
(2012a) Sequence and structural analyses of nuclear export signals in the NESdb database
Molecular Biology of the Cell 23:3677–3693.

https://doi.org/10.1091/mbc.E12-01-0046
- Google Scholar
1. Xu D
2. Grishin NV
3. Chook YM
(2012b) NESdb: a database of NES-containing CRM1 cargoes
Molecular Biology of the Cell 23:3673–3676.

https://doi.org/10.1091/mbc.E12-01-0045
- Google Scholar
1. Xu D
2. Marquis K
3. Pei J
4. Fu SC
5. Cagatay T
6. Grishin NV
7. Chook YM
(2015) LocNES: a computational tool for locating classical NESs in CRM1 cargo proteins
Bioinformatics 31:1357–1365.

https://doi.org/10.1093/bioinformatics/btu826
- Google Scholar

Article and author information

Author details

Ho Yee Joyce Fung

Department of Pharmacology, University of Texas Southwestern Medical Center, Dallas, United States

Contribution
HYJF, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article

Competing interests
The authors declare that no competing interests exist.

"This ORCID iD identifies the author of this article:" 0000-0002-0502-1957
Szu-Chin Fu

Department of Pharmacology, University of Texas Southwestern Medical Center, Dallas, United States

Contribution
S-F, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article

Competing interests
The authors declare that no competing interests exist.
Chad A Brautigam

Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, United States

Contribution
CAB, Analysis and interpretation of data, Drafting or revising the article

Competing interests
The authors declare that no competing interests exist.
Yuh Min Chook

Department of Pharmacology, University of Texas Southwestern Medical Center, Dallas, United States

Contribution
YMC, Conception and design, Analysis and interpretation of data, Drafting or revising the article

For correspondence
yuhmin.chook@utsouthwestern.edu

Competing interests
The authors declare that no competing interests exist.

Funding

Cancer Prevention and Research Institute of Texas (CPRIT) (RP120352, RP150053)

Yuh Min Chook

National Institutes of Health (NIH) (GM069909)

Yuh Min Chook

University of Texas Southwestern Medical Center (UT Southwestern) (Endowed Scholars Program)

Yuh Min Chook

Welch Foundation (Robert A. Welch Foundation) (I-1532)

Yuh Min Chook

Leukemia and Lymphoma Society (LLS) (Scholar Award)

Yuh Min Chook

Croucher Foundation (Graduate Student Scholarship)

Ho Yee Joyce Fung

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

We thank members of the Structural Biology Laboratory and Macromolecular Biophysics Resource at UTSW for crystallographic and biochemical data collection assistance, M Soniat, M Rosen, and N Grishin for comments. The use of SBC 19ID beamline at Advanced Photon Source is supported by U.S. Department of Energy contract DE-AC02-06CH11357. This work is funded by Cancer Prevention Research Institute of Texas (CPRIT) Grants RP120352 and RP150053 (YMC), R01 GM069909 (YMC), the University of Texas Southwestern Endowed Scholars Program (YMC), Welch Foundation Grant I-1532 (YMC), Leukemia and Lymphoma Society Scholar Award (YMC), and a Croucher Foundation Scholarship (HYJF).

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.