Membrane Protein Topology: The messy process of guiding proteins into membranes

  1. Stephen H White  Is a corresponding author
  1. University of California, Irvine, United States

One of the keys to predicting the three-dimensional structure of a membrane protein from its sequence of amino acid residues is to understand how structures called translocons guide the protein to its final folded state. Translocons are generally thought of as channels that allow proteins to cross cell membranes. In eukaryotes, it is thought that newly-formed secreted proteins pass through the Sec61 translocon as they emerge from the ribosome. New membrane proteins are thought to follow a similar path, except that the hydrophobic transmembrane helices in these proteins are diverted sideways so that they become embedded in the cell membrane. This ‘sequential-insertion’ scheme seems logical in the context of what we know about the structure of translocons (Rapoport et al., 2004; Cymer et al., 2015), but is it correct?

We cannot answer this question because we do not have experimental methods that can follow, residue-by-residue, the insertion and folding of the protein chains as they pass from the ribosome and into the membrane. The alternative is to simulate the process. However, a newly-formed protein chain elongates at a rate of about one residue every 50–100 milliseconds, which is orders of magnitude faster than can be modeled using standard molecular dynamics simulation methods. Now, in eLife, Reid van Lehn, Bin Zhang and Thomas Miller of the California Institute of Technology report a simplified approach that allows insertion and folding to be simulated on biological time scales (Van Lehn et al., 2015). Their results suggest that the membrane protein insertion/folding process is more complicated than commonly depicted in the sequential-insertion scheme.

Van Lehn et al. modeled a protein called EmrE that sits in the inner membrane of Escherichia coli bacteria and is able to transport a wide range of antibiotic drugs out of the cell. This helps to make the bacteria resistant to these treatments. EmrE is a homodimer, and each monomer has four transmembrane helices (Chen et al., 2007). EmrE is unusual in that the two monomers are oriented in opposite directions (Figure 1A): this is known as dual topology.

Simulations suggest that membrane proteins take on their final structure after they have been inserted into the membrane.

(A) The topologies of the EmrE monomers first inserted into the cytoplasmic membrane (blue band) at the end of translation (left) do not necessarily reflect the final topologies, which are subsequently achieved through thermodynamics-driven annealing. The interhelical loops in red represent the loops that flip most slowly, and thereby have a major influence on the kinetics of folding. EmrE can take on two different, antiparallel topologies; each row in the figure shows how one of these topologies may develop. (B) Van Lehn et al. used a coarse-grained model to simulate the insertion and folding of the EmrE dual-topology membrane protein (Zhang and Miller, 2012). Coarse-grained beads are assigned approximate hydrophobicity values (indicated by the shadings of the beads). The ribosome (brown) and translocon (green) are also represented as coarse-grained beads. The translocon is negatively charged on the cytoplasmic end and positively charged at the periplasmic end to represent the known charge distribution of the Sec 61 translocon (Goder et al., 2004). The simulation proceeds by adding a bead at the C-terminus of the nascent chain every 125 milliseconds; the panel on the right shows the chain on the left at a later point in time. Figure adapted from Figures 1 and 4 of Van Lehn et al. (2015).

The topology (orientation) of membrane proteins is largely determined by the positive-inside rule (von Heijne, 1986). This rule suggests that if the connecting loops that join the transmembrane regions of the protein are rich in lysine and arginine residues, then these loops tend to orient inward, toward the cytoplasm of the cell. This is known as the K+R bias. EmrE, which is encoded in a single gene, has a weak K+R bias, and this means that the monomers can be inserted into the membrane in one of two opposite orientations (Rapp et al., 2006, 2007).

In 2010, researchers at Stockholm University reported, based on extensive mutation studies, that a single positively charged residue placed in different positions throughout the protein can control the topology of EmrE monomers and affect whether parallel or anti-parallel dimers form (Seppälä et al., 2010). Given the positive-inside rule and the sequential-insertion scheme, one would expect positive charges in the C-terminal region of a membrane protein to have a smaller influence on topology than charges in the N-terminal region. However, Seppälä et al. discovered that a single positive charge at the C-terminus itself could determine the orientation of EmrE!

Because the positive-inside rule was robustly verified in the Stockholm experiments, a logical conclusion is that the sequential-insertion scheme does not describe accurately how EmrE, and perhaps other membrane proteins, fold inside cells. The simulations now performed by Van Lehn et al. divulge the missing ingredients of membrane protein folding: stochastic insertion and post-insertion annealing. By stochastic insertion, I mean that protein chains can have various topologies after they have been made, creating what Van Lehn et al. refer to as an ‘end-of-translation ensemble’ (Figure 1A). After being inserted into the membrane, the members of the ensemble that are not initially in their lowest thermodynamic free energy state subsequently relax to their preferred topology through a process called annealing. In the case of EmrE, antiparallel dimers can form because there are two final topologies that have similar free energies.

Van Lehn et al. increased the speed of the simulations by treating the nascent protein chain as a sequence of coarse-grained beads, with each bead representing several amino acids (Figure 1B). Four beads were used to represent the transmembrane helices and five beads were to used represent the loops that connect these helices. Certain properties of the amino acid residues that are known to affect the topology of a protein were also incorporated into the simulation: for example, hydrophobicities were assigned to the beads using an experimentally-determined hydrophobicity scale (Wimley et al., 1996). Particularly important was the assignment of positive charges in the connecting loops between the transmembrane helices to mimic the mutation experiments of Seppälä et al. (2010). The ribosome and translocon were also represented by simple two-dimensional structures composed of coarse-grained beads (Zhang and Miller, 2012; Figure 1B). Crucially, the model translocon used in the simulations had two negative charges on its cytoplasmic side and two positive charges on its periplasmic side to mimic the known net charge distribution of the translocon (Goder et al., 2004).

The simulations were performed by adding a new bead at the C-terminal of the nascent chain every 125 milliseconds. In this way, van Lehn et al. simulated the insertion and folding of the many mutant EmrE proteins studied by Seppälä et al. (2010) and found remarkable agreement with the experimentally determined topologies.

The simulations of van Lehn et al. show that the stochastic insertion of newly-formed protein chains into the membrane, followed by thermodynamics-driven annealing, is a viable alternative to the current sequential-insertion view. What is needed now is direct experimental verification of how transmembrane proteins are inserted into the membrane. This will require new methods that can directly follow insertion and folding on the biological time scale.

References

    1. von Heijne G
    (1986)
    The distribution of positively charged residues in bacterial inner membrane proteins correllates with the trans-membrane topology
    EMBO Journal 5:3021–3027.

Article and author information

Author details

  1. Stephen H White

    Department of Physiology & Biophysics, University of California, Irvine, Irvine, United States
    For correspondence
    shwhite@uci.edu
    Competing interests
    The author declares that no competing interests exist.

Publication history

  1. Version of Record published:

Copyright

© 2015, White

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 3,151
    views
  • 392
    downloads
  • 2
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Stephen H White
(2015)
Membrane Protein Topology: The messy process of guiding proteins into membranes
eLife 4:e12100.
https://doi.org/10.7554/eLife.12100

Further reading

    1. Biochemistry and Chemical Biology
    2. Genetics and Genomics
    Federico A Vignale, Andrea Hernandez Garcia ... Adrian G Turjanski
    Research Article

    Yerba mate (YM, Ilex paraguariensis) is an economically important crop marketed for the elaboration of mate, the third-most widely consumed caffeine-containing infusion worldwide. Here, we report the first genome assembly of this species, which has a total length of 1.06 Gb and contains 53,390 protein-coding genes. Comparative analyses revealed that the large YM genome size is partly due to a whole-genome duplication (Ip-α) during the early evolutionary history of Ilex, in addition to the hexaploidization event (γ) shared by core eudicots. Characterization of the genome allowed us to clone the genes encoding methyltransferase enzymes that catalyse multiple reactions required for caffeine production. To our surprise, this species has converged upon a different biochemical pathway compared to that of coffee and tea. In order to gain insight into the structural basis for the convergent enzyme activities, we obtained a crystal structure for the terminal enzyme in the pathway that forms caffeine. The structure reveals that convergent solutions have evolved for substrate positioning because different amino acid residues facilitate a different substrate orientation such that efficient methylation occurs in the independently evolved enzymes in YM and coffee. While our results show phylogenomic constraint limits the genes coopted for convergence of caffeine biosynthesis, the X-ray diffraction data suggest structural constraints are minimal for the convergent evolution of individual reactions.

    1. Biochemistry and Chemical Biology
    2. Structural Biology and Molecular Biophysics
    Angel D'Oliviera, Xuhang Dai ... Jeffrey S Mugridge
    Research Article

    The SARS-CoV-2 main protease (Mpro or Nsp5) is critical for production of viral proteins during infection and, like many viral proteases, also targets host proteins to subvert their cellular functions. Here, we show that the human tRNA methyltransferase TRMT1 is recognized and cleaved by SARS-CoV-2 Mpro. TRMT1 installs the N2,N2-dimethylguanosine (m2,2G) modification on mammalian tRNAs, which promotes cellular protein synthesis and redox homeostasis. We find that Mpro can cleave endogenous TRMT1 in human cell lysate, resulting in removal of the TRMT1 zinc finger domain. Evolutionary analysis shows the TRMT1 cleavage site is highly conserved in mammals, except in Muroidea, where TRMT1 is likely resistant to cleavage. TRMT1 proteolysis results in reduced tRNA binding and elimination of tRNA methyltransferase activity. We also determined the structure of an Mpro-TRMT1 peptide complex that shows how TRMT1 engages the Mpro active site in an uncommon substrate binding conformation. Finally, enzymology and molecular dynamics simulations indicate that kinetic discrimination occurs during a later step of Mpro-mediated proteolysis following substrate binding. Together, these data provide new insights into substrate recognition by SARS-CoV-2 Mpro that could help guide future antiviral therapeutic development and show how proteolysis of TRMT1 during SARS-CoV-2 infection impairs both TRMT1 tRNA binding and tRNA modification activity to disrupt host translation and potentially impact COVID-19 pathogenesis or phenotypes.