1. Biochemistry and Chemical Biology
  2. Structural Biology and Molecular Biophysics
Download icon

Membrane Protein Topology: The messy process of guiding proteins into membranes

  1. Stephen H White  Is a corresponding author
  1. University of California, Irvine, United States
  • Cited 1
  • Views 2,475
  • Annotations
Cite this article as: eLife 2015;4:e12100 doi: 10.7554/eLife.12100


A new simulation protocol has revealed unexpected complexity in the folding of membrane proteins.

Main text

One of the keys to predicting the three-dimensional structure of a membrane protein from its sequence of amino acid residues is to understand how structures called translocons guide the protein to its final folded state. Translocons are generally thought of as channels that allow proteins to cross cell membranes. In eukaryotes, it is thought that newly-formed secreted proteins pass through the Sec61 translocon as they emerge from the ribosome. New membrane proteins are thought to follow a similar path, except that the hydrophobic transmembrane helices in these proteins are diverted sideways so that they become embedded in the cell membrane. This ‘sequential-insertion’ scheme seems logical in the context of what we know about the structure of translocons (Rapoport et al., 2004; Cymer et al., 2015), but is it correct?

We cannot answer this question because we do not have experimental methods that can follow, residue-by-residue, the insertion and folding of the protein chains as they pass from the ribosome and into the membrane. The alternative is to simulate the process. However, a newly-formed protein chain elongates at a rate of about one residue every 50–100 milliseconds, which is orders of magnitude faster than can be modeled using standard molecular dynamics simulation methods. Now, in eLife, Reid van Lehn, Bin Zhang and Thomas Miller of the California Institute of Technology report a simplified approach that allows insertion and folding to be simulated on biological time scales (Van Lehn et al., 2015). Their results suggest that the membrane protein insertion/folding process is more complicated than commonly depicted in the sequential-insertion scheme.

Van Lehn et al. modeled a protein called EmrE that sits in the inner membrane of Escherichia coli bacteria and is able to transport a wide range of antibiotic drugs out of the cell. This helps to make the bacteria resistant to these treatments. EmrE is a homodimer, and each monomer has four transmembrane helices (Chen et al., 2007). EmrE is unusual in that the two monomers are oriented in opposite directions (Figure 1A): this is known as dual topology.

Simulations suggest that membrane proteins take on their final structure after they have been inserted into the membrane.

(A) The topologies of the EmrE monomers first inserted into the cytoplasmic membrane (blue band) at the end of translation (left) do not necessarily reflect the final topologies, which are subsequently achieved through thermodynamics-driven annealing. The interhelical loops in red represent the loops that flip most slowly, and thereby have a major influence on the kinetics of folding. EmrE can take on two different, antiparallel topologies; each row in the figure shows how one of these topologies may develop. (B) Van Lehn et al. used a coarse-grained model to simulate the insertion and folding of the EmrE dual-topology membrane protein (Zhang and Miller, 2012). Coarse-grained beads are assigned approximate hydrophobicity values (indicated by the shadings of the beads). The ribosome (brown) and translocon (green) are also represented as coarse-grained beads. The translocon is negatively charged on the cytoplasmic end and positively charged at the periplasmic end to represent the known charge distribution of the Sec 61 translocon (Goder et al., 2004). The simulation proceeds by adding a bead at the C-terminus of the nascent chain every 125 milliseconds; the panel on the right shows the chain on the left at a later point in time. Figure adapted from Figures 1 and 4 of Van Lehn et al. (2015).

The topology (orientation) of membrane proteins is largely determined by the positive-inside rule (von Heijne, 1986). This rule suggests that if the connecting loops that join the transmembrane regions of the protein are rich in lysine and arginine residues, then these loops tend to orient inward, toward the cytoplasm of the cell. This is known as the K+R bias. EmrE, which is encoded in a single gene, has a weak K+R bias, and this means that the monomers can be inserted into the membrane in one of two opposite orientations (Rapp et al., 2006, 2007).

In 2010, researchers at Stockholm University reported, based on extensive mutation studies, that a single positively charged residue placed in different positions throughout the protein can control the topology of EmrE monomers and affect whether parallel or anti-parallel dimers form (Seppälä et al., 2010). Given the positive-inside rule and the sequential-insertion scheme, one would expect positive charges in the C-terminal region of a membrane protein to have a smaller influence on topology than charges in the N-terminal region. However, Seppälä et al. discovered that a single positive charge at the C-terminus itself could determine the orientation of EmrE!

Because the positive-inside rule was robustly verified in the Stockholm experiments, a logical conclusion is that the sequential-insertion scheme does not describe accurately how EmrE, and perhaps other membrane proteins, fold inside cells. The simulations now performed by Van Lehn et al. divulge the missing ingredients of membrane protein folding: stochastic insertion and post-insertion annealing. By stochastic insertion, I mean that protein chains can have various topologies after they have been made, creating what Van Lehn et al. refer to as an ‘end-of-translation ensemble’ (Figure 1A). After being inserted into the membrane, the members of the ensemble that are not initially in their lowest thermodynamic free energy state subsequently relax to their preferred topology through a process called annealing. In the case of EmrE, antiparallel dimers can form because there are two final topologies that have similar free energies.

Van Lehn et al. increased the speed of the simulations by treating the nascent protein chain as a sequence of coarse-grained beads, with each bead representing several amino acids (Figure 1B). Four beads were used to represent the transmembrane helices and five beads were to used represent the loops that connect these helices. Certain properties of the amino acid residues that are known to affect the topology of a protein were also incorporated into the simulation: for example, hydrophobicities were assigned to the beads using an experimentally-determined hydrophobicity scale (Wimley et al., 1996). Particularly important was the assignment of positive charges in the connecting loops between the transmembrane helices to mimic the mutation experiments of Seppälä et al. (2010). The ribosome and translocon were also represented by simple two-dimensional structures composed of coarse-grained beads (Zhang and Miller, 2012; Figure 1B). Crucially, the model translocon used in the simulations had two negative charges on its cytoplasmic side and two positive charges on its periplasmic side to mimic the known net charge distribution of the translocon (Goder et al., 2004).

The simulations were performed by adding a new bead at the C-terminal of the nascent chain every 125 milliseconds. In this way, van Lehn et al. simulated the insertion and folding of the many mutant EmrE proteins studied by Seppälä et al. (2010) and found remarkable agreement with the experimentally determined topologies.

The simulations of van Lehn et al. show that the stochastic insertion of newly-formed protein chains into the membrane, followed by thermodynamics-driven annealing, is a viable alternative to the current sequential-insertion view. What is needed now is direct experimental verification of how transmembrane proteins are inserted into the membrane. This will require new methods that can directly follow insertion and folding on the biological time scale.


    1. von Heijne G
    The distribution of positively charged residues in bacterial inner membrane proteins correllates with the trans-membrane topology
    EMBO Journal 5:3021–3027.

Article and author information

Author details

  1. Stephen H White

    Department of Physiology & Biophysics, University of California, Irvine, Irvine, United States
    For correspondence
    Competing interests
    The author declares that no competing interests exist.

Publication history

  1. Version of Record published: November 6, 2015 (version 1)


© 2015, White

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.


  • 2,475
    Page views
  • 354
  • 1

Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Download citations (links to download the citations from this article in formats compatible with various reference manager tools)

Open citations (links to open the citations from this article in various online reference manager services)

Further reading

    1. Biochemistry and Chemical Biology
    2. Microbiology and Infectious Disease
    Sydney P Thomas, John M Denu
    Research Article

    Short-chain fatty acids (SCFAs) acetate, propionate, and butyrate are produced in large quantities by the gut microbiome and contribute to a wide array of physiological processes. While the underlying mechanisms are largely unknown, many effects of SCFAs have been traced to changes in the cell’s epigenetic state. Here, we systematically investigate how SCFAs alter the epigenome. Using quantitative proteomics of histone modification states, we identified rapid and sustained increases in histone acetylation after addition of butyrate or propionate, but not acetate. While decades of prior observations would have suggested that hyperacetylation induced by SCFAs are attributed to inhibition of histone deacetylases (HDACs), we found that propionate and butyrate instead activate the acetyltransferase p300. Propionate and butyrate are rapidly converted to the corresponding acyl-CoAs which are then used by p300 to catalyze auto-acylation of the autoinhibitory loop, activating the enzyme for histone/protein acetylation. This data challenges the long-held belief that SCFAs mainly regulate chromatin by inhibiting HDACs, and instead reveals a previously unknown mechanism of HAT activation that can explain how an influx of low levels of SCFAs alters global chromatin states.

    1. Biochemistry and Chemical Biology
    2. Genetics and Genomics
    Krishna S Ghanta et al.
    Research Article

    Nuclease-directed genome editing is a powerful tool for investigating physiology and has great promise as a therapeutic approach to correct mutations that cause disease. In its most precise form, genome editing can use cellular homology-directed repair (HDR) pathways to insert information from an exogenously supplied DNA repair template (donor) directly into a targeted genomic location. Unfortunately, particularly for long insertions, toxicity and delivery considerations associated with repair template DNA can limit HDR efficacy. Here, we explore chemical modifications to both double-stranded and single-stranded DNA-repair templates. We describe 5′-terminal modifications, including in its simplest form the incorporation of triethylene glycol (TEG) moieties, that consistently increase the frequency of precision editing in the germlines of three animal models (Caenorhabditis elegans, zebrafish, mice) and in cultured human cells.