Protein Overexpression: Reaching the limit
Cells can be pictured as factories that build proteins, the molecules essential for nearly all of life’s processes. The body tightly controls production levels, because creating too many proteins – also known as protein overexpression – can be harmful to the cell. Yet, it is difficult to know how much of any given protein will be harmful, or why.
Indeed, high concentrations of enzymes and other proteins can harm cells in several ways, for example by activating or overloading specific biological pathways, disrupting regulation, or by aggregating together (Vavouri et al., 2009; Tang and Amon, 2013; Makanae et al., 2013). They can also upset the balance in protein complexes or make the different liquid phases separate in the cell (Birchler and Veitia, 2012; Bolognesi et al., 2016). Ultimately, overexpressing any protein will be destructive because it exhausts the resources of the cell to make and transport proteins (Stoebel et al., 2008). However, we did not know how much of a specific protein must be produced to cause this ‘protein burden’ and hinder cell growth.
Now, in eLife, Hisao Moriya and colleagues at the universities of Okayama, Kobe and Meiji – including Yuichi Eguchi as first author – report that many members of a group of enzymes can be overexpressed until they form 15% of the total proteins in a yeast cell (Eguchi et al., 2018). Only then do they start to cause damage because of protein burden. This matches the results of previous experiments from the same laboratory, which only focused on a single fluorescent protein that did not interfere with any components of the cell (Kintaka et al., 2016).
To discover this limit, Eguchi et al. relied on a method the lab developed in 2006. The technique involves inserting a small portion of DNA, called a plasmid, into the yeast cells. The plasmid carries two genes: the first is essential for growth, and the other codes for one of the enzymes studied. The cell increasingly needs to make new plasmids in order to grow, but this also creates more enzymes. In this ‘tug-of-war’ system, the yeast generates more and more plasmids until the expression of the enzyme of interest becomes harmful; at this point, plasmid production decreases. The number of plasmids in the cell thus reflects the quantity of protein that can be made before it turns toxic.
The experiments focused on a set of 29 glycolytic enzymes, which break down sugar in yeast. These enzymes are normally highly expressed in a cell, and their roles are well understood.
Out of the 29 proteins, three were not harmful in the experiment and could not be produced in high enough amounts to reach the burden limit. This was because the genes that encoded these enzymes contained sequences that were not optimal for protein production.
Another 19 enzymes could be expressed until they formed close to 15% of the total protein content of the cell, which suggests that protein burden is the cause of their toxicity. The fact that even large essential yeast enzymes could be produced up to this limit is unexpected, and it suggests that in many cases the toxicity created by protein overexpression does not depend on the specific characteristics of the proteins.
The cost of overexpression may come from the burden it puts on the machinery that assembles proteins in the cell, which requires particularly high levels of energy (Shah et al., 2013). Putting this apparatus under pressure could impair or slow it down; in turn, this may hinder the creation of other proteins and decrease the fitness of the cell. The other steps of protein production, such as ‘reading’ the genes, helping the protein to mature, bringing it to its right location in the cell, and degrading it, also use significant amounts of energy (Rice and McLysaght, 2017).
Seven proteins caused harm at concentrations far below the 15% limit, which means that they must damage the cell in other ways than by causing a protein burden. Eguchi et al. identified three mechanisms for this toxicity: the proteins aggregated together, they overloaded a transport system that takes them to a specific cell compartment, or the overexpressed enzymes produced too much catalytic activity (Figure 1). One might have expected this last process to drive the toxic effects of this group of proteins. Yet, killing catalytic activity in the enzymes (by introducing specific mutations) only relieved the toxicity caused by overexpression for two of the 18 proteins that were tested.
In many cases, removing one mechanism of toxicity increased the level to which an enzyme could be overexpressed, but it still did not allow expression up to the 15% limit. Proteins could therefore be damaging through a range of mechanisms, each of which gets triggered when the concentration in the cell reaches a particular level.
While the glycolytic enzymes belong to the same pathway and share extremely similar roles, their overexpression affects cell growth via diverse mechanisms. In other words, the biological role of a protein cannot be used to predict how it will harm the cell. Altogether, these results stimulate important lines of enquiry, such as looking into which of the above mechanisms damage cells when gene expression changes during disease. They also encourage further research so that we could predict at which concentration the expression of every human gene will be harmful in any tissue. And finally, they raise the question: is protein burden what has stopped increased gene expression during evolution?
References
Article and author information
Author details
Publication history
Copyright
© 2018, Bolognesi et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 21,421
- views
-
- 944
- downloads
-
- 44
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Computational and Systems Biology
To help maximize the impact of scientific journal articles, authors must ensure that article figures are accessible to people with color-vision deficiencies (CVDs), which affect up to 8% of males and 0.5% of females. We evaluated images published in biology- and medicine-oriented research articles between 2012 and 2022. Most included at least one color contrast that could be problematic for people with deuteranopia (‘deuteranopes’), the most common form of CVD. However, spatial distances and within-image labels frequently mitigated potential problems. Initially, we reviewed 4964 images from eLife, comparing each against a simulated version that approximated how it might appear to deuteranopes. We identified 636 (12.8%) images that we determined would be difficult for deuteranopes to interpret. Our findings suggest that the frequency of this problem has decreased over time and that articles from cell-oriented disciplines were most often problematic. We used machine learning to automate the identification of problematic images. For a hold-out test set from eLife (n=879), a convolutional neural network classified the images with an area under the precision-recall curve of 0.75. The same network classified images from PubMed Central (n=1191) with an area under the precision-recall curve of 0.39. We created a Web application (https://bioapps.byu.edu/colorblind_image_tester); users can upload images, view simulated versions, and obtain predictions. Our findings shed new light on the frequency and nature of scientific images that may be problematic for deuteranopes and motivate additional efforts to increase accessibility.
-
- Computational and Systems Biology
The force developed by actively lengthened muscle depends on different structures across different scales of lengthening. For small perturbations, the active response of muscle is well captured by a linear-time-invariant (LTI) system: a stiff spring in parallel with a light damper. The force response of muscle to longer stretches is better represented by a compliant spring that can fix its end when activated. Experimental work has shown that the stiffness and damping (impedance) of muscle in response to small perturbations is of fundamental importance to motor learning and mechanical stability, while the huge forces developed during long active stretches are critical for simulating and predicting injury. Outside of motor learning and injury, muscle is actively lengthened as a part of nearly all terrestrial locomotion. Despite the functional importance of impedance and active lengthening, no single muscle model has all these mechanical properties. In this work, we present the viscoelastic-crossbridge active-titin (VEXAT) model that can replicate the response of muscle to length changes great and small. To evaluate the VEXAT model, we compare its response to biological muscle by simulating experiments that measure the impedance of muscle, and the forces developed during long active stretches. In addition, we have also compared the responses of the VEXAT model to a popular Hill-type muscle model. The VEXAT model more accurately captures the impedance of biological muscle and its responses to long active stretches than a Hill-type model and can still reproduce the force-velocity and force-length relations of muscle. While the comparison between the VEXAT model and biological muscle is favorable, there are some phenomena that can be improved: the low frequency phase response of the model, and a mechanism to support passive force enhancement.