Organisms with alternative genetic codes resolve unassigned codons via mistranslation and ribosomal rescue

Abstract
eLife digest
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

Organisms possessing genetic codes with unassigned codons raise the question of how cellular machinery resolves such codons and how this could impact horizontal gene transfer. Here, we use a genomically recoded Escherichia coli to examine how organisms address translation at unassigned UAG codons, which obstruct propagation of UAG-containing viruses and plasmids. Using mass spectrometry, we show that recoded organisms resolve translation at unassigned UAG codons via near-cognate suppression, dramatic frameshifting from at least −3 to +19 nucleotides, and rescue by ssrA-encoded tmRNA, ArfA, and ArfB. We then demonstrate that deleting tmRNA restores expression of UAG-ending proteins and propagation of UAG-containing viruses and plasmids in the recoded strain, indicating that tmRNA rescue and nascent peptide degradation is the cause of impaired virus and plasmid propagation. The ubiquity of tmRNA homologs suggests that genomic recoding is a promising path for impairing horizontal gene transfer and conferring genetic isolation in diverse organisms.

https://doi.org/10.7554/eLife.34878.001

eLife digest

Usually, DNA passes from parent to offspring, vertically down the generations. But not always. In some cases, it can move directly from one organism to another by a process called horizontal gene transfer. In bacteria, this happens when DNA segments pass through a bacterium’s cell wall, which can then be picked up by another bacterium. Because the vast majority of organisms share the same genetic code, the bacteria can read this DNA with ease, as it is in the same biological language.

Horizontal gene transfer helps bacteria adapt and evolve to their surroundings, letting them swap and share genetic information that could be useful. The process also poses a threat to human health because the DNA that bacteria share can help spread antibiotic resistance. However, some organisms use an alternative genetic code, which obstructs horizontal gene transfer. They cannot read the DNA transmitted to them, because it is in a different ‘biological language’. The mechanism of how this language barrier works has been poorly understood until now.

Ma, Hemez, Barber et al. investigated this using Escherichia coli bacteria with an artificially alternated genetic code. In this E. coli, one of the three-letter DNA ‘words’ in the sequence is a blank – it does not exist in the bacterium’s biological language. This three-letter DNA word normally corresponds to a particular protein building block. Using a technique called mass spectrometry, Ma et al. analyzed the proteins this E. coli forms. The results showed that it has several strategies to deal with DNA transmitted horizontally into the bacterium. One method is destroying the proteins that are half-created from the DNA, using molecules called tmRNAs. These are part of a rescue system that intervenes when protein translation stalls on the blank word. The tmRNAs help to add a tag to half-formed proteins, marking them for destruction.

This mechanism creates a ‘genetic firewall’ that prevents horizontal gene transfer. In organisms engineered to work from an altered genetic code, this helps to isolate them from outside interference. The findings could have applications in creating engineered bacteria that are safer for use in fields such as medicine and biofuel production.

https://doi.org/10.7554/eLife.34878.002

Introduction

The standard genetic code allows faithful translation of proteins across nearly all living organisms and enables horizontally transferred genetic elements (HTGEs), such as conjugative plasmids and viruses, to exploit a host’s translational machinery (Krakauer and Jansen, 2002). Since naturally occurring exceptions to the standard genetic code exist (Ambrogelly et al., 2007; Knight et al., 2001), researchers have hypothesized that such alternative genetic codes might arise to escape viral predation (Shackelton and Holmes, 2008). Recent research supports this hypothesis, with modification to codon usage or the genetic code reducing the ability of viruses and conjugative plasmids to exploit their hosts (Coleman et al., 2008; Lajoie et al., 2013b; Ma and Isaacs, 2016). Given the medical, technological, and evolutionary importance of HTGE-mediated horizontal gene transfer (HGT) (Davies, 1994; Gogarten and Townsend, 2005; Moe-Behrens et al., 2013; Ochman et al., 2000), understanding the molecular basis for how alternative genetic codes impede HTGEs is vital.

At the molecular level, an alternative genetic code arises from reassignment of one or more codons in the genetic code, which stems from a change in the ability of an aminoacyl-tRNA or release factor (RF) to recognize codon(s) during translation. One possible alteration of the genetic code is the loss of a codon assignment through the deletion or modification of an aminoacyl-tRNA or release factor, removing the cell’s ability to decode that codon (Figure 1A). Such unassigned codons are found in alternative genetic codes in nature (Knight et al., 2001) and have been engineered into genomically recoded organisms (GROs) derived from Escherichia coli (Isaacs et al., 2011; Lajoie et al., 2013b). We recently demonstrated that a GRO with an unassigned UAG codon (i.e. lacking all instances of the UAG codon and release factor 1, RF1) impaired the propagation of HTGEs carrying UAG-ending genes, illustrating that alternative genetic codes can obstruct HGT (Ma and Isaacs, 2016) and establishing the GRO as an ideal model to study the molecular mechanisms that act at unassigned codons to impair HTGEs.

Figure 1

Download asset Open asset

A UAG-ending transcript in the genomically recoded organism (GRO) may produce proteins with multiple differing C-termini.

(A) Unassigned codons arise when either the cognate tRNA or release factor recognizing a codon are removed. (B) Since the GRO lacks Release Factor 1 (RF1), ribosomal stalling at the UAG codons results in three possible fates for the nascent protein (blue): (1) suppression of the codon by a near-cognate or suppressor tRNA (yellow) and continued translation, (2) frameshifting of bases along the mRNA transcript into a new reading frame and continued translation (purple), or (3) ribosomal rescue by the *ssrA*-encoded tmRNA, ArfA, or ArfB proteins. If ribosomal rescue occurs via tmRNA, the resulting protein is tagged with a peptide sequence (red) for degradation, while rescue via ArfA or ArfB results in release of peptide without C-terminal modification.

https://doi.org/10.7554/eLife.34878.003

Encountering an unassigned codon during translation leads to ribosomal stalling, and without resolution, to cell death (Keiler and Feaga, 2014). However, the survival of organisms engineered to lack RF1 but retaining some UAG codons in their protein-coding sequences (Heinemann et al., 2012; Mukai et al., 2010) and the ability of GROs to resist exploitation by and continue growth in the presence of HTGEs (Ma and Isaacs, 2016) indicates that E. coli can resolve translation at unassigned UAG codons. We hypothesize that three mechanisms could resolve translation at prokaryotic ribosomes encountering these unassigned codons, each resulting in peptides with different C-terminal sequences (Figure 1B): (1) suppression of the codon by a near-cognate or mutated tRNA (e.g. amber suppressor) and continued translation, (2) frameshifting of bases along the mRNA transcript into a new reading frame and continued translation, or (3) stalling that elicits one of three ribosomal rescue pathways (tmRNA-SmpB, ArfA, or ArfB) in the cell (Keiler, 2015). The tmRNA-SmpB system acts as the primary rescue mechanism in prokaryotes, resolving ribosomal stalling that arises from the translation of mRNAs lacking a stop codon due to mRNA degradation, frameshifting, and stop codon read-through (Keiler, 2015). tmRNA-SmpB can also rescue ribosomes stalled on intact mRNAs for structural reasons (Cruz-Vera et al., 2011; Keiler, 2015; Li et al., 2012). The ssrA-encoded tmRNA associates with SmpB to form the tmRNA-SmpB complex, which adds a C-terminal degradation tag to peptides on stalled ribosomes (Tu et al., 1995). ArfA and ArfB, the secondary ribosomal rescue systems, alleviate stalling and release the stalled ribosome’s nascent peptide without modification (Chadani et al., 2012; Shimizu, 2012). tmRNA, ArfA, and ArfB all act on nonstop ribosomal complexes, which are stalled ribosomes that have reached the 3’ end of an mRNA because of stop-codon readthrough or because of the loss of a stop codon due to 3’ exonuclease degradation (Keiler, 2015). A possible fourth outcome identified from in vitro studies is loss of translational fidelity after the ribosome encounters rare or unassigned codons (Gingold and Pilpel, 2011), followed by untemplated termination by release factor 2 (RF2) (Zaher and Green, 2009).

Studies of ribosomal stalling arising at rare codons (Hayes et al., 2002) or in contexts of depleted or inefficient cognate decoding elements (George et al., 2016; Li et al., 2007; Roche and Sauer, 1999) suggest that a number of these mechanisms could resolve translation at unassigned codons, but a lack of well-characterized model organisms with an unassigned codon has precluded direct study of this question. Here, we use the GRO as a model to demonstrate that unassigned UAG codons in mRNA transcripts (1) elicit suppression, ribosomal frameshifting, and ribosomal rescue, (2) can induce ribosomal frameshifting from at least −3 to +19 nucleotides, and (3) lead to total loss of translational fidelity. By selectively deleting ribosomal rescue pathways in the GRO, we show that the tmRNA system is primarily responsible for rescuing ribosomes stalled at unassigned codons, with deletion of the tmRNA restoring expression of UAG-ending genes and re-enabling propagation of UAG-containing plasmids and viruses in the GRO. Our work reveals mechanistic details into how cells rescue ribosomes stalled at unassigned stop codons, providing insight into how alternative genetic codes act as barriers to HTGEs and demonstrating genomic recoding as a broadly applicable strategy to obstruct HGT in engineered organisms.

Results

Suppression, ribosomal frameshifting, and tmRNA-mediated peptide tagging occur at unassigned codons

In prior work, we constructed an Escherichia coli strain in which all UAG codons were mutated to UAA, permitting the deletion of release factor 1 (RF1) and resulting in an organism that lacks a codon assignment of UAG. This genomically recoded organism (GRO) (Isaacs et al., 2011; Lajoie et al., 2013b) exhibited resistance to multiple viruses and failure to propagate conjugative plasmids (Lajoie et al., 2013b; Ma and Isaacs, 2016) attributable to the unassigned UAG codon, but the molecular mechanisms that resolve unassigned UAG codons during translation remained unknown. In this study, we conducted two main experiments to uncover these mechanisms: (1) analysis of proteins translated from UAG-ending transcripts via mass spectrometry and western blots and (2) phenotypic assays to assess whether gene deletions of specific rescue factors restored the ability of conjugative plasmids and viruses to exploit the GRO. Since we hypothesized that the tmRNA-mediated response may resolve ribosomal stalling at the UAG codon, we also mutated the degradation tag encoded by the tmRNA from AANDENYALAA (AA-tag) to AANDENYALDD (DD-tag) for protein expression for mass spectrometry experiments. This mutation increases the half-life of protein products released by tmRNA (Keiler et al., 1996; Roche and Sauer, 1999), enabling their detection via mass spectrometry.

We assembled plasmids (pUAG-GFP and pUAA-GFP) encoding GFP genes with C-terminal 6x-His tags positioned immediately upstream of a UAG or UAA stop codon. We then expressed GFP from pUAG-GFP and pUAA-GFP in GRO cells containing the RF1-encoding prfA gene (GRO.DD.prfA+) or in GRO cells lacking prfA and consequentially without UAG assignment (GRO.DD) (Figure 2A; Table 1; see also Key Resources Table for a list of plasmids used in this study). We then purified proteins by nickel affinity chromatography, performed trypsin digest, and used tandem mass spectrometry to collect peptide mass data as described previously (Aerni et al., 2015; Amiram et al., 2015). To distinguish between mechanisms of ribosomal rescue and mistranslation at the UAG codon, we searched mass spectrometry data with theoretical peptide libraries detailed in Table 2 (see also Supplementary file 3 and 4) to identify evidence for suppression, ribosomal frameshifting, rescue via tmRNA tagging, and loss of translational fidelity.

Figure 2

Download asset Open asset

UAG codons in the genomically recoded organism elicit suppression, frameshifting, and tagging for degradation by the tmRNA.

(A) Schematic of the GFP construct with a C-terminal 6x-His tag and a UAG stop codon, showing 102 nucleotides downstream of the UAG codon and the positions of other stop codons in the downstream tail. (B) Peptides identified from the C-terminus of a UAG-ending GFP construct expressed in the GRO (using libraries detailed in Supplementary file 3 and 4). Purified GFP protein was digested with trypsin, processed via MS/MS, and the resulting data were computationally searched using libraries encoding all possible suppressors and all possible subsequent reading frames. Peptides are mapped to the C-terminus of the original GFP construct and grouped by reading frame, with the number of bases skipped listed in the left column. Green text represents GFP, blue text represents the C-terminal 6xHis tag and unframeshifted readthrough, orange text represents the position of a UAG stop codon, purple text represents frameshifted readthrough, and red text represents the tmRNA tag. Black dashes represent ribosomal frameshifts (Figure 2—source datas 1 and 2). (C) MS-MS spectra for two peptides: the C-terminus of GFP with the appended degradation tag (LEHHHHHHAANDENYALDD) and the C-terminus of GFP demonstrating a + 10 base skip in translation (LEHHHHHHGDPMVR). The other spectra validated from UAG-GFP expressing GRO.AA are shown in Supplementary file 2.

https://doi.org/10.7554/eLife.34878.004

Figure 2—source data 1 Raw data and analysis of peptides detected in mass spectrometry datasets using a library generated to search for frameshifting, near-cognate suppression, and ribosomal rescue events (Supplementary file 3).: https://doi.org/10.7554/eLife.34878.005
Download elife-34878-fig2-data1-v1.xlsx
Figure 2—source data 2 Raw data and analysis of peptides detected in mass spectrometry datasets using a library generated to search for loss of translational fidelity (Supplementary file 4).: https://doi.org/10.7554/eLife.34878.006
Download elife-34878-fig2-data2-v1.xlsx

Table 1

Strains used in this study.

https://doi.org/10.7554/eLife.34878.007

Strain Abbreviation*	Ancestor (source)^†	Genotype	# UAG Codons^‡	RF1 Status^§	Ribosomal rescue gene deletion	ssrA tag Status^#	Investigated in
GRO.DD.prfA+	GRO.AA (this study)	ΔmutS:zeo.Δ(ybhB-bioAB):[λcI857.Δ(cro-ea59):tetR-bla]	0	+RF1	n/a	DD	GFP expression for mass spectrometry (Figure 2)
GRO.DD	GRO.AA (this study)	ΔmutS:zeo.Δ(ybhB-bioAB):[λcI857.Δ(cro-ea59):tetR-bla], ΔprfA, ΔtolC	0	∆RF1	n/a	DD	GFP expression for mass spectrometry (Figure 2)
ECNR2.AA	E. coli MG1655 (Wang et al., 2009)	MG1655 ΔmutS:zeo.Δ(ybhB-bioAB):[λcI857.Δ(cro-ea59):tetR-bla]	321	+RF1	n/a	AA	Fitness, conjugation, and viral infection (Figures 3 and 4)
GRO.AA	ECNR2.AA (Lajoie et al., 2013b)	ΔmutS:zeo.Δ(ybhB-bioAB):[λcI857.Δ(cro-ea59):tetR-bla], ΔprfA, ΔtolC	0	∆RF1	n/a	AA	Fitness, conjugation, and viral infection (Figures 3 and 4)
GRO.AA.∆ssrA	GRO.AA (this study)	ΔmutS:zeo.Δ(ybhB-bioAB):[λcI857.Δ(cro-ea59):tetR-bla], ΔprfA, ΔtolC	0	∆RF1	∆ssrA	AA	Fitness, conjugation, and viral infection (Figures 3 and 4)
GRO.AA.∆arfA	GRO.AA (this study)	ΔmutS:zeo.Δ(ybhB-bioAB):[λcI857.Δ(cro-ea59):tetR-bla], ΔprfA, ΔtolC	0	∆RF1	∆arfA	AA	Fitness, conjugation, and viral infection (Figures 3 and 4)
GRO.AA.∆arfB	GRO.AA (this study)	ΔmutS:zeo.Δ(ybhB-bioAB):[λcI857.Δ(cro-ea59):tetR-bla], ΔprfA, ΔtolC	0	∆RF1	∆arfB	AA	Fitness, conjugation, and viral infection (Figures 3 and 4)
GRO.AA.∆ssrA.∆arfB	GRO.AA (this study)	ΔmutS:zeo.Δ(ybhB-bioAB):[λcI857.Δ(cro-ea59):tetR-bla], ΔprfA, ΔtolC	0	∆RF1	∆ssrA, ∆arfB	AA	Fitness, conjugation, and viral infection (Figures 3 and 4)
GRO.AA.∆arfA. ∆arfB	GRO.AA (this study)	ΔmutS:zeo.Δ(ybhB-bioAB):[λcI857.Δ(cro-ea59):tetR-bla], ΔprfA, ΔtolC	0	∆RF1	∆arfA, ∆arfB	AA	Fitness, conjugation, and viral infection (Figures 3 and 4)

*All strains derived from ECNR2, as described in Wang et al. (2009).

†See Key Resources Table for additional information on strains and sources. The GenBank accession number for E. coli MG1655 is U00096, and the GenBank accession number for GRO.AA is CP006698.
‡ Out of a total of 321 in the original ECNR2 strain.

§RF1 terminates translation at UAG and UAA. Deletion of RF1 eliminates recognition of UAG during translation; translational termination continues through RF2, which recognizes UAA and UGA.
#The ssrA gene encodes the tmRNA, which appends the ssrA degradation tag to stalled ribosomes. The wild-type sequence is AANDENYALAA; mutation of the C-terminus to AANDENYALDD slows degradation of peptides to enable detection by mass spectrometry.

Table 2

Components of peptide library constructed to search and analyze tandem mass spectrometry data.

The LEHHHHHHXXX library was separate from the library that contained the entries of the first three rows of the table (see Supplementary file 3 and 4).

https://doi.org/10.7554/eLife.34878.008

Library component	Example peptides (from Figure 2A)	Enables detection of…	Complete peptide list
Any one of 20 canonical amino acids inserted at the UAG codon	LEHHHHHHQGAR	Near-cognate suppression	Supplementary file 3
Any length of C-tail following UAG codon to the next non-UAG stop codon or to 38 amino acids downstream of the UAG codon, whichever came first	ALGDPMVR	Readthrough, frameshifting, and rescue by ArfA or ArfB	Supplementary file 3
AANDENYALDD degradation tag	LEHHHHHHGDAANDENYALDD	Rescue by tmRNA-SmpB	Supplementary file 3
All peptides of form LEHHHHHHXXX, where X is any amino acid	LEHHHHHHQLD	Loss of translational fidelity	Supplementary file 4

In the GRO lacking UAG assignment, the UAG codon elicited a combination of ribosomal rescue mechanisms and mistranslation events, including tmRNA-mediated tagging, near-cognate suppression, and frameshifting. The mutated ssrA DD-tag appended directly to the C-terminus of GFP (LEHHHHHHAANDENYALDD) appeared in both UAG- and UAA-ending transcripts in GRO.DD and GRO.DD.prfA⁺ (Figure 2, Supplementary file 1 – Table S1), consistent with previous reports that overexpressed proteins are targeted for degradation by the tmRNA (Baneyx and Mujacic, 2004; Li et al., 2007; Moore and Sauer, 2005; Tu et al., 1995). Both samples also contained the unmodified C-terminus of GFP (LEHHHHHH). In GRO.DD.prfA⁺, this is likely due to translational termination via RF1, while in GRO.DD this may represent rescue of nonstop ribosomes by ArfA/ArfB, release of nascent peptides undergoing translation at the time of cell lysis, or spontaneous dissociation of the ribosome, although this last event is estimated to occur fewer than once per 100,000 codon decoding events (Keiler and Feaga, 2014). While these were the only C-terminal fragments detected in GRO.DD expressing UAA-GFP and in GRO.DD.prfA⁺ expressing UAG-GFP, GRO.DD [pUAG-GFP] contained greater than 30 unique C-terminal sequences (Supplementary file 2).

The peptide fragments detected from GRO.DD [pUAG-GFP] demonstrate a combination of near-cognate suppression, ribosomal frameshifting, and tmRNA tagging (Figure 2B). We identified two previously known suppression events glutamine (Q) and tyrosine (Y) (Aerni et al., 2015; Lajoie et al., 2013b), and observed two new suppressors, aspartic acid (D) and valine (V). We detected ribosomal frameshifting of up to −3 (LEHHHHHHH) and +19 nucleotides (LEHHHHHHMVR), as determined by the presence of fragments from all three reading frames appended to the C-terminal peptide of LEHHHHHH. Additionally, the LEHHHHHHHH peptide may indicate a −6 frameshift, although it is impossible to determine whether this peptide arises from a −6 frameshift or two −3 frameshifts between histidine incorporation. We also detected peptides encoded as far downstream as +82 nucleotides after the UAG codon, illustrating that the ribosome can continue translation after encountering the unassigned UAG codon provided that stalling at the UAG codon is resolved. Lastly, we identified the modified ssrA DD-tag at both the site of the UAG codon and downstream on multiple peptides.

Prior research in vitro revealed that a mistranslation event increases the likelihood of subsequent mistranslation events and termination by release factor 2 (RF2) (Zaher and Green, 2009), and we investigated whether we could detect peptides representing such mistranslation events. Given the difficulty of distinguishing such peptides from suppression or frameshifting with one or two amino acids, we created a hypothetical peptide library (Supplementary file 1 – Table S2) containing all combinations of LEHHHHHHXXX, wherein X is any amino acid incorporated at the three residue positions directly downstream of the UAG codon (Supplementary file 4). The search with this library returned 23 unique peptides, 14 of which met our scoring threshold of 15 (Aerni et al., 2015). Five of these peptides (LEHHHHHHEKP, LEHHHHHHQLD, LEHHHHHHQQR, LEHHHHHHSLK, and LEHHHHHHYQR) could only arise from the mRNA transcript through two or more frameshift events after stalling at the UAG codon had already resolved (Supplementary file 1 – Table S2), suggesting they instead arise from loss of translational fidelity and spontaneous termination of translation following mistranslation at the UAG codon. We also had enough resolution in the data to manually verify the amino acid sequences of LEHHHHHHQQR and LEHHHHHHYQR, noting a 35 Da shift in mass between the Q and Y in the third position from the C-terminus.

Although several alternative hypotheses may explain these random tripeptides, these explanations are either incomplete or unlikely given our current understanding of prokaryotic translation. First, it is improbable that these fragments arose from routine errors in mRNA transcription because this would require at least two transcriptional errors in a nine-nucleotide span. The transcription error rate in E. coli is estimated to be ~1 in 10,000 bases (Blank et al., 1986; Rosenberger and Hilton, 1983) and our strains have no known mutations that would lead to greater error rates in transcription. Second, it is possible that ArfA or ArfB may have terminated translation in these peptides due to 3’ exonuclease shortening of the mRNA transcript as the ribosome is stalled at the UAG codon (Keiler and Feaga, 2014; Yamamoto et al., 2003). However, this does not explain the non-encoded tripeptides appended to the LEHHHHHH peptide. Lastly, the peptides LEHHHHHHQQR, LEHHHHHHSLK, and LEHHHHHHYQR may have been part of longer peptides that were cleaved off during trypsin digest. In this case, translation may have continued past the C-terminal R or K observed in these peptides, but this consideration would not apply to LEHHHHHHEKP and LEHHHHHHQLD and again does not explain the non-encoded tripeptide sequence observed appended to LEHHHHHH. Given this, we hypothesize that these five peptides result from loss of translational fidelity after stalling at the UAG codon that may lead to (1) spontaneous termination of translation due to the untemplated action of RF2 following mistranslation or (2) ArfA- or ArfB-mediated release predicated on 3’ exonuclease degradation of the mRNA. The rare event of spontaneous hydrolysis of the peptide from the ribosome is also possible.

ssrA and arfB mediate degradation of proteins containing unassigned UAG codons

Since mass spectrometry data indicated that a combination of mechanisms could resolve stalled translation at the unassigned UAG codon, we generated targeted deletions of the ribosomal rescue systems (ssrA, arfA, and arfB) in strains with wild-type ssrA sequence (GRO.AA) to determine whether protein production from UAG-ending transcripts in ΔRF1 cells could be restored to levels seen in +RF1 cells. Using recombineering (Sharan et al., 2009), we produced single and double deletions of the ssrA, arfA, and arfB genes that encode the ribosomal rescue systems. Efforts to generate a double deletion of ssrA and arfA failed (data not shown) because the resulting phenotype is synthetic lethal (Chadani et al., 2010). We transformed each deletion strain with the UAG-GFP construct under a highly expressing, inducible pLtetO promoter (Lutz and Bujard, 1997) and induced GFP expression for 20 hr, measuring the effect of protein expression on cellular growth through doubling time and maximum optical density at 600 nm (OD₆₀₀) (Figure 3A and B, Supplementary file 1 – Table S3). To quantify protein expression, we then assayed whole-cell lysate from equal cell numbers, as determined by OD₆₀₀, for abundance of protein via anti-GFP western blot alongside GFP standards of known concentration as described previously (Figure 3C, Figure 3—source data 6) (Pirman et al., 2015). We also included as positive controls (1) a wild-type strain (ECNR2) expressing the UAG-GFP construct and (2) GRO.AA expressing UAA-GFP.

Figure 3 with 1 supplement see all

Download asset Open asset

Deletion of both *ssrA* and *arfB* restores protein production in the genomically recoded organism.

(A) Comparison of doubling times for WT and GRO strains carrying listed deletions with and without GFP induction. Error bars show standard deviation centered at mean, n = 3; data were analyzed using Source code 1 (Figure 3—source datas 1 and 2). (B) Change in maximum optical density at 600 nm (OD₆₀₀) due to expression of UAG-GFP or UAA-GFP in wild-type (WT) and GRO strains carrying listed deletions. Error bars show standard deviation centered at mean, n = 3 (Figure 3—source datas 1 and 2). (C) Quantification of GFP abundance per 1 mL of cells at OD₆₀₀ of 2.5 via western blot from biological replicates of indicated strains (Figure 3—source datas 3–6). Error bars show standard deviation centered at mean, n = 3 (Figure 3—source datas 3–5). See Figure 3—figure supplement 1 for linear calibration curves used to quantify GFP abundance for each replicate experiment. Image of representative western blot is below the graph. p-values are calculated in relation to the GRO containing the UAG-ending GFP (GRO – UAG) and are as follows: * is p≤0.05, ** is p≤0.01, *** is p≤0.001, and **** is p≤0.0001.

https://doi.org/10.7554/eLife.34878.009

Figure 3—source data 1 Growth curve data from 96-well plate assay analyzed using Source code 1 (one of three plate replicates), used for data represented in Figure 3A and B.: https://doi.org/10.7554/eLife.34878.011
Download elife-34878-fig3-data1-v1.xlsx
Figure 3—source data 2 Analysis of doubling times and maximum OD₆₀₀’s of indicated strains. File contains doubling times and maximum OD₆₀₀’s for three separate experiments conducted on different plate reader machines. Each experiment tested each sample in biological triplicate. Only the biological triplicate data from Plate 3 is represented in Figure 3A and B.: https://doi.org/10.7554/eLife.34878.012
Download elife-34878-fig3-data2-v1.xlsx
Figure 3—source data 3 Anti-GFP western blot image used for quantification of GFP yields; replicate 1.: https://doi.org/10.7554/eLife.34878.013
Download elife-34878-fig3-data3-v1.zip
Figure 3—source data 4 Anti-GFP western blot image used for quantification of GFP yields; replicate 2.: https://doi.org/10.7554/eLife.34878.014
Download elife-34878-fig3-data4-v1.zip
Figure 3—source data 5 Anti-GFP western blot image used for quantification of GFP yields; replicate 3.: https://doi.org/10.7554/eLife.34878.015
Download elife-34878-fig3-data5-v1.zip
Figure 3—source data 6 Analysis of western blot data represented in Figure 3C.: https://doi.org/10.7554/eLife.34878.016
Download elife-34878-fig3-data6-v1.xlsx

Expression of UAG-GFP impaired GRO growth rate and cell density, generating a 54% increase in doubling time and 8% reduction in maximum OD₆₀₀ compared to cells not expressing UAG-GFP, and a 25% greater doubling time and 14% lower maximum OD₆₀₀ compared to cells expressing UAA-GFP. In contrast, ECNR2 exhibited only a 7% increase in doubling time and a 5% reduction in maximum OD₆₀₀ when expressing UAG-GFP. Although deletion strains experienced reduced growth rate as measured by doubling time compared to the GRO.AA, they exhibited a less pronounced increase in doubling time when expressing UAG-GFP (increases in doubling time between 12% and 50%) as compared to the GRO.AA (54% increase in doubling time) (Figure 3A). However, deletion of ssrA reduced fitness during protein expression as measured by maximum OD₆₀₀, with GRO.AA.∆ssrA demonstrating a 34% reduction in max OD₆₀₀ and GRO.AA.∆ssrA.∆arfB demonstrating a 61% decrease in max OD₆₀₀. This is potentially due to increased presence of misfolded or prematurely truncated peptides that are ordinarily tagged and degraded by the tmRNA. Interestingly, deletion of arfB produces a 50% increase in doubling time during protein expression, suggesting ArfB may play a role in ribosomal rescue during high levels of ribosomal stalling.

We then investigated the impact of unassigned codons on protein production using western blot densitometry, and found that the GRO expressing UAG-GFP produced less than one-fourth of the protein amount than does ECNR2 expressing UAG-GFP (Figure 3C, 8.0 µg/ml for the GRO versus 35 µg/ml for ECNR2, p=0.0014). GRO.AA expressing UAA-GFP produced nearly nine times more protein than did GRO.AA expressing UAG-GFP (68 µg/ml for GRO.AA [pUAA-GFP] versus 8.0 µg/ml for GRO.AA [pUAG-GFP], p<0.0001), indicating that the UAG codon in pUAG-GFP is the cause of reduced protein expression in the GRO. Deletion of ssrA in the UAG-GFP-expressing GRO partially restored protein production to levels seen in its UAA-GFP-expressing counterpart with no knockouts (31 µg/ml for GRO.AA.∆ssrA [pUAG-GFP] versus 68 µg/ml for GRO.AA [pUAA-GFP]) and deletion of both ssrA and arfB fully restored protein production (70. µg/ml). These ssrA deletion strains likely demonstrate increased GFP expression and reduced growth rate (Figure 3A) and cell density (Figure 3B) because translation of GFP transcripts sequesters cellular resources at the expense of cellular replication, producing GFP peptides that are freed from nonstop ribosomes via ArfA or ArfB without addition of a degradation tag.

A deletion of arfB leads to strikingly low- protein abundances from UAG-GFP transcripts that approach the lower limit of detection of our assay, although this apparent reduction in protein production was not statistically significant in comparison to protein production by GRO.AA [pUAG-GFP]. These ArfB deletion data, together with the fitness reduction observed in the GRO, suggest that ArfB is constitutively expressed and relieving low levels of ribosomal stalling in E. coli. These data also suggest that while deletion of ssrA partially recovers protein production from UAG-ending transcripts in the GRO, deletion of both ssrA and arfB is necessary to fully recover protein expression from UAG-ending transcripts to levels seen from the translation of UAA-ending transcripts in the GRO.

Deletion of ssrA restores conjugative plasmid propagation and viral infection in the GRO

To determine whether deletions of of ssrA or arfB could restore propagation of horizontally-transferred genetic elements in the GRO, we assessed conjugation efficiency and growth rate from plasmids RK2 and F on GRO strains with single and double deletions of ssrA, arfA, and arfB. Previous research indicates that the UAG stop codon in the trfA gene on RK2 leads to impaired conjugation efficiency and replication in the GRO (Ma and Isaacs, 2016), likely because the TrfA protein is required to initiate plasmid replication (Pansegrau et al., 1994). Phenotypically, this manifests as reduced efficiency of plasmid transfer in conjugation experiments and increased doubling times for RK2⁺ strains in media selecting for plasmid maintenance due to loss of plasmid and concomitant antibiotic resistance genes. We found that deletion of ssrA increased the ability of the GRO to both receive (Figure 4A, Supplementary file 1 – Table S4) and replicate RK2 (Figure 4B, Supplementary file 1 – Table S5). RK2 conjugation efficiency in GRO.AA.∆ssrA improved to 99% (compared to 87% in GRO.AA), and the strain showed an increase in doubling time of only 6% compared to a 28% increase for GRO.AA (p<0.0001). We observed similar results for GRO.AA.∆ssrA.∆arfB. However, single deletion of arfB halved RK2 conjugative efficiency (Figure 4A, p=0.0002). This strain also exhibited a 38% increase in doubling time when bearing RK2, compared to the 28% increase in doubling time seen in the GRO with no ribosomal rescue gene deletions (Figure 4B, p<0.0001).

Figure 4

Download asset Open asset

Deleting *ssrA* restores propagation of both viruses and conjugative plasmids in the genomically recoded organism.

(A) Percent transfer of conjugative plasmid RK2 from a wild-type donor into wild-type (WT), GRO, or GRO with designated deletions (KO) as recipients (Figure 4—source data 1). Data are obtained from technical triplicates generated from a single biological sample. (B) Percent increase in doubling time for strains carrying plasmid RK2 compared to strains lacking RK2 (Figure 4—source datas 2 and 3). (C) Number of conjugation events for conjugative plasmid F from wild-type, GRO, or GRO with designated gene deletions as donors to a wild-type recipient (Figure 4—source data 4). Data are obtained from technical triplicates generated from a single biological sample. (D) Relative titer on wild-type, GRO, and GRO with designated deletions of phage λ (Figure 4—source data 5). Error bars show standard deviation centered at mean, n = 3. p-values are calculated in relation to the GRO condition and are as follows: * is p≤0.05, ** is p≤0.01, *** is p≤0.001, and **** is p≤0.0001. (E) Effects of sequential deletions of ribosomal rescue mechanisms on conjugative plasmid transfer efficiency. (F) Effects of sequential deletions of ribosomal rescue mechanisms on viral susceptibility.

https://doi.org/10.7554/eLife.34878.017

Figure 4—source data 1 Analysis of RK2 plasmid conjugation data represented in Figure 4A. Note: These data represent technical triplicates generated from the same biological sample.: https://doi.org/10.7554/eLife.34878.018
Download elife-34878-fig4-data1-v1.xlsx
Figure 4—source data 2 Growth curve data from 96-well plate assay analyzed using Source code 1, used for data represented in Figure 4B.: https://doi.org/10.7554/eLife.34878.019
Download elife-34878-fig4-data2-v1.xlsx
Figure 4—source data 3 Analysis of doubling times represented in Figure 4B.: https://doi.org/10.7554/eLife.34878.020
Download elife-34878-fig4-data3-v1.xlsx
Figure 4—source data 4 Analysis of F plasmid conjugation data represented in Figure 4C. Note: These data represent technical triplicates generated from the same biological sample.: https://doi.org/10.7554/eLife.34878.021
Download elife-34878-fig4-data4-v1.xlsx
Figure 4—source data 5 Analysis of lambda phage infection data represented in Figure 4D.: https://doi.org/10.7554/eLife.34878.022
Download elife-34878-fig4-data5-v1.xlsx

For plasmid F (Figure 4C, Supplementary file 1 – Table S6), which contains UAG-ending genes traY and traL that are essential for conjugation between cells (Ma and Isaacs, 2016), we found that deletion of ssrA increased conjugation events from the GRO donor 1,000-fold to 3.56 × 10⁷ (p=0.0015) compared to GRO.AA (3.30 × 10⁴ events), arfA deletion (3.41 × 10⁴ events), and arfB deletion (3.47 × 10⁴ events). GRO.AA.∆ssrA.∆arfB and GRO.AA.∆arfA.∆arfB exhibited 5.2- and 2.3-fold decrease in conjugative efficiency when compared to GRO.AA.∆ssrA and GRO.AA.∆arfA single deletion strains, respectively (p<0.01 for each, Figure 4C). These reductions in RK2 and F conjugative efficiency attributable to arfB deletion indicate that ArfB likely contributes to relief of nonstop ribosomes when encoded in its native ribosomal context, supporting evidence of ArfB’s ribosomal rescue activity previously validated in vitro (Handa et al., 2011) and when over-expressed in the absence of ssrA and arfA in vivo (Chadani et al., 2010). However, deletion of ssrA is sufficient to restore both conjugation and propagation of RK2 and F in the GRO. We next attempted infection with phage λ on our suite of deletion strains (Figure 4D, Supplementary file 1 – Table S7). Although deletion of arfA or arfB does not recover viral infection, deletion of the ssrA gene—either alone (p=0.0016) or alongside deletion of arfB (p<0.0001)—recovers λ infection of the GRO to levels similar to wild-type, with about 10⁸ plaque forming units per mL (PFU/mL) (Figure 4D). These results demonstrate that removal of ssrA has the greatest influence in restoring conjugative plasmid transfer efficiency and viral susceptibility in the GRO (Figure 4E and F).

Discussion

In this study, we use a genomically recoded organism (GRO) containing an unassigned UAG codon as a model to investigate the molecular mechanisms that obstruct the propagation of HTGEs in organisms with alternative genetic codes. We demonstrate that unassigned stop codons elicit near-cognate suppression, frameshifting, and the action of ribosomal rescue mechanisms (Figure 2). tmRNA-mediated ribosomal rescue prompted by the unassigned codon results in the degradation of nascent peptides translated from UAG-ending transcripts and obstructs the propagation of HTGEs (Figure 3, Figure 4). Additionally, ssrA deletion strains exhibit both significantly increased UAG-GFP yields (Figure 3C) and recovered propagation of HTGEs (Figure 4), consistent with evidence that deletion of ssrA removes inhibition of ArfA production and releases nascent peptides from stalled ribosomes without degradation (Chadani et al., 2011; Garza-Sánchez et al., 2011; Schaub et al., 2012). Our GRO model thus sheds light on the functional significance of previously described regulatory relationships while elucidating the unique mechanistic contributions of different ribosomal rescue systems in resolving translation at unassigned stop codons. These mechanistic outcomes that occur as a consequence of ribosomal stalling could be further investigated via ribosomal profiling in future work.

The mass spectrometry data collected from our GRO model demonstrate the striking proclivity for the ribosome to undergo un-programmed frameshifting at unassigned stop codons and represents, to our knowledge, the first in vivo study to examine such frameshifting. Prior studies have revealed programmed ribosomal frameshifting from −4 to +50 nucleotides (Atkins et al., 2016; Baranov et al., 2015; Huang et al., 1988; Yan et al., 2015), but these studies focused on frameshifts programmed into mRNA transcripts through combinations of four mechanisms: (1) use of rare codons to slow translation speed at the skip site, (2) weak base pairing of the P-site tRNA anticodon and mRNA codon, (3) strong base pairing of the P-site tRNA anticodon to the location where the ribosome will re-bind the mRNA, and (4) a region six bases upstream of the re-binding site that mimics a Shine-Dalgarno sequence and offsets the energetic cost of frameshifting (Pech et al., 2010). Although the UAG codon in our GFP transcript slows translation, the P-site codon-anticodon pair for the codon immediately upstream of UAG is exact (CAC codon and ^GUGHis-tRNA anticodon) (Hsu et al., 1984) and any frameshift except backward would incur greater mispairing between the P site codon and anticodon. Additionally, no Shine Dalgarno-like sequence (AGGAGG) (Shine and Dalgarno, 1974; Vimberg et al., 2007) exists upstream, suggesting that the GFP construct we use contains only one of the four elements required for programmed ribosomal frameshifting (Supplementary file 1). From our construct, we observed frameshifts of potentially up to −6 and +19 nucleotides in response to the unassigned UAG codon (Figure 2, Supplementary file 1 – Tables S1 and S2). Collectively, our work uncovers a wide variety of frameshifting events that can occur in response to ribosomal stalling in vivo, highlighting the capacity of the ribosome to continue translation despite missing an essential translational component.

Mass spectrometry analysis also revealed truncated mistranslation products that possibly represent loss of translational fidelity and termination by RF2 downstream of an initial mistranslation event at the UAG codon, known as post-peptidyl transfer quality control (Petropoulos et al., 2014; Zaher and Green, 2009), a result previously only observed in vitro. Although prior studies decades ago revealed premature truncation products in vivo (Manley, 1978), they lacked the technical capability to determine whether these peptides arose from a single mistranslation event or demonstrated loss of translational fidelity after the ribosome encounters a rare or unassigned codon. The mistranslation products we detect show repeated mistranslation events that could not have been produced by suppression, ribosomal rescue, or frameshifting, unless the ribosome frameshifted multiple times after resolving stalling at the UAG codon (Figure 2B, Supplementary file 1). These events may be followed by ribosomal rescue via ArfA or ArfB, spontaneous ribosomal dissociation, or termination via release factor 2, though our technique was not capable of distinguishing between these fates. Previous in vitro studies using purified ribosome complexes determined that a mistranslation event destabilized the P-site helix, reducing the ability of the A-site to discriminate between anticodons and resulting in further mistranslation events and rapid termination by RF2 with the assistance of release factor 3 (Zaher and Green, 2009; Zaher and Green, 2010). The researchers predicted that a single mistranslation event would also lead to prematurely truncated peptides with two or three miscoded C-terminal amino acids appended in vivo (Zaher and Green, 2009). These findings, together with our results, motivate future work to investigate the possibility of loss of translational fidelity after an initial translation error and highlight the GRO as a model for elucidating translational fidelity in vivo.

The GRO demonstrates that general ribosomal rescue mechanisms resolve ribosomal stalling at unassigned stop codons. As most sequenced bacterial species contain a homolog of the tmRNA, ArfA, or ArfB ribosomal rescue systems (Hudson et al., 2014; Keiler, 2015) and eukaryotic cells contain analogous pathways that rescue stalled ribosomes (Graille and Séraphin, 2012), we anticipate that translational stalling at unassigned codons can be resolved similarly in these organisms. Accordingly, we hypothesize that organisms beyond E. coli should tolerate unassigned codons as intermediates toward codon reassignments in genomic recoding, efforts for which are underway in numerous prokaryotic and eukaryotic species (Lau et al., 2017; Napolitano et al., 2016; Ostrov et al., 2016; Richardson et al., 2017). Additional barriers to codon reassignment exist, such as regulatory roles of codons in gene expression (Lajoie et al., 2013a), but our findings indicate that unassigned codons are tolerable in the absence of specialized translational machinery to address them, both as intermediate steps towards codon reassignment and as permanent parts of the genetic code.

Our findings suggest that we can use unassigned codons to engineer organisms with broad resistance to HTGEs and impart genetic isolation, increasing engineered organisms’ stability in biotechnology applications. Since tmRNA homologs are found in >99% of all sequenced bacterial genomes (Hudson et al., 2014; Keiler, 2015), we would expect other organisms engineered to contain unassigned codons to exhibit immunity to horizontally transferred genetic elements. As researchers pursue further efforts in whole genome recoding (Boeke et al., 2016; Lau et al., 2017; Napolitano et al., 2016; Ostrov et al., 2016; Richardson et al., 2017) and engineer organisms for use in open environments, we require strategies to genetically isolate such organisms from their surrounding environment to ensure robust function, both individually (Moe-Behrens et al., 2013) and as members of microbial communities (Grosskopf and Soyer, 2014; Hillesland and Stahl, 2010). Genomically recoded organisms with unassigned codons would possess reduced susceptibility to exploitation by HTGEs, increasing their stability in open environments. Although this work demonstrates that an unassigned stop codon acts as a barrier to HGT, this current barrier can be breached by mutation or deletion of the tmRNA to produce a functional protein. In contrast, we expect that an organism with an unassigned sense codon would have even greater barriers to HGT, as premature termination at an unassigned sense codon would likely produce a nonfunctional, truncated peptide. We thus anticipate that further genomic recoding to engineer additional unassigned sense and nonsense codons may be a broadly applicable strategy to confer genetic isolation in living systems, facilitating the safe use of engineered organisms in complex open environments.

Materials and methods

Key resources table

Genetic reagents, bacterial strains, antibodies, and software used in this study.

https://doi.org/10.7554/eLife.34878.023

Reagent type (species) or resource	Designation	Source or reference	Identifiers	Additional information	Isaacs Lab Reference #	Full genotype of strains	# UAG Codons	RF1 status	Ribosomal rescue gene knockout	ssrA tag status
Gene (Escherichia coli)	pUAG-GFP	this paper	eGFP-6xHis -UAG; Plasmid NJM88; Strain NJM1242	eGFP protein with a C-terminal 6-His tag for protein purification, terminating translation in a UAG codon.	Plasmid NJM88; Strain NJM1242	N/A	N/A	N/A	N/A	N/A
Gene (E. coli)	pUAA-GFP	this paper	eGFP-6xHis -UAA; Plasmid NJM89; Strain NJM1249	eGFP protein with a C-terminal 6-His tag for protein purification, terminating translation in a UAA codon.	Plasmid NJM89; Strain NJM1249	N/A	N/A	N/A	N/A	N/A
Genetic reagent (E. coli)	RK24	10.1126/science .1205822; 10.1016/j.cels .2016.06.009	pRK24; Strain NJM699	Conjugative RK2 plasmid (10.1006/ jmbi.1994.1404), but lacks functional AmpR gene.	Strain NJM699	N/A	N/A	N/A	N/A	N/A
Genetic reagent (E. coli)	F	Yale University Coli Genetic Stock Center (CGSC), Strain #4401	pF; Strain EMG2; Strain CGSC#4401; Strain NJM426; Strain NJM473	Conjugative F plasmid, as described by PMID: 4568763. Obtained from the Yale CGSC.	Strain NJM426; Strain NJM473	N/A	N/A	N/A	N/A	N/A
Genetic reagent (E. coli)	pZE21_ UAG-GFP	this paper	pZEtR-eGFP -cHis-TAG- v02; Plasmid NJM88; Strain NJM1242	pZE21 plasmid with pLtetO promoter driving inducible expression of eGFP with a C-terminal 6-His tag and terminating in UAG codon. Inducible with anhydro-tetracycline.	Plasmid NJM88; Strain NJM1242	N/A	N/A	N/A	N/A	N/A
Genetic reagent (E. coli)	pZE21_ UAA-GFP	this paper	pZEtR-eGFP -cHis-TAA-v02 ; Plasmid NJM89; Strain NJM1249	pZE21 plasmid with pLtetO promoter driving inducible expression of eGFP with a C-terminal 6-His tag and terminating in UAA codon. Inducible with anhydro-tetracy cline.	Plasmid NJM89; Strain NJM1249	N/A	N/A	N/A	N/A	N/A
Genetic reagent (Enteroba cteria phage λ)	λ.CI857	Coli Genetic Stock Center (CGSC), Yale University (contact John Wertz directly)	λ.CI857; λ phage; Phage NJM102	Phage λ with temperature- sensitive CI repressor gene; when incubated at 37° C, phage becomes obligate lytic	Phage NJM102	N/A	N/A	N/A	N/A	N/A
Cell line (E. coli)	GRO.DD	this paper	C31GIB. tmRNA-DD; Strain #987	MG1655-derived strain with all 321 UAG codons mutated to UAA, deletion of RF1, and tmRNA tag C-terminal amino acids mutated from AA to DD. Retains lambda red cassette for recombineering. Investigated in Figure 2.	Strain #987	ΔmutS:zeo. Δ(ybhB- bioAB) :[λcI857. Δ(cro-ea59) :tetR-bla]. ΔprfA.ΔtolC .tmRNA_DD	0	+RF1	n/a	DD
Cell line (E. coli)	GRO. DD.prfA+	this paper	C31GIB. prfA+.tmRNA -DD; Strain #996	MG1655-derived strain with all 321 UAG codons mutated to UAA, retains RF1, and tmRNA tag C-terminal amino acids mutated from AA to DD. Retains lambda red cassette for recombineering. Investigated in Figure 2.	Strain #996	ΔmutS:zeo. Δ(ybhB- bioAB) :[λcI857. Δ(cro-ea59): tetR-bla]. ΔtolC.tm RNA_DD	0	∆RF1	n/a	DD
Cell line (E. coli)	ECNR2	10.1016/j.cels .2016.06.009	ECNR2.Δmut S:zeocin.Δ λRed; Strain #795	MG1655-derived strain that contains 321 UAG codons and retains RF1. Investigated in Figures 3 and 4.	Strain #795	ΔmutS:zeo	321	+RF1	n/a	AA
Cell line (E. coli)	GRO.AA	10.1016/j.cels .2016.06.009	C31.final. ΔmutS: zeocin.ΔprfA .ΔλRed; Strain #796	MG1655-derived strain with all 321 UAG codons mutated to UAA, deletion of RF1. Investigated in Figures 3 and 4.	Strain #796	ΔmutS: zeo.ΔprfA (GenBank ID: CP006698)	0	∆RF1	n/a	AA
Cell line (E. coli)	GRO. AA.∆arfB	this paper	C31GIB.arfB: tolCorf. ΔλRed; Strain #1230	MG1655-derived strain with all 321 UAG codons mutated to UAA, deletion of RF1, and deletion of arfB. Investigated in Figures 3 and 4.	Strain #1230	ΔmutS: zeo.ΔprfA .arfB:tolC	0	∆RF1	∆ssrA	AA
Cell line (E. coli)	GRO. AA.∆ssrA	this paper	C31GIB.ssrA :tolC.ΔλRed; Strain #1231	MG1655-derived strain with all 321 UAG codons mutated to UAA, deletion of RF1, and deletion of ssrA. Investigated in Figures 3 and 4.	Strain #1231	ΔmutS: zeo.ΔprfA. ssrA:tolC	0	∆RF1	∆arfA	AA
Cell line (E. coli)	GRO. AA.∆arfA	this paper	C31GIB.arfA :tolC.ΔλRed ; Strain #1232	MG1655-derived strain with all 321 UAG codons mutated to UAA, deletion of RF1, and deletion of arfA. Investigated in Figures 3 and 4.	Strain #1232	ΔmutS: zeo.ΔprfA. arfA:tolC	0	∆RF1	∆arfB	AA
Cell line (E. coli)	GRO.AA .∆ssrA.∆arfB	this paper	C31GIB.ΔarfB .ssrA:tolC.Δ λRed; Strain #1233	MG1655-derived strain with all 321 UAG codons mutated to UAA, deletion of RF1, and deletion of ssrA and arfB. Investigated in Figures 3 and 4.	Strain #1233	ΔmutS: zeo.ΔprfA .ΔarfB.ssrA:tolC	0	∆RF1	∆ssrA. ∆arfB	AA
Cell line (E. coli)	GRO.AA .∆arfA. ∆arfB	this paper	C31GIB.Δarf B.arfA:tolC. ΔλRed; Strain #1234	MG1655-derived strain with all 321 UAG codons mutated to UAA, deletion of RF1, and deletion of arfA and arfB. Investigated in Figures 3 and 4.	Strain #1234	ΔmutS: zeo.ΔprfA .ΔarfB.arfA :tolC	0	∆RF1	∆arfA. ∆arfB	AA
Antibody	mouse anti-GFP antibody	other	Invitrogen (Ref#: 332600, Lot#: 1513862A)	Invitrogen (Ref#: 332600, Lot#: 1513862A); (5.5 μL antibody in 3 mL Milk + TBST)	N/A	N/A	N/A	N/A	N/A	N/A
Antibody	goat anti-mouse antibody	other	AbCam (Ref#: ab7023, Lot#: GR157827-1)	AbCam (Ref#: ab7023, Lot#: GR157827-1); (2.2 μL antibody in 10 mL Milk + TBST)	N/A	N/A	N/A	N/A	N/A	N/A
Recombinant DNA reagent	ssrA:tolC	this paper; for use, see tolC positive /negative selection in 10.1038/nprot .2014.081	dsDNA NJM111	The E. coli native tolC gene used to delete ssrA gene via recombineering (10.1038/nprot. 2008.227).	dsDNA NJM111	N/A	N/A	N/A	N/A	N/A
Recombinant DNA reagent	arfA:tolC	this paper; for use, see tolC positive /negative selection in 10.1038/nprot .2014.081	dsDNA NJM112	The E. coli native tolC gene used to delete arfA gene via recombineering (10.1038/nprot. 2008.227).	dsDNA NJM112	N/A	N/A	N/A	N/A	N/A
Recombinant DNA reagent	arfB:tolC	this paper; for use, see tolC positive /negative selection in 10.1038/nprot .2014.081	dsDNA NJM113	The E. coli native tolC gene used to delete arfB gene via recombineering (10.1038/nprot. 2008.227).	dsDNA NJM113	N/A	N/A	N/A	N/A	N/A
Software, algorithm	Doubling time algorithm	10.1126/ science.1241459	Growth_ Analyze_ GK.m	Doubling time used in 10.1126 /science.1241459, written by Gleb Kuznetsov in the lab of Dr. George Church.	N/A	N/A	N/A	N/A	N/A	N/A
Software, algorithm	MaxQuant v1.5.1.2	other	N/A	Commercial software for mass spectrometry analysis.	N/A	N/A	N/A	N/A	N/A	N/A
Software, algorithm	Graphpad Prism 7	other	N/A	Commercial software for statistical analysis and graphing, provided through Yale University.	N/A	N/A	N/A	N/A	N/A	N/A

Strains and media

Request a detailed protocol

All bacteria used in this study are derived from E. coli ECNR2, which is in turn derived from E. coli MG1655 (GenBank ID: U00096) in which mutS is replaced by a zeocin resistance cassette (Wang et al., 2009; Lajoie et al., 2013b). Additionally, the native bioAB genes found in MG1655 are replaced by the lambda red cassette in ECNR2. This strain is designated ECNR2.AA (see Table 1 for full genotype). For experiments expressing UAG-GFP and UAA-GFP for mass spectrometry, strains with all 321 UAG codons changed to UAA (designated ‘GRO’ strains) were used to control for potential differences in protein expression arising from these mutations (GenBank ID for GRO.AA: CP006698). For all other experiments, control strains labeled wild-type (WT) are MG1655 derivatives retaining all 321 UAG codons. All deletions of ssrA, arfA, and arfB were generated with a tolC resistance cassette via recombineering (Sharan et al., 2009). Modification of the ssrA tag from AANDENYALAA to AANDENYALDD (AA->DD) to increase stability of tagged proteins was performed with MAGE as described previously (Gallagher et al., 2014; Wang et al., 2009). All modifications to strains made in this study were validated through Sanger sequencing (GeneWiz; South Plainfield, NJ).

We performed all protein expression assays and conjugation assays in LB Lennox at pH 7.5. We performed all phage assays in Tryptone-KCl (TK) media as described previously (Jaschke et al., 2012; Ma and Isaacs, 2016; Valentine et al., 2002).

Phages and plasmids

Request a detailed protocol

For viral relative titers, we used phage λ cI857 obtained from Dr. John Wertz at the Yale Coli Genetic Stock Center (CGSC) because it is obligately lytic at 37°C, preventing possible confounding factors from lysogeny. We used the conjugative plasmid RK2 described in Isaacs et al. (2011), which is a derivative of the RK2 plasmid described in Pansegrau et al. (1994) carrying bla^R instead of kan^R. The complete nucleotide sequence for the plasmid is available in NCBI database, Accession L27758.1 and GI 508311. We obtained the F plasmid from the Yale CGSC (NCBI Accession AP001918.1, GI: 8918823) and added Kan^R from plasmid pZE21 for antibiotic selection.

To create the UAG-GFP and UAA-GFP constructs for protein expression, we cloned an eGFP construct with a C-terminal 6xHis tag downstream of pLtetO into a modified pZE21 vector with kanamycin resistance (kan^R)carrying a copy of the tet repressor gene (tetR) to prevent leaked gene expression. We then modified the stop codon of the eGFP construct to end in either a UAG or UAA stop codon.

Protein expression and purification

Request a detailed protocol

To obtain GFP for analysis via mass spectrometry, we transformed UAG-GFP and UAA-GFP constructs into wild-type and GRO strains carrying the AA->DD modification in the ssrA tag to prolong the half-life of tagged peptides. Experiments in the absence of the AA->DD modification yielded no peptides with ssrA degradation tags (data not shown). We then grew 50 mL cultures of each strain at 33°C in LB Lennox with 30 μg/mL kanamycin to an OD₆₀₀ of 1.0 and induced protein expression with the addition of 30 ng/uL anhydrotetracycline (aTC). After incubation overnight, we pelleted cells and resuspended them in sterile phosphate buffer solution, then lysed cells via sonication. Cell debris was then pelleted by centrifugation and GFP purified from supernatant via a nickel resin affinity column. To concentrate protein and exchange buffer for subsequent trypsin digest, we then concentrated GFP via Millipore Amicon spin columns.

For whole western blots on whole cell lysates, we transformed UAG-GFP and UAA-GFP constructs into wild-type, GRO, and GRO strains with deletions of the ribosomal rescue systems. We then grew 5 mL cultures of each strain at 33°C in LB Lennox with kanamycin overnight, then diluted all cultures OD₆₀₀ of 0.15 in fresh media containing 30 μg/mL kanamycin and 30 ng/uL aTC for 20 hr. To quantify protein expression and compare across strains, we normalized the OD₆₀₀ of all cultures to 2.5 and pelleted 1 mL of this culture, which we placed in the −80C for 2 hr. We then re-suspended cell pellets in lysis buffer described previously (Aerni et al., 2015), incubated for 10 min on ice, centrifuged lysate, and ran 1:10 dilutions of resulting supernatant on gels for western blot analysis. Overnight starter cultures were diluted to an OD₆₀₀ of 0.15 into three separate culture tubes, and cells within each tube were induced in parallel for GFP expression. GFP was purified from each of these cultures in parallel.

Mass spectrometry and proteomic analysis

Request a detailed protocol

Trypsin digest, sample preparation for mass spectrometry, and liquid chromatography elution gradients were performed as described previously (Aerni et al., 2015). Desalted peptides were injected onto a 75 μm ID PicoFrit column (New Objective) packed to 50 cm in length with 1.9 μm ReproSil-Pur 120 Å C18-AQ (Dr. Maisch). Samples were eluted over a 90 min gradient using an EASY-nLC 1000 UPLC (Thermo) paired with a Q Exactive Plus (Thermo), using the following parameters: (MS1) 70,000 resolution, 3 × 10⁶ AGC target, 300–1700 m/z scan range; (MS2) 17,500 resolution, 1 × 10⁶ AGC target, top 10 mode, 1.6 m/z isolation window, 27 normalized collision energy, 90 s dynamic exclusion, unassigned and +1 charge exclusion. Peptide identification from collected spectra was performed using MaxQuant v1.5.1.2 (Cox and Mann, 2008). Samples were searched using custom databases representing potential translational outcomes in response to the UAG codon within the GFP reporter construct (Supplementary file 3 and 4), as well as the E. coli proteome (EcoCyc K-12 MG1655 v17). The searches considered carbamidomethyl (Cys) as a fixed modification and the following variable modifications: acetyl (N-terminal), oxidation (Met), deamidation (Asn, Gln), and phosphorylation (Ser/Thr/Tyr). Discovered peptides had a minimum length of five amino acids and could contain up to three trypsin miscleavage events. A 1% false discovery rate was used. The mass spectrometry proteomics data and the custom search databases have been deposited to the ProteomeXchange Consortium (http://proteomecentral.proteomexchange.org) via the PRIDE partner repository (Vizcaíno et al., 2014) with the dataset identifier PXD009643. Mass spectrometry spectra were manually validated by identifying all spectra with an MS/MS score over 15 and verifying the presence sufficient b- and/or y-ion series.

Western blot experiments and analysis

Request a detailed protocol

Western blots were run as described previously using SDS-PAGE gels (Pirman et al., 2015). We ran GFP-6xHis standards of known amount (1, 10, 50, and 100 ng) alongside experimental samples and used these standards to generate linear-range calibration curves to quantify protein abundance in experimental samples (Figure 3—figure supplement 1). Because the antibody signal appeared sublinear in the 0–10 ng regime when we performed linear regression using all standards, we generated separate linear fits using the 1–10 ng standards and the 10–100 ng standards. We then determined experimental sample concentrations using these linear approximations. 20 of the 24 experimental samples quantified fell within or slightly above the 10–100 ng range (with the highest-intensity sample quantified as 136 ng), and 3 of the 24 samples fell within the 1–10 ng range. The one remaining sample, which had a weaker intensity than that of the 1 ng standard, was quantified through a linear approximation between the intensity of the 1 ng sample and of a blank lane with an assumed intensity of zero.

We expressed GFP-6xHis as described above, normalized cell cultures to an OD₆₀₀ of 2.5, and lysed cells using BugBuster protein extraction reagent (Merck, Darmstadt, Germany). We then ran 10 µl of 1/150 diluted lysate per lane of the SDS-PAGE gel. We obtained primary mouse anti-GFP antibody from Invitrogen (Ref#: 332600, Lot#: 1513862A; RRID:AB_2234927) and goat anti-mouse antibody from AbCam (Ref#: ab7023, Lot#: GR157827-1; RRID:AB_955413). Western blots were developed using Bio-Rad Clarity Western ECL Blotting Substrate and Imaged on a GE Amersham Imager 600. We performed quantification of western blot bands as described previously (Pirman et al., 2015). We repeated three western blots in parallel for each strain induced in separate culture tubes (i.e. biological triplicates, see Protein expression and purification).

Viral relative titers

Request a detailed protocol

To quantify relative titers, we mixed 100-fold dilutions of phage with 300 µL of mid-log (OD₆₀₀ = 0.5) cells in 3 mL of TK soft agar and poured onto TK solid agar plates. Starter cultures of cells were diluted to an OD₆₀₀ of 0.5 into three separate culture tubes, and cells within each tube were infected with phage lambda in parallel (i.e. biological triplicate). Each tube was plated on a separate TK solid agar plate. We incubated plates overnight at 37°C, and counted plaques the next day.

Quantifying conjugation

Request a detailed protocol

We used conjugation conditions described previously (Ma and Isaacs, 2016; Ma et al., 2014). Briefly, we grew cultures of donor and recipient cells to late log in antibiotics selecting for plasmid or recipient and then rinsed and re-suspended in media to remove antibiotics. After concentrating cells to an OD₆₀₀ of 20, we mixed donors and recipients in 1:1 ratio and spotted onto pre-warmed LB Lennox agar plates in 2 × 20 uL and 6 × 10 uL pattern. For F, we incubated plates at 37°C for 2 hr, then rinsed cells off plate, diluted serially 10-fold, and plated serial dilutions on plates containing antibiotic selecting for conjugants and incubated overnight at 37°C. For RK2, we incubated plates at 37°C for 1 hr, then plated on agar plates selecting for the recipient. To quantify the rate of transfer, we then picked 86 colonies from plates selecting for the recipient strain and patched them onto plates selecting for both recipient and conjugative plasmid, incubated plates overnight at 37°C, and counted the number of patched colonies that grew. After the conjugation, colonies were plated three times to generate technical triplicates.

Statistical and data analysis

Request a detailed protocol

We performed all t-tests and one-way ANOVA tests for statistical significance in GraphPad Prism 7. We calculated doubling times and maximum OD₆₀₀ values from growth curve data using MATLAB (Newton, MA) code that we generated (Source code 1).

Experimental replicates

Request a detailed protocol

We used the definitions for biological and technical replicates outlined in Blainey et al., 2014. Biological replicates consist of parallel measurements of different biological samples subjected to the same experiment, and technical replicates are parallel measurements of a single biological sample subjected to experimentation. Data represented in (Figures 3, 4B and D) are biological replicates; data represented in (Figure 4A and C) are technical replicates. Data for all 96-well plate assays (Figures 3A, B and 4B) were obtained as biological replicates: One well of each sample was grown overnight as a starter culture in a 96-well plate. Starter cultures were then inoculated into three separate wells in a separate 96-well plate.

Data availability

Sequences of strains used have been previously published with the appropriate citations. Modifications (e.g., gene deletions) to those strains are described in full in the Tables, Key Resource Guide, methods and supplementary material. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD009643 (http://proteomecentral.proteomexchange.org) via the PRIDE partner repository (Vizcaíno et al, 2014).

The following data sets were generated

1. Jing Ma N
2. Hemez CF
3. Barber KW
4. Rinehart J
5. Isaacs F
(2018) Mass spectrometry proteomics data from "Organisms with alternative genetic codes resolve unassigned codons via mistranslation and ribosomal rescue"
Publicly available at ProteomeXchange (accession no: PXD009643).

http://proteomecentral.proteomexchange.org/cgi/GetDataset?ID=PXD009643

References

(2015) Revealing the amino acid composition of proteins within an expanded genetic code
Nucleic Acids Research 43:e8.

https://doi.org/10.1093/nar/gku1087
- PubMed
- Google Scholar
(2007) Natural expansion of the genetic code
Nature Chemical Biology 3:29–35.

https://doi.org/10.1038/nchembio847
- PubMed
- Google Scholar
1. Amiram M
2. Haimovich AD
3. Fan C
4. Wang YS
5. Aerni HR
6. Ntai I
7. Moonan DW
8. Ma NJ
9. Rovner AJ
10. Hong SH
11. Kelleher NL
12. Goodman AL
13. Jewett MC
14. Söll D
15. Rinehart J
16. Isaacs FJ
(2015) Evolution of translation machinery in recoded bacteria enables multi-site incorporation of nonstandard amino acids
Nature Biotechnology 33:1272–1279.

https://doi.org/10.1038/nbt.3372
- PubMed
- Google Scholar
1. Atkins JF
2. Loughran G
3. Bhatt PR
4. Firth AE
5. Baranov PV
(2016) Ribosomal frameshifting and transcriptional slippage: from genetic steganography and cryptography to adventitious use
Nucleic Acids Research 243:gkw530.

https://doi.org/10.1093/nar/gkw530
- Google Scholar
1. Baneyx F
2. Mujacic M
(2004) Recombinant protein folding and misfolding in Escherichia coli
Nature Biotechnology 22:1399–1408.

https://doi.org/10.1038/nbt1029
- PubMed
- Google Scholar
(2015) Augmented genetic decoding: global, local and temporal alterations of decoding processes and codon meaning
Nature Reviews Genetics 16:517–529.

https://doi.org/10.1038/nrg3963
- PubMed
- Google Scholar
(2014) Points of significance: replication
Nature Methods 11:879–880.

https://doi.org/10.1038/nmeth.3091
- PubMed
- Google Scholar
1. Blank A
2. Gallant JA
3. Burgess RR
4. Loeb LA
(1986) An RNA polymerase mutant with reduced accuracy of chain elongation
Biochemistry 25:5920–5928.

https://doi.org/10.1021/bi00368a013
- PubMed
- Google Scholar
1. Boeke JD
2. Church G
3. Hessel A
4. Kelley NJ
5. Arkin A
6. Cai Y
7. Carlson R
8. Chakravarti A
9. Cornish VW
10. Holt L
11. Isaacs FJ
12. Kuiken T
13. Lajoie M
14. Lessor T
15. Lunshof J
16. Maurano MT
17. Mitchell LA
18. Rine J
19. Rosser S
20. Sanjana NE
21. Silver PA
22. Valle D
23. Wang H
24. Way JC
25. Yang L
(2016) Genome engineering. the genome project-write
Science 353:126–127.

https://doi.org/10.1126/science.aaf6850
- PubMed
- Google Scholar
1. Chadani Y
2. Ono K
3. Ozawa S
4. Takahashi Y
5. Takai K
6. Nanamiya H
7. Tozawa Y
8. Kutsukake K
9. Abo T
(2010) Ribosome rescue by Escherichia coli ArfA (YhdL) in the absence of trans-translation system
Molecular Microbiology 78:796–808.

https://doi.org/10.1111/j.1365-2958.2010.07375.x
- PubMed
- Google Scholar
1. Chadani Y
2. Matsumoto E
3. Aso H
4. Wada T
5. Kutsukake K
6. Sutou S
7. Abo T
(2011) trans-translation-mediated tight regulation of the expression of the alternative ribosome-rescue factor ArfA in Escherichia coli
Genes & Genetic Systems 86:151–163.

https://doi.org/10.1266/ggs.86.151
- PubMed
- Google Scholar
1. Chadani Y
2. Ito K
3. Kutsukake K
4. Abo T
(2012) ArfA recruits release factor 2 to rescue stalled ribosomes by peptidyl-tRNA hydrolysis in Escherichia coli
Molecular Microbiology 86:37–50.

https://doi.org/10.1111/j.1365-2958.2012.08190.x
- PubMed
- Google Scholar
(2008) Virus attenuation by genome-scale changes in codon pair bias
Science 320:1784–1787.

https://doi.org/10.1126/science.1155761
- PubMed
- Google Scholar
1. Cox J
2. Mann M
(2008) MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification
Nature Biotechnology 26:1367–1372.

https://doi.org/10.1038/nbt.1511
- PubMed
- Google Scholar
(2011) Nascent polypeptide sequences that influence ribosome function
Current Opinion in Microbiology 14:160–166.

https://doi.org/10.1016/j.mib.2011.01.011
- PubMed
- Google Scholar
1. Davies J
(1994) Inactivation of antibiotics and the dissemination of resistance genes
Science 264:375–382.

https://doi.org/10.1126/science.8153624
- PubMed
- Google Scholar
1. Gallagher RR
2. Li Z
3. Lewis AO
4. Isaacs FJ
(2014) Rapid editing and evolution of bacterial genomes using libraries of synthetic DNA
Nature Protocols 9:2301–2316.

https://doi.org/10.1038/nprot.2014.082
- PubMed
- Google Scholar
(2011) tmRNA regulates synthesis of the ArfA ribosome rescue factor
Molecular Microbiology 80:1204–1219.

https://doi.org/10.1111/j.1365-2958.2011.07638.x
- PubMed
- Google Scholar
1. George S
2. Aguirre JD
3. Spratt DE
4. Bi Y
5. Jeffery M
6. Shaw GS
7. O'Donoghue P
(2016) Generation of phospho-ubiquitin variants by orthogonal translation reveals codon skipping
FEBS Letters 590:1530–1542.

https://doi.org/10.1002/1873-3468.12182
- PubMed
- Google Scholar
1. Gingold H
2. Pilpel Y
(2011) Determinants of translation efficiency and accuracy
Molecular Systems Biology 7:481.

https://doi.org/10.1038/msb.2011.14
- PubMed
- Google Scholar
1. Gogarten JP
2. Townsend JP
(2005) Horizontal gene transfer, genome innovation and evolution
Nature Reviews Microbiology 3:679–687.

https://doi.org/10.1038/nrmicro1204
- PubMed
- Google Scholar
1. Graille M
2. Séraphin B
(2012) Surveillance pathways rescuing eukaryotic ribosomes lost in translation
Nature Reviews Molecular Cell Biology 13:727–735.

https://doi.org/10.1038/nrm3457
- PubMed
- Google Scholar
1. Grosskopf T
2. Soyer OS
(2014) Synthetic microbial communities
Current Opinion in Microbiology 18:72–77.

https://doi.org/10.1016/j.mib.2014.02.002
- PubMed
- Google Scholar
1. Handa Y
2. Inaho N
3. Nameki N
(2011) YaeJ is a novel ribosome-associated protein in Escherichia coli that can hydrolyze peptidyl-tRNA on stalled ribosomes
Nucleic Acids Research 39:1739–1748.

https://doi.org/10.1093/nar/gkq1097
- PubMed
- Google Scholar
1. Hayes CS
2. Bose B
3. Sauer RT
(2002) Stop codons preceded by rare arginine codons are efficient determinants of SsrA tagging in Escherichia coli
PNAS 99:3440–3445.

https://doi.org/10.1073/pnas.052707199
- PubMed
- Google Scholar
1. Heinemann IU
2. Rovner AJ
3. Aerni HR
4. Rogulina S
5. Cheng L
6. Olds W
7. Fischer JT
8. Söll D
9. Isaacs FJ
10. Rinehart J
(2012) Enhanced phosphoserine insertion during Escherichia coli protein synthesis via partial UAG codon reassignment and release factor 1 deletion
FEBS Letters 586:3716–3722.

https://doi.org/10.1016/j.febslet.2012.08.031
- PubMed
- Google Scholar
1. Hillesland KL
2. Stahl DA
(2010) Rapid evolution of stability and productivity at the origin of a microbial mutualism
PNAS 107:2124–2129.

https://doi.org/10.1073/pnas.0908456107
- PubMed
- Google Scholar
1. Hsu LM
2. Klee HJ
3. Zagorski J
4. Fournier MJ
(1984)
Structure of an Escherichia coli tRNA operon containing linked genes for arginine, histidine, leucine, and proline tRNAs

Journal of Bacteriology 158:934–942.
- PubMed
- Google Scholar
1. Huang WM
2. Ao SZ
3. Casjens S
4. Orlandi R
5. Zeikus R
6. Weiss R
7. Winge D
8. Fang M
(1988) A persistent untranslated sequence within bacteriophage T4 DNA topoisomerase gene 60
Science 239:1005–1012.

https://doi.org/10.1126/science.2830666
- PubMed
- Google Scholar
(2014) Ends of the line for tmRNA-SmpB
Frontiers in Microbiology 5:421.

https://doi.org/10.3389/fmicb.2014.00421
- PubMed
- Google Scholar
1. Isaacs FJ
2. Carr PA
3. Wang HH
4. Lajoie MJ
5. Sterling B
6. Kraal L
7. Tolonen AC
8. Gianoulis TA
9. Goodman DB
10. Reppas NB
11. Emig CJ
12. Bang D
13. Hwang SJ
14. Jewett MC
15. Jacobson JM
16. Church GM
(2011) Precise manipulation of chromosomes in vivo enables genome-wide codon replacement
Science 333:348–353.

https://doi.org/10.1126/science.1205822
- PubMed
- Google Scholar
(2012) A fully decompressed synthetic bacteriophage øX174 genome assembled and archived in yeast
Virology 434:278–284.

https://doi.org/10.1016/j.virol.2012.09.020
- PubMed
- Google Scholar
(1996) Role of a peptide tagging system in degradation of proteins synthesized from damaged messenger RNA
Science 271:990–993.

https://doi.org/10.1126/science.271.5251.990
- PubMed
- Google Scholar
1. Keiler KC
2. Feaga HA
(2014) Resolving nonstop translation complexes is a matter of life or death
Journal of Bacteriology 196:2123–2130.

https://doi.org/10.1128/JB.01490-14
- PubMed
- Google Scholar
1. Keiler KC
(2015) Mechanisms of ribosome rescue in bacteria
Nature Reviews Microbiology 13:285–297.

https://doi.org/10.1038/nrmicro3438
- PubMed
- Google Scholar
(2001) Rewiring the keyboard: evolvability of the genetic code
Nature Reviews Genetics 2:49–58.

https://doi.org/10.1038/35047500
- PubMed
- Google Scholar
1. Krakauer DC
2. Jansen VA
(2002) Red queen dynamics of protein translation
Journal of Theoretical Biology 218:97–109.

https://doi.org/10.1006/jtbi.2002.3054
- PubMed
- Google Scholar
1. Lajoie MJ
2. Kosuri S
3. Mosberg JA
4. Gregg CJ
5. Zhang D
6. Church GM
(2013a) Probing the limits of genetic recoding in essential genes
Science 342:361–363.

https://doi.org/10.1126/science.1241460
- PubMed
- Google Scholar
1. Lajoie MJ
2. Rovner AJ
3. Goodman DB
4. Aerni HR
5. Haimovich AD
6. Kuznetsov G
7. Mercer JA
8. Wang HH
9. Carr PA
10. Mosberg JA
11. Rohland N
12. Schultz PG
13. Jacobson JM
14. Rinehart J
15. Church GM
16. Isaacs FJ
(2013b) Genomically recoded organisms expand biological functions
Science 342:357–360.

https://doi.org/10.1126/science.1241459
- PubMed
- Google Scholar
1. Lau YH
2. Stirling F
3. Kuo J
4. Karrenbelt MAP
5. Chan YA
6. Riesselman A
7. Horton CA
8. Schäfer E
9. Lips D
10. Weinstock MT
11. Gibson DG
12. Way JC
13. Silver PA
(2017) Large-scale recoding of a bacterial genome by iterative recombineering of synthetic DNA
Nucleic Acids Research 45:6971–6980.

https://doi.org/10.1093/nar/gkx415
- PubMed
- Google Scholar
1. Li X
2. Yokota T
3. Ito K
4. Nakamura Y
5. Aiba H
(2007) Reduced action of polypeptide release factors induces mRNA cleavage and tmRNA tagging at stop codons in Escherichia coli
Molecular Microbiology 63:116–126.

https://doi.org/10.1111/j.1365-2958.2006.05498.x
- PubMed
- Google Scholar
1. Li GW
2. Oh E
3. Weissman JS
(2012) The anti-Shine-Dalgarno sequence drives translational pausing and codon choice in bacteria
Nature 484:538–541.

https://doi.org/10.1038/nature10965
- PubMed
- Google Scholar
1. Lutz R
2. Bujard H
(1997) Independent and tight regulation of transcriptional units in Escherichia coli via the LacR/O, the TetR/O and AraC/I1-I2 regulatory elements
Nucleic Acids Research 25:1203–1210.

https://doi.org/10.1093/nar/25.6.1203
- PubMed
- Google Scholar
(2014) Precise manipulation of bacterial chromosomes by conjugative assembly genome engineering
Nature Protocols 9:2285–2300.

https://doi.org/10.1038/nprot.2014.081
- PubMed
- Google Scholar
1. Ma NJ
2. Isaacs FJ
(2016) Genomic recoding broadly obstructs the propagation of horizontally transferred genetic elements
Cell Systems 3:199–207.

https://doi.org/10.1016/j.cels.2016.06.009
- PubMed
- Google Scholar
1. Manley JL
(1978) Synthesis and degradation of termination and premature-termination fragments of beta-galactosidase in vitro and in vivo
Journal of Molecular Biology 125:407–432.

https://doi.org/10.1016/0022-2836(78)90308-X
- PubMed
- Google Scholar
(2013) Preparing synthetic biology for the world
Frontiers in Microbiology 4:5.

https://doi.org/10.3389/fmicb.2013.00005
- PubMed
- Google Scholar
1. Moore SD
2. Sauer RT
(2005) Ribosome rescue: tmRNA tagging activity and capacity in Escherichia coli
Molecular Microbiology 58:456–466.

https://doi.org/10.1111/j.1365-2958.2005.04832.x
- PubMed
- Google Scholar
1. Mukai T
2. Hayashi A
3. Iraha F
4. Sato A
5. Ohtake K
6. Yokoyama S
7. Sakamoto K
(2010) Codon reassignment in the Escherichia coli genetic code
Nucleic Acids Research 38:8188–8195.

https://doi.org/10.1093/nar/gkq707
- PubMed
- Google Scholar
(2016) Emergent rules for codon choice elucidated by editing rare arginine codons in Escherichia coli
PNAS 113:E5588–E5597.

https://doi.org/10.1073/pnas.1605856113
- PubMed
- Google Scholar
(2000) Lateral gene transfer and the nature of bacterial innovation
Nature 405:299–304.

https://doi.org/10.1038/35012500
- PubMed
- Google Scholar
1. Ostrov N
2. Landon M
3. Guell M
4. Kuznetsov G
5. Teramoto J
6. Cervantes N
7. Zhou M
8. Singh K
9. Napolitano MG
10. Moosburner M
11. Shrock E
12. Pruitt BW
13. Conway N
14. Goodman DB
15. Gardner CL
16. Tyree G
17. Gonzales A
18. Wanner BL
19. Norville JE
20. Lajoie MJ
21. Church GM
(2016) Design, synthesis, and testing toward a 57-codon genome
Science 353:819–822.

https://doi.org/10.1126/science.aaf3639
- PubMed
- Google Scholar
1. Pansegrau W
2. Lanka E
3. Barth PT
4. Figurski DH
5. Guiney DG
6. Haas D
7. Helinski DR
8. Schwab H
9. Stanisich VA
10. Thomas CM
(1994) Complete nucleotide sequence of Birmingham IncP alpha plasmids. Compilation and comparative analysis
Journal of molecular biology 239:623–663.

https://doi.org/10.1006/jmbi.1994.1404
- PubMed
- Google Scholar
Book
1. Pech M
2. Vesper O
3. Yamamoto H
4. Wilson DN
5. Nierhaus KH
(2010) The E Site and Its Importance for Improving Accuracy and Preventing Frameshifts
In: Atkins J. F, Gesteland R. F, editors. Recoding: Expansion of Decoding Rules Enriches Gene Expression. New York: Springer. pp. 345–362.

https://doi.org/10.1007/978-0-387-89382-2_16
- Google Scholar
(2014) Distinct roles for release factor 1 and release factor 2 in translational quality control
Journal of Biological Chemistry 289:17589–17596.

https://doi.org/10.1074/jbc.M114.564989
- PubMed
- Google Scholar
1. Pirman NL
2. Barber KW
3. Aerni HR
4. Ma NJ
5. Haimovich AD
6. Rogulina S
7. Isaacs FJ
8. Rinehart J
(2015) A flexible codon in genomically recoded Escherichia coli permits programmable protein phosphorylation
Nature Communications 6:8130.

https://doi.org/10.1038/ncomms9130
- PubMed
- Google Scholar
1. Richardson SM
2. Mitchell LA
3. Stracquadanio G
4. Yang K
5. Dymond JS
6. DiCarlo JE
7. Lee D
8. Huang CL
9. Chandrasegaran S
10. Cai Y
11. Boeke JD
12. Bader JS
(2017) Design of a synthetic yeast genome
Science 355:1040–1044.

https://doi.org/10.1126/science.aaf4557
- PubMed
- Google Scholar
1. Roche ED
2. Sauer RT
(1999) SsrA-mediated peptide tagging caused by rare codons and tRNA scarcity
The EMBO Journal 18:4579–4589.

https://doi.org/10.1093/emboj/18.16.4579
- PubMed
- Google Scholar
1. Rosenberger RF
2. Hilton J
(1983) The frequency of transcriptional and translational errors at nonsense codons in the lacZ gene of Escherichia coli
MGG Molecular & General Genetics 191:207–212.

https://doi.org/10.1007/BF00334815
- PubMed
- Google Scholar
(2012) Proteobacterial ArfA peptides are synthesized from non-stop messenger RNAs
Journal of Biological Chemistry 287:29765–29775.

https://doi.org/10.1074/jbc.M112.374074
- PubMed
- Google Scholar
1. Shackelton LA
2. Holmes EC
(2008) The role of alternative genetic codes in viral evolution and emergence
Journal of Theoretical Biology 254:128–134.

https://doi.org/10.1016/j.jtbi.2008.05.024
- PubMed
- Google Scholar
(2009) Recombineering: a homologous recombination-based method of genetic engineering
Nature Protocols 4:206–223.

https://doi.org/10.1038/nprot.2008.227
- PubMed
- Google Scholar
1. Shimizu Y
(2012) ArfA recruits RF2 into stalled ribosomes
Journal of Molecular Biology 423:624–631.

https://doi.org/10.1016/j.jmb.2012.08.007
- PubMed
- Google Scholar
1. Shine J
2. Dalgarno L
(1974) The 3'-terminal sequence of Escherichia coli 16S ribosomal RNA: complementarity to nonsense triplets and ribosome binding sites
PNAS 71:1342–1346.

https://doi.org/10.1073/pnas.71.4.1342
- PubMed
- Google Scholar
1. Tu GF
2. Reid GE
3. Zhang JG
4. Moritz RL
5. Simpson RJ
(1995) C-terminal extension of truncated recombinant proteins in Escherichia coli with a 10Sa RNA decapeptide
Journal of Biological Chemistry 270:9322–9326.

https://doi.org/10.1074/jbc.270.16.9322
- PubMed
- Google Scholar
(2002) Characterization of mutant spectra generated by a forward mutational assay for gene A of Phi X174 from ENU-treated transgenic mouse embryonic cell line PX-2
Environmental and Molecular Mutagenesis 39:55–68.

https://doi.org/10.1002/em.10043
- PubMed
- Google Scholar
1. Vimberg V
2. Tats A
3. Remm M
4. Tenson T
(2007) Translation initiation region sequence preferences in Escherichia coli
BMC Molecular Biology 8:100.

https://doi.org/10.1186/1471-2199-8-100
- PubMed
- Google Scholar
1. Vizcaíno JA
2. Deutsch EW
3. Wang R
4. Csordas A
5. Reisinger F
6. Ríos D
7. Dianes JA
8. Sun Z
9. Farrah T
10. Bandeira N
11. Binz PA
12. Xenarios I
13. Eisenacher M
14. Mayer G
15. Gatto L
16. Campos A
17. Chalkley RJ
18. Kraus HJ
19. Albar JP
20. Martinez-Bartolomé S
21. Apweiler R
22. Omenn GS
23. Martens L
24. Jones AR
25. Hermjakob H
(2014) ProteomeXchange provides globally coordinated proteomics data submission and dissemination
Nature Biotechnology 32:223–226.

https://doi.org/10.1038/nbt.2839
- PubMed
- Google Scholar
1. Wang HH
2. Isaacs FJ
3. Carr PA
4. Sun ZZ
5. Xu G
6. Forest CR
7. Church GM
(2009) Programming cells by multiplex genome engineering and accelerated evolution
Nature 460:894–898.

https://doi.org/10.1038/nature08187
- PubMed
- Google Scholar
1. Yamamoto Y
2. Sunohara T
3. Jojima K
4. Inada T
5. Aiba H
(2003) SsrA-mediated trans-translation plays a role in mRNA quality control by facilitating degradation of truncated mRNAs
RNA 9:408–418.

https://doi.org/10.1261/rna.2174803
- PubMed
- Google Scholar
1. Yan S
2. Wen JD
3. Bustamante C
4. Tinoco I
(2015) Ribosome excursions during mRNA translocation mediate broad branching of frameshift pathways
Cell 160:870–881.

https://doi.org/10.1016/j.cell.2015.02.003
- PubMed
- Google Scholar
1. Zaher HS
2. Green R
(2009) Quality control by the ribosome following peptide bond formation
Nature 457:161–166.

https://doi.org/10.1038/nature07582
- PubMed
- Google Scholar
1. Zaher HS
2. Green R
(2010) Kinetic basis for global loss of fidelity arising from mismatches in the P-site codon:anticodon helix
RNA 16:1980–1989.

https://doi.org/10.1261/rna.2241810
- PubMed
- Google Scholar

Article and author information

Author details

Natalie Jing Ma
1. Department of Molecular, Cellular & Developmental Biology, Yale University, New Haven, United States
2. Systems Biology Institute, Yale University, West Haven, United States
Contribution
Conceptualization, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing—original draft, Writing—review and editing

Contributed equally with
Colin F Hemez and Karl W Barber

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0001-7452-9482
Colin F Hemez
1. Systems Biology Institute, Yale University, West Haven, United States
2. Department of Biomedical Engineering, Yale University, New Haven, United States
Contribution
Investigation, Visualization, Writing—original draft, Writing—review and editing

Contributed equally with
Natalie Jing Ma and Karl W Barber

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0003-3445-7706
Karl W Barber
1. Systems Biology Institute, Yale University, West Haven, United States
2. Department of Cellular & Molecular Physiology, Yale University School of Medicine, New Haven, United States
Contribution
Resources, Investigation, Visualization, Writing—review and editing

Contributed equally with
Natalie Jing Ma and Colin F Hemez

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0003-0672-8409
Jesse Rinehart
1. Systems Biology Institute, Yale University, West Haven, United States
2. Department of Cellular & Molecular Physiology, Yale University School of Medicine, New Haven, United States
Contribution
Resources, Supervision, Funding acquisition, Validation, Methodology

For correspondence
jesse.rinehart@yale.edu

Competing interests
No competing interests declared
Farren J Isaacs
1. Department of Molecular, Cellular & Developmental Biology, Yale University, New Haven, United States
2. Systems Biology Institute, Yale University, West Haven, United States
Contribution
Conceptualization, Supervision, Funding acquisition, Methodology, Writing—original draft, Writing—review and editing

For correspondence
farren.isaacs@yale.edu

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0001-8615-8236

Funding

Defense Advanced Research Projects Agency (N66001-12-C-4211)

Farren Isaacs

U.S. Department of Energy (152339.5055249.100)

Farren Isaacs

National Institutes of Health (R01GM117230)

Jesse Rinehart
Farren Isaacs

National Institutes of Health (R01GM125951)

Farren J Isaacs
Jesse Rinehart

National Science Foundation (Graduate Research Fellowship DGE-1122492)

Karl W Barber

Gruber Foundation (Graduate Research Fellowship)

Natalie Jing Ma

Arnold and Mabel Beckman Foundation (Young Investigator Award)

Farren Isaacs

DuPont (Young Professor Award)

Farren Isaacs

National Institutes of Health (Graduate Training Grants T32GM007499,T32GM007223)

Natalie Jing Ma

Defense Advanced Research Projects Agency (HR0011-15-C-0091)

Farren Isaacs

Arnold and Mabel Beckman Foundation (Beckman Scholar Award)

Colin F Hemez

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

We thank the members of the Isaacs and Rinehart labs for helpful feedback on this study. We also thank Paul Turner for valuable discussions and experimental advice. The authors gratefully acknowledge support from DARPA (N66001-12-C-4211, HR0011-15-C-0091 to FJI), DOE (152339.5055249.100 to FJI), NIH (R01GM117230, R01GM125951 to FJI and JR, T32GM007499 and T32GM007223 to NJM), and NSF (DGE-1122492 to KWB). The authors also thank the Gruber Foundation (NJM), the Arnold and Mabel Beckman Foundation (FJI and CFH), and DuPont Inc. (FJI) for funding.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.