Codon-level information improves predictions of inter-residue contacts in proteins by correlated mutation analysis
Abstract
Methods for analysing correlated mutations in proteins are becoming an increasingly powerful tool for predicting contacts within and between proteins. Nevertheless, limitations remain due to the requirement for large multiple sequence alignments (MSA) and the fact that, in general, only the relatively small number of top-ranking predictions are reliable. To date, methods for analysing correlated mutations have relied exclusively on amino acid MSAs as inputs. Here, we describe a new approach for analysing correlated mutations that is based on combined analysis of amino acid and codon MSAs. We show that a direct contact is more likely to be present when the correlation between the positions is strong at the amino acid level but weak at the codon level. The performance of different methods for analysing correlated mutations in predicting contacts is shown to be enhanced significantly when amino acid and codon data are combined.
Article and author information
Author details
Copyright
© 2015, Jacob et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 3,147
- views
-
- 605
- downloads
-
- 8
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Biochemistry and Chemical Biology
- Structural Biology and Molecular Biophysics
The phase separation of intrinsically disordered proteins is emerging as an important mechanism for cellular organization. However, efforts to connect protein sequences to the physical properties of condensates, that is, the molecular grammar, are hampered by a lack of effective approaches for probing high-resolution structural details. Using a combination of multiscale simulations and fluorescence lifetime imaging microscopy experiments, we systematically explored a series of systems consisting of diblock elastin-like polypeptides (ELPs). The simulations succeeded in reproducing the variation of condensate stability upon amino acid substitution and revealed different microenvironments within a single condensate, which we verified with environmentally sensitive fluorophores. The interspersion of hydrophilic and hydrophobic residues and a lack of secondary structure formation result in an interfacial environment, which explains both the strong correlation between ELP condensate stability and interfacial hydrophobicity scales, as well as the prevalence of protein-water hydrogen bonds. Our study uncovers new mechanisms for condensate stability and organization that may be broadly applicable.
-
- Microbiology and Infectious Disease
- Structural Biology and Molecular Biophysics
Giant viruses of protists are a diverse and likely ubiquitous group of organisms. Here, we describe Jyvaskylavirus, the first giant virus isolated from Finland. This clade B marseillevirus was found in Acanthamoeba castellanii from a composting soil sample in Jyväskylä, Central Finland. Its genome shares similarities with other marseilleviruses. Helium ion microscopy and electron microscopy of infected cells unraveled stages of the Jyvaskylavirus life cycle. We reconstructed the Jyvaskylavirus particle to 6.3 Å resolution using cryo-electron microscopy. The ~2500 Å diameter virion displays structural similarities to other Marseilleviridae giant viruses. The capsid comprises of 9240 copies of the major capsid protein, encoded by open reading frame (ORF) 184, which possesses a double jellyroll fold arranged in trimers forming pseudo-hexameric capsomers. Below the capsid shell, the internal membrane vesicle encloses the genome. Through cross-structural and -sequence comparisons with other Marseilleviridae using AI-based software in model building and prediction, we elucidated ORF142 as the penton protein, which plugs the 12 vertices of the capsid. Five additional ORFs were identified, with models predicted and fitted into densities that either cap the capsomers externally or stabilize them internally. The isolation of Jyvaskylavirus suggests that these viruses may be widespread in the boreal environment and provide structural insights extendable to other marseilleviruses.