Universal and taxon-specific trends in protein sequences as a function of age
Abstract
Extant protein-coding sequences span a huge range of ages, from those that emerged only recently, to those present in the last universal common ancestor. Because evolution has had less time to act on young sequences, there might be 'phylostratigraphy' trends in any properties that evolve slowly with age. A long-term reduction in hydrophobicity and hydrophobic clustering was found in previous, taxonomically restricted studies. Here we perform integrated phylostratigraphy across 435 fully sequenced species, using sensitive HMM methods to detect protein domain homology. We find that the reduction in hydrophobic clustering is universal across lineages. However, only young animal domains have a tendency to have higher structural disorder. Among ancient domains, trends in amino acid composition reflect the order of recruitment into the genetic code, suggesting that the composition of the contemporary descendants of ancient sequences reflects amino acid availability during the earliest stages of life, when these sequences first emerged.
Data availability
All scripts used in this work can be accessed at: https://github.com/MaselLab/ProteinEvolution. Our raw data, and homology files used in our analyses, are available at https://doi.org/10.6084/m9.figshare.12037281.
Article and author information
Author details
Funding
John Templeton Foundation (60814)
- Joanna Masel
National Institutes of Health (GM-104040)
- Joanna Masel
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Reviewing Editor
- Christian R Landry, Université Laval, Canada
Version history
- Received: March 28, 2020
- Accepted: January 5, 2021
- Accepted Manuscript published: January 8, 2021 (version 1)
- Version of Record published: January 21, 2021 (version 2)
Copyright
© 2021, James et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 1,685
- Page views
-
- 182
- Downloads
-
- 10
- Citations
Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Evolutionary Biology
- Microbiology and Infectious Disease
Drug resistance remains a major obstacle to malaria control and eradication efforts, necessitating the development of novel therapeutic strategies to treat this disease. Drug combinations based on collateral sensitivity, wherein resistance to one drug causes increased sensitivity to the partner drug, have been proposed as an evolutionary strategy to suppress the emergence of resistance in pathogen populations. In this study, we explore collateral sensitivity between compounds targeting the Plasmodium dihydroorotate dehydrogenase (DHODH). We profiled the cross-resistance and collateral sensitivity phenotypes of several DHODH mutant lines to a diverse panel of DHODH inhibitors. We focus on one compound, TCMDC-125334, which was active against all mutant lines tested, including the DHODH C276Y line, which arose in selections with the clinical candidate DSM265. In six selections with TCMDC-125334, the most common mechanism of resistance to this compound was copy number variation of the dhodh locus, although we did identify one mutation, DHODH I263S, which conferred resistance to TCMDC-125334 but not DSM265. We found that selection of the DHODH C276Y mutant with TCMDC-125334 yielded additional genetic changes in the dhodh locus. These double mutant parasites exhibited decreased sensitivity to TCMDC-125334 and were highly resistant to DSM265. Finally, we tested whether collateral sensitivity could be exploited to suppress the emergence of resistance in the context of combination treatment by exposing wildtype parasites to both DSM265 and TCMDC-125334 simultaneously. This selected for parasites with a DHODH V532A mutation which were cross-resistant to both compounds and were as fit as the wildtype parent in vitro. The emergence of these cross-resistant, evolutionarily fit parasites highlights the mutational flexibility of the DHODH enzyme.
-
- Evolutionary Biology
- Genetics and Genomics
Cultural and socioeconomic differences stratify human societies and shape their genetic structure beyond the sole effect of geography. Despite mating being limited by sociocultural stratification, most demographic models in population genetics often assume random mating. Taking advantage of the correlation between sociocultural stratification and the proportion of genetic ancestry in admixed populations, we sought to infer the former process in the Americas. To this aim, we define a mating model where the individual proportions of the genome inherited from Native American, European and sub-Saharan African ancestral populations constrain the mating probabilities through ancestry-related assortative mating and sex bias parameters. We simulate a wide range of admixture scenarios under this model. Then, we train a deep neural network and retrieve good performance in predicting mating parameters from genomic data. Our results show how population stratification shaped by socially constructed racial and gender hierarchies have constrained the admixture processes in the Americas since the European colonisation and the subsequent Atlantic slave trade.