Universal and taxon-specific trends in protein sequences as a function of age
Abstract
Extant protein-coding sequences span a huge range of ages, from those that emerged only recently, to those present in the last universal common ancestor. Because evolution has had less time to act on young sequences, there might be 'phylostratigraphy' trends in any properties that evolve slowly with age. A long-term reduction in hydrophobicity and hydrophobic clustering was found in previous, taxonomically restricted studies. Here we perform integrated phylostratigraphy across 435 fully sequenced species, using sensitive HMM methods to detect protein domain homology. We find that the reduction in hydrophobic clustering is universal across lineages. However, only young animal domains have a tendency to have higher structural disorder. Among ancient domains, trends in amino acid composition reflect the order of recruitment into the genetic code, suggesting that the composition of the contemporary descendants of ancient sequences reflects amino acid availability during the earliest stages of life, when these sequences first emerged.
Data availability
All scripts used in this work can be accessed at: https://github.com/MaselLab/ProteinEvolution. Our raw data, and homology files used in our analyses, are available at https://doi.org/10.6084/m9.figshare.12037281.
Article and author information
Author details
Funding
John Templeton Foundation (60814)
- Joanna Masel
National Institutes of Health (GM-104040)
- Joanna Masel
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Reviewing Editor
- Christian R Landry, Université Laval, Canada
Publication history
- Received: March 28, 2020
- Accepted: January 5, 2021
- Accepted Manuscript published: January 8, 2021 (version 1)
- Version of Record published: January 21, 2021 (version 2)
Copyright
© 2021, James et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 1,530
- Page views
-
- 170
- Downloads
-
- 9
- Citations
Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Developmental Biology
- Evolutionary Biology
The gill skeleton of cartilaginous fishes (sharks, skates, rays, and holocephalans) exhibits a striking anterior–posterior polarity, with a series of fine appendages called branchial rays projecting from the posterior margin of the gill arch cartilages. We previously demonstrated in the skate (Leucoraja erinacea) that branchial rays derive from a posterior domain of pharyngeal arch mesenchyme that is responsive to Sonic hedgehog (Shh) signaling from a distal gill arch epithelial ridge (GAER) signaling centre. However, how branchial ray progenitors are specified exclusively within posterior gill arch mesenchyme is not known. Here, we show that genes encoding several Wnt ligands are expressed in the ectoderm immediately adjacent to the skate GAER, and that these Wnt signals are transduced largely in the anterior arch environment. Using pharmacological manipulation, we show that inhibition of Wnt signalling results in an anterior expansion of Shh signal transduction in developing skate gill arches, and in the formation of ectopic anterior branchial ray cartilages. Our findings demonstrate that ectodermal Wnt signalling contributes to gill arch skeletal polarity in skate by restricting Shh signal transduction and chondrogenesis to the posterior arch environment and highlights the importance of signalling interactions at embryonic tissue boundaries for cell fate determination in vertebrate pharyngeal arches.
-
- Ecology
- Evolutionary Biology
Spider venoms are a complex concoction of enzymes, polyamines, inorganic salts, and disulfide-rich peptides (DRPs). Although DRPs are widely distributed and abundant, their bevolutionary origin has remained elusive. This knowledge gap stems from the extensive molecular divergence of DRPs and a lack of sequence and structural data from diverse lineages. By evaluating DRPs under a comprehensive phylogenetic, structural and evolutionary framework, we have not only identified 78 novel spider toxin superfamilies but also provided the first evidence for their common origin. We trace the origin of these toxin superfamilies to a primordial knot – which we name ‘Adi Shakti’, after the creator of the Universe according to Hindu mythology – 375 MYA in the common ancestor of Araneomorphae and Mygalomorphae. As the lineages under evaluation constitute nearly 60% of extant spiders, our findings provide fascinating insights into the early evolution and diversification of the spider venom arsenal. Reliance on a single molecular toxin scaffold by nearly all spiders is in complete contrast to most other venomous animals that have recruited into their venoms diverse toxins with independent origins. By comparatively evaluating the molecular evolutionary histories of araneomorph and mygalomorph spider venom toxins, we highlight their contrasting evolutionary diversification rates. Our results also suggest that venom deployment (e.g. prey capture or self-defense) influences evolutionary diversification of DRP toxin superfamilies.