Proteome-wide signatures of function in highly diverged intrinsically disordered regions

  1. Taraneh Zarin
  2. Bob Strome
  3. Alex N Nguyen Ba
  4. Simon Alberti
  5. Julie Deborah Forman-Kay
  6. Alan M Moses  Is a corresponding author
  1. University of Toronto, Canada
  2. Harvard University, United States
  3. Max Planck Institute of Molecular Cell Biology and Genetics, Germany
  4. Hospital for Sick Children, Canada

Abstract

Intrinsically disordered regions make up a large part of the proteome, but the sequence-to-function relationship in these regions is poorly understood, in part because the primary amino acid sequences of these regions are poorly conserved in alignments. Here we use an evolutionary approach to detect molecular features that are preserved in the amino acid sequences of orthologous intrinsically disordered regions. We find that most disordered regions contain multiple molecular features that are preserved, and we define these as 'evolutionary signatures' of disordered regions. We demonstrate that intrinsically disordered regions with similar evolutionary signatures can rescue function in vivo, and that groups of intrinsically disordered regions with similar evolutionary signatures are strongly enriched for functional annotations and phenotypes. We propose that evolutionary signatures can be used to predict function for many disordered regions from their amino acid sequences.

Data availability

The analysis is based on publically available sequence data from YGOB. Source data has been included as supplementary data.

Article and author information

Author details

  1. Taraneh Zarin

    Department of Cell and Systems Biology, University of Toronto, Toronto, Canada
    Competing interests
    The authors declare that no competing interests exist.
  2. Bob Strome

    Department of Cell and Systems Biology, University of Toronto, Toronto, Canada
    Competing interests
    The authors declare that no competing interests exist.
  3. Alex N Nguyen Ba

    Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, United States
    Competing interests
    The authors declare that no competing interests exist.
  4. Simon Alberti

    Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-4017-6505
  5. Julie Deborah Forman-Kay

    Program in Molecular Medicine, Hospital for Sick Children, Toronto, Canada
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-8265-972X
  6. Alan M Moses

    Department of Cell and Systems Biology, University of Toronto, Toronto, Canada
    For correspondence
    alan.moses@utoronto.ca
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-3118-3121

Funding

National Sciences and Engineering Research Council (Alexander Graham Bell Scholarship)

  • Taraneh Zarin

National Sciences and Engineering Research Council (Discovery Grant)

  • Alan M Moses

Canadian Institutes of Health Research (PJT-148532)

  • Julie Deborah Forman-Kay
  • Alan M Moses

Canadian Institutes of Health Research (FDN-148375)

  • Julie Deborah Forman-Kay

Canada Research Chairs

  • Julie Deborah Forman-Kay

Canadian Foundation for Innovation

  • Alan M Moses

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Copyright

© 2019, Zarin et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 7,629
    views
  • 1,231
    downloads
  • 141
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Taraneh Zarin
  2. Bob Strome
  3. Alex N Nguyen Ba
  4. Simon Alberti
  5. Julie Deborah Forman-Kay
  6. Alan M Moses
(2019)
Proteome-wide signatures of function in highly diverged intrinsically disordered regions
eLife 8:e46883.
https://doi.org/10.7554/eLife.46883

Share this article

https://doi.org/10.7554/eLife.46883

Further reading

    1. Computational and Systems Biology
    2. Neuroscience
    Brian DePasquale, Carlos D Brody, Jonathan W Pillow
    Research Article

    Accumulating evidence to make decisions is a core cognitive function. Previous studies have tended to estimate accumulation using either neural or behavioral data alone. Here we develop a unified framework for modeling stimulus-driven behavior and multi-neuron activity simultaneously. We applied our method to choices and neural recordings from three rat brain regions - the posterior parietal cortex (PPC), the frontal orienting fields (FOF), and the anterior-dorsal striatum (ADS) - while subjects performed a pulse-based accumulation task. Each region was best described by a distinct accumulation model, which all differed from the model that best described the animal's choices. FOF activity was consistent with an accumulator where early evidence was favored while the ADS reflected near perfect accumulation. Neural responses within an accumulation framework unveiled a distinct association between each brain region and choice. Choices were better predicted from all regions using a comprehensive, accumulation-based framework and different brain regions were found to differentially reflect choice-related accumulation signals: FOF and ADS both reflected choice but ADS showed more instances of decision vacillation. Previous studies relating neural data to behaviorally-inferred accumulation dynamics have implicitly assumed that individual brain regions reflect the whole-animal level accumulator. Our results suggest that different brain regions represent accumulated evidence in dramatically different ways and that accumulation at the whole-animal level may be constructed from a variety of neural-level accumulators.

    1. Computational and Systems Biology
    2. Ecology
    Lenore Pipes, Rasmus Nielsen
    Tools and Resources

    Environmental DNA (eDNA) is becoming an increasingly important tool in diverse scientific fields from ecological biomonitoring to wastewater surveillance of viruses. The fundamental challenge in eDNA analyses has been the bioinformatical assignment of reads to taxonomic groups. It has long been known that full probabilistic methods for phylogenetic assignment are preferable, but unfortunately, such methods are computationally intensive and are typically inapplicable to modern Next-Generation Sequencing data. We here present a fast approximate likelihood method for phylogenetic assignment of DNA sequences. Applying the new method to several mock communities and simulated datasets, we show that it identifies more reads at both high and low taxonomic levels more accurately than other leading methods. The advantage of the method is particularly apparent in the presence of polymorphisms and/or sequencing errors and when the true species is not represented in the reference database.