High-throughput profiling of sequence recognition by tyrosine kinases and SH2 domains using bacterial peptide display
Abstract
Tyrosine kinases and SH2 (phosphotyrosine recognition) domains have binding specificities that depend on the amino acid sequence surrounding the target (phospho)tyrosine residue. Although the preferred recognition motifs of many kinases and SH2 domains are known, we lack a quantitative description of sequence specificity that could guide predictions about signaling pathways or be used to design sequences for biomedical applications. Here, we present a platform that combines genetically-encoded peptide libraries and deep sequencing to profile sequence recognition by tyrosine kinases and SH2 domains. We screened several tyrosine kinases against a million-peptide random library and used the resulting profiles to design high-activity sequences. We also screened several kinases against a library containing thousands of human proteome-derived peptides and their naturally-occurring variants. These screens recapitulated independently measured phosphorylation rates and revealed hundreds of phosphosite-proximal mutations that impact phosphosite recognition by tyrosine kinases. We extended this platform to the analysis of SH2 domains and showed that screens could predict relative binding affinities. Finally, we expanded our method to assess the impact of non-canonical and post-translationally modified amino acids on sequence recognition. This specificity profiling platform will shed new light on phosphotyrosine signaling and could readily be adapted to other protein modification/recognition domains.
Data availability
All of the processed data from the high-throughput specificity screens are provided as source data files. The raw fastq and fasta sequencing files are available as a Dryad repository (DOI: 10.5061/dryad.0zpc86727). Custom code used to process/analyze screening data can be found in a GitHub repository, as specified in the manuscript.
-
Data from: High-throughput profiling of sequence recognition by tyrosine kinases and SH2 domains using bacterial peptide displayDryad Digital Repository, doi:10.5061/dryad.0zpc86727.
Article and author information
Author details
Funding
National Institute of General Medical Sciences (R35GM138014)
- Neel H Shah
Damon Runyon Cancer Research Foundation (DFS 31-18)
- Neel H Shah
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Reviewing Editor
- Tony Hunter, Salk Institute for Biological Studies, United States
Publication history
- Received: August 1, 2022
- Accepted: March 15, 2023
- Accepted Manuscript published: March 16, 2023 (version 1)
Copyright
© 2023, Li et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 91
- Page views
-
- 19
- Downloads
-
- 0
- Citations
Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Biochemistry and Chemical Biology
- Neuroscience
Modification by sialylated glycans can affect protein functions, underlying mechanisms that control animal development and physiology. Sialylation relies on a dedicated pathway involving evolutionarily conserved enzymes, including CMP-sialic acid synthetase (CSAS) and sialyltransferase (SiaT) that mediate the activation of sialic acid and its transfer onto glycan termini, respectively. In Drosophila, CSAS and DSiaT genes function in the nervous system, affecting neural transmission and excitability. We found that these genes function in different cells: the function of CSAS is restricted to glia, while DSiaT functions in neurons. This partition of the sialylation pathway allows for regulation of neural functions via a glia-mediated control of neural sialylation. The sialylation genes were shown to be required for tolerance to heat and oxidative stress and for maintenance of the normal level of voltage-gated sodium channels. Our results uncovered a unique bipartite sialylation pathway that mediates glia-neuron coupling and regulates neural excitability and stress tolerance.
-
- Biochemistry and Chemical Biology
- Medicine
Mitochondrial dysfunction caused by aberrant Complex I assembly and reduced activity of the electron transport chain is pathogenic in many genetic and age-related diseases. Mice missing the Complex I subunit NADH dehydrogenase [ubiquinone] iron-sulfur protein 4 (NDUFS4) are a leading mammalian model of severe mitochondrial disease that exhibit many characteristic symptoms of Leigh Syndrome including oxidative stress, neuroinflammation, brain lesions, and premature death. NDUFS4 knockout mice have decreased expression of nearly every Complex I subunit. As Complex I normally contains at least 8 iron-sulfur clusters and more than 25 iron atoms, we asked whether a deficiency of Complex I may lead to iron perturbations, thereby accelerating disease progression. Consistent with this, iron supplementation accelerates symptoms of brain degeneration in these mice, while iron restriction delays the onset of these symptoms, reduces neuroinflammation, and increases survival. NDUFS4 knockout mice display signs of iron overload in the liver including increased expression of hepcidin and show changes in iron-responsive element-regulated proteins consistent with increased cellular iron that were prevented by iron restriction. These results suggest that perturbed iron homeostasis may contribute to pathology in Leigh Syndrome and possibly other mitochondrial disorders.