A deep learning algorithm to translate and classify cardiac electrophysiology

  1. Parya Aghasafari Ph.D.
  2. Pei-Chi Yang Ph.D.
  3. Divya C Kernik Ph.D.
  4. Kazuho Sakamoto Ph.D.
  5. Yasunari Kanda Ph.D.
  6. Junko Kurokawa Ph.D
  7. Igor Vorobyov
  8. Colleen E Clancy Ph.D.  Is a corresponding author
  1. University of California Davis, United States
  2. Washington University in St. Louis, United States
  3. University of Shizuoka, Japan
  4. National Institute of Health Sciences, Japan
  5. University California Davis, United States

Abstract

The development of induced pluripotent stem cell-derived cardiomyocytes (iPSC-CMs) has been a critical in vitro advance in the study of patient-specific physiology, pathophysiology and pharmacology. We designed a new deep learning multitask network approach intended to address the low throughput, high variability and immature phenotype of the iPSC-CM platform. The rationale for combining translation and classification tasks is because the most likely application of the deep learning technology we describe here is to translate iPSC-CMs following application of a perturbation. The deep learning network was trained using simulated action potential (AP) data and applied to classify cells into the drug-free and drugged categories and to predict the impact of electrophysiological perturbation across the continuum of aging from the immature iPSC-CMs to the adult ventricular myocytes. The phase of the AP extremely sensitive to perturbation due to a steep rise of the membrane resistance was found to contain the key information required for successful network multitasking. We also demonstrated successful translation of both experimental and simulated iPSC-CM AP data validating our network by prediction of experimental drug-induced effects on adult cardiomyocyte APs by the latter.

Data availability

Since we used simulated data, we have made all drugged and drug-free iPSC-CM and adult-CM AP data used for training and testing the multitask network publicly available at Clancy lab Github.(https://github.com/ClancyLabUCD/Multitask_network/tree/master/data). In addition, we have illustrated training and test dataset in Figure1 and Figure5.We have also shared the jupyter notebook for preparing clean and organized data for training the network at Clancy lab Github (https://github.com/ClancyLabUCD/Multitask_network/tree/master/jupyter).We also made experimental data used for the model validation publicly available at Clancy lab Github.(https://github.com/ClancyLabUCD/Multitask_network/blob/master/data/clean_data/experiments.csv ). Figure 7 illustrates the experimental data we used to validate the network.

Article and author information

Author details

  1. Parya Aghasafari Ph.D.

    Physiology and Membrane Biology, University of California Davis, Davis, United States
    Competing interests
    The authors declare that no competing interests exist.
  2. Pei-Chi Yang Ph.D.

    Physiology and Membrane Biology, University of California Davis, Davis, United States
    Competing interests
    The authors declare that no competing interests exist.
  3. Divya C Kernik Ph.D.

    Biomedical Engineering, Washington University in St. Louis, St. Louis, United States
    Competing interests
    The authors declare that no competing interests exist.
  4. Kazuho Sakamoto Ph.D.

    Bio-Informational Pharmacology, University of Shizuoka, Shizuoka, Japan
    Competing interests
    The authors declare that no competing interests exist.
  5. Yasunari Kanda Ph.D.

    Division of Pharmacology, National Institute of Health Sciences, Kanagawa, Japan
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-2527-3526
  6. Junko Kurokawa Ph.D

    Bio-Informational Pharmacology, University of Shizuoka, Shizuoka, Japan
    Competing interests
    The authors declare that no competing interests exist.
  7. Igor Vorobyov

    University California Davis, Davis, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-4767-5297
  8. Colleen E Clancy Ph.D.

    Physiology and Membrane Biology, University of California Davis, Davis, United States
    For correspondence
    ceclancy@ucdavis.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-6849-4885

Funding

Common Fund (OT2OD026580)

  • Igor Vorobyov
  • Colleen E Clancy Ph.D.

Texas Advanced Computing Center Leadership Resource Allocation (MCB20010)

  • Igor Vorobyov
  • Colleen E Clancy Ph.D.

Oracle cloud for research allocation

  • Igor Vorobyov
  • Colleen E Clancy Ph.D.

Common Fund (OT2OD025308‐01S2)

  • Parya Aghasafari Ph.D.

American Heart Association (19CDA34770101)

  • Igor Vorobyov

National Heart, Lung, and Blood Institute (R01HL152681)

  • Igor Vorobyov
  • Colleen E Clancy Ph.D.

National Heart, Lung, and Blood Institute (R01HL128170)

  • Colleen E Clancy Ph.D.

National Heart, Lung, and Blood Institute (U01HL126273)

  • Colleen E Clancy Ph.D.

Department of Physiology and Membrane Biology Research Partnership Fund

  • Igor Vorobyov
  • Colleen E Clancy Ph.D.

Extreme Science and Engineering Discovery Environment (MCB170095)

  • Igor Vorobyov
  • Colleen E Clancy Ph.D.

National Center for Supercomputing Applications Blue Waters Broadening Participation Allocation

  • Igor Vorobyov
  • Colleen E Clancy Ph.D.

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Reviewing Editor

  1. Thomas Hund, The Ohio State University, United States

Version history

  1. Received: March 11, 2021
  2. Accepted: June 29, 2021
  3. Accepted Manuscript published: July 2, 2021 (version 1)
  4. Version of Record published: July 15, 2021 (version 2)

Copyright

© 2021, Aghasafari et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 1,853
    views
  • 265
    downloads
  • 18
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Parya Aghasafari Ph.D.
  2. Pei-Chi Yang Ph.D.
  3. Divya C Kernik Ph.D.
  4. Kazuho Sakamoto Ph.D.
  5. Yasunari Kanda Ph.D.
  6. Junko Kurokawa Ph.D
  7. Igor Vorobyov
  8. Colleen E Clancy Ph.D.
(2021)
A deep learning algorithm to translate and classify cardiac electrophysiology
eLife 10:e68335.
https://doi.org/10.7554/eLife.68335

Share this article

https://doi.org/10.7554/eLife.68335

Further reading

    1. Computational and Systems Biology
    2. Genetics and Genomics
    Weichen Song, Yongyong Shi, Guan Ning Lin
    Tools and Resources

    We propose a new framework for human genetic association studies: at each locus, a deep learning model (in this study, Sei) is used to calculate the functional genomic activity score for two haplotypes per individual. This score, defined as the Haplotype Function Score (HFS), replaces the original genotype in association studies. Applying the HFS framework to 14 complex traits in the UK Biobank, we identified 3619 independent HFS–trait associations with a significance of p < 5 × 10−8. Fine-mapping revealed 2699 causal associations, corresponding to a median increase of 63 causal findings per trait compared with single-nucleotide polymorphism (SNP)-based analysis. HFS-based enrichment analysis uncovered 727 pathway–trait associations and 153 tissue–trait associations with strong biological interpretability, including ‘circadian pathway-chronotype’ and ‘arachidonic acid-intelligence’. Lastly, we applied least absolute shrinkage and selection operator (LASSO) regression to integrate HFS prediction score with SNP-based polygenic risk scores, which showed an improvement of 16.1–39.8% in cross-ancestry polygenic prediction. We concluded that HFS is a promising strategy for understanding the genetic basis of human complex traits.

    1. Computational and Systems Biology
    Qianmu Yuan, Chong Tian, Yuedong Yang
    Tools and Resources

    Revealing protein binding sites with other molecules, such as nucleic acids, peptides, or small ligands, sheds light on disease mechanism elucidation and novel drug design. With the explosive growth of proteins in sequence databases, how to accurately and efficiently identify these binding sites from sequences becomes essential. However, current methods mostly rely on expensive multiple sequence alignments or experimental protein structures, limiting their genome-scale applications. Besides, these methods haven’t fully explored the geometry of the protein structures. Here, we propose GPSite, a multi-task network for simultaneously predicting binding residues of DNA, RNA, peptide, protein, ATP, HEM, and metal ions on proteins. GPSite was trained on informative sequence embeddings and predicted structures from protein language models, while comprehensively extracting residual and relational geometric contexts in an end-to-end manner. Experiments demonstrate that GPSite substantially surpasses state-of-the-art sequence-based and structure-based approaches on various benchmark datasets, even when the structures are not well-predicted. The low computational cost of GPSite enables rapid genome-scale binding residue annotations for over 568,000 sequences, providing opportunities to unveil unexplored associations of binding sites with molecular functions, biological processes, and genetic variants. The GPSite webserver and annotation database can be freely accessed at https://bio-web1.nscc-gz.cn/app/GPSite.