Identification and classification of ion channels across the tree of life provide functional insights into understudied CALHM channels
Figures
Distribution of human ion channels (ICs) across different families.
Each circle represents a human IC family, with the symbol at the center indicating its Group. The size of the circles is proportional to the number of sequences in that family, and the colored pies indicate the proportion of their Target Development Level (TDL) status as designated by Illuminating Druggable Genome (IDG). The placement of the bubbles is based on the average coordinates of all the members within that family in a distribution of uniform manifold approximation and projection (UMAP) embeddings generated using protein embedding-based pairwise sequence alignments. An embedding-based sequence alignment approach was used to overcome the vast divergence of IC sequences with minimal sequence similarity. A family-level abstraction was done to provide an intuitive view undeterred by the relationships of individual ICs across families. A detailed view of the full UMAP plot showing the placement of individual ICs is provided in Figure 1—figure supplement 3 for reference. Auxiliary IC families at the bottom were not part of the embedding-based analysis due to the lack of a pore-containing domain, and thus placed arbitrarily at the bottom of the figure.
Retrieval augmented generation (RAG) annotation pipeline used for validating the annotation of ion specificity and gating mechanism.
Upset plot showing the overlap of ion channel (IC) sequences based on their UniProt IDs across the current study, the Kyoto Encyclopedia of Genes and Genomes (KEGG) database, Guide to Pharmacology (GtoP), and Pharos.
Uniform manifold approximation and projection (UMAP) embeddings plot of all human ion channel (IC) sequences.
The UMAP embeddings were generated based on the pairwise similarity scores of the protein embedding alignment generated for all pairs of human ICs. Shapes of the markers indicate the different IC groups (circle: voltage-gated ion channels [VGIC], square: ligand-gated ion channels [LGIC], diamond: Chloride channel, triangle: Other, plus: Unclassified), and the colors indicate different IC families within each group as shown in the legend above.
Orthology profiling of human ion channels (ICs).
(A) Heatmap showing the percent of orthologs detected for each IC family within a given taxonomic lineage. The taxonomic groups are shown in the vertical axis with a tree on the left. Darker color represents a higher percentage of orthologs detected. Percentages were calculated as (total number of orthologs found for all ICs in a family)/(total number of organisms queried in the taxonomic lineage * number of sequences in the family). (B) Clustergram depicting the presence/absence of orthologous sequences of ICs across eukaryotic taxonomic lineages. ICs are clustered along the horizontal axis into nine distinct clusters. Taxonomic groups are shown on the vertical axis. Each square in the heatmap is colored based on the orthology relationship found for a specific IC in a specific organism (black: one-to-one ortholog present, red: co-ortholog detected, brown: no orthology detected). (C) Results from the enrichment analysis performed on human ICs of each cluster. The x-axis shows the number of ICs in the cluster enriched for the Gene Ontology (GO) term shown on the y-axis. The bars are colored based on their FDR values for the enriched term. For a full list of enriched terms, please refer to Supplementary file 1D.
Evolutionary analysis of calcium homeostasis modulator (CALHM) reveals conserved pattern positions.
(A) A phylogenetic tree depicting the evolutionary relationships across all six CALHM members with orthologs across different taxa. (B) The conserved pattern positions conserved across all six CALHM members are shown as a weblogo with the red bars indicating the significance (longer bars indicate higher significance) of conservation and are mapped into a representative structure of human CALHM2. The four transmembrane helices are labeled S1-S4. Conserved residues that are targeted for mutation are highlighted using an asterisk symbol in the labels. (C) Schematic representing the location of transmembrane regions and identified conserved pattern positions in a representative human CALHM2 sequence. The residues targeted for mutations are labeled with their corresponding positions for CALHM2, 1, and 6 shown in the labels. (D) Cartoon representation of CALHM2 structure (PDB ID: 6uiv and 6uiw) in open and closed conformation, respectively. (E) List of disease variants and mutations performed in the conserved pattern positions for functional studies.
Conserved pattern positions identified within the clade for calcium homeostasis modulator (CALHM)2, 4, 5, and 6.
(A) Phylogenetic tree of CALHM sequences where the orange star indicates the clade for CALHM2, 4, 5, and 6. (B) The identified pattern positions are mapped into a representative structure of human CALHM2 (PDB: 6uiw). (C) Weblogo showing the conserved pattern positions. The red bar indicates the significance of conservation, where a taller bar indicates higher significance.
Electrophysiological studies of human calcium homeostasis modulator CALHM1 and CALHM6.
Whole-cell voltage-clamp recordings were performed in tsA cells overexpressing wild-type CALHM1 (A–D) and wild-type CALHM6 (E–H), as well as in non-transfected cells (I–L). Currents were measured under three conditions sequentially from the same cell: 5 mM Ca2+ at 22°C (A, E, I; black), 0 mM Ca2+ at 22°C (B, F, J; blue), and 0 mM Ca2+ at 37°C (C, G, K; red). Voltage steps ranged from −100 mV to +140 mV, followed by a final tail pulse at −100 mV, with a holding potential of 0 mV (protocol illustrated in the box on the right). Current-voltage (I–V) relationships were plotted in D, H, and L using mean current amplitudes (averaged across independent cells) measured at the end of the voltage steps. The arrow indicates the time point at which current amplitudes were measured. Error bars represent SEM (D, n=5; H, n=5; L, n=5).
Expression analysis of wild-type calcium homeostasis modulator CALHM1, wild-type CALHM6, and their mutants.
(a, b) Representative gels showing CALHM1 and its mutants (a), and CALHM6 and its mutants (b). CALHM1 and CALHM6 signals were detected using in-gel fluorescence of the C-terminal GFP tag, while β-actin was detected by western blotting as a loading control. (c, d) Quantification of total protein expression levels of wild-type CALHM1 and its mutants (c), and wild-type CALHM6 and its mutants (d). Each dot represents an independent measurement (transfection), and error bars represent SEM (c, n=3; d, n=3). Statistical analysis was performed using one-way ANOVA with Bonferroni’s post hoc test, comparing each mutant to wild type (*p<0.05; **p<0.01; ***p<0.001). (e, f) Surface biotinylation assays of wild-type CALHM1 and its mutants (e), and wild-type CALHM6 and its mutants (f), detected using the C-terminal GFP tag by in-gel fluorescence.
-
Figure 4—figure supplement 1—source data 1
PDF file containing original western blots for Figure 4—figure supplement 1, indicating the relevant bands and treatments.
- https://cdn.elifesciences.org/articles/106134/elife-106134-fig4-figsupp1-data1-v1.zip
-
Figure 4—figure supplement 1—source data 2
Original files for western blot analysis displayed in Figure 4—figure supplement 1.
- https://cdn.elifesciences.org/articles/106134/elife-106134-fig4-figsupp1-data2-v1.zip
Functional characterization of calcium homeostasis modulator CALHM1 and CALHM6 mutants at conserved residues at 37°C.
Whole-cell voltage-clamp recordings were performed in tsA cells overexpressing wild-type CALHM1 (A–C), CALHM1 mutant I109W (D–F), wild-type CALHM6 (G–I), and CALHM6 mutants (Y51A (J–L) and W113A (M–O)). Currents were measured under two conditions: 0 mM Ca²+ at 37°C (A, D, G, J, M; red) and 0 mM Ca²+ at 37°C plus 100 µM Gd3+ (B, E, H, K, N; blue), with both conditions recorded sequentially from the same cell. Voltage steps ranged from −80 mV to +120 mV, followed by a final tail pulse at −80 mV, with a holding potential of 0 mV (protocol shown in the box on the right). Current-voltage (I–V) relationships were plotted in C, F, I, L, and O using mean current amplitudes (averaged across independent cells) measured at the end of the voltage steps. The arrow indicates the time point at which the current was measured. The number of independent measurements (cells) were: n=5 (C); n=5 (F); n=5 (I); n=5 (L); n=5 (O). Error bars represent SEM. (P, Q) Current amplitudes obtained using a two-step voltage protocol (from +120 mV to −80 mV; protocol shown in the box on the right) are compared between wild-type CALHM1 and its mutants (P), and between wild-type CALHM6 and its mutants (Q). Each dot represents an independent measurement (cell), and bar represents the mean current amplitude across cells. The number of independent measurements (cells) for each bar in P and Q are shown from left to right: 5, 8, 5, 8, 6, 7, 5, 5, 5, 6, 7, 7 (P); 5, 6, 5, 7, 7, 5, 6, 6 (Q). Statistical analysis was performed using one-way ANOVA with Bonferroni’s post hoc test, comparing each mutant to wild type (*p<0.05; **p<0.01; ***p<0.001).
Functional characterization of calcium homeostasis modulator (CALHM)1 and CALHM6 mutants at conserved residues at 22°C.
Current amplitudes obtained using a two-step voltage protocol (from +120 mV to −80 mV; protocol shown in the box on the right) are compared between wild-type CALHM1 and its mutants (a, b) and between wild-type CALHM6 and its mutants (c, d). The cells analyzed here are the same as those in Figure 4P and Q. Briefly, for each cell, currents were measured sequentially under three conditions: 5 mM Ca2+ at 22°C, 0 mM Ca2+ at 22°C, and 0 mM Ca2+ at 37°C. The currents from the first two conditions are plotted here, while currents at 0 mM Ca2+ at 37°C are shown in Figure 4R and S. Each dot represents an independent measurement (cell), and bar represents the mean current amplitude across cells. The number of independent measurements (cells) for each bar in a–d are shown from left to right: 5, 8, 5, 8, 6, 7, 5, 5, 5, 6, 7, 7 (a, b); 5, 6, 5, 7, 7, 6, 6, 5 (c, d). Statistical analysis was performed using one-way ANOVA with Bonferroni’s post hoc test, comparing each mutant to wild type (*p<0.05; **p<0.01; ***p<0.001).
UMAP embeddings of the human ICs alongside orthologs from 12 model organisms.
Structure based sequence alignment and phylogenetic analysis of available crystal structures of members from the CALHM, Pannexin and Connexin families.
Top: The resulting sequence alignment is very sparse and does not show conservation of residues in the TM regions. The CPC motif with conserved cysteines in CALHM family is shown. Bottom: Phylogenetic tree based on the alignment has low support values making it difficult to interpret.
Tables
List of features annotated for the collected ion channel (IC) sequences.
The labels are grouped by their annotation category. Labels for Aquaporin-1 are shown as examples of each annotation label.
| Identifier labels | |
|---|---|
| UniProt | P29972 |
| Name | Aquaporin-1 |
| Symbol | AQP1 (CHIP28) |
| Target Development Level (Pharos) | Tbio |
| Length | 269 |
| Classification labels | |
| Family designation | Aquaporin |
| Group | Other |
| Class | |
| Family | Aquaporin |
| Subfamily | |
| Functional labels | |
| Unit | Pore-containing |
| Ion | Water |
| Gate mechanism | Ligand-gated (cGMP) |
| Lit Resource | PMID:26365508, PMID:16962972 |
| UniProt function | Form water-specific channel/plasma membranes of red cells and kidney proximal tubules |
| Structure-related labels | |
| PDB ID | 8CT2 |
| AlphaFold ID | |
| Complex/Interaction-related labels | |
| Auxiliary | No |
| Characterized domains | MIP |
| Auxiliary domain | no |
| Auxiliary protein | |
| Notable interactors | EPHB2 |
| TM and pore domain-related labels | |
| Pore domain start | 8 |
| Pore domain end | 228 |
| Domain length | 220 |
| Does it pass through membrane at least once | Yes |
| # of TM domains (predicted by TMHMM) | 6 |
| # of TM domain (predicted by Phobius) | 6 |
| TM organization | 1|2|3|4|5|6 |
| MOLE pore residue first | 35 |
| MOLE pore residue last | 185 |
| # of TMs | 6 |
| # of Transmembranes (TMs)+Intramembranes (IMs) | 10 |
| TMstart (UniProt) | 8 |
| TMend (UniProt) | 228 |
| TMsList | T:8–36,T:49–66,I:71–76,I:77–84,T:95–115,T:137–155,T:167–183,I:187–192,I:193–200,T:208–228 |
| TM Lit Resource | PMID:10644652 |
| Lit based TMstart_1 | 8 |
| Lit based TMend _1 | 36 |
| Lit based TMstart_2 | 49 |
| Lit based TMend_2 | 66 |
| Lit based TMstart_3 | 95 |
| Lit based TMend_3 | 115 |
| Lit based TMstart_4 | 137 |
| Lit based TMend_4 | 155 |
| Lit based TMstart_5 | 167 |
| Lit based TMend_5 | 183 |
| Lit based TMstart_6 | 208 |
| Lit based TMend_6 | 228 |
Additional files
-
Supplementary file 1
Gene Ontology (GO) annotation of IC clusters.
(A) Table showing the enriched GO terms for each IC cluster. An * in the ion and gate mechanism columns indicates annotations not verified by the retrieval augmented generation (RAG) system. (B) Table showing the presence/absence of curated human IC channels in other ion channel databases listed in Kyoto Encyclopedia of Genes and Genomes (KEGG), Guide to Pharmacology (GtoP), and Pharos. (C) List of human IC orthologs detected across the tree of life. (D) Table showing the enriched Gene Ontology (GO) terms for each IC cluster.
- https://cdn.elifesciences.org/articles/106134/elife-106134-supp1-v1.xlsx
-
MDAR checklist
- https://cdn.elifesciences.org/articles/106134/elife-106134-mdarchecklist1-v1.docx