A) Schematic representation of domain organization for the known isoforms of the three human DCLK paralogs. Domain boundaries are annotated according to the representative amino acid sequences derived from UniProt. B) DCLK1 isoforms visualized as cartoons, showing key structural differences between the four human DCLK1 isoforms and a DCLK1 catalytic domain with artificially short linker regions (DCLK1cat).

Names of DCLK isoforms discussed in this paper, along with their respective isoform number, UniProt identification, and alternate names that have been used.

Evolution of the DCLK family.

A) Phylogenetic tree showing the divergence and grouping of DCLK sub-families in different taxonomic groups. Bootstrap values are provided for each clade. B) Shows domain annotations for sequences included in the phylogenetic tree. The length of C-terminal tail segment for these sequences is shown as a histogram (green). The original tree generated using IQTREE is provided in Figure 2-source data 1.

A) Cartoon cladogram of mammalian species showing the domain organization of each DCLK1 isoform from representative annotated sequences from UniProt. UniProt IDs for each sequence are provided in Figure 3-Source File 1. B) SDS-PAGE of 6His-GST-3C-DCLK1.1 (351-689, Top) or a D533A mutant in which the DFG Asp is mutated to Ala (Middle). Proteins were separated by size exclusion chromatography, and high-purity fractions were pooled. The affinity tag was removed prior to analysis by incubation with 3C protease, leading to a demonstrable shift in mobility (bottom) C) Evaluation of catalytic activity towards DCLK1 peptide. DCLK1.1 351-689 possesses a Km [ATP] ∼20 µM in vitro (left) and real-time substrate phosphorylation was inhibited by prior incubation with the small molecule DCLK1-IN-1, right). D) Thermal shift assay demonstrating a 2.1°C increase in the stability of DCLK1 351-689 in the presence of Mg:ATP (left), which was absent in the D533A protein (right). Raw data are provided in Figure 3-Source File 2.

A) Gene and intron-exon organization of DCLK1 human isoforms in the C-terminal tail. The DCLK1 gene is present on locus 13q13.3, and isoforms 1 and 3, contain an additional exon (exon 16), in the C-terminal tail that is absent in DCLK1.2. B) A phase 2 intron results in the alternative transcript of exon 17 in isoform 1, translating a different open-reading frame and early stop codon, resulting in the shorter sequence. C) Cartoon organization of the C-tail exons (exon 15, 16, and 17) of the DCLK1

A) Cartoons of DCLK1 construct used in our assays, portraying the locations of the Inhibitory Binding Segment (IBS) and the Intrinsically Disordered Segment (IDS). B-E) DSF thermal denaturation profiling of the purified DCLK1 core catalytic domain, or tail-matched DCLK1.1 and DCLK1.2 proteins. Unfolding curves and changes in Tm values (ΔTm) for each protein relative to WT DCLK1cat are indicated. F-H) B-factor structural representations of DCLK1short proteins shown in A). The width of the region indicates the extent of flexibility based on averaged RMSF data from three one microsecond MD replicates. I) DSSP analysis of three replicates of one microsecond MD simulations showing the residues surrounding the IBS in the C-tail of DCLK1.1short and DCLK1.2short. Blue indicates the presence of a Beta-sheet or Beta-bridge secondary structures and red indicates the presence of alpha-helical structures.

Identification of DCLK specific constraints.

A) Cartoon of DCLK1.2 and the intrinsically disordered segment (IDS) with evolutionary constraints mapped to the kinase domain and C-tail. B) Sequence constraints that distinguish DCLK1/2/3 sequences from closely related CAMK sequences are shown in a contrast hierarchical alignment (CHA). The CHA shows DCLK1/2/3 sequences from diverse organisms as the display alignment. The foreground consists of DCLK sequences while the background alignment contains related CAMK sequences. The foreground and background alignments are shown as residue frequencies below the display alignment in integer tenths (19). The histogram (red) indicates the extent to which distinguishing residues in the foreground diverge from the corresponding position in the background alignment. Black dots indicate the alignment positions used by the BPPS (Neuwald, 2014) procedure when classifying DCLK sequences from related CAMK sequences. Alignment number is based on the human DCLK1.2 sequence (UniProt ID: O15075-2). C) Sequence alignment of human DCLK1 isoforms.

The DCLK1 C-tail ‘completes’ the regulatory C-spine (green).

A) PKA crystal structure (pdb: 1ATP) with bound ATP in red and Mg2+ in purple. The C-spine is completed by the adenine ring of ATP. The gamma phosphate of ATP hydrogen bonds with the second glycine of the G-loop. B) DCLK1.2 crystal structure (pdb: 6KYQ) showing how the C-tail (red) docks underneath the pocket and mimics the ATP structure. The C-spine is completed by V682 and V684 in the C-tail and helical segments defined using DSSP are shown. T687 is also depicted making multiple hydrogen bonds with the backbone of V684 and I685 (dashed lines). C) DCLK1.1 AlphaFold2 model showing an unstructured loop in the C-tail docking into the ATP binding pocket, where V684 and I685 are predicted to complete the C-spine. The average per-residue confidence of the C-tail is 49%. D-F) Zoomed out versions of A-C, demonstrating how the DCLK1 C-tail docks into the ATP binding cleft, akin to ATP in PKA.

A) Structural depiction of DCLK1.2 (PDB: 6KYQ) showing the location of modified DCLK1 amino acids on the G-loop (purple) or C-tail (red). B-C) Differential Scanning Fluorimetry assays depicting thermal denaturation profiles of each protein along with the calculated Tm value. D) Kinase assays. DCLK1-dependent phosphate incorporation (pmol/min-1) into the DCLK1 peptide substrate was calculated for DCLK1cat, long and short DCLK1.1 and the indicated DCLK1.2 variants. E) Thermal stability analysis in the presence of ATP or DCLK1-IN-1 for DCLK1 proteins. For DCLK1.2, all proteins were generated in the DCLK1.2 short background.

A DCLK1 C-tail can act as a multi-functional Swiss Army Knife, using six distinct segments for a variety of regulatory functions including mimicking ATP binding/association, stabilizing the G-loop, occluding the substrate binding pocket, and packing against the kinase activation loop.

Structural cartoon depicting each DCLK1 isoform, categorizing them by presence of DCX domain and enzymatic activity.

Each of the full-length DCLK paralogs showing how the tail packs against the substrate binding pocket of the kinase domain.

Fluorescence for DCLK1cat in presence of DMSO and different inhibitor conditions.

SDS-Page and Coomassie blue staining of each DCLK1 construct.

A) Minimum Distance of K692 in the DCLK1.2 C-tail forms significant stable interactions over microsecond replicates to the DFG and HRD aspartates. B) H689 in the DCLK1.1 C-tail, comparatively fails to interact with the DFG and HRD aspartates.

CAMK-specific insert (green) consistently making structural contacts (shown in surface representation) with the C-tail (red) across multiple CAMK families.

Identification of DCLK family-specific constraints.

A) Sequence alignment of human DCLK paralogs including long and short isoforms of DCLK1. B) Sequence constraints that distinguish DCLK1/2/3 sequences from closely related CAMK sequences are shown in a contrast hierarchical alignment (CHA). The CHA shows DCLK1/2/3 sequences from diverse organisms as the display alignment. The foreground consists of 3564 DCLK sequences while the background alignment contains 27,299 related CAMK sequences. The foreground and background alignments are shown as residue frequencies below the display alignment in integer tenths (19). The histogram (red) indicates the extent to which distinguishing residues in the foreground diverge from the corresponding position in the background alignment. Black dots indicate the alignment positions used by the BPPS (Neuwald, 2014) procedure when classifying DCLK sequences from related CAMK sequences. Alignment number is based on the human DCLK1 sequence (Uniprot ID: O15075-1).

Molecular Dynamics of DCLK1 isoforms. A-B) Microsecond MD replicates from DCLK1.1 and DCLK1.2, showing the DSSP output plotted for the C-tail, where red lines represent alpha helices and blue lines represent beta sheets. C) Distance plots from MD replicates of the phosphorylated threonine highlighting the contact distance between pT688 phosphate and G399 of the G-loop.

A) Quantitative LC-MS/MS data showing tryptic phosphopeptides identified from DCLK1.1 and 1.2. Detailed are peptide sequences, identified sites of phosphorylation (red), the site of phosphorylation within the protein and the ptmRS score relevant to confidence of phosphosite localisation, as well as the Mascot score for peptide identification. Fold change in the relative abundance of the two phosphopeptides in DCLK 1.2 is computed with reference to these same two phosphopeptides in DCLK1.1., normalising against 3 non-modified peptides to account for potential difference in the amount analysed. B) DCLK1 substrate phosphorylation (reported as % total phosphopeptide in reaction) was quantified as a function of time for each of the indicated purified DCLK proteins. Assays were performed side-by-side in quadriplicate. C) As described in A, except fold change in abundance could not be calculated for the peptide containing pThr 395 given the presence of the inserted mutations and the potential differences in relative ionisation efficiency for the resulting tryptic peptide.

Intrinsic Disorder prediction of DCLK1.2 C-tail using IUPRED3.