The tree was constructed using Neighbor-Joining method based on nucleotide sequences. Shrew was used as the out-group. Gerbil was collected for metabolite data only and mouse was included as …
Species and samples used in the current study.
(A) Species and traits information. Life history traits of adult weight (AW, in grams), maximum lifespan (ML, in years), and female time to maturity (FTM, in days) of these species were obtained from Anage database (Tacutu et al., 2013). Since the life history data were not available for meadow vole, the data of a related species Microtus arvalis were used instead. Maximum lifespan residual (MLres) and female time to maturity residuals (FTMres) were computed based on the allometric equations MLres = ML/(4.88×AW0.153) and FTMres = FTM/(78.1×AW0.217), respectively. (B) RNA sequencing and read mapping to ortholog sets. Read mapping statistics are based on STAR. De novo assembly was performed by Trinity. (C) Read mapping to publically available genomes. For the species with publicly available genomes, the reads were also aligned to the full genomes for mapping rate comparison. The numbers of annotated genes were based on the published annotations. (D) Metabolite profiling. Metabolite profiling was performed on selected species only.
(A) Pipeline to obtain the species-specific ortholog sets and expression values. See Materials and methods or a more detailed description of the methodology. (B) Sequence identity of ortholog sets …
(A) Percentage of ortholog sets filled up using consensus. Horizontal axis indicates the percentage of sequence length filled up by consensus. For example, 74% of the ortholog sets did not require …
(A) Projection of the first three Principal Components (PCs) in Principal Component Analysis. Values in parenthesis indicate percentage of variance explained by each of the PCs. Points are colored …
The lines represent interaction based on STRING database (mouse genes). Selected gene names are colored based on the enriched pathways (see Table 1). Only the connected nodes are shown.
(A) Pnkp and (B) Nadsyn1 show positive correlation with the longevity traits. (C) Trp53, (D) Bax, (E) Mapk1, and (F) Jund show negative correlation with the longevity traits. In each plot, the gene …
(A) Projection of the first three Principal Components (PCs) in Principal Component Analysis. Values in parenthesis indicate percent of variance explained by each of the PCs. Points are colored by …
Each point represents a different species of bird (green triangles) or non-human primate (orange circles), with linear regression lines shown separately for each group of species. Data for human …
Pathway enrichment analysis of genes with significant correlation with the longevity traits.
The genes were supported by at least two longevity traits (p value.robust < 0.01 and p value.max < 0.05). Pathway enrichment was performed using DAVID. The percentages of positive or negative correlating genes belonging to each pathway were indicated in parentheses. Only selected pathways are shown here. GO (BP): Gene Ontology (Biological Process). GO (BP): Gene Ontology (Molecular Functions). SP/PIR: SwissProt and Protein Information Resource. See Table 1—source data 1 for more details.
Annotation cluster | Enriched terms and genes | No. of genes | p Value |
---|---|---|---|
Positive Correlation Cluster No. 1 (15%) | GO (MF): adenyl nucleotide binding | 50 | 5.25 × 10−3 |
GO (MF): nucleotide binding | 64 | 1.21 × 10−2 | |
Acly, Atad2, Atp2b4, Cdk2, Cdk20, Chd7, Chek1, Chkb, Cpsf7, D2hgdh, Dgkq, Dhx58, Dock6, Ero1lb, Etnk1, Fastkd5, Fn3krp, Gnai1, Guk1, Hk1, Hmgcr, Hnrnpd, Hyou1, Insr, Madd, Map4k5, Mastl, Mlkl, Mov10, Msh6, Mx2, Nadsyn1, Oplah, Pdk1, Pfkp, Phka2, Phkg2, Pkmyt1, Pms2, Pnkp, Ppp2r4, Prkar1b, Qrsl1, Rbm10, Rbm15b, Rbm38, Rhot2, Rnasel, Rps6ka2, Sacs, Sirt3, Slirp, Smarca1, Smarca5, Srsf9, Stk19, Stk36, Tbrg4, Tesk2, Thnsl1, Tia1, Top3a, Trpm4, Ttf2, Tyk2, Vps4a, Ythdc2 | |||
Positive Correlation Cluster No. 2 (4%) | SP/PIR: DNA damage | 14 | 1.16 × 10−3 |
SP/PIR: DNA repair | 12 | 4.25 × 10−3 | |
GO (BP): cellular response to stress | 16 | 1.01 × 10−1 | |
Bnip3, C17orf70, Chek1, Dtx3l, Ercc1, Errfi1, Fancg, Hif1a, Mapkbp1, Msh6, Myd88, Pms2, Pnkp, Prdx3, Prpf19, Pttg1, Rad51b, Rif1, Rnaseh1, Slx4, Tdp2, Terf1, Tinf2, Top3a, Wrap53 | |||
Positive Correlation Cluster No. 4/5 (4%) | GO (BP): glucose metabolic process | 11 | 1.22 × 10−3 |
GO (BP): hexose metabolic process | 11 | 5.68 × 10−3 | |
GO (BP): generation of precursor metabolites and energy | 15 | 4.59 × 10−3 | |
Aldh5a1, Atp2b4, Atp6v0d1, Atp6v0e2, Ero1lb, Fads1, Gbe1, Gpi1, Hexa, Hk1, Insr, Ndst1, Ndufa8, Pdk1, Pfkp, Pgp, Phka2, Phkb, Phkg2, Prkar1b, Sdhaf3, Tmx4, Tpi1, Trpm4, Tsc2 | |||
Positive Correlation Cluster No. 6 (4%) | SP/PIR: chromatin regulator | 11 | 1.61 × 10−2 |
GO (BP): chromosome organization | 17 | 2.22 × 10−2 | |
Bnip3, Cenph, Chd7, Dtx3l, Ercc1, H2afv, Hdac2, Jade1, Kdm5d, Kmt2c, Pttg1, Rcor1, Rrp8, Smarca1, Smarca5, Smyd3, Terf1, Tinf2, Wdr5, Wrap53 | |||
Negative Correlation Cluster No. 1 (9%) | GO (BP): modification-dependent protein catabolic process | 27 | 2.39 × 10−4 |
SP/PIR: ubiquitin conjugation pathway | 26 | 3.35 × 10−4 | |
GO (BP): proteolysis | 36 | 1.09 × 10−2 | |
Adamts2, Agtpbp1, Anapc4, Atg10, Atg4a, Atg7, Btbd1, Ctsl, Ctsz, Dcaf10, Dda1, Dpp8, Fbxl17, Fbxl20, Fbxo18, Fbxw2, Kcmf1, Map1lc3b, Med8, Mmp2, Mycbp2, Oma1, Pcsk5, Pgpep1, Pmepa1, Ppp2r5c, Rad18, Rfwd2, Rnf14, Rnf2, Rnf6, Sumo3, Tpp2, Ube2b, Ube2v1, Ufm1, Vhl | |||
Negative Correlation Cluster No. 2 (9%) | GO (BP): protein localization | 38 | 4.67 × 10−5 |
GO (BP): protein transport | 34 | 7.99 × 10−5 | |
Agap1, Akap7, Ap3d1, Atg10, Atg4a, Atg7, Bax, Cav1, Clpx, Cnih1, Col4a3bp, Cry2, Dirc2, Ergic2, Fdx1l, Fkbp15, Gabarapl2, Gdi2, Gm10273, Golt1b, Hspa9, Ift46, Ipo4, Kif1bp, Kpna4, Laptm4a, Lrp4, mt-Nd4, Mtch1, Ndel1, Ndufb11, Necap1, Ppp3ca, Rab18, Rab2a, Rab6a, Rhot1, Sar1a, Sec22a, Sec31a, Sec62, Slc25a12, Slc29a1, Slc33a1, Slc35a4, Snx12, Snx13, Stx17, Timm8a1, Tomm6, Trappc6b, Trp53, Tsg101, Vps36, Vps53, Ywhag | |||
Negative Correlation Cluster No. 3 (18%) | GO (BP): regulation of transcription | 74 | 1.62 × 10−5 |
SP/PIR: transcription regulation | 55 | 1.04 × 10−3 | |
Actl6a, Agtpbp1, Ak6, Anp32a, Anp32e, Atf6b, Bckdha, Bmi1, Ccdc59, Cd3eap, Cdc5l, Cggbp1, Clk2, Cnbp, Cops7a, Crtc3, Cry2, Csrp2, Ebna1bp2, Ehmt2, Elk4, Ergic2, Fbxo18, Fip1l1, Fosb, Foxo3, Gatad2b, Gid8, Gmcl1, Gtf2h1, Gtf2h2, Gtf2h5, Harbi1, Hlx, Hmga1-rs1, Hnrnpab, Hnrnpf, Ift57, Ing2, Ints4, Ipo4, Jund, Klf11, Klf2, Klf4, Klf9, Kpna4, Mafb, Mapk1, Mdm4, Med16, Med17, Med31, Med8, Mef2a, Mettl8, Mmp2, Mnt, Morf4l2, Mta1, Mtdh, Mxd1, Mycbp2, Nabp2, Ncor2, Neo1, Nfe2l2, Nr1d2, Papd4, Parp2, Phf12, Phlpp1, Pkig, Pomp, Pop5, Ppp1r8, Ppp2r5c, Ppp3ca, Ptbp1, R3hdm4, Rab18, Rad18, Rbbp4, Rfwd2, Rnf14, Rnf2, Rnf6, Rps6ka4, Rrs1, Sap30l, Sav1, Scoc, Sfmbt1, Sin3b, Snrk, Sqstm1, Srpk2, Ssbp1, Tep1, Tgfbr3, Trim35, Trip6, Trp53, Tsg101, Ube2b, Ube2v1, Ubtf, Ufm1, Vhl, Vps36, Wiz, Xrcc5, Yeats4, Zbtb14, Zfp414, Zfp637, Zfp655, Zfp710, Zfp821 |
Phylogenetic regression of gene expression against longevity traits.
Regression against (A) Adult Weight; (B) Maximum Lifespan; (C) Female Time to Maturity; (D) Maximum Lifespan Residual; and (E) Female Time to Maturity Residual. ‘coef.all’, ‘p value.all’, and ‘q value.all’ refer to the regression slope, p value, and FDR-adjusted q value using all the species. ‘p value.robust’ and ‘q value.robust’ refer to the statistics after removing the potential outlier point. ‘p value.max’ and ‘q value.max’ refer to the maximal (least significant) regression p value and q value when each one of the species was left out, one at a time. Only genes with p value.robust<0.01 and p value.max<0.05 are shown. (F) Top hits identified by two or more longevity traits. The p value.robust against each of the four longevity traits (ML, FTM, MLres, and FTMres) as well as adult weight (AW) are shown. These genes were the input for pathway enrichment analysis. Pathway enrichment analysis of genes showing (G) positive and (H) negative correlation with longevity traits. Enrichment was performed using DAVID with default settings. Only the top 10 clusters are shown. (I) System level analyses of gene functions. The numbers of shared genes between longevity associated genes (either positively or negatively or both) and human aging genes, essential genes, transcription factor genes, and housekeeping genes are shown. The enrichment p value was calculated by Fisher’s exact test with two different background gene sets.
Amino acid levels showing consistent positive correlation with longevity traits.
For the mammalian fibroblast dataset, the number of longevity traits (out of Maximum Lifespan; Female Time to Maturity; Maximum Lifespan Residual; and Female Time to Maturity Residual) with significant positive correlation with the amino acid levels at two different cut-offs (p value.robust < 0.01 and p value.robust < 0.05) are shown. For the primate and bird fibroblast dataset, the regression was performed using primate data only, bird data only, and the pooled data of both. The regression slope p value < 0.05 are in bold.
Amino acid | Mammalian fibroblasts | Primate and bird fibroblasts | ||||||
---|---|---|---|---|---|---|---|---|
No. of longevity traits (out of four) with significant correlation | Regression slope p value with species maximum lifespan | Regression slope p value with species maximum lifespan residual | ||||||
p value.robust < 0.01 | p value.robust < 0.05 | Primates only | Birds only | Primates and birds | Primates only | Birds only | Primates and birds | |
arginine | 3 | 4 | 3.4 × 10−2 | 8.6 × 10−2 | 3.1 × 10−2 | 3.8 × 10−1 | 1.1 × 10−2 | 2.1 × 10−2 |
glutamate | 2 | 4 | 6.5 × 10−2 | 1.8 × 10−2 | 1.1 × 10−2 | 4.6 × 10−2 | 2.8 × 10−1 | 1.3 × 10−1 |
histidine | 0 | 4 | 9.4 × 10−2 | 6.0 × 10−2 | 4.3 × 10−2 | 2.3 × 10−1 | 1.4 × 10−1 | 1.7 × 10−1 |
leucine | 2 | 4 | 2.9 × 10−3 | 6.0 × 10−2 | 4.8 × 10−3 | 1.4 × 10−2 | 5.9 × 10−1 | 2.3 × 10−1 |
lysine | 3 | 3 | 9.8 × 10−3 | 8.2 × 10−2 | 1.4 × 10−2 | 9.1 × 10−2 | 2.9 × 10−1 | 2.5 × 10−1 |
methionine | 1 | 3 | 3.2 × 10−1 | 1.4 × 10−2 | 2.7 × 10−2 | 3.0 × 10−1 | 3.0 × 10−2 | 4.9 × 10−2 |
phenylalanine | 1 | 4 | 9.8 × 10−3 | 1.2 × 10−3 | 2.1 × 10−4 | 8.2 × 10−2 | 1.3 × 10−1 | 1.2 × 10−1 |
proline | 1 | 4 | 4.4 × 10−3 | 3.9 × 10−4 | 3.6 × 10−5 | 3.5 × 10−2 | 1.2 × 10−1 | 5.4 × 10−2 |
tryptophan | 2 | 4 | 9.2 × 10−3 | 7.8 × 10−4 | 1.2 × 10−4 | 2.6 × 10−2 | 2.5 × 10−1 | 1.5 × 10−1 |
tyrosine | 1 | 3 | 3.2 × 10−1 | 8.8 × 10−3 | 1.8 × 10−2 | 4.3 × 10−1 | 1.7 × 10−1 | 2.9 × 10−1 |
valine | 0 | 3 | 1.2 × 10−2 | 5.4 × 10−3 | 1.0 × 10−3 | 2.0 × 10−1 | 2.8 × 10−1 | 3.2 × 10−1 |
Phylogenetic regression of metabolite levels against longevity traits.
Regression against (A) Adult Weight; (B) Maximum Lifespan; (C) Female Time to Maturity; (D) Maximum Lifespan Residual; and (E) Female Time to Maturity Residual. ‘coef.all’, ‘p value.all’, and ‘q value.all’ refer to the regression slope, p value, and FDR-adjusted q value using all the species. ‘p value.robust’ and ‘q value.robust’ refer to the statistics after removing the potential outlier point. ‘p value.max’ and ‘q value.max’ refer to the maximal (least significant) regression p value and q value when each one of the species was left out, one at a time. Only genes with p value.robust < 0.01 and p value.max < 0.05 are shown. (F) Top hits identified by two or more longevity traits. The p value.robust against each of the four longevity traits (ML, FTM, MLres, and FTMres) as well as adult weight (AW) are shown. These metabolites were the input for pathway enrichment analysis. Pathway enrichment analysis of metabolites showing (G) positive and (H) negative correlation with longevity traits. Enrichment was performed based on hypergeometric statistics. (I) Top hits identified by two or more longevity traits, using cut-off of p value.robust < 0.05. The p value.robust against each of the four longevity traits (ML, FTM, MLres, and FTMres) as well as adult weight (AW) are shown.
Gene expression values.
(A) Raw counts. (B) log10 normalized values.
Metabolite levels.
(A) Raw values. (B) log10 normalized values.