Integrated systems analysis reveals conserved gene networks underlying response to spinal cord injury

Abstract
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

Spinal cord injury (SCI) is a devastating neurological condition for which there are currently no effective treatment options to restore function. A major obstacle to the development of new therapies is our fragmentary understanding of the coordinated pathophysiological processes triggered by damage to the human spinal cord. Here, we describe a systems biology approach to integrate decades of small-scale experiments with unbiased, genome-wide gene expression from the human spinal cord, revealing a gene regulatory network signature of the pathophysiological response to SCI. Our integrative analyses converge on an evolutionarily conserved gene subnetwork enriched for genes associated with the response to SCI by small-scale experiments, and whose expression is upregulated in a severity-dependent manner following injury and downregulated in functional recovery. We validate the severity-dependent upregulation of this subnetwork in rodents in primary transcriptomic and proteomic studies. Our analysis provides systems-level view of the coordinated molecular processes activated in response to SCI.

https://doi.org/10.7554/eLife.39188.001

Introduction

Spinal cord injury (SCI) results in impairment of motor, sensory, and autonomic systems, causing profound deregulation of almost every bodily function. The failure of large-scale clinical trials of drug therapies in acute SCI (Bracken et al., 1990; Geisler et al., 2001), and the lack of success in translating preclinical therapies to humans (Ramer et al., 2014), leaves clinicians without effective treatment options for SCI. As such, hemodynamic management and surgical decompression remain the only options to influence neurological outcomes immediately following acute SCI, typically with only marginal improvements (Hawryluk et al., 2015; Tee et al., 2017; Fehlings et al., 2012). The absence of an effective treatment for SCI reflects the complexity of the pathophysiologic mechanisms activated by central nervous system (CNS) injury. The additive effects of the immune response (Kigerl et al., 2009; Demjen et al., 2004), multiple forms of cell death (Springer et al., 1999; Crowe et al., 1997), neuronal growth suppression (GrandPré et al., 2000; Schnell and Schwab, 1990), and the formation of an inhibitory glial scar (Bradbury et al., 2002) pose a challenge to the development of new therapeutic strategies.

A major obstacle to the development of targeted therapies for SCI is the fragmentary state of our understanding of SCI pathophysiology. The response to trauma within the human spinal cord is mediated by multiple coordinated molecular pathways, yet these processes are rarely studied in an integrated manner. An additional challenge in translation of novel therapies is the reliance of clinical trials on standardized neurological assessments for patient enrolment and stratification (Fawcett et al., 2007). These measures are highly variable, operator-dependent, and may be impossible to perform in many SCI patients (Kwon et al., 2017). Systems biology approaches provide powerful means to elucidate the coordinated molecular processes underlying the pathophysiology of complex diseases (Voineagu et al., 2011; Parikshak et al., 2013; Zhang et al., 2013; Johnson et al., 2016). In particular, gene coexpression network analysis can complement reductionist descriptions of isolated gene functions by identifying networks of genes responsible for driving disease processes (Zhang and Horvath, 2005; Parikshak et al., 2015). Systems-level analyses may additionally have the potential to suggest novel biomarkers capable of stratifying injury severity and predicting functional recovery, and consequently to facilitate the translation of new therapies for acute SCI.

In the present study, we describe an integrated systems biology approach to study the pathophysiology of SCI. We systematically survey decades of biomedical literature in order to establish the complete set of genes implicated in the response to SCI by small-scale experiments. We then integrate this literature-curated gene set with unbiased gene expression data from the human spinal cord. We use weighted gene coexpression network analysis (WGCNA) to establish the normal biological processes within the healthy human spinal cord, and conduct a meta-analysis of publicly available gene expression data to define the gene regulatory network signature of the coordinated physiological response to SCI. We validate our findings at the transcriptomic and proteomic levels, and leverage the resulting systems-level understanding of SCI pathophysiology to define candidate biomarkers for stratification of injury severity and prediction of functional recovery.

Results

Systematic literature analysis identifies genes associated with response to SCI

Despite decades of study, an integrated understanding of the pathophysiological response to SCI remains elusive. This gap represents a central challenge to the development of targeted therapies for SCI. We hypothesized that such an integrated understanding could be achieved by integrating the vast corpus of SCI literature, collected by small-scale experimentation over several decades, within an unbiased, genome-wide framework. An overview of our experimental design is shown in Figure 1.

Figure 1

Download asset Open asset

Schematic overview of systems biology approach to SCI pathophysiology integrating small-scale experiments with high-throughput data.

Systematic analysis of over 500 manuscripts revealed the complete set of genes implicated in SCI pathophysiology by small-scale experiments. SCI genes were integrated with unbiased, genome-wide gene expression data from healthy human spinal cord to identify coexpressed gene subnetworks enriched for known SCI genes. Meta-analysis of SCI gene expression data revealed consensus patterns of subnetwork differential expression after SCI. The resulting consensus network signature of the response to SCI in human spinal cord was subjected to functional enrichment and cell type analyses, validated at the transcriptomic and proteomic levels, and leveraged to nominate quantitative biomarkers of SCI severity.

https://doi.org/10.7554/eLife.39188.002

As a first step, we sought to systematically establish the complete set of genes implicated in the physiological response to SCI. We conducted a systematic analysis of the SCI literature, reviewing over 500 papers, in order to reveal a set of 695 unique human genes associated with the response to SCI by small-scale experiments (Supplementary file 1). Of these genes, 559 were upregulated following SCI, 213 were downregulated, and the protein products of 8 were differentially phosphorylated. Among all genes, 151 were associated with the response to SCI by more than one study (Figure 2A). The complete set includes genes that have been associated with SCI in a wide range of experimental models of SCI, in addition to human injuries (Figure 2—figure supplement 1A); in multiple species, including human as well as rat, mouse, and rabbit (Figure 2—figure supplement 1B); using a range of experimental techniques (Figure 2B); and at a variety of time points, from 1 hr to 6 months post-injury (Figure 2—figure supplement 1C).

Figure 2 with 2 supplements see all

Download asset Open asset

Literature curation and validation of genes implicated in the physiological response to SCI by small-scale experiments.

(A) Number of small-scale studies implicating each gene in SCI pathophysiology in the LC gene set. (B) Experimental techniques used to associate LC genes with response to SCI in the LC gene set. (C) Enrichment for shared Gene Ontology terms among LC genes (all p < 10⁻¹⁵). BP, biological process; CC, cellular component; MF, molecular function. (D) Number of protein-protein interactions (PPIs) between LC genes observed in the high-confidence human interactome (Menche et al., 2015) (dotted line) and 1000 randomized interactome networks (density), revealing significant enrichment for PPIs between LC genes relative to random expectation (p < 10⁻³). (E) Size of the largest connected component (LCC) between LC genes in the high-confidence human interactome (dotted line) and 1000 randomized interactome networks (density), revealing LC genes occupy a distinct region of the human interactome (p < 10⁻³). (F) LC genes are prioritized by a disease gene prediction algorithm (Ghiassian et al., 2015) (p < 10⁻¹⁵, Kolmogorov–Smirnov test).

https://doi.org/10.7554/eLife.39188.003

Validation of literature-curated SCI genes

We validated the biological relevance of our literature-curated (LC) SCI gene set using multiple lines of evidence. First, we established that LC genes were more likely to share common biological functions than random sets of genes, using annotations from the Gene Ontology (Ashburner et al., 2000). Because functional annotations may be specific or broad, we confirmed that the enrichment held regardless of the number of genes to which each term was annotated (Figure 2C). Next, we investigated the tendency for the protein products of LC genes to physically interact. Significant enrichment for protein-protein interactions (PPIs) between LC genes was observed relative to random expectation (Figure 2D, empirical p < 10⁻³), and we reproduced this finding in multiple independent PPI databases (all p < 10⁻³, Figure 2—figure supplement 2A–C). Genes implicated in a variety of complex diseases by genome-wide association studies (GWAS) have been found to form distinct modules of densely interacting proteins within the human interactome (Ghiassian et al., 2015). We therefore evaluated whether this same principle held for SCI by calculating the size of the largest connected component (LCC) between LC genes, and found that LC genes collectively formed a significantly larger subnetwork than random expectation (Figure 2E, empirical p < 10⁻³), a finding that was again reproduced in independent interaction datasets (p < 10⁻³, Figure 2—figure supplement 2D–F). Literature-curated genes also displayed a significant tendency to participate in the same protein complexes (Figure 2—figure supplement 2J). Finally, LC genes were preferentially recovered by a disease gene prediction algorithm when a subset of them were randomly withheld, and the remainder used to prioritize additional disease genes (Figure 2F and Figure 2—figure supplement 2G–I). Thus, LC genes represent a biologically relevant and functionally coherent set of genes, which converge on a common protein interaction module within the human interactome.

Gene coexpression network analysis of human spinal cord

Multiple lines of evidence support the functional coherence of the set of genes implicated in SCI by small-scale experiments. However, these studies nonetheless have appreciable false positive and false negative rates, and are limited by sociological and experimental biases. We therefore sought to integrate knowledge from the SCI corpus within an unbiased, genome-wide framework. We hypothesized that unsupervised gene coexpression network analysis of human spinal cord would provide a powerful method to integrate these LC genes in a systems-level context, as this method has recently been powerfully applied to develop insights into the etiologies of a number of neurological (Langfelder et al., 2016; Delahaye-Duriez et al., 2016; Johnson et al., 2015; Zhang et al., 2013) or psychiatric diseases (Voineagu et al., 2011; Chen et al., 2013; Fromer et al., 2016).

We constructed gene coexpression networks in human spinal cord using RNA-seq data from 71 post-mortem human spinal cords from the Genotype-Tissue Expression project (GTEx) (GTEx Consortium, 2013). We applied WGCNA (Langfelder and Horvath, 2008) to group the human spinal cord transcriptome into 15 distinct modules of coexpressed genes (Supplementary file 2). These modules represent networks of genes that share highly related patterns of expression in the human spinal cord. In order to establish the reproducibility of these spinal cord gene expression modules in an independent dataset, we constructed a second human spinal cord gene coexpression network from public microarray data, using established techniques to control for batch effects (Leek et al., 2012; Vandenbon et al., 2016). Module conservation was quantified using the $Z_{summary}$ statistic (Langfelder et al., 2011). Despite the small sample size of our microarray-based human spinal cord coexpression network (n = 33), seven of 15 modules showed strong evidence of reproducibility ( $Z_{summary}$ > 10), with an additional two modules showing moderate evidence of reproducibility ( $Z_{summary}$ > 5) (Figure 3A). Only two of 15 modules showed no evidence of reproducibility ( $Z_{summary}$ < 2).

Figure 3 with 1 supplement see all

Download asset Open asset

Gene coexpression modules in the human spinal cord and their differential expression in SCI.

(A) Reproducibility of human spinal cord modules in a microarray dataset and conservation in mouse and rat. (B) Enrichment of M3 and M7 for LC SCI genes. (C) Robustness of M3 and M7 enrichment for LC SCI genes. (D) Eigengene network for human spinal cord modules. (E) Differential expression of spinal cord modules following SCI in five datasets, and consensus. (F) Evidence for differential expression of six consensus modules and one majority module (M8). (G) Time-dependent expression of spinal cord modules at acute, subacute, and chronic time points following SCI.

https://doi.org/10.7554/eLife.39188.006

Next, we investigated the evolutionary conservation of human spinal cord coexpression modules in mouse and rat, two of the most commonly used model organisms for studies of SCI pathophysiology. We compiled hundreds of microarray samples from mouse (n = 414) and rat (n = 267) spinal cords from the Gene Expression Omnibus, and constructed gene coexpression networks for the mouse and rat spinal cords, again using established batch effect correction methods. Five modules showed strong evidence of evolutionary conservation ( $Z_{summary}$ > 10) in both species, while another four modules showed moderate evidence of conservation ( $Z_{summary}$ > 5) in at least one species, and only two modules showed no evidence of conservation in either species ( $Z_{summary}$ < 2) (Figure 3A). Notably, the same five modules that showed the strongest evidence of reproducibility (M2, M3, M7, M8, and M12) also showed the strongest evidence of conservation in rat and mouse. Thus, at least at the systems level, the architecture of the spinal cord transcriptome is substantially conserved between human and model organisms, supporting our approach of integrating data from small-scale studies of mammalian model organisms.

In order to integrate the LC gene set with the spinal cord coexpression network, we next tested for enrichment of LC genes within each module (Figure 3B). Two modules, M3 and M7, were significantly enriched for LC genes (Fisher’s exact test, Bonferroni-corrected p = 9.5 $\times$ 10⁻⁸ and 2.7 $\times$ 10⁻³, respectively). These modules consist of 746 and 330 genes, respectively, and both are among the most reproducible and conserved in the spinal cord (Figure 3A). We confirmed the robustness of the observed enrichment by randomly removing seed genes from the LC set, and by randomly adding false positive genes to the LC set. Both M3 and M7 remained significantly enriched for LC genes despite the removal of a large number of seed genes, or the addition of a large number of random genes (Figure 3C): M3 remained significantly enriched for LC genes even after the removal of approximately 70% of genes from the seed set, compared to approximately 50% for M7. Moreover, M3 remained significantly enriched for seed genes even after the size of the literature-curated set was doubled by addition of random false positives. We also asked whether the observed enrichment was driven most strongly by any individual analytical technique or injury model, but found the majority of experimental methods, SCI models, and species contributed to the observed LC gene enrichment in M3 and M7 (Figure 3—figure supplement 1). Thus, M3 and M7 are robustly enriched for genes associated to the SCI response by small-scale studies, despite their divergent experimental designs.

Finally, to assess the relationships between modules, we constructed a module meta-network based on the eigengene of each module, defined as the first principal component of module expression (Langfelder and Horvath, 2007) (Figure 3D). In the resulting network, M3 and M7 clustered together, as would be expected given the strong correlation between their eigengenes (Spearman’s $ρ$ = 0.54, p = 1.6 $\times$ 10⁻⁶). These results suggest that the expression of these two modules in the spinal cord is highly correlated.

In summary, gene coexpression network analysis identified five highly conserved and reproducible modules, two of which are significantly and robustly enriched for LC genes, and whose expression is highly correlated.

Meta-analysis of coexpression network deregulation in SCI

We next characterized the role of M3 and M7, as well as other highly conserved coexpression modules, in the pathophysiological response to SCI. We performed a meta-analysis of five mouse and rat transcriptomic studies of SCI within the context of our spinal cord coexpression network, in order to identify consensus changes in the spinal cord transcriptome at the module level in response to SCI (Figure 3E). This analysis identified M3, M6, M7, and M11 as consensus upregulated, and M1 and M2 as consensus downregulated, following SCI. One other module, M8, was upregulated following SCI in four of five datasets, while the remaining eight modules did not show robust evidence of differential expression. Among all seven modules, M2, M3, and M7 consistently showed the strongest evidence of differential expression (Figure 3F, p $\leq$ 6.5 $\times$ 10⁻³⁶, 1.2 $\times$ 10⁻⁴⁸, and 1.6 $\times$ 10⁻¹⁴, respectively). Notably, among these modules, M2, M3, M7 were strongly conserved and reproducible in mouse, rat, and human networks ( $Z_{summary}$ > 10), whereas M1, M6, and M11 displayed only moderate evidence of conservation (2 < $Z_{summary}$ < 10), suggesting these modules may capture human-specific aspects of spinal cord transcriptome organization that are relevant in the response to SCI.

Because the pathophysiological processes underlying primary and secondary injury in SCI are incompletely understood, we additionally investigated the expression of spinal cord modules at acute, subacute, and chronic time points. Consensus module expression was remarkably consistent at all time points studied (Figure 3G). However, analysis of the temporal regulation of spinal cord modules revealed consensus downregulation of M9 at the most acute time point after SCI, but consensus upregulation at a chronic time point. These results suggest M9 may be specifically involved in the transition between acute and chronic physiological responses following SCI. Thus, by integrating gene coexpression network analysis with a meta-analysis of the SCI transcriptome, we reveal a consensus network signature associated with the response to SCI, and a network module specifically implicated in the transition from acute to chronic injury processes.

Functional characterization of consensus signature modules

We sought to characterize the biological significance of the modules implicated in the physiological response to SCI by integrating functional annotations from the Gene Ontology (Ashburner et al., 2000) and molecular signatures from MSigDB (Liberzon et al., 2011) (Supplementary file 3). To visualize statistically overrepresented gene sets, we constructed enrichment maps for each consensus signature module (Merico et al., 2010) (Figure 4A–B and Figure 4—figure supplements 1–4). To appreciate the cell type-specificity of each module, we additionally conducted a meta-analysis of transcriptomic and proteomic profiles from the major cell types of the CNS, incorporating both bulk and single-cell RNA-seq datasets (Zhang et al., 2014; Sharma et al., 2015; Cahoy et al., 2008; Zeisel et al., 2018) (Figure 4C and Figure 4—figure supplement 5). M1 was an oligodendrocyte module, associated with axon ensheathment and myelination, whereas M2 was a neuronal module implicated in synaptic transmission. M3 was enriched for markers of microglia and vascular endothelial cells, and biological processes such as inflammatory response and response to wounding, while M7 was a microglial module enriched for annotations related to the immune response. M9 was enriched for astrocyte markers and terms such as oxidation-reduction process, as well as the term central nervous system development, which may be related to its upregulation at chronic time points following SCI. M6 and M11 were not significantly associated with any specific cell type, and were enriched for terms including cellular protein modification process and mitochondrial translation, respectively.

Figure 4 with 5 supplements see all

Download asset Open asset

Biological characterization of spinal cord modules.

(A–B) Enrichment maps (Merico et al., 2010) for modules M3 and M7. (C) Meta-analysis of cell type-specific marker gene enrichment in human spinal cord modules at the transcriptomic and proteomic levels.

https://doi.org/10.7554/eLife.39188.008

Network analysis of SCI severity and recovery

The finding that M3 is a highly conserved and reproducible gene coexpression module, with the most significant enrichment for LC genes and strong evidence of upregulation following SCI, suggested that this module plays a key pathophysiological role in SCI. We focused on the role of M3 in SCI by investigating the relationship between M3 expression and two key clinical parameters in SCI: injury severity and recovery of sensory and motor function.

We first re-analysed gene expression data from a mouse model of severity-dependent injury to identify relationships between consensus module expression and injury severity (Di Giovanni et al., 2003; De Biase et al., 2005). Strikingly, M3 was the sole module enriched for genes positively correlated to injury severity, whereas M1, M2, and M9 were enriched for genes anti-correlated to injury severity (Figure 5A). We investigated this effect further by considering the correlations between module eigengenes, which provide a summary of the expression profile of each module, and injury severity. This analysis revealed that the M3 eigengene was the most strongly correlated with injury severity (Spearman’s $ρ$ = 0.79, p = 2.5 $\times$ 10⁻⁷), with a clear separation in M3 expression between the mild, severe, and sham injury groups at 7 days post-injury (Figure 5E).

Figure 5

Download asset Open asset

Relationship of spinal cord modules to injury severity and functional recovery.

(A) Enrichment of spinal cord modules for genes correlated or anticorrelated to injury severity in a mouse model. (B) Consensus network signature of SCI pathophysiology, validation in independent transcriptomic and proteomic datasets, and reversal in functional recovery and reduced axonal dieback. (C) Gene expression correlation to M3 eigengene predicts association to SCI severity. (D) Reproducibility and evolutionary conservation of spinal cord modules and their preservation at the proteomic level. (E–F) Relationship between M3 eigengene and injury severity at 7 days post-injury in a mouse model (E), and in our own RNA-seq (F) and proteomic (G) datasets. (H) Downregulation of the M3 eigengene following treatment with NT-3, a neurotrophic agent that promotes functional recovery in acute SCI. (I) Six genes classify moderate and severe injuries in transcriptomic data with 90% or greater accuracy. (J–K) Gene expression and protein abundance of annexin A1 in sham, moderate, and severe SCI.

https://doi.org/10.7554/eLife.39188.014

In order to validate the severity-dependent upregulation of M3 following SCI, we conducted a prospective experimental SCI study, using the field standard contusion injury model at the T10 segment, and performed RNA sequencing of the spinal cord parenchyma in rats subjected to moderate, severe, or sham injuries (n = 5 per group). Our RNA-seq data reproduced the consensus network signature derived from our meta-analysis of microarray datasets, emphasizing the robustness of this systems-level characterization of SCI pathophysiology (Figure 5F). In addition, we confirmed the significant association between injury severity and the M3 eigengene (Figure 5E; Spearman’s $ρ$ = 0.94, p = 4.2 $\times$ 10⁻⁷). Thus, insights into the network-level organization of the transcriptome in SCI derived from a meta-analysis of publicly available data replicate in an independently collected dataset.

Together, these results emphasized the severity-dependent upregulation of M3 following SCI, and suggested that the expression of a gene or combination of genes that accurately summarize the transcriptional status of M3 has the potential to serve as an objective biomarker of SCI severity. To evaluate the potential of such an indicator as a biomarker of injury severity, we focused on the hub genes of M3. These genes are the most central and interconnected within the module, based on their correlation to the module eigengene, and are highly enriched for functionally relevant genes such as drivers of disease pathophysiology (Voineagu et al., 2011) or therapeutic targets (Horvath et al., 2006). Consistent with these findings, the hubness of M3 genes (that is, their correlation to the M3 eigengene in human spinal cord) was significantly associated with their predictive power as a biomarker of injury severity (Figure 5D; Spearman’s $ρ$ = 0.23, p = 3.9 $\times$ 10⁻⁷). Among M3 hubs, six genes stratified rats by SCI severity with an accuracy greater than 90%, including Anxa1, Colgalt1, Ifngr2, Shc1, Sod2, and Tbc1d2b (Figure 5G). Remarkably, expression levels of Anxa1 (annexin A1) stratified moderately and severely injured rats with perfect accuracy (Figure 5I). Annexin A1 has previously been associated with SCI by three small-scale studies, each employing divergent model organisms, spinal cord levels, and injury models, emphasizing the robustness of the association between SCI and annexin upregulation (Didangelos et al., 2016; Moghieb et al., 2016; Gao et al., 2012).

While our integrative analyses of public and newly acquired transcriptomic data established a strong relationship between M3 expression and SCI severity, post-transcriptional regulation can result in marked differences between gene and protein expression levels, particularly in complex tissues such as those of the CNS (Sharma et al., 2015; Fortelny et al., 2017). To further explore the potential of M3 hubs as biomarkers of SCI severity, we therefore performed quantitative proteomic profiling of the same rat spinal cords. We first sought to establish that the overall structure of the spinal cord coexpression network was conserved between the transcriptomic and proteomic levels. Despite having limited power to detect module preservation due to the small size of our proteomic sample (n = 15), both M3 and M7 displayed highly significant evidence of reproducibility between the RNA and protein levels (Figure 5C; $Z_{summary}$ = 6.8 and 7.3, respectively). Furthermore, we identified substantial overall agreement between proteomic data and the consensus network signature derived from transcriptomic meta-analysis, further validating the robustness of our systems-level portrait of SCI pathophysiology (Figure 5F). Finally, we confirmed the severity-dependent upregulation of both the M3 eigengene and annexin A1 in particular (Figure 5H and J), finding that ANXA1 protein levels stratified both moderate and severe injuries with an accuracy of 93%. Thus, systems-level insights into SCI pathophysiology derived from integrative transcriptomic analyses extend to the proteomic level and nominate quantitative biomarkers of SCI severity.

Given the strong relationship between injury severity and M3 expression, we hypothesized that targeting the transcriptional profile of this module could represent a viable strategy for development of novel therapies for SCI. To explore this hypothesis, we analyzed gene expression data from a recent trial of a neurotrophic factor, neurotrophin-3 (NT-3), which promoted sensory and motor recovery after SCI (Duan et al., 2015; Yang et al., 2015). Remarkably, all six consensus modules derived from our meta-analysis, including M3, were differentially expressed at the lesion site in the opposite direction (Figure 5F) in rats treated with NT-3. Intriguingly, the sole other differentially expressed module was M9, which we previously observed to exhibit a strongly time-dependent expression profile, and which was enriched for genes associated with neurogenesis. In rats treated with NT-3, known for its role in neuronal differentiation, axonal growth, and chemotropic guidance (Alto et al., 2009; Anderson et al., 2016), M9 was strongly upregulated at the lesion site relative to the experimental control (p = 9.3 $\times$ 10⁻¹²). Moreover, the M3 eigengene was significantly downregulated in NT-3-treated rats relative to controls (Figure 5K; one-tailed Wilcoxon rank-sum test, p = 2.1 $\times$ 10⁻³). We additionally analysed gene expression data from transgenic STAT3 knockout mice (Anderson et al., 2016), a loss-of-function manipulation that increased axonal dieback following experimental SCI, and found all six consensus modules were again differentially expressed in the opposite direction in wild-type mice, relative to knockout mice (Figure 5B). These results indicate that reversal of the transcriptome changes observed in response to SCI is associated with functional recovery and decreased axonal dieback in rodent models, and highlight M3 expression as a predictor of functional recovery.

Discussion

The fragmentary understanding of the coordinated pathophysiological processes activated in the human spinal cord by SCI represents a central obstacle to the development of therapies capable of influencing neurological outcomes. In this study, we developed an integrated, systems-level approach to understand the molecular mechanisms underlying SCI pathophysiology. We leveraged large-scale RNA-seq data from healthy subjects to reveal gene regulatory relationships in the human spinal cord. By integrating multiple gene expression datasets from experimental models of SCI, we identified gene subnetworks implicated by consensus in the pathophysiological response to SCI, and reproduced these signatures at both the transcriptomic and proteomic levels in an animal trial. The observation that seven different gene modules were robustly associated with the response to SCI, either by consensus differential regulation (M1, M2, M3, M7, M11) or by a strongly time-dependent course of expression (M9), is consistent with the notion that the pathophysiology of SCI is highly complex (Ramer et al., 2014). Our results provide a framework to understand the diverse, coordinated processes in the spinal cord following SCI.

In order to prioritize gene subnetworks, we conducted a systematic analysis of the SCI literature, and integrated genes implicated in the SCI response by small-scale experiments into our network analysis. This approach is conceptually similar to the integration of GWAS or de novo mutation data into gene regulatory networks, as has previously been described for a number of diseases (e.g., Johnson et al., 2015; Delahaye-Duriez et al., 2016; Calabrese et al., 2017; Li et al., 2014; International Consortium for Blood Pressure GWAS (ICBP) et al., 2015). In the context of genetic analyses, the core assumption is that false positive and false negative associations between alleles and the phenotype of interest can be mitigated by identifying convergent molecular processes that mediate disease biology. In the context of literature curation, as employed here, we posit that the relatively high false-positive rates of small-scale experiments, as well as their appreciable false-negative rates, can be mitigated by unbiased integration of data from small-scale experiments into a genome-wide framework. Importantly, this experimental design provides an approach to extend gene coexpression network analysis to acquired and traumatic conditions, using samples from healthy tissues. However, a limitation of this approach is the implicit assumption that the molecular organization of the transcriptome in the relevant tissue of healthy human subjects is informative about the biological processes dysregulated by an acquired or traumatic condition. Although the analyses presented here indicate that this assumption appears to be valid in the case of SCI, future work will be needed to establish whether this principle holds in general.

A major challenge to the translation of preclinical therapies for acute SCI is the use of standardized neurological assessments to enrol and stratify patients in large clinical trials (Fawcett et al., 2007). In this context, objective biomarkers capable of accurately stratifying injury severity have the potential to facilitate translation by accelerating the pace of patient enrolment (Kwon et al., 2017; Streijger et al., 2017). We found that M3 was the sole module enriched for genes whose expression correlated with injury severity in a mouse model, and that its eigengene was likewise most strongly associated with severity. We subsequently reproduced this correlation in our own transcriptomic and proteomic datasets. The severity-dependent upregulation of M3 following SCI, and its preservation at the proteomic level, suggests that its expression has the potential to stratify injury severity in a clinical context. Furthermore, this expression pattern was reversed with administration of NT-3, a treatment that promotes motor and sensory recovery (Yang et al., 2015). These findings have several implications for the discovery and translation of new SCI therapies. The identification of drugs that reverse transcriptional changes associated with SCI has the potential to provide a new strategy for preclinical lead discovery. Moreover, analysing the effect of a desired treatment on M3 expression, or our consensus network signature more generally, may represent an effective technique to validate the efficacy of preclinical therapies.

Among M3 hub genes, which reflect the expression of the entire module, we found that both the RNA and protein levels of Anxa1 (annexin A1) demonstrated a strong ability to discriminate between injuries of different severities. Annexin A1 is a member of the annexin superfamily of calcium dependent phospholipid-binding proteins, and plays a role in mediating anti-inflammatory effects through inhibition of phospholipase A2 activity (Elderfield et al., 1993; Liu et al., 2007), decreasing leukocyte activation (Perretti and Flower, 1993; D'Acquisto et al., 2007) and reducing expression of pro-inflammatory cytokines (Sudlow et al., 1996; McArthur et al., 2010). Anxa1 is primarily expressed in microglia, where it regulates the selective removal of apoptotic neurons (McArthur et al., 2010). Correspondingly, Anxa1 knockout mice are characterized by exaggerated inflammatory responses, as well as a blunted response to the anti-inflammatory effects of glucocorticoids (Hannon et al., 2003). Anxa1 is upregulated in multiple diseases characterized by aberrant neuroinflammation (Elderfield et al., 1992; Elderfield et al., 1993; McArthur et al., 2010). Importantly, multiple studies have previously reported upregulation of Anxa1 in SCI (Didangelos et al., 2016; Moghieb et al., 2016; Gao et al., 2012), with peak expression at 7 days post injury (Liu et al., 2004), and upregulation of Anxa1 is associated with functional recovery after SCI (Liu et al., 2007). Notably, Anxa1 was previously identified as a biomarker of SCI severity in a study that included both rat and human samples (Moghieb et al., 2016). Our independent finding here that Anxa1 is a strong candidate for a severity-dependent biomarker of SCI suggests that our systems-level approach can drive rational selection of novel potential biomarkers. However, although we observed substantial conservation of M3 between human and rat at the systems level, this finding does not preclude the possibility that individual genes diverge in their expression following acute SCI between human and rodents. Further studies in humans are therefore needed to conclusively establish the validity of Anxa1 as a biomarker of SCI severity.

In summary, our systems biology approach identifies evolutionarily conserved and reproducible gene subnetworks with robust evidence for differential regulation following SCI, and provides a genome-wide view of the pathophysiological processes triggered by SCI. Our findings provide new, data-driven strategies to identify and translate novel therapies for SCI.

Share this article

Cite this article

Schematic overview of systems biology approach to SCI pathophysiology integrating small-scale experiments with high-throughput data.

Literature curation and validation of genes implicated in the physiological response to SCI by small-scale experiments.

Gene coexpression modules in the human spinal cord and their differential expression in SCI.

Biological characterization of spinal cord modules.

Relationship of spinal cord modules to injury severity and functional recovery.

Author details

Jordan W Squair

Contribution

Contributed equally with

Competing interests

Seth Tigchelaar

Contribution

Competing interests

Kyung-Mee Moon

Contribution

Competing interests

Jie Liu

Contribution

Competing interests

Wolfram Tetzlaff

Contribution

Competing interests

Brian K Kwon

Contribution

Competing interests

Andrei V Krassioukov

Contribution

Competing interests

Christopher R West

Contribution

Competing interests

Leonard J Foster

Contribution

Competing interests

Michael A Skinnider

Contribution

Contributed equally with

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organisms