Rewiring the bone marrow: Evolution and the transcriptional architecture of trained immunity

  1. Sarah J Sun  Is a corresponding author
  2. Raúl Aguirre-Gamboa
  3. Luis B Barreiro  Is a corresponding author
  1. Committee on Immunology, University of Chicago, United States
  2. Medical Scientist Training Program, University of Chicago, United States
  3. Section of Genetic Medicine, Department of Medicine, University of Chicago, United States
  4. Department of Human Genetics, University of Chicago, United States
  5. Committee on Genetics, Genomics, and Systems Biology, University of Chicago, United States
  6. Chan Zuckerberg Biohub Chicago Investigator, United States

Abstract

The epigenetic adaptation of innate immune cells to inflammatory stimuli, or trained immunity, represents an evolutionarily conserved feature of host defense. Recent advances have revealed that such adaptations can occur at the level of hematopoietic stem and progenitor cells, resulting in long-lasting epigenetic reprogramming of the immune system. However, a comprehensive mechanistic understanding of these processes remains incomplete, limiting our capacity to predict or therapeutically manipulate the adaptive capacity of hematopoiesis. In this review, we survey the current literature to support a model of hematopoietic memory whose stimulus-specific nuances are shaped by specific cytokine environments and driven by the combinatorial activity of key transcription factors. Comparative analyses underscore the evolutionary conservation and essential biological roles of these factors, suggesting that trained immunity may reflect the strategic repurposing of ancient transcriptional programs for the purpose of enhancing host defense.

Introduction

The innate immune system is the oldest branch of the broader human immune system and the sole source of immunity in most living species, including plants and invertebrates (Pancer and Cooper, 2006). In humans, quintessential innate immune cells, such as macrophages and dendritic cells (DCs), detect and ultimately eliminate pathogens via the expression of fixed, germline-encoded pathogen recognition receptors (PRRs), which confer innate immune cells with the ability to broadly recognize and phagocytose pathogenic microorganisms while preserving self-tolerance. Adaptive immune cells, such as T-cells, are more complex. They recombine germline-encoded sequences to generate an essentially infinite number of unique receptors with exquisite specificity. T-cells also exist in different states, such as ‘naïve’, ‘effector’, and ‘memory’, and the activation of a naïve cell by its specific antigen induces the permanent differentiation of the T-cell clone into a memory state via the induction of complex transcriptional, metabolic, and epigenetic remodeling.

Traditionally, this capacity for durable, antigen-specific memory was considered exclusive to the adaptive immune system. However, the field has evolved to appreciate the spectrum-like nature of memory (when loosely defined as short or long-term changes in immune cells that result from transient stimuli). All immune cells exhibit some ability to adapt, but the duration and specificity of these adaptations vary. Certain innate-like lymphocytes—including invariant natural killer T cells, mucosal-associated invariant T cells, and γδ T-cells—exhibit memory-like features despite utilizing a limited range of antigen receptors (Pellicci et al., 2020), blurring the classical boundaries between innate and adaptive immunity and highlighting the role of multiple types of memory in the immune response.

The expanding view of immune memory stems largely from discoveries made in the late 2000s and early 2010s, which revealed that classical innate immune cells can undergo short-term epigenetic, metabolic, and transcriptional reprogramming—typically lasting around 7 days—following PRR stimulation (Cheng et al., 2014; Foster et al., 2007; Quintin et al., 2012). These changes, in some cases, led to heightened inflammatory responses upon subsequent, nonspecific challenges (Cheng et al., 2014; Quintin et al., 2012). The term ‘trained immunity’ was coined in 2011 (Netea et al., 2011) to describe this phenomenon, which had long been observed in plants and invertebrates (Gourbal et al., 2018; Netea et al., 2019). Organisms as primitive as bacteria exhibit memory-like immunity through the CRISPR-Cas system, which incorporates viral DNA fragments into their genome to ‘remember’ and target future infections (Gourbal et al., 2018). In plants, a system called systemic acquired resistance enables adaptation to pathogens through epigenetic and transcriptional mechanisms (Gourbal et al., 2018). Evidence of innate immune memory has also been reported in organisms, such as mosquitoes and copepods, both of which rely exclusively on innate immunity (Gourbal et al., 2018; Netea et al., 2019). That such innate memory also existed within vertebrates was subsequently more formally demonstrated in human monocytes stimulated ex vivo with β-glucan (Cheng et al., 2014; Quintin et al., 2012) and shown to have an epigenetic and metabolic basis. These findings supported a broader plasticity across the immune system and, thus, a more complex evolutionary model in which the capacity for epigenetic adaptation to environmental stimuli is a basic feature of all immune cells across species, while the evolution of antigen-specific receptors marks a key innovation of adaptive immunity in vertebrates (Gourbal et al., 2018).

While in vitro studies provided evidence that short-lived innate immune cells could acquire nonspecific memory, their limited lifespan raised questions about the durability of such responses. This led to the hypothesis that long-term innate immune memory must be encoded in long-lived cell populations, such as tissue-resident innate cells and/or hematopoietic stem and progenitor cells (HSPCs). In 2018, several pivotal studies confirmed this hypothesis, showing that HSPCs can undergo persistent epigenetic, metabolic, and transcriptional reprogramming in response to inflammatory stimuli. These rewired HSPCs are capable of harboring molecular signatures of memory long-term through their ability to self-renew and are thought to continuously give rise to monocytes and neutrophils with enhanced innate immune responsiveness (Christ et al., 2018; Kaufmann et al., 2018; Mitroulis et al., 2018). This introduced a new conceptual framework: short-term, peripheral trained immunity (within mature innate immune cells) vs. a more long-term hematopoietic reprogramming wherein molecular adaptations encoded in long-lived stem and progenitor cells can be passed on to their mature immune progeny. Such progeny, in turn, often appear to harbor the classical characteristics of peripherally trained cells (Netea et al., 2020) and are capable of establishing long-term residence within inflamed tissues, further contributing to the longevity of innate immune memory (Patel et al., 2021). Together with a growing body of evidence that long-lived, self-renewing yolk-sac-derived tissue resident macrophages can harbor epigenetic memory of infections (Patel et al., 2021), the influx of reprogrammed bone-marrow-derived macrophages into inflamed tissues serves to fundamentally rewire innate immune responses within those tissues. This phenomenon is sometimes referred to as ‘central trained immunity’, although in this review, we will use the looser term of hematopoietic reprogramming to broaden our discussion to studies that, in our view, do not meet strict criteria for central trained immunity (see below for a more extensive discussion about terminology).

Extensive reviews have covered the triggers and mechanisms underlying peripheral trained immunity (Bekkering et al., 2018; Chavakis et al., 2022; Divangahi et al., 2021; Netea et al., 2020; Netea et al., 2016; Vuscan et al., 2024), as well as the growing body of work surrounding central trained immunity (Chavakis et al., 2022; Chavakis et al., 2019; Divangahi et al., 2021). However, a unified framework capable of explaining and predicting hematopoietic reprogramming across different contexts remains elusive. In this review, we aim to contribute to this ongoing discussion by focusing on memory within the hematopoietic system. In the first section, we adopt a mechanistic perspective to explore how specific stimuli can durably rewire hematopoiesis, altering innate immune output and function—emphasizing the roles of cytokine signaling and the activation of pioneering transcription factors (TFs). In the second section, we take a broader evolutionary and comparative view, highlighting the deep conservation of transcriptional machinery across species. We also explore emerging evidence of species-specific differences in hematopoietic memory mechanisms, particularly between mice and humans, and consider how these insights might guide future research in the field.

Terminology: hematopoietic reprogramming

The term ‘(peripheral) trained immunity’, since first being coined in 2011, has classically been used to refer to the ability of a primary stimulus to directly reprogram innate immune cells to mount stronger cytokine responses to diverse secondary stimuli as compared to their naïve counterparts (Netea et al., 2011). This definition highlights the difference between trained immunity and the related phenomenon of innate immune tolerance in which strong lipopolysaccharide (LPS) stimulation leads to a temporary state of innate immune hypo-responsiveness to secondary stimulation (Foster et al., 2007; Novakovic et al., 2016). Another requirement is that cells return to a transcriptional baseline in between primary and secondary stimuli, retaining only epigenetic signatures imprinted by the primary stimulus to keep the chromatin in a more accessible, ‘ready’, state (Divangahi et al., 2021). Failure to return to a transcriptional baseline in between the primary and secondary stimulus has in some of the literature been termed ‘priming’ to distinguish it from the more strict definition of ‘training’ (Mishra and Ivashkiv, 2024).

Defining trained immunity of the hematopoietic system has proven to be more complex. While stimulus-induced changes within HSPCs do often lead to the production of innate immune cells which appear classically ‘trained’, the HSPCs themselves, from which these innate cells are derived, often retain residual changes in gene expression, TF activity, cell abundances, and lineage biases, even long after presumed clearance of the initiating stimulus. Relying on in vivo systems, it then also becomes difficult to determine whether residual inflammation from the primary stimulus may exist, or whether the residual changes in HSPCs represent a true, independent, rewiring of their transcriptional circuitry. Not surprisingly, existing studies have often used diverse indices of what it means for the bone marrow to be ‘trained’. Although the studies and examples discussed in this review have been categorized as examples of central trained immunity, we will from here on use the broader term hematopoietic reprogramming to acknowledge the fact that not all studies meet the definition of trained immunity in its strictest sense. Hematopoietic reprogramming (as a substitute for central trained immunity) will be used here to refer to long-lasting changes (this may include any combination of epigenetic, transcriptional, metabolic, or other persistent alterations) in HSPCs persisting in the absence of detectable levels of the primary stimulus, and leading to enhanced downstream innate immune cell function, composition, or responses to secondary stimuli.

Cytokine-induced EM is integral to the establishment of hematopoietic reprogramming

Emergency myelopoiesis (EM) is believed to be an essential component of hematopoietic reprogramming. In general terms, it is defined as a reactive homeostatic response of HSPCs to increased myeloid cell demand (Manz and Boettcher, 2014). EM is therefore a central feature of systemic bacterial infections (Manz and Boettcher, 2014), and viral infections, such as COVID-19 (Schulte-Schrepping et al., 2020), which induce widespread inflammation and the depletion of neutrophils as they infiltrate into tissues. Noninfectious, iatrogenic causes such as myeloablative chemotherapy can also act as triggers of HSC proliferation and myeloid cell regeneration (Pietras et al., 2016), demonstrating that the mere act of myeloid cell depletion in the absence of infection is sufficient to drive this response. Mechanistically, peripheral signals of myeloid cell demand cause usually dormant, pluripotent long-term HSCs (LT-HSCs) to expand and give rise to myeloid-biased multipotent progenitors (MPPs). EM also involves the entry of normally quiescent lineage-committed granulocyte-monocyte precursors (GMPs) into the cell cycle (Swann et al., 2024). These events are largely thought to be coordinated by the action of key cytokines, including G-CSF, GM-CSF, M-CSF, IL6, IL-1β, TNFα, TGFβ, and type I and type II interferons (Manz and Boettcher, 2014; Park et al., 2024; Pascutti et al., 2016; Swann et al., 2024), although other cytokines, including IL4, have also been shown to coordinate EM under certain circumstances (LaMarche et al., 2024). Signaling by cytokines, either in combination or in isolation, results in cell proliferation at multiple points in the hematopoietic hierarchy, as well as the activation of lineage-determining TFs such as PU.1 (Heinz et al., 2010) and CEBPβ (Hirai et al., 2006), which bias stem cells toward the myeloid lineage (de Laval et al., 2020; Pietras et al., 2016; Yamashita and Passegué, 2019).

We believe that these processes, usually perceived as transient changes enabling the temporary mobilization of monocytes and granulocytes in times of need, are essential in the induction of persistent hematopoietic reprogramming and that cytokine-induced EM serves as a necessary, but likely not sufficient, trigger of hematopoietic reprogramming. In support of this hypothesis, our review of the literature (see examples below) has consistently demonstrated that an initial EM response precedes hematopoietic reprogramming. Most stimuli lead to the upregulation of at least one identifiable cytokine among IL1β, IFNγ, or IL6 (Chavakis et al., 2022; Chavakis et al., 2019; Zhao et al., 2014)—one possible exception being LPS which may directly act on HSCs via Toll-like receptor signaling (Chavakis et al., 2019; de Laval et al., 2020). Even so, TLR ligation is believed to lead to the production of cytokines such as IL6, which feedback to partially instruct EM via paracrine signaling (Chavakis et al., 2019; Zhao et al., 2014). In general, we find that differences in hematopoietic reprogramming between stimuli can be, to some extent, predicted by the cytokines they produce (Table 1).

Table 1
Cytokine-induced hematopoietic reprogramming.

The table includes stimuli of hematopoietic reprogramming from studies for which cytokines were directly investigated. The phenotype-defining cytokine is bolded. Rows are colored according to cytokine (red—IL1β, yellow—IL6 ± IFN, blue—IFN in the absence of IL6). Stimuli that induce similar cytokine repertoires also produce similar reprogramming effects.

StimulusSpecies studiedTypeKey cytokines inducedPersistent reprogramming outcomeReferences
β-GlucanMiceInfectious mimic (PRR agonist) and inflammasome activatorIL-1β, G-CSFInherent LT-HSC myeloid bias; trained EM response to chemotherapyMitroulis et al., 2018
LIPMiceSterile inflammationIL-1β, G-CSFInherent LT-HSC myeloid bias; trained EM response to LPS;
primed monocyte and neutrophil TNFα/IL-6 secretion upon LPS challenge
Li et al., 2022
BCG vaccinationMiceLive attenuated infectionIFN-γIncreased macrophage antimicrobial activity; IFN-γ dependentKaufmann et al., 2018
COVID-19 infectionHumanViral infectionIL-6Lasting monocyte chromatin remodeling and priming; persistent GMP expansion; reversed by IL-6 blockadeCheong et al., 2023
Western dietMiceMetabolic triggerIL-6, IFN-γ, IL-1β, G-CSF, GM-CSFLasting monocyte priming; GMP chromatin remodeling, differential gene expression, monocyte skewingChrist et al., 2018

β-Glucan and ligature-induced periodontitis (LIP), for example, are seemingly unrelated mouse models of hematopoietic reprogramming. Both, however, induce IL1β and similar hematopoietic reprogramming phenotypes. β-Glucan and LIP both induce a persistent myeloid bias at the level of LT-HSCs that is transplantable into naïve recipients and prime the hematopoietic system to more rapidly increase myeloid cell production in response to secondary challenges with either chemotherapy or LPS. Blockage of IL1β signaling abrogates the reprogramming of long-lived myeloid bias in response to β-glucan (Mitroulis et al., 2018) and in the model of LIP also reduces the primed secretion of TNFα and IL6 by LIP-bone marrow-derived neutrophils and monocytes in response to LPS (Li et al., 2022). In further support of the essential role of IL1β in encoding these responses at the level of LT-HSCs, studies have shown that IL1β itself is sufficient to drive the expression of PU.1, as well as macrophage colony-stimulating factor (M-CSF) and granulocyte-macrophage colony-stimulating factor (GM-CSF) within long-term hematopoietic stem cells (LT-HSCs) to drive an intrinsic myeloid bias (Pietras et al., 2016) at the very apex of the hematopoietic hierarchy.

Other stimulus-cytokine pairs further support the notion that the action of cytokines dictates hematopoietic reprogramming. In contrast to IL1β, inducers of IFNγ and IL6 appear to generate a hematopoietic reprogramming phenotype that more predominantly involves monocyte- (rather than neutrophil) skewed myeloid cell production and GMP/monocyte-centric reprogramming.

For example, vaccination with Bacillus Calmette-Guérin (BCG), a live attenuated version of Mycobacterium bovis, induces hematopoietic reprogramming in a manner that is dependent on IFNγ (Kaufmann et al., 2018). In line with IFNγ’s intrinsic ability to polarize the bone marrow toward monocyte over granulocyte production, the main hematopoietic reprogramming phenotype of BCG is the production of macrophages with an increased ability to resist Mycobacterium tuberculosis infection in vitro even in mice treated with antibiotics, an effect which is lost completely in IFNγR KO mice (Kaufmann et al., 2018).

COVID-19 induces hematopoietic reprogramming via a mechanism that appears to involve IL6. Often acting in concert with IFNγ, IL6 acts directly on MPPs by downregulating production of the TFs Cebpa and Runx1, thus further skewing downstream GMPs toward the monocyte lineage (de Bruin et al., 2012). Accordingly, COVID infection can induce the persistence of elevated GMPs detected in the peripheral circulation for at least 4–12 months following exposure and leads to the production of epigenetically modified monocytes primed to secrete elevated levels of IL6 and TNF-α in response to TLR7/8 agonist R848 or IFNα (Cheong et al., 2023). Treatment with Tocilizumab—an antibody which blocks IL6 signaling—leads to significantly reduced levels of the GMPs. Moreover, in a mouse model of COVID infection, concomitant treatment with anti-IL6 significantly decreased the post-COVID burden of inflammatory myeloid cells within both the lungs and brain (Cheong et al., 2023).

Western diet is an interesting example of a stimulus that induces multiple cytokines simultaneously, including IL6, IFNγ, and IL1β. In line with the tendency for IL6 and IFNγ to reprogram HSPCs at the level of the GMP and monocyte, western diet hematopoietic reprogramming is also characterized by lasting effects on the chromatin accessibility and gene profiles of GMPs isolated even 4 weeks after return to a control diet, GMP skewing toward the monocyte lineage, and priming of both monocytes and GMPs. Although IL1β was markedly increased in this study, whether its effects on LT-HSCs resembled those of other IL1β producers, such as β-glucan, remains unclear (Christ et al., 2018).

Altogether, the data suggest that the specific flavor of hematopoietic reprogramming that is induced by any given stimulus is influenced by the cytokines which they induce (Table 1). β-Glucan and LIP converge on IL-1β and in both examples, result in LT-HSCs with persistent myeloid bias and primed EM response to secondary challenges (Li et al., 2022; Mitroulis et al., 2018). Mycobacteria, which induce the monocyte/macrophage-polarizing cytokine IFNγ when injected intravenously, train the bone marrow of mice to give rise to macrophages with increased bacterial killing capacity (Kaufmann et al., 2018). Models of human COVID infection and western diet in mice implicate IL-6 as a key cytokine in reprogramming the bone marrow at the level of monocytes and GMPs (Cheong et al., 2023; Christ et al., 2018). When taken together, existing data suggests that blocking the signaling of key cytokines involved in programming demand-adapted myeloid cell mobilization abrogates both EM and the ability of the hematopoietic system to become reprogrammed, suggesting that EM may be a necessary component of hematopoietic reprogramming. In some cases, administration of recombinant cytokines, such as IFNγ (Kain et al., 2023), has also reproduced phenotypes of hematopoietic reprogramming, thereby also suggesting sufficiency. Nonetheless, there remains a need for more systematic necessity and sufficiency experiments to further delineate the exact role of each EM-inducing cytokine in hematopoietic reprogramming.

Remodeling hematopoiesis: how TFs encode innate immune memory

The pioneer TF machinery of EM drives hematopoietic reprogramming

In addition to cytokine signaling, the induction of myeloid lineage directing TFs is an essential step in hematopoietic reprogramming. These include TFs such as PU.1 and C/EBPβ, which are notable for their ability to act as pioneer factors by binding to previously inaccessible chromatin and setting off a cascade of subsequent TF binding events. These and other pioneer factors are believed to act in concert to remodel the chromatin landscape such that the regulatory enhancers and promoters of myeloid cells remain open, while those of lymphoid lineage cells become inaccessible. Studies suggest that the same network of pioneering TFs may simultaneously imprint the physical chromatin ‘scars’ that underlie the phenotypes of hematopoietic reprogramming (Larsen et al., 2021). In taking a systematic look at the TFs implicated in hematopoietic reprogramming across studies, we find that they are almost exclusively limited to pioneer TFs with previously known roles as, or acting in concert with, lineage-determining TFs (used to denote a subtype of pioneer factor which dictates hematopoietic lineage fate).

In an attempt to gain clarity, we have organized these TFs into four groups according to the specificity with which they are activated:

  1. C/EBPβ, which is a lineage-determining TF specific to and required for EM.

  2. STATs, which are critical in cytokine-induced EM but also important in mounting inflammatory responses in many other contexts outside of hematopoiesis.

  3. The AP-1 TFs FOS and JUN, which are pioneer factors ubiquitously induced not only in EM but in response to most stimuli.

  4. Universally cooperative factors—SP/KLF, EGR, MAZ, and ZNF—which are not specifically implicated in EM but have been shown to universally engage in highly cooperative binding and in prolonging the chromatin binding time of their partner TFs (Table 2).

Table 2
Transcription factors (TFs) implicated in hematopoietic reprogramming fall into four groups.

These range from TFs that are activated and bind chromatin in specific contexts (such as emergency myelopoiesis [EM] for CEBPB) to those that engage in widespread binding and play roles in dictating the global chromatin landscape (stripe factors). Check marks on the right indicate TF involvement within each study.

TF groupTF function specificityKey membersRole in hematopoietic reprogrammingModels/Stimuli with evidence of TF group involvement
MouseHuman
β-GlucanBCGLPSWestern dietBCGCOVID
Lineage+C/EBP (β, α, ε, δ)Required for cytokine-induced EM; downstream of G-CSF through STAT3
Cytokine responsive+ +STAT1, STAT3, STAT5Mediates cytokine signaling (IL6, IFNs, G-CSF)
Ubiquitous pioneer+ + +FOS/JUN (AP-1)Induced in many stimuli; drive enhancer formation and chromatin access
Universal stripe+ + +KLFs, SPs, EGRs, ZNFsEnables through co-binding other TFs shaping epigenetic landscape

To briefly elaborate on the functions of these four groups of TFs: The first group consists of C/EBPb—a member of the C/EBP family of TFs which share a highly conserved C-terminus leucine zipper dimerization domain, adjacent to a basic DNA binding domain (bZIP) (Landschulz et al., 1989). The most important role of C/EBPb is in cytokine-induced emergency granulopoiesis (Hirai et al., 2006). It is induced in response to G-CSF (universally important in EM and often induced concomitantly with other cytokines) signaling which triggers CEBPβ activation through the activation of JAK-STAT3. C/EBPb and STAT3 bind to the promoters of various cell cycle genes, such as c-Myc, leading to GMP proliferation (Manz and Boettcher, 2014; Zhang et al., 2010). C/EBPb is also induced within HSPCs higher up the differentiation tree, such as in LT-HSCs (de Laval et al., 2020). C/EBPε has also been implicated in some studies and is involved in the terminal differentiation of granulocytes (Yamanaka et al., 1997).

STAT TFs differ from C/EBP in that they not only orchestrate EM, but also play a dual role in driving inflammatory gene expression responses in response to cytokine stimulation of terminally differentiated cells. JAK inhibition in the context of cytokine stimulation, which blocks the ability to activate STATs, prevents PU.1 binding and the deposition of histone modifications (Ostuni et al., 2013), providing further evidence that chromatin opening in response to inflammatory cytokines is instigated by STATs. In the context of EM, STATs 1, 3, and 5 are directly upregulated by G-CSF stimulation (Manz and Boettcher, 2014). IL6 also induces the expression of Stat3 (which binds to regulatory regions together with C/EBPβ as discussed above), and GM-CSF upregulates STAT5a and STAT5b. Interferons activate STAT1, which can form a complex with STAT2 and IRF9 (ISGF3), which recruits chromatin remodelers SWI/SNF, HATs, and HDACs (Philips et al., 2022).

The third group of TFs, AP-1 TFs, has established roles as pioneer factors with de novo enhancer formation in EM but is also generally responsive to most stimuli in most tissues, and thus modulates chromatin in a plethora of contexts. Thus, they can be differentiated from STATs in that they respond to a broader range of stimuli. For example, exposure of mouse fibroblasts to serum induces the AP-1 binding to several de novo sites and association of AP-1 with components of the BAF chromatin remodeling complex (Vierbuchen et al., 2017). Ap-1 TFs are often reported to bind in combination with more specific TFs. In a survey of other studies, it was noted that factors co-enriched with AP-1 are often cell-type specific, such as C/EBPb in macrophages and NFAT in T-cells (Vierbuchen et al., 2017), suggesting that AP-1 may instigate chromatin opening to invite a second layer of more specific TF binding.

The final group of TFs is connected by their universal cooperative binding patterns. A recent study (Zhao et al., 2022) called such factors ‘stripe’ factors for the prominent vertical lines they formed on colocalization heatmaps as a result of their tendency to colocalize with most other TFs. Notable members included KLF, SP, EGR, ZNF, ZBTB, MAZ, PATZ1, and RREB1—all containing C2H2 zinc finger domains. SNPs affecting universal stripe factor binding have been shown to have a larger impact on chromatin accessibility than did SNPs targeting other TFs (Zhao et al., 2022), suggesting an integral role of USFs in dictating global chromatin architecture.

Stimulus-specific combinations of TFs shape hematopoietic reprogramming

The TF groups described above are not activated in isolation but are selectively engaged depending on the stimulus and the cytokines it induces. Below, we summarize evidence from key models of hematopoietic reprogramming to illustrate how these transcriptional programs are deployed in vivo.

The stimuli that lead to hematopoietic reprogramming engage various combinations of TFs from these classes. LPS in mice, for example, leads to persistent changes in chromatin accessibility in LT-HSCs that can be transplanted into naïve hosts to confer significant protection against Pseudomonas aeruginosa infection. These sites of altered chromatin accessibility enrich significantly for CEBPB, as well as the AP-1 members FOS and JUN (de Laval et al., 2020). The hematopoietic reprogramming stimulus β-glucan (which induces persistent LT-HSC bias in an IL1β-dependent manner as described above) induces changes in the gene expression of CEBPε targets after 7 days, again implicating the CEBP family (Mitroulis et al., 2018). In mice, BCG most predominantly modulates the expression of STAT1 downstream of IFNγ to reprogram bone marrow-derived macrophages to engage in heightened mycobacterium killing capacity (Kaufmann et al., 2018). Of note, epidermal skin stem cells form epigenetic memories in response to topical imiquimod via the formation of STAT3, FOS, and JUN complexes. Once activated by imiquimod, STAT3 recruits FOS and JUN, with JUN acting as a bookmarker. On heterologous stimulation, FOS rapidly reassociates with JUN, enabling faster gene expression to secondary injury (Larsen et al., 2021). While not demonstrated to the same level of mechanistic detail within hematopoietic stem cells, the example is consistent with the observation that small combinations of TFs drawn from STATs, AP-1, CEBP, or stripe factor TFs can orchestrate epigenetic memories.

Other stimuli engage broader swaths of TFs across larger number of classes. The universal stripe factors KLF4 and EGR1, concomitant with FOS/JUN and STAT TF activities, were implicated in western diet-reprogrammed GMPs after 4 weeks (Christ et al., 2018). Human studies, primarily of COVID or BCG, of hematopoietic reprogramming have also provided invaluable insights into the range of TFs implicated in hematopoietic reprogramming. COVID, for which IL6 signaling is critical for hematopoietic reprogramming, engages TFs of all four classes. Blockage of IL6 reduces the level of persistent C/EBPε and C/EBPβ activities found in late samples (4–12 months post-COVID), implicating the CEBP TFs downstream of IL6 in hematopoietic reprogramming. COVID infection also induces persistent changes in chromatin accessibility around STAT3, increased activity of FOS and JUN in HSPCs and monocytes 4–12 months post infection, and differential expression of KLF2 in HSPC 4–12 months post infection (Cheong et al., 2023).

Similarly, human intradermal vaccination with BCG leads to persistent changes in the chromatin accessibility of CMPs and GMPs at sites that enrich for TFs of all four classes (including CEBPB, STAT1, FOS, JUN, and TF motifs in the KLF/SP and EGR families) but for which stripe factors have the strongest enrichments. Out of the 11 TF motif families enriched within CMP and GMP progenitors with concomitant evidence of altered TF activity within HSCs upstream, six, including ‘ZNF/MAZ’, ‘KLF/SP_2’, ‘EGR1/2/3/4’, ‘KLF/SP_1’, ‘KLF2/3/6’, and ‘ZNF24’, overlapped those labeled as universal stripe factors (Sun et al., 2024). Thus, it appears that in the context of subcutaneous BCG vaccination of humans, persistently altered activity of stripe factors in HSCs and downstream changes in chromatin accessibility within progenitors at sites bound by these TFs is a key feature of the resultant hematopoiesis reprogramming.

Taken together, these studies suggest that all stimuli of hematopoietic reprogramming converge on the need to engage chromatin remodeling through pioneer or cooperative TFs. While this mechanism appears conserved, the specific TFs engaged differ by stimulus, likely due to variation in the cytokine milieu. In support of this hypothesis, both western diet and COVID-19 infections implicate IL-6 as upregulated cytokines, and both engage TFs from larger numbers of classes (3–4) compared to, for example, the IL1β inducer β-glucan, or the interferon inducer BCG in mice (Table 2). Still, the current body of literature emphasizes the need for larger numbers of studies which use the same stimuli in both mice and humans and which collect the same measurements (many studies measure cytokines or TFs but not both, and some measure TF activity through gene expression while others use chromatin accessibility measurements) to decrease the number of confounding variables that complicate interpretation of the current literature.

Species may matter: TF usage in mouse vs. human hematopoietic reprogramming

While Table 2 suggests that hematopoietic reprogramming is driven by different combinations of TFs because of different stimuli, we also reasoned that species-specific factors, rather than specific stimuli, may explain some of these differences. We noticed that the studies in mice (Christ et al., 2018; de Laval et al., 2020; Kaufmann et al., 2018; Mitroulis et al., 2018) generally contained TF representatives from fewer numbers of our TF classes compared to the studies in humans (Cheong et al., 2023; Sun et al., 2024). These TF classes were, of course, artificially defined by us and not based on any formal structural or evolutionary similarity. In an attempt to more objectively compare TF usage in hematopoietic reprogramming across species, we analyzed TF motif enrichment data from three studies spanning human and mouse HSPCs, alongside TF differential expression data from HSCs in Kaufmann et al. (which did not include chromatin accessibility data) and a mouse dataset of COVID infection published alongside the human data in Cheong et al. Studies in Table 2 without published datasets on TF expression or motif enrichment were not included. For each of the five included datasets, TF motif enrichment FDR values—or differential gene expression FDR values in the case of Kaufmann et al.—were ranked from most to least significant. This ranking enabled cross-study comparisons of preferential TF usage. We then focused on a combined set of TFs implicated in the literature as important in hematopoietic reprogramming in either mouse, human, or both, as shown in Figure 1. Viewing the data in this manner, albeit limited by small sample sizes in differences in data modalities, revealed an overall consistent usage of TFs across species and conditions. Hierarchical clustering suggests that differences between studies are likely influenced not only by stimulus but also species (Figure 1), thus TF utilization in hematopoietic reprogramming may be shaped by a complex interplay of factors, including stimulus type (live vs. inert), stimulus class (virus vs. bacteria), stimulus dose, and host species (mouse vs. human). While some TFs were enriched in a study-specific manner (e.g. EGR3 being solely enriched within human HSPCs following BCG vaccination), others displayed enrichment patterns suggestive of specificity to one species or infectious stimulus. FOS was enriched within all studies involving live infectious agents, but not in LPS injection. TFs such as MAZ and SP1 were most strongly enriched in the human studies compared to mice, being among the top quarter of most enriched motifs in humans vaccinated with BCG or infected with COVID-19, but not their mouse counterparts. In support of the idea that human hematopoietic reprogramming may involve larger complexes of factors, human studies had larger numbers of total enriched TFs compared to mouse studies of the same stimulus (human/mouse: BCG = 746/635; COVID 423/378; LPS NA/97). Nonetheless, the highly congruent TF enrichments between mouse and human COVID-infected HSPCs (both published in Jeong et al.) suggest that a significant portion of species-specific differences may be confounded by differences in the stimuli typically used. Extended versions of such analyses could provide better context for understanding the roles of different enriched TFs as either species or stimulus-specific responders.

Heatmap of human and mouse studies with transcription factors (TFs) colored according to their ranked motif or expression (Bacillus Calmette–Guérin [BCG] mouse) enrichment within hematopoietic stem and progenitor cells (HSPCs) 1 or more months following training with the indicated stimulus.

Each column represents a different study. Studies are grouped by stimulus type (top gray boxes). Species (mouse or human) is indicated on the bottom x-axis. White squares indicate no significant enrichment (NE).

Selective constraint on TFs involved in trained immunity

Despite some evidence for species-specific differences in TF usage in hematopoietic reprogramming, the data in Figure 1 suggest a high degree of overall similarity between humans and mice, especially when comparing studies that used the same stimulus (e.g. COVID). This functional conservation reinforces the biological relevance of the trained immunity program across species.

A major goal in human genetics is to identify genes that are important for disease susceptibility and fitness. One useful way to assess the biological importance of a gene is to measure its level of selective constraint—i.e., how strongly natural selection acts to eliminate deleterious variants from the population (Agarwal et al., 2023; Fuller et al., 2019). Genes that are under strong purifying selection will show a marked depletion of loss-of-function (LOF) variants, indicating that disruptions to their function reduce fitness and are thus selected against. We reasoned that if the genes central to trained immunity, such as the cytokines and TFs discussed above, are important for human immune adaptation and ultimately survival, particularly in the context of infectious disease, they should exhibit strong selective constraint. To test this, we analyzed gene-level LOF constraint metrics derived from the gnomAD v2.1 dataset, which includes exome sequences from approximately 125,000 individuals spanning 19,071 protein-coding genes (Karczewski et al., 2020). Specifically, we examined shet, a measure of the fitness reduction associated with carrying one copy of an LOF variant in each gene (Zeng et al., 2024). Larger shet values indicate stronger selection against LOF variants and, by extension, greater importance to fitness; smaller values suggest either tolerance to disruption or functional redundancy.

We focused on the key effector molecules and TFs (described above in this review) as core components of the trained immunity program. When compared to the genome-wide distribution of shet values, we found that TFs involved in trained immunity show some of the highest levels of selective constraint across the human genome (Figure 2). Notably, this elevated constraint is not simply a reflection of their being TFs; even when compared specifically to all TFs in the genome, those implicated in trained immunity are significantly more constrained (Figure 2).

Conservation scores (shet) for cytokines (green) and transcription factors (TFs) (purple) implicated across mouse and human studies of hematopoietic reprogramming, compared to all other genes (gray).

The figure shows posterior distributions of shet for individual genes, ordered by mean. Lines represent 95% credible intervals, with labeled genes represented by thick black lines. Colors represent the selection regimes. The inset panel summarizes shet values for TFs involved in hematopoietic reprogramming (HR-TFs), comparing them to all other TFs and to the rest of the genome.

This suggests that these genes play essential roles in human fitness and survival. Stripe TFs such as KLF6, KLF2, EGR3, SP1, and MAZ are among the most constrained genes in the trained immunity network, and the genome overall.

These TFs are known to play critical and broadly pleiotropic roles in chromatin regulation, cell fate decisions, and stress responses, often in coordination with lineage-specific factors. For example, KLF6 is essential for hematopoietic stem cell quiescence and vascular development, and germline inactivation in mice results in embryonic lethality (Bialkowska et al., 2017). SP1 is a master regulator involved in the transcription of thousands of genes, including those required for immune responses, metabolism, and cell cycle control; mutations are associated with severe multisystem developmental syndromes. Finally, EGR3 regulates both neuronal development and T-cell activation; although rare, LOF mutations are linked to immune dysfunction and neurodevelopmental phenotypes (Li et al., 2007).

The extreme constraint on these genes underscores their nonredundant, dosage-sensitive roles, making even partial loss deleterious. While these TFs are involved in a range of biological processes beyond trained immunity, their central positioning in immune regulatory networks likely contributes to their evolutionary conservation. This supports the idea that mechanisms underpinning trained immunity—especially those related to chromatin remodeling and transcriptional reprogramminghave been subject to strong selective pressures, likely due to their impact on host survival in the face of infection.

In contrast, many effector molecules in the trained immunity network show lower levels of constraint, even though they play important roles in immune responses. For instance, IL6 and IL1β, both critical mediators of inflammatory responses and trained immunity, exhibit moderate to low shet values. This suggests that partial or even complete LOF variants in these genes are compatible with survival and may have relatively mild phenotypic consequences.

Indeed, common variants in IL6, IL1β, and related cytokine genes are found at appreciable frequencies in the general population, including missense and regulatory variants that modulate expression or function (Gong et al., 2022; Ołdakowska et al., 2022; Smith and Humphries, 2009). Some of these are associated with variation in infection susceptibility, inflammatory disease risk, or vaccine responsiveness, but they rarely cause severe monogenic disorders. For example, polymorphisms in the IL6 promoter have been associated with differences in CRP levels and outcomes in sepsis or COVID-19 but are not causative of immunodeficiency (Gong et al., 2022; Ołdakowska et al., 2022; Smith and Humphries, 2009).

This pattern is consistent with functional redundancy in cytokine networks: many cytokines share receptor subunits (e.g. IL-6 and IL-11 signaling through gp130) or activate overlapping downstream pathways such as JAK-STAT (Murakami et al., 2019). In this modular architecture, TFs act as essential convergence points, integrating signals from diverse upstream cues, while individual cytokines may be more expendable or context-specific.

Taken together, these data support a model in which trained immunity relies on a robust and evolutionarily constrained transcriptional core, built on TFs that are indispensable for orchestrating broad and durable responses. In contrast, upstream effector molecules exhibit greater flexibility, reflecting both their redundancy and the adaptive value of variation in cytokine responsiveness across diverse human environments.

Conclusion and future directions

The extension of trained immunity to the hematopoietic compartment has revealed that diverse stimuli—both inert and infectious—can durably reprogram the lineage biases, epigenetic landscapes, and transcriptional circuits of LT-HSCs. While this diversity of stimuli has posed interpretive challenges, it has also clarified common mechanisms: hematopoietic reprogramming is consistently initiated by EM, typically coordinated by cytokines such as IL-1β, IL-6, and IFN-γ acting on LT-HSCs. Downstream, the activation of pioneer and lineage-determining TFs capable of remodeling chromatin likely constitutes the next critical step in encoding immune memory.

Comparative analyses suggest that mouse models often engage fewer TFs than human studies—whether this reflects true species-specific biology, experimental tractability, or differences in stimulus remains unclear. However, many of the TFs implicated in hematopoietic reprogramming, particularly those involved in chromatin remodeling and lineage specification, are among the most evolutionarily constrained genes in the human genome. This constraint likely reflects their indispensable roles in core developmental and cellular processes, rather than any selection pressure specific to trained immunity. Instead, trained immunity may have emerged, at least in part, as a byproduct of the reuse of this deeply conserved transcriptional architecture in the context of infection and inflammation.

Looking ahead, trained immunity offers a powerful framework for exploring fundamental questions: How is epigenetic memory maintained in self-renewing cells? How do cells preserve core identity while flexibly adapting to environmental challenges? Self-sustaining TF circuits—like the Myc-Wnt/β-catenin loop in embryonic stem cells (Fagnocchi et al., 2016) or the IRF-KLF-SPI network in DC development (Lin et al., 2015)—may provide a useful blueprint. Whether similar circuits exist in reprogrammed HSPCs remains to be explored.

A key frontier lies in decoding the thresholds that separate transient adaptation from long-term memory. Are these governed by the intensity, duration, or type of stimulusor by a combinatorial logic yet to be defined? Elucidating these parameters will not only advance our understanding of hematopoietic reprogramming and trained immunity, but also inform broader questions in immunology and regenerative medicine, from T-cell fate decisions to the design of durable cellular therapies and improved patient stratification strategies.

References

Article and author information

Author details

  1. Sarah J Sun

    1. Committee on Immunology, University of Chicago, Chicago, United States
    2. Medical Scientist Training Program, University of Chicago, Chicago, United States
    Contribution
    Investigation, Writing – original draft, Writing – review and editing
    For correspondence
    sarahjiesun@gmail.com
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-5604-7324
  2. Raúl Aguirre-Gamboa

    Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, United States
    Contribution
    Visualization, Writing – review and editing
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-2505-3574
  3. Luis B Barreiro

    1. Committee on Immunology, University of Chicago, Chicago, United States
    2. Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, United States
    3. Department of Human Genetics, University of Chicago, Chicago, United States
    4. Committee on Genetics, Genomics, and Systems Biology, University of Chicago, Chicago, United States
    5. Chan Zuckerberg Biohub Chicago Investigator, Chicago, United States
    Contribution
    Investigation, Writing – review and editing
    For correspondence
    lbarreiro@uchicago.edu
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-9456-367X

Funding

National Institute of Allergy and Infectious Diseases (AI182148)

  • Luis B Barreiro

National Institute of Allergy and Infectious Diseases (AI190301)

  • Luis B Barreiro

National Institute of General Medical Sciences (GM152227)

  • Luis B Barreiro

National Institute of General Medical Sciences (GM150375)

  • Sarah J Sun

University of Chicago (T32GM150375)

  • Sarah J Sun

University of Chicago (NIDDK P30 DK042086)

  • Luis B Barreiro

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

We thank all members of the Barreiro lab for their insightful comments. This study was supported by grant R35-GM152227, R01-AI182148, and R01-AI190301 to LBB, and the UChicago DDRCC, Center for Interdisciplinary Study of Inflammatory Intestinal Disorders (C-IID) (NIDDK P30 DK042086). SS was supported by the MSTP training grant T32GM150375.

Copyright

© 2025, Sun et al.

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 9
    views
  • 0
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Sarah J Sun
  2. Raúl Aguirre-Gamboa
  3. Luis B Barreiro
(2025)
Rewiring the bone marrow: Evolution and the transcriptional architecture of trained immunity
eLife 14:e107551.
https://doi.org/10.7554/eLife.107551

Share this article

https://doi.org/10.7554/eLife.107551