HSPCs display withinfamily homogeneity in differentiation and proliferation despite population heterogeneity
Abstract
Highthroughput singlecell methods have uncovered substantial heterogeneity in the pool of hematopoietic stem and progenitor cells (HSPCs), but how much instruction is inherited by offspring from their heterogeneous ancestors remains unanswered. Using a method that enables simultaneous determination of common ancestor, division number, and differentiation status of a large collection of single cells, our data revealed that murine cells that derived from a common ancestor had significant similarities in their division progression and differentiation outcomes. Although each family diversifies, the overall collection of cell types observed is composed of homogeneous families. Heterogeneity between families could be explained, in part, by differences in ancestral expression of cell surface markers. Our analyses demonstrate that fate decisions of cells are largely inherited from ancestor cells, indicating the importance of common ancestor effects. These results may have ramifications for bone marrow transplantation and leukemia, where substantial heterogeneity in HSPC behavior is observed.
Introduction
The hematopoietic system has long since served as a reference model for stem cell biology, with understanding garnered from the study of hematopoietic stem cells (HSCs) successfully transferred to the clinic. In order to maintain blood cell production, rare selfrenewing HSCs produce differentiated cells called multipotent progenitors (MPPs), which proliferate and differentiate through an amplifying cascade of increasingly committed progenitors, ultimately resulting in all mature blood cell types. Underpinning this traditional model is the assumption that the HSC pool is maintained through a process of asymmetric division that results in one HSC and one MPP, while MPPs form a transient cell type that cannot persist indefinitely and must ultimately differentiate.
Recent studies have challenged this theory in multiple distinct directions. It is wellestablished that HSCs can sequentially reconstitute the blood system of several hosts (Ross et al., 1982), leading to the inference that HSCs must be able to maintain themselves (Morrison and Kimble, 2006). When observed with timelapse imaging, selfrenewal has been seen to occur through both symmetric and asymmetric cell division (Wu et al., 2007; Brummendorf et al., 1998; Ema et al., 2000; Punzel et al., 2003), which can be influenced by extrinsic signals (Ema et al., 2000; Punzel et al., 2003). Steadystate in situ lineagetracing studies have also suggested that MPPs are capable of selfrenewal (Sun et al., 2014; Busch et al., 2015). In addition, HSCs have been shown to differentiate without division into megakaryocytes in vitro (Roch et al., 2015), and common myeloid, megakaryocyte, and erythroid progenitors in vivo (Grinenko et al., 2018). Together, these findings not only questioned the necessity for HSCs to undergo asymmetric division, but they also queried the explicit link between division and differentiation. Evidence for the multipotency of HSCs and MPPs has historically derived from in vitro colonies assays and transplantation experiments (Morrison and Weissman, 1994; Osawa et al., 1996; Nakahata and Ogawa, 1982; Christensen and Weissman, 2001). Recent singlecell transplantation and cellular barcoding experiments have revealed that only a few HSCs reconstitute all of the blood lineages, with the rest being either restricted in the number of lineages they produce (Yamamoto et al., 2013; SanjuanPla et al., 2013; RodriguezFraticelli et al., 2018; Carrelha et al., 2018) or having a bias or imbalance in the proportion of cell types they create (Dykstra et al., 2007; MüllerSieburg et al., 2004). As examples of lineage restriction, it has been reported that some HSCs produce only megakaryocytes (RodriguezFraticelli et al., 2018; Carrelha et al., 2018), while others produce only myeloid cells, megakaryocytes, and erythrocytes (Yamamoto et al., 2013). Furthermore, singlecell transplantations have revealed that HSCs are heterogeneous in differentiation and proliferative output (Benz et al., 2012; MüllerSieburg et al., 2002; Sieburg et al., 2011), and transplantation experiments using populations of HSCs labeled with different fluorescent proteins have suggested that this heterogeneity might be epigenetically determined (Vwc et al., 2017). Taken together, HSCs have been shown to be a heterogeneous population, where each one of them may be committed to the production of only a few lineages, possibly through lineage priming or externally through instruction from a niche. Similarly, transplanted barcoded MPPs have been reported to produce heterogeneous patterns of restricted cell types (Naik et al., 2013), suggesting that lineage restriction may occur early in the hematopoietic tree, in the pool of HSCs and MPPs (Perié and Duffy, 2016).
Altogether, it is presently unclear how symmetric and asymmetric division combines with early lineage commitment to generate downstream diversity, and a fundamental question is how much instruction is inherited by offspring from an ancestral HSC or MPP. That matter has not been addressed previously due to technical limitations. Tackling it requires an experimental system that enables the simultaneous identification of cells that are descendent from a common ancestor, the number of divisions that has led to each of them, and their differentiation status. Towards that end, we developed a divisiondye multiplex system (Horton et al., 2018) for the study of hematopoietic system.
The data from our study revealed that cells that derived from a common ancestor were highly concordant in their division progression and similar in their differentiation pattern. This similarity is primarily propagated through divisions resulting in siblings of the same cell type. These data establish that early lineage commitment can be inherited from individual HSCs and MPPs, and that the resulting diversity of lineages is produced by a heterogeneous collection of cell families that are individually homogeneous. Our data suggests that common ancestor effects are significant and call for a revision of the assumption of independent fate decisions by cells along the hematopoietic tree.
Results
Highthroughput simultaneous tracking of the common ancestor, number of divisions, and differentiation status of HSPCs
Defining a family as all descendants from an individual marked ancestor cell, we developed a new highthroughput method that simultaneously determines for each cell’s family membership, generation (i.e., number of cell divisions), and phenotype (Horton et al., 2018Figure 1) in hematopoietic stem cells and progenitors (HSPCs). We focused our investigation on familial division and differentiation in early hematopoietic differentiation to elucidate how symmetric and asymmetric fates combine with early lineage commitment to generate downstream diversity and how much instruction is inherited by offspring from single ancestral HSPC.
To this end, we isolated bone marrow (BM) cells, labeled them with four distinguishable combinations of 5(and 6)carboxyfluorescein diacetate succinimidyl ester (CFSE) and CellTrace Violet (CTV), and used fluorescent antibody staining of cell surface markers to determine three HSPC populations, cKit^{+}Sca1^{+}CD150^{+}Flt3^{} (SLAMHSC), cKit^{+}Sca1^{+}CD150^{}Flt3^{} (STHSC), and cKit^{+}Sca1^{+}CD150^{}Flt3^{+} (MPP) (Figure 1). Wells in a 96well plate were then seeded with four cells of a single ancestral type (SLAMHSC, STHSC, or MPP), one from each of the four CFSE/CTV combinations to increase the throughput of the assay, and incubated in one of two classic cytokine cocktails (SCF and TPO, ±IL3 and IL6). At 24 or 48 hr, cells were harvested from each well and stained with fluorescent antibodies to determine their phenotypic cell type based on the expression of CD150, Flt3, Sca1, cKit, and CD16/32 (Figure 1). By examining each cell’s CFSE and CTV profile, its ancestral cell and generation number was determined (Figure 1 and Materials and methods). By index sorting labeled cells, we could relate downstream familial fate and division to ancestral cell surface expression.
The method allows easily to assess a large collection of single ancestor cells. For each ancestor type, 360 initial cells were sorted for analysis at 24 hr and 240 initial cells for analysis at 48 hr (Figure 2A). From the 600 seeded SLAMHSCs, we recovered 358 families (71%) constituting 648 cells, while 343 STHSC families (69%) were recovered with 592 cells, and 246 MPP families (49%) with 362 cells (Figure 2A). Over all conditions, 27 families (2.8% of recovered families) had cell numbers that could not have originated from a single ancestor, and so were excluded from analysis, illustrating the fidelity of the singlecell sorting.
Characterization of differentiation outcomes at the population level
At the population level, some offspring underwent no differentiation from their ancestor type while others differentiated. In both culture conditions and for each ancestor type, we obtained a diversity of myeloid cell types ranging from the initial ancestor to cKit^{} differentiated cells, including cKit^{+}Sca1^{}CD16/32^{+} (GMP), cKit^{+}Sca1^{}CD150^{}CD16/32^{} (CMP/MEP), cKit^{}CD16/32^{+} (late myeloid progenitor, late MP), and cKit^{}CD16/32 cell types (Figure 2A). We also detected cKit^{+}Sca1^{}CD16/32CD150^{+} (PreMegE) cells, previously described as megakaryocyte and erythroid progenitors (Pronk et al., 2007). As the phenotypic definitions we used after culture were originally defined on freshly isolated progenitors, we functionally tested the differentiation of the progenitors as defined phenotypically after 48 hr of culture with SCF, TPO, IL3, and IL6 using semisolid cultures (Figure 2B, C and Figure 2—figure supplement 1) and observed a similar differentiation outcome as previously published (Pronk et al., 2007).
Although all hematopoietic cells have been reported to go through a Flt3 expressing stage (Boyer et al., 2011), we found that no offspring of SLAMHSCs, and those of very few STHSCs, differentiated into Flt3 expressing MPPs in our culture conditions (Figure 2D). Some STHSC and MPP acquired CD150 expression after culture, making them resemble SLAMHSCs, but this was not further investigated. This CD150 expression cannot be due to residual fluorescence from antibodies used for cell sorting as no fluorescence was measured for any of the markers in cells kept for 24 hr in culture after sorting without further antibodies staining (Figure 2—figure supplement 2). The addition of IL3 and IL6 had no impact on the pattern of cell types produced by SLAMHSCs (Figure 2D). It did, however, change the pattern of cell types produced by STHSCs and MPPs in a statistically significant way as determined by a permutation test (Materials and methods), with an increased number of early progenitors as opposed to more differentiated progenitors as previously shown (Lui et al., 2014).
Heterogeneity between HSPC families in division and differentiation
At the level of individual families, we observed substantial heterogeneity in division history (Figure 2E) from families that did not proliferate to those with cells that had undergone six divisions. Consistent with earlier observations (Roch et al., 2017), after 24 hr most cells either remained undivided or had divided once, with a few cells having undergone two or more divisions (Figure 2A). At 48 hr, over 90% of the cells had undergone at least one division. For all three sorted ancestor cell types, we found that addition of IL3 and IL6 led to a statistically significant increase in proliferation (Figure 2E), as previously described (Domen and Weissman, 2000; BordeauxRego et al., 2010). Exploring the relationship between division and differentiation, in the culture without IL3 and IL6, the proportion of undivided ancestors that differentiated were 36.5% from SLAMHSCs, 61.1% from STHSCs, and 35.6% from MPPs (Figure 2F), which was in agreement with previous reports (Roch et al., 2015; Grinenko et al., 2018). The addition of IL3 and IL6 did not drastically change those values. SLAMHSCs preferentially differentiated without dividing into PreMegEs, as reported previously (Grinenko et al., 2018). On comparing the surface marker expression of differentiated and notdifferentiated nondivided cells (Figure 2—figure supplement 3), Sca1^{high} SLAMHSCs were more likely not to differentiate, in agreement with a previous report (Schulte et al., 2015). The addition of IL3 and IL6 only significantly impacted the differentiation pattern of the progeny of MPPs. These results show that families are heterogeneous in their division pattern, and that a nonnegligible fraction of ancestors differentiates without dividing.
Both symmetric and asymmetric fate occurred within families after the first division
Our experimental system can capture a large number of siblings after a single division, enabling the quantification of symmetric versus asymmetric fates. We defined four distinct types of symmetric or asymmetric fates depending on whether the offspring included a differentiated cell (Figure 2G). A symmetric undifferentiated fate produces two cells of the ancestor type. An asymmetric undifferentiated fate, which would be the classically defined asymmetric fate in the stem cell community, produces one cell of the ancestor type and one differentiated cell. Similarly, a symmetric differentiated fate produces two cells of the same differentiated type, and an asymmetric differentiated fate produces two cells of the distinct differentiated type. Note that an asymmetric fate cannot be distinguished from a symmetric fate followed by differentiation without division of one of the daughter cells. Pooling data over ancestor types, 70.7% of the cells had a symmetric fate after their first division, with the fate of MPPs being mostly symmetric undifferentiated (51.4%), and of STHSC being mostly symmetric differentiated (59.5%). SLAMHSCs selfrenewed with symmetric undifferentiated fates (32.1%), but also with asymmetric undifferentiated fates (10.7%). Asymmetric fates occurred for 28.6% of SLAMHSCs, 28.6% of STHSCs, and 31.4% of MPPs, consistent with previous reports (Suda et al., 1984). No statistical difference was found between ancestors cultured with or without IL3 and IL6. The pattern of cell types produced after one division (Figure 2 and Figure 2—figure supplement 3B) was similar to the pattern including all divisions (Figure 2D), suggesting that the diversity of cell types can be produced by a heterogeneous collection of cell families through symmetric fates.
HSPC family members were concordant in division
To investigate familial effects on division progression, we examined the generation numbers of cells derived from single ancestors. We found that families are highly concordant, with 81% of the 223 families that divided more than once having all of their cells in the same generation (range = 0, Figure 2H), and only four of those families (1.8%) containing cells that were more than one generation apart (range >1, Figure 2H). As not all cells are necessarily recovered from wells and sampling effects could potentially make families look more concordant, a mathematical model that accounts for that sampling was fitted to the 48 hr data to estimate the correlation in division progression decisions among cells within a family (Materials and methods). High correlation coefficients (70–90%) (Figure 2H) resulted in the best fit to the measured familial ranges, with the exception of MPPs cultured in medium without IL3 and IL6 for which no reliable estimate could be made. For guidance, the pattern of range values from the model for different correlation coefficients is also illustrated in Figure 2H. Thus, this analysis establishes that division progression is highly concordant within SLAMHSC, STHSC, and MPP families, while being heterogeneous between them.
Differentiation occurred though a diversity of paths and progressed with division
Although differentiation without division was observed, in general differentiation progressed in tandem with division (Figure 3A), as published previously (Upadhaya et al., 2018). To visualize changes in cell surface marker expression, we used the Uniform Manifold Approximation and Projection (UMAP) algorithm (McInnes and Healy, 2018) on the combined surface marker expressions of all cells obtained at both 24 and 48 hr (Figure 3B and Figure 3—figure supplement 1). When we mapped cell types determined by traditional gating onto the UMAP (Figure 3C), a smooth transition was observed from SLAMHSCs at the bottom, with further differentiated cells towards the top, indicating a gradual transition from one cell type to another. GMPs and cKit^{}Sca1^{}CD16/32^{+} (myeloid progenitors [MPs]) appear on the top left, and CMP/MEP and PreMegE on the top right. More numerous GMPs, MPs, and CMP/MEPs were seen at the top of the UMAP at 48 hr than 24 hr, suggesting that it takes between 24 and 48 hr to fully differentiate into Sca1^{} progenitors. In addition, all three ancestral cell types remain present at 48 hr, demonstrating, in particular, that HSCs can remain in an undifferentiated state for the duration of the experiment, even if their offspring experience three rounds of division. On plotting the generation numbers of offspring from each ancestor cell type on the UMAP (Figure 3D and Figure 3—figure supplement 2), SLAMHSCs appeared to be primed towards the production of CMP/MEPs and PreMegEs while still generating some STHSCs, whereas MPPs were more primed towards GMPs, and STHSCs showed a more even distribution between the two lineages. Differentiation without proliferation appeared as dark red dots outside of the regions of the sorted ancestor cells, and selfrenewal divisions as red, orange, and blue dots in the region of ancestor cells. PreMegEs were observed to be generated without division, as well as in 1–3 divisions from SLAMHSCs.
HSPC family members displayed similar differentiation outcome
Descendants from a common ancestor were not only highly concordant in their generation numbers, but they also exhibited significant similarity in differentiation outcome. At 24 hr, most families were composed of only one cell type (Figure 4A), but at 48 hr more families produced several cell types (Figure 4B), indicating that downstream asymmetries in the fate occur after a largely symmetric first division (Figure 2G). Permutation tests on phenotypically defined cell types revealed that families exhibited significantly more similarity than would be expected if there was no family component, both at 24 hr (Figure 4A) and at 48 hr (Figure 4B). This withinfamily similarity in fate is visible by the colocalization of cells from the same family on the UMAP (Figure 4C). Thus, SLAMHSC, STHSC, and MPP families are highly concordant in division and share similar differentiation outcomes in vitro, while populationlevel diversity in proliferation and cell types arises from heterogeneity across families.

Figure 4—source data 1
 https://cdn.elifesciences.org/articles/60624/elife60624fig4data1v2.xlsx

Figure 4—source data 2
 https://cdn.elifesciences.org/articles/60624/elife60624fig4data2v2.xlsx

Figure 4—source data 3
 https://cdn.elifesciences.org/articles/60624/elife60624fig4data3v2.xlsx

Figure 4—source data 4
 https://cdn.elifesciences.org/articles/60624/elife60624fig4data4v2.xlsx
The withinfamily homogeneity in division and differentiation could be intrinsically present in ancestor cells or extrinsically instructed by cytokines in the cocktail. Comparison of the median fluorescent intensities of markers from ancestors during sort with those obtained from daughters at each of the two time points (Figure 4—figure supplement 1A) revealed clear correlation between the two, supporting the hypothesis that ancestor surface expression markers were instructive in the withinfamily homogeneity in division progression and differentiation. To further investigate that hypothesis, we first explored the relationship between cell surface expression on the ancestor and division progression. Rank ordering ancestors from the least to greatest expression level for a given marker (Figure 4D, Figure 4—figure supplement 1B), the cumulative sum of the maximum division of their offspring would be expected to fall near the diagonal if there were no relationships between an ancestral expression level and division progression. If there was a negative relationship, where low expression of a given marker on the ancestor corresponded to more division progression, the cumulative sum of the maximum division would be expected to initially overshoot above the diagonal. The contrary would happen if there was a positive relationship. The statistical significance of divergence from the diagonal was tested using Jonckheere's trend test (Figure 4—source data 1). Across both cocktails and time points, only cell surface markers on ancestral SLAMHSCs were consistently instructive for division progression. CD48 correlated positively, with its strongest effect at 24 hr, and Sca1 correlated negatively, while at 48 hr cKit correlated positively (Figure 4D, Figure 4—figure supplement 1B, and Figure 4—source data 1). Notably, for both STHSCs and MPPs, even though the family data clearly indicate that there is a familial component to division progression (Figure 2H), none of the phenotypic markers exhibited strong correlation (Figure 4—figure supplement 1B), indicating the need to identify other markers.
Ancestral phenotype correlated with familial differentiation outcome
We then explored the relationship between fluorescence intensity of ancestral marker expression and familial differentiation (Figure 4E, Figure 4—figure supplement 2, and Figure 4—source data 2 and 3; for a summary of findings in table format, see Figure 4—source data 4). For SLAMHSCs, at 24 and 48 hr in both cocktails, Sca1 expression provided a strong positive correlation to selfrenewal and a negative one to production of PreMegE. At 48 hr, cKit presented the inverse dependency to Sca1. At 24 and 48 hr, CD48 correlated negatively to selfrenewal and positively to production of PreMegE when IL3 and IL6 are added. As cocktail composition did not have a major impact on the relationship between familial fate and ancestral expression, these results were suggestive that cKit and Sca1 expression levels of SLAMHSCs act as intrinsic markers for both familial progression and differentiation with high Sca1 expression and low cKit expression leading to less division (Wilson et al., 2015; Grinenko et al., 2014; Morcos et al., 2017) and less differentiation (Shin et al., 2014), and potentially resulting in better engraftment (Wilson et al., 2015; Grinenko et al., 2014; Shin et al., 2014). While low ancestral CD48 expression level has been reported to result in less division (Pietras et al., 2015; Akinduro et al., 2018), our data indicates its relationship to differentiation is dependent on extrinsic signals.
For STHSCs and MPPs, we found little evidence of correlation of ancestral expression to division progression or selfrenewal, but the same was not true of differentiation. For STHSCs, the ancestral level of CD48 and Sca1 consistently correlated negatively and positively, respectively, with dedifferentiation to SLAMHSC in the cocktail with IL3 and IL6 at both time points (Figures 4E and Figure 4—figure supplement 2). Differentiation to GMP, which occurred only in 48 hr data, correlated positively and negatively with the ancestral level of CD48 and Sca1, respectively (Figure 4E; Morcos et al., 2017). Therefore, differentiation to GMP from STHSC was dependent on the parental level of CD48 and Sca1, whereas the dedifferentiation to SLAMHSC is dependent on both extrinsic factors (IL3 and IL6) and the intrinsic ancestral level of CD48 and Sca1. The differentiation from MPPs to GMPs that was observed to occur by 48 hr correlated negatively with Sca1 ancestral expression (Morcos et al., 2017) in both cocktails. It also negatively correlated with Flt3 ancestral expression, but only in the cocktail without IL3 and IL6 (Figure 4E). In the presence of IL3 and IL6, instead, differentiation from MPP to GMP positively correlated with cKit ancestral expression. Differentiation from MPP to CMP/MEP occurred only at 48 hr in the cocktail without IL3 and IL6, and then correlated positively with cKit (Figure 4E). Thus, differentiation to GMP from MPP is dependent on the intrinsic ancestral level of Sca1, whereas the differentiation to CMP/MEP is dependent on both extrinsic factors (IL3 and IL6) and the intrinsic ancestral level of cKit. Overall, the concordance in division and similarity in fate within families is partially explained by the surface expression marker used to phenotype ancestors, but both intrinsic and extrinsic factors act to direct familial fate.
Discussion
We developed a highthroughput method that enables simultaneously determination of common ancestor, generation, and differentiation status of a large collection of single cells. Its use with HSPCs revealed that despite substantial populationlevel heterogeneity amongst offspring cells derived from a single ancestor are highly concordant in their division progression and exhibit familial effects on differentiation. The restriction in differentiated cell types within each family is propagated primarily through symmetric first divisions. Although each family is composed of several cell types, the overall collection of cell types observed in a population is composed of homogeneous families from heterogeneous ancestors. This finding opens new avenues and challenges for the hematopoietic field. The generation of a diversity of cell types is presently assumed to result from a diversification within every family, and methods for inferring differentiation trajectories using singlecell RNA sequencing data from snapshot data assume that cells all behave independently (Trapnell et al., 2014; Bendall et al., 2014). Consistent with previous observations of early lineage priming (MüllerSieburg et al., 2002; Perié and Duffy, 2016; Paul et al., 2016; Hoppe et al., 2016), our findings establish that familial dependencies that are currently unmeasured exist within the population and call for a revision of the assumption of independent fate decision by cells along the hematopoietic tree. Ancestral cell surface expression of markers used for phenotyping serves as correlates that partially predict some of these familial properties, but, in particular, a correlate that explains the highly heritable division progression of STHSC and MPP families is not contained within them. It is also the case that extrinsic properties such as cytokine signaling can play an instructive role, altering and reshaping the observed familial effects.
As HSPCs are cultured before BM transplantation in gene therapy, our results indicate that the broad range of engraftment and proliferation capacities of HSPCs could be consequences of the heterogeneity in their engrafted families. That suggests that altered culture conditions might reduce or enhance heterogeneity between families and possibly improve transplantation outcomes if this leads to more selfrenewal divisions. Indeed, changing the composition of the population of committed HSPC might be a mechanism to directly alter the balance of lineage production, with therapeutic applications that could benefit the treatment of leukemia and genetic immune disorders.
Materials and methods
Mice and cell isolation
Request a detailed protocolAll the experimental procedures were approved by the local ethics committee (Comité d’Ethique en expérimentation animale de l’Institut Curie) under approval number DAP 2016 006. BM cells were obtained from wildtype C57BL/6 of 8–16 weeks of age by bone flushing of femur tibia and iliac crest. BM cells were MACS enriched for cKit^{+} cells using CD117 MicroBeads Ultrapure (Miltenyi Biotec cat #130091224) according to the manufacturer’s protocol.
Division tracking and surface marker labeling of HSPC cKitenriched BM cells were stained with CD135 (Flt3) PE (eBiosciences 12135182), Sca1 PECF594 (BD Biosciences, 562730), CD117 (cKit) APC (Biolegend 105812), CD150 (SLAM) PC7 (Biolegend 115914), and CD48 APCCy7 (Biolegend 103432) in RPMI1640 supplemented with 10% FCS. Subsequently, cells were stained in PBS with either 2.5 µM CellTrace CFSE (ThermoFisher Scientific C34554), 2.5 µM CTV (ThermoFisher Scientific C34557), and 2.5 µM CFSE together with 1.25 µM CTV or 2.5 µM CTV together with 1.25 µM CFSE (see Figure 1A) as adapted from.
Single cKit^{+}Sca1^{+}CD150^{+}Flt3^{} (SLAMHSC), cKit^{+}Sca1^{+}CD150^{}Flt3^{} (STHSC), and cKit^{+}Sca1^{+}CD150^{}/Flt3^{+} (MPP) were sorted directly into Ubottom 96well plates containing cell culture media using an Aria III cell sorter (BD Biosciences). For each cell type, we sorted four single cells, one for each of the CellTrace stain combinations, into each well. Sorting four ancestor cells per well is a critical step in the method to ensure that at time of analysis there are enough cells in the well, which could not be obtained when sorting one ancestor cell per well. In total, 30 wells (120 single cells) were sorted per cell type per plate, with three replicates for analysis at 24 hr and two replicates for analysis at 48 hr. In addition, we sorted 100 cells of each cell type into one well for both culture conditions in order to collect enough events for reliable gate definition for cell type and generation assignment. During the sort of single cells, fluorescence intensities of each surface marker were recorded using the index sorting function.
In vitro cell culture
Request a detailed protocolCells were cultured at 37°C under 5% CO_{2} in 100 µl of StemSpan serumfree expansion medium (Stemcell Technologies 9650) supplemented with 50 ng/ml murine recombinant thrombopoietin (TPO, SigmaAldrich SRP323610UG) and 100 ng/ml stem cell factor (SCF) or 50 ng/ml TPO, 100 ng/ml SCF, 20 ng/ml IL3, and 100 ng/ml IL6 (Ema et al., 2000; Roch et al., 2017).
Division and expression marker analysis of cell progeny
Request a detailed protocolAfter 24 or 48 hr of incubation, cells in each well were stained as for sorting except for the use of CD48 BUV395 (BD Biosciences 740236), Sca1 APCCy7 (Biolegend 108125), and CD16/32 BV711 (BD Biosciences 101337). Cells from each well were analyzed at 4°C using a ZE5 Flow cytometer (BioRad) with a recovery estimate of circa 70% per well (beadsbased estimate, data not shown).
Cell type and generation assignment
Request a detailed protocolFor data analysis of FACS data, we pooled all the data from a single experiment using the concatenate function in FlowJo (FlowJo, LLC version 10.4.2). For cell type assignment, gates were set on concatenated data of both singlecell and bulk sorted samples and then applied to the singlecell data (Figure 1). Cells were separated from debris by their forward and size scatter (FSC/SSC) profile and assigned to a cell type (see Figure 1—source data 1). The generation (i.e., the number of divisions since labeling) of cells was determined on histograms of CellTrace dye fluorescence in FlowJo. For cells stained with both CFSE and CTV, we rotated the CTV/CFSE coordinates, on a logarithmic scale, by 45° degrees anticlockwise so that division dilution proceeded in parallel to the horizontal. That is, with $x$ and $y$ denoting the coordinates of CTV and CFSE levels, the histogram was calculated over a new xaxis coordinate
Generation gating was then determined based on the florescence histogram on the new x′axis on the merged data of wells from the same experiment.
Data visualization by UMAP
Request a detailed protocolUMAP (McInnes and Healy, 2018) was performed on arcsinh(x/100) transformed fluorescence intensity values of surface expression markers from all experiments using the R implementation in the UMAP package (version 0.2.0.0) with default parameters. The UMAP output was visualized using the ggplot two package (version 3.0.0) in R (version 3.4.3).
Progenitor assays in semisolid cultures
Request a detailed protocolSLAMHSC (150–200 cells), MPP (700–1500 cells), CMP/MEP (1000 cells), GMP (300–1000), PreMegE (1000 cells), and late MP (5000 cells) were plated in duplicate or triplicate in methylcellulose MethoCult 32/34 (Stemcell Technologies) with 10 ng/ml TPO (a generous gift from Kirin, Tokyo, Japan), 1 U/ml EPO (PreproTech), 10 ng/ml IL3 (Miltenyi Biotec), 10 ng/ml IL6 (Miltenyi Biotec), 100 ng/ml SCF (PreproTech), and 20 ng/ml GCSF (Miltenyi Biotec). Colonies derived from erythroid progenitors (colony forming uniterythroid [CFUE]) were counted after 2 days, but no CFUE was detected in any of the cell populations tested. Colonies derived from erythroid progenitors (burst forming uniterythroid [BFUE]), granulomonocytic (colony forming unitgranulocyte macrophage [CFUGM]), and multilineage colonies (mixed) progenitors were counted after 9 days. For megakaryocytic progenitor (CFUMK) assay, SLAMHSC (150–200 cells), MPP (2000 cells), CMP/MEP (2000 cells), GMP (2000), PreMegE (2000 cells), and late MP (5000 cells) were plated in triplicate in serumfree fibrin clot assays with SCF, IL6, and TPO. MKs and CFUMKs were evaluated at day 7 by acetylcholinesterase staining.
Confidence intervals
Request a detailed protocolThe confidence intervals at 95% level shown in Figures 2B–F and 3D were calculated via basic bootstrap (Davison and Hinkley, 1997) with 250,000 bootstrap datasets. Following this procedure, each bootstrap dataset is constructed by sampling with replacement as many cellular families (Figures 2B–H and 3D) as were in the original data. The distribution of the statistics, each calculated from one bootstrap dataset, then provided a reference from which the confidence interval was derived. Formally, given the statistic $\widehat{\theta}$ calculated from the original data, and ${\theta}_{\left(0.025\right)}^{*}$ and ${\theta}_{\left(0.975\right)}^{*}$ the 0.025 and 0.975 percentiles, respectively, derived from the bootstrapped distribution, the confidence interval of $\widehat{\theta}$ at 95% level was calculated as follows:
Statistical testing framework
Request a detailed protocolTo perform the statistical analysis, we adapted the permutation test (Lehmann and Romano, 2006) framework proposed in Horton et al., 2018. This framework was preferred over classical statistical tests as their assumptions were violated by the presence of familial dependencies in the data.
The objective of this framework was to challenge the hypothesis of independence between one or more variables in the data. For example, to test if the differentiation pattern was changed by culture conditions in Figure 2D, we challenged the null hypothesis that differentiation pattern per ancestor type (e.g., SLAMHSC) was independent of culture conditions. If that null hypothesis held true, then the pattern of differentiation would not statistically change on swapping families between the culture conditions. Thus, the first step in the procedure consists in computing a statistic for the measured data that captures a key characteristic related to the variables to be tested. In this example, we chose the statistic to be the Gtest statistic (or Gvalue) for contingency tables (Lehmann and Romano, 2006); therefore, the differentiation pattern data was transformed into the cellular frequencies from each cell type (columns) for each culture condition (rows). The second step is to perform randomization of the data, the permutation, that will be compared to the measured data. Each randomly selected permutation captures how the data would look if the differentiation pattern and the culture condition were independently assigned. Indeed, if these two variables were independent, we could shuffle cellular families between culture conditions and the composition of the resulting permuted dataset would be statistically similar to the original measured data. If one shuffled cells instead of families, then any familial dependence of cells would break down and so interfere with the testing of the independency between the differentiation pattern and the culture condition. The ability to manage familial dependencies is the reason why this statistical framework is well suited to these data. In Figure 2D, cellular families derived from the same ancestor type were permuted between the two culture conditions 250,000 times, and the Gvalue was then computed for each permuted dataset. Finally, the proportion of the Gvalues of the permuted datasets that were as, or more, extreme than the Gvalue from the original dataset determined the pvalue of the hypothesis test. This in turn indicated whether the differentiation pattern significantly varied with the culture condition. In general, for each test performed in this paper, a test statistic and data permutation class must be defined to characterize the hypothesis to be challenged and to compute the pvalue. Below, a more formal explanation is provided, followed by a paragraph with a description of the statistics and the permutation strategies specifically used throughout this work.
In more mathematical terms, a typical example of permutation testing proceeds in the following manner. A null hypothesis concerning the independence of the data, $D$, on one or more variables is first determined. Then, a collection, $Q$, of permutations of the data is identified such that, under the null hypothesis, the permuted data ${D}^{\pi}$, for any $\pi \in Q$, is equal in distribution to $D.$ In this way, for any a realvalued statistic $T$ of the data, $T\left(D\right)$ and ${T(D}^{\pi})$ are equal in distribution given the data $D$. Therefore, the distribution of $T\left(D\right)$, and its associated pvalues, can be approximated by the distribution of $\left\{T({D}^{{\pi}_{1}}),\dots ,T({D}^{{\pi}_{B}})\right\}$, which is obtained by sampling a large number $B$ of permutations ${\pi}_{i}$ from $Q$, for $i=1,\dots ,B$. Of note, the statistic $T$ should be chosen to present good sensitivity with respect to the departure of the data from the null hypothesis, a property often exhibited by classical statistics.
To further clarify the framework described above, we make explicit how the null hypothesis of independence between culture condition and differentiation pattern was challenged in Figure 2D. Under this hypothesis, the frequencies of cell types from the two culture conditions each from a different cell culture were equal in distribution. In particular, under the null hypothesis the distribution of the data in each culture condition would not change upon the shuffling of cellular families between culture conditions, which identifies a suitable set of permutations $Q$. As the variables to be tested were either discrete or categorical, the independence of celltype frequency from the culture condition was tested selecting $T$ to be the Gtest statistic for contingency tables (Lehmann and Romano, 2006). Following this rationale, the same choice for the set of permutations, $Q$, and the test statistics, $T$, was made to challenge the hypotheses of independence between culture condition and the other discrete variables: maximum division number per family (Figure 2E), differentiation pattern without division (Figure 2F), and pattern of first division (Figure 2G).
In Figure 4B, we sought to challenge the null hypothesis that differentiation diversity among cells from the same ancestor type was independent of familial membership, effectively testing whether a cell’s familial membership was independent of its type. Under this null hypothesis, the naïve assumption would be to define $Q$ as the set of permutations that swap cells between or within families, but, as cell type appeared to correlate with cell generation (Figure 3D), permuting cells with different division number would return a dataset ${D}^{\pi}$ that is not equal in distribution to $D$, the original. Leveraging the flexibility of the testing framework, it sufficed to instead restrict the set $Q$ to be permutations that leave the generations of cells unaltered, effectively solely swapping cells (between or within families) having the same division number. For this test, $T$ was set as the average number of cell types per family since this statistic is expected to decrease under the alternative hypothesis that cells with a common ancestor diversify into a smaller collection of cell types.
Finally, we tested whether the ancestor’s expression levels were independent of an ordinal variable of its offspring: division (Figure 4D) and differentiation pattern (Figure 4E). Under each null hypothesis, $Q$ was defined as the set of permutations of families amongst ancestors, which embodies the assumption that a family is assigned independently at random to an ancestor. To assess such null hypothesis when compared against the alternative that the families ranked by their ancestors’ expression levels established a trend (either increasing or decreasing) in the other familial variable, Jonckheere's trend test was chosen as the test statistic $T$.
Statistical testing formulae
Request a detailed protocolTo challenge the null hypotheses that differentiation was independent of the culture condition, using the data underlying Figure 2D we compared the population proportion per cell type. For notational purposes, the data were represented as a sequence $D={\left({\tau}_{i},{c}_{i},s\left({c}_{i}\right)\right)}_{i=1}^{N}$ of $N$ cells, where the $i$th cell was identified by cell type ${\tau}_{i}$, family ${c}_{i}$, and culture condition of the family $s\left({c}_{i}\right)$. To assess the independence of cell types $J=\{{\tau}_{i},i=1,\dots ,N\}$ from partition labels $l\in \{1,\dots ,L\}$ (relative to culture condition), the statistic $T$ of the data $D$ was defined as the loglikelihood statistic of the Gtest for the contingency table $O$, such that ${O}_{jl}={\sum}_{i=1}^{N}\chi \left({\tau}_{i}=j,s\left({c}_{i}\right)=l\right)$ with $\chi \left(A\right)=1$ if the event $A$ holds true and 0 otherwise. The Gtest statistic is classically used for the testing of independence between two sets of categories ($J$ and $\{1,\dots ,L\}$) partitioning the data counts. Therefore,
where ${E}_{jl}=\left({\sum}_{i\in J}{O}_{il}\right)\left({\sum}_{i=1}^{L}{O}_{ji}\right)$.
Under the null hypothesis that differentiation was not impacted by culture condition, $D$ is equally likely as a dataset ${D}^{\pi}={\left({\tau}_{i},\pi \left({c}_{i}\right),s\left(\pi {(c}_{i}\right))\right)}_{i=1}^{N}$ transformed by the action of any permutation $\pi \in Q$ of the set of family labels $\{{c}_{i},i=1,\dots ,N\}$. As a consequence, using Monte Carlo approximation we estimated the pvalue for the righttailed test as
where $B=\mathrm{250,000}$ and ${\pi}_{1},\dots ,{\pi}_{B}$ were uniformly and independently sampled from $Q$.
To challenge the null hypotheses that familial division was independent of the culture condition, for the data underlying Figure 2E we compared the distribution of the maximum generation reached by each family. For these procedures, it sufficed to follow the same rationale as for the tests related to Figure 2D, but for the dataset $D={\left({\tau}_{i},{c}_{i},s\left({c}_{i}\right)\right)}_{i=1}^{N}$ of $N$ families, where ${\tau}_{i}$ is the maximum generation of the $i$ th family. In particular, the testing statistic $T$ was defined as in Equation 2, and the subsequent pvalue was estimated as in Equation 3.
To challenge the null hypotheses that differentiation without division was independent of the culture condition for the data underlying Figure 2F, we compared the proportions of cell types of undivided cells (i.e., those in generation 0). For these procedures, it sufficed to follow the same rationale as for the tests related to Figure 2D, E, with $D={\left({\tau}_{i},{c}_{i},s\left({c}_{i}\right)\right)}_{i=1}^{N}$ the sequence of $N$ families in generation 0, where ${\tau}_{i}$ identifies the type of the unique cell in family ${c}_{i}$. In particular, the testing statistic $T$ was defined as in Equation 2, and the subsequent pvalue was estimated as in Equation 3.
To challenge the null hypotheses that the pattern of first division was independent of the culture condition, for the data underlying Figure 2G we compared the proportion of division types among families recovered with two cells in generation 1. For these procedures, it sufficed to follow the same rationale as for the tests in Figure 2D–F, with $D={\left({\tau}_{i},{c}_{i},s\left({c}_{i}\right)\right)}_{i=1}^{N}$ as the dataset of $N$ families with two cells generation 1, where ${\tau}_{i}$ records the pattern of division of the family ${c}_{i}$ as one out of four possibilities (outlined in Figure 2G). The test statistic $T$ was defined as in Equation 2, and the subsequent pvalue was estimated as in Equation 3.
For the data in a given time point (24 or 48 hr) underlying Figure 4B, we investigated the family effect on differentiation by challenging the null hypotheses that differentiation diversity among cells from the same ancestor type was independent of familial membership. In particular, as the cells from the data were found in different generations, we sought to take into account that division may have had an impact on differentiation (Figure 3D). These data were identified by the sequence $D={\left({\tau}_{i},{g}_{i},{c}_{i}\right)}_{i=1}^{N}$ of the $N$ cells from the same progenitor, with ${\tau}_{i},{g}_{i},{c}_{i}$ recording the type, the generation, and the family label, respectively, of the $i$ th cell. To test the null hypothesis by permutation, the set of invariant transformations $Q$ for $D$ should permute, across families, only cells that were found in the same generation. To this end, $Q$ was generated by the functions ${\pi}_{g}$ for $g\in G=\{{g}_{i},i=1,\dots ,N\}$, such that
where ${\stackrel{~}{\pi}}_{g}$ is any permutation of the set $\{i=1,\dots ,N:{g}_{i}=g\}$. Then ${D}^{\pi}={\left({\tau}_{i},{g}_{i},{c}_{\pi \left(i\right)}\right)}_{i=1}^{N}$. To measure family differentiation diversity, we defined the statistic $T$ for the average number of cell types per family, that is,
where $\{1,\dots ,M\}$ is the set of all family labels, $J=\{{\tau}_{i}:i=1,\dots ,N\}$ is the set of all cell types observed, and $\mathfrak{T}}_{c}=\left\{{\tau}_{i},i=1,\dots ,N:{c}_{i}=c\right\$. In this case, the alternative hypothesis posited that familial relationship induced a more homogeneous differentiation in terms of cell types, leading to a decreased number of cell types expected per families $T\left(D\right)$. For this reason, by Monte Carlo approximation, we estimated the pvalue for the lefttailed test as
where ${\pi}_{1},\dots ,{\pi}_{B}$ are sampled uniform and independent sampled elements from $Q$.
Using the data underlying Figure 4D, we wished to challenge the null hypotheses that family progression is independent of ancestral expression levels (CD48, cKit, Sca1). For notational purposes, the data were represented as a sequence $D={\left({\tau}_{i},{g}_{i}\right)}_{i=1}^{N}$ of $N$ families where the $i$ th family was identified by expression level ${\tau}_{i}$, relative to one marker; maximum generation of its offspring ${g}_{i}$. Given the set of maximum generations attained, $J=\left\{{g}_{i},i=1,\dots ,N\right\}$, we partitioned the data into ${D}_{j}=\left({\tau}_{i},i=1\dots ,N:{g}_{i}=j\right)={\left({\tau}_{ji}\right)}_{i=1}^{{n}_{j}}$ collections of size ${n}_{j}$, for $j\in J$. Thus, we sought to test the null hypothesis that the variables ${\tau}_{ji}$ are identically distributed, against the alternative hypothesis that, given ${m}_{j}$ the median of the distribution from which the elements of ${D}_{j}$ are drawn, for every $k,h\in J$ such that $k\le h$, are either increasing
or decreasing
where at least one inequality must be strict. To this end, the statistic $T$ of the data $D$ was defined from the Jonckheere's trend test, that is,
Under the null hypothesis that the variables ${\tau}_{ji}$ are identically distributed, $D$ is equally likely as a dataset ${D}^{\pi}={\left({\tau}_{i},{g}_{\pi \left(i\right)}\right)}_{i=1}^{N}$ transformed by the action of any permutation $\pi \in Q$ of the set of family labels $\{1,\dots ,N\}$. As a consequence, using Monte Carlo approximation we estimated the pvalue for the twotailed test as
where $B=\mathrm{250,000}$ and ${\pi}_{1},\dots ,{\pi}_{B}$ were uniformly and independently sampled from $Q$.
For the data underlying Figure 4E, we sought to challenge the null hypotheses that family differentiation to a certain cell type (SLAMHSC, PreMegE, GMP, CMP/MEP, MPP) is independent of ancestral expression levels (CD48, cKit, Flt3, Sca1). For these procedures, it sufficed to follow the same rationale as for the tests used for the data in Figure 4D, but for the dataset $D={\left({\tau}_{i},{g}_{i}\right)}_{i=1}^{N}$ of $N$ families, where, for the $i$ th family, ${\tau}_{i}$ is the expression level from one marker of its ancestor cell, while ${g}_{i}=1$ if its offspring was detected having at least one cell of the cell type under consideration, ${g}_{i}=0$ otherwise. In particular, the testing statistic $T$ was defined as in Equation 7, and the subsequent pvalue was estimated as in Equation 8.
When multiple hypotheses were tested from the same data, the familywise error rate was controlled using Holm–Bonferroni method (Lehmann and Romano, 2006). As such, given the ordered pvalues from $k$ simultaneous tests ${\widehat{p}}_{B1}^{t}\le \dots \le {\widehat{p}}_{Bk}^{t}$, the $i$ th pvalue was adjusted and recalculated as
Betabinomial model for family concordance
Request a detailed protocolTo quantify the correlation in decisions of cells to continue to divide or cease dividing, for cells from the same generation and descending from the same ancestor, we employed a stochastic mathematical model that was first described in.
In biological terms, in this mathematical model, cells divide or stop dividing with a certain probability that is correlated between cells from the same family and generation. When sampling, cells that divide less tend to be detected less, simply because cells that divide more are more abundant. Thus, it is important to take sampling into account when measuring the range of division within a family. In this model, cells have a certain probability to be measured defined by the recovery rate. The correlation coefficient that links the division of cells within the same family is fitted to the data and is used to evaluate the degree of correlation in division within families, the socalled concordance in division.
In more detail, the model is directly parameterized by the data, apart from one variable that encapsulates the correlation in decisionmaking that is fit to the data. In particular, with $n$ being the maximum generation recorded, let ${p}_{i}\in \left[\mathrm{0,1}\right]$ for $i=0,\dots ,n$ denote the proportion of cells that divide from generation $i$ to the next, which is determined from the data as follows. Set ${z}_{i}$ the total number of cells recovered in generation $i$, the ${p}_{i}$ were estimated by
for $i=0,\dots ,n1$ and by ${\widehat{p}}_{n}=0$.
In this model, given ${k}_{i}$ the number of cells from a particular family that reach generation $i$, the number of cells that continue on to divide to $i+1$ follow a betabinomial distribution with parameters ${k}_{i}$, ${a}_{i}={p}_{i}\left(1\rho \right)/\rho $, and ${b}_{i}={(1p}_{i})\left(1\rho \right)/\rho $, namely $\beta \left({k}_{i},{a}_{i},{b}_{i}\right)$, where $\rho \in \left[\mathrm{0,1}\right]$ is a free parameter. In particular, each family is generated recursively by setting ${k}_{0}=1$ and defining ${k}_{i+1}=2\beta \left({k}_{i},{a}_{i},{b}_{i}\right)$. As in the experimental system, not all cells are recovered, but the proportion that can be determined either by beads or wellvolume recovered, on generating a family with the model, we accounted for sampling effect by subsampling each cell from a family with probability $r=0.71$ independently of all other cells. The betabinomial model interpolates between cells deciding to divide again independently of one another if $\rho =0$, and when they are perfectly aligned, all making the same division decision, which occurs when $\rho =1$. A value between 0 and 1 reflects the level of concordance within each family in divisionprogression decisionmaking, but, by construction, irrespective of the values of ${p}_{i}$, that determines the population distribution among the generations. Defining the family range as the difference between maximum and minimum generations in which the cells from a family are recovered, the bestfit $\rho $ was determined to be the value that maximized the likelihood of recapitulating the family range distribution in the data.
Data availability
All data generated or analysed during this study are included in the manuscript and supporting files. Source data has been provided for Figures 14.
References

Asymmetric Cell Divisions Sustain LongTerm Hematopoiesis from Singlesorted Human Fetal Liver CellsJournal of Experimental Medicine 188:1117–1124.https://doi.org/10.1084/jem.188.6.1117

BookBootstrap Methods and Their ApplicationCambridge, UK: Cambridge University Press.https://doi.org/10.1017/CBO9780511802843

Hematopoietic Stem Cells Need Two Signals to Prevent Apoptosis; Bcl2 Can Provide One of These, Kitl/CKIT Signaling the OtherJournal of Experimental Medicine 192:1707–1718.https://doi.org/10.1084/jem.192.12.1707

In Vitro SelfRenewal Division of Hematopoietic Stem CellsJournal of Experimental Medicine 192:1281–1288.https://doi.org/10.1084/jem.192.9.1281

Clonal expansion capacity defines two consecutive developmental stages of longterm hematopoietic stem cellsJournal of Experimental Medicine 211:209–215.https://doi.org/10.1084/jem.20131115

Multiplexed Division Tracking Dyes for ProliferationBased Clonal Lineage TracingThe Journal of Immunology 201:1097–1103.https://doi.org/10.4049/jimmunol.1800481

BookTesting Statistical HypothesesSpringer Science & Business Media.https://doi.org/10.1007/038727605X

Retracing the in vivo haematopoietic tree using singlecell methodsFEBS Letters 590:4068–4083.https://doi.org/10.1002/18733468.12299

Index sorting resolves heterogeneous murine hematopoietic stem cell populationsExperimental Hematology 43:803–811.https://doi.org/10.1016/j.exphem.2015.05.006

High cKit expression identifies hematopoietic stem cells with impaired selfrenewal and megakaryocytic biasJournal of Experimental Medicine 211:217–231.https://doi.org/10.1084/jem.20131128

Kinetics of adult hematopoietic stem cell differentiation in vivoJournal of Experimental Medicine 215:2815–2832.https://doi.org/10.1084/jem.20180136

Imaging Hematopoietic Precursor Division in Real TimeCell Stem Cell 1:541–554.https://doi.org/10.1016/j.stem.2007.08.009
Decision letter

Utpal BanerjeeSenior and Reviewing Editor; University of California, Los Angeles, United States
In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.
Acceptance summary:
Tak et al. address an important problem in stem cell biology, which is how stem cells (in this case, hematopoietic stem cells, or HSCs), most of which are known to be biased or restricted in their lineage output, acquire their lineage bias. Specifically, the authors sought to uncouple the process of division and differentiation from divergent instructive signals from a niche by culturing single cells (or actually groups of 4 cells) ex vivo under defined conditions (i.e. the same extrinsic instructive signals) and examining cell division and associated lineage decisions. The authors make several conclusions from their studies. One important conclusion is that daughters of a single ancestor HSC or progenitor and their progeny divided at very similar rates, which differed between ancestral cell families, suggesting that the division rate is an intrinsic property of the ancestor cell that was passed along to its daughters. The authors also tried to link the properties of daughter cells to the levels of cell surface markers on the ancestral clone using the original index sorting information, but found that cell surface marker levels only partially correlated with the division and differentiation properties.
Decision letter after peer review:
Thank you for sending your article entitled "HSPCs display withinfamily homogeneity in differentiation and proliferation despite population heterogeneity" for peer review at eLife. Your article is being evaluated by 2 peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Utpal Banerjee as the Senior Editor.
Reviewer #1:
This study reports an in vitro cell labeling method combined with mathematical modeling to simultaneously track the number of cell divisions and the differentiation progress of single murine hematopoietic stem cells and progenitors. The authors conclude that cells derived from a common ancestor have statistically significant concordant proliferation and differentiation.
I found the paper was difficult to understand in parts, not just because the mathematical modeling is outside my expertise.
1. Although the immunophenotypes used to isolate the populations from fresh bone marrow are well established, reliance on immunophenotype in cultured cells to define functional status is problematic.
2. The biology of HSC and progenitor cells is dealt with superficiallyan example in point is the casual statement that the immunophenotypic data show "dedifferentiation" of progenitors to HSC, a phenomenon that is not accepted by the field; proof of this heresy would require use of more rigorous assays than those used here.
3. The assignment of immunophenotype based on restaining cells 2448 hours after original staining requires careful controls to ensure antibody associated fluorescence is not a carryover from prior antibody staining.
4. As the culture conditions (supraphysiologic concentrations of cytokines in cell suspension without ECM or stroma) do not reflect the complexity of the normal microenvironment the relevance of the findings to normal HSPC biology is limited.
Reviewer #2:
Tak et al. address an important problem in stem cell biology, which is how stem cells (in this case hematopoietic stem cells, or HSCs), most of which are known to be biased or restricted in their lineage output, acquire their lineage bias. Specifically, the authors sought to uncouple the process of division and differentiation from divergent instructive signals from a niche by culturing single cells (or actually groups of 4 cells) ex vivo under defined conditions (i.e. the same extrinsic instructive signals) and examining cell division and associated lineage decisions. The authors make several conclusions from their studies. One important conclusion is that daughters of a single ancestor HSC or progenitor and their progeny divided at very similar rates, which differed between ancestral cell families, suggesting that the division rate was an intrinsic property of the ancestor cell that was passed along to its daughters. The authors also tried to link the properties of daughter cells to the levels of cell surface markers on the ancestral clone using the original index sorting information, but found that cell surface marker levels only partially correlated with the division and differentiation properties.
The study required sophisticated statistical analyses. An issue is that the authors defined progenitors that differentiated from the original ancestral cell using cell surface markers that were originally defined using freshly isolated cells coupled with functional assays. The authors use these markers to define progenitor types following one or two days of ex vivo culture. Markers can change in culture, and the authors did not demonstrate with functional assays that the markers reliably identified progenitors that have been in culture for 12 days. This does not impact their conclusions about the division history, but it may impact the proper identification of the more differentiated progeny of the ancestral cell.
Has the faithfulness of the marker expression in cultured cells been verified by other laboratories? If so, please provide a reference. If not, the authors should explain how they determined that marker expression correctly identified different progenitor types in culture.
https://doi.org/10.7554/eLife.60624.sa1Author response
Reviewer #1:
This study reports an in vitro cell labeling method combined with mathematical modeling to simultaneously track the number of cell divisions and the differentiation progress of single murine hematopoietic stem cells and progenitors. The authors conclude that cells derived from a common ancestor have statistically significant concordant proliferation and differentiation.
I found the paper was difficult to understand in parts, not just because the mathematical modeling is outside my expertise.
1. Although the immunophenotypes used to isolate the populations from fresh bone marrow are well established, reliance on immunophenotype in cultured cells to define functional status is problematic.
We thank the Reviewer for raising this important point. Most of our phenotypic definitions were inspired by Pronk et al., Cell Stem Cell 2017. The comment from reviewer 2 made us realize that our naming was not fully transparent (in particular for Slam+MEP) so we have changed the naming of the progenitors phenotypically defined after culture.
We agree that the cell surface markers we used are most often used for identification of cell populations in freshly isolated bone marrow rather than cultured cells (as it is the case in Pronk et al). We have now performed semisolid cultures (methylcellulose and CFUMK assays) to functionally assess the differentiation capacity of the different cell populations defined with our phenotypic definitions after 48h of culture (SCF, TPO, IL3 and IL6). As now described in the manuscript page 19 and shown in the new panels B and C for figure 2, these assays have shown that the differentiation capacity of these progenitors after culture is very close to the fresh ones.
2. The biology of HSC and progenitor cells is dealt with superficiallyan example in point is the casual statement that the immunophenotypic data show "dedifferentiation" of progenitors to HSC, a phenomenon that is not accepted by the field; proof of this heresy would require use of more rigorous assays than those used here.
While we can see how the reviewer may have come to that conclusion, we respectfully disagree that our statement was casual as its inclusion was actually entirely considered. However, the finding about phenotypic dedifferentiation is not the focus of our study. Carrying in vivo transplantation involved work that goes beyond its remit and, therefore, we have entirely removed the comment about dedifferentiation in our manuscript.
3. The assignment of immunophenotype based on restaining cells 2448 hours after original staining requires careful controls to ensure antibody associated fluorescence is not a carryover from prior antibody staining.
We were also concerned about this issue and so had already checked that the fluorescence is lost within 24h of culture. We have added this data as Figure 2—figure supplement 2.
4. As the culture conditions (supraphysiologic concentrations of cytokines in cell suspension without ECM or stroma) do not reflect the complexity of the normal microenvironment the relevance of the findings to normal HSPC biology is limited.
We agree that our study in vitro does not reproduce the in vivo complexity. in vitro studies are, of course, never physiological but are employed as reasonable models in which to study effects that are not assessable in vivo. Our approach to ameliorating those limitations was to use two distinct culture conditions, which demonstrate that the observations on concordance in division and differentiation are consistent in both. We disagree with the reviewer, and believe these results are still significant for HSPC biology.
Reviewer #2:
Tak et al. address an important problem in stem cell biology, which is how stem cells (in this case hematopoietic stem cells, or HSCs), most of which are known to be biased or restricted in their lineage output, acquire their lineage bias. Specifically, the authors sought to uncouple the process of division and differentiation from divergent instructive signals from a niche by culturing single cells (or actually groups of 4 cells) ex vivo under defined conditions (i.e. the same extrinsic instructive signals) and examining cell division and associated lineage decisions. The authors make several conclusions from their studies. One important conclusion is that daughters of a single ancestor HSC or progenitor and their progeny divided at very similar rates, which differed between ancestral cell families, suggesting that the division rate was an intrinsic property of the ancestor cell that was passed along to its daughters. The authors also tried to link the properties of daughter cells to the levels of cell surface markers on the ancestral clone using the original index sorting information, but found that cell surface marker levels only partially correlated with the division and differentiation properties.
The study required sophisticated statistical analyses. An issue is that the authors defined progenitors that differentiated from the original ancestral cell using cell surface markers that were originally defined using freshly isolated cells coupled with functional assays. The authors use these markers to define progenitor types following one or two days of ex vivo culture. Markers can change in culture, and the authors did not demonstrate with functional assays that the markers reliably identified progenitors that have been in culture for 12 days. This does not impact their conclusions about the division history, but it may impact the proper identification of the more differentiated progeny of the ancestral cell.
Has the faithfulness of the marker expression in cultured cells been verified by other laboratories? If so, please provide a reference. If not, the authors should explain how they determined that marker expression correctly identified different progenitor types in culture.
We thank the reviewer for raising this important point. First, the Reviewer’s comment made us realize that we were not using the same names than the one used in the literature (eg Pronk et al., 2007; Grinengo et al., 2018) which was misleading. We have changed the nomenclature to follow a similar scheme to Pronk et al., 2007 (e.g. Slam+MEP is now PreMegE). We agree that the cell surface markers we used are most often used for identification of cell populations in freshly isolated bone marrow than cultured cells (as it is the case in Pronk et al).
We have now performed semisolid cultures (methylcellulose and CFUMK assays) to functionally assess the differentiation capacity of the different cell populations defined with our phenotypic definitions after 48h of culture (SCF, TPO, IL3 and IL6). As now described in the manuscript page 19 and included in figure 2,these assays have shown that the differentiation capacity of these progenitors after culture is very similar to the fresh ones.
https://doi.org/10.7554/eLife.60624.sa2Article and author information
Author details
Funding
Fondation Bettencourt Schueller (ATIPAvenir)
 Leïla Perié
Centre National de la Recherche Scientifique (ATIPAvenir)
 Leïla Perié
Labex Cell(n)Scale (ANR10LBX0038)
 Leïla Perié
Idex Paris Sciences et Lettres (ANR10IDEX000102 PSL)
 Leïla Perié
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We would like to thank the members of Institut Curies Flow Facility for their help with setting up the flow cytometry experiments, the members of the Institut Curie Animalerie for their care for our experimental animals, JeanLuc Villeval for counting the methylcellulose colonies, Prof. Phil Hodgkin (Walter and Eliza Hall Institute of Medical Research) and Dr Julia Marchingo (University of Dundee) for their advice on setting up the experiment, and Stefania Pan and Emiliano Lancini (Université Paris 13) for their advice on optimization problems in graph theory. The present study was supported by an ATIPAvenir grant from CNRS and BettencourtSchueller Foundation (to LP) and two grants from the Labex CelTisPhyBio (ANR10LBX0038) and Idex ParisScienceLettres Program (ANR10IDEX000102 PSL) (to LP).
Ethics
Animal experimentation: All the experimental procedures were approved by the local ethics committee (Comité d'Ethique en expérimentation animale de l'Institut Curie) under approval number DAP 2016 006.
Senior and Reviewing Editor
 Utpal Banerjee, University of California, Los Angeles, United States
Publication history
 Received: July 1, 2020
 Accepted: May 17, 2021
 Accepted Manuscript published: May 18, 2021 (version 1)
 Version of Record published: June 3, 2021 (version 2)
Copyright
© 2021, Tak et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics

 1,254
 Page views

 186
 Downloads

 4
 Citations
Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading

 Stem Cells and Regenerative Medicine
During severe or chronic hepatic injury, biliary epithelial cells (BECs) undergo rapid activation into proliferating progenitors, a crucial step required to establish a regenerative process known as ductular reaction (DR). While DR is a hallmark of chronic liver diseases, including advanced stages of nonalcoholic fatty liver disease (NAFLD), the early events underlying BEC activation are largely unknown. Here, we demonstrate that BECs readily accumulate lipids during highfat diet feeding in mice and upon fatty acid treatment in BECderived organoids. Lipid overload induces metabolic rewiring to support the conversion of adult cholangiocytes into reactive BECs. Mechanistically, we found that lipid overload activates the E2F transcription factors in BECs, which drive cell cycle progression while promoting glycolytic metabolism. These findings demonstrate that fat overload is sufficient to reprogram BECs into progenitor cells in the early stages of NAFLD and provide new insights into the mechanistic basis of this process, revealing unexpected connections between lipid metabolism, stemness, and regeneration.

 Stem Cells and Regenerative Medicine
Stem cell differentiation requires dramatic changes in gene expression and global remodeling of chromatin architecture. How and when chromatin remodels relative to the transcriptional, behavioral, and morphological changes during differentiation remain unclear, particularly in an intact tissue context. Here, we develop a quantitative pipeline which leverages fluorescentlytagged histones and longitudinal imaging to track largescale chromatin compaction changes within individual cells in a live mouse. Applying this pipeline to epidermal stem cells, we reveal that celltocell chromatin compaction heterogeneity within the stem cell compartment emerges independent of cell cycle status, and instead is reflective of differentiation status. Chromatin compaction state gradually transitions over days as differentiating cells exit the stem cell compartment. Moreover, establishing live imaging of Keratin10 (K10) nascent RNA, which marks the onset of stem cell differentiation, we find that Keratin10 transcription is highly dynamic and largely precedes the global chromatin compaction changes associated with differentiation. Together, these analyses reveal that stem cell differentiation involves dynamic transcriptional states and gradual chromatin rearrangement.