Introduction

During gastrulation the three germ layers are established, and the body plan is laid out. In amniotes this involves the formation of the primitive streak on the posterior side of the embryo from which cells ingress beneath the epiblast and migrate to form the mesodermal and endodermal layers. An important role in gastrulation is played by evolutionarily conserved morphogens from the BMP, Wnt, Nodal, and FGF families1. The function of the first three signaling pathways is comparatively well understood: in both mouse and human a transcriptional hierarchy between BMP4, Wnt3, and Nodal plays a central role in inducing dynamic signaling gradients along the anterior posterior axis that control cell fate patterning13. For BMP and Nodal, basal receptor localization in the epiblast is crucial to form signaling activity gradients37. Wnt and Nodal signaling are directly required for mesoderm and endoderm differentiation, while the balance between BMP and Nodal signaling is important for patterning the germ layers along the body axes8.

In contrast, the role of FGF in mammalian and especially human gastrulation has been more elusive. Secreted FGFs signal through FGF receptor tyrosine kinases (FGFRs) in a heparan sulfate proteoglycan (HSPG)-dependent manner to activate several signaling pathways including MAPK/ERK, PI3K, and PLCγ9. There is evidence for a conserved requirement for FGF/ERK signaling in mesoderm formation across vertebrate model organisms1013. However, it is unknown if there are FGF- and phosphorylated ERK gradients across the mammalian primitive streak14,15, whether there are functionally relevant ERK signaling dynamics like in other contexts, or whether receptor localization is important for establishing signaling gradients1619. It is also unclear to what extent the requirement in mesoderm induction is direct, or indirect by modulating other signals14,15,20. In addition, several FGFs are co-expressed and the roles of specific FGFs are uncertain. Moreover, there may be significant interspecies differences in this respect. Notably, FGF8 is the only FGF which has been shown to be required for gastrulation in the mouse21, and its requirement in gastrulation appears to be conserved across vertebrate model organisms2224, yet FGF8 is not expressed in primate embryos25,26.

Mutants for FGF8, FGFR1, HSPG synthesis, and ERK2, as well as pharmacological inhibition of MEK and FGFR give rise to similar phenotypes in mouse gastrulation, suggesting that FGFR1 and FGF8 are the only FGF ligand-receptor pair required for gastrulation and that these act through the MAPK/ERK pathway12,20,21,2731. FGF4 expression during gastrulation is also lost in FGF8 mutants, leaving the possibility of a role for FGF4, but this remains unknown as FGF4 is also required in the blastocyst and mutants therefore do not reach gastrulation32. In all mutants and for pharmacological inhibition of FGF/ERK signaling, expression of the primitive streak marker brachyury (TBXT, also BRA, or T in mouse) is lost or severely diminished in the posterior streak and mesoderm fails to migrate away, piling up in this region. In addition, TBX6 expression is lost completely, and no lateral or paraxial mesoderm are formed. In contrast, TBXT expression in the anterior streak and extraembryonic mesoderm is not dependent on FGF/ERK signaling; extraembryonic mesoderm development is normal, and the axial mesoderm is expanded.

In vitro models offer an opportunity to disentangle the complexity of FGF signaling in human development. Here, we explore the role of FGF/ERK signaling in micropatterned human pluripotent stem cells treated with BMP4, also known as 2D human gastruloids. These give rise to concentric rings of different cell types associated with gastrulation after 2 days of differentiation. Pluripotent cells in the center are surrounded by a primitive streak-like ring, which itself is surrounded by primordial germ cells and amnion-like cells on the colony edge3335. Due to its simplicity and reproducibility, this system has proven powerful in deciphering the mechanisms underlying aspects of human gastrulation. It was used to show that BMP-Wnt-Nodal hierarchy that controls primitive streak formation during mouse gastrulation is preserved in human while providing a range of new insights into how these signals function2,3. For example, the crucial role for BMP and Nodal receptor localization in patterning was discovered in 2D gastruloids and later confirmed in the mouse embryo. This system also revealed that BMP, Wnt, and Nodal signaling patterns are highly dynamic37. It therefore also provides a promising approach to determine how FGF may function in human gastrulation.

In the 2D human gastruloid model, we show that a ring of phosphorylated ERK forms in an FGF-dependent manner and expands largely in lockstep with the formation of primitive streak-like cells but extends further inward. We demonstrate that this pattern of ERK activity is due to localized FGF receptor (FGFR) activation that requires basal FGFR localization. We then show that exogenous FGF2 in the media is required for pattern formation to occur but can be removed well before the phosphorylated (active) ERK (pERK) and PS-like rings appear, suggesting the pERK ring instead depends on endogenous FGF gradients. Using single cell RNA-sequencing, we determine the expression of FGF pathway components including FGFR1 as the main receptor and FGF2, FGF4, and FGF17 as the main ligands. Our analysis corroborates previous studies describing expression of FGF2 and FGF17 in human embryos and gastruloids but for the first time recognizes a possible role for FGF4 and includes a broad analysis of genes involved in FGF signaling modulation25,36. We find that the FGF4 and FGF17 expression are restricted to the PS-like ring and that reduction of FGF4 significantly reduces PS-like differentiation, while FGF17 reduction leads to a smaller reduction in PS-like cells and their derivatives. In summary, we have greatly advanced our understanding of how FGFs function in a model for human gastrulation.

Results

FGF/ERK signaling is required for primitive streak-like differentiation

We previously observed that pharmacological inhibition of MEK (the kinase upstream of ERK) or FGF receptors leads to a loss of differentiation to primitive streak-like and primordial germ cell-like cells, suggesting a crucial role for FGF/ERK signaling34. Moreover, a ring of active ERK (pERK) has been observed in 2D gastruloids but its connection to FGF signaling or differentiation remains unclear33. Therefore, we wanted to investigate the role of FGF/ERK signaling further.

To determine the dynamics of pERK and relate it to PS-like differentiation, we first performed time series immunofluorescence co-staining for pERK and the PS marker TBXT after BMP4 treatment. This revealed the pERK ring first emerges around 24h, around the same time as TBXT expression, and gradually gets brighter and wider (Fig.1a-c). By 42h the outer edge of the ERK ring coincided with TBXT expression, while the inner edge was less sharp and extended further towards the center than TBXT, reminiscent of the domains of WNT and NODAL signaling that extend further in than the PS-like region3,6 (Fig.1a, Supp. Fig.1ab). MEK or FGFR inhibition for 30 minutes after 41.5h eliminated ERK activity, suggesting ERK activity depends entirely on FGF and that continuous FGF signaling is necessary to maintain the ERK signaling pattern (Fig.1bc). Inhibition for the full duration of differentiation eliminated TBXT expression as well as other PS markers (Fig.1bc, Supp. Fig.1c). However, we were able to rescue TBXT expression by directly and uniformly activating ERK (Fig. 1d-f, Supp. Fig.1de) in cells stably expressing the doxycycline-inducible membrane-targeted catalytic domain of SOS (dox-SOScat)37. Although TBXT expression was rescued by ERK activation, reduced proliferation caused by FGFR inhibition was not, indicating proliferation may be regulated by additional downstream pathways (Fig. 1d). Furthermore, despite uniform induced ERK activity, TBXT expression remained excluded from the center, showing that ERK activation alone is not sufficient for TBXT induction. However, TBXT levels were much higher in the rescue (Supp. Fig. 1E), possibly reflecting the much higher level of ERK activity induced by dox-SOScat. Altogether, these data show the formation of a PS-like ring is dependent on FGF signaling through ERK.

a) Immunofluorescence stainings for pERK and TBXT and their radial intensity profiles in 2D gastruloids at different times. Error bars in graphs are standard deviations over N=4 colonies. b,c) Effect FGF receptor inhibition (FGFRi, b) or MEK inhibition (MEKi, c) on pERK and TBXT in colonies fixed at 42h after BMP treatment. Inhibition for either 30 minutes at 41.5h (rounded to 42 in figure) or throughout differentiation. d) BMP only control and FGFRi inhibition in cells expressing doxycycline-inducible SOScat with and without addition of doxycycline. e-f) Cell numbers (e) and radial profiles of TBXT positive cells for conditions in (d). g) PS marker TBXT, pERK and pluripotency marker SOX2 in Nodal knockout cells with inhibitors of Wnt secretion (WntSeci) and BMP receptors (BMPRi) with or without MEKi or FGFRi. All scale bars 50um.

This raised several questions. First, is FGF/ERK signaling required directly for PS-like differentiation, or does it act indirectly, for example, by controlling the expression of Wnt or Nodal which are both required for PS-like differentiation? These possibilities are not mutually exclusive. Second, which are the FGFs activating the ERK pathway? The pathway could be activated by exogenous FGF2 present in the pluripotency medium mTeSR1 or by endogenously expressed ligands. Third, how is ERK activity restricted to a ring?

To determine if there is a direct requirement of FGF/ERK signaling and whether this depends on exogenous FGF, we performed differentiation to PS-like cells in standard culture by exogenous activation of Wnt and Nodal signaling, bypassing the need to form endogenous Wnt and Nodal gradient as in 2D gastruloids38. We used a minimal base medium (E6) that did not include exogenous FGF or other growth factors. To rule out any additional requirement for endogenous Nodal and Wnt which may be FGF/ERK-dependent, we used Nodal knockout cells3 treated with the Wnt secretion inhibitor IWP2, thereby eliminating any endogenous activity of these pathways. We also inhibited BMP receptors using LDN193189 to rule out any indirect effect through endogenous BMP.

Under these conditions, with exogenous Wnt and Nodal activation for 24h in the absence of exogenous FGF, cells efficiently differentiated to TBXT+ PS-like cells (Fig.1g, Supp. Fig.1f). These PS-like cells had high active ERK levels just as in the 2D gastruloids. Inhibition of either FGFR or MEK abrogated ERK activity and TBXT expression while FGFR inhibition also severely reduced proliferation. In both cases significant expression of the pluripotency and ectoderm marker SOX2 was maintained, although SOX2 levels were much lower with FGFR inhibition (Supp. Fig. 1f). Given that FGF/ERK is known to be involved in pluripotency maintenance, this suggests inhibition may not be complete and that much lower ERK activity is required for pluripotency maintenance than PS-like differentiation, consistent with lower ERK activity in the pluripotent center of gastruloids. The greater loss of SOX2 with inhibition of FGFR than MEK could reflect more efficient inhibition of ERK signaling but could also indicate that other pathways downstream of FGFR, such as PI3K, play a role in maintaining SOX2 expression. Altogether, these results show that endogenous FGF expression is both necessary and sufficient for PS-like differentiation by exogenous Wnt and Nodal stimulation and demonstrate a direct requirement for ERK signaling in PS-like differentiation.

Single cell RNA-sequence reveals differentially expressed FGF pathway components

The above results suggest endogenous FGF expression may contribute to elevated ERK signaling in the PS-like ring. However, FGF/ERK signaling may be modulated at many levels (Fig. 2a). A spatial pattern of ERK activation could reflect a concentration gradient of FGF. Alternatively, differential expression of receptors and HSPG-modifying enzymes could lead to spatially patterned receptor activation and downstream ERK activity even in the absence of an FGF gradient. Even if receptor activation is uniform, downstream ERK activity could be spatially patterned by downstream negative regulators of ERK activity such as Sprouty-family proteins (SPRY) and dual-specificity phosphatases (DUSPs)40.

a) Schematic of the FGF/ERK signaling pathway. b) Distribution of pERK levels in TBXT+ cells versus ISL1+ and other cells based on data and thresholds in Supp. Fig. 2b. c) Expression of key markers in PHATE39 projection of scRNA-seq data for 42h colony. d) Cell type annotation. PS-LC: primitive streak-like cells, NasMe: nascent mesoderm, AE-LC: amniotic ectoderm-like cells: PGC-LC: primordial germ cell-like cells, endo: definitive endoderm. e) Expression of canonical FGF ligands (log transformed and smoothened). f) Comparison of FGF ligand expression in 2D human gastruloids to human, monkey, and mouse embryo, normalized within each type of sample. g-i) Expression of several genes involved in FGF signaling with strong differential expression between PS-like cells or nascent mesoderm and pluripotent cells, split by gene category.

To understand which genes modulating FGF/ERK signaling could be responsible for the elevated ERK signaling in TBXT positive cells and required for TBXT expression (Fig.1, Fig. 2b), we extended previous work using single cell RNA-sequencing to determine expression of FGF pathway components in human gastrulation25,35,36. TBXT-positive PS-like cells give rise to nascent mesoderm expressing TBX6 as well as definitive endoderm expressing SOX17 and FOXA2, both of which may still express TBXT at 42h41. To understand how TBXT expression is initiated in an FGF-dependent manner, we focused on differential expression in the PS-LC relative to pluripotent cells, reasoning that FGF modulation specific to nascent mesoderm or endoderm is a consequence rather than a cause of PS-LC differentiation. We identified different cell types based on marker genes with the PS-like cells defined as TBXT+MIXL1+POU5F1+TBX6-SOX17-FOXA2-(Fig. 2cd, Supp. Table 4). For our analysis, we combined one previously published dataset and one new replicate sample at 42h (Supp. Fig. 2a). We then analyzed expression of different categories of FGF/ERK regulators.

We found four FGF ligands with significant expression: FGF2, 4, 8, and 17 (Fig. 2e, Supp. Fig. 2b). We then compared the expression of these ligands in PS-like cells to reference data of CS7 human, CS8 monkey, and E6.5-E7.5 mouse gastrulation (Fig. 2f, Supp. Fig. 2cd). We also included FGF3 and FGF5, which are expressed during mouse gastrulation32,42,43. High expression of FGF4 and FGF17 and low expression of FGF8 was consistent between human, monkey, and 2D gastruloid. FGF2 expression was also high in the PS-like populations in 2D gastruloids and the CS8 monkey embryo, but much lower in for human embryo data. Furthermore, the monkey and human embryos expressed FGF3, which was minimally expressed in gastruloids. A similar comparison in nascent mesoderm showed much lower FGF2/4 and much higher FGF17 compared to PS across all primate samples (Supp. Fig. 2e). Expression in 2D gastruloids was consistent with published RNA-seq data for hPSC-derived PS-like cells, which expressed high FGF2 and FGF4, some FGF8 and FGF17, and no FGF3 (Supp. Fig. 2f). Comparison with mouse confirmed significant differences from primates, with high expression of FGF5, 8, much lower expression of FGF4,17, and no expression of FGF2 in the mouse21,42,43.

Of the FGF receptors, only FGFR1 and FGFR4 were strongly expressed, with FGFR1 expression about 5-fold higher (Fig. 2g, Supp. Fig. 2g). Both receptors showed graded expression that decreased from pluripotent to PS-like cells and much lower expression in amnion-like cells (Fig. 2g). Therefore, elevated ERK activity in PS-like cells cannot be accounted for by overall receptor expression. Differential expression of genes regulating heparin-sulfate proteoglycans which modulate FGF receptor binding was mild except for glypicans 3 and 4 which were downregulated in PS-like cells relative to pluripotent cells (Fig. 2h, Supp. Fig. 2g). In contrast, several intracellular negative regulators of ERK signaling were strongly differentially expressed between cell clusters, including SPRY1 and PEBP1 which were significantly higher in pluripotent cells (Fig. 2i, Supp. Fig. 2g). Surprisingly, however, expression of SPRY4 was not elevated in PS-like cells where ERK signaling is high, suggesting that transcription of SPRY4 may not be a good readout of ERK activity in human hPSCs like it is in the mouse44. On the other hand, expression of the ERK negative feedback regulator IL17RD (SEF) does seem to approximately reflect expected ERK activity in each population. Components of the core ERK pathway including ERK itself were highly expressed across all clusters (Supp. Fig. 2g).

Thus, while scRNA-seq identified specific genes involved, it is consistent with several non-exclusive mechanisms for generating an ERK pattern. For example, uniform FGF2 could form an ERK ring by equally activating receptors across the colony but with ERK response reduced by elevated expression of negative regulators in the pluripotent center. Alternatively, based on gene expression we cannot rule out HSPGs increasing receptor binding specifically in the ring. Finally, localized expression of FGF4 or FGF17 in the PS-like ring could lead to a concentration gradient responsible for increased receptor activation.

The phosphorylated ERK pattern is due to differential activation of FGF receptors

To distinguish between the different possibilities, we asked whether the spatial pattern of pERK signaling is controlled primarily below, at, or above the level of the receptors. To address this, we first visualized phosphorylated FGFR1 (pFGFR1) and found its spatial pattern to strongly resembles that of pERK (Fig.3a, Supp. Fig.3a). As expected, pFGFR1 was lost upon brief FGFR inhibition (Fig.3a, Supp. Fig.3a). In contrast, after MEK inhibition, high pFGFR1 remained while pERK was greatly reduced (Supp. Fig.3bc). This suggests the spatial pattern of ERK reflects a spatial pattern of receptor activity.

The spatial pattern in receptor activity could either be due to differential receptor expression or differential receptor activation. There was no significant upregulation of FGF receptors in PS-like cells according to our single cell RNA-sequencing data (Fig 2e). In fact, the most highly expressed receptor, FGFR1, was elevated in pluripotent cells. This suggests the ERK activity pattern is not due to a receptor expression pattern with two caveats. First, mRNA expression may not reflect protein expression. Second, multiple isoforms exist for FGF receptors 1-3 (but not 4) that have different affinity for FGF ligands, leaving open the possibility that PS-like cells could express a different FGFR1 isoform than pluripotent cells, despite overall FGFR1 expression being approximately uniform.

To determine if the spatial pattern in pFGFR1 could be explained by a spatial pattern of FGFR1 protein, we performed immunofluorescence staining of FGFR1. Consistent with the scRNA-seq data, and in sharp contrast to the pFGFR1 stain, FGFR1 protein expression was graded from center to edge with its highest levels in the pluripotent center, and therefore could not explain the signaling pattern (Fig.3b, Supp. Fig.3d). To determine if different isoforms of FGFR1 are differentially expressed in PS-like vs. pluripotent cells, we designed qPCR primers to detect the IIIb and IIIc isoforms of FGFR1 (Supp. Fig. 3e-g). These are the most common isoforms, also known as the epithelial and mesenchymal isoforms, and therefore appeared the mostly likely candidates for differential expression. We then analyzed expression of these isoforms in 42h micropatterns versus pluripotent cells and PS-like cells differentiated directly as in Fig.1d. However, the change in expression between samples was similar for each isoform, with all isoforms decreasing in 2D gastruloids and directly differentiated PS-like cells relative to pluripotent cells (Fig. 3c). To corroborate this and also determine absolute FGFR isoform expression in pluripotent and PS-like cells, we analyzed our single cell RNA-sequencing data and previously published bulk RNA-sequencing data for isoform expression in these two cell types. This revealed that for both pluripotent and PS-like cells FGFR IIIc is the dominant isoform with on average almost 900-fold higher expression than IIIb, while IIIa was not expressed at all (Fig. 3d). Altogether, these data suggest that spatial patterning in ERK is due to differential activation of FGFR receptors, either due to FGF concentration gradients or possibly spatially patterned HSPGs.

a) phospho-FGFR1 stain with and without 1h FGFRi treatment at 41h, radial intensity profile on the right. b) FGFR1 with and without 1h FGFRi treatment at 41h, radial intensity profile on the right. c) qPCR data for relative expression of FGF receptor 1 isoforms IIIb, IIIc and total FGFR1 in pluripotent cells, gastruloids at 42h and directly differentiated PS-like cells. d) RNA-seq analysis of absolute FGFR1 isoform expression. All scale bars 50 um.

FGF receptors are localized basolaterally

To maintain an endogenous FGF concentration gradient on the apical side of the colony which faces the medium, would require ligands to remain tethered to the surface. However, FGF has been reported to diffuse freely in the intercellular space45. Moreover, there is FGF2 present in the medium that would be expected to uniformly activate FGF receptors. The situation is similar for BMP4, which is present uniformly in the medium for 2D gastruloids but does not cause uniform signaling activity4. In the case of BMP, that is because receptors are localized basolaterally4 and tight junctions seal the basolateral side from the medium. Consequently, response to exogenous ligands in the medium is restricted to the colony edge, while endogenous ligands are confined to the basal side and prevented from diffusing away into the medium everywhere but the edge. This was found to be critical for patterning in both 2D gastruloids and the mouse embryo4,5. To explain the pERK pattern, we therefore hypothesized FGF receptors were similarly restricted to the basolateral side.

To test our hypothesis, we first starved pluripotent colonies of FGF by maintenance in FGF-free E6 medium for 12h and then treated with FGF2. Consistent with our hypothesis, phospho-FGFR and phosphor-ERK were both restricted to the colony edge (Fig. 4a). To determine if this could be explained by diffusion of exogenous FGF from the medium, we repeated the experiment with micropattern pluripotent cells and added fluorescent dextran with a similar molecular weight as FGF. We observed fluorescent dextran diffusing into the intercellular space from the edge inward over the same distance that ERK signaling was observed (Fig. 4b-d). We then acquired high resolution z-stacks of FGFR1 co-stained with the tight junction protein TJP1 (ZO-1) which revealed the majority of FGFR1 was below the tight junctions (Fig. 4e). To directly test that cells are more responsive to basal FGF than apical FGF, we then performed a transwell assay, where pluripotent cells were grown on filters to enable separate apical and basal stimulation. We found that only basal stimulation with FGF led to a significant increase in pERK (Fig. 4f). Finally, to determine if the center of the micropatterned colony is similarly able to respond to basal FGF stimulation and pERK levels there can be attributed to reduced FGF ligand concentration, we performed a scratch assay. Strong ERK response to FGF2 in the media was seen in the colony center around a scratch, but this response was blocked by FGFR inhibitor (Fig.4gh). If HSPGs rather than FGF ligand were limiting ERK response in the center, we would not have expected strong response around a scratch. We therefore concluded that the pERK pattern is most likely due to a basolateral FGF ligand gradient which has its maximum in the PS-like ring.

a) pERK and pFGFR1 response of pluripotent colonies in standard culture grown in FGF-free (E6) medium for 24 hrs and then treated with FGF2 for 30 minutes. b) Fluorescent dextran in micropatterned pluripotent colony with quantification. c) pERK response in micropatterned pluripotent colony after 42h in E6 with TGFβ and FGF2. d) Radial intensity profile of dextran and pERK for conditions in b,c), averaged over 2 colonies. e) Cross-section of 2D gastruloid shows FGF receptor 1 predominantly below the tight junction marked by ZO-1. White box marks are magnified in inset e’. f) pERK and pFGFR1 response in transwell experiment with cell treated with FGF either apically or basally. g) ERK signaling in scratched colonies with or without FGFRi. h) Quantification of pERK intensity as a function of distance from the scratch. Error bars represent standard deviation over colonies. Scale bars 50 micron.

Illustrations in panel f created with BioRender.com/v73i147.

Endogenous FGF4 and FGF17 gradients underly the ERK activity pattern

To identify which FGF ligands are required for PS-like differentiation, we first determined the requirement for exogenous FGF2 in the media. Differentiation of 2D gastruloids in FGF-free medium was severely reduced but restored by addition of FGF2 (Fig. 5a, Supp. Fig. 5a). However, if FGF2 was removed 24h after BMP treatment, differentiation was normal. The fact that removal 2h after BMP treatment significantly reduced differentiation suggests this was not a failure to wash out the FGF2. Furthermore, MEK inhibition at 24h severely reduces TBXT expression34 while FGF2 is known to have a half-life on the order of hours46 and no media changes are performed during standard differentiation. Altogether this suggests that exogenous FGF2 is required at the time of BMP treatment but that endogenous FGFs are sufficient for the ERK activity ring and PS-like differentiation that start a day later.

a) Pattern at 42h with exogenous FGF2 in the culture medium for different amounts of time. b) Pseudotime analysis of FGFs and TBXT, TBX6 expression. c,f) FGF4 (c), and FGF17 (f) FISH co-stained for TBXT and pERK at 32h and 40h after BMP4 treatment. d,g) Radial intensity profiles for conditions in (c,f) respectively. e,h) Scatterplots of FGF4 (e), and FGF17 (h) mRNA versus pERK colored for TBXT. i) FGF4 FISH co-stained for TBXT and pERK in ctrl (scramble) shRNA versus FGF4 shRNA cells and corresponding radial intensity profiles.j) pERK and TBXT expression in WT versus FGF17+/- cells. k) SOX17 and TFAP2C expression in WT versus FGF17+/- cells.

We reasoned that FGFs whose expression correlates with the presence of PS-like cells are more likely to be responsible for the observed ERK ring and PS-like differentiation. Our analysis in figure 2 had implicated FGF4 and FGF17 as most strongly expressed in the PS-like cells and nascent mesoderm, respectively (Fig. 2c). However, it is unclear whether expression of these ligands is a consequence of PS-like differentiation, its cause, or both, in the case of a feedback loop. To get more information about the relationship between FGF ligands and PS vs. mesoderm differentiation, we performed pseudotime analysis of our scRNA-seq data (Fig. 5b, Supp. Fig 5b). Along a pseudotime trajectory from pluripotent cells to nascent mesoderm, FGF2 expression was nearly constant. FGF8 was upregulated prior to TBXT and declined as TBXT was upregulated although its levels always remained low. FGF4 upregulation approximately coincided with TBXT, but FGF4 began decreasing as TBX6 expression increased. In contrast, FGF17 expression started after FGF4 but continued to increase after FGF4 started decreasing. Direct comparison of FGF and TBX expression also showed strong positive correlation between TBXT and both FGF4 and FGF17 but opposing relations with TBX6 (Supp. Fig. 5c). Together this suggested to us that FGF4 and FGF17 may have overlapping functions, but that initial TBXT expression may depend more on FGF4 while FGF17 could play a later role in maintaining or expanding the PS-like population or facilitating subsequent mesoderm differentiation.

Pseudotime may not accurately predict true dynamics. Therefore, we verified expression at different two different times using RNA FISH (fluorescent in situ hybridization). We found that FGF4 expression was similar at 32h and 40h (Fig.5c-e, Supp. Fig. 5d). This is consistent with its decrease in pseudotime as cell differentiate to nascent mesoderm, since in real time differentiation is asynchronous and new PS-like cells form while older ones differentiate to nascent mesoderm, which would yield a flatter temporal profile. In contrast, FGF17 strongly increased from 32h to 40h, again consistent with the pseudotime analysis (Fig.5f-h, Supp. Fig. 5e). As expected, the spatial profiles of both FGF4 and FGF17 resembled TBXT, although the peaks were displaced slightly inward (Fig.5d,g). Scatterplots further showed that expression of both FGF4 and FGF17 was restricted to TBXT positive cells with high pERK levels (Fig. 5e,h). Conversely, however, high pERK was not restricted to FGF expressing cells since the pERK ring extended significantly further inward than FGF expression. One possible explanation for this is inward diffusion of FGF ligands.

To functionally test the requirement for these FGFs we decreased their expression. We created a cell line stably expressing short hairpin RNA to knock down FGF4 which led to a decrease in TBXT and pERK (Fig. 5i). We also decreased expression by transfection with small interfering RNAs (siRNA) with a similar outcome for FGF4 (Supp. Fig. 5f). Finally, in the absence of a homozygous knockout, we characterized a heterozygous knockout for FGF17, which also led to decrease in TBXT (Fig. 5j, Supp. Fig. 5g-h, Supp. Fig. 6). The strength of this effect varied from experiment to experiment, partially related to cell density variation (Supp. Fig. 5hj). We further interrogated endoderm and primordial germ cell markers since both cell types require ERK signaling to differentiate and transiently express TBXT (Fig. 5k, Supp. Fig. 5ij). Consistent with our previous work, we observed very little expression of FOXA2 at 42h. However, SOX17 positive cells both positive for TFAP2C (primordial germ cell-like cells) and negative for TFAP2C (prospective endoderm) were reduced. Combined, these data support partially redundant functions for FGF4 and FGF17 in PS-like differentiation with FGF17 more important continued differentiation to mesoderm and endoderm.

Discussion

We have shown that in a stem cell model for early human gastrulation, FGF-dependent ERK signaling is directly required for differentiation to PS-like cells and their derivatives, and that loss of differentiation upon FGF inhibition can be rescued by directly activating ERK. We also showed that FGF receptors are polarized basolaterally, preventing the colony center from responding to exogenous FGF in the medium. Furthermore, FGF receptor activation is increased in the PS-like ring, matching the pattern of active ERK despite overall receptor expression being lower in the PS-like ring than in the pluripotent center. This suggests elevated ERK activity is due increased binding of FGFs to the receptors in this region, possibly due to increased FGF concentration. We found that both FGF4 and FGF17 have strongly increased expression in the PS-like cells, and decreased expression of either leads to a reduction in PS-like cells or their derivatives, suggesting they might be responsible for increased ERK signaling in the PS-like region.

Many questions remain. Although we showed that FGF gradients are likely the dominant cause for the spatial pattern of ERK signaling, our scRNA-seq data suggest other levels of regulation may contribute, which remains to be explored further. Furthermore, although we showed FGF/ERK signaling is directly required in PS-like differentiation, it may also act indirectly by affecting the expression of Wnt and Nodal. We will explore this and the more general question of how FGF integrates with the hierarchy of BMP, Wnt, and Nodal in a separate manuscript. Related is the indirect effect FGF may have on patterning by its control of cell growth and thereby cell density. Cell density affects the response to exogenous BMP and downstream expression of Wnt, Nodal, and various inhibitors4,47,48. It likely has other effects as well. However, the rescue of TBXT expression but not growth by ERK activation when FGF receptors are blocked suggests the FGF-dependent cell growth is not required to generate a primitive streak-like ring.

Another important question is to what extent different FGFs act redundantly or perform different functions. Our data suggest a picture where different FGFs are required consecutively, starting with exogenous FGF2, then endogenous FGF4, possibly preceded by FGF8, to support initial differentiation of PS-like cells, and finally FGF17 to maintain and expand the PS-like ring and support mesoderm and endoderm differentiation. Whether these FGFs could substitute for each other if they were expressed at similar times and levels is unclear. It also remains to be determined whether the nearly constant high expression of FGF2 we observed is required for patterning. The fact that PS-like differentiation was largely rescued by uniformly activating ERK to a supraphysiological level suggests that the role for the different FGFs is simply to sufficiently activate ERK signaling. However, future work will have to investigate the dox-SOScat rescue phenotype in much greater detail to see how differentiation and other cell behaviors such as migration are affected.

It is interesting to ask if there is a simple explanation for the seemingly large differences in FGF expression between primates and other species. FGF8 is the only FGF shown to be required for mouse gastrulation and is upstream of FGF4 in both chick and mouse, but it is barely expressed in human and monkey. If FGFs are interchangeable in simply activating ERK for primitive streak induction, variation in FGF expression is not surprising. However, FGF4 and FGF8 are the main FGFs in mouse, chick and frog gastrulation4951, suggesting a high degree of conservation. It is possible FGF8 has simply been substituted by FGF17, which is in the same subfamily9, but this does not explain why the role of FGF8 in gastrulation otherwise appears strongly conserved, or why FGF8 is upstream of FGF4 in the mouse while FGF4 is expressed before FGF17 in 2D gastruloids. Similarly, it is possible that FGF2 in primates substitutes for FGF5 in the mouse as the FGF that is highly expressed in the epiblast. Although these are not in the same subfamily and generally have different receptor affinities, they both have highest affinity for FGFR19. That leaves FGF4 as the only FGF that may be conserved across vertebrate gastrulation, and it is tempting to speculate on why. In Xenopus, FGF4 and TBXT (Xbra) function in a positive feedback loop. The strongly correlated initial transcriptional dynamics of TBXT and FGF4 in 2D gastruloids are consistent with a similar feedback loop. However, recent work suggests TBXT is not required for FGF4 or FGF17 expression52. One possibility is that FGF expression is controlled by a different primitive streak transcription factor such as EOMES/TBR2. Future work will have to investigate this further.

A final related question is the origin of the discrepancies we and others36 found between the primate (i.e., non-mouse) samples in Fig. 2 and Supp. Fig. 2: FGF3 appear only significantly expressed in the human embryo, but not in the monkey embryo and human stem cell model. This could also reflect genetic variation: if FGFs function redundantly, their relative expression levels could be relatively unconstrained between individuals. On the other hand, in the nascent mesoderm, FGF2 is only high in human gastruloids. This may reflect developmental stage: the in vivo data is developmentally more advanced and nascent mesoderm may lose FGF2 expression later. It is also possible the maintenance conditions of hPSCs lead to in vitro artefacts in FGF expression.

In conclusion, the complexity of FGF signaling, with its many ligands, modulators, and functions, has prevented a clear understanding of its role in mammalian gastrulation in general and in human gastrulation specifically. Here we showed that 2D human gastruloids provide a powerful approach to make headway and although many open questions remain, we have taken significant steps towards this goal.

Acknowledgements

We thank Lila Solnica-Krezel, Blerta Stringa, Aryeh Warmflash, and Ben Allen for discussions. We thank Jared Toettcher for the dox-SOScat plasmid. Library prep and next-generation sequencing was carried out in the Advanced Genomics Core at the University of Michigan. This work was supported by the National Institute of General Medical Sciences (NIGMS R35GM138346), and the Branco Weiss Fellowship – Society in Science. KJ was partially supported by the Michigan Pioneer Postdoctoral Fellowship and the NIH F32 Ruth L. Kirschstein Postdoctoral National Research Service Award (5F32HD108980-02).

Data availability

Raw single cell RNA sequencing data generated in this study was deposited in GEO (GSE271604), analyzed data will be added upon publication. Raw image data are available upon request.

Code availability

All code for data analysis and model simulations is available on https://github.com/idse/FGF

Competing Interests Statement

The authors declare no competing interests.

Methods

Cell lines

The cell lines used were the embryonic stem cell line ESI017 (XX), and the induced pluripotent stem cell line PGP1 (XY). The pluripotency of these cells was confirmed by immunostaining of pluripotency markers OCT3/4, SOX2, NANOG. All cells were routinely tested for mycoplasma contamination, and negative results were recorded. ESI017 cells stably expressing doxycycline inducible SOScat were made using a published Piggybac plasmid gifted by Jared Toettcher37.

Cell culture and differentiation

Human pluripotent stem cells were cultured in the commercially defined pluripotency maintaining media mTeSR1 (STEMCELL Technologies #85850) on Cultrex (R&D Systems)-coated tissue culture plates. For FGF2 starvation we used Essential 6 Medium (Thermofisher, A1516401), or mTeSR1 without bFGF, TGF-beta, LiCl, GABA and pipecolic acid (STEMCELL Technologies #05896). For routine cell maintenance, cell passaging was performed every 3 days. For whole-colony passaging, L753 was used, while single-cell passaging was performed with Accutase or TrypLE (Gibco). Single-cell suspensions were used to seed for all the differentiation experiments to control initial cell number.

To directly differentiate to primitive streak-like cells, cells were resuspended in a single-cell suspension and seeded in cultrex coated wells in mTeSR with ROCK inhibitor (RI) Y-27632 (MeChemExpress, cat# HY-10583). Cells were then maintained in mTeSR with RI for 24h before adding CHIR (3-5uM) and Activin (100ng/ml) treatment for 24-30h. Experiments were performed in 18-well Ibidi slides (cat# 81818).

To differentiate cells for 2D gastruloids by micropatterning, we followed the protocol in41. Briefly, cells were resuspended in a single-cell suspension and seeded in laminin-coated micropatterned wells in mTeSR with ROCK inhibitor (RI) (MeChemExpress, cat# HY-10583). Micropatterned wells were washed with PBS-/- 30 minutes after the initial seeding to wash off the non-attached cells. Cells were then maintained for 24 hours in mTeSR before adding the BMP4 treatment by a full medium change. For time series micropatterned differentiation, full media changes with designated treatment(s) were performed every 24 hours. Experiments were performed in micropatterned 18-well Ibidi slides (cat# 81818) prepared as previously described54. Cell signaling reagents and doses used are listed in Supplementary Table 1.

Immunofluorescence staining

Antibodies can be found in Supplementary Tables 2 and 3. For stains including pERK, we used methanol fixation, for all other stains we used PFA fixation. PFA fixation: Samples from the 18-well Ibidi slides were rinsed with PBS, fixed for 20 min in 4% paraformaldehyde, rinsed twice with PBS, and blocked for 30 min at room temperature with 3% donkey serum and 0.1% Triton X-100 in 1× PBS. After blocking, cells were incubated with primary antibodies at 4°C overnight, followed by three washes in PBST (PBS with 0.1% Tween 20). They were then incubated with secondary antibodies and DAPI for 30 min at room temperature and washed twice in PBST at room temperature. Methanol Fixation: Samples from the 18-well Ibidi slides were rinsed with PBS, fixed for 20 mins in cold methanol, then rinsed with PBS for 5 min two times, after which 10% phosphate buffered formalin was added for 20 min at room temperature. Cells were then rinsed with TBS twice followed by 5 mins of incubation in cold methanol and rehydrate in TBS for 20 mins. Blocking was done for 30 mins at room temperature with 5% donkey serum, 1% BSA, 0.2% Triton-X-100 in 1x TBS after which staining proceeded the same way as with PFA fixation.

Microscopy

Fixed sample imaging was performed with an Andor Dragonfly/Leica DMI8 spinning disk confocal microscope with a 40x, 1.1NA water and 20x 0.8NA air objectives using Andor Fusion software version 2.3.0.31 and a Nikon/Yokogawa spinning dish confocal microscope with a 40x silicon oil objective using NIS Elements AR software version 5.41.02. Live-cell imaging was performed with an Andor Dragonfly/Leica DMI8 spinning dish confocal microscope under the controlled temperature (37°C), CO2 concentration (5%), and humidity (>60%).

General image analysis

We segmented nuclei in individual z-slices based on nuclear fluorescence stained with DAPI in fixed cells using a pipeline we previously described, which integrates two machine learning approaches: Ilastik pixel classification55 and Cellpose56 (v1). Fluorescence intensities were then calculated per nucleus as mean intensities in the 3D nuclear again with our established custom image-processing pipeline. To calculate the radial fluorescence distributions in micropatterned colonies, colonies were subdivided into radial bins with equal numbers of cells, mean or median and variance were then calculated within these bins. To calculate positive fractions, markers were first thresholded based on their intensity distribution, after which the fraction of cells positive for each marker was calculated within different bins. Standard deviations were then calculated over multiple replicate colonies. For visualization purposes only, background subtraction using grayscale opening on a 100 micron scale and 5 pixel wide median filter was applied slice by slice to z-stacks of micropatterned colonies before making maximal intensity projections.

(Single-cell) RNA sequencing and analysis

Cells were collected using accutase and resuspended in ice-cold PBS. Single-cell RNA-sequencing was performed by the University of Michigan Advanced Genomics Core. Cells were barcoded using the 10X Genomics Chromium system (part numbers 1000268, 1000120, 1000215). For quality control, cDNA was quantified by Qubit High Sensitivity DNA assay and Agilent TapeStation. Sequencing was performed on the Illumina NovaSeq 6000 with NovaSeq S4 flowcell and Control Software version 1.7.0. Reads were aligned using cellranger-4.0.0 with the GRCh38 reference. Further processing was mostly done in Python, primarily using the Scanpy57 and SCVI58 packages. We integrated the two scRNA-seq datasets for 42h 2D human gastruloids using SCVI based on the top 2000 highly variable genes, which were selected using the default function in Scanpy using the seurat_v3 flavor. Cell cycle genes were regressed out during integration. Dimensional reduction for visualization was performed using PHATE39 on the SCVI latent space. Gene expression was smoothened with MAGIC59 for visualization on PHATE plots in Fig. 3c, Supp. Fig.3b. To annotate different cell types, we used MAGIC to impute that data and get smooth distributions of gene expression. We then thresholded marker genes to assign cell fate (Table 4), which produced better results than the more common unsupervised clustering using Leiden, as well as more consistent results between different datasets.

For pseudotime analysis (Fig. 5), we applied the trajectory inference tool Slingshot60 in R to compute the pseudotime trajectory in the SCVI latent representation. Slingshot determines the membership of a cell to a lineage at each branching point by its projection distance to the fitted principal curve of that lineage. We projected the mesoderm developmental trajectory onto the PHATE map for visualization. We further used TradeSeq61, which fits a generalized additive model (GAM), to infer the pseudotime gene expression trends along the mesoderm lineage trajectory from Slingshot. For visualizing the correlation between the expression of FGF ligands and marker genes (Supp. Fig. 5), we used our SCVI model to denoise the expression and mutual nearest neighbor to further harmonize the difference between samples.

To quantify isoforms from RNA sequencing salmon v1.10.0 was used62. First, gencode.v45.transcripts.fa was downloaded from gencodegenes.org. The reference was customized by removing lines for FGFR1 and replacing them with the transcript sequence for FGFRIIIa, FGFRIIIb, FGFRIIIc (sequences included as supplementary file). Three samples from GEO dataset GSM2257301 were downloaded through SRA (SRX1725562 (Pluripotent), SRX1725577 (APS), SRX1725643 (MPS))38. Adapters and the first 10 bases were trimmed from the FASTQ files using trim_galore. Finally, the Salmon “quant” command with the --validateMappings option was used to quantify transcripts from trimmed FASTQ files. For single cell datasets, https://timoast.github.io/sinto/ was used to subset the 10x aligned file by cluster. Reads for clusters of interest were converted back to raw FASTQ files and run through the same alignment-based mode of salmon but as single end reads.

Quantitative real-time PCR

RNA were extracted using RNAqueous™-Micro Total RNA Isolation Kit (CAT# AM1931, Thermo Fisher Scientific) and then cDNA synthesis was prepared from it using SuperScript™ VILO™ cDNA Synthesis Kit (CAT# 11754250, Thermo Fisher Scientific) following the manufacturer‘s protocol with adjustment for optimization. Measurements were performed with SYBR green and the primers in the Table 5. GAPDH was used for normalization in all experiments.

Fluorescent dextran assays

Fluorescent dextran essays were performed by adding 10 ug/ml fluorescein isothiocyanate-dextran (10KD, Sigma Aldrich) to the media. To calculate fluorescence intensity as a function of distance from the colony edge, Otsu thresholding followed by several morphological operations was used to create a mask for the colony in each z-slice, thereby excluding bright dextran fluorescence outside the colony. Background subtraction on each z-slice of the image was performed to further remove background signal from dextran outside the colony while maintaining signal in the intercellular space. The radial intensity profile was then calculated from the maximal intensity projection of the masked, background subtracted image.

Scratch assays

Human embryonic or induced pluripotent stem cells were differentiated on micropattern wells for 41h, then each micropattern colony was manually scratched with a serological needle. After 60mins, they were fixed with PFA or methanol followed by immunostaining with appropriate antibodies. To calculate the intensity profile as a function of distance from a scratch, we created a mask for the scratch based on the difference between an overall mask for the colony and the convex hull, calculated the distance transform of the scratch mask, then binned pixels by distance from the scratch and calculated their mean ERK intensity.

Transmembrane assays

Human embryonic or induced pluripotent stem cells were seeded into Corning 6.5 mm transwell with 0.4 µm pore polyester membrane insert (cat# 3470) with E6 supplemented with TGF1b, FGF2, and Y-27632 for 16h, then media was changed to E6 with TGF1b. After 24h, the media was changed to E6 supplemented with FGF2 from the top or the bottom side of the transwell. 30 mins later, cells were fixed followed by immunofluorescence staining. The transwell cartoon in Fig 4 was generated with Biorender.

Fluorescence In Situ Hybridization

The fluorescence in situ hybridization protocol (FISH) was performed according to the manufacturer’s instructions (ACDbio, RNAscope multiplex fluorescent manual). A list of probes and reagents can be found in Supplementary Table 6.

Gene targeting by shRNA

We used the construct hFGF4[shRNA#1] from VectorBuilder in a PiggyBac Transposon Vectors with puromycin resistance which was stably integrated into ESI17 hESCs. Transfections were performed using Lipofectamine™ Stem Transfection Reagent (CAT# STEM00001, Thermo Fisher Scientific), following the manufacturer‘s protocol.

Gene targeting by siRNA

We purchased predesigned siRNA from Millipore Sigma; NM_002007 (FGF4), NM_008004 (FGF17). The siRNA transfections were performed using Lipofectamine™ RNAiMAX Transfection Reagent (CAT# 13778075, Thermo Fisher Scientific), following the manufacturer‘s protocol with adjustment for optimization.

Statistics and reproducibility

All experiments were performed twice or more. All the attempts in replicating experiments yielded consistent results. Unless specifically noted, all the quantifications were performed with N=4 colonies within the same experimental condition.

Ethics statement

We comply with all the ethical regulations in our research. We were granted approval by the Human Pluripotent Stem Cell Research Oversight (HPSCRO) Committee at the University of Michigan to work with human embryonic stem cells and human induced pluripotent stem cells.

Cell signaling reagents.

Primary antibodies used for immunofluorescence.

Secondary antibodies.

Marker genes for cell fate annotation

qPCR primers

Probes for Fluorescent In Situ Hybridization

Supplementary Figures

a) DAPI images and ISL1 stains corresponding to Fig. 1a. b) Scatter plots of TBXT and ISL1 expression. c) Staining for three primitive streak markers with or with FGFRi or MEKi. d) DAPI stain corresponding to Fig. 1d. e) Radial intensity profile for TBXT corresponding to Fig.1d. f) Quantification of conditions in Fig.1g for five images each, N indicates total number of cells.

a) PHATE projection of scRNA-seq data colored for sample. b,c) Expression of canonical FGF ligands across different cell types in 2D gastruloids (b) and human CS7 embryo (c). Expression across clusters is normalized by row, while total expression is shown in a separate column to the directly to left. The leftmost column indicates differential expression between PS-like and pluripotent cells. The color scale of (differential) expression is cut off for better contrast. d) Threshold-based annotation (left) for human, monkey, and mouse embryos used in interspecies comparison (Fig. 2f), compared to original annotation (right). e) Interspecies comparison as in Fig. 2f but for nascent mesoderm and PS-LC + nascent mesoderm. f) Bulk RNA-seq data by Loh et al38 for expression of different FGFs in directed differentiation to mid primitive streak (MPS) or anterior primitive streak (APS) relative the human embryonic stem cells (hESCs). g) Expression of genes involved in FGF signaling.

a) DAPI image corresponding to Fig. 3a. b-c) pFGFR1 (b) or pERK (c) and TBXT stains with and without continuous MEKi treatment. d) DAPI image corresponding to Fig. 3b. e) Design of qPCR primers to detect FGFR1 isoforms, diagram adapted from Eswarakumar et al63. f) Primer sequence and properties. g) Amplicon sizes match prediction. Scale bars 50um.

a) DAPI image corresponding to Fig. 5a. b) Pseudotime dynamics from Fig.5b with each gene normalized individually. c) Correlations between FGFs and TBXT, TBX6 colored for clusters, with cluster colors matching Fig. 2d. d,e) DAPI images and overlays including DAPI corresponding to Fig.5c,f respectively. f) Representative stainings and quantification for FGF4 siRNA versus controls. g) DAPI stain, individual channels and overlay with DAPI for images corresponding to Fig. 5k. h) Scatterplot of fraction of TBXT-positive cells in repeated experiments with FGF17+/- cells colored for genotype (left) or experiment (right). i) Individual channels including DAPI and FOXA2 for images corresponding to Fig.5l. j) Radial profiles of cells positive for different markers in FGF17+/- versus WT.

a) Genotyping for FGF17+/- cells.