Cell type-specific network analysis in Diversity Outbred mice identifies genes potentially responsible for human bone mineral density GWAS associations

eLife Assessment

This is an important study that provides compelling data from a diverse set of approaches from single cell transcriptome data and network analysis from genetically diverse mouse cells to identify novel driver genes underlying human GWAS associations. The authors present evidence that network analysis of scRNA-seq data from genetically diverse mouse bone-marrow derived stromal cells can be informative for identifying human BMD GWAS driver genes. Their approach should be broadly used and applicable to other GWAS studies.

https://doi.org/10.7554/eLife.100832.3.sa0

Significance of the findings:

Important: Findings that have theoretical or practical implications beyond a single subfield

Landmark
Fundamental
Important
Valuable
Useful

Strength of evidence:

Compelling: Evidence that features methods, data and analyses more rigorous than the current state-of-the-art

Exceptional
Compelling
Convincing
Solid
Incomplete
Inadequate

During the peer-review process the editor and reviewers write an eLife Assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife Assessments

Abstract
Introduction
Results
Discussion
Methods
Data availability
References
Article and author information
Metrics

Abstract

Genome-wide association studies (GWASs) have identified many sources of genetic variation associated with bone mineral density (BMD), a clinical predictor of fracture risk and osteoporosis. Aside from the identification of causal genes, other difficult challenges to informing GWAS include characterizing the roles of predicted causal genes in disease and providing additional functional context, such as the cell-type predictions or biological pathways in which causal genes operate. Leveraging single-cell transcriptomics (scRNA-seq) can assist in informing BMD GWAS by linking disease-associated variants to genes and providing a cell-type context for which these causal genes drive disease. Here, we use large-scale scRNA-seq data from bone marrow-derived stromal cells cultured under osteogenic conditions (BMSC-OBs) from Diversity Outbred (DO) mice to generate cell type-specific networks and contextualize BMD GWAS-implicated genes. Using trajectories inferred from the scRNA-seq data that map cell state transitions, we identify networks enriched with genes that exhibit the most dynamic changes in expression across trajectories. We discover 21 network driver genes, which are likely to be causal for human BMD GWAS associations that colocalize with expression/splicing quantitative trait loci (eQTLs/sQTLs). These driver genes, including Fgfrl1 and Tpx2, along with their associated networks, are predicted to be novel regulators of BMD via their roles in the differentiation of mesenchymal lineage cells. In this work, we showcase the use of single-cell transcriptomics from mouse bone-relevant cells to inform human BMD GWAS and prioritize genetic targets with potential causal roles in the development of osteoporosis.

Introduction

Osteoporosis is a complex disease characterized by low bone mineral density (BMD), bone fragility, and an increased risk of fracture (Lin and Lane, 2004). BMD, a highly heritable trait, is one of the most important clinical predictors of osteoporotic fracture (Peacock et al., 2002; Johnell et al., 2005). Increasing our understanding of the genetic basis of BMD is critical for the development of approaches for the treatment and prevention of osteoporosis. In recent years, genome-wide association studies (GWAS) have made great progress in unraveling BMD genetics by identifying over 1100 independent associations (Morris et al., 2019). Now the challenge lies in pinpointing causal genes, which is necessary for the translation of genetic findings into novel therapies.

A number of approaches exist to identify genes responsible for GWAS associations (Cookson et al., 2009; Wen et al., 2017; Al-Barghouthi et al., 2022; Li and Ritchie, 2021). Most rely on population-based ‘-omics’ data (Akiyama, 2021), which are scarce for human bone, to connect associations to causal genes. Our group has used co-expression networks generated from mouse bone transcriptomic datasets to assist in the identification of genes likely responsible for BMD associations. One significant advantage of this approach is its ability to utilize the network connections of candidate genes to predict how these candidate genes may affect BMD. For example, we generated co-expression networks from bone tissue and primary osteoblasts in mouse genetic reference populations and identified multiple co-expression modules enriched with genes located in BMD associations (Calabrese et al., 2017; Sabik et al., 2020). We then cross-referenced genes in these modules with those regulated by colocalizing expression quantitative trait loci (eQTLs) from the Gene-Tissue Expression project (GTEx) (GTEx Consortium, 2013; Aguet et al., 2020) to identify ‘high-priority’ genes. Recently, we expanded our analyses to include directed networks generated via a Bayesian approach using cortical bone RNA-seq data from 192 Diversity Outbred (DO) mice. By combining key driver analysis and GTEx eQTL colocalization data, we identified 19 novel genes, such as SERTAD4 and GLT8D2, which are likely causal for human BMD GWAS associations (Al-Barghouthi et al., 2021).

To date, our analyses have been reliant on networks generated from heterogeneous bulk transcriptomics (RNA-seq) datasets from mouse bone and primary bone cells. However, leveraging single-cell transcriptomics (scRNA-seq) data would offer the added benefit of resolving the transcriptomic profiles of discrete cell types. Additionally, using scRNA-seq data has the potential to provide context by predicting the specific cell types in which causal genes and their associated networks operate. In recent work, we demonstrated the utility of bone marrow-derived stromal cells cultured under osteogenic conditions (BMSC-OB) for the generation of population-scale scRNA-seq data on bone-relevant cell types (Dillard et al., 2023). The BMSC-OB model effectively enriches for mesenchymal lineage cells (e.g. mesenchymal progenitors, osteoblasts, osteocyte-like cells [Ocy]) that are highly relevant to the regulation of BMD.

In this work, our goal was to prioritize and contextualize genes implicated by BMD GWAS using an expanded large-scale (N=80) BMSC-OB scRNA-seq dataset on bone cell types. We accomplished this by first generating co-expression and Bayesian networks (Al-Barghouthi et al., 2021) for each BMSC-OB mesenchymal cell type. We subsequently prioritized networks based on their enrichment for genes exhibiting the most dynamic changes in expression across trajectories inferred from the scRNA-seq data, thus highlighting networks likely associated with the differentiation of BMSC-OBs. We then used these networks linked to osteogenic differentiation to prioritize genes with eQTLs and/or splicing quantitative trait loci (sQTLs) which colocalize with BMD GWAS associations (Al-Barghouthi et al., 2022; Abood et al., 2023). In doing so, this analysis provides additional support for a role of these genes in the regulation of BMD and highlights their potential roles in differentiation of cell types essential to skeletal health.

Results

BMSC-OBs derived from DO mice yield diverse cell types that are enriched for mesenchymal lineage cells

We cultured BMSCs from a total of 80 DO mice, a genetically diverse outbred mouse population (Bogue et al., 2015; Churchill et al., 2012) (N=75 from the current study and N=5 from Dillard et al., 2023; N = 49 male and N = 31 females). We cultured BMSCs under osteogenic conditions and subsequently performed scRNA-seq, as described in Dillard et al., 2023. After stringent processing and quality control (Materials and methods), the dataset consisted of 21,831 expressed genes across 139,392 cells. We manually annotated 15 clusters ranging in size from 270 to 27,291 cells and identified cell types of the mesenchymal lineage, as well as various other cell types (Figure 1A, Supplementary file 1a, Figure 1—figure supplement 1).

Figure 1 with 1 supplement see all

Download asset Open asset

Analysis of single-cell RNA-seq (scRNA-seq) data for bone marrow-derived stromal cells cultured under osteogenic conditions (BMSC-OBs) derived from 80 Diversity Outbred (DO).

(A) Uniform Manifold Approximation and Projection (UMAP) of 139,392 single cells (BMSC-OBs). Cell numbers and corresponding percentages for the fifteen (15) annotated cell clusters are listed in parenthesis to the right of the annotated cluster name. (B) Dot plot (Marsh et al., 2023) portraying representative and highly expressed genes for all annotated cell clusters. Dot color indicates the scaled gene expression while the size of the dot corresponds to the percentage of cells of a given cluster that express a given gene. (C) The raw proportional abundances of seven (7) mesenchymal cell clusters and one (1) cluster (Hem) representing the remaining cells (i.e. mainly hematopoietic immune cells) across all 80 DO mice. (D) UMAP plots for mesenchymal lineage cell clusters for DO mouse 50 and DO mouse 233. (E) CELLECT (CELL-type Expression-specific integration for Complex Traits) cell-type prioritization results displaying the Bonferroni adjusted p-values for the cell clusters. The OB1, Ocy, and marrow adipogenic lineage progenitor (MALP) cell clusters (red) were significantly enriched (p_adj<0.05, red dashed line) for BMD heritability (p_adj = 0.018, 0.010, 0.006, respectively).

Based on our prior BMSC-OB scRNA-seq study (Dillard et al., 2023), we expected to identify a large proportion of mesenchymal cells and a smaller fraction of non-mesenchymal cell types. Consistent with this hypothesis, clusters associated with mesenchymal lineages accounted for 74.1% of all cells (Figure 1A). These included mesenchymal progenitor cells (MPCs), late mesenchymal progenitors (LMPs), osteoblast progenitors (OBPs), two mature osteoblast populations (OB1 and OB2), Ocy, and marrow adipogenic lineage progenitors (MALPs). The non-mesenchymal cell types observed included macrophages, monocytes, granulocytes, T-cells, B-cells, endothelial cells, and osteoclast-like cells (Figure 1A). With regard to the mesenchymal cell types, the only differences in cell clusters relative to our previous report (Dillard et al., 2023) were the presence of MPCs and two mature osteoblast clusters. Upon comparing the two distinct osteoblast clusters, OB1 and OB2 (Figure 1A), both clusters had ubiquitous expression of genes associated with mature osteoblasts (e.g. Col1a1, Bglap, Sparc, and Ibsp) (Supplementary file 1a) while many of the ‘canonical’ osteoblast markers were more highly expressed in OB1 compared to OB2 (Supplementary file 1b). Interestingly, MPCs did not have transcriptomic profiles similar to other MPCs previously identified by our group or others (Dillard et al., 2023; Zhong et al., 2020). All other mesenchymal cell types demonstrated specific expression of canonical marker genes (Figure 1A and B).

We next assessed the variability in cell-type frequencies across DO mice by quantifying the proportions of each annotated mesenchymal cell type. All other clusters, which mainly consisted of immune cells of hematopoietic origin, were aggregated into one group (Hem) for each mouse. We observed high variability in the raw proportional abundances of cell types derived from each mouse (Figure 1C, Supplementary file 1c). For example, the proportions of osteoblasts (OB1 and OB2) varied significantly among individual DO mice (Figure 1D). All mice used in the current experiment had been extensively phenotyped for a wide range of bone traits (microCT, histomorphometry, biomechanical bone properties, etc.) as part of a previous genetic analysis of bone strength (Al-Barghouthi et al., 2021). We correlated cell-type frequencies with bone traits; however, none of the cell-type proportions were strongly correlated with any bone trait (Supplementary file 1d and e).

Mesenchymal lineage cells are enriched in BMD heritability

The primary goal of this work was to prioritize and contextualize genes implicated by BMD GWAS. As a first step toward this goal, we sought to determine which cell types were the most relevant to the genetics of BMD. Using the BMD GWAS and the BMSC-OB scRNA-seq data, we performed a CELLECT (Timshel et al., 2020) cell-type prioritization analysis to identify cell clusters enriched for BMD heritability. We observed that OB1, Ocy, and MALP cell clusters were significantly enriched (p_adj<0.05, red dashed line) for BMD heritability (p_adj=0.018, 0.010, 0.006, respectively) (Figure 1E, Supplementary file 1f). None of the non-mesenchymal cells identified were significant (p_adj>0.05) (Figure 1E). As a result, all downstream efforts focused solely on using data on mesenchymal cell types to inform GWAS.

Generating mesenchymal cell type-specific Bayesian networks to inform BMD GWAS

We have previously shown that network-based approaches using bulk RNA-seq are powerful tools for the identification of putative causal genes from BMD GWAS data (Calabrese et al., 2017; Sabik et al., 2020; Al-Barghouthi et al., 2021). Here, our goal was to apply these same approaches using the BMSC-OB scRNA-seq data to prioritize and contextualize genes we previously identified as having a colocalizing eQTL (N=512) or sQTL (N=732) in a tissue from the GTEx project (Al-Barghouthi et al., 2022; Aguet et al., 2020; Abood et al., 2023). Genes identified in each study (or both) yielded a list of high-priority target genes (N=1037). While GTEx does not currently contain data for bone tissue, eQTL can be shared across many tissues and may exert their effects in cell types resident to bone (GTEx Consortium, 2017). Therefore, utilizing our previous work, we hypothesized that generating cell type-specific networks would yield more biologically relevant representations of processes occurring within particular mesenchymal cell types. Additionally, by integrating GWAS, cell type-specific networks, and dynamic gene expression as a function of differentiation, we anticipated we would identify points of intervention in which genetic variation impacts genes involved in the differentiation process.

Our network analysis begins by partitioning genes into groups based on co-expression by applying iterative weighted gene co-expression network analysis (iterativeWGCNA) (Greenfest-Allen et al., 2017) to each mesenchymal cell type (Step 1, Figure 2). In total, 535 modules were identified from the BMSC-OB scRNA-seq data, and the number of modules identified for each mesenchymal cell cluster ranged from 26 to 153 with an average of 76 modules per cluster (Supplementary file 1g and h). We sought to infer causal relationships between genes in each cell type-specific co-expression module and subsequently identify networks involved in processes relevant to BMSC-OB differentiation. To this end, we generated Bayesian networks for each co-expression module, thus enabling us to model directed interactions between co-expressed genes based on conditional independence (Al-Barghouthi et al., 2021) (Step 2, Figure 2).

Figure 2 with 1 supplement see all

Download asset Open asset

Overview of the network analysis pipeline.

Step 1: For all seven (7) of the mesenchymal lineage cell clusters (mesenchymal progenitor cell [MPC], late mesenchymal progenitors [LMP], osteoblast progenitor [OBP], OB1, OB2, osteocyte-like cell [Ocy], marrow adipogenic lineage progenitor [MALP]), cell type-specific co-expression modules were generated using iterative Weighted Gene Co-expression Network Analysis (iterativeWGCNA). Step 2: Bayesian networks were learned to generate directed networks and model causal interactions between co-expressed genes. Step 3: Differentiation driver genes (DDGs) were identified by extracting subnetworks (i.e. large three-step neighborhood) for each gene in each cell type-specific Bayesian network and highlighting those subnetworks that were enriched (p_adj<0.05) for trajectory-specific tradeSeq genes for the cell-type boundary. Step 4: DDGs (and associated networks) were prioritized if the DDG was identified previously as an expression/splicing quantitative trait loci (eQTLs/sQTLs) that colocalized with BMD genome-wide association studies (GWAS) associations. Created with Biorender.com.

Identifying putative drivers of mesenchymal cell differentiation

We hypothesized that many genes impacting BMD do so by influencing osteogenic differentiation or possibly bone marrow adipogenic differentiation of key mesenchymal cell types, as suggested by the CELLECT analysis above. Therefore, the next step of our network analysis was to identify cell type-specific Bayesian networks enriched for genes potentially driving mesenchymal differentiation (Step 3, Figure 2). To accomplish this, we first performed a pseudotime trajectory analysis to infer paths of differentiation in the mesenchymal lineage cells. We resolved three pseudotime trajectories (two osteogenic, one adipogenic) originating from the MPC cell cluster and ending in either Ocy, OB2, or MALP cell fates (Figure 3A). It is important to note that given the identification of multiple skeletal stem cells (Chan et al., 2018; Mizuhashi et al., 2018; Debnath et al., 2018; Matsushita et al., 2020), we do not view these trajectories as lineages, but rather ‘differentiation paths’ (progenitor to mature and/or terminally differentiated cells) that are likely traversed by different subsets of skeletal stem cells.

Figure 3

Download asset Open asset

Pseudotime trajectory inference analysis and establishment of cell-type boundaries for tradeSeq analysis.

(A) Three (3) trajectories (two adipogenic, one adipogenic) were inferred from the mesenchymal cell clusters of the bone marrow-derived stromal cells cultured under osteogenic condition (BMSC-OB) single-cell RNA-seq (scRNA-seq) data using Slingshot. All trajectories originate from the mesenchymal progenitor cell (MPC) and end in either osteogenic (osteocyte-like cells [Ocy], OB2) or adipogenic (marrow adipogenic lineage progenitor [MALP]) cell fates. (B) For each of the trajectories, cell-type boundaries were generated using pseudotime values along the trajectories, which encompass the majority of cells of a cell-type mapping to their respective trajectory. (C) Normalized gene expression of select genes associated with each cluster is represented in feature plots (*top*) and each gene plotted as a function of pseudotime (*bottom*) for all pseudotime trajectories (color corresponds to cell-type annotation observed throughout). Vertical lines (red) represent the cell-type (pseudotime) boundaries established for each cell type (label). The cell-type boundary for OB1 and OB2 is represented as one red line/label for visualization purposes.

To identify genes likely impacting BMSC-OB differentiation, we used tradeSeq to identify genes that exhibit dynamic changes in expression along pseudotime (Van den Berge et al., 2020). Prior to performing the tradeSeq analysis, we parsed the pseudotime trajectories into regions that encompass cells associated with each cell type along their respective trajectories (Figure 3B). We defined multiple cell-type boundaries (nine in total) using pseudotime values, which represent points along a trajectory. The tradeSeq analysis was performed for each boundary (Supplementary file 2a). For example, trajectories bifurcate in the LMP cell cluster (Figure 3A); therefore, cells belonging to the LMP cluster can map to adipogenic and/or osteogenic trajectories depending on their placement along pseudotime. Between a cell-type boundary, cells spanning a specific cluster (e.g. LMP) and mapping to a specific trajectory (e.g. osteogenic trajectory) are used as input to analyze gene expression between the start and end points of the cell-type boundary (e.g. LMP_to_OBP). We analyzed gene expression within the established cell-type boundaries for all trajectories and identified genes that exhibit the most significant differences in expression between the start and end points of the cell-type boundaries. The total number of significant trajectory-specific tradeSeq genes (p_adj<0.05) ranged from 87 to 1697 across the nine cell-type boundaries (Supplementary file 2a and b–d). The expression of representative marker genes for all cell types as a function of pseudotime was consistent with boundaries defined for each cell type (Figure 3C).

We sought to identify tradeSeq genes that may have an associated eQTL and hypothesized that eQTLs that perturb their expression would also impact the proportion of cells at different stages along the cell trajectories. We performed a cell type-specific eQTL analysis for each mesenchymal cell type from the 80 DO mice scRNA-seq data. We identified 563 genes (eGenes) regulated by a significant cis-eQTL in specific cell types of the BMSC-OB scRNA-seq data. Despite being significantly underpowered for this analysis due to our relatively smaller sample size (N=80), we identified two cell type-specific eGenes where the genotype responsible for the cis-eQTL effect was also associated with cell-type proportions. The first of these genes was Pyruvate Kinase, muscle (Pkm), which was identified as a significant global tradeSeq gene (p_adj=8.35 × 10^–8; Supplementary file 2e) associated with the transition from LMPs to OBPs along an osteogenic trajectory (Figure 4A). Moreover, Pkm served as an eGene in the LMP cell cluster (LOD = 9.72; Figure 4B, Supplementary file 2f). Mice inheriting at least one PWK allele at this locus (N=15) demonstrated lower Pkm expression (Figure 4C) and a notable reduction in mature osteoblasts (OB1) and Ocy proportions (p=0.030 and p=0.026, respectively), while LMP proportions were unaffected (Figure 4D, Supplementary file 2g).

Figure 4 with 1 supplement see all

Download asset Open asset

TradeSeq-identified genes associated with bone marrow-derived stromal cells cultured under osteogenic condition (BMSC-OB) differentiation exhibit expression quantitative trait locus (eQTL) effects.

(A) *Pkm* was identified as a significant global tradeSeq-identified gene (p_adj = 8.35 × 10^–8) for late mesenchymal progenitor (LMP) cells along an osteogenic trajectory (LMP_to_OBP) (*left*). *S100a1* was identified as a significant global tradeSeq-identified gene (p_adj=0.023) for OBP cells along osteogenic trajectory 1 (OBP_to_OB1) (*right*). (B) Plots indicating the cell type-specific eQTLs signal for both *Pkm* and *S100a1*. A negative eQTL effect on *Pkm* expression was observed in LMPs for Diversity Outbred (DO) mice with a PWK haplotype background at the *Pkm* locus (*left*). A positive eQTL effect on the expression of *S100a1* was observed in OBPs for DO mice with a 129 haplotype background at the *S100a1* locus, while a negative effect was observed for NZO mice (*right*). (C) The expression of *Pkm* and *S100a1* based on DO mouse (expression values transformed via variance stabilizing transformation [VST], as described in Methods). Genotype abbreviations correspond to DO haplotype background (legend) at the respective gene locus. Mice with at least one PWK allele (genotype abbreviation G) tend to have decreased expression of *Pkm* (*left*). Mice with at least one 129 allele (genotype abbreviation C) tend to have increased expression of *S100a1*, while NZO mice (genotype abbreviation E) have decreased expression (*right*). (D) PWK mice had a significant reduction in mature osteoblasts (OB1) and osteocyte-like cells (Ocy) proportions relative to other mice (p=0.030 and p=0.026, respectively; t-test), while LMP proportions were unaffected. Asterisks represent any of the other haplotype backgrounds. 129 mice showed a significant decrease in LMP proportion and increase in OB1 proportion (p=0.008 and p=0.016, respectively; t-test), but OBP proportions were unaffected. No significant effects on cell-type proportions were observed in NZO mice (Figure 4—figure supplement 1).

Similarly, S100 calcium binding protein A1 (S100a1) was an OBP to OB1 transition tradeSeq gene (p_adj=0.023; Figure 4A, Supplementary file 2e) and an eGene in the OBP cell cluster (LOD = 10.12; Figure 4B, Supplementary file 2f). Mice inheriting at least one 129 allele at this locus (N=30) had higher S100a1 expression, while the opposite was observed for mice inheriting NZO alleles (N=14) (Figure 4C). Additionally, mice inheriting at least one 129 allele showed a significant decrease in LMP proportion and increase in OB1 proportion (p=0.008 and p=0.016, respectively) (Figure 4D, Supplementary file 2g), while no significant differences were observed in cell-type proportions among mice inheriting NZO alleles at this locus (Figure 4—figure supplement 1, Supplementary file 2g).

Identification of DDG

In order to discover BMSC-OB differentiation genes potentially responsible for BMD GWAS associations, the next step of our network analysis leveraged the trajectory-specific tradeSeq genes identified for each cell-type boundary (Supplementary file 2b–d) to identify differentiation driver genes (DDGs) (Step 3, Figure 2). We identified DDGs by extracting subnetworks (i.e. large three-step neighborhoods; see Methods) for each gene in each cell type-specific Bayesian network and identifying those subnetworks enriched (p_adj<0.05) for trajectory-specific tradeSeq genes for the cell-type boundary. The analysis identified 408 significant DDGs (Supplementary file 2h–k). We performed a PANTHER (Thomas et al., 2022) Gene Ontology (GO) analysis for the cell-type boundaries yielding a sufficient number of DDGs and found that DDGs for the osteogenic cell-type boundaries (LMP_to_OBP, OBP_to_OB1, OBP_to_OB2) were enriched for genes associated with the cell cycle (GO:0007049; N=23, 18, 23; p=1.12 × 10^–6, 1.29×10^–13, 1.0×10^–14, respectively) (Supplementary file 3a–c). The DDGs for the adipogenic cell-type boundary (LMP_to_MALP, MALP_to_end) were enriched for genes associated with extracellular matrix organization (GO:0030198; N=10; p=1.62 × 10^–7) and lipid metabolic processes (GO:0006629; N=25; p=1.83 × 10^–11), respectively (Supplementary file 3d and e). Across all 408 DDGs, 49 (12%) were identified in one or more cell-type boundaries as genes with a significant alteration (p<0.05) of whole-body BMD when knocked out/down in mice, as reported by the International Mouse Knockout Consortium (IMPC) (Groza et al., 2023; Supplementary file 2i–k).

We used our previously generated list of potentially causal BMD GWAS genes (N=1037) to subsequently prioritize the DDGs (Step 4, Figure 2). Of the 408 DDGs, 21 DDGs in one or more cell-type boundaries were genes that have BMD GWAS associations that colocalize with sQTL/eQTL (Table 1). The majority of these DDGs were identified in LMPs along both the osteogenic (LMP_to_OBP) and adipogenic (LMP_to_MALP) trajectories (N=10 and 6, respectively; Supplementary file 2h, Supplementary file 3f). The remaining DDGs were identified in OBPs along both osteoblast trajectories (OBP_to_OB1, OBP_to_OB2; N=1 and 3, respectively) and MALPs (MALP_to_end; N=6). Additionally, 3 of the 21 DDGs (Tet1, Tpx2, Timp2) are IMPC genes that exhibit a significant alteration of BMD (Supplementary file 2h, Supplementary file 3f).

Table 1

Prioritized differentiation driver genes (DDGs) that have bone mineral density (BMD) genome-wide association studies (GWAS) associations that colocalize with splicing/expression QTL (eQTL/sQTL) identified in a Genotype-Tissue Expression (GTEx) project tissue.

The tissue with the most significant colocalization (RCP and/or H4PP) is listed for each DGG (26 total, 21 distinct), as determined from Al-Barghouthi et al., 2022, and Abood et al., 2023, for eQTL and sQTL, respectively (Al-Barghouthi et al., 2022; Abood et al., 2023). RCP=Regional Colocalization Probability (GWAS and eQTL colocalization). H4P=H4 Posterior Probability (GWAS and sQTL colocalization).

Trajectory	Cell-type boundary	DDG	GTEx Tissue with strongest eQTL colocalization(RCP)	GTEx Tissue with strongest sQTL colocalization(H4PP)	eGene identified from scRNA-seq of the 80 DO mice
1	LMP to OBP	Tet1	Adipose (Visceral); 0.3191	–	–
1	LMP to OBP	Tpx2	Testis; 0.2031	–	–
1	LMP to OBP	Cdk1	–	Pituitary; 0.7795	–
1	LMP to OBP	Ttyh3	–	Liver; 0.9350	–
1	LMP to OBP	Olfml3	Artery (aorta); 0.8048	–	–
1	LMP to OBP	Izumo4	–	Brain (hypothalamus); 0.9182	–
1	LMP to OBP	Sec24d	Nerve (tibial); 0.2677		–
1	LMP to OBP	Tmem263	Adipose (subcutaneous); 0.5704	Cultured cells (fibroblasts); 0.9716	–
1	LMP to OBP	Lmf2	–	Adrenal gland; 0.8181	–
1	LMP to OBP	Tln2	Esophagus (muscularis); 0.9697	–	–
1	OBP to OB1	Kremen1	Heart (left ventricle); 0.8686	–	–
2	OBP to OB2	Kremen1	Heart (left ventricle); 0.8686	–	–
2	OBP to OB2	Ebf1	–	Testis; 0.8760	–
2	OBP to OB2	Lrp4	Pancreas; 0.7943	–	–
3	LMP to MALP	Ttyh3	–	Liver; 0.9350	–
3	LMP to MALP	Fgfrl1	Cultured cells (fibroblasts); 0.1611	–	–
3	LMP to MALP	Ebf1	–	Testis; 0.8760	–
3	LMP to MALP	Ppp1r12b	–	Nerve (tibial); 0.8807	–
3	LMP to MALP	Rhoj	Cultured cells (fibroblasts); 0.352	Breast; 0.7844	–
3	LMP to MALP	Tln2	Esophagus (muscularis); 0.9697	–	–
3	MALP to end	Adh1	–	Esophagus (gastroesophageal junction); 0.9999	–
3	MALP to end	Fgfrl1	Cultured cells (fibroblasts); 0.1611	–	–
3	MALP to end	Adcy5	–	Esophagus (gastroesophageal junction); 0.8456	–
3	MALP to end	Cnn2	–	Spleen; 0.7743	–
3	MALP to end	Mxra8	–	Pituitary; 0.7545	–
3	MALP to end	Timp2	–	Testis; 0.9429	–

Network analysis predicts Fgfrl1 and Tpx2 as novel regulators of BMD

Here, we highlight two DDGs that putatively impact human BMD via their roles in LMP differentiation along either an adipogenic (Fgfrl1) or osteogenic (Tpx2) trajectory, which are genes with potential roles that have been minimally characterized in the context of human BMD. Based on our previous work (Al-Barghouthi et al., 2022), Fgfrl1 (fibroblast growth factor receptor-like 1) was identified as a DDG with significant human BMD GWAS associations that also colocalized with eQTL identified in the cultured fibroblast GTEx tissue (RCP = 0.1611, Table 1). The Fgfrl1 network was enriched for tradeSeq-identified genes (N=6 genes, p_adj = 7.5 × 10^–3) for LMPs along an adipogenic trajectory (Figure 5A). An increase in the expression of all tradeSeq-identified genes for the Fgfrl1 network was observed (Figure 5B, Supplementary file 2d). Importantly, the expression pattern for the tradeSeq-identified genes was consistent with the cell-type boundaries established for LMPs differentiating along the adipogenic trajectory toward the MALP cell state (Figure 5B). Furthermore, in the surrounding Fgfrl1 network, two genes (Plpp3 and Cfap100) have significant human BMD GWAS associations that also colocalized with sQTL in GTEx tissues, as reported in our previous study (Abood et al., 2023). In the Fgfrl1 network, many other genes can be associated with adipocyte function (e.g. Lpl, Plpp3, Igfbp4) (Enerbäck et al., 1992; Federico et al., 2018; Maridas et al., 2017) and the maintenance of cilia (e.g. Cfap100, St5 (Denn2b), Mark1) (Sigg et al., 2017; Kumar et al., 2022; Fumoto et al., 2019).

Figure 5

Download asset Open asset

*Fgfrl1* and *Tpx2* are prioritized differentiation driver genes (DDGs) and putative drivers of mesenchymal differentiation.

(A) *Fgfrl1* was identified as a DDG of a network for late mesenchymal progenitors (LMPs) differentiating along an adipogenic trajectory. The network is enriched (p_adj = 7.5 × 10^–3) for trajectory-specific tradeSeq-identified genes for the LMP_to_MALP cell-type boundary (*Hnmt, St5, Igfbp4, Cyp1b1, Pdzrn4, Mark1*). *Fgfrl1* was previously identified as a gene that has bone mineral density (BMD) genome-wide association studies (GWAS) associations that colocalize with an expression quantitative trait locus (eQTL) in the cultured fibroblast Genotype-Tissue Expression (GTEx) tissue. (B) An increase in the expression of tradeSeq-identified genes coincides with the LMP_to_MALP cell-type boundary in which they were identified as significant. (C) *Tpx2* was identified as a DDG of a network for LMPs differentiating along an osteogenic trajectory. The network is enriched (p_adj = 5.7 × 10^–7) for tradeSeq-identified genes for the LMP_to_OBP cell-type boundary (*Tpx2, Top2a, Kif4, Iqgap3, Prc1, Kif11, Ect2, Sgo2a, Ube2c*). *Tpx2* is both a tradeSeq gene and previously identified as a gene that has BMD GWAS associations that colocalize with an eQTL in the Testis GTEx tissue. (D) An increase in the expression of tradeSeq-identified genes coincides with the LMP_to_OBP cell-type boundary in which they were identified as significant. (E) Box plot displaying whole-body BMD measurements (excluding skull) from the International Mouse Knockout Consortium (IMPC) for *Tpx2* mutant mice, which exhibited a significant increase in BMD (genotype p-value = 1.03 × 10^–3) in both male and female mice (N=8 (M) and 8 (F) mutants; N=2574 (M) and 2633 (F) controls).

The other network we identified, the Tpx2 network, was identified for LMPs along an osteogenic trajectory (Figure 5C). Tpx2 (TPX2, microtubule-associated) is a DDG with significant human BMD GWAS associations that also colocalized with eQTL identified in the Testis GTEx tissue (RCP = 0.2031, Table 1). The network was enriched for tradeSeq-identified genes (N=9 genes, p_adj = 5.7 × 10^–7) for LMPs differentiating along the osteogenic trajectory (Figure 5C). Furthermore, the expression of the tradeSeq-identified genes for the Tpx2 network was consistent with the cell-type boundaries established for LMPs differentiating along the osteogenic trajectory toward the OBP cell state (Figure 5D; Supplementary file 2b). The expression of these genes increases as LMPs differentiate into OBPs and subsequently decreases upon reaching an OBP cell state. Additionally, Tpx2 exhibited a significant alteration of BMD in both male and female mutant mice (genotype p-value = 1.03 × 10^–3) from IMPC (Figure 5E). In regard to the constituents of the Tpx2 network, additional genes have been tested by the IMPC and result in a significant impact on BMD, such as Ube2c, Top2a, and Papss1. Many other genes in the Tpx2 network can be associated with cellular division and proliferation, including four of the genes of the kinesin family (Kif) motor protein genes (Miki et al., 2001): Kif4, Kif11, Kif15, Kif23.

Discussion

BMD GWAS has been successful at identifying thousands of SNPs associated with disease; however, the identification of causal genes and defining their functional role in disease remains challenging. The integration of ‘-omics’ data, particularly transcriptomics, can assist in overcoming this challenge. Leveraging transcriptomics data has proven invaluable to informing GWAS, as demonstrated in studies that use these data to perform eQTL mapping, transcriptome-wide association studies, and co-expression/gene regulatory network reconstruction. GWAS associations can colocalize with predicted sources of genetic variation that perturb causal gene function or expression, thus providing a potential mechanism through which associations impact disease. While bulk RNA-seq data has been the foundation of such analyses, scRNA-seq data can provide valuable biological context by predicting the cell type in which causal genes are affected. To inform BMD GWAS, the generation of population-scale transcriptomics data at single-cell resolution in bone-relevant cell types can assist in the discovery of novel gene targets. Here, we perform scRNA-seq on 80 DO mice to generate single-cell transcriptomics data of mesenchymal cell types relevant to bone. Using these data, our goal was to prioritize putative causal genes and provide biological context in which these genes potentially influence disease, at cell type-specific resolution. Through our pseudotemporal gene expression and network analyses, we identified 21 networks governed by predicted DDGs that have corresponding human BMD GWAS associations colocalizing with eQTL/sQTL in a GTEx tissue.

We demonstrate that the BMSC-OB model serves as an effective method to enrich for mesenchymal lineage cells, particularly bone-relevant cells. We characterized cells from 80 mice and identified both osteogenic and adipogenic cells derived from the mesenchymal lineage, such as two populations of osteoblasts (OB1 and OB2), Ocy, and MALPs. Our trajectory inference analysis identified three distinct trajectories in which MPCs give rise to both osteogenic and adipogenic cell types, thus portraying biologically relevant and known paths of differentiation of MPCs. Pseudotemporal gene expression was analyzed along each trajectory, in a cell type-specific fashion, to identify genes that were changing the most as a function of pseudotime (tradeSeq-identified genes). Subsequent cis-eQTL analysis indicated that the expression of some tradeSeq-identified genes was associated with the relative proportion of cell types. While instances such as these were rare, they illustrate that the potential consequence of genetic variation impacting the expression of tradeSeq-identified genes may impact differentiation and the abundances of certain cell types along a trajectory.

To inform BMD GWAS, we utilized the scRNA-seq data in a network analysis to contextualize causal genes (and their associated networks) by predicting the cell types through which these genes are likely acting. Toward this goal, we generated cell type-specific Bayesian networks from our BMSC-OB scRNA-seq data. Our approach was similar to our previous network analyses where bulk RNA-seq data was leveraged to identify genes with strong evidence of playing central roles in networks (Calabrese et al., 2017; Sabik et al., 2020; Al-Barghouthi et al., 2021). In contrast, here, we utilized scRNA-seq data to identify DDGs and prioritize networks based on the likelihood that they are involved in the differentiation of mesenchymal lineage cells (based on network connections enriched for tradeSeq-identified genes determined from inferred trajectories). Leveraging our previous work (Al-Barghouthi et al., 2022; Abood et al., 2023), we prioritized DDGs if they were genes with BMD GWAS associations colocalizing with human eQTL/sQTL in a GTEx tissue. Together, a gene being both a DDG and having BMD GWAS associations that colocalize with eQTL/sQTL is strong support of causality.

We identified 21 DDGs and associated networks, some of which have little to no known prior connection to bone. We contextualize these causal genes and their networks by not only providing cell-type predictions in which they likely operate, but also providing information regarding the biological processes they likely affect. For example, the Tpx2 network was identified in LMPs differentiating along an osteogenic trajectory. Tpx2 is a microtubule assembly factor that interacts with spindle microtubules during cellular division (Zhang et al., 2017). The expression of Tpx2 and its regulation is associated with osteosarcoma, as well as other cancers (Zhu et al., 2022). In our previous study, Tpx2 was identified as a gene that has BMD GWAS associations that colocalize with eQTL in the Testis GTEx tissue (Al-Barghouthi et al., 2022). While GTEx does not maintain bone tissue, eQTLs are shared across many tissues (GTEx Consortium, 2017); therefore, non-bone eQTLs may exert their effects in cell types associated with bone, such as LMPs, and evidence of a human eQTL effect indicates that genetic variation can modulate the expression of Tpx2. Additionally, when knocked out by IMPC, Tpx2 exhibited a significant increase in whole-body BMD in mice, thus providing strong support for Tpx2 influencing the regulation of BMD in humans. In the surrounding gene neighborhood of the Tpx2 network, other genes can be associated with cellular division as well, such as Topoisomerase 2A (Top2a) and the kinesin family (Kif) genes (Miki et al., 2001; Uusküla-Reimand and Wilson, 2022). Taken together, these results indicate a potential role of Tpx2 as a mediator of BMD and genetic variation altering its expression could affect microtubule maintenance during the expansion of osteogenic cell populations.

Additionally, the Fgfrl1 network was identified in LMPs differentiating along an adipogenic trajectory. Fibroblast growth factor receptor-like 1 (Fgfrl1) is presumed to function as a decoy receptor that interacts with FGF ligands necessary for FGF signaling (Trueb, 2011; Steinberg et al., 2010), and Fgfrl1 expression is suggested to play a role in both adipogenic and osteogenic differentiation (Kähkönen et al., 2018). Our previous study also identified Fgfrl1, which has BMD GWAS associations that colocalize with eQTL in the cultured fibroblasts GTEx tissue (Al-Barghouthi et al., 2021). In the neighborhood of the Fgfrl1 network, Lpl, Plpp3, Igfbp4 have well-established roles in adipocyte function and metabolism (Enerbäck et al., 1992; Federico et al., 2018; Maridas et al., 2017); however, other genes can be associated with cilia, such as Cfap100, St5 (Denn2b), Mark1 (Sigg et al., 2017; Kumar et al., 2022; Fumoto et al., 2019). Interestingly, the maintenance and remodeling of cilia is essential to the differentiation of mesenchymal stem cells and pre-adipocytes (e.g. MALPs) while mature adipocytes lack cilia (Hilgendorf, 2021). Moreover, the inactivation of FGF signaling is associated with the length of primary cilia (Neugebauer et al., 2009). Thus, genetic variation altering the expression of expression of Fgfrl1 may affect FGF signaling to impact the maintenance of cilia and adipogenic differentiation. Additionally, given the prioritization of MALPs in the CELLECT analysis and the well-established inverse relationship between marrow adiposity and BMD (Fazeli et al., 2013; Veldhuis‐Vlug and Rosen, 2018), skewed balance of LMP differentiation toward marrow adipogenic cell fates may affect BMD. In summary, the Fgfrl1 network harbors genes involved in adipogenic function, including cilia maintenance, which may contribute to LMP differentiation along an adipogenic trajectory. Together, these results indicate a potential role of Fgfrl1 as a mediator of BMD via its role in adipogenic differentiation and maintenance of cilia.

Analyses performed here are not without limitations to consider. Our in vitro culturing approach and the preparation of single cells for scRNA-seq could be sources of technical variation in our study. Additionally, a pitfall of scRNA-seq is the sparsity of the resulting data, which yields an increased frequency of zero values for the expression of some genes in a proportion of cells, also known as ‘drop-outs’ (Haque et al., 2017). While statistical approaches can be employed to impute missing data, the accuracy of such methods and whether or not the resulting improvement in transcriptomic signal recovery is enough to warrant such intervention is contentious (Cheng et al., 2023; Yu et al., 2021). However, this issue may be partially offset given the larger scale of the scRNA-seq performed in this study and the average expression approach performed for network and eQTL analysis. Another limitation of this study is that read alignment of the scRNA-seq data did not account for DO founder genetic variation in RNA transcripts, which could affect read mapping and gene expression measurements. An additional limitation is that the BMSC-OB model does not capture osteoclasts, another cell type associated with bone tissue. Importantly, results from our CELLECT analysis indicate that BMD heritability was not enriched for genes whose expression was more specific to osteoclast-like cells; however, these cells likely represent immature osteoclasts, as mature multinucleated cells would be too large to be captured for sequencing. Lastly, while our study employed 80 DO mice, the issue of statistical power is still a limitation; however, we demonstrate that the BMSC-OB model is amenable to high throughput and the inclusion of hundreds of mice, thus statistical power will be improved in future studies.

In summary, we showcase the use of large-scale scRNA-seq data to inform GWAS by performing a network analysis to contextualize BMD GWAS associations. Through the use of multiple single-cell analyses, we have expanded upon our understanding of the genetics of BMD. Our work exemplifies the power of single-cell transcriptomics from large populations of genetically diverse samples, and our network approach for data analysis may guide future studies to consider systems genetics strategies for the discovery of genetic determinants of disease.

Methods

Sample preparation and scRNA-seq

All animal procedures were conducted in compliance with the National Institutes of Health Guide for the Care and Use of Laboratory Animals. The protocol for studies involving DO mice (Protocol Number 3741) was reviewed and approved by the Institutional Animal Care and Use Committee (IACUC) at the University of Virginia. We prepared our samples in the same fashion as performed previously in Dillard et al., 2023. In brief, bone marrow was extracted from the femurs of initially 77 DO mice (The Jackson Laboratory, Strain: 009376). BMSCs were grown to confluence after 3 days of incubation in 48-well plates and then underwent in vitro osteoblast differentiation for 10 days with osteogenic differentiation media (alpha MEM, 10% FBS, 1% pen/strep, 1% GlutaMAX, 50 μg/μL ascorbic acid [Sigma, St. Louis, MO, USA], 10 nM β-glycerophosphate [Sigma], 10 nM dexamethasone [Sigma]). After differentiation, single cells were liberated from mineralizing cultures via incubations with 60 mM ethylenediaminetetraacetic acid pH 7.4 (EDTA [Thermo Fisher Scientific], made in DPBS), 8 mg/mL collagenase (Gibco) in HBSS/4 mM CaCl₂ (Fisher), and 0.25% trypsin-EDTA (Gibco). After single-cell isolation, cells from mice were pooled into groups containing cells from four to five mice total and concentrated to 800 cells/μL in PBS supplemented with 0.1% BSA (bovine serum albumin). Pooled single cells were prepared for sequencing using the 10× Chromium Controller (10× Genomics, Pleasanton, CA, USA) with the Single Cell 3’ v2 reagent kit, according to the manufacturer’s protocol. Libraries were sequenced on the NextSeq500 (Illumina, San Diego, CA, USA).

scRNA-seq analysis pipeline

The data was subsequently processed using the 10× Genomics Cell Ranger toolkit (version 5.0.0) using the GRCm38 reference genome (Church et al., 2009). Using Seurat (Hao et al., 2021) (version 4.1.0), a combined Seurat object containing all cells was generated with the inclusion of features detected in at least three cells and cells with at least 200 features detected. We used Souporcell (Heaton et al., 2020) (version 2.0.0) to deconvolve the genotypes of all mice and to remove doublet cells. Cells were assigned to their associated DO mouse by making a pairwise comparison between allele calls made by the shared variants captured between Souporcell and GigaMUGA genotype arrays generated for all mice in the cohort, as previously performed in Dillard et al., 2023. Cells derived from two mice (176 and 244) were removed in some analyses due to poor genotyping of their respective Souporcell clusters, thus yielding a total of 75 DO mice from this study and 5 DO mice from our previous study (Dillard et al., 2023) for a total of 80 DO mouse biological replicates. We filtered out cells with more than 6200 reads and less than 400 reads, as well as those cells with more than 10% mitochondrial reads. Further, cells were removed if they expressed greater than 20% Rpl and 15% Rps reads, which equates to cells approximately exceeding the 98th percentile. After filtering, 139,392 cells remained, and the resulting object underwent standard normalization, scaling, and the top 3000 features were modeled from a variance stabilizing transformation (VST) using Seurat. Cell-cycle markers based on Tirosh et al., 2016, were regressed out using the ‘CellCycleScoring’ and scaling functions. For subsequent dimensionality reduction, 15 principal components (PCs) were summarized. Then, a kNN (k = 20) graph was created and the Louvain algorithm was used to cluster cells at a resolution of 0.5. Annotation of cell-type clusters was performed manually based on differential gene expression analysis using the Seurat ‘FindAllMarkers’ function (Supplementary file 1a).

For subsequent WGCNA and eQTL mapping, transcriptomic profiles for each cell-type cluster were generated for each sample using a mean expression approach, as performed similarly by others (Neavin et al., 2021; van der Wijst et al., 2018). For each sample contributing at least five cells to a given cluster, unnormalized unique molecular identifier (UMI) counts of gene expression for all cells in the cluster for the sample were averaged and then rounded to the nearest hundredth decimal place. A total of 80, 80, 77, 67, 50, 76, 80 mice contributed enough cells to the MPC, LMP, OBP, OB1, OB2, Ocy, and MALP cell-type clusters, respectively. Genes with non-zero expression values in fewer than 15 samples were removed. A total of 11,971, 15,162, 14,857, 13,674, 13,825, 14,136, and 14,534 genes remained for the MPC, LMP, OBP, OB1, OB2, Ocy, and MALP clusters, respectively. Samples were normalized by computing CPMs (counts per million) without log transformation for each gene using edgeR (Robinson et al., 2010) (version 4.0.7), then transformed via VST using DESeq2 (Love et al., 2014) (version 1.42.0), and quantile normalized using preprocessCore (version 1.60.2).

Trajectory and tradeSeq analysis

Trajectory inference analysis was performed using Slingshot (Street et al., 2018) (version 1.8.0) on the mesenchymal lineage cell clusters (seven total) of the BMSC-OB scRNA-seq data. The starting cluster was set as the MPC cluster upon the removal of a small outlier population of cells. Trajectories were inferred using 15 PCs. TradeSeq (Van den Berge et al., 2020) (version 1.4.0) was used to analyze gene expression along the trajectories by fitting a negative binomial generalized additive model (NB-GAM) to each gene using the ‘fitGAM’ function with nknots = 10, which was determined by using the ‘evaluateK’ function. Prior to performing the tradeSeq analysis, the scRNA-seq data was downsampled to reduce the size of the dataset to approximately 10,000 cells (sampled at random across all seven clusters).

All cell-type boundaries were established to encompass, on average, 78% of cells of a cell cluster (Supplementary file 2a). To identify genes significantly changing between boundaries in a trajectory-specific fashion, we first performed tradeSeq to compare gene expression within each trajectory (two osteogenic, one adipogenic) to highlight genes with a significant difference in expression between boundaries using the ‘startVsEndTest’ function (Supplementary file 2a–d). Next, we performed a global test with tradeSeq to compare gene expression between trajectories in order to highlight genes exhibiting a significant difference in expression using the ‘startVsEndTest’ function (Supplementary file 2a, Supplementary file 2e). All tests were performed with the log₂ fold change threshold (l2fc)=0.5. For all global and trajectory-specific tests, the p-values associated with each gene were adjusted to control the false discovery rate using the ‘p.adjust’ function from the stats (version 4.2.1) R package, and genes were filtered to include those with a p_adj<0.05.

CELLECT analysis

CELLECT (Timshel et al., 2020) (CELL-type Expression-specific integration for Complex Traits) (version 1.1.0) was used to identify likely etiologic cell types underlying complex traits of both the BMSC-OBs scRNA-seq data (Figure 1E, Supplementary file 1f). CELLECT p-values were adjusted using the Bonferroni correction. CELLECT quantifies the association between the GWAS signal and cell-type expression specificity using the S-LDSC genetic prioritization model (Finucane et al., 2015). Summary statistics from the UK Biobank eBMD and Fracture GWAS (Data Release 2018) and cell-type annotations from each scRNA-seq data set were used as input. Cell-type expression specificities were estimated using CELLEX (Timshel et al., 2020) (CELL-type EXpression-specificity) (version 1.2.1) (Supplementary file 3g).

WGCNA

Cell type-specific mean expression matrices (as obtained above) were used as input to generate signed co-expression network modules (Supplementary file 1g and h). IterativeWGCNA (Greenfest-Allen et al., 2017) (version 1.1.6) was used from a Singularity container built from a Docker hub image (Cartailler, 2022). A soft threshold (power) of 14, which exceeded an R² threshold of 0.85 for all cell-type clusters, was selected for module construction (Figure 2—figure supplement 1). Modules were generated using iterativeWGCNA with default parameters for the ‘blockwiseModules’ function, a minimum module size of 20 genes, minCoreKME = 0.7, and minKMEtoStay = 0.5.

Bayesian network learning

Bayesian networks were learned from each of the cell type-specific modules of co-expressed genes with the bnlearn (version 4.8.3). Gene expression matrices containing the genes for each module were used as input to the ‘mmhc’ function which employs the Max-Min Hill Climbing (MMHC) algorithm (Tsamardinos et al., 2006) to learn the underlying structure of the Bayesian network. From the generated networks, igraph (version 1.6.0) was used to resolve three-step neighborhoods (Porter and Smith, 2010). Nodes (genes) that were unconnected to a neighborhood or connected to only one neighbor were removed. Neighborhoods were filtered to include those with a size greater than 1 standard deviation from the mean across all neighborhoods generated for the network.

DDGs are genes that yield large three-step neighborhoods that are enriched (p_adj<0.05) with tradeSeq-identified genes for a given cell-type boundary. We calculated whether each neighborhood contained more tradeSeq-identified genes (for the neighborhoods’ associated cell-type boundary) than would be expected by chance using the hypergeometric distribution (‘phyper’ function) from the stats (version 4.2.1) R package. The arguments were as follows: q: (number of neighbors in a neighborhood that are also tradeSeq-identified genes for a given cell-type boundary) – 1; m: total number of tradeSeq-identified genes for a given cell-type boundary; n: (total number of identified neighborhoods) – m; k: neighborhood size (total number of neighbors); lower.tail=false. p-Values were adjusted to control the false discovery rate using the ‘p.adjust’ function from the stats (version 4.2.1) R package. These pruning steps resulted in a total of 408 DDGs and associated networks for all cell types (Supplementary file 2h–k).

DO eQTL mapping

Prior to performing the eQTL analysis, DNA was extracted from the tails of the 80 DO mice, using the PureLink Genomic DNA Mini Kit (Invitrogen) and genotyped using the GigaMUGA array by Neogen Genomics (GeneSeek; Lincoln, NE, USA). Processing and quality control of genotype data, including calculation of genotype/allele probabilities, was performed as previously described in Al-Barghouthi et al., 2021. Cell type-specific mean expression matrices (as obtained above) for mesenchymal lineage clusters were used as input for the eQTL mapping, which was performed using a linear mixed model via the ‘scan1’ function from the qtl2 (Broman et al., 2019) (version 0.30) R package with allowances for the following covariates: sex, age at sacrifice (in days), weight, length, and DO mouse generation. To identify significant eQTL, we calculated an LOD (logarithm of the odds) threshold; for each cell-type cluster, we chose 50 genes at random and then permuted them 1000 times using the ‘scan1perm’ function from qtl2. We established the LOD threshold of 9.68 and 9.49 for the autosomal chromosomes and X chromosome, respectively, by taking the average of the median LOD across each cell type. A total of 563 eQTLs exceeded the LOD thresholds and were no more than 1 Mbp from the transcription start site of the associated eGene (Supplementary file 2f).

Cell-type proportion analysis

To account for technical sources of variation often retained in scRNA-seq, cell-type proportions were transformed using the arcsin (asin) square root transformation from the speckle (Phipson et al., 2022) R package (version 0.0.3). Tests of statistical significance were performed using the propeller t-test and ANOVA functions with default parameters. The sex of the mice and the batch each mouse was associated with for sequencing were modeled as covariates. Transformed values were used as input for computing tests of statistical differences of cell-type proportions between mice, as well as correlation to phenotypic traits (Supplementary file 1c–e).

Data availability

The data that support the findings of this study are openly available in NCBI Gene Expression Omnibus database with accession codes GSE152806 and GSE269583. Processed scRNA-seq data available on Zenodo at https://doi.org/10.5281/zenodo.15299630. Code for analysis is available on GitHub at https://github.com/Farber-Lab/DO80_project (copy archived at Farber, 2025).

The following data sets were generated

(2025) NCBI Gene Expression Omnibus
ID GSE269583. Cell type-specific network analysis in Diversity Outbred mice identifies genes potentially responsible for human bone mineral density GWAS associations.

https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE269583
1. Dillard L
(2024) Zenodo
Processed Seurat Object.

https://doi.org/10.5281/zenodo.15299630

The following previously published data sets were used

1. Al-Barghouthi B
2. Mesner L
3. Calabrese G
4. Brooks D
5. Tommasini S
6. Bouxsein M
7. Horowitz M
8. Rosen C
9. Nguyen K
10. Haddox S
11. Farber E
12. Onengut-Gumuscu S
13. Pomp D
14. Farber C
(2020) NCBI Gene Expression Omnibus
ID GSE152806. Single-cell RNA-seq of bone marrow-derived stromal cells from 5 Diversity Outbred mice.

https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE152806

References

Preprint
1. Abood A
2. Mesner LD
3. Jeffery ED
4. Murali M
5. Lehe M
6. Saquing J
7. Farber CR
8. Sheynkman GM
(2023) Long-Read Proteogenomics to Connect Disease-Associated sQTLs to the Protein Isoform Effectors of Disease
bioRxiv.

https://doi.org/10.1101/2023.03.17.531557
- Google Scholar
1. Aguet F
2. Anand S
3. Ardlie KG
4. Gabriel S
5. Getz GA
6. Graubert A
7. Hadley K
8. Handsaker RE
9. Huang KH
10. Kashin S
11. Li X
12. MacArthur DG
13. Meier SR
14. Nedzel JL
15. Nguyen DT
16. Segrè AV
17. Todres E
18. Balliu B
19. Barbeira AN
20. Battle A
21. Bonazzola R
22. Brown A
23. Brown CD
24. Castel SE
25. Conrad DF
26. Cotter DJ
27. Cox N
28. Das S
29. de Goede OM
30. Dermitzakis ET
31. Einson J
32. Engelhardt BE
33. Eskin E
34. Eulalio TY
35. Ferraro NM
36. Flynn ED
37. Fresard L
38. Gamazon ER
39. Garrido-Martín D
40. Gay NR
41. Gloudemans MJ
42. Guigó R
43. Hame AR
44. He Y
45. Hoffman PJ
46. Hormozdiari F
47. Hou L
48. Im HK
49. Jo B
50. Kasela S
51. Kellis M
52. Kim-Hellmuth S
53. Kwong A
54. Lappalainen T
55. Li X
56. Liang Y
57. Mangul S
58. Mohammadi P
59. Montgomery SB
60. Muñoz-Aguirre M
61. Nachun DC
62. Nobel AB
63. Oliva M
64. Park Y
65. Park Y
66. Parsana P
67. Rao AS
68. Reverter F
69. Rouhana JM
70. Sabatti C
71. Saha A
72. Stephens M
73. Stranger BE
74. Strober BJ
75. Teran NA
76. Viñuela A
77. Wang G
78. Wen X
79. Wright F
80. Wucher V
81. Zou Y
82. Ferreira PG
83. Li G
84. Melé M
85. Yeger-Lotem E
86. Barcus ME
87. Bradbury D
88. Krubit T
89. McLean JA
90. Qi L
91. Robinson K
92. Roche NV
93. Smith AM
94. Sobin L
95. Tabor DE
96. Undale A
97. Bridge J
98. Brigham LE
99. Foster BA
100. Gillard BM
101. Hasz R
102. Hunter M
103. Johns C
104. Johnson M
105. Karasik E
106. Kopen G
107. Leinweber WF
108. McDonald A
109. Moser MT
110. Myer K
111. Ramsey KD
112. Roe B
113. Shad S
114. Thomas JA
115. Walters G
116. Washington M
117. Wheeler J
118. Jewell SD
119. Rohrer DC
120. Valley DR
121. Davis DA
122. Mash DC
123. Branton PA
124. Barker LK
125. Gardiner HM
126. Mosavel M
127. Siminoff LA
128. Flicek P
129. Haeussler M
130. Juettemann T
131. Kent WJ
132. Lee CM
133. Powell CC
134. Rosenbloom KR
135. Ruffier M
136. Sheppard D
137. Taylor K
138. Trevanion SJ
139. Zerbino DR
140. Abell NS
141. Akey J
142. Chen L
143. Demanelis K
144. Doherty JA
145. Feinberg AP
146. Hansen KD
147. Hickey PF
148. Jasmine F
149. Jiang L
150. Kaul R
151. Kibriya MG
152. Li JB
153. Li Q
154. Lin S
155. Linder SE
156. Pierce BL
157. Rizzardi LF
158. Skol AD
159. Smith KS
160. Snyder M
161. Stamatoyannopoulos J
162. Tang H
163. Wang M
164. Carithers LJ
165. Guan P
166. Koester SE
167. Little AR
168. Moore HM
169. Nierras CR
170. Rao AK
171. Vaught JB
172. Volpi S
173. The GTEx Consortium
(2020) The GTEx consortium atlas of genetic regulatory effects across human tissues
Science 369:1318–1330.

https://doi.org/10.1126/science.aaz1776
- Google Scholar
1. Akiyama M
(2021) Multi-omics study for interpretation of genome-wide association study
Journal of Human Genetics 66:3–10.

https://doi.org/10.1038/s10038-020-00842-5
- PubMed
- Google Scholar
1. Al-Barghouthi BM
2. Mesner LD
3. Calabrese GM
4. Brooks D
5. Tommasini SM
6. Bouxsein ML
7. Horowitz MC
8. Rosen CJ
9. Nguyen K
10. Haddox S
11. Farber EA
12. Onengut-Gumuscu S
13. Pomp D
14. Farber CR
(2021) Systems genetics in diversity outbred mice inform BMD GWAS and identify determinants of bone strength
Nature Communications 12:3408.

https://doi.org/10.1038/s41467-021-23649-0
- PubMed
- Google Scholar
1. Al-Barghouthi BM
2. Rosenow WT
3. Du K-P
4. Heo J
5. Maynard R
6. Mesner L
7. Calabrese G
8. Nakasone A
9. Senwar B
10. Gerstenfeld L
11. Larner J
12. Ferguson V
13. Ackert-Bicknell C
14. Morgan E
15. Brautigan D
16. Farber CR
(2022) Transcriptome-wide association study and eQTL colocalization identify potentially causal genes responsible for human bone mineral density GWAS associations
eLife 11:e77285.

https://doi.org/10.7554/eLife.77285
- PubMed
- Google Scholar
(2015) Collaborative cross and diversity outbred data resources in the mouse phenome database
Mammalian Genome 26:511–520.

https://doi.org/10.1007/s00335-015-9595-6
- PubMed
- Google Scholar
1. Broman KW
2. Gatti DM
3. Simecek P
4. Furlotte NA
5. Prins P
6. Sen Ś
7. Yandell BS
8. Churchill GA
(2019) R/qtl2: software for mapping quantitative trait loci with high-dimensional data and multiparent populations
Genetics 211:495–502.

https://doi.org/10.1534/genetics.118.301595
- PubMed
- Google Scholar
(2017) Integrating GWAS and co-expression network data identifies bone mineral density genes SPTBN1 and MARK3 and an osteoblast functional module
Cell Systems 4:46–59.

https://doi.org/10.1016/j.cels.2016.10.014
- PubMed
- Google Scholar
Software
1. Cartailler JP
(2022) Iterativewgcna, version R 3.6.3
Docker Hub.

https://hub.docker.com/r/jpcartailler/iterativewgcna
1. Chan CKF
2. Gulati GS
3. Sinha R
4. Tompkins JV
5. Lopez M
6. Carter AC
7. Ransom RC
8. Reinisch A
9. Wearda T
10. Murphy M
11. Brewer RE
12. Koepke LS
13. Marecic O
14. Manjunath A
15. Seo EY
16. Leavitt T
17. Lu WJ
18. Nguyen A
19. Conley SD
20. Salhotra A
21. Ambrosi TH
22. Borrelli MR
23. Siebel T
24. Chan K
25. Schallmoser K
26. Seita J
27. Sahoo D
28. Goodnough H
29. Bishop J
30. Gardner M
31. Majeti R
32. Wan DC
33. Goodman S
34. Weissman IL
35. Chang HY
36. Longaker MT
(2018) Identification of the human skeletal stem cell
Cell 175:43–56.

https://doi.org/10.1016/j.cell.2018.07.029
- PubMed
- Google Scholar
1. Cheng Y
2. Ma X
3. Yuan L
4. Sun Z
5. Wang P
(2023) Evaluating imputation methods for single-cell RNA-seq data
BMC Bioinformatics 24:302.

https://doi.org/10.1186/s12859-023-05417-7
- Google Scholar
1. Church DM
2. Goodstadt L
3. Hillier LW
4. Zody MC
5. Goldstein S
6. She X
7. Bult CJ
8. Agarwala R
9. Cherry JL
10. DiCuccio M
11. Hlavina W
12. Kapustin Y
13. Meric P
14. Maglott D
15. Birtle Z
16. Marques AC
17. Graves T
18. Zhou S
19. Teague B
20. Potamousis K
21. Churas C
22. Place M
23. Herschleb J
24. Runnheim R
25. Forrest D
26. Amos-Landgraf J
27. Schwartz DC
28. Cheng Z
29. Lindblad-Toh K
30. Eichler EE
31. Ponting CP
32. Mouse Genome Sequencing Consortium
(2009) Lineage-specific biology revealed by a finished genome assembly of the mouse
PLOS Biology 7:e1000112.

https://doi.org/10.1371/journal.pbio.1000112
- PubMed
- Google Scholar
(2012) The diversity outbred mouse population
Mammalian Genome 23:713–718.

https://doi.org/10.1007/s00335-012-9414-2
- PubMed
- Google Scholar
1. Cookson W
2. Liang L
3. Abecasis G
4. Moffatt M
5. Lathrop M
(2009) Mapping complex disease traits with global gene expression
Nature Reviews. Genetics 10:184–194.

https://doi.org/10.1038/nrg2537
- PubMed
- Google Scholar
1. Debnath S
2. Yallowitz AR
3. McCormick J
4. Lalani S
5. Zhang T
6. Xu R
7. Li N
8. Liu Y
9. Yang YS
10. Eiseman M
11. Shim J-H
12. Hameed M
13. Healey JH
14. Bostrom MP
15. Landau DA
16. Greenblatt MB
(2018) Discovery of a periosteal stem cell mediating intramembranous bone formation
Nature 562:133–139.

https://doi.org/10.1038/s41586-018-0554-8
- PubMed
- Google Scholar
1. Dillard LJ
2. Rosenow WT
3. Calabrese GM
4. Mesner LD
5. Al-Barghouthi BM
6. Abood A
7. Farber EA
8. Onengut-Gumuscu S
9. Tommasini SM
10. Horowitz MA
11. Rosen CJ
12. Yao L
13. Qin L
14. Farber CR
(2023) Single-cell transcriptomics of bone marrow stromal cells in diversity outbred mice: a model for population-level scRNA-seq studies
Journal of Bone and Mineral Research 38:1350–1363.

https://doi.org/10.1002/jbmr.4882
- PubMed
- Google Scholar
(1992) Characterization of the human lipoprotein lipase (LPL) promoter: evidence of two cis-regulatory regions, LP-alpha and LP-beta, of importance for the differentiation-linked induction of the LPL gene during adipogenesis
Molecular and Cellular Biology 12:4622–4633.

https://doi.org/10.1128/mcb.12.10.4622-4633.1992
- PubMed
- Google Scholar
Software
1. Farber C
(2025) DO80_project, version swh:1:rev:714435a02c00b89190e66cc065b84c7ccdf404c6
Software Heritage.

https://archive.softwareheritage.org/swh:1:dir:1f8f1969534429dfd1d16dd67d2fbde129618b40;origin=https://github.com/Farber-Lab/DO80_project;visit=swh:1:snp:c2b1ffe4885c2fdb02194ad398586af7583d853a;anchor=swh:1:rev:714435a02c00b89190e66cc065b84c7ccdf404c6
(2013) Marrow fat and bone--new perspectives
The Journal of Clinical Endocrinology and Metabolism 98:935–945.

https://doi.org/10.1210/jc.2012-3634
- PubMed
- Google Scholar
1. Federico L
2. Yang L
3. Brandon J
4. Panchatcharam M
5. Ren H
6. Mueller P
7. Sunkara M
8. Escalante-Alcalde D
9. Morris AJ
10. Smyth SS
(2018) Lipid phosphate phosphatase 3 regulates adipocyte sphingolipid synthesis, but not developmental adipogenesis or diet-induced obesity in mice
PLOS ONE 13:e0198063.

https://doi.org/10.1371/journal.pone.0198063
- PubMed
- Google Scholar
(2015) Partitioning heritability by functional annotation using genome-wide association summary statistics
Nature Genetics 47:1228–1235.

https://doi.org/10.1038/ng.3404
- PubMed
- Google Scholar
(2019) Mark1 regulates distal airspace expansion through type I pneumocyte flattening in lung development
Journal of Cell Science 132:jcs235556.

https://doi.org/10.1242/jcs.235556
- PubMed
- Google Scholar
Preprint
(2017) iterativeWGCNA: Iterative Refinement to Improve Module Detection from WGCNA Co-Expression Networks
bioRxiv.

https://doi.org/10.1101/234062
- Google Scholar
1. Groza T
2. Gomez FL
3. Mashhadi HH
4. Muñoz-Fuentes V
5. Gunes O
6. Wilson R
7. Cacheiro P
8. Frost A
9. Keskivali-Bond P
10. Vardal B
11. McCoy A
12. Cheng TK
13. Santos L
14. Wells S
15. Smedley D
16. Mallon A-M
17. Parkinson H
(2023) The international mouse Phenotyping Consortium: comprehensive knockout phenotyping underpinning the study of human disease
Nucleic Acids Research 51:D1038–D1045.

https://doi.org/10.1093/nar/gkac972
- Google Scholar
1. GTEx Consortium
(2013) The genotype-tissue expression (GTEx) project
Nature Genetics 45:580–585.

https://doi.org/10.1038/ng.2653
- PubMed
- Google Scholar
1. GTEx Consortium
(2017) Genetic effects on gene expression across human tissues
Nature 550:204–213.

https://doi.org/10.1038/nature24277
- Google Scholar
1. Hao Y
2. Hao S
3. Andersen-Nissen E
4. Mauck WM
5. Zheng S
6. Butler A
7. Lee MJ
8. Wilk AJ
9. Darby C
10. Zager M
11. Hoffman P
12. Stoeckius M
13. Papalexi E
14. Mimitou EP
15. Jain J
16. Srivastava A
17. Stuart T
18. Fleming LM
19. Yeung B
20. Rogers AJ
21. McElrath JM
22. Blish CA
23. Gottardo R
24. Smibert P
25. Satija R
(2021) Integrated analysis of multimodal single-cell data
Cell 184:3573–3587.

https://doi.org/10.1016/j.cell.2021.04.048
- PubMed
- Google Scholar
(2017) A practical guide to single-cell RNA-sequencing for biomedical research and clinical applications
Genome Medicine 9:75.

https://doi.org/10.1186/s13073-017-0467-4
- PubMed
- Google Scholar
1. Heaton H
2. Talman AM
3. Knights A
4. Imaz M
5. Gaffney DJ
6. Durbin R
7. Hemberg M
8. Lawniczak MKN
(2020) Souporcell: robust clustering of single-cell RNA-seq data by genotype without reference genotypes
Nature Methods 17:615–620.

https://doi.org/10.1038/s41592-020-0820-1
- PubMed
- Google Scholar
1. Hilgendorf KI
(2021) Primary cilia are critical regulators of white adipose tissue expansion
Frontiers in Physiology 12:769367.

https://doi.org/10.3389/fphys.2021.769367
- PubMed
- Google Scholar
1. Johnell O
2. Kanis JA
3. Oden A
4. Johansson H
5. De Laet C
6. Delmas P
7. Eisman JA
8. Fujiwara S
9. Kroger H
10. Mellstrom D
11. Meunier PJ
12. Melton LJ
13. O’Neill T
14. Pols H
15. Reeve J
16. Silman A
17. Tenenhouse A
(2005) Predictive value of BMD for hip and other fractures
Journal of Bone and Mineral Research 20:1185–1194.

https://doi.org/10.1359/JBMR.050304
- PubMed
- Google Scholar
(2018) Role of fibroblast growth factor receptors (FGFR) and FGFR like-1 (FGFRL1) in mesenchymal stromal cell differentiation to osteoblasts and adipocytes
Molecular and Cellular Endocrinology 461:194–204.

https://doi.org/10.1016/j.mce.2017.09.015
- PubMed
- Google Scholar
(2022) A cell-based GEF assay reveals new substrates for DENN domains and a role for DENND2B in primary ciliogenesis
Science Advances 8:eabk3088.

https://doi.org/10.1126/sciadv.abk3088
- PubMed
- Google Scholar
1. Li B
2. Ritchie MD
(2021) From GWAS to gene: transcriptome-wide association studies and other methods to functionally understand GWAS discoveries
Frontiers in Genetics 12:713230.

https://doi.org/10.3389/fgene.2021.713230
- Google Scholar
1. Lin JT
2. Lane JM
(2004)
Osteoporosis: a review

Clinical Orthopaedics and Related Research 334:126–134.
- PubMed
- Google Scholar
1. Love MI
2. Huber W
3. Anders S
(2014) Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2
Genome Biology 15:550.

https://doi.org/10.1186/s13059-014-0550-8
- PubMed
- Google Scholar
1. Maridas DE
2. DeMambro VE
3. Le PT
4. Mohan S
5. Rosen CJ
(2017) IGFBP4 is required for adipogenesis and influences the distribution of adipose depots
Endocrinology 158:3488–3500.

https://doi.org/10.1210/en.2017-00248
- PubMed
- Google Scholar
Software
(2023) Custom visualizations & functions for streamlined analyses of single cell sequencing, version 1.1.1
Zenodo.

https://doi.org/10.5281/zenodo.7534950
1. Matsushita Y
2. Nagata M
3. Kozloff KM
4. Welch JD
5. Mizuhashi K
6. Tokavanich N
7. Hallett SA
8. Link DC
9. Nagasawa T
10. Ono W
11. Ono N
(2020) A wnt-mediated transformation of the bone marrow stromal cell identity orchestrates skeletal regeneration
Nature Communications 11:332.

https://doi.org/10.1038/s41467-019-14029-w
- PubMed
- Google Scholar
1. Miki H
2. Setou M
3. Kaneshiro K
4. Hirokawa N
(2001) All kinesin superfamily protein, KIF, genes in mouse and human
PNAS 98:7004–7011.

https://doi.org/10.1073/pnas.111145398
- PubMed
- Google Scholar
1. Mizuhashi K
2. Ono W
3. Matsushita Y
4. Sakagami N
5. Takahashi A
6. Saunders TL
7. Nagasawa T
8. Kronenberg HM
9. Ono N
(2018) Resting zone of the growth plate houses a unique class of skeletal stem cells
Nature 563:254–258.

https://doi.org/10.1038/s41586-018-0662-5
- PubMed
- Google Scholar
1. Morris JA
2. Kemp JP
3. Youlten SE
4. Laurent L
5. Logan JG
6. Chai RC
7. Vulpescu NA
8. Forgetta V
9. Kleinman A
10. Mohanty ST
11. Sergio CM
12. Quinn J
13. Nguyen-Yamamoto L
14. Luco A-L
15. Vijay J
16. Simon M-M
17. Pramatarova A
18. Medina-Gomez C
19. Trajanoska K
20. Ghirardello EJ
21. Butterfield NC
22. Curry KF
23. Leitch VD
24. Sparkes PC
25. Adoum A-T
26. Mannan NS
27. Komla-Ebri DSK
28. Pollard AS
29. Dewhurst HF
30. Hassall TAD
31. Beltejar M-JG
32. Adams DJ
33. Vaillancourt SM
34. Kaptoge S
35. Baldock P
36. Cooper C
37. Reeve J
38. Ntzani EE
39. Evangelou E
40. Ohlsson C
41. Karasik D
42. Rivadeneira F
43. Kiel DP
44. Tobias JH
45. Gregson CL
46. Harvey NC
47. Grundberg E
48. Goltzman D
49. Adams DJ
50. Lelliott CJ
51. Hinds DA
52. Ackert-Bicknell CL
53. Hsu Y-H
54. Maurano MT
55. Croucher PI
56. Williams GR
57. Bassett JHD
58. Evans DM
59. Richards JB
60. 23andMe Research Team
(2019) An atlas of genetic influences on osteoporosis in humans and mice
Nature Genetics 51:258–266.

https://doi.org/10.1038/s41588-018-0302-x
- Google Scholar
1. Neavin D
2. Nguyen Q
3. Daniszewski MS
4. Liang HH
5. Chiu HS
6. Wee YK
7. Senabouth A
8. Lukowski SW
9. Crombie DE
10. Lidgerwood GE
11. Hernández D
12. Vickers JC
13. Cook AL
14. Palpant NJ
15. Pébay A
16. Hewitt AW
17. Powell JE
(2021) Single cell eQTL analysis identifies cell type-specific genetic control of gene expression in fibroblasts and reprogrammed induced pluripotent stem cells
Genome Biology 22:76.

https://doi.org/10.1186/s13059-021-02293-3
- PubMed
- Google Scholar
(2009) FGF signalling during embryo development regulates cilia length in diverse epithelia
Nature 458:651–654.

https://doi.org/10.1038/nature07753
- PubMed
- Google Scholar
1. Peacock M
2. Turner CH
3. Econs MJ
4. Foroud T
(2002) Genetics of osteoporosis
Endocrine Reviews 23:303–326.

https://doi.org/10.1210/edrv.23.3.0464
- PubMed
- Google Scholar
1. Phipson B
2. Sim CB
3. Porrello ER
4. Hewitt AW
5. Powell J
6. Oshlack A
(2022) Propeller: testing for differences in cell type proportions in single cell data
Bioinformatics 38:4720–4726.

https://doi.org/10.1093/bioinformatics/btac582
- PubMed
- Google Scholar
Conference
1. Porter MD
2. Smith R
(2010) Network neighborhood analysis
2010 IEEE International Conference on Intelligence and Security Informatics.

https://doi.org/10.1109/ISI.2010.5484781
- Google Scholar
(2010) edgeR: a Bioconductor package for differential expression analysis of digital gene expression data
Bioinformatics 26:139–140.

https://doi.org/10.1093/bioinformatics/btp616
- PubMed
- Google Scholar
(2020) Identification of a core module for bone mineral density through the integration of a co-expression network and GWAS data
Cell Reports 32:108145.

https://doi.org/10.1016/j.celrep.2020.108145
- PubMed
- Google Scholar
1. Sigg MA
2. Menchen T
3. Lee C
4. Johnson J
5. Jungnickel MK
6. Choksi SP
7. Garcia G
8. Busengdal H
9. Dougherty GW
10. Pennekamp P
11. Werner C
12. Rentzsch F
13. Florman HM
14. Krogan N
15. Wallingford JB
16. Omran H
17. Reiter JF
(2017) Evolutionary proteomics uncovers ancient associations of cilia with signaling pathways
Developmental Cell 43:744–762.

https://doi.org/10.1016/j.devcel.2017.11.014
- PubMed
- Google Scholar
1. Steinberg F
2. Zhuang L
3. Beyeler M
4. Kälin RE
5. Mullis PE
6. Brändli AW
7. Trueb B
(2010) The FGFRL1 receptor is shed from cell membranes, binds fibroblast growth factors (FGFs), and antagonizes FGF signaling in xenopus embryos
Journal of Biological Chemistry 285:2193–2202.

https://doi.org/10.1074/jbc.M109.058248
- Google Scholar
1. Street K
2. Risso D
3. Fletcher RB
4. Das D
5. Ngai J
6. Yosef N
7. Purdom E
8. Dudoit S
(2018) Slingshot: cell lineage and pseudotime inference for single-cell transcriptomics
BMC Genomics 19:477.

https://doi.org/10.1186/s12864-018-4772-0
- PubMed
- Google Scholar
1. Thomas PD
2. Ebert D
3. Muruganujan A
4. Mushayahama T
5. Albou L-P
6. Mi H
(2022) PANTHER: making genome-scale phylogenetics accessible to all
Protein Science 31:8–22.

https://doi.org/10.1002/pro.4218
- PubMed
- Google Scholar
(2020) Genetic mapping of etiologic brain cell types for obesity
eLife 9:e55851.

https://doi.org/10.7554/eLife.55851
- PubMed
- Google Scholar
1. Tirosh I
2. Izar B
3. Prakadan SM
4. Wadsworth MH
5. Treacy D
6. Trombetta JJ
7. Rotem A
8. Rodman C
9. Lian C
10. Murphy G
11. Fallahi-Sichani M
12. Dutton-Regester K
13. Lin J-R
14. Cohen O
15. Shah P
16. Lu D
17. Genshaft AS
18. Hughes TK
19. Ziegler CGK
20. Kazer SW
21. Gaillard A
22. Kolb KE
23. Villani A-C
24. Johannessen CM
25. Andreev AY
26. Van Allen EM
27. Bertagnolli M
28. Sorger PK
29. Sullivan RJ
30. Flaherty KT
31. Frederick DT
32. Jané-Valbuena J
33. Yoon CH
34. Rozenblatt-Rosen O
35. Shalek AK
36. Regev A
37. Garraway LA
(2016) Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq
Science 352:189–196.

https://doi.org/10.1126/science.aad0501
- PubMed
- Google Scholar
1. Trueb B
(2011) Biology of FGFRL1, the fifth fibroblast growth factor receptor
Cellular and Molecular Life Sciences 68:951–964.

https://doi.org/10.1007/s00018-010-0576-3
- PubMed
- Google Scholar
(2006) The max-min hill-climbing Bayesian network structure learning algorithm
Machine Learning 65:31–78.

https://doi.org/10.1007/s10994-006-6889-7
- Google Scholar
1. Uusküla-Reimand L
2. Wilson MD
(2022) Untangling the roles of TOP2A and TOP2B in transcription and cancer
Science Advances 8:eadd4920.

https://doi.org/10.1126/sciadv.add4920
- PubMed
- Google Scholar
(2020) Trajectory-based differential expression analysis for single-cell sequencing data
Nature Communications 11:1201.

https://doi.org/10.1038/s41467-020-14766-3
- PubMed
- Google Scholar
(2018) Single-cell RNA sequencing identifies celltype-specific cis-eQTLs and co-expression QTLs
Nature Genetics 50:493–497.

https://doi.org/10.1038/s41588-018-0089-9
- PubMed
- Google Scholar
1. Veldhuis‐Vlug AG
2. Rosen CJ
(2018) Clinical implications of bone marrow adiposity
Journal of Internal Medicine 283:121–139.

https://doi.org/10.1111/joim.12718
- Google Scholar
(2017) Integrating molecular QTL data into genome-wide genetic association analysis: probabilistic assessment of enrichment and colocalization
PLOS Genetics 13:e1006646.

https://doi.org/10.1371/journal.pgen.1006646
- PubMed
- Google Scholar
(2021) Statistical and bioinformatics analysis of data from bulk and single-cell RNA sequencing experiments
Methods in Molecular Biology 2194:143–175.

https://doi.org/10.1007/978-1-0716-0849-4_9
- Google Scholar
1. Zhang R
2. Roostalu J
3. Surrey T
4. Nogales E
(2017) Structural insight into TPX2-stimulated microtubule assembly
eLife 6:e30959.

https://doi.org/10.7554/eLife.30959
- PubMed
- Google Scholar
1. Zhong L
2. Yao L
3. Tower RJ
4. Wei Y
5. Miao Z
6. Park J
7. Shrestha R
8. Wang L
9. Yu W
10. Holdreith N
11. Huang X
12. Zhang Y
13. Tong W
14. Gong Y
15. Ahn J
16. Susztak K
17. Dyment N
18. Li M
19. Long F
20. Chen C
21. Seale P
22. Qin L
(2020) Single cell transcriptomics identifies a unique adipose lineage cell population that regulates bone marrow environment
eLife 9:e54695.

https://doi.org/10.7554/eLife.54695
- PubMed
- Google Scholar
1. Zhu D
2. Xu X
3. Zhang M
4. Wang T
(2022) TPX2 regulated by miR-29c-3p induces cell proliferation in osteosarcoma via the AKT signaling pathway
Oncology Letters 23:143.

https://doi.org/10.3892/ol.2022.13262
- PubMed
- Google Scholar

Article and author information

Author details

Luke J Dillard

Department of Genome Sciences, University of Virginia, Charlottesville, United States

Contribution
Conceptualization, Data curation, Formal analysis, Methodology, Visualization, Writing – original draft, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0001-8293-0479
Gina Calabrese

Department of Genome Sciences, University of Virginia, Charlottesville, United States

Contribution
Visualization

Competing interests
No competing interests declared
Larry Mesner

Department of Genome Sciences, University of Virginia, Charlottesville, United States

Contribution
Visualization

Competing interests
No competing interests declared
Charles Farber
1. Department of Genome Sciences, University of Virginia, Charlottesville, United States
2. Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, United States
Contribution
Methodology, Supervision, Funding acquisition, Writing – review and editing

For correspondence
crf2s@virginia.edu

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-6748-4711

Funding

National Institute of Arthritis and Musculoskeletal and Skin Diseases (R01AR68345)

Charles Farber

National Institute of Arthritis and Musculoskeletal and Skin Diseases (R01AR082880)

Charles Farber

National Institute of Arthritis and Musculoskeletal and Skin Diseases (R01AR077992)

Charles Farber

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

Research reported in this publication was supported in part by the National Institute of Arthritis and Musculoskeletal and Skin Diseases of the National Institutes of Health under award numbers R01AR68345, R01AR082880, and R01AR077992 to CRF.

Ethics

All animal procedures were conducted in compliance with the National Institutes of Health Guide for the Care and Use of Laboratory Animals. The protocol for studies involving Diversity Outbred mice (Protocol Number 3741) was reviewed and approved by the Institutional Animal Care and Use Committee (IACUC) at the University of Virginia.

Version history

Preprint posted: May 21, 2024
Sent for peer review: July 24, 2024
Reviewed Preprint version 1: December 11, 2024
Reviewed Preprint version 2: September 24, 2025
Version of Record published: March 11, 2026

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.100832. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

1,164

views
81

downloads
2

citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Citations by DOI

1

citation for umbrella DOI https://doi.org/10.7554/eLife.100832

1

citation for Reviewed Preprint v1 https://doi.org/10.7554/eLife.100832.1

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Mendeley

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Luke J Dillard
Gina Calabrese
Larry Mesner
Charles Farber

(2026)

Cell type-specific network analysis in Diversity Outbred mice identifies genes potentially responsible for human bone mineral density GWAS associations

eLife 13:RP100832.

https://doi.org/10.7554/eLife.100832.3