Using population selection and sequencing to characterize natural variation of starvation resistance in Caenorhabditis elegans

Abstract
Editor's evaluation
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

Starvation resistance is important to disease and fitness, but the genetic basis of its natural variation is unknown. Uncovering the genetic basis of complex, quantitative traits such as starvation resistance is technically challenging. We developed a synthetic-population (re)sequencing approach using molecular inversion probes (MIP-seq) to measure relative fitness during and after larval starvation in Caenorhabditis elegans. We applied this competitive assay to 100 genetically diverse, sequenced, wild strains, revealing natural variation in starvation resistance. We confirmed that the most starvation-resistant strains survive and recover from starvation better than the most starvation-sensitive strains using standard assays. We performed genome-wide association (GWA) with the MIP-seq trait data and identified three quantitative trait loci (QTL) for starvation resistance, and we created near isogenic lines (NILs) to validate the effect of these QTL on the trait. These QTL contain numerous candidate genes including several members of the Insulin/EGF Receptor-L Domain (irld) family. We used genome editing to show that four different irld genes have modest effects on starvation resistance. Natural variants of irld-39 and irld-52 affect starvation resistance, and increased resistance of the irld-39; irld-52 double mutant depends on daf-16/FoxO. DAF-16/FoxO is a widely conserved transcriptional effector of insulin/IGF signaling (IIS), and these results suggest that IRLD proteins modify IIS, although they may act through other mechanisms as well. This work demonstrates efficacy of using MIP-seq to dissect a complex trait and it suggests that irld genes are natural modifiers of starvation resistance in C. elegans.

Editor's evaluation

The authors identify natural genetic variants in C. elegans that are associated with variation in starvation resistance. The authors focus on a gene family (irld's) that are thought to regulate insulin signaling. These studies are very interesting in that the approach for identifying natural gene variants is highly innovative and the work provides novel information about this family of genes.

https://doi.org/10.7554/eLife.80204.sa0

Introduction

Given tremendous sequencing capacity, digitization of phenotypes by counting DNA molecules in mixed-genotype populations can provide unprecedented sensitivity and precision. Population-selection-and-sequencing approaches for genetic analysis were developed in bacteria and yeast, enabling large numbers of genetic perturbations to be assayed in parallel (Han et al., 2010; Kwon et al., 2016; Nislow et al., 2016), and CRISPR subsequently enabled related approaches in mammalian cells (Gilbert et al., 2014; Koike-Yusa et al., 2014; Shalem et al., 2014; Wang et al., 2014). With its small size, genetic toolkit, and genomic resources, the nematode C. elegans is an ideal animal model to develop selection-and-sequencing approaches to organismal phenotypes. Such approaches have been described for mapping causal loci from recombinants between a pair of divergent strains in C. elegans (Mok et al., 2017; Burga et al., 2019). We described a population-sequencing approach based on pooling many wild strains (Webster et al., 2019), but it lacked power since only very rare sequencing reads that include single-nucleotide variants (SNVs) unique to a strain in the pool informed inference of relative strain frequency. By capturing targeted sequences, MIP-seq enables extremely deep sequencing of polymorphic loci (Cantsilieris et al., 2017; Mok et al., 2017), but it has not been applied to populations of wild strains.

Enduring periods of starvation is a near-ubiquitous feature of animal life that affects survival, growth, and reproduction, making starvation resistance a fitness-proximal trait. Starvation resistance is also important to human health and disease, with direct relevance to diabetes, obesity, aging, and cancer. Despite its importance to understanding animal evolution and informing therapeutic strategies, however, the genetic basis of natural variation in starvation resistance is unclear. The nematode C. elegans is frequently starved in the wild and has robust starvation responses (Schulenburg and Félix, 2017; Baugh and Hu, 2020). Larvae that hatch in the absence of food arrest development in the first larval stage (L1 arrest) and can survive starvation for weeks (Baugh, 2013). In addition to causing mortality, extended starvation reduces growth and reproductive success upon feeding (Jobson et al., 2015; Jordan et al., 2019), and these effects can be uncoupled by genotype or condition (Roux et al., 2016; Kaplan et al., 2018; Chen et al., 2022 and reviewed in Baugh and Hu, 2020). Starvation resistance therefore integrates survival, growth rate, and reproductive success, and different genes and conditions can affect these phenotypes independently. Insulin/IGF signaling (IIS) is a critical regulator of L1 arrest (Baugh and Sternberg, 2006). There is a single known insulin/IGF-like receptor in C. elegans, DAF-2/InsR, which signals through a conserved phosphatidylinositol 3-kinase (PI3K) signaling pathway to antagonize the transcription factor DAF-16/FoxO (Lin et al., 1997; Ogg et al., 1997). When IIS is reduced, such as during starvation, DAF-16 moves to the nucleus and regulates transcription (Henderson and Johnson, 2001; Lee et al., 2001; Lin et al., 2001), promoting starvation resistance (Muñoz and Riddle, 2003; Baugh and Sternberg, 2006; Hibshman et al., 2017). (Roux et al., 2016; Kaplan et al., 2018; Baugh and Hu, 2020; Chen et al., 2022).

Here, we describe the development of MIP-seq for statistical genetic analysis of complex traits in C. elegans. We used MIP-seq to analyze starvation resistance in a pool of genetically diverse wild strains, identifying relatively starvation-resistant and sensitive strains. We identified and validated three QTL that affect starvation resistance and contain numerous candidate variants. Our results suggest that multiple members of the irld gene family affect aspects of starvation resistance, and they suggest they do so at least in part by modifying IIS.

Results

Sensitive and precise measurement of strain frequency in pooled culture using MIP-seq

We selected 103 genetically diverse, wild C. elegans strains from around the world including the laboratory reference N2 to test MIP-seq, ultimately phenotyping 100 for starvation resistance (Figure 1A–B). MIPs are designed to capture a specific region of the genome for targeted multiplex sequencing (Figure 1C). We designed MIPs to target a region containing a SNV unique to each of 103 strains. Thus, the relative frequency of each strain in a pool can be determined by the SNV frequency. We designed four such MIPs per strain to provide redundancy and increase precision. To pilot MIP-seq, we prepared sequencing libraries from an equimolar mix of genomic DNA from each of 103 strains. We determined the frequency of strain-specific reads for each MIP, and we censored probes that produced frequencies substantially different than the expected value of approximately 0.01 (Figure 1D; criteria in Materials and methods), leaving three or four reliable probes for 85% of strains and at least one MIP for 100 strains (Figure 1E, Supplementary file 1), which were used in the starvation resistance experiment. Three strains with no MIPs passing filtering were excluded from subsequent analysis. As an additional pilot, we mixed genomic DNA from a subset of strains at different concentrations to prepare a standard curve. MIP-seq accurately measured individual strain frequencies over three orders of magnitude (Figure 1F), and greater sequencing depth could theoretically expand the dynamic range.

Figure 1

Download asset Open asset

Sensitive and precise measurement of strain frequency in pooled culture using MIP-seq.

(A) The three metrics used to identify the most diverse *C. elegans* strains are plotted. ‘MAF’ stands for minor allele frequency. Concordance refers to the average pairwise concordance for the focal strain compared to all other strains, which is calculated as the number of shared variant sites divided by the total number of variants for each pair. The strains included in the MIP-seq experiments are in red. (B) Geographic locations of the strains assayed for starvation resistance. (C) Schematic of MIP-seq. MIPs are designed for loci with SNVs unique to each strain. Four MIPs were designed per strain. MIPs are 80 nt long and include ligation and extension arms to match DNA sequence surrounding the SNV, a unique molecular identifier (UMI), and P5 and P7 sequences for Illumina sequencing. MIPs are hybridized to genomic DNA, polymerized, ligated, and used as PCR template to generate an Illumina sequencing library. The alternative-to-total read frequency for each MIP/SNV locus indicates strain frequency. (D) Empirical testing of 412 MIPs with an equimolar mix of genomic DNA from 103 strains to identify reliable MIPs. 321 MIPs passed filtering and were analyzed in the starvation experiment. Outliers for filtered MIPs are for N2, which has hardly any unique SNVs because it is derived from the reference genome. N2 MIPs were included despite poor performance. (E) Number of MIPs per strain of the 321 filtered MIPs that passed filtering. (F) Genomic DNA from seven strains was combined at different known concentrations, and MIP-seq was used to generate a standard curve. Included MIPs all passed filtering. (R²=0.99).

Using MIP-seq to characterize natural variation of starvation resistance

We used MIP-seq to phenotype 100 diverse strains for starvation resistance. We cultured the strains in standard laboratory conditions, pooled them, and subjected them to starvation during L1 arrest. We aimed for approximately 5000 L1 larvae per strain in the pooled starvation culture in order to ensure representative sampling. However, we expected actual representation to vary across strains and replicates, so we collected DNA from an aliquot of L1 larvae on the first day of starvation as a ‘baseline’ sample to capture initial population composition. In addition, aliquots were taken from the pool at days 1, 9, 13, and 17 of starvation, and sampled larvae were allowed to recover with food for 4 or 5 days (depending on the duration of starvation), enabling reproduction for 1 day, and then the entire population was collected for DNA preparation (Figure 2A). DNA from baseline samples, as well as samples allowed to recover and reproduce following starvation, were sequenced with MIP-seq for each of five biological replicates. It is critical to point out that by incorporating recovery and early fecundity, this sampling scheme integrates effects of starvation on mortality as well as growth rate and reproductive success, each of which are important for starvation resistance (i.e. fitness) and can be uncoupled by certain genotypes and conditions (Baugh and Hu, 2020).

Figure 2 with 4 supplements see all

Download asset Open asset

MIP-seq determines relative starvation resistance of 100 strains.

(A) Experimental design. Worms were starved at the L1 stage ('L1 arrest').~5000 L1 larvae per strain were starved (~500,000 total). The population of starved L1 larvae was sampled initially (‘baseline’ on day 1), and then sampled on the days indicated. Samples (except baseline) were recovered with food in liquid culture, reaching adulthood and producing progeny for 1 day, and the entire population was frozen for DNA isolation. Five biological replicates were performed. (B) Principal component 1 of normalized and processed data from all replicates (replicate-level) and strains is plotted, revealing association with duration of starvation. Each point is an individual sample (MIP-seq library). (C) The relationship between two starvation-resistance metrics (Slope and PC1) produced from strain-level data (replicates averaged) is plotted. Each point is a different strain. (D) Log₂-normalized strain frequency is plotted over time for the 25 most resistant and 25 most sensitive strains in rank order (based on Slope). Only days 1, 9, and 13 are plotted. See Figure 2—figure supplement 2 for full data. Grey lines are biological replicates and black line is the mean. DL238 and EG4725 are most starvation-resistant, and NIC526 and MY2147 are most sensitive, and they are color-coded accordingly. (E) L1 starvation survival curves are plotted for starvation-resistant and sensitive strains. Individual replicate measurements are included as points to which curves were fit with logistic regression. T-tests on 50% survival time of four biological replicates. (F) Worm length following 48 hours of recovery with food after 1 or 12 days of L1 starvation. (G) Number of progeny produced between 48 and 72 hr of recovery on food following 1 or 8 days of starvation. (**F,G**) ΔΔ indicates effect size of interaction between duration of starvation and strain data plotted in that panel compared to the strain listed (the difference in differences between strains’ mean length at days 1 and 12 or between mean number of progeny at days 1 and 8). ‘MY’ is an abbreviation for MY2147 and ‘NIC’ is an abbreviation for NIC526. Linear mixed-effects model; one-way p-value of interaction between duration of starvation and strain. (**E–G**) ***p<0.001, **p<0.01, *p<0.05.

Figure 2—source data 1 Source data for manual starvation resistance assays of wild strains.: https://cdn.elifesciences.org/articles/80204/elife-80204-fig2-data1-v2.xlsx
Download elife-80204-fig2-data1-v2.xlsx

DNA from baseline samples allowed us to effectively normalize differences in pool composition in each replicate, revealing effects of starvation on strain frequency. Differences in pool composition explained the first component in principal component analysis (PCA) when strain frequencies over time were analyzed without consideration of baseline frequencies (Figure 2—figure supplement 1A). However, once the data were normalized for initial strain composition using the baseline sample for each replicate, the first principal component correlated with duration of starvation, especially across the first three time points (Figure 2B, Figure 2—figure supplement 1B). Substantial mortality occurred by day 17 (Figure 2—figure supplement 1C), and day 17 recovery cultures thus produced relatively few progeny. Consequently, differences in strain frequencies were actually smaller at day 17 than 13, but relative differences were conserved (Figure 2—figure supplement 2). After normalization, duration of starvation is the major factor accounting for differences in strain frequency across all samples, and this is robust to differences in the initial composition of the pool across replicates.

We developed two metrics to quantify relative starvation resistance for each strain. ‘Slope’ is a measure of how much a strain increases or decreases in frequency over time across days 1, 9, and 13, calculated as the slope of a linear model (Supplementary file 1). ‘PC1’ is the value of the first principal component for each strain from strain-level PCA (Figure 2B, Figure 2—figure supplement 1B). These two metrics are correlated but also show some differences (Figure 2C), suggesting they capture related but also distinct features of the data. While Slope is intuitive, it is limited by the use of a linear model. Nonetheless, Slope values are correlated with starvation-resistance values produced from a previously published population-sequencing approach with less power that included some of the same strains (Figure 2—figure supplement 1D; Webster et al., 2019). In addition, Slope is modestly correlated with the latitude from which strains were collected, suggesting possible adaptation to starvation or other correlated traits based on location (Figure 2—figure supplement 3). There is also a modest negative correlation between Slope and growth rate after only one day of starvation (control condition) (Figure 2—figure supplement 4), suggesting a possible trade-off between starvation resistance and population growth rate in the absence of stress. We used the Slope metric to order strains from most resistant to sensitive, revealing differences in starvation resistance between wild strains (Figure 2D, Figure 2—figure supplement 2). In contrast to Slope, PC1 does not assume linearity, it includes the results from day 17 of starvation, and it may be less affected by noise. PCA is also an established way to obtain trait values for GWA studies (Ried et al., 2016; Yano et al., 2019).

Our recovery-based sampling approach integrates starvation survival, recovery, and early fecundity into a single fitness assay. It is therefore unclear whether a given strain is more or less resistant because of differences in mortality, growth rate, progeny production, or some combination. It is also unclear what the absolute effect sizes are between the most resistant and sensitive strains in this competition assay. Nonetheless, our approach is intended to model the impact of larval starvation on fitness broadly, while traditional assays can be used to isolate specific effects of starvation on survival, growth, and reproduction in follow-up experiments.

We performed manual assays for starvation survival, growth rate, and early fecundity for the most resistant and sensitive strains. We found starvation-resistant strains DL238 and EG4725 survived starvation significantly longer during L1 arrest than sensitive strains MY2147 and NIC526 (Figure 2E). Differences in starvation survival among wild strains are relatively small compared to some published mutants in the N2 reference background (Baugh and Hu, 2020). After extended L1 arrest, DL238 and EG4725 recovered from starvation better than MY2147 but not NIC526, as assessed by their size following 48 hr of recovery (Figure 2F). Finally, DL238 and EG4725 exhibited a larger early brood size following extended starvation compared to both MY2147 and NIC526. Overall, this demonstrates that differences in starvation resistance among wild strains are driven by differences in survival, recovery, and early fecundity, but that sensitivity of NIC526 is apparently driven by differences in survival and early fecundity without an appreciable effect on growth. These results validate the MIP-seq approach and reveal the extent of natural variation in starvation resistance.

Natural variation in irld gene family members affects starvation resistance

We used Slope and PC1 as trait values to perform GWA using the Caenorhabditis elegans Natural Diversity Resource (CeNDR) (Cook et al., 2017). GWA identified QTL on the right arm of chromosome IV and on the left and right arms of chromosome V (Figure 3A–B, Supplementary file 2). We confirmed that each QTL affected starvation resistance by generating NILs and measuring growth rate upon recovery from starvation (Figure 3—figure supplement 1). We chose this assay, as opposed to starvation survival or fecundity, because it revealed relatively robust differences between DL238/EG4725 (resistant) and MY2147 (sensitive) (Figure 2F).

Figure 3 with 1 supplement see all

Download asset Open asset

Genetic variation in the *irld* gene family underlies differences in starvation resistance.

(A) GWA output using Slope as a trait value. Significant QTL intervals are IV: 15939340–16613710 and V: 15660911–17615557. (B) GWA output using PC1 as a trait value. Significant QTL intervals are V: 1345848–2764788 and V: 15775895–18065050. (C) WormCat Category 3 enrichments for all genes with variants in the QTL. (D) Fold-enrichment of protein domains significantly enriched among genes with variants in QTL. A hypergeometric p-value was calculated for each of 102 protein domains present, and a Bonferroni-corrected p-value of 0.00049 was used as a threshold to determine significance. Red indicates the receptor L domain, which is found in *irld* genes. (E) All variants in *irld* genes that are within significant QTL and their association with the starvation-resistance traits, Slope and PC. Each gene name is shown next to the most significant variant for that gene, but multiple variants are plotted for each gene when present. Red indicates genes selected for functional validation. (F) Slope trait values for strains based on whether they have ALT and REF alleles for specific *irld-39* and *irld-52* variants predicated to disrupt protein function. The *irld-52* variant p-value is p=0.007 for the PC1 trait value (only the Slope trait value is shown here). Significance determined from GWA fine mapping. (G) Slope trait values for strains based on whether they are hyper-divergent or not at *irld-11* and *irld-57* loci. T-test on trait values between hyper-divergent and non-divergent strains. (**F–G**) DL238, EG4725, NIC526, and MY2147 are color-coded as indicated. (H) The four *irld* genes selected for genome editing and the edits generated for each. For *irld-39 and irld-52,* N2 and MY2147 have the REF allele and were edited to have the ALT allele. *irld-11* and *irld-57* are hyper-divergent in DL238 and EG2745 backgrounds, so full gene deletions were generated in N2 and MY2147 backgrounds. (I) L1 starvation survival assays on *irld-39* and *irld-52* ALT alleles in N2 and MY2147 backgrounds. There were no significant differences between strains within a background. (J) L1 starvation survival assays on *irld-11* and *irld-57* deletions in N2 and MY2147 backgrounds. There were no significant differences between strains within a background. (**K–L**). Worm length following 48 hr recovery with food after 1 or 8 d of L1 starvation for indicated genotypes. Linear mixed-effects model; one-way p-value for interaction between strain and duration of starvation; 4–5 biological replicates per condition. ΔΔ indicates effect size of interaction between duration of starvation and strain compared to control (the difference in differences between strains’ mean length at days 1 and 8). (**F,G,K,L**) ***p<0.001, **p<0.01, *p<0.05, n.s. not significant.

Figure 3—source data 1 Source data for starvation resistance assays of irld strains.: https://cdn.elifesciences.org/articles/80204/elife-80204-fig3-data1-v2.xlsx
Download elife-80204-fig3-data1-v2.xlsx

These QTL are relatively large, ranging from 0.7 to 2.2 Mb, and include many candidate variants (Supplementary file 2) across 867 genes, which are enriched for several large gene families. WormCat analysis identified significant enrichments of serpentine receptors, nuclear hormone receptors, and C-type lectins (Figure 3C; Holdorf et al., 2020). Likewise, protein-domain enrichment analysis (Finn et al., 2011) identified seven-pass transmembrane domains and hormone receptor domains (Figure 3D). In addition, the receptor L domain was significantly enriched, which is found in proteins comprising Insulin/EGF-Receptor L Domain (IRLD) family (Dlakić, 2002). Given weak homology to DAF-2/InsR, and the critical role of IIS in regulation of starvation resistance, we were intrigued at the possibility that natural variation in irld family genes may impact starvation resistance, lthough it should be noted that the QTL contain numerous additional candidates that could affect the trait. Across all three QTL identified, there are genetic variants in 16 irld genes, and 68 genes have been identified as part of this family in C. elegans (Hobert, 2013). Multiple variants are present for most irld genes, and variants differed in the degree to which they were associated with variation in starvation resistance (Figure 3E).

We selected at least one irld gene from each QTL for functional analysis. On the left arm of chromosome IV, a variant in irld-39 was the strongest individual candidate among all genes because of its strong association with starvation resistance and because the variant is predicted to disrupt the start codon of the gene (Figure 3E and F, Supplementary file 2), likely rendering irld-39 a functional null in the starvation-resistant strain DL238. However, this was not functionally validated, and it is possible that this variant affects expression of the neighboring irld gene, hpa-1. On the right arm of chromosome V, irld-52 was identified through both Slope and PC1 phenotype metrics and contains a variant associated with starvation resistance predicted to disrupt its fifth exon with a frameshift (Figure 3E and F), though this was not functionally validated and it is unclear if the variant causes a null mutation. While analyzing variants on the left arm of chromosome V, we noticed that many irld genes are adjacent to each other and that each contain many variants. In particular, irld-11, irld-44, and irld-45 are clustered, and each gene contains over 50 genetic variants. This pattern of some loci containing many variants relative to N2 has been broadly observed, leading to identification of ‘hyper-divergent’ regions of the genome containing exceptional amounts of variation (Lee et al., 2021). irld-11, irld-44, and irld-45 are part of a hyper-divergent region, and because they are so tightly linked, they are hyper-divergent in the same strains. We found that hyper-divergence at these loci was associated with starvation resistance (Figure 3G). irld-57 is also in a hyper-divergent region on the right arm of chromosome V, and hyper-divergence at this locus is also associated with starvation resistance (Figure 3G). Given several variants predicted to disrupt protein function in each, we believe irld-11 and irld-57 are null in the hyper-divergent context, though this has not been functionally demonstrated. Notably, associations between variants or hyper-divergence and Slope (Figure 3F and G) together with their predicted negative impacts on gene function (Figure 3H) suggests that disruption of these four irld genes in backgrounds where they are functional will increase starvation resistance.

We used CRISPR-Cas9 genome editing to determine functional consequences of genetic modification of our candidate irld genes. Because irld-39 and irld-52 contain singular variants associated with starvation resistance and predicted to disrupt protein function, we generated these specific variants in the starvation-sensitive MY2147 and the laboratory-reference N2 backgrounds (Figure 3H). Since irld-11 and irld-57 contain so many candidate variants, we deleted these genes in MY2147 and N2, rendering them null at each locus (Figure 3H). Edits of irld-39 and irld-52 are more likely to approximate the effect of specific variants in the wild, because they are the exact variants present in starvation-resistant wild strains. None of the alleles in either background significantly affected survival (Figure 3I and J). A power analysis suggests there is sufficient statistical power to detect differences of approximately 2 days or greater, suggesting there is not a difference of at least this magnitude. However, alleles for all four irld genes mitigated the effect of starvation on growth rate in the MY2147 background but not N2 (Figure 3K and L). This suggests that MY2147, as a more starvation-sensitive background than N2, facilitates detection of alleles that increase starvation resistance. These results show that multiple types of variants in different irld family members reduce the effect of extended L1 starvation on recovery, suggesting four individual genes from this family affect this aspect of starvation resistance in wild strains. Notably, none of the engineered variants affected the trait to a similar extent as the NILs, suggesting that other variants within each QTL also affect the trait.

IRLD-39 and IRLD-52 act through DAF-16/FoxO

We hypothesized that irld-39 and irld-52 have additive phenotypic effects, and that combining our two engineered alleles would reveal an effect in N2. An irld-39(duk1); irld-52(duk17) double mutant did not significantly increase starvation survival in the N2 background (Figure 4A). In this case, there was sufficient statistical power to detect differences of approximately 1.5 days or greater. However, the double mutant displayed a modest but significant increase in growth following 8 days of starvation, consistent with single mutants in the MY2147 background (Figure 4B). Furthermore, the double mutant significantly increased early fecundity following starvation (Figure 4C). These results further support the conclusion that natural variation in irld-39 and irld-52 affects starvation resistance. Notably, these two variants are both present in the most starvation-resistant strain identified, DL238 (Figure 3F).

Figure 4 with 2 supplements see all

Download asset Open asset

IRLD-39 and IRLD-52 together impact starvation resistance and depend on DAF-16.

(A) Survival curves of *irld-39(duk1); irld-52(duk17*) and N2 throughout L1 starvation. The apparent increase in starvation survival in the double mutant is not statistically significant (P=0.14). (B) Worm length of *irld-39(duk1); irld-52(duk17*) and N2 following 48 hr of recovery with food after 1 or 8 days of L1 starvation. (C) Number of progeny produced between 48 and 72 hr of recovery with food after 1 or 5 days of L1 starvation. (D) Worm length of N2, *irld-39(duk1); irld-52(duk17), daf-16(mu86),* and *daf-16(mu86); irld-39(duk1); irld-52(duk17*) following 48 hr of recovery with food after 1 or 4 days of L1 starvation. (**B–D**) Linear mixed-effects model with duration of L1 starvation and genotype as fixed effects and the number of replicates as a random effect; p-value calculated for interaction between fixed effects. ΔΔ indicates effect size of interaction between duration of starvation and strain compared to control. (E) Nuclear localization of DAF-16::GFP in intestinal cells of starved L1s ~36 hr after hatching. Each point represents the result of a single independent biological replicate with 51–64 worms scored for each condition and replicate, with a line connecting the two genotypes in each replicate. The Cochran-Mantel-Haenszel test was used to determine differences in the distribution of the two categories (nuclear and cytoplasmic) between *daf-16(ot971*) (wild type) and *daf-16(ot971); irld-39(duk1); irld-52(duk17*) (*irld-39; irld-52*). Images of intestinal nuclear and cytoplasmic localization are shown. (**A–E**) Four to six biological replicates were performed per experiment. ***p<0.001, **p<0.01, *p<0.05, n.s. not significant.

Figure 4—source data 1 Source data for starvation resistance assays of irld-39(duk1); irld-52(duk17). Source data for figures resulting from MIP-seq, NIL-seq, RNA-seq, or enrichment analysis is available in Supplementary files 1-3.: https://cdn.elifesciences.org/articles/80204/elife-80204-fig4-data1-v2.xlsx
Download elife-80204-fig4-data1-v2.xlsx

Given weak homology between IRLD proteins and the extracellular domain of DAF-2/InsR, we wondered if IRLD-39 and IRLD-52 modify IIS, as originally proposed (Dlakić, 2002). We therefore hypothesized that increased starvation resistance with disruption of irld-39 and irld-52 depends on daf-16/FoxO. Again, the irld-39; irld-52 double mutant displayed significant mitigation of the effect of starvation on growth (Figure 4D). This result corroborates the effect of the double mutant after 8 days of starvation (Figure 4B), except after only 4 days in this case (4 days of starvation was used since the daf-16 mutant is starvation-sensitive). We found no significant difference in the effect of starvation on growth between the null mutant daf-16(mu86) and daf-16(mu86); irld-39(duk1); irld-52(duk17), suggesting that increased starvation resistance of irld-39(duk1); irld-52(duk17) is dependent on daf-16 (Figure 4D). This genetic epistasis is consistent with DAF-16/FoxO activity being increased in the irld-39(duk1); irld-52(duk17) double mutant. In support of this hypothesis, nuclear localization of endogenous DAF-16 (Aghayeva et al., 2020) in intestinal cells, a proxy of its activity, was significantly increased in irld-39(duk1); irld-52(duk17) mutants (Figure 4E). However, this is a relatively modest difference in nuclear localization, and it is unclear where in the animal DAF-16 activity is most relevant in this context. Nonetheless, genetic epistasis and nuclear localization assays suggest that IRLD-39 and IRLD-52 act through DAF-16/FoxO to affect starvation resistance during L1 arrest.

Discussion

Our results illustrate the power of MIP-seq as a population selection-and-sequencing approach for analysis of complex traits in C. elegans. MIP-seq can be used in any organism with known sequence variants and that can be cultured in sufficiently large numbers with the ability to select on the trait of interest. With sufficient population genetic complexity and sequencing depth, meaningful phenotypic differences too small or variable to be detected by manual assays can be discovered, leading to improved understanding of gene-by-environment interactions and the genotype-to-phenotype map. When complex traits are highly polygenic (Boyle et al., 2017), it is critical to leverage the power of sequencing to elucidate their architectures. Here we used MIP-seq with a large panel of wild strains for statistical genetic analysis, but it can also be used with panels of recombinant lines for high-resolution gene mapping (Mok et al., 2017). MIP-seq can also be used for phenotypic analysis of mutants where it is beneficial to boost sensitivity and precision by using sequencing to count exceptionally large numbers of individuals (Shendure et al., 2017; Mok et al., 2020).

We characterized natural variation in starvation resistance in a set of genetically diverse, wild strains of C. elegans using MIP-seq and traditional assays. Our results suggest relatively little phenotypic variation of this presumably fitness-proximal trait. Nonetheless, we validated three QTL and showed four irld genes in these QTL impact starvation recovery. For irld-11 and irld-57, we generated deletion mutants, which do not precisely match the variants present in wild strains. For irld-39 and irld-52, the engineered alleles match starvation-resistant strains, but we have not confirmed their loss of function. Thus, our results suggest, but do not definitively demonstrate, that variation in irld genes affects starvation resistance in this species. The irld gene family is expanded relative to other Caenorhabditis species, suggesting that expansion (or contraction) of gene families influences natural variation and possibly evolutionary adaptation in this context. In addition, two of the irld genes identified are in hyper-divergent regions of the genome, consistent with genes in these regions contributing to environmental responses (Lee et al., 2021). However, irld variants investigated each had relatively weak phenotypic effects compared to the NILs, suggesting they do not fully account for natural variation in the trait associated with the QTLs. This implies other variants (Supplementary file 2), possibly of larger effect, also contribute to phenotypic variation.

Genetic epistasis analysis suggests that the effect of irld-39 and irld-52 on starvation resistance depends on daf-16/FoxO, and the double mutant increases DAF-16 nuclear localization, suggesting that these irld genes modify IIS. However, irld-39 and irld-52 could affect DAF-16 activity independent of IIS and could also affect other signaling pathways. IRLDs also bear weak homology to EGF receptors, and irld family members hpa-1 and hpa-2 affect healthspan by modifying EGF signaling (Iwasa et al., 2010). It is not known whether EGF signaling affects starvation resistance or other aspects of L1 arrest, and future work is needed to address the possible role of irld genes affecting EGF signaling in this context.

Ideas and speculation

Given the proposal that IRLD proteins modify IIS, it is intriguing to speculate that they do so by binding any of the 40 insulin-like peptides (ILPs) that would otherwise agonize or antagonize DAF-2/InsR (Pierce et al., 2001), as suggested previously (Dlakić, 2002). DAF-2B is an alternative isoform of DAF-2/InsR that includes the extracellular domain but lacks the tyrosine kinase domain, like the IRLD proteins, and it is also thought to act this way (Martinez et al., 2020). This hypothetical mechanism is also analogous to the proposed function of insulin-like growth factor (IGF)-binding proteins, which affect circulation and receptor binding of IGF proteins (Allard and Duan, 2018). These parallels suggest the possibility that natural variation in the IGF-binding protein family (Rotwein, 2017) contributes to phenotypic variation in humans. However, we have not shown that IRLD proteins actually bind ILPs, and a variety of uncertainties remain regarding their function.

Expression analysis provides clues to how irld genes possibly influence starvation resistance. Published whole-animal mRNA-seq analysis of fed and starved L1 larvae (Webster et al., 2018) revealed relatively low expression levels of the entire irld family (Figure 4—figure supplement 1). However, about half of the irld genes were differentially expressed, and all of those were upregulated in starved larvae, suggesting a role in starvation. We also interrogated existing single-cell RNA-seq datasets. One includes the major tissue types in fed L2-stage larvae (Cao et al., 2017), and it suggests that irld genes are most prominently expressed in ciliated sensory neurons, though there is expression in other neurons and tissues (Figure 4—figure supplement 2). Another study focused on neurons in fed L4-stage larvae (Taylor et al., 2021), and it suggests that irld gene expression is more prominent in sensory neurons than other neuron types (Figure 4—figure supplement 2). irld-39 is expressed in ASJ sensory neurons, along with distal tip cells and vulval precursors (Figure 4—figure supplement 2). irld-52 is expressed in the ADL sensory neurons and also intestinal rectal muscle cells. C. elegans sensory neurons are polymodal and influence life-history traits regulated by IIS, including dauer formation, aging, and L1 arrest (Bargmann and Horvitz, 1991; Vowels and Thomas, 1992; Apfeld and Kenyon, 1999). ASJ is known to express the relatively potent ILP DAF-28 in nutrient and sensory-dependent fashion (Li et al., 2003; Kaplan et al., 2018), and daf-28 affects L1 starvation survival (Chen and Baugh, 2014). ins-4/ILP is also expressed in ASJ, and it too affects L1 starvation survival (Chen and Baugh, 2014). If IRLD-39 and IRLD-52 proteins are translated and function in the vicinity of these sensory neurons, that would allow them to exert their influence at the interface of the animal and its environment.

Share this article

Cite this article

Sensitive and precise measurement of strain frequency in pooled culture using MIP-seq.

MIP-seq determines relative starvation resistance of 100 strains.

Figure 2—source data 1

Genetic variation in the irld gene family underlies differences in starvation resistance.

Figure 3—source data 1

IRLD-39 and IRLD-52 together impact starvation resistance and depend on DAF-16.

Figure 4—source data 1

Author details

Amy K Webster

Present address

Contribution

Competing interests

Rojin Chitrakar

Contribution

Competing interests

Maya Powell

Present address

Contribution

Competing interests

Jingxian Chen

Contribution

Competing interests

Kinsey Fisher

Contribution

Competing interests

Robyn E Tanny

Contribution

Competing interests

Lewis Stevens

Present address

Contribution

Competing interests

Kathryn Evans

Contribution

Competing interests

Angela Wei

Contribution

Competing interests

Igor Antoshechkin

Contribution

Competing interests

Erik C Andersen

Contribution

Competing interests

L Ryan Baugh

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism