Decoupled maternal and zygotic genetic effects shape the evolution of development
Abstract
Evolutionary transitions from indirect to direct development involve changes in both maternal and zygotic genetic factors, with distinctive population-genetic implications, but empirical data on the genetics of such transitions are lacking. The polychaete Streblospio benedicti provides an opportunity to dissect a major transition in developmental mode using forward genetics. Females in this species produce either small eggs that develop into planktonic larvae or large eggs that develop into benthic juveniles. We identify large-effect loci that act maternally to influence larval size and independent, unlinked large-effect loci that act zygotically to affect discrete aspects of larval morphology. The likely fitness of zygotic alleles depends on their maternal background, creating a positive frequency-dependence that may homogenize local populations. Developmental and population genetics interact to shape larval evolution.
https://doi.org/10.7554/eLife.37143.001Introduction
Many animals develop indirectly via larval stages that are morphologically and ecologically distinct from their adult forms. Hundreds of lineages across animal phylogeny have secondarily lost larval forms, instead producing offspring that directly develop into adult form without a distinct larval ecological niche (Raff, 1996; Smith et al., 2007; Strathmann, 1985; Thorson, 1950). Indirect development in the sea is typically planktotrophic: females produce large numbers of small offspring that require exogenous planktonic food to develop before metamorphosing into benthic juveniles. Direct development is typically lecithotrophic: females produce a smaller number of larger eggs, each developing into a juvenile without the need for larval feeding, provisioned by yolk. Evolutionary theory suggests that these alternative developmental strategies represent stable alternative fitness peaks, while intermediate states are disfavored (Christiansen and Fenchel, 1979; Havenhand, 1995; Strathmann, 1985; Vance, 1973). Transitions from planktotrophy to lecithotrophy thus represent radical and coordinated transformations of life-history, fecundity, ecology, dispersal, and development (Duda and Palumbi, 1999; Jeffery and Emlet, 2003; McEdward and Miner, 2001; Raff, 1996; Romiguier et al., 2014; Wray, 1995). Here we dissect this transition in the polychaete annelid Streblospio benedicti, a genetically tractable species that harbors both states as heritable variation (Levin, 1984; Levin et al., 1991; Zakas and Rockman, 2014).
Streblospio benedicti is a common and widespread benthic polychaete found in estuaries throughout North America. Females brood fertilized embryos in dorsal brood pouches and release larvae into the water column. Larvae remain pelagic until they are competent to metamorphose and return to the benthos. Females fall into two classes exhibiting classic life-history trade-offs: planktotrophic mothers produce small (~100 um) eggs that develop into obligately feeding larvae that spend weeks in the plankton. These larvae grow larva-specific swimming chaetae that are thought to deter predation (Blake, 1969) (Figure 1A). Lecithotrophic mothers produce large (~200 um) eggs that develop into larvae that do not require exogenous food and quickly leave the water column as benthic juveniles. These larvae lack swimming chaetae and a second type of larva-specific morphological structure, anal cirri containing bacillary cells, which are distinctive cells that contain rhabdites of unknown function (Gibson et al., 2010) (Figure 1A). There are also differences during embryogenesis (McCain, 2008) and organogenesis (Gibson et al., 2010; Pernet and McHugh, 2010), where accelerated development of juvenile features and truncated development of larval features occurs in lecithotrophs. Despite these developmental and larval differences, adults are indistinguishable at the level of gross morphology. There are no known behavioral differences between the adults, though it is possible that preferences in mating or habitat choice occur. Unlike examples of polyphenism, environmental factors do not substantially alter egg size or subsequent offspring type (Levin and Bridges, 1994; Levin and Creed, 1986). Natural populations consist of only one of the two phenotypic extremes, often occurring in close geographic proximity but rarely directly overlapping (Levin, 1984; Schulze et al., 2000; Zakas and Wares, 2012), and larval mode in the population does not correlate with any known environmental gradient. Despite the persistence of these two dramatically different strategies in nature, they can be crossed in the lab producing an array of morphological and developmental intermediates (Levin et al., 1991; Zakas and Rockman, 2014; Figure 1B). Stable populations of different larval modes suggest that migration-selection balance on small ecological scales is acting to maintain both types in the larger population.
We genetically dissected developmental mode by crossing a lecithotrophic male from Long Beach, California, and a planktotrophic female from Bayonne, New Jersey. Studies of these populations have found them to be homogeneous in larval mode but little differentiated from one another across the genome (Rockman, 2012; Zakas and Rockman, 2014; Zakas and Rockman, 2015), consistent with ongoing gene flow between larval modes in their native East Coast range (Zakas and Wares, 2012). West Coast populations, including Long Beach, are exclusively lecithotrophic and result from a 20th century introduction from an East Coast source (Zakas and Wares, 2012).
Genetic models for evolutionary transitions from planktotrophy to lecithotrophy require traversal of intermediate states in multivariate phenotype space, which are predicted to be suboptimal with respect to fitness (Vance, 1973; Whitlock et al., 1995) (Figure 1C). The simplest scenario is that co-adapted trait combinations are affected by shared alleles, either by pleiotropy (for example, ref [Hall et al., 2006]) or tight linkage (e.g., a supergene with suppressed recombination [Jones et al., 2012; Joron et al., 2011; Yeaman, 2013]). The necessity to coordinate maternal traits (larval size) with zygotic traits (larval morphology) adds additional constraints (Kirkpatrick and Lande, 1989; Wade, 1998; Wolf and Brodie, 1998; Wolf and Wade, 2016). Figure 1C shows that alternative paths from planktotrophy to lecithotrophy make different predictions about the correlations among genetic effects. In particular, the presence of facultative planktotrophy (i.e., larvae can feed but do not require food) in lecithotrophic S. benedicti larvae (Allen and Pernet, 2007; Pernet and McArthur, 2006) provides one suggestion that evolution could traverse the larval-size axis, from small to large, prior to traversing the larval morphology axis, from ‘complex’ feeding larvae to ‘simple’ non-feeding larvae with accelerated development of juvenile features (Smith et al., 2007; Wray, 1996). Nevertheless, in natural populations we observe only the two extremes: large, ‘simple’ lecithotrophs, and small, ‘complex’ planktotrophs.
Results
From the F1 progeny of our cross between a lecithotroph and planktotroph, we generated a G2 panel by crossing one F1 male with four F1 females. We measured larval features – size, number and length of swimming chaetae, and presence or absence of anal cirri – on each G2 animal, then raised it to adulthood and crossed it to another G2 animal. From the progeny of these crosses, we measured average size of each G2 female’s G3 offspring. This extra generation is required because larval size is a maternal-effect trait (Zakas and Rockman, 2014). Indeed, we find that the larval size of a G2 female is uncorrelated with the average size of her G3 progeny (Figure 2A). The G2 larvae all develop with equivalent maternal genetic effects, as their F1 mothers are heterozygotes for planktotrophic and lecithotrophic alleles. By contrast, maternal-effect genes are segregating among G2 females, and so G3 larval sizes reflect inherited maternal-effect variation.
Next we constructed a genetic map, the first for any annelid (Supplementary file 1). We genotyped F1 and G2 animals at 702 markers, which fell into 10 autosomal and one sex-linked linkage group (LG) (Broman et al., 2003; R Core Development Team, 2015). In parallel, we defined the karyotype for S. benedicti, which includes 10 pairs of autosomes and one pair of sex chromosomes (Figure 2B–D, Figure 2—figure supplement 1). Males are heterogametic, diagnosed by a large Y chromosome. The karyotypes are grossly indistinguishable between Long Beach and Bayonne, and in each population we observe 11 bivalents at meiotic prophase in females and 12 bodies, presumably 10 autosomal bivalents and two sex-chromosome univalents, at diakinesis in males (Figure 2C,D; Figure 2—figure supplement 1).
G2 larvae are intermediate in size and variable in morphology; 15% lack larval chaetae and 25% lack anal cirri. As expected for a maternal trait, G2 larval size showed no linkage to G2 genotype. Every other phenotype implicated a small number of large-effect loci (Figure 2E; Table 1). Moreover, the loci are almost entirely independent: the maternal-effect loci shaping G3 larval size are unlinked to the zygotic-effect loci shaping G2 larval morphology. The morphology loci are also largely independent, with a major-effect anal cirri locus on LG5 and loci uniquely affecting number and length of swimming chaetae on LG8 and LG9. The two chaetal traits also share a common locus on LG3. Each QTL has a large effect (Figure 3A) and explains a large proportion of trait variance (Table 1), and the effects are largely additive (Table 2). The two exceptions to additivity are in chaetae length at the LG3 locus (the chaetae-shortening allele is dominant) and larval size, where there is significant epistasis between the two maternal-effect loci on LG6 and LG7. We corroborated the single-trait mapping results by performing a fully multivariate mapping analysis for maternal-effect larval size, chaetae number, and chaetae length. This analysis identified the same loci as the univariate analyses, and showed that the maternal effects and zygotic effects are largely independent (i.e., most effects are nearly orthogonal in this three-dimensional space: Figure 3B).
Our results imply that matings between animals of alternative developmental modes will routinely generate mismatches between larval size and morphology. This genetic architecture creates a strong selective barrier to adaptation: zygotic-effect alleles that migrate into new populations will find themselves tested against the local maternal-effect background (Kirkpatrick and Lande, 1989; Wolf and Brodie, 1998; Wolf, 2000). This maternal-by-zygotic epistasis for fitness can inhibit adaptation to local environmental conditions by creating runaway coevolution (Wade, 1998): the locally common maternal background creates a selective regime favoring matching zygotic-effect alleles.
This frequency-dependent runaway model assumes that zygotic-effect alleles are penetrant across maternal-effect backgrounds. An alternative is that zygotic effects are masked in mismatched maternal backgrounds, allowing them to persist invisibly until their matching maternal background becomes available. To test this possibility, we backcrossed 30 G2 males (which, like all of the G2s, have an F1 maternal background) to planktotrophic females, thereby shifting the maternal background to assess the penetrance of the chaetae loci. We found that the zygotic-effect loci that affect chaetae length and number in an F1 maternal background also do so in the planktotroph maternal background (length: p<10−4; number p<10−6; Figure 4). We estimate a negligible effect of the locus on LG9, but our analysis of that locus is underpowered due to unbalanced genotype frequencies among the 30 G2 males.
The backcross data demonstrate that zygotically-acting alleles are penetrant in planktotrophic maternal backgrounds, and thus these alleles cannot fully ‘hide’ in mismatched populations. However, it remains possible that variation within genotypic classes is such that an occasional larva will have a fully planktotrophic or lecithotrophic phenotype despite carrying mismatched alleles (Zakas and Rockman, 2014). This raises the possibility that maternal effects can allow for the accumulation of standing genetic variation in a population by masking unfit genotypic combinations from strong selection. Moreover, maternal-effect loci are sex-limited in their expression, and alleles can thus persist in populations at higher frequencies than unconditionally expressed alleles (Wade, 1998).
Discussion
We find that co-adapted life-history traits are strikingly modular, where each of the phenotypes has a largely independent genetic basis with QTL occurring on different chromosomes. This suggests the lecithotrophic larval strategy evolved through the stepwise accumulation of multiple genetic changes throughout the genome. Maternally-determined larval size has an independent genetic basis from other traits that are perfectly correlated in natural populations. While the most direct evolutionary path from plantkotrophy to lecithotrophy involves pleiotropic mutations that move populations diagonally across the size/shape space (Figure 1C), we observe that mutational effects are at right angles to one another. The lack of strong correlated effects among alleles is compatible with the evolution of lecithotrophy via a facultatively planktotrophic intermediate (Smith et al., 2007; Wray, 1996); that is, large larval size may have evolved prior to the loss of larval morphology (Figure 1C).
Under a broad range of assumptions, theory predicts that local adaptation in the face of ongoing gene flow should ultimately lead the causal alleles to converge at one or a few genomic loci, as opposed to an accumulation of genetic differences throughout the genome (Kirkpatrick and Barton, 2006; Yeaman and Whitlock, 2011), particularly in the context of maternal-by-zygotic epistasis (Wade, 1998). Our genetic results are therefore consistent with one component of the theoretical expectations: individual traits are influenced by one or two genomic regions of large-effect, which could in turn represent clusters of underlying genes. However, the overall modularity of these phenotypes is unexpected as the loci affect discrete traits rather than the larval syndrome as a whole. Loci affecting different traits are located on different chromosomes, indicating that correlated life-history phenotypes are genetically separable. We identified six chromosomes with large effects for four traits, implying that S. benedicti populations can segregate 36 = 729 possible genotypic combinations, most of which would produce combinations of intermediate phenotypes that are expected to be maladaptive.
Finally, we found that the evolutionary transition to lecithotrophic development requires the interaction of two genotypes, the mother and offspring. Because of this dual control of larval phenotype, the likely fitness of a zygotic allele depends on the probability it occurs in a favorable maternal genetic background. This genetic architecture provides an explanation for the phenotypic homogeneity of natural populations. The independent genetic basis of maternal and zygotic genetic effects on development shapes the accumulation of variation necessary for selection and local adaptation.
Materials and methods
Cross design
Request a detailed protocolWe crossed a single outbred Bayonne planktotrophic female to a Long Beach lecithotrophic male to generate F1s. Four F1 females were crossed to a single F1 male to produce four half-sib G2 families. We intercrossed the G2 animals to generate G3 families, and we crossed a subset of G2 males to Bayonne females to generate backcross families.
Phenotyping
Request a detailed protocolWe imaged G2 individuals live so they could be reared to adulthood and crossed to produce G3s. A panel of 266 G2 larvae was collected within 24 hr of release from their mothers and pipetted individually onto a 0.2% agar-seawater pad on a depression slide. We imaged larvae with a Zeiss Axio Imager M2 with 20x DIC objective, and washed them into individual wells in a 24-well plate where they developed to juvenile stage, provisioned with Dunaliella salina algae. We measured larval size (area), chaetae number, length of the longest chaeta, and presence/absence of pronounced anal cirri from images using ImageJ (NIH).
Once G2 animals reached adulthood, fed with defaunated mud, we paired them randomly for crosses. Because offspring size is proportional to egg size and has a strong additive maternal basis (Zakas and Rockman, 2014), we use average G3 larval size as a phenotype of the G2 mother. G3 larvae were fixed in 4% formaldehyde overnight, transferred to PBS buffer, and mounted on slides in a long-term stable mountant. We then imaged and measured larval area as above. Fixed larvae maintained their morphology relative to live larvae when compared from the same clutch. Backcross larvae were fixed and measured in the same manner.
Raw phenotype data are included as part of an Rqtl cross object in Supplementary file 1, and data for backcross larvae are included in Supplementary file 2.
Genotyping
Request a detailed protocolFor each Parent, F1 and G2 individual, we extracted DNA using the Qiagen DNeasy kit. Total worm DNA was sent to SNPsaurus for NextRAD genotyping-by-sequencing. NextRAD uses a primer-based approach to amplify a selection of sequences throughout the genome in order to have a reduced representation library for sequencing. PCR primers amplify genomic loci consistently between samples (Russello et al., 2015).
376 libraries were sequenced on Illumina HiSeq 2500 to a depth of ~20 x coverage. Reads were aligned using Samtools (Li et al., 2009) to a 805 MB (in fragments over 300 bp) reference genome assembly. The reference genome was constructed from Illumina Hiseq PE-100 reads of three male S. benedicti libraries (insert sizes of 300, 500, 800 bp) assembled with Platanus (Kajitani et al., 2014).
SNPsaurus aligned reads to the reference genome at 97% identity to reduce spurious alignments, and allowed only two mismatches per read to the reference. SNPs were called in vcftools (Danecek et al., 2011) and filtered to meet QC requirements by removing loci that had more than two alleles or less than 10% occurrence across all samples. From more than a million scaffolds in the reference assembly, 939 contained SNPs that passed QC, resulting in 1,389 SNPs.
Karyotyping
Request a detailed protocolWe imaged giemsa-stained chromosomes from cells at mitotic metaphase from laboratory-raised inbred (F5-F8) males and females from the Bayonne and Long Beach populations. We also imaged meiotic diakinesis in males and zygotene in females. We used a karyotyping protocol modeled closely on Rowell et al. (2011). Briefly, worm tissue was incubated in 0.05% colchicine in artificial seawater for 90 min, then subjected to a hypotonic series reaching 1:4 ASW:H20 over one hour. The tissue was then fixed with freshly prepared 3:1 Methanol:Acetic Acid fixative, with three fixative changes over one hour. Next, tissue was macerated on a glass slide in 60% acetic acid, using glass needles. Finally, after the slides had dried, we stained them with 5% Giemsa stain in phosphate-buffered saline. We imaged the slides on a Zeiss Axio Imager M2 with 100x objective.
Preparations from both sexes from both populations revealed a consistent karyotype of 2n = 22 and no gross differences between populations. Males are heteromorphic, carrying a distinctive Y chromosome that is substantially larger than every other chromosome. Among annelids with XY chromosomal sex determination, the Y is often larger than the X, as in species of Hediste (Tosuji et al., 2004). In Polydora curiosa, the only spionid previously known to have heteromorphic sex chromosomes (2n = 34 XY), the Y is larger than all other chromosomes (Korablev et al., 1999), as we observe in S. benedicti. Many other species of Polydora lack heteromorphic sex chromosomes (Korablev et al., 1999), leaving the potential homology of the large Y chromosomes in P. curiosa and S. benedicti uncertain.
Meiotic material from females of each population shows 11 bivalents at zygotene. In male meiotic material from each population, we observe 12 structures at diakinesis, including a mixture of ring and rod bivalents. We hypothesize that two of the 12 structures are X- and Y-chromosome univalents. This pattern of unpaired X and Y chromosomes at metaphase I is known among plant and insect species (Brady and Paliulis, 2015). Korablev et al. (1999) observed that the X and Y pair end-to-end at metaphase I in Polydora curiosa, which therefore differs from S. benedicti.
Linkage map construction
Request a detailed protocolData and annotated scripts used to generate S. benedicti linkage maps are included in Supplementary file 3.
We filtered data to include only the 313 unique samples for which the mean read depth at the 1389 SNP markers was greater than 15 and for which more than 1000 SNP genotypes were called. Unfortunately, due to sample quality issues, we were unable to generate reliable genotype calls from the two P0 animals or from the F1 male and four F1 females that parented the four F2 families. We retained 47 F1 and 266 G2 individuals.
We tested each SNP for sex linkage by applying Fisher’s exact test to genotype counts within each family and then combining the p-values using Fisher’s meta-analysis. For non-sex-linked contigs that contain multiple SNPs, we manually recoded each contig, when possible, to be a single intercross marker, with inferred P0 genotypes AA x BB. These inferences are straightforward given multiple SNPs per contig and our data from a panel of F1s and four half-sib G2 families. In some cases, individual G2 families segregated haplotypes that were not all distinguishable. For example, if P0 haplotypes were AB x AC, segregation in G2 families descended from AC x BC F1 parents is fully resolvable, but segregation from AA x BC F1 parents is not, because the P0 origin of A haplotypes in the G2 is ambiguous. In such cases, we assigned partial genotypes in the relevant families (i.e., we treated them as dominant rather than codominant markers). However, we only recoded multi-SNP contigs for which segregation in at least one G2 family was fully resolvable. Note that the assignment of genotypes to the maternal vs. paternal P0 is arbitrary because we lack genotypes for those individuals. To call a marker genotype for an individual worm, we required that all SNPs in the contig marker had to have genotype calls that conform to an intact P0 haplotype; this adds a layer of stringency in excluding genotypes that contain any erroneous SNP calls. Most multi-SNP contigs contain two or three SNPs, but one contained 16 SNPs spread across multiple sequencing reads. Because of missing data and genotype error, the requirement for intact P0 haplotypes across 16 SNPs is too stringent, and this contig was recoded into nine separate markers (five multi-SNP and four single SNP, based on the distribution of the SNPs across sequencing reads).
In recoding multi-SNP contigs as single markers, we identified five strains whose data showed very high rates of incompatibility with possible segregation patterns and 16 strains that showed probable pedigree errors (i.e., their genotypes were inconsistent with maternity by the F1 mother shown in their records but compatible with one of the other F1 mothers). These 21 strains were excluded from subsequent analysis, leaving a dataset of 292 worms distributed across five families (Table 3).
We inferred segregation classes for each autosomal marker and imputed genotypes for the five parents of the G2 families using a composite likelihood approach. For every autosomal SNP, there are four possible P0 genotype combinations (AAxBB, ABxAB, AAxAB, ABxBB), plus two for sites erroneously called as SNPs (AAxAA, BBxBB). For each SNP, we estimated likelihoods for each configuration for each family. We maximized the likelihood including a one-step error probability (i.e., AA <->AB < ->BB) allowed to range up to 10%. We next identified the segregation pattern (including the genotypes of the F1 parents of the G2 families) that maximizes the likelihood over the five families. This is a composite likelihood insofar as the genotype error probabilities are estimated for each family and the global pedigree likelihood derived from the appropriate combination of family likelihood estimates.
We then used four steps to build the S. benedicti genetic map:
Construction of an autosomal map from markers segregating in an intercross pattern, using Rqtl (Broman et al., 2003).
Construction of an autosomal map using the refined intercross marker dataset from step 1 and hundreds of additional non-intercross autosomal markers, using lep-MAP (Rastas et al., 2015; Rastas, 2017).
Addition of a sex-linked linkage-group.
Assignment of founder origin to each autosomal haplotype.
Step 1: Intercross marker map
Request a detailed protocolThe first strategy uses data from 549 non-sex-linked SNPs that individually or in contig-groups segregate in an intercross pattern (i.e., F1s are heterozygous and G2s segregate 1:2:1, implying that the parental genotypes are homozygous for alternate alleles). These SNPs include 448 from multi-SNP contigs that are recoded into 193 intercross markers, plus 101 individual SNPs identified as intercross by likelihood. We excluded 17 markers that were missing data for more than 10% of individuals and three other markers that exhibited strong departures from Mendelian proportions (p<10−10). From the resulting dataset of 274 markers and 245 G2 individuals, we constructed an autosomal genetic map in Rqtl (v.1.40 – 8) following the protocol of Broman (Broman, 2010). 271 markers readily assembled into 10 autosomal linkage groups. We ordered the markers using 20 cycles of the orderMarkers function with ripple window = 7, retaining the ordering with the highest likelihood for each linkage group. Four worms exhibited a large excess of crossovers, indicative of high genotyping error rates. We removed these animals (all are Family C females), estimated the genotyping error rate as 0.013 by maximum likelihood, and reordered the markers with another 20 cycles of orderMarkers with error.prob = 0.013.
Some apparent crossovers are associated with single genotypes that differ from both immediate flanking markers, despite very small genetic distances. These represent probable genotyping errors. Using an error lod of 6 as a threshold for probable errors, we converted 420 (of almost 63,000) genotype calls to missing data. We then reestimated the maximum likelihood error rate as 0.004 for the remaining data and reordered the markers with 50 cycles of orderMarkers and error.prob = 0.004.
Step 2. Inclusion of non-intercross markers
Request a detailed protocolReturning to the original 1,389-SNP dataset, we replaced the intercross marker genotypes with those from step 1 (i.e., replacing multi-SNP contig SNPs with their inferred intercross genotypes, converting 420 genotypes to missing data, and removing the four individuals inferred to have high genotype error rates). For the remaining non-intercross SNPs, we imputed the genotypes of the five F1 parents by likelihood, as described, and we used lepmap2 (Rastas et al., 2015) (v.0.2, downloaded March 5, 2017) to assign markers to linkage groups. The input dataset includes 288 individuals (47 F1s and 241 G2s in four families) and 1,111 markers. Processing steps included ParentCallGrandparents with XLimit = 8 and ZLimit = 8, Filtering with epsilon = 0.02 and dataTolerance = 0.0001, SeparateChromosomes with lodLimit = 15, and JoinSingles with lodLimit = 10 and lodDifference = 5. This analysis assigned 676 markers to 10 autosomes and 97 markers to two sex-linked chromosomes (Figure 2). An additional 25 markers were assigned to 10 small linkage groups (one sex-linked) with two to four markers each, and 313 markers were not assigned to linkage groups. The ten major autosomal linkage groups mapped uniquely to the ten identified using the intercross markers above at step 1.
To order the autosomal markers, we used OrderMarkers2 in lepmap3 (Rastas, 2017) (v.0.1, downloaded March 5, 2017). For each linkage group, we ran 100 iterations with randomPhase = 0 and 100 iterations with randomPhase = 1, and we then repeatedly ran OrderMarkers2 with the evaluateOrder option on the highest likelihood results for each approach to search for higher likelihood reconstructions. We then selected the result with the highest likelihood overall and retained the reconstructed genotypes (outputPhasedData = 1) for that run.
We next converted these reconstructed genotypes to conventional intercross genotypes by mapping the intercross-marker genotypes onto them and retaining the phase with the highest likelihood. We renamed the linkage groups to sort them by genetic map length and the result is Sb676. The intercross map from step 1, with linkage groups named accordingly, is Sb271.
Step 3. Sex-linked markers
Request a detailed protocolFor each of the 1,389 SNPs in the original dataset, we estimated the likelihood of each possible autosomal or sex-linked segregation pattern. We identified 32 SNPs with clean X-linked segregation (AAxBY or BBxAY) and none with Z-linkage. The X-linked SNPs mapped to one major and one minor sex-linked linkage group in the lepmap analysis at step two above. The other major lepmap-defined sex-linked linkage group contained only SNPs that segregate as fixed differences between X and Y chromosomes (i.e., the maximum likelihood segregation patterns are XAXA x XAYB and XBXB x XBYA). These are uninformative for the genetic map.
We next examined the multi-SNP contigs that contain any SNP whose sex-linkage p-value is less than 0.01, and where possible we recoded these SNPs as single multi-SNP markers, converting impossible genotypes to missing data, as done for autosomal markers in step 1. This resulted in a total of 26 markers (10 multi-SNP and 16 single-SNP). These markers were then ordered in Rqtl as in step 1. A group of three markers, which form a separate, minor sex-linked linkage group in our lepmap analysis at step 2, here form a distantly linked piece of the X chromosome (min.lod = 6, max.rf = 0.33). The final map, incorporating 702 markers, is Sb702.
The maps produced at each of the three steps are provided as Rqtl cross objects in Supplementary file 1.
Step 4. Inference of founder haplotypes
Request a detailed protocolAs noted above, we were unable to recover genotypes from the founding parents of the cross or from the F1 worms that contributed to the G2 generation, and we therefore lack information about which autosomal alleles derive from which founder. To assign parental origins to genotypes, we tested the two possible orientations using population-genomic data. We showed previously (Zakas and Rockman, 2015) that slight allele frequency differentiation between Long Beach and Bayonne populations is sufficient for population assignment. To assign the haplotypes from our cross to these populations, we generated Pool-seq libraries of each population in duplicate by pooling DNA of 25 adult females for each population, making four pool-seq libraries in total. The worms contributing to these pools were collected in 2018 from the same Long Beach and Bayonne localities as our cross founders. DNA was extracted with the Qiagen DNeasy kit and libraries were constructed with a Nextera DNA kit. These four libraries were sequenced on one lane of an Illumina NextSeq flow cell in High-Output mode. Reads were mapped to the draft reference genome, which was constructed from Bayonne individuals, using BWA and samtools (Li and Durbin, 2009) and variants were called using GATK (McKenna et al., 2010) with a low quality-filtering threshold of QD <1.0 and MQ <30.0.
Using this population sequence data, we calculated the likelihoods for the two possible genotype assignments for each autosome. To calculate likelihoods, we considered only SNPs that we inferred to be homozygous for alternate alleles in the founders of our cross (Step 1 above). Population allele frequencies were estimated from read counts in the pooled population sequence data. To account for the presence of zero counts of reads of some alleles in either population, which can be due to true absence or to low read coverage, we added one read to every allele count and then used these modified count proportions as population frequency estimates. In total, we used data from 110 SNPs (4 to 27 per chromosome).
For the hypothesis that founder genotype one derives from Bayonne and founder genotype two from Long Beach, we calculate the likelihood as L(H1):
P(genotype 1 | Bayonne allele frequencies) * P(genotype 2 | Long Beach allele frequencies),
assuming Hardy-Weinberg equilibrium within each population. We then calculated the likelihood of the alternative assignment (H2). We report the log10 likelihood ratio as a statistic.
For every chromosome, one genotype assignment was favored over the alternative assignment by a factor of >300 (Table 4). Under the favored orientations, QTL effects are all in the expected directions (for example, large egg and short chaetae alleles derive from the lecithotrophic Long Beach founder), and these polarities are those reported in the Supplementary file 1 genetic maps and analyzed throughout the paper.
QTL Mapping
Request a detailed protocolAnnotated R scripts that reproduce all linkage mapping and phenotypic analyses in the paper are included in Source code file 1.
We performed interval mapping in Rqtl to identify QTL. To accommodate the family structure in the data, we performed genome scans separately in each G2 family and summed the lod scores. To determine genome-wide significance thresholds, we generated a null distribution of maximum lod scores by applying the same structured genome scans to 1000 datasets in which phenotypes have been permuted among individuals within (but not among) G2 families (Churchill and Doerge, 2008). We used a per-trait threshold of p=0.05. All traits were mapped assuming a normal model, except for the presence or absence of anal cirri, for which we used a binary (logistic) model.
Our linkage mapping approach tests for loci that differ between the founding Long Beach and Bayonne parents of the cross, and QTL that are heterozygous in the P0 founders may be overlooked. Such loci should manifest as genetic differences among the four G2 families. We tested for family effects in each trait, and found no evidence for variation among the four G2 families for number of chaetae (p=0.06), presence of anal cirri (p=0.70), or G3 offspring size (p=0.80). We identified family effects for the length of the larval chaetae (p=0.0003) and G2 size (p=10−7).
The observed differences among G2 families may be due to one or more of (1) environmental maternal effects, including familial experimental counfounders such as the exact age of full-sib larvae at the time they were measured; (2) maternal or zygotic genetic effects that segregate among the F2 families due to heterozygosity in the P0 founders; and (3) genetic effects that differ among F2 families due to differential segregation distortion or chance allele frequency variation but not due to heterozygosity in the P0s. Because G2 size is a maternal-effect trait, the family effect for this phenotype can only be due to environmental maternal effects among the F1 mothers or segregating maternal effect factors that were heterozygous in the P0s. However, in the latter case, we expect the family effect to recur as variation among the G2 families in G3 offspring size. We observe no such effect, implying that the G2 families do not differ in their genotype distributions at maternal-effect offspring-size loci. For larval chaetae length, we cannot exclude the possibility that some of the variation among G2 families is due to QTL that were heterozygous in P0s. In a model including the significant QTL (detailed below), G2 family explains 5.7% of total variance in the larval chaetae length phenotype, while the two significant QTL explain 47.0%.
To model the genetics of each trait using the Rqtl fitqtl function, we tested for pairwise interactions between detected QTL across linkage groups and for effects of G2 family. In each case we compared a model of additive and dominant within-locus effects to models that incorporated the additional terms, and we used likelihood ratio tests to determine the statistical significance of improvements in model fit. We estimate the effect size for QTL using the best-fitting model, as described in Source code file 1. For chaetae number and presence of anal cirri, the prefered model includes QTL effects only. For chaetae length, the model includes an effect of G2 family. For G3 offspring area, the model includes an interaction between two QTL. Effect estimates are reported below in Table 4.
We find that chaetae number and length are correlated, as in Zakas and Rockman (2014), (r2 = 0.22, p<0.001). The two traits share a QTL on LG3 that may act pleiotropically. However, when variation due to the LG3 QTL is removed, there is still a strong correlation between the trait residuals (r2 = 0.1, p<0.001).
Multivariate QTL scan
Request a detailed protocolWe analyzed the three continuous traits – mean offspring size, chaetae number, and chaetae length – in a fully multivariate QTL scan (Knott and Haley, 2000). The approach searches for regions of the genome with significant effects in the three-dimensional space defined by variation in these three traits. We restricted the dataset to the 149 G2 females for which all three traits were scored (i.e., excluding G2 worms with zero chaetae).
We fit the following model at each marker: Y = XB + e, where Y is the 149 × 3 matrix of phenotypes, X is a matrix of fixed effects, B is the vector of fixed-effect coefficients to be estimated, and e is normally-distributed residual error. The fixed effects are the intercept, G2 family identity, and two vectors coding the additive and dominance variables for a single genetic marker. At each marker, we fit this model and compared it to a model with intercept and G2 family as the only fixed effects, calculating a p-value from the Pillai-Bartlett statistic. We estimated thresholds for genome-wide significance by repeating the analysis on 1000 datasets in which phenotype vectors were permuted among individuals within G2 families. R code that reproduces the analysis is included in Source code file 1.
Five QTL exceeded the genome-wide threshold for significance (p≤0.001 in each case), coinciding with the five QTL detected for these traits by univariate analysis. In a model incorporating additive and dominance effects for each QTL, five additive and two dominance effects (for the QTL on LG3 and LG6) were retained as significant (p<0.01; Table 5). We then estimated QTL effect sizes under the reduced model that includes only the significant effects.
Analysis of backcross larvae
Request a detailed protocolWe tested for an effect of G2 male genotype on the distribution of chaetal phenotypes in their backcross progeny. We used a linear mixed-effect model to account for the relatedness of larvae within a brood and then compared models with and without the loci implicated in each trait in the G2 generation (i.e., the loci in Table 2). For both traits, chaetal length and number, we detected a significant effect of genotype, with effects in the expected directions. The effect size estimates indicate that the locus on LG9 does not have a significant effect on chaetal length. We note that the 30 G2 males used in the backcross have unbalanced genotype representation at this locus, with 8 LL homozygotes, 22 LP heterozygotes, and 0 PP homozygotes. We are therefore poorly powered to detect an effect of this locus. R code reproducing our analyses are included in Source code file 1.
Data availability
Request a detailed protocolThe code and experimental data generated and analyzed during this study can be found in the supplemental files.
Data availability
All code and data generated and analyzed during this study can be found in the supplemental files.
References
-
Chromosome interaction over a distance in meiosisRoyal Society Open Science 2:e150029.https://doi.org/10.1098/rsos.150029
-
R/qtl: QTL mapping in experimental crossesBioinformatics 19:889–890.https://doi.org/10.1093/bioinformatics/btg112
-
Genetic map construction with R/qtlTechnical Report, Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison 214:.
-
Evolution of marine invertebrate reproduction patternsTheoretical Population Biology 16:267–282.https://doi.org/10.1016/0040-5809(79)90017-0
-
The variant call format and VCFtoolsBioinformatics 27:2156–2158.https://doi.org/10.1093/bioinformatics/btr330
-
BookThe evolutionary ecology of larval typesIn: McEdward L. R, editors. Ecology of Marine Invertebrate Larvae. Boca Raton: CRC Press. pp. 79–122.
-
Multitrait least squares for quantitative trait loci detectionGenetics 156:899–911.
-
Multiple patterns of development in streblospio benedicti webster(Spionidae) from 3 coasts of North AmericaThe Biological Bulletin 166:494–508.https://doi.org/10.2307/1541157
-
The sequence alignment/Map format and SAMtoolsBioinformatics 25:2078–2079.https://doi.org/10.1093/bioinformatics/btp352
-
Larval and life-cycle patterns in echinodermsCanadian Journal of Zoology 79:1125–1170.https://doi.org/10.1139/z00-218
-
SoftwareR: A language and environment for statistical computingR Foundation for Statistical Computing, Austria.
-
BookThe Shape of Life: Genes, Development, and the Evolution of Animal FormChicago: University of Chicago Press.
-
Construction of ultradense linkage maps with Lep-MAP2: stickleback F2 recombinant crosses as an exampleGenome Biology and Evolution 8:78–93.https://doi.org/10.1093/gbe/evv250
-
Patterns of nuclear genetic variation in the poecilogonous polychaete Streblospio benedictiIntegrative and Comparative Biology 52:173–180.https://doi.org/10.1093/icb/ics083
-
Chromosome analysis in invertebrates and vertebratesMethods in Molecular Biology 772:13–35.https://doi.org/10.1007/978-1-61779-228-1_2
-
Feeding and nonfeeding larval development and Life-History evolution in marine invertebratesAnnual Review of Ecology and Systematics 16:339–361.https://doi.org/10.1146/annurev.es.16.110185.002011
-
On reproductive strategies in marine benthic invertebratesThe American Naturalist 107:339–352.https://doi.org/10.1086/282838
-
Evolutionary genetics of maternal effectsMaternal Effects as Adaptations 21:827–839.
-
Multiple fitness peaks and epistasisAnnual Review of Ecology and Systematics 26:601–629.https://doi.org/10.1146/annurev.es.26.110195.003125
-
BookEvolution of larvae and developmental modeIn: larvae L. R, Edward M. c, editors. Ecology of Marine Invertebrate. Boca Raton: CRC Press. pp. 413–447.
-
Parallel evolution of nonfeeding larvae in echinoidsSystematic Biology 45:308–322.https://doi.org/10.1093/sysbio/45.3.308
-
Dimorphic development in Streblospio benedicti: genetic analysis of morphological differences between larval typesThe International Journal of Developmental Biology 58:593–599.https://doi.org/10.1387/ijdb.140088mr
Article and author information
Author details
Funding
National Science Foundation (IOS-1350926)
- Matthew V Rockman
National Institutes of Health (GM108396)
- Christina Zakas
Zegar Family Foundation
- Matthew V Rockman
New York University (Biology Master's Research Grant)
- Alex D Kay
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We thank D Tandon, I Ukegbu, R Freih, C Fayyazi, and L Jessell for laboratory assistance with maintenance and crossing of the animals, and L Noble, M Bernstein, J Yuen for helpful discussions of the manuscript. We also thank L Noble for valuable help with genome assembly and image analysis. We thank B Pernet for collecting the Long Beach worms. This research is funded by the Zegar Family Foundation, NSF grant IOS-1350926 to MVR, NIH grant GM108396-02 to CZ, and an NYU Biology Master’s Research Grant to ADK.
Copyright
© 2018, Zakas et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 2,494
- views
-
- 276
- downloads
-
- 23
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Ecology
- Evolutionary Biology
Seasonal polyphenism enables organisms to adapt to environmental challenges by increasing phenotypic diversity. Cacopsylla chinensis exhibits remarkable seasonal polyphenism, specifically in the form of summer-form and winter-form, which have distinct morphological phenotypes. Previous research has shown that low temperature and the temperature receptor CcTRPM regulate the transition from summer-form to winter-form in C. chinensis by impacting cuticle content and thickness. However, the underling neuroendocrine regulatory mechanism remains largely unknown. Bursicon, also known as the tanning hormone, is responsible for the hardening and darkening of the insect cuticle. In this study, we report for the first time on the novel function of Bursicon and its receptor in the transition from summer-form to winter-form in C. chinensis. Firstly, we identified CcBurs-α and CcBurs-β as two typical subunits of Bursicon in C. chinensis, which were regulated by low temperature (10 °C) and CcTRPM. Subsequently, CcBurs-α and CcBurs-β formed a heterodimer that mediated the transition from summer-form to winter-form by influencing the cuticle chitin contents and cuticle thickness. Furthermore, we demonstrated that CcBurs-R acts as the Bursicon receptor and plays a critical role in the up-stream signaling of the chitin biosynthesis pathway, regulating the transition from summer-form to winter-form. Finally, we discovered that miR-6012 directly targets CcBurs-R, contributing to the regulation of Bursicon signaling in the seasonal polyphenism of C. chinensis. In summary, these findings reveal the novel function of the neuroendocrine regulatory mechanism underlying seasonal polyphenism and provide critical insights into the insect Bursicon and its receptor.
-
- Evolutionary Biology
- Genetics and Genomics
It is well established that several Homo sapiens populations experienced admixture with extinct human species during their evolutionary history. Sometimes, such a gene flow could have played a role in modulating their capability to cope with a variety of selective pressures, thus resulting in archaic adaptive introgression events. A paradigmatic example of this evolutionary mechanism is offered by the EPAS1 gene, whose most frequent haplotype in Himalayan highlanders was proved to reduce their susceptibility to chronic mountain sickness and to be introduced in the gene pool of their ancestors by admixture with Denisovans. In this study, we aimed at further expanding the investigation of the impact of archaic introgression on more complex adaptive responses to hypobaric hypoxia evolved by populations of Tibetan/Sherpa ancestry, which have been plausibly mediated by soft selective sweeps and/or polygenic adaptations rather than by hard selective sweeps. For this purpose, we used a combination of composite-likelihood and gene network-based methods to detect adaptive loci in introgressed chromosomal segments from Tibetan WGS data and to shortlist those presenting Denisovan-like derived alleles that participate to the same functional pathways and are absent in populations of African ancestry, which are supposed to do not have experienced Denisovan admixture. According to this approach, we identified multiple genes putatively involved in archaic introgression events and that, especially as regards TBC1D1, RASGRF2, PRKAG2, and KRAS, have plausibly contributed to shape the adaptive modulation of angiogenesis and of certain cardiovascular traits in high-altitude Himalayan peoples. These findings provided unprecedented evidence about the complexity of the adaptive phenotype evolved by these human groups to cope with challenges imposed by hypobaric hypoxia, offering new insights into the tangled interplay of genetic determinants that mediates the physiological adjustments crucial for human adaptation to the high-altitude environment.