Natural genetic variation in Arabidopsis thaliana defense metabolism genes modulates field fitness
Abstract
Natural populations persist in complex environments, where biotic stressors, such as pathogen and insect communities, fluctuate temporally and spatially. These shifting biotic pressures generate heterogeneous selective forces that can maintain standing natural variation within a species. To directly test if genes containing causal variation for the Arabidopsis thaliana defensive compounds, glucosinolates (GSL) control field fitness and are therefore subject to natural selection, we conducted a multi-year field trial using lines that vary in only specific causal genes. Interestingly, we found that variation in these naturally polymorphic GSL genes affected fitness in each of our environments but the pattern fluctuated such that highly fit genotypes in one trial displayed lower fitness in another and that no GSL genotype or genotypes consistently out-performed the others. This was true both across locations and within the same location across years. These results indicate that environmental heterogeneity may contribute to the maintenance of GSL variation observed within Arabidopsis thaliana.
https://doi.org/10.7554/eLife.05604.001eLife digest
‘Genetic variation’ describes the naturally occurring differences in DNA sequences that are found among individuals of the same species. These genetic differences arise from random mutations and may be passed on to their offspring. Some of these mutations may improve the ability of an individual to survive and reproduce—known as fitness—and are likely to become more common in the population. Other mutations may reduce an individual's fitness and are likely to be lost. However, it is believed that most of the mutations will have no effect on the fitness of individuals.
It is not known why many of these ‘neutral’ genetic differences are maintained in populations. Some researchers have proposed that they are kept by chance and that there is no direct advantage to the population of keeping them unless these neutral mutations later become beneficial. However, other researchers think that the genetic variation itself may improve the fitness of the population by allowing it to quickly adapt to changes in the environment.
Arabidopsis thaliana is a small plant that lives in many different environments and has high levels of genetic variation in many of its physical traits. One of these traits is the production of molecules called glucosinolates, which help the plants to defend against herbivores and infection by microbes. Previous studies have suggested that variation in the genes that make glucosinolates may improve the fitness of A. thaliana populations.
To test this idea, Kerwin et al. carried out a field trial using A. thaliana plants that were genetically identical except for some of the genes involved in the production of glucosinolates. Kerwin et al. grew the plants in several different environments over several years. The field trial shows that variation in these genes affected the fitness of the plants in each of the different environments. However, the fitness benefit depended on the environment, and no single gene variant provided the best fitness across all environments, or over all the years of the trial.
Kerwin et al.'s findings suggest that changes in the environment may contribute to the maintenance of genetic variation in the genes that make glucosinolates. This raises the questions of how many other genes in plants (or other species such as humans) have genetic variation that contributes to fitness across varied environments; and how can this link be tested in natural settings.
https://doi.org/10.7554/eLife.05604.002Introduction
High levels of standing variation have often been observed among many natural plant and animal populations. This is particularly true for the model species Arabidopsis thaliana, which exhibits variation both within and among natural populations and/or accessions (Pigliucci and Marlow, 2001; Atwell et al., 2010; Bomblies et al., 2010; Chan et al., 2010; Platt et al., 2010; Cao et al., 2011; Debieu et al., 2013; Joseph et al., 2013; Long et al., 2013; Anwer et al., 2014; Li et al., 2014). Models based on mutation-selection balance theory predict that this observed variation will be due to rare alleles at many loci introduced through random mutations that evolution acts on to eliminate through persistent purifying natural selection (Kimura, 1968; Turelli, 1984). In agreement, studies of nucleotide variation in Arabidopsis have found an excess of low frequency polymorphisms relative to expectation (Purugganan & Suddith, 1998, 1999). However, other studies cloning causal genetic variants from natural Arabidopsis accessions have found several intriguing examples of intermediate frequency alleles maintained at polymorphic loci (Johanson et al., 2000; Long et al., 2000; Li et al., 2014). This variation among loci has led to a long-standing interest in elucidating to what extent this genetic variation is neutral in origin or, alternatively, maintained through selective forces (Levene, 1953; Hedrick et al., 1976; Bull, 1987; Stahl et al., 1999; Prasad et al., 2012).
The neutral theory posits that the majority of genetic polymorphisms have no effect on fitness and that stochastic evolutionary processes, such as genetic drift and migration, are sufficient to explain the genetic and phenotypic variation observed within and among populations (Darwin, 1859; Kimura, 1968; Duret, 2008). This hypothesis has generated numerous modeling studies demonstrating that the standing level of genetic variation in traits can be explained by the demographic history of a species not linked to fitness of an individual (Wolf et al., 2000; Barton and Turelli, 2004; Hufford et al., 2012; Pyhajarvi et al., 2013). However, for many ecologically important traits, phenotypic variation has been shown to empirically impact fitness in natural populations, suggesting that natural selection also plays an important role in the evolution of such traits (Mothershead and Marquis, 2000; Adler et al., 2001; Tian et al., 2003; Korves et al., 2007; Milla et al., 2009). A key step necessary to begin to resolve these discrepancies between theory and empirical observations requires the validation of fitness consequences of variation at specific loci or pathways in the field (Turelli and Barton, 2004; Fournier-Level et al., 2011; Hancock et al., 2011).
Determining the impact of polygenic variation upon fitness in the field informs our understanding of the potential selective and non-selective evolutionary processes that protect or maintain phenotypic variation within a species, such as genetic drift and balancing selection (Kimura, 1968; Hedrick et al., 1976; Mitchell-Olds et al., 2007; Mojica et al., 2012). However, most population level studies of evolution and selection in the field have focused on polygenic populations and have been unable to validate the link between variation at specific underlying genes and the resulting fitness consequences of this variation (Lande and Arnold, 1983; Mitchell-Olds and Rutledge, 1986; Gillespie and Turelli, 1989; Orr, 1998). Studies using structured mapping populations, such as Arabidopsis RILs, can only associate large genomic regions, rather than individual genes, with quantitative variation in fitness (Weinig et al., 2003; Stinchcombe et al., 2004; Juenger et al., 2005; Malmberg et al., 2005). More recently, genome wide studies using A. thaliana accessions have been able to associate SNPs to fitness in the field and even predict relative fitness of accessions grown in a common garden (Fournier-Level et al., 2011; Hancock et al., 2011). However, these associations between loci and fitness need more refining to validate the effect of individual genes. Testing if individual genes impact fitness in the field first requires identifying and cloning the causal genes underlying the phenotypic variation of interest (Mitchell-Olds, 1995; Tian et al., 2003; Mitchell-Olds et al., 2007). Then, these natural alleles need to be recreated as single gene lines, which can require approaches such as chemical mutation (e.g., EMS), generation of transgenic individuals via Agrobacterium-mediated transformation, and/or generation of isogenic lines through successive rounds of backcrossing. Therefore, empirical field testing of individual causative polymorphic genes has only been done rarely, and we do not yet have a good understanding of the extent to which individual genes impact fitness in the field (Tian et al., 2003; Schuman et al., 2012).
A. thaliana has become a key model system and is extremely suitable for characterizing, cloning and validating genes influencing the fitness consequences of underlying natural variation. This is due, in part, to the ease of transformation as well as the abundance of genomic resources available for this organism, including an extensive library of T-DNA insertion lines and natural accessions (Alonso et al., 2003). Arabidopsis persists in many different environments and experiences selection from both abiotic pressures, such as temperature and precipitation, and biotic pressures, such as insect and pathogen populations that vary temporally and spatially (Meyerowitz, 1987; Richards et al., 2009). Potentially to maximize fitness across a broad range of biotia, Arabidopsis has evolved high levels of natural variation among accessions for many important phenotypic traits, including the defense compounds, glucosinolates (GSLs) (Stahl et al., 1999; Atwell et al., 2010; Chan et al., 2010). GSLs constitute a diverse set of plant-made defensive metabolites restricted primarily to the Brassicales that are partitioned into three classes, indolic, aliphatic and aromatic, depending on their amino acid precursor. These N and S containing compounds are stored in the vacuoles of plant cells until they are activated through tissue damage, which can occur through insect feeding and pathogen attack. Natural genetic polymorphisms found among a suite of aliphatic GSL genes in Arabidopsis are responsible for the majority of GSL diversity observed in the leaf tissue (Figure 1). These aliphatic GSL genes encode enzymes, transcription factors and activation co-factors that have been identified, cloned and validated in a laboratory setting (Table 1) (Haughn et al., 1991; Li and Quiros, 2003; Hansen et al., 2007, 2008; Hirai et al., 2007; Li et al., 2008; Neal et al., 2010). Previous studies have uncovered links between GSL variation and ecologically important traits in Arabidopsis, such as resistance to insect/pathogen damage, flowering time, and growth, suggesting that GSLs play an important role in determining plant fitness (Mauricio, 1998; Kliebenstein et al., 2002; Bidart-Bouzat and Kliebenstein, 2008; Hansen et al., 2008; Burow et al., 2010; Kerwin et al., 2011; Züst et al., 2011). Since the genes responsible for the majority of natural polymorphism in aliphatic GSL have been well characterized in a laboratory setting, the GSL pathway in Arabidopsis provides a good system for understanding the impact that individual genes might have on fitness in the field (Kliebenstein et al., 2001b; Halkier and Gershenzon, 2006; Hansen et al., 2008). In this study, we tested the fitness consequences of aliphatic GSL variation in the field by utilizing a collection of lines that vary at specific GSL genes in Arabidopsis (Col-0), which recreated observed natural variation in the aliphatic GSL pathway found among accessions (Table 2) (Mauricio, 1998; Kliebenstein et al., 2002; Bidart-Bouzat and Kliebenstein, 2008; Hansen et al., 2008; Burow et al., 2010; Kerwin et al., 2011; Züst et al., 2011).
Results
Synthetic laboratory population mimics natural GSL variation in Arabidopsis
The GSL profile of a plant is characterized by the presence and relative abundance of the various GSL structures it produces. Among Arabidopsis accessions, GSL profiles show extensive phenotypic variation across the species geographic distribution (Figure 2) (Chan et al., 2010). While previous studies have linked GSL profile variation to insect resistance, as well as correlated the geographic distribution of insect populations with GSL profile-type across Europe, it is still not known to what extent, if at all, individual GSL genes affect fitness in the field (Mauricio, 1998; Bidart-Bouzat and Kliebenstein, 2008; Züst et al., 2012). To test if standing genetic variation within the aliphatic GSL defensive pathway of A. thaliana impacts fitness in the field, we utilized an existing set of genotypes that recreate natural variation at eight specific GSL loci, with the reference accession, Col-0, as the genetic background. These transgenic lines consisted of loss-of-function T-DNA insertion lines, an EMS mutant and gain-of-function overexpression lines that were originally created to validate individual genes as causal for GSL natural variation (Table 1). For example, the AOP2 gene was found to encode an enzyme that converts methylsulfinyl (MSO) GSL into alkenyl GSL (Figure 1 and Figure 1—figure supplement 7) (Kliebenstein et al., 2001c). Importantly, the AOP2 gene is polymorphic among Arabidopsis accessions, with Col-0 accession containing a natural knockout that abolishes its function. Therefore, introducing the functional allele back into Col-0 created a single gene mimic of the natural variation found in Arabidopsis (Figure 1 and Table 1) (Kliebenstein et al., 2001c). The natural variation at the other causal genes has been similarly mimicked as described in the listed citations (Table 1). This was facilitated by the fact that all of these genes contain natural presence/absence polymorphisms (citations in Table 1).
-
Figure 2—source data 1
- https://doi.org/10.7554/eLife.05604.024
Each of these transgenic lines had been backcrossed to Col-0 several times to remove unlinked polymorphisms in the original studies (Table 1). For this study, the transgenic lines were manually crossed to each other to represent the phenotypic variation in GSL profiles found among Arabidopsis accessions (Table 2, Figures 1, 2). This synthetic laboratory population varies at specific genes controlling aliphatic GSL variation within a single common genetic background. Utilizing this synthetic laboratory population, we can explicitly measure the impact of variation in a suite of aliphatic GSL genes on fitness components in the field without confounding variation in other regions of the genome.
We tested our population in multiple environments, which allowed us to separate the effects of genotype from environment, to determine if traits measured in the field are environmentally controlled. This could be particularly important if selection pressures fluctuate across environments. We transplanted 2 week old, greenhouse-germinated replicates of the synthetic laboratory population into the field at the University of California, Davis in Davis, CA in Spring 2012 and the University of Wyoming in Laramie, WY in Summer 2011 and Summer 2012. In each of our three field trials, which represent three environments, genotypes were replicated in 40 randomized blocks in the field, for a total of 120 blocks/replicates. To distinguish the effects of GSL variation alone from the interaction of GSL variation with field herbivory as well as assess the effects of leaf damage in the field, half of the blocks in each field trial were treated with pesticides and the other half were not (Figure 3) (Mauricio, 1998).
GSL genetic variation controls GSL profile in the field
Since the genes underlying variation in the aliphatic GSL pathway investigated in this study have been previously validated using lab techniques, we have a solid working knowledge of the resulting laboratory GSL profiles (Beekwilder et al., 2008; Hansen et al., 2008) (Figure 1—figure supplements 1–17). However, these GSL genotypes have not previously been tested in the field to determine if they produce the same GSL profiles as when grown in the laboratory. We particularly wanted to assess if variation at individual aliphatic GSL genes has the same impact on GSL profile in the field as predicted from published lab experiments when the plants are grown in different complex environments, and therefore measured GSL on all the plants grown in each of our three field trials. A mixed model analysis of field GSL revealed that the majority of variation in GSL profiles in the field was controlled by the GSL genotypes that we generated (Table 3). Importantly, the majority of the GSL genotypes produced the expected GSL profiles in the field, consistent with the lab studies (Figure 4 and Figure 1—figure supplements 1–17). To quantify the similarity in profiles between field and lab grown samples, we conducted a PCA analysis using the GSL profiles of these genotypes grown in a growth chamber. The first four vectors from our PCA were able to explain >99% of the variation in GSL profile. We utilized the loadings from the chamber PCA to estimate PCA scores of the first four vectors using the chamber GSL and field GSL. The scores for the field grown genotypes were highly correlated with the lab grown genotypes, showing that the GSL genetic variation leads to highly similar field and lab profiles (Table 4).
-
Table 3—source data 1
- https://doi.org/10.7554/eLife.05604.027
-
Table 3—source data 2
- https://doi.org/10.7554/eLife.05604.028
-
Figure 4—source data 1
- https://doi.org/10.7554/eLife.05604.030
In addition to the quantitative comparison of profiles, we also investigated the specificity of each locus in producing particular GSL structures to ensure that its field behavior mimicked the lab behavior. We found that, for the most part, each GSL gene produced the expected GSL phenotype in the field. For example, all lines harboring a functional AOP2 gene produce alkenyl GSL (e.g., but-3-enyl GSL) (Figures 1, 4). Additionally, the functional/non-functional allelic state at the MAM1 locus was always predictive of the chain-length of the GSL in the field as predicted from lab experiments. The lines with a functional MAM1, like Col-0, produced more 4C GSL than 3C GSL, while genotypes with a non-functional MAM1 always produced more 3C GSL than 4C GSL (Figure 4) (Haughn et al., 1991). A functional copy of GSOH, the gene encoding the enzyme to create 2-OH-but-3-enyl, always leads to the production of 2-OH-but-3-enyl GSL from but-3-enyl GSL (Figures 1, 4) (Hansen et al., 2008). In addition to the biosynthetic genes, the MYB genes, which encode transcription factors that control accumulation of aliphatic GSLs, showed similar field phenotypes as were found in the lab (Hirai et al., 2007; Sønderby et al., 2007, 2010). Specifically, a non-functional MYB28 leads to an almost complete reduction in long chain (8C) GSL and a 60% reduction in short chain (3C and 4C) GSL (Figure 4) (Hirai et al., 2007; Sønderby et al., 2007, 2010). A non-functional MYB29 leads to a 40% reduction in short chain GSL with no significant reduction in long chain GSL (Figure 4) (Hirai et al., 2007; Sønderby et al., 2007, 2010). A double mutant in MYB28 and MYB29 lead to an almost complete loss of all aliphatic GSLs, as expected (Figure 4) (Hirai et al., 2007; Sønderby et al., 2007, 2010). The only genes for which the field and laboratory GSL profile data differ are GSOX1 and GSOX3, which are two tightly linked genes at the GSOX locus that also contains two additional genes, GSOX2 and GSOX4. In the lab, gsox1 and gsox3 mutants accumulate higher levels of methylthio (MT) GSL than does Col-0, due to reduced expression of a flavin-monooxygenase that converts the MT to MSO GSL (Figure 1) (Hansen et al., 2007; Li et al., 2008). In the field there was no measureable accumulation of MT GSL in any line, likely due to the redundant function of the GSOX2 and GSOX4 genes (Kerwin et al., 2011, 2012; Li et al., 2008). Thus, the field results show that the laboratory work on GSL genotypes and their associated GSL profiles are translatable and predictive of the GSL profiles found in naturally fluctuating environments.
Environment and genetic variation interact to control GSL accumulation in the field
Conducting field trials in multiple environments enabled us to test the effect of different environmental conditions on our field traits. The specific composition of GLSs within a genotype largely did not change across the environments (Table 4). In contrast, the total amount of aliphatic GSL content, that is, the sum of all aliphatic GSLs per sample, showed a significant genotype by environment effect, indicating that impact of environment on total aliphatic GSL accumulation varied among the different GSL genotypes in this study (Table 3 and Figure 5). For example, the AOP2 genotype showed a dramatic variation in total aliphatic GSL across the three field trials (Figure 5). In contrast, a number of other genotypes tended to show similar accumulation across the environments. For example, genotypes with a myb28/myb29 double knockout accumulated virtually no GSL in all three environments. Thus, the GSL genotype is the dominant determinant of GSL profile in the field while total aliphatic GSL accumulation is determined by an interaction of genotype and environment within our laboratory population.
Leaf damage in the field varies across environment
A critical way in which plant environments fluctuate is with respect to insect populations that vary both temporally and spatially in a manner that could have a profound impact on variation in plant damage (Mauricio, 1998; Richards et al., 2009). To assess if changes in environment impact herbivory levels, we measured leaf damage on a scale from 0–10 in all three field trials, with and without a pesticide treatment (Figure 6). A mixed model analysis showed that leaf damage significantly varied across the three environments but that the pesticide application did not significantly alter leaf damage in the field (Table 3). The UWY2012 field trial (mean = 2.610) had significantly higher levels of leaf damage than both UWY2011 (mean = 1.17, p value <1e-04) and UCD2012 (mean = 1.50, p value <1e-04), though UCD2012 and UWY2011 environments did not differ significantly for leaf damage (p value = 0.44). Field plots were treated with pesticides once every 2 weeks, which did not entirely eliminate leaf damage on the treated individuals. A more aggressive pesticide treatment regime would have been necessary to abolish leaf damage in the treated group. In addition, the levels of leaf damage measured in our study are low relative to other field studies in Arabidopsis (Bidart-Bouzat and Kliebenstein, 2008). The field site was located adjacent to other experimental field sites and greenhouses that also treated for pests, which may or may not have had an impact on the relative levels and/or pesticide resistance of herbivores in the vicinity. This combination of low overall leaf damage levels and the fact that the pesticide treatment did not eliminate leaf damage in the treated group is likely the cause for this lack of a treatment effect. However, there is a significant environment effect for leaf damage, indicating that this trait varied across the three field trials. In fact, we see no significant correlation of leaf damage across the three environments (Table 5). This suggests the three environments experienced differing herbivory pressures. Since we did not measure herbivore levels, we cannot determine whether the differences in leaf damage are the direct result of differences in insect populations. It is interesting to note that the UWY field site showed both the highest and lowest leaf damage levels, demonstrating that there can be potentially large temporal fluctuations in herbivory at a single location (Table 3—source data 2).
-
Figure 6—source data 1
- https://doi.org/10.7554/eLife.05604.034
Environment interacts with GSL genotype to impact leaf damage in the field
GSL variation is known to affect leaf damage incurred by insect herbivory within a controlled lab setting and we wanted to test if this could also be observed within a naturally fluctuating field setting (Lambrix et al., 2001; Kliebenstein et al., 2002; Beekwilder et al., 2008; Hansen et al., 2008). Within a field environment, levels of leaf damage significantly varied across GSL genotypes, in agreement with the role of GSL in deterring herbivory (Table 3). However, the extent of leaf damage incurred upon different GSL genotypes in the field fluctuated among environments, such that no particular GSL genotype showed a consistent maximal or minimal level of leaf damage across the three field trials (Figure 6). For example, the myb28/AOP2 and AOP2 genotypes had similar herbivory in UCD2012 (mean = 1.30 and 1.95, respectively) and UWY2011 (mean = 1.29 and 0.80, respectively) but strongly diverged in UWY2012 (mean = 1.45 and 5.64, respectively) (Figure 6 and Figure 6—source data 1). It has been shown, in a laboratory setting that the extent to which GSL profile provides resistance varies across different herbivore species (Kroymann et al., 2003; Pfalz et al., 2007; Hansen et al., 2008). In addition, GSL have been shown to provide resistance to fungi, bacteria and nematodes, which may have also been present and variable between our environments (Manici et al., 1997; Tierens et al., 2001; Aires et al., 2009; Witzel et al., 2013). It is likely that the composition of the herbivore communities differed between the two field sites. Though we did not conduct a complete survey of the herbivores present at UWY and UCD, we did observe differences in leaf damage patterns between the two locations, suggesting that there would be differences in the composition of herbivores species present. Together, these results show that GSL variation controls differential leaf damage in each field trial but the specific directions of effect for individual GSL genotypes is subject to environmental conditions, such as the composition of herbivores, which can vary temporally and spatially.
GSL variation and the environment impact fitness in the field
Since our laboratory population contains single gene variants, we have the ability to test the fitness consequences of individual genotypes in a field setting, an important step in connecting the GSL variation observed among Arabidopsis accessions with potential selective and non-selective evolutionary processes. To test if the GSL genotypes alter plant fitness in the three environments, we measured fecundity of each individual grown in the field. Plants were harvested from the field at maturity and the numbers of fruits, flowers and buds per plant were counted in the laboratory to yield total fruit count (TFC). TFC has previously been shown to be a good proxy for fecundity in Arabidopsis where total number of seeds per plant is highly correlated with total number of siliques (i.e., fruits) (Wolf et al., 2000; Kliebenstein et al., 2001c). Among the GSL genotypes we observed variation in silique length. Arabidopsis siliques contain two rows of seeds in a linear conformation, so that silique length strongly correlates with seed number at maturity, assuming uniform seed size. Therefore variation in silique length or seed size could affect our fecundity estimates. Silique length and seed size were measured from digital images of GSL genotypes harvested from the field and seed size showed no significant variation (data not shown). However, there was significant variation in silique length across GSL genotypes as well as a significant genotype by environment interaction (Table 3—source data 1). We concluded that the significant differences in silique lengths are likely reflective of fecundity and adjusted our fitness measurements using this information. Estimates of absolute fitness were therefore obtained for each individual as TFC multiplied by silique length both including and excluding individuals that did not survive to harvest. Survivorship was included in fitness estimates to avoid obtaining artificially inflated fitness estimates from GSL genotypes with higher death rates that would result from removing individuals that do not survive to fruiting and have a fitness of zero.
In this study, GSL genotype had a significant impact on absolute field fitness (Table 6). There was also a significant interaction effect between GSL genotype and environment for absolute fitness both including and excluding survivorship, suggesting that the impact that GSL genotype has on fitness is conditioned upon the environment (Table 6). Environment did not show a significant main effect on either measure of absolute fitness, suggesting that the population mean for absolute fitness may have been comparable across the environments and instead it is the fitness of GSL genotypes relative to each other within an environment that varies. Thus, these GSL genotypes that recreate natural variation within a single common genetic background influence field fitness of A. thaliana in an environmentally dependent manner.
To visualize if the rank in absolute fitness of GSL genotypes fluctuates among the three environments and to compare the patterns of fluctuation of GSL genotypes across environments, we plotted the mean normalized fitness of all GSL genotypes in all environments for both absolute fitness measures, including and excluding survivorship (Figure 7 and Figure 7—figure supplement 1). Absolute fitness varied greatly between the highest and lowest ranked GSL genotypes within each of the environments (Figure 7 and Figure 7—source data 1). In addition, the performance of different GSL genotypes relative to each other varied across environments, so that no GSL genotype outperformed all the others in all three environments. For example, myb28/AOP2 shows the greatest fitness in the UCD2012 environment and the lowest fitness in UWY2012. In contrast, myb28/gsoh shows an opposite pattern while other genotypes showing a diversity of other patterns (Figure 7). This fluctuation in rank of GSL genotypes across environments can also be observed if we look at fluctuations of TFC with and without survivorship across the three environments, though the patterns for specific GSL genotypes vary across the different fitness measures (Figure 7—figure supplement 1). Thus, it appears that the significant interaction of GSL genotype by environment controlling fitness is caused by fluctuations in the fitness rank of different genotypes across environments (Figure 7 and Figure 7—source data 1).
-
Figure 7—source data 1
- https://doi.org/10.7554/eLife.05604.038
Within an environment, individuals compete against their neighbors for resources during their lifetime and natural selection favors those with higher performance relative to others. Therefore, in addition to absolute fitness, we also analyzed the effect of the GSL genotype on relative fitness in the field, both with and without survivorship. We calculated relative fitness of each GSL genotype within each environment as absolute fitness divided the population mean within that environment. Even more strongly than with our absolute fitness measurements, we found that GSL genotype and the interaction between GSL genotype and environment both had a significant impact on relative fitness in the field both including and excluding survivorship (Table 6). For example, myb28 has a higher than average relative fitness in UWY2011 but shows an average and slightly lower than average relative fitness in UWY2012 and UCD2012, respectively (Figure 8). In other cases, relative fitness of a GSL genotype is similar among the UWY field trials but differs in the UCD field trial. Two examples, with opposite patterns are myb28/AOP2, that has low relative fitness in both UWY field trials but higher relative fitness in UCD and gsm1, that has high relative fitness in both UWY field trials but lower relative fitness in UCD. This indicates that temporal and spatial fluctuations in fitness can both occur and are dependent on genotypic differences.
Interestingly, heatmaps of absolute fitness and relative fitness reveal unexpected hierarchical clustering of the environments between the two traits (Figure 8). In both cases, UCD2012 clusters with UWY2011 and the two UWY field trials do not cluster together, showing that within an environment across years there is the potential for greater variability than across environments.
Pleiotropic links to GSL genes
In our analysis, we measured flowering time and total indole GSL in the field. In a laboratory setting, GSL genes have been observed to pleiotropically alter these traits (Kerwin et al., 2011). In the field, both of these phenotypes were significantly affected by the GSL genetic variation in our synthetic population, indicating that aliphatic GSL genes can have pleiotropic impacts beyond the aliphatic GSL pathway that can be observed in natural settings (Table 3 and Table 3—source data 1). Therefore, there is the possibility that either of these phenotypes might be driving the observed variation in fitness of GSL genotypes across these environments. To test this, we conducted genetic correlations using the genotypic means for absolute fitness, flowering time and total indole GSL within each environment (Table 7). We did not observe a significant correlation between absolute fitness and our pleiotriopic traits, using either parametric or non-parametric approaches, in any of our three environments (Table 7). This indicates that while the GSL genes are causing pleiotropic effects, these pleiotropic effects are probably not driving the observed fitness consequences of the GSL genotypes in our field trials.
Non-random variation of GSL loci among field collected accessions
To test if natural Arabidopsis accessions show a pattern of variation consistent with fluctuating selection, we determined the GSL haplotype for a global collection of accessions using their GSL profile (Figure 2). Using the validated GSL phenotype caused by genetic variation at the eight causal genes for the aliphatic GSL pathway, we assigned a GSL haplotype to each Arabidopsis accession, given its GSL profile (Table 1 and Figure 1). Using the available GSL profile information, the underlying allelic state at each of the eight genes assigned functional or non-functional, based on presence or absence of different GSL structures as well as the relative abundances of different structures, that is, based on the GSL profile of the accession. This identified 18 distinct aliphatic GSL haplotypes among the set of 144 natural Arabidopsis accessions, observed at different frequencies (Figure 9 and Figure 9—source data 1). Using the observed single locus allelic frequencies, we calculated the expected GSL haplotype frequencies for each of the 18 multi-locus genotypes (Figure 9—source data 1). These expected frequencies for the GSL genotypes represent theoretical frequencies that would be expected if no selection gradient acted upon GSL variation and no genetic drift, migration or other non-selective effect upon population structure biased the allele distribution. Comparing the population of observed vs expected frequencies was highly non-random (p < 0.001) (Figure 9 and Figure 9—source data 1). Further, specific multi-locus GSL genotypes occurred significantly more or less often than expected (Figure 9 and Figure 9—source data 1). Thus, the non-random variation of GSL haplotypes within the Arabidopsis accessions supports the observations from the empirical field trials. It is similarly possible that this observed non-random variation is caused by non-selective processes such as migration, population structure and/or local bottleneck. Significant future efforts will be required to test the extent to which this non-random variation is caused by neutral demographic processes vs potential fluctuating selection.
-
Figure 9—source data 1
- https://doi.org/10.7554/eLife.05604.043
Discussion
Ecologically and evolutionarily important traits often show considerable phenotypic variation in nature that is quantitative, polygenic and interacts with the environment. A clear example of this is aliphatic GSL accumulation in Arabidopsis, which is highly polygenic and environmentally dependent (Figures 1, 4, 5). However, it has been complicated to validate that specific polymorphic loci within a pathway are the actual causative basis of any changes in fitness due to the use of polygenic populations (Lande and Arnold, 1983). In this study, using a single gene manipulation approach that has allowed us, over the past decade, to recreate natural allelic diversity in the aliphatic GSL pathway, we have shown that GSL genetic variation at numerous loci directly impacts Arabidopsis fitness in the field (Table 1, Figure 7, Figure 7—figure supplement 1, Figure 8). Because we have only manipulated the GSL genes within an otherwise isogenic background, we can directly conclude that it is these specific genes and their GSL phenotypes that are determining the differences in fitness in the field. Further experiments will optimally generate the full 256-line matrix containing all combinations of alleles between all loci to fully interrogate the effects of all loci in all possible backgrounds. We should also note that even with all of our efforts to clean up the respective backgrounds and validate that the mutant phenotypes are similar to the segregating natural genotypes, it remains possible that some of the observed effects are caused by unexpected changes in the lines.
More difficult however is to ascribe the specific selective forces acting on this GSL variation to produce a fitness effect. GSL are known plant defensive compounds and variation in GSL genotype was shown to significantly impact GSL profiles, leaf damage and fitness in the field (Table 3). While GSL variation did alter measured leaf damage in the field, the patterns did not fully reflect the relative fitness spectrum of these same genotypes (Figures 6–8). One possibility is that our experiment, even with 20 blocks (10 control/10 pesticide treated) per field trial, was still insufficient to identify the underlying link, suggesting the need for larger experiments. Another possibility is that there were different herbivore populations between these environments, which agrees with the observation that there was no genetic correlation of herbivore resistance across the three field trials (Table 5). The fact that different GSLs defend against different herbivores would complicate finding the specific link between GSL loci and a population of herbivores (Kroymann et al., 2003; Falk and Gershenzon, 2007; Pfalz et al., 2007; Hansen et al., 2008; Falk et al., 2014). Additionally, our herbivory measures are limited to foliar damage, which obfuscates any potential interactions between GSL genotype and root pathogens. Supporting this idea, previous studies have found that GSL can influence a number of root pathogens and commensal microbes (Bending and Lincoln, 2000; van Dam et al., 2008; Bressan et al., 2009; Millet et al., 2010; Witzel et al., 2013). While these organisms could directly impact plant fitness, this interaction is highly difficult to detect or control in field trials.
In addition to unmeasured biotic stresses, there is the potential for causal links between GSL genes and abiotic pressures. We showed that the GSL genes have pleiotropic effects on development such as flowering time that while having no link to fitness in our experiments could impact fitness in other environments. Similarly, previous work has shown that individual GSL structures directly modulate stomatal closure in response to wounding (Zhao et al., 2008). Furthermore, analysis of natural variation and validation lines showed that GSL structure and amount can influence the circadian clock and flowering time (Kerwin et al., 2011). Other experiments have also identified a potential for regulatory roles with indole GLS (Clay et al., 2009). Thus, these are not indirect pleiotropies but direct regulatory links whereby GSLs may influence the plants abiotic responses potentially to alter the biotic interactions. Thus, it is possible that the observed GSL to fitness links are resulting from a complex web of biotic and abiotic effects. Identifying the specific selective agents affected by GSL variation will require the development of techniques for rapid and systematic identification of all foliar and root herbivores and microbes from field samples as well as a complete physiological and developmental analysis of the plant within the field. This is especially critical as the specific agents of selection may be highly variable across environments.
Within our multiple field trials, we found that effects of GSL genes on fitness are highly dependent upon the environment in which the experiment is conducted (Table 6). The fitness effects of the naturally polymorphic GSL genes were such that each environment had a different optimal set of GSL genotypes (Figure 7, Figure 7—figure supplement 1, Figure 7—figure supplement 8). Similarly, no particular GSL genotype had the maximal fitness in all environments (Figure 7, Figure 7—figure supplement 1, Figure 8). This suggests that the GSL defense pathway might be a system in which genetic variation could be stabilized by fluctuating selection across the environments. Fully exploring this hypothesis will require extensive assessment of genetic variation at the polymorphic GSL loci within natural populations and more extensive field trials of this synthetic population that recreates natural diversity at these same loci.
Within species that are highly but not exclusively selfing, such as A. thaliana, temporal variation in selection is not solely sufficient to maintain genetic diversity (Dempster, 1955; Bomblies et al., 2010). This would require either spatial variation in fitness and/or variation within a seed bank to provide extra drive for the system (Dempster, 1955; Turelli et al., 2001; Turelli and Barton, 2004). Recent work has begun to show that Arabidopsis has a robust multi-generational seed bank in natural populations (Lundemo et al., 2009; Bomblies et al., 2010). Further, there is extensive allelic variation within small local regions that contain different habitats, that would likely experience different insect pressures, providing the potential for spatial variation in fitness (Bomblies et al., 2010). Thus, both conditions necessary for fluctuating selection to maintain diversity in Arabidopsis exist, but we do not yet know enough about the extent of the seed bank or spatial variation in selection within Arabidopsis to fully model the system. This shows that a greater understanding of life history traits, seed bank history and migration rates in natural populations of Arabidopsis is necessary to determine if fluctuating selection is contributing to the maintenance of variation in this species.
Conclusions
Based on our measures of fitness in the field, we showed that GSL variation can control fitness within the field. These fitness effects were not driven by pleiotropic phenotypes like flowering, but the specific selective pressures driving these fitness differences remain to be identified. Identifying these pressures will require vastly larger surveys of natural populations and long-term field trials. Using the empirical values for fitness, we could show that the GSL system within these environments fits models where fluctuating selection can maintain standing polygenic variation. Further trials are required to test if this is more broadly applicable across a broader range of environments. This would require more field trials using our synthetic population to provide the capacity to empirically evaluate models of maintenance of standing variation and its influence on adaptation (Gillespie and Turelli, 1989; Orr, 1998; Agrawal, 2001). It remains to be directly tested if similar evolutionary processes drive evolution of other ecologically important traits that must respond to fluctuating environmental conditions such as pathogen populations and water availability.
Materials and methods
Synthetic laboratory population generation
Request a detailed protocolThe following eight loci in the aliphatic GSL pathway were modified in the synthetic laboratory population of A. thaliana genotypes: AOP2 (At4g03060), ESP (At1g54040), MYB28 (At5g61420), MYB29 (At5g07690), GSOH (At2g25450), MAM1 (At5g23010), GSOX1 (At1g65860), GSOX3 (At1g62560). The following knockout or complementation lines for the following loci in A. thaliana Col-0 were used to generate the lab population: AOP2 = 35S:AOP2 (Li and Quiros, 2003), ESP = 35S:ESP (Burow et al., 2006), MYB28 = SALK_136312, (Sønderby et al., 2007), MYB29 = SM.34316 (Hirai et al., 2007), GS-OH = SALK_09807 (Hansen et al., 2008), MAM1 = EMS mutant line gsm1 (Haughn et al., 1991), GSOX1 = SALK_079493 (Li et al., 2008), GSOX3 = CSHL_GT13906 (Li et al., 2008). Mutant lines were manually crossed to each other to generate a population of plants containing homozogyous combinations of mutations in the different genes mentioned above, representing a subset of the potential variation in the aliphatic GSL pathway observed among Arabidopsis accessions (Table 2). Individuals were genotyped via PCR using the primers and reaction conditions listed below.
Reaction conditions for group 1 | ||||
---|---|---|---|---|
Initial melting | 32 cycles | Final extension | ||
94°C | 94°C | 60°C | 72°C | 72°C |
30 s | 30 s | 45 s | 90 s | 10 min |
Reaction conditions for group 2 | ||||
---|---|---|---|---|
Initial melting | 30 cycles | Final extension | ||
94°C | 94°C | 61°C | 72°C | 72°C |
30 s | 30 s | 45 s | 90 s | 10 min |
Reaction conditions for group 3 | ||||
---|---|---|---|---|
Initial melting | 30 cycles | Final extension | ||
94°C | 94°C | 65°C | 72°C | 72°C |
45 s | 45 s | 45 s | 90 s | 10 min |
Experimental settings
Request a detailed protocolField trials were conducted in two locations, the latter over 2 years, giving three separate environments total. The first field trial was performed at the University of Wyoming (UWY) in Laramie, WY during Summer 2011, the second at UC Davis in Davis, CA Spring 2012, and the third at UWY during Summer 2012. Seeds were sown into flats with 2 inch 50-celled inserts using Sunshine #5 (Sungro, Agawam, MA) potting soil containing slow release fertilizer and stratified at 4°C for 4 days before being transferred into the greenhouse at either the University of Wyoming in Laramie (UWY) or the University of California at Davis (UCD) to facilitate germination synchrony. In the UWY greenhouse, plants received 15 hr light/9 hr dark natural phototoperiod with temperatures fluctuating diurnally from 10°C to 30°C. In the UCD greenhouse, plants received 14 hr light/10 hr dark natural photoperiod with temperatures fluctuating from 15°C to 35°C. Further, starting all the plants in the greenhouse minimizes variation in the initial seedling conditions. After germination, seedlings were thinned to one per pot and GSL genotypes were randomly organized into 40 blocks per field trial, for a total of 120 blocks total and also 120 GSL genotype replicates total. Individuals were transplanted from the greenhouse into the field 2 weeks post germination. A single plant of each genotype was present in each block in all three environments and blocks were arranged into four rows of ten blocks each (Figure 3). Each row of 10 blocks is referred to as a plot, so that there were four plots per field trial and 12 plots total. Within each plot is nested a treatment by environment combination. Every 14 days, two plots (20 blocks total) per environment were treated with pesticides to decrease leaf damage due to herbivory. At UWY, plants were sprayed with the insecticide Sevin (GardenTech, Palatine, IL) to repel flea beetles. At UCD, plants were treated with Marathon 1% granular (OHP, Mainland, PA) and Lily Miller Slug, Snail & Insect Killer Bait (Lily Miller Brands, Walnut Creek, CA). The plants were allowed to grow in the field for 4 weeks before being harvested. At harvest, the aerial portion of each plant was collected from the field, placed into a quart sized freezer bag and transferred into 4°C for temporary storage. After the harvest completion, the UCD field plants were immediately placed into −80°C for storage. The UWY field plants were shipped to UC Davis overnight on dry ice and then placed in −80°C for storage.
GSL extraction, HPLC separation and GSL structure identification
Request a detailed protocolGSL were measured on all field trial plants to assess field effects of the genotypes on GSL accumulation. At approximately 4 weeks of age, a single, fully expanded, green leaf was collected from each plant. In order to measure leaf area of each sample, leaves from twelve plants at a time were placed on a white sheet of paper with a grid overlay. A ruler was placed on the sheet of paper below the leaves and digitally imaged using a Nikon D3100 (Tokyo, Japan). The photographed leaves were then placed directly into 96 deep well plates containing 400 μl 90% methanol and stored in the freezer until extraction. For the UWY field trial, the leaves were stored at −20°C for 3–4 weeks and shipped overnight to Davis, CA on dry ice. For the Davis field trial, all plates were stored at −20°C until extraction. After harvest, desulfoglucosinolates were extracted from all samples using a high-throughput protocol briefly described below (Kliebenstein et al., 2001a). One gram of Sephadex DEAE A-25 (Sigma–Aldrich, St. Louis, MO) was added to each well of a 96 well filter plate using a column loader. To hydrate the Sephadex, 300 μl of H2O was transferred to each well using a multichannel pipet and allowed to incubate at room temp 1 hr. Excess H2O was removed from the Sephadex by placing filter plate on top of a 96 deep well discard plate (used to catch the flow through) and centrifuged at 1000 rpm for 2 min. To homogenize the samples, 96 deep well plates containing a single A. thaliana leaf, two 2.3 mm ball bearings and 400 μl of 90% methanol in each well were shaken in a Harbil 5-Gallon Mixer (Fluid Management Co., Wheeling, IL) for 3–5 min. Plates were centrifuged at 4000 rpm for 20 min. To bind GSL to Sephadex, 150 μl of supernatant from each well (containing the extracted organic compounds) was transferred to the corresponding well of the 96 well filter plate containing hydrated Sephadex and centrifuged at 1200 rpm for 3 min on top of the 96 deep well discard plate. To wash away all the non-binding organic compounds from the Sephadex, 150 μl of 90% methanol was added to each well and the plate was centrifuged at 1200 rpm for 3 min. To remove excess methanol, two wash steps were conducted by adding 150 μl of H2O to the plate followed by centrifugation at 1200 rpm for 3 min. To release the GSL from the Sephadex, 10 μl of Sulfatase (Sigma–Alrich) and 100 μl of water were added to each well of the 96 well filter plate then incubated overnight in the dark. The desulfoglucosinolates were then eluted into a 96 well round bottom plate via centrifugation at 1200 rpm for 3 min. For each GSL sample, 50 μl of the 110 μl of extract was injected on an Agilent 1100 HPLC (Agilent, Santa Clara, CA) using a Lichrocart 250–4 RP18e column (Hewlett–Packard, Palo Alto, CA). Individual GSL compounds were detected at 229 nm and separated utilizing the following program with an aqueous acetonitrile gradient: a 6-min gradient from 1.5% to 5.0% (vol/vol) acetonitrile, followed by a 2-min gradient from 5% to 7% (vol/vol) acetonitrile, a 7-min gradient from 7% to 25% (vol/vol) acetonitrile, a 2-min gradient from 25% to 92% (vol/vol) acetonitrile, 6 min at 92% (vol/vol) acetonitrile, a 1-min gradient from 92% to 1.5% (vol/vol) acetonitrile, and a final 5 min at 1.5% (vol/vol) acetonitrile (Kliebenstein et al., 2001a). For each peak, the GSL structure was determined by comparing the retention time and UV absorption spectrum with known standards. The integral under each peak was automatically calculated and this value in mili-absorption units was converted to picamoles/mm2 tissue using response factor slopes determined from purified standards and area of leaf tissue used per sample as measured by ImageJ (Kliebenstein et al., 2001a; Reichelt et al., 2002).
Leaf damage measurements in the field
Request a detailed protocolLeaf damage estimates were visually taken in the field at 4 weeks of age, just before tissue collection for GSL extraction. A scale from 0–10 was used to determine amount of pest damage incurred on each plant, with 0 representing no damage and 10 representing complete destruction of the plant (i.e., the plant completely eaten).
Absolute fitness and relative fitness
Request a detailed protocolAbsolute fitness was calculated as total fruit count (TFC) × silique length × survival. TFC was measured as the count of fruits (siliques) + flowers + buds per individual. Silique length was measured in ImageJ from digital images of harvested field plants taken using a Nikon D3100 as follows: each plant was placed flat on a white sheet of paper next to a ruler and pictures were taken using auto focus. After setting the scale in ImageJ using the ruler placed in each image, the segmented line tool was used to draw a line from the pedicle to the tip of the silique. For each plant, eight siliques were measured at random and these values were averaged to get a value for each plant. Survival was scored on a binary (0–1) scale. Plants that germinated, were transplanted into the field and survived to harvest were given a survival score of 1 and plants that germinated and were transplanted but did not survive to harvest were given a score of 0. Individuals that did not germinate or did not survive to transplantation were given an NA. Relative fitness was calculated for each GSL genotype within each environment relative to Col-0. To do this, average absolute fitness of a GSL genotype was divided by the average absolute fitness of Col-0 within a environment. Col-0 was chosen as the reference genotype given that it is the background genotype.
Statistical analysis methods
Request a detailed protocolAll statistical analyses were carried out using the R statistical computing language (Team, 2014). The field trial was conducted in a split plot design with each plot nested within treatment by environment. We used a restricted maximum likelihood (REML) approach to fit a linear mixed effects model to the field traits and partition the variation of each among the fixed effects, genotype, environment, treatment and the random factor, plot nested within treatment and environment. There were 17 genotypes, which refers to the GSL genotype in the synthetic laboratory population. There were three environments: Wyoming 2011, Wyoming 2012 and Davis 2012. The two treatments were control and pesticide treated. We had 4 plots per environment (2 in each treatment group) for a total of 12 plots. We used the following formula to fit this model using the lme4 package in R (Baayen et al., 2008):
lmer(Trait ∼ Genotype*Environment*Treatment + (1|Plot(Treatment:Environment))).
The Anova function from the car package in R was utilized to determine which fixed effects variables significantly altered the mean of each trait (p value <= 0.05) (Fox and Weisberg, 2011). We estimated population means (i.e., LSMeans) of each field trait for all genotypes across treatment and environment using the LSMeans function from the doBy package in R (Højsgaard et al., 2013). Dunnett's multiple comparison testing was performed on the traits to determine which genotypes had significantly different means than Col-0, our reference genotype using the glht function from the multcomp package in R (Hothorn et al., 2014). Additionally, Tukey's multiple comparison was performed on the traits to compare all the genotypes to all the other genotypes for significant differences using the same glht function from the multcomp package in R (Hothorn et al., 2014). PCA was conducted using the princomp function from the base package (Team, 2014).
Data availability
-
Data from: Natural Genetic Variation in Arabidopsis thaliana Defense Metabolism Genes Modulate Field FitnessAvailable at Dryad Digital Repository under a CC0 Public Domain Dedication.
References
-
Suppressing potato cyst nematode, globodera rostochiensis, with extracts of brassicacea plantsAmerican Journal of Potato Research 86:327–333.https://doi.org/10.1007/s12230-009-9086-y
-
Mixed-effects modeling with crossed random effects for subjects and itemsJournal of Memory and Language 59:390–412.https://doi.org/10.1016/j.jml.2007.12.005
-
Effects of genetic drift on variance components under a general model of epistasisEvolution; International Journal of Organic Evolution 58:2111–2132.https://doi.org/10.1111/j.0014-3820.2004.tb01591.x
-
Inhibition of soil nitrifying bacteria communities and their activities by glucosinolate hydrolysis productsSoil Biology and Biochemistry 32:1261–1269.https://doi.org/10.1016/S0038-0717(00)00043-2
-
Regulatory networks of glucosinolates shape Arabidopsis thaliana fitnessCurrent Opinion in Plant Biology 13:348–353.https://doi.org/10.1016/j.pbi.2010.02.002
-
Whole-genome sequencing of multiple Arabidopsis thaliana populationsNature Genetics 43:956–963.https://doi.org/10.1038/ng.911
-
BookOn the origin of species by means of natural selection, or the preservation of favoured races in the struggle for lifeLondon: John Murray.
-
α-Keto acid elongation and glucosinolate biosynthesis in Arabidopsis thalianaTheoretical and Applied Genetics 101:429–437.https://doi.org/10.1007/s001220051500
-
Maintenance of genetic heterogeneityCold Spring Harbor Symposia on Quantitative Biology 20:25–32.https://doi.org/10.1101/SQB.1955.020.01.005
-
The desert locust, Schistocerca gregaria, detoxifies the glucosinolates of Schouwia purpurea by desulfationJournal of Chemical Ecology 33:1542–1555.https://doi.org/10.1007/s10886-007-9331-0
-
Genotype-environment interactions andthe maintenance of polygenic variationGenetics 121:129–138.
-
Biology and biochemistry of glucosinolatesAnnual Review of Plant Biology 57:303–333.https://doi.org/10.1146/annurev.arplant.57.032905.105228
-
Genetic polymorphism in Hetergenous environmentsAnnual Review of Ecology and Systematics 7:1–32.https://doi.org/10.1146/annurev.es.07.110176.000245
-
Omics-based identification of Arabidopsis Myb transcription factors regulating aliphatic glucosinolate biosynthesisProceedings of the National Academy of Sciences of USA 104:6478–6483.https://doi.org/10.1073/pnas.0611629104
-
doBy–groupwise summary statistics, LSmeans, general linear contrasts, various utilitiesdoBy–groupwise summary statistics, LSmeans, general linear contrasts, various utilities.
-
Simultaneous inference in general parametric modelsBiometrical Journal 50:346–363.https://doi.org/10.1002/bimj.200810425
-
Comparative population genomics of maize domestication and improvementNature Genetics 44:808–811.https://doi.org/10.1038/ng.2309
-
Comparative quantitative trait loci mapping of aliphatic, indolic and benzylic glucosinolate production in Arabidopsis thaliana leaves and seedsGenetics 159:359–370.
-
Genetic control of natural variation in arabidopsis glucosinolate accumulationPlant Physiology 126:811–825.https://doi.org/10.1104/pp.126.2.811
-
Comparative analysis of quantitative trait loci controlling glucosinolates, myrosinase and insect resistance in Arabidopsis thalianaGenetics 161:325–332.
-
Fitness effects associated with the major flowering time gene FRIGIDA in Arabidopsis thaliana in the fieldThe American Naturalist 169:E141–E157.https://doi.org/10.1086/513111
-
Evolutionary dynamics of an Arabidopsis insect resistance quantitative trait locusProceedings of the National Academy of Sciences of USA 100:14587–14592.https://doi.org/10.1073/pnas.1734046100
-
The measurement of selection on correlated CharactersEvolution 37:1210–1226.https://doi.org/10.2307/2408842
-
Genetic equilibrium when more than one ecological niche is availableAmerican Society of Naturalists 87:331–333.https://doi.org/10.1086/281792
-
In planta side-chain glucosinolate modification in Arabidopsis by introduction of dioxygenase Brassica homolog BoGSL-ALKTheoretical and Applied Genetics 106:1116–1121.https://doi.org/10.1007/s00122-002-1161-4
-
Subclade of flavin-monooxygenases involved in aliphatic glucosinolate biosynthesisPlant Physiology 148:1721–1733.https://doi.org/10.1104/pp.108.125757
-
Both naturally occurring insertions of transposable elements and intermediate frequency polymorphisms at the achaete-scute complex are associated with variation in bristle number in Drosophila melanogasterGenetics 154:1255–1269.
-
In vitro fungitoxic activity of some glucosinolates and their enzyme-derived products toward plant pathogenic fungiJournal of Agricultural and Food Chemistry 45:2768–2773.https://doi.org/10.1021/jf9608635
-
Arabidopsis thalianaAnnual Review of Genetics 21:93–111.https://doi.org/10.1146/annurev.ge.21.120187.000521
-
The molecular basis of quantitative genetic variation in natural populationsTrends in Ecology & Evolution 10:324–327.https://doi.org/10.1016/S0169-5347(00)89119-3
-
Quantitative genetics in natural plant populations: a review of the theoryThe American Naturalist 127:379–402.https://doi.org/10.1086/284490
-
Which evolutionary processes influence natural genetic variation for phenotypic traits?Nature Reviews Genetics 8:845–856.https://doi.org/10.1038/nrg2207
-
Fitness impacts of herbivory through indirect effects on plant-Pollinator interactions in Oenthera macrocarpaEcology 81:30–40.
-
Molecular population genetics of the Arabidopsis CAULIFLOWER regulatory gene: Nonneutral evolution and naturally occurring variation in floral homeotic functionProceedings of the National Academy of Sciences of USA 95:8130–8134.https://doi.org/10.1073/pnas.95.14.8130
-
Molecular population genetics of Floral homeotic loci: Departures from the equilibrium-neutral model at the APETALA3 and PISTILLATA genes of Arabidopsis thalianaGenetics 151:839–848.
-
Complex patterns of local adaptation in teosinteGenome Biology and Evolution 5:1594–1609.https://doi.org/10.1093/gbe/evt109
-
Perspectives on ecological and evolutionary systems biologyAnnual Plant Reviews 35:331–351.
-
A latitudinal cline in flowering time in Arabidopsis thaliana modulated by the flowering time gene FRIGIDAProceedings of the National Academy of Sciences of USA 101:4712–4717.https://doi.org/10.1073/pnas.0306401101
-
BookR: a language and environment for statistical computingVienna, Austria: R Foundation for Statistical Computing.
-
Heritable genetic variation via mutation-selection balance: Lerch's zeta meets the abdominal bristleTheoretical Population Theory 25:138–193.https://doi.org/10.1016/0040-5809(84)90017-0
-
Heterogeneous selection at specific loci in natural environments in Arabidopsis thalianaGenetics 165:321–329.
-
Genetic diversity and population structure of the serpentine endemic Calystegia collina (Convolvulaceae) in northern CaliforniaAmerican Journal of Botany 87:1138–1146.https://doi.org/10.2307/2656650
-
Using knockout mutants to reveal the growth costs of defensive traitsProceedings Biological Sciences/The Royal Society 278:2598–2603.https://doi.org/10.1098/rspb.2010.2475
Article and author information
Author details
Funding
National Science Foundation (NSF) (DGE 0653984)
- Rachel Kerwin
National Science Foundation (NSF) (DBI 0820580)
- Julie Feusier
- Jason Corwin
- Catherine Lin
- Alise Muok
- Brandon Larson
- Baohua Li
- Bindu Joseph
- Daniel Copeland
- Daniel J Kliebenstein
National Science Foundation (NSF) (MCB 1330337)
- Julie Feusier
- Jason Corwin
- Catherine Lin
- Alise Muok
- Brandon Larson
- Baohua Li
- Bindu Joseph
- Daniel Copeland
- Daniel J Kliebenstein
Danish National Research Foundation (DNRF99)
- Daniel J Kliebenstein
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We thank Carlos Quiros, Ute Wittstock and Bjarne G Hansen for their generous donations of seed stocks and members of the Kliebenstein and Weinig labs for assistance in the field.
Copyright
© 2015, Kerwin et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 3,584
- views
-
- 625
- downloads
-
- 91
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Ecology
Tracking wild pigs with GPS devices reveals how their social interactions could influence the spread of disease, offering new strategies for protecting agriculture, wildlife, and human health.
-
- Ecology
- Neuroscience
In nature, animal vocalizations can provide crucial information about identity, including kinship and hierarchy. However, lab-based vocal behavior is typically studied during brief interactions between animals with no prior social relationship, and under environmental conditions with limited ethological relevance. Here, we address this gap by establishing long-term acoustic recordings from Mongolian gerbil families, a core social group that uses an array of sonic and ultrasonic vocalizations. Three separate gerbil families were transferred to an enlarged environment and continuous 20-day audio recordings were obtained. Using a variational autoencoder (VAE) to quantify 583,237 vocalizations, we show that gerbils exhibit a more elaborate vocal repertoire than has been previously reported and that vocal repertoire usage differs significantly by family. By performing gaussian mixture model clustering on the VAE latent space, we show that families preferentially use characteristic sets of vocal clusters and that these usage preferences remain stable over weeks. Furthermore, gerbils displayed family-specific transitions between vocal clusters. Since gerbils live naturally as extended families in complex underground burrows that are adjacent to other families, these results suggest the presence of a vocal dialect which could be exploited by animals to represent kinship. These findings position the Mongolian gerbil as a compelling animal model to study the neural basis of vocal communication and demonstrates the potential for using unsupervised machine learning with uninterrupted acoustic recordings to gain insights into naturalistic animal behavior.