Discarded sequencing reads uncover natural variation in pest resistance in Thlaspi arvense

Dario Galanti; Jun Hee Jung; Caroline Müller; Oliver Bossdorf

doi:10.7554/eLife.95510.2

eLife Assessment

This important study presents a significant methodological advance by leveraging previously discarded, unmapped DNA sequence reads to estimate pest infestation loads across plant accessions, and map variation in these apparent pest loads to defense genes. The bioinformatics approach is compelling, and the results should bear broad implications for phenotype-genotype prediction, especially regarding the use of unmapped reads for GWAS.

https://doi.org/10.7554/eLife.95510.2.sa3

Significance of findings

important: Findings that have theoretical or practical implications beyond a single subfield

landmark
fundamental
important
valuable
useful

Strength of evidence

compelling: Evidence that features methods, data and analyses more rigorous than the current state-of-the-art

exceptional
compelling
convincing
solid
incomplete
inadequate

During the peer-review process the editor and reviewers write an eLife assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife assessments

Abstract

Understanding the genomic basis of natural variation in plant pest resistance is an important goal in plant science, but it usually requires large and labour-intensive phenotyping experiments. Here, we explored the possibility that non-target reads from plant DNA sequencing can serve as phenotyping proxies for addressing such questions. We used data from a whole-genome and -epigenome sequencing study of 207 natural lines of field pennycress (Thlaspi arvense) that were grown in a common environment and spontaneously colonized by aphids, mildew and other microbes. We found that the numbers of non-target reads assigned to the pest species differed between populations, had significant SNP-based heritability, and were associated with climate of origin and baseline glucosinolates content. Specifically, pennycress lines from cold and thermally fluctuating habitats, presumably less favorable to aphids, showed higher aphid DNA load, i.e. decreased aphid resistance. Genome-wide association analyses identified genetic variants at known defense genes but also novel genomic regions associated with variation in aphid and mildew DNA load. Moreover, we found several differentially methylated regions associated with pathogen loads, in particular differential methylation at transposons and hypomethylation in the promoter of a gene involved in stomatal closure, likely induced by pathogens. Our study provides first insights into the defense mechanisms of Thlaspi arvense, a rising crop and model species, and demonstrates that non-target whole genome sequencing reads, usually discarded, can be leveraged to estimate intensities of plant biotic interactions. With rapidly increasing numbers of large sequencing datasets worldwide, this approach should have broad application in fundamental and applied research.

Introduction

Plant pests, such as pathogens and herbivores, can cause major yield losses in crops and often require the massive use of pesticides to control their damage. Natural plant populations, on the other hand, are constantly exposed to such biotic stressors and their higher genetic diversity often allows these populations to become locally adapted. Since many pest species are sensitive to climatic conditions, their pressure on plant communities is spatially heterogeneous, maintaining geographically structured genetic variation in plant defenses (1,2). For these reasons, natural plant populations are highly suitable to study defense mechanisms and evolution of defenses, and also a very useful source of beneficial and resistance alleles for specific pathogens and environmental conditions. This genetic variation in defense-related genes can for example be screened through Genome Wide Association (GWA) (3–6) or approaches based on known candidate genes (2).

Many pest species are also highly sensitive to temporal variation in weather conditions. This temporal heterogeneity in pathogen pressure can induce plastic responses in plants, involving gene expression and epigenetic changes (7–9), which may also be studied through stress experiments (7–9). Some plastic epigenetic responses can have a transient stability and be transmitted to the next generations through inheritance of epigenetic marks (10–13). In particular, DNA methylation has been shown to respond to biotic and abiotic stresses through gene expression regulation and transposable elements (TEs) reactivation, and can be inherited across generations (9,12,14). In plants, DNA methylation can occur in the three sequence contexts CG, CHG and CHH (H being A, T or C), which differ in their molecular machineries depositing, maintaining and removing methylation and consequently also in their transgenerational stability (15,16). While CG methylation is usually more stable across generations, CHH methylation is less stable and more responsive to stress and the sensitivity of CHG methylation lies somewhere in between (15–17).

Whether inherited or induced, some strategies of plants for defense against pathogens and herbivores include: i) physical barriers such as reinforced cell walls, leaf protective layers or closing stomata, ii) production of specialized (secondary) metabolites that reduce palatability or are toxic to pests, iii) oxidative bursts, iv) the activation of signaling cascades to induce systemic responses and v) RNA interference mechanisms to silence pathogen genes (18–22). In Brassicaceae, a particularly important and diverse class of defense metabolites are glucosinolates, which often show local adaptation driven by variation in pests and can also be induced by herbivore and pathogen attacks (1,2,23).

Studying natural variation in plant resistance, along with associated genetic and epigenetic variation, can identify genes involved in defense and their regulators, including vital genes whose function cannot be determined through knockout experiments. Such knowledge, and especially the discovery of natural resistance alleles, are crucial sources for the breeding of more pest-resistant crop varieties. Nevertheless, because of the diversity of resistance mechanisms and their often multigenic nature, plant defense mechanisms remain difficult to study. In particular, antixenosis (the prevention of pathogen settlement) and antibiosis (the repression of pathogen growth and reproduction) require extensive and time-consuming phenotyping, based for example on choice (24) or settling (9) assays, and such assays are extremely challenging to perform on large collections. On the other hand, there are increasing numbers of large sequencing datasets, which may also be used to quantify contaminants or microbiome composition (25–27) and thus as proxies for resistance phenotyping. In this study we investigated such usage of exogenous reads, i.e. reads not mapping to the target reference genome, as a source of information for quantifying herbivore and pathogen abundance in large collections.

We worked with field pennycress (Thlaspi arvense), an annual plant in the Brassicaceae family that is increasingly studied as a model species (28–32) and new biofuel and winter cover crop (33–36). In a previous study, we investigated natural epigenetic variation in a collection of 207 Thlaspi lines from across Europe (32). Prior to their whole-genome and -epigenome sequencing these lines had been grown in a common environment, an open glasshouse where the plants were spontaneously colonized by aphids and powdery mildew, as well as by other microbes. At the time of sequencing, pathogen contamination was still very limited but appeared highly variable, and preliminary analyses showed that it resulted in sizeable amounts of non-target reads assigned to the pest species, i.e. contamination of the DNA samples. Inspired by other recent studies on non-target reads (25–27), we asked if there was systematic variation in the numbers of aphid and pathogen reads among different T. arvense lines, and whether these data, together with our whole-genome plant sequencing data, could provide insights into the genomic basis of plant resistance variation.

The goals of our study were thus two-fold: i) to contribute to a mechanistic understanding of pest resistance in Thlaspi arvense, and ii) to explore whether non-target reads from plant sequencing can be used as proxies for studying plant biotic interactions. Considering that we are moving towards an increasingly sequencing-prone world, with more and larger datasets being generated for many species (37–43), the use of non-target reads has very broad potential.

Results

Reads classification and species identification

Starting from our previously published sequencing data (32), the first step of our analysis was to separate the Whole Genome Sequencing (WGS) reads of each sample into the ~99.5% mapping to the Thlaspi arvense reference genome (29) and the ~0.5% that did not, hereafter called “exogenous reads” (Fig 1A). Initially, we used all mapped reads for calling variants in Thlaspi, but after some difficulties with Genome Wide Associations (see below) we suspected that some plant reads were false and mapped to the T. arvense genome only because of the high cross-taxa similarity of some genomic regions. We therefore remapped all reads to the genomes of the aphid Acyrthosiphon pisum, its endosymbiont Buchnera aphidicola and the powdery mildew Blumeria graminis, and found that, on average, 7.4% of the reads mapped to both T. arvense and at least one of the pests. We removed these ambiguous reads from our analyses and used only the T. arvense target reads, 92.1% on average, for variant calling (Fig 1A, S1 Table).

Classification of sequencing reads in *Thlaspi arvense* WGS data.
(A) Workflow of the analyses, including reads classification (orange nodes) into target, ambiguous and exogenous reads, and downstream analysis (dark blue nodes) (see Methods). (B) Fractions of exogenous reads assigned to different taxonomic groups by MG-RAST (44,45). (C) Read counts assigned to nine selected groups in our 207 *T. arvense* samples from different European regions. (D) Aphids and mildew occurring on *T. arvense* leaves during our experiment.

We next attempted a taxonomic classification of the exogenous reads, in multiple steps. First, we used MG-RAST (44,45) to assign reads to taxonomic groups based on public sequencing databases. Out of the 78% of the exogenous reads that passed the MG-RAST quality control (S1 Table) the majority belonged to bacteria and smaller fractions to fungi, plants and animals (Fig 1B and S2 Table). For subsequent group-level analyses, we then focused on nine taxonomic groups that occurred consistently within our samples (Fig 1C), and that were particularly abundant or relevant: Erysiphales (fungi), Peronosporales (oomycetes), Aphididae and Culicidae (both insects), and five bacterial families.

Visual inspection (Fig 1D) and other sources of information narrowed down the observed aphid and mildew species to a few candidates. For aphids we considered Acyrthosiphon pisum (indicated by MG-RAST), Myzus persicae (visual match, and a generalist attacking Brassicaceae (46)) and Brevicoryne brassicae (attacks Brassicaceae including Thlaspi (47)). For powdery mildew we considered Blumeria graminis (indicated by MG-RAST), and Erysiphe cruciferarum (attacks Brassicaceae (48)). To decide among these species, we then used a competitive mapping approach (49), where the exogenous reads were aligned to a pseudo-reference composed of the same DNA sequences from the different candidate species (see Methods for details, S3 and S4 Tables). The majority (77%) of the aphid reads mapped to M. persicae, 18% to B. brassicae, and 5% to A. pisum, while 98% of the mildew reads mapped to E. cruciferarum, with only 2% to the other mildew species (S5 Table). Based on these results, we concluded that the plants in our experiment had been attacked by Myzus persicae and Erysiphe cruciferarum.

Finally, to compare the power of a large database approach (MG-RAST) versus using specific reference genomes, we also remapped all exogenous reads to the M. persicae and B. aphidicola genome assemblies (50) (www.ncbi.nlm.nih.gov/assembly/GCA_001939165.1) and used the counts from these two mappings as additional phenotypes, besides the nine taxonomic groups selected through MG-RAST (Table 1, S6 Table).

Population differences and SNP-based heritability for different types of exogenous read counts.
Population differences were tested with a linear model, SNP-based heritabilities (and their confidence intervals) estimated with the R package *heritability* (51).

Exogenous read counts are a heritable Thlaspi phenotype

As we had observed that aphid and mildew infections in the glasshouse were not random, but prevalent on plants from some origins than others, i.e. possibly reflecting heritable variation in plant resistance, we next tested for population differences and SNP-based heritability in pest and microbiome read-counts. Prior to these analyses, to avoid biases caused by different sequencing depths, we corrected the read counts for the total numbers of deduplicated reads in each library and used the residuals as unbiased estimates of aphid, mildew and microbe loads (see Methods).

For most of the nine taxonomic groups, there were significant population effects, with 20-40% of the variance in read counts explained, as well as significant SNP-based heritability, typically in the range of 0.18 - 0.30 (Table 1). The highest heritability of 0.47 was for read counts of Erysiphales, indicating particularly strong variation for resistance to mildew. Both SNP-based heritability and population differences tended to be stronger for aphid and Buchnera data based on read mapping to the reference genomes than for those based on MG-RAST, demonstrating that the former method is stronger and thus preferable if high-quality genome assemblies are available.

An alternative explanation for different aphid and mildew loads in the greenhouse could be that variation in enemy densities in the field was transmitted to the greenhouse, through maternal carry-over effects, or even as seed contamination. However, we had recorded aphid and mildew occurrence during seed sampling in the field and found no significant differences in the glasshouse between the offspring of plants that had been attacked in the field versus those that had not (S1 Fig).

Aphid and mildew loads correlate with climate of origin and glucosinolates content of plants

Having established that our method most likely captured variation in plant resistance, we were interested in the ecological drivers of this variation. As climate is known to be a major influence on many biotic interactions as well as plant defenses (1,52), we correlated the observed read counts with the climates of origin of the plants. We found negative correlations between aphid read counts and several temperature variables, in particular annual minimum temperature (Fig 2A). Aphid read counts were also positively correlated with temperature variability, i.e. the diurnal and annual ranges of temperature (Fig 2A). In other words, plants from warmer and more stable climates had consistently lower levels of aphid infestation in our glasshouse, possibly because these plants had evolved greater resistance under such benign climatic conditions where aphids thrive. We found similar, although weaker patterns, for the number of Erysiphales reads. The other analysed taxonomic groups showed different and often weaker patterns of correlation with climate, except that the read counts of several bacterial groups were positively correlated with annual maximum temperature and in particular diurnal temperature range.

Relationships between climates of origin or glucosinolate levels of plants and the exogenous reads loads.
(A) Correlations with bioclimatic variables. (B) Correlations with baseline glucosinolate (GS) levels measured in the same pennycress lines in another experiment. All correlations in (A) and (B) were done after correction for population structure. Aphid-related read counts are in green, mildew-related in gray, others in black. (C) Boxplot of the aphid reads residuals in samples where benzyl GS was detected vs. not. The P value is based on the Welch’s t-test for unequal variances.

Since glucosinolates are major defense metabolites of Brassicaceae, and their variation could thus be an explanation for variance in plant resistance, we also tested for correlations between the baseline amounts of these metabolites and the frequencies of aphid and mildew reads. Glucosinolate levels were measured on the same T. arvense lines in a separate experiment not affected by pests (S7 Table). We found positive correlations of aphid read counts with sinigrin, an aliphatic glucosinolate which is by far the most abundant in the leaves of T. arvense, and a stronger negative correlation with benzyl glucosinolates (glucotropaeolin) (Fig 2B). Although the baseline levels of benzyl glucosinolates were very low and probably sometimes below the detection level, plant lines where benzyl glucosinolate was detected had significantly lower aphid loads in the glasshouse (Fig 2C). We also detected three indole glucosinolates, but these did not show any significant correlations with aphid loads.

GWA identifies peaks near defense genes

To further investigate the genetic basis of variation in aphid, mildew and microbe loads, we next employed GWA and tested for associations between exogenous read counts and biallelic genetic variants (SNPs and short INDELs). We corrected for population structure using an IBS matrix and only tested variants with Minor Allele Frequency (MAF) > 0.04 (see Methods). Initially, we called genetic variants using all reads that mapped to the T. arvense genome and found massive peaks in some highly conserved regions of the genome, which had very high mapping coverage (S2 Fig). We suspected that this might be because some non-Thlaspi reads were very similar to these highly conserved regions and, by mapping there, generated false variants only in samples containing many non-Thlaspi reads. We therefore identified and removed ambiguous reads prior to variant calling, which eliminated the observed massive GWA peaks, indicating that they had indeed reflected false associations (S2 Fig).

After excluding the ambiguous reads, we still found significant GWA peaks for Erysiphales but not for other types of exogenous reads (excluding isolated, unreliable variants) (Fig 3A and S3 Fig). Nevertheless, when clear peaks were visible, regardless of their significance, they were usually located close to genes involved in plant defense response. An enrichment analysis (53) confirmed that stronger variants were indeed enriched close to these defense genes (S8 Table) for some exogenous read counts (Fig 3B and S3 Fig). For M. persicae load there was a peak in the proximity of Tarvense_01930, encoding a predicted pathogenesis-related peptide. The top variant in this peak had a slight but clear allelic effect on M. persicae load (Fig 3C). For Erysiphales load we detected a more persistent enrichment, with a highly significant peak in Scaffold 1, located in a region with several defense genes, including MAJOR LATEX PROTEINS (MLP) and two genes similar to Arabidopsis thaliana SALICYLATE METHYLTRANSFERASE 1 (BSMT1) (Fig 3D and E). This region is wide due to ancient TE colonization, but the top variants are clearly neighboring candidate genes involved in defense (Fig 3E). Other significant peaks for Erysiphales load were close to other genes that possibly contribute to resistance such as PBL7, involved in signaling and stomatal closure or SRF3, reinforcing cell walls by callose deposition.

Genome-wide association analyses for aphid and mildew loads.
We show only the results for *M. persicae* and MG-RAST Erysiphales read-counts; for full results see S3 Fig. (A) Manhattan plots, annotated with genes potentially affecting aphid/mildew colonization. The genome-wide significance (horizontal red line) was calculated based on unlinked variants (54), the blue line corresponds to –log(p) = 5. (B) Corresponding to the Manhattan plots on the left, enrichment of *a priori* candidates and expected false discovery rates (as in (53)) for increasing significance thresholds. (C) Allelic effects of the red-marked variants in the corresponding Manhattan plots, with genotypes on the x-axes and the read-count residuals on the y-axes. (D) The candidate genes marked in panel A, their putative functions and distances to the top variant of the neighboring peak. Candidates in dark blue are the *a priori* candidates included in the enrichment analyses and involved in defense response (GO:0006952). GS: glucosinolates. (E) Zoom-in for the Manhattan plot of Erysiphales load, around the first peak in Scaffold 1, with gene and TE models below, and *a priori* candidates in blue.

Aphid and mildew loads correlate with differential methylation at genes and transposons

Variation in phenotypes, such as our indirect estimates of pest resistance, may not only be associated with DNA sequence but also with epigenetic changes like DNA methylation. This phenotype-associated epigenetic variation can include both heritable and plastic components. The Whole Genome Bisulfite Sequencing (WGBS) data from our previous study (32) allowed us to also explore these questions and to test for associations between DNA methylation variation and pest attack. For simplicity, we limited this analysis to M. persicae and Erysiphales loads.

Our analysis had two steps: First we called Differentially Methylated Regions (DMRs) between the 20 samples with the most and 20 samples with the least M. persicae or Erysiphales loads, and then we conducted Epigenome Wide Association (EWA) analyses on individual positions located within these DMRs, using the complete dataset (188 lines - see Methods). This approach allowed us to target genomic regions of interest, while strongly reducing the multiple-testing problem of millions of cytosines in the whole genome and correcting for population structure. Using a relaxed False Discovery Rate (FDR) of 20%, we identified 162 DMRs for M. persicae load and 548 DMRs for Erysiphales load (S4 Fig, S9 and S10 Table). The majority of these were in the CG context, especially for M. persicae-related DMRs (Fig 4A and S4 Fig). As observed previously (32), DMRs in CHH were generally shorter than in the other sequence contexts (S4A Fig). Since the genome of T. arvense is rich in TEs and intergenic regions, the majority of DMRs were located in those features (S4B Fig). However, the DMR density was higher in proximity of genes and particularly in coding sequences (Fig 4A), and even DMRs assigned as intergenic (Fig 4A) were often located close to genes or promoters. In accordance with previous studies (8,9), most DMRs were hypomethylated in the infested samples (higher pathogen load), indicating that genes needed for defense might be activated through demethylation.

Differential methylation associated with aphid and mildew loads.
(A) Differentially Methylated Region (DMR) densities in different genomic features when comparing the 20 samples with the most vs. the least *M. persicae* (top) or Erysiphales (bottom) load. CDS: coding sequences. (B) Manhattan plots from EWA analyses based on individual cytosines within DMRs, with sequence contexts in different colours and annotation of genes close to low P-value cytosines. The genome-wide Bonferroni significance thresholds (dashed red lines) were calculated based on the number of DMRs. (C) Candidate genes and TEs marked in panel B, their putative functions, genomic locations of associated DMRs, and whether infested samples were hyper- or hypomethylated. (D) Zoomed-in Manhattan plot for Erysiphales load around the peak in Scaffold 4, with gene and TE models given below. The CG methylation in the 20 most and least infested samples was calculated over 50 bp bins (see S5C Fig for methylation in other contexts).

For a more detailed investigation, we turned to EWA, leveraging the power of the entire Thlaspi collection. We tested for associations between M. persicae or Erysiphales loads and the methylation at individual cytosines located within the DMRs. As in GWA, we corrected for population structure using an IBS matrix. For both types of pest loads, we found associations in the proximity of genes and especially within TEs, but no genomic feature was particularly enriched for low P-value associations (S5A Fig). M. persicae load was associated with methylation at several genomic locations, especially TEs (Fig 4B), but these associations had strongly inflated P-values (S5B Fig). For Erysiphales load the P-value distribution was more well-behaved (S5B Fig), and we found a clear association with hypomethylation of Copia family 202 TEs upstream of MAPKK KINASE 20 (MAPKKK20), a gene involved in abscisic acid (ABA) stress response and stomatal closure (Fig 4B, C and D). A coverage analysis confirmed that none of the T. arvense lines carries insertions or deletions of the TEs upstream of MAPKKK20.

Discussion

Plant pests are a major threat to food safety, causing large yield losses, and new crops such as the potential biofuel plant Thlaspi arvense must be able to resist pathogen and herbivore attacks. A powerful source for obtaining resistant varieties is natural variation in plant defenses, but phenotyping large collections can be very time-consuming and error-prone. Here we describe how an unplanned pest infestation in a glasshouse experiment, together with available WGS data, can be used to estimate aphid, mildew and microbial loads, and thus variation in plant resistance. The approach is straightforward, makes use of WGS data without microbiome-specific DNA extraction, and can in principle be applied to many other situations such as field experiments. It is not error-free, but we highlight some potential pitfalls, show how to reduce noise, and illustrate its potential to detect associations with climatic, genetic and epigenetic variation.

An important first step in our analyses was the identification and classification of pest-related reads in the plant WGS data. We began by classifying all reads as target (only mapping to T. arvense), ambiguous (mapping to T. arvense and at the same time to at least one of the pest genomes) or exogenous (not mapping to T. arvense) (Fig 1A). We demonstrated the importance of removing ambiguous reads prior to variant calling, as this prevented calling false positive variants caused by exogenous DNA that also mapped to highly conserved or repetitive sequences in the T. arvense genome. We then classified the exogenous reads using MG-RAST (44,45) or by confident mapping to specific pest genomes, and selected the eleven most relevant and/or abundant taxonomic groups to focus our analyses on. To obtain unbiased pest/microbe loads we also corrected the read-counts for the total number of deduplicated reads of each sample. A competitive mapping approach allowed us to identify the aphid and mildew species that had occurred in our experiment as the generalist aphid Myzus persicae and the powdery mildew Erysiphe cruciferaum (Fig 2).

Since we suspected a non-random colonization of pests and microbes in our T. arvense collection, we tested for population differences as well as SNP-based heritability. We found significant population differences for most pest and microbe loads, and often heritabilities above 15%, which although low, is still indicative of genetic determination (5). Moreover, Erysiphales load had a particularly high heritability of 47% (Table 1). We therefore next asked what could explain the observed variation in pest loads in our experiment. As pathogen abundances in the field are often determined by climatic conditions, we expected plants originating from climates less favorable to aphids to perform worse in our glasshouse, i.e. to have higher pathogen loads. As expected, aphid counts were negatively correlated with temperature of origin (particularly minimum temperature), and positively with temperature variability (Mean Diurnal Range and Temperature Annual Range) (Fig 3A), suggesting that plants from colder and more thermally fluctuating climates, which are less favorable to aphids, were less well defended and performed worse in our glasshouse. We found similar but weaker patterns for Erysiphales load.

As we expected the observed climate-associated variation in pest loads to be at least partially explained by variation in chemical defenses, and since in Arabidopsis thaliana glucosinolates, the main defensive compounds, are known to be geographically structured in response to aphid distributions (1), we also tested for association of aphid and mildew loads with glucosinolates in our collection. In accordance with literature on A. thaliana (55), we observed a positive correlation of aphid loads with total glucosinolates as well as with the most abundant glucosinolate sinigrin (aliphatic glucosinolate), but a negative correlation with benzyl glucosinolates (Fig 3B). These findings suggest that glucosinolate composition, rather than total amount, is important for aphid defense, and that while benzyl glucosinolates might have a deterrent effect, sinigrin might on the contrary attract M. persicae or act as a stimulant, which would be in accordance with previous observations (56).

To detect genetic variants associated with pest and microbe loads, we then conducted a GWA study. For aphid (M. persicae) load, we detected only one non-significant peak in Scaffold 5, close to a pathogenesis-related coding gene (Fig 3A and D). For Erysiphales load, however, there were several significant associations neighboring genes directly involved in defense, mostly members of the MLP family, clustered in a large peak on the first arm of Scaffold 1 (Fig 3E). MLP165, the closest gene to the most significant variant in the peak, is indirectly involved in GS biosynthesis in A. thaliana (57), which might explain why baseline GS levels were associated with Erysiphales load (Fig 3B). Further GWA peaks for Erysiphales pointed towards other genes indirectly involved in the defense response through phytohormone signaling (eg. CAR8, PBL7, GH3.1) or preventing pathogen access through cell wall reinforcement or stomatal closure (SRF3, PBL7) (Fig 3D). Further experiments would be necessary to confirm the functionality of these genes.

An important general insight from our GWA analyses was the frequent ambiguity of reads that mapped to both pest and host plant genome. Such ambiguous reads generated false variants only present in samples with pest DNA, which resulted in highly significant false associations, and it was therefore important to remove these reads before variant calling. Another potential reason for sequence similarity between host and pathogens could be defense mechanisms such as RNA interference. If T. arvense produces small or micro RNAs to silence pathogen genes, this would originate from genomic regions of high similarity between host and pathogen, and thus reflect a true association. However, a BLAST (58) of the region in which the suspicious associations occurred did not reveal any similarity to aphid or mildew genes, but instead to the highly conserved ribosomal RNA coding regions. While genetic variants are generally inherited from parents and thus reflect evolutionary processes, DNA methylation variants can be heritable but can also reflect plastic responses to environmental stresses like herbivores or pathogens. Our data do not allow to confidently distinguish between these two sources of DNA methylation variation, and thus should be interpreted with caution, especially with regard to the directionality of associations. A beneficial DNA methylation variant is expected to be associated with lower pathogen load when already present before pathogen arrival, but with higher pathogen load when plastically induced by pathogens during the experiment. For both M. persicae and Erysiphales, the majority of DMRs were hypomethylated in affected samples, which is in accordance with the loss of methylation observed in A. thaliana and T. arvense upon aphid feeding, and in diploid wheat upon powdery mildew infection (8,9,31), but we also detected hypermethylation at several loci. M. persicae load was associated with differential methylation at only few genes but several TEs, which is in accordance with the aphid or stress-induced TE reactivation observed in A. thaliana (9,14). Erysiphales load was associated with hypomethylated Copia TEs upstream of MAPKKK20, a gene involved in ABA-mediated signaling and stomatal closure. Since stomatal closure is a known defense mechanism to block pathogen access (21), it is tempting to conclude that hypomethylation of the MAPKKK20 promoter might induce its overexpression and consequent stomatal closure, thereby preventing mildew access to the leaf blade. Overall, we found associations between pathogen load and TE methylation that could potentially act both in cis (eg. Copia TE methylation in MAPKKK20 promoter) and in trans, for example through transposon reactivation (eg. LINE, Helitron and Ty3/Gypsi TEs isolated from genes). Although we cannot confidently distinguish inherited versus induced DNA methylation variants, to our knowledge this is the first coupled GWA - EWA analysis conducted on a large natural plant collection.

In summary, our study offers first insights into the defense mechanisms of Thlaspi arvense, including candidate genes and alleles which may be of interest for breeding efforts in this novel biofuel and cover crop. It also provides a proof of principle that exogenous reads from large sequencing efforts, usually discarded if not mapping to the target genome, can be leveraged to extract additional information about important biotic interactions of the target species, including its antagonists and microbiome components. We combined this approach with data from a common environment experiment to show that pest and microbiome load were geographically structured, as expected from locally adapted traits, and associated with both genetic and DNA methylation variants. In principle, our approach can be applied to many other designs. For example, field-collected samples could be used to quantify geographic pathogen distributions. With the decreasing cost of sequencing and increasing large-scale and single-species sequencing projects e.g. (37–43), the number of data-sets suitable for such analyses is set to rapidly increase in the near future.

Materials and Methods

Plant growth and sequencing

The WGS data used in this study were already published in Galanti et al. (2022) (32). Please refer to this publication for details on data generation and methods. Briefly, we collected 207 Thlaspi arvense accessions from 36 European populations in July 2018, and we grew their offspring in a glasshouse at University of Tübingen (48°32’21.3”N, 9°02’04.2”E) between August and October 2019. The glasshouse was located in biodiverse surroundings, and insects and pests could enter when the windows opened for temperature regulation. A few weeks after germination, we noticed aphid and mildew infestations. After 46 d we sampled the 3^rd or 4^th true leaf of each plant and snap-froze it in liquid nitrogen. We extracted DNA using the DNeasy Plant Mini Kit (Qiagen, Hilden, DE), sonicated (Covaris) 300 ng of genomic DNA and used the NEBNext Ultra II DNA Library Prep Kit for Illumina (New England Biolabs) to prepare the libraries. Half way through the protocol we split the DNA into 1/3 for genomic libraries and 2/3 for bisulfite libraries. For the bisulfite conversion we used the EZ-96 DNA Methylation-Gold MagPrep (ZYMO) kit. We sequenced paired-end for 150 cycles using Illumina NovaSeq 6000 (Illumina, San Diego, CA) for genomic libraries and HiSeq X Ten (Illumina, San Diego, CA) for bisulfite libraries.

Reads mapping and classification

Upon demultiplexing the raw reads, we used cutadapt (59) for quality (minimum quality of 20) and adaptor trimming, excluding reads shorter than 35 bp. We used FastQC and MultiQC (60,61) to estimate the duplication rate, and calculated the total deduplicated reads, which we later used for correcting the number of exogenous reads. We then classified the reads based on their mapping behavior. First we aligned reads to the T. arvense reference genome (29) with BWA-MEM v0.7.17 (62), excluding multimapping reads (-c 1) and marking duplicates with MarkDuplicatesSpark (63,64). We then mapped all samples again (62) to the three putative exogenous genomes of pea aphid (Acrophyson pisum), the aphid symbiont Buchnera aphidicola and powdery mildew (Blumeria graminis), using available resources (www.ncbi.nlm.nih.gov/assembly/GCF_005508785.2, www.ncbi.nlm.nih.gov/assembly/GCA_001939165.1) (65). After this, we used a custom script to collect all read IDs within a sample mapping to any of the three exogenous genomes, and removed any of these reads from the T. arvense alignment bam files. We thus removed all ambiguous reads before proceeding with variant calling. To compare coverage of specific regions with and without ambiguous reads, we used samtools bedcov (67). The numbers of reads classified by their mapping behaviour are reported in S1 Table.

Variant calling

For variant calling we used GATK4 v4.1.8.1 (63,64), following the best practices for germline short variant discovery (https://gatk.broadinstitute.org/hc/en-us/articles/360035535932-Germline-short-variant-discovery-SNPs-Indels-) with few adjustments for large datasets (32). Briefly, starting from the bam files generated after the removal of ambiguous reads, we i) ran HaplotypeCaller, ii) combined the resulting GVCF files with GenomicsDBImport and GenotypeGVCFs and iii) filtered out low quality variants with VariantFiltration (see Galanti et al. (2022) (32) for more details). Finally, we used vcftools v0.1.16 (67) to retain biallelic variants with MAF > 0.01 and a maximum of 10% missing genotype calls. We imputed these missing calls with BEAGLE 5.1 (68) to obtain a complete multisample vcf file.

Identification and classification of exogenous reads

To identify exogenous reads, we extracted all unmapped reads from the bam files created aligning WGS reads to the T. arvense genome (29). We selected reads with both mates unmapped (SAM flag 12) and excluded supplementary alignments (SAM flag 256 after running MarkDuplicatesSpark) with samtools (65). We then recovered these reads from the trimmed fastq files with seqtk subset (https://github.com/lh3/seqtk) to obtain fastq files of unmapped reads only. We used these as input for MG-RAST (44,45), a web-based tool for phylogenetic analysis of metagenomes.

We ran MG-RAST mostly with default parameters, without assembled reads, excluding dereplicated sequences, and dynamically trimming reads with a minimum Phred score of 15 in more than 5 consecutive bases. We set the “sequence screening” to A. thaliana, the closest relative of T. arvense available. We used two different approaches to extract read counts. First we classified all reads up to family level using the web-based Analysis tool from MG-RAST. We used RefSeq as query annotation database and filtered reads classified with low confidence using default settings: e-value 5, 60 %- identity, length 15 and min.abundance of 1 (S2 Table). Out of the hundreds of taxonomic groups identified by MG-RAST, we selected only a small subset for follow-up analyses, based on their biological relevance, our visual observations and/or abundance: Aphididae, Culicidae, Peronosporales, Staphylocaccaceae, Burkholderiaceae, Mycobacteriaceae and Pseudomonadaceae (Table 1). Additionally, we used a custom Python script to download individual “taxonomy” or “sequence_breakdown” results from MG-RAST API (69) and extracted the counts of the genus Buchnera, including bacterial symbionts of many aphid species, and of the order Erysiphales, to quantify the observed mildew infection (Table 1). All the code for extracting counts for all families or specific taxonomic groups are available on GitHub (https://github.com/junhee-jung/MG-RAST-read-counter).

In addition to the nine read groups selected from MG-RAST results, we also performed a highly confident mapping of exogenous reads to the M. persicae and B. aphidicola genome assemblies (50) (www.ncbi.nlm.nih.gov/assembly/GCA_001939165.1), to test whether mapping to a high quality assembly of the exact pathogen has a higher sensitivity than MG-RAST. We mapped with BWA-MEM v0.7.17 (61), using a seed length of 25 bp (70) and removing reads with MAPQ < 20 and duplicates with MarkDuplicatesSpark (63,64). We then counted all reads in the bam files.

Finally, we log transformed all read counts to approximate normality, and corrected for the total number of deduplicated reads by extracting residuals from the following linear model, log(read_count + 1) ~ log(deduplicated_reads), which allowed us to quantify non-Thlaspi loads, correcting for the sequencing depth of each sample.

Exogenous reads heritability and species identification

To exclude the possibility that aphid and mildew infestation patterns were carried-over from the field, through seed contamination or maternal effects, we used aphid and mildew presence/absence data collected in the field. We found no difference in aphid or mildew loads between samples with and without aphids or mildew on the original parental plant in the field (S1 Fig). Nevertheless, to exclude a possible bias, we excluded one outlier sample with particularly high aphid load and aphids observed in the field (S1 Fig) from the analyses.

To test for variation between populations we used a general linear model with population as a predictor. To measure SNP-based heritability, i.e. the proportion of variance explained by kinship, we used the marker_h2() function from the R package heritability (51), which uses a genetic distance matrix as predictor to compute REML-estimates of the genetic and residual variance. We used the same IBS matrix as for GWAS and for the correlations with climatic variables.

Even though MG-RAST classifies reads based on all taxonomic ranks, the accuracy of species identification of course strongly depends on the sequences available in the query databases. MG-RAST assigned our aphid reads to A. pisum, but this did not fit with our visual observations and with the poor performance of this species on Brassicaceae (71). We therefore selected three plausible aphid species and test which of these had mostly likely attacked our experiment. In addition to A. pisum, we tested two other aphid species commonly attacking Brassicaceae: Brevicoryne brassicae and Myzus persicae. While not all three species have reference genomes available, all mitochondrial genomes are available on NCBI (72) under accession numbers MN232006, NC_011594 and NC_056270. We downloaded these sequences, aligned them to each other (73), removed INDELs to retain only SNPs and combined them into a single pseudo-reference fasta file (S3 Table). We then mapped the exogenous reads from 40 randomly selected samples to this pseudo-reference, allowing for unique mappings only and counted the reads mapping to either of the three aphid species. We used the same approach for mildew except that we included only two possible species: Blumeria graminis, as suggested by MG-RAST, and Erysiphe cruciferarum which is known to attack Brassicaceae but was not in the MG-RAST query database and seemed plausible from visual inspection (Fig 2B). For the mildew pseudo-reference (S4 Table) we used the Internal Transcribed Spacer (ITS), which is publicly available for both species on NCBI (71) under accession numbers MT644878 and AF031283.

Quantification of glucosinolates

Using seed material collected from the sequenced plants, we conducted a follow-up experiment to estimate the glucosinolates (GS) content of all 207 lines in the absence of pathogens. Briefly, we sowed the seeds in petri dishes, stratified them at 4°C in the dark for two weeks and transplanted the germinated seedlings to individual 9 x 9 cm pots. We grew the plants in a growth chamber with a 14/10 h light/dark cycle at 21/17 °C and a relative humidity of ~45%. Two weeks after germination the plants were vernalized at 4°C for two more weeks in order to minimize phenological and developmental differences between winter and summer annuals. Ten days after vernalization, we collected the 3^rd or 4^th true leaf and snap-froze it in liquid nitrogen. After freeze drying, we weighed all samples and extracted the material threefold in 80% methanol, adding p-hydroxybenzl glucosinolate (Phytoplan, Heidelberg, Germany) as internal standard. After centrifugation, we applied the supernatants onto ion-exchange columns with diethylaminoethyl (DEAE) Sephadex A25 (Sigma Aldrich, St.Louis, MO, USA) in 0.5 M acetic acid buffer, pH 5. We added purified sulfatase, converting glucosinolates to desulfo glucosinolates. After one day, we eluted desulfo glucosinolates in water and analyzed them on a HPLC coupled to a DAD detector (HPLC-1200 Series, Agilent Trechnologies, Inc., Santa Clara, CA, USA) equipped with a Supelcosil LC 18 column (3 μm, 150×3 mm, Supelco, Bellefonte, PA, USA). We analysed the samples with a gradient from water to methanol starting at 5% methanol for 6 min and then increased from 5 to 95% within 13 min with a hold at 95% for 2 min, followed by a column equilibration cycle. We identified different glucosinolates based on their retention times and UV spectra in comparison to respective standards and an in-house database. We integrated peaks at 229 nm and calculated respective glucosinolate concentrations in relation to the internal standard and sample dry mass, using response factors as reported by Agerbirk et al. (2015) (74).

Drivers of exogenous reads variation

To test for associations between glucosinolate variation, as well as climate of origin, and the observed pest loads, we extracted average bioclimatic variables for the 25 years predating our experiment for our 36 study populations from the Copernicus website (75), as described in Galanti et al. 2022 (32). We then used the R package lme4qtl (76) to run mixed models that included either bioclimatic variables or glucosinolate contents as explanatory variables, and the exogenous read counts as dependent variables, while correcting for population structure with the same IBS matrix as in GWA and EWA analyses (see below).

GWA analysis

We conducted GWA with mixed models that corrected for population structure with a genetic Isolation By State (IBS) matrix as a random factor, as implemented in GEMMA (77). To obtain the IBS matrix we used PLINK v 1.90b6.12 (78). Starting from the imputed multisample vcf file obtained from variant calling, we pruned variants with LD > 0.8 in 50 variants windows, sliding by five. To produce the genetic variants used for GWAS, we also started from the imputed multisample vcf file from variant calling and filtered out variants with MAF < 0.04. As phenotypes we used the number of exogenous reads corrected for the total number of deduplicated reads, as described above. To validate our results and test for overlap with existing gene functional annotations, we performed an enrichment analysis of variants neighboring a priori candidate genes as described in Atwell et al. (2010) (53). Briefly, we attributed a priori candidate status to all variants located within 20 kb from orthologs (79) of A. thaliana genes annotated with the GO term “defense response” (GO:0006952), including nine genes similar to AtBSMT1 (S8 Table). We then calculated enrichment for incremental −log(p) thresholds as the ratio between observed frequency (significant a priori candidate / significant variants) and background frequency (total a priori candidate / total variants), and an upper bound for the FDR (32,53). We further assessed the significance of the enrichment through a previously established genome rotation scheme (80). Briefly, we calculated a null distribution of enrichments by randomly rotating the P-values and a priori candidate status of the genetic variants within each chromosome for 1M permutations. We then assessed significance by comparing the observed enrichment at the Bonferroni threshold to the null distribution. The code for these analyses is available on https://github.com/Dario-Galanti/multipheno_GWAS/tree/main/gemmaGWAS.

Methylation and DMR calling

For the methylome analyses we used the EpiDiverse toolkit (81), specifically designed for large WGBS datasets. We used the WGBS pipeline (https://github.com/EpiDiverse/wgbs) for read mapping and methylation calling, retained only uniquely-mapping reads longer than 30 bp, and obtained individual-sample bedGraph files for each sequence context. We then called DMRs using the DMR pipeline (79), with a minimum coverage of 4x. We compared the 20 samples with the most and the least M. periscae and Eriysiphales loads, resulting in two sets of DMRs for each sequence context. Since this was only the first step of our methylation analysis, meant to identify potential regions of interest, we retained all DMRs with a FDR < 20%. To understand the genomic preferences of DMRs, we intersected them with genomic features and calculated their densities in each by dividing their number by the total Mb covered by each genomic feature in the genome.

EWA analysis

Following the DMR calling, we investigated methylation-phenotype relationships in more detail, using EWA. We ran EWA similarly to GWA, enabling the “-notsnp” option available in GEMMA (76), and correcting for population structure with the same IBS matrix. To exclude possible biases, we excluded all samples with a bisulfite non-conversion rate >1 (32), which left 188 samples for analysis. To generate the methylation input files we first used custom scripts (https://github.com/Dario-Galanti/WGBS_downstream/tree/main/WGBS_simpleworkflow) (32) to unite individual-sample bedGraph files into unionbed files and retain positions with coverage > 3 in at least 95% of the samples and a methylation difference of at least 5% in at least two samples. We then intersected the unionbed files with the DMRs of the corresponding sequence context using bedtools (82) and converted unionbed to BIMBAM format as input for GEMMA.

We ran EWA for individual positions within the DMRs and calculated Bonferroni thresholds based on the number of DMRs, assuming that cytosines within the same DMR are mostly autocorrelated. To observe in which genomic features associations with lower P-values were located, we performed enrichment analyses similar to the ones performed for defense a priori candidate genes in GWA (53), but based on whole genomic features. Starting from all cytosines used for EWAS, we calculated the background frequency as the fraction of all cytosines located in each genomic feature and then calculated the observed frequency in the same way for −log(p) 0.5 increments, with enrichment as the ratio of observed and expected frequencies. All code used for EWA and the enrichment analysis in genomic features is available on https://github.com/Dario-Galanti/EWAS/tree/main/gemmaEWAS.

Code and data availability statement

The seed material from the sequenced lines is available at the Nottingham Arabidopsis Stock Centre (NASC) under stock numbers N950001 to 950204. Genomic and bisulfite sequencing raw data are available on the ENA Sequence Read Archive (www.ebi.ac.uk/ena/) under study accession number PRJEB50950. The reference genome and annotations were previously published by Nunn et al. (2022) (29). GWA and EWA results in a format compatible with the Integrative Genomics Viewer (https://www.igv.org/) are available on Zenodo (https://zenodo.org/records/10011535).

All code used in this study is available and documented on GitHub. The scripts for variant calling, filtering and imputation are on https://github.com/Dario-Galanti/BinAC_varcalling, and the scripts for the classification of sequencing reads and MG-RAST analysis are in https://github.com/Dario-Galanti/Exoreads_treasure and https://github.com/junhee-jung/MG-RAST-read-counter respectively. The pipelines for methylation and DMR calling from WGBS data can be found on the EpiDiverse GitHub (https://github.com/EpiDiverse). The workflow for downstream analysis of methylation data is on https://github.com/Dario-Galanti/WGBS_downstream/tree/main/WGBS_simpleworkflow. Finally, the scripts for running GWA and EWA analysis are on https://github.com/Dario-Galanti/multipheno_GWAS/tree/main/gemmaGWAS and https://github.com/Dario-Galanti/EWAS/tree/main/gemmaEWAS respectively.

Funding

This work was supported by the EU Horizon 2020 program under Marie Skłodowska-Curie grant agreement 764965 (Innovative Training Network EpiDiverse; https://epidiverse.eu; PhD fellowship of DG), as well as by the Deutsche Forschungsgemeinschaft (DFG) as part of the priority programme 2125 Deconstruction and Reconstruction of the Plant Microbiota, “DECRyPT” (grant 401829393 to OB; PhD fellowship of JHJ). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Supporting information

S1 Fig. Pest loads in samples with or without pests in the field. Offspring from plants with or without pathogens in the field did not differ (n.s.) in pathogen loads in the glasshouse, suggesting the pathogens were not carried with the seeds. A single outlier (TA_NL_03_04) with aphids in the field also showed high levels of aphid load in the glasshouse and was excluded from the analysis.

S2 Fig. Example of a GWA peak caused by ambiguous reads. (A) Manhattan plot of GWA with aphid load, without removing ambiguous reads prior to variant calling. The genome-wide significance (horizontal red line) was calculated based on unlinked variants (54), the suggestive line (blue) corresponds to −log(p) =5. (B) Zoomed-in peak on Scaffold 6 with coverage before and after removal of ambiguous reads.

S3 Fig. GWA results for all exogenous reads. Manhattan plots, enrichment of a priori candidate variants and QQplots for all exogenous reads. The genome-wide significance (horizontal red lines), was calculated based on unlinked variants (53), the blue line corresponds to −log(p) =5. The top variants are labeled with the neighboring genes potentially affecting exogenous reads load.

S4 Fig. Lengths and genomic locations of Differentially Methylated Regions (DMRs). (A) DMRs length distributions in different sequence contexts. (B) Numbers of hypermethylated and hypomethylated DMRs in each genomic feature.

S5 Fig. EWA enrichment in different genomic features and P-value distributions. (A) Enrichment of associations in different genomic features for increasing −log(p) thresholds, compared to expectations from all cytosines. (B) P-value distributions of EWA. (C) Complement to Fig 4D showing CHG and CHH methylation upstream and above the gene MPKKK20.

S1 Table. Classification of reads. Total reads, duplication rates, deduplicated raw reads, target, ambiguous and exogenous reads and MG-RAST reads passing QC.

S2 Table. MG-RAST classification of exogenous reads. Results of MG-RAST classification of exogenous reads at the family level, using RefSeq as query database.

S3 Table. Competitive mapping pseudo-reference for aphid identification. Fasta file combining mitochondrial sequences of B. Brassicaceae, A. pisum and M. persicae, with structural variants removed.

S4 Table. Competitive mapping pseudo-reference for mildew identification. Fasta file combining ITS sequences of Blumeria graminis and Erysiphe cruciferarum, with structural variants removed.

S5 Table. Results of competitive mapping for pest identification. Number of reads mapping uniquely to the pseudo-reference genomes of different aphid (mitochondrial DNA for either B. Brassicaceae, A. pisum or M. persicae) or mildew species (ITS sequence for either B. graminis or E. cruciferaum).

S6 Table. Exogenous reads used for downstream analyses. Classes of exogenous reads used in the analyses, including the nine groups from MG-RAST and two from mapping to the M. persicae and B. aphidicola reference genomes.

S7 Table. Quantification of glucosinolates. Glucosinolate concentrations in leaves of T. arvense, obtained from offspring of the sequenced plants, not infested by any herbivores nor pathogens.

S8 Table. A priori “defense response” candidate genes used for the GWA enrichment analysis. List of T. arvense candidate genes used for the GWA enrichment analysis: orthologs of A. thaliana genes annotated with the GO term “defense response”.

S9 Table. S9 Table. Differentially Methylated Regions based on M. persicae load. List of DMRs called between the 20 samples with the highest and the lowest M. persicae load.

S10 Table. Differentially Methylated Regions based on Erysiphales load. List of DMRs called between the 20 samples with the highest and the lowest Erysiphales load.

Acknowledgements

We thank the EpiDiverse network for its amazing support and discussions, in particular Adrián Contreras-Garrido, Bárbara Díez Rodríguez, Iris Sammarco, Adam Nunn, Daniela Ramos and Anupoma Troyee for their close collaboration. We also thank Frank Reis for his feedback and expert tips for the project, and Cecilia Heyworth for proofreading the manuscript. We thank Peter Stadler at the University of Leipzig and David Langenberger from ecSeq, which helped with computing and hosted the EpiDiverse servers. The BinAC cluster is managed by the High Performance and Cloud Computing Group at the Zentrum für Datenverarbeitung of the University of Tübingen, and supported by the state of Baden-Württemberg through bwHPC and the German Research Foundation (DFG) through grant no INST 37/935-1 FUGG.

Author Contributions

Dario Galanti: Conceptualization, Methodology, Formal analysis, Investigation, Data Curation, Writing – Original Draft Preparation, Visualization.

Jun Hee Jung: Conceptualization, Methodology, Data Curation, Writing – Review & Editing. Caroline Müller: Investigation, Writing – Review & Editing.

Oliver Bossdorf: Conceptualization, Methodology, Writing – Original Draft Preparation, Supervision, Funding Acquisition.

References

1.
1. Züst T
2. Heichinger C
3. Grossniklaus U
4. Harrington R
5. Kliebenstein DJ
6. Turnbull LA
2012Natural Enemies Drive Geographic Variation in Plant DefensesScience 338:116–9https://www.science.org/doi/full/10.1126/science.1226397 Google Scholar
2.
1. Kerwin R
2. Feusier J
3. Corwin J
4. Rubin M
5. Lin C
6. Muok A
7. et al.
2015Natural genetic variation in Arabidopsis thaliana defense metabolism genes modulates field fitness. Kant MR, editoreLife 4:e05604https://doi.org/10.7554/eLife.05604 Google Scholar
3.
1. Chan EKF
2. Rowe HC
3. Kliebenstein DJ
2010Understanding the Evolution of Defense Metabolites in Arabidopsis thaliana Using Genome-wide Association MappingGenetics 185:991–1007https://doi.org/10.1534/genetics.109.108522 Google Scholar
4.
1. Corwin JA
2. Copeland D
3. Feusier J
4. Subedy A
5. Eshbaugh R
6. Palmer C
7. et al.
2016The Quantitative Basis of the Arabidopsis Innate Immune System to Endemic Pathogens Depends on Pathogen GeneticsPLOS Genet 12:e1005789https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1005789 Google Scholar
5.
1. Thoen MPM
2. Olivas NHD
3. Kloth KJ
4. Coolen S
5. Huang PP
6. Aarts MGM
7. et al.
2017Genetic architecture of plant stress resistance: multi-trait genome-wide association mappingNew Phytol 213:1346–62https://nph.onlinelibrary.wiley.com/doi/abs/10.1111/nph.14220 Google Scholar
6.
1. Hanson AA
2. Lorenz AJ
3. Hesler LS
4. Bhusal SJ
5. Bansal R
6. Michel AP
7. et al.
2018Genome-Wide Association Mapping of Host-Plant Resistance to Soybean AphidPlant Genome 11:180011https://acsess.onlinelibrary.wiley.com/doi/abs/10.3835/plantgenome2018.02.0011 Google Scholar
7.
1. Jaouannet M
2. Morris JA
3. Hedley PE
4. Bos JIB
2015Characterization of Arabidopsis Transcriptional Responses to Different Aphid Species Reveals Genes that Contribute to Host Susceptibility and Non-host ResistancePLOS Pathog 11:e1004918https://journals.plos.org/plospathogens/article?id=10.1371/journal.ppat.1004918 Google Scholar
8.
1. Geng S
2. Kong X
3. Song G
4. Jia M
5. Guan J
6. Wang F
7. et al.
2019DNA methylation dynamics during the interaction of wheat progenitor Aegilops tauschii with the obligate biotrophic fungus Blumeria graminis f. sp. triticiNew Phytol 221:1023–35https://onlinelibrary.wiley.com/doi/abs/10.1111/nph.15432 Google Scholar
9.
1. Annacondia ML
2. Markovic D
3. Reig-Valiente JL
4. Scaltsoyiannes V
5. Pieterse CMJ
6. Ninkovic V
7. et al.
2021Aphid feeding induces the relaxation of epigenetic control and the associated regulation of the defense response in ArabidopsisNew Phytol 230:1185–200https://nph.onlinelibrary.wiley.com/doi/abs/10.1111/nph.17226 Google Scholar
10.
1. Kinoshita T
2. Seki M
2014Epigenetic Memory for Stress Response and Adaptation in PlantsPlant Cell Physiol 55:1859–63https://academic.oup.com/pcp/article/55/11/1859/2756009 Google Scholar
11.
1. Espinas NA
2. Saze H
3. Saijo Y
2016Epigenetic Control of Defense Signaling and Priming in PlantsFront Plant Sci 7:1201https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4980392/Google Scholar
12.
1. Lämke J
2. Bäurle I
2017Epigenetic and chromatin-based mechanisms in environmental stress adaptation and stress memory in plantsGenome Biol 18:124https://doi.org/10.1186/s13059-017-1263-6 Google Scholar
13.
1. He Y
2. Li Z
2018Epigenetic Environmental Memories in Plants: Establishment, Maintenance, and ReprogrammingTrends Genet http://www.sciencedirect.com/science/article/pii/S0168952518301276 Google Scholar
14.
1. Roquis D
2. Robertson M
3. Yu L
4. Thieme M
5. Julkowska M
6. Bucher E
2021Genomic impact of stress-induced transposable element mobility in ArabidopsisNucleic Acids Res 49:10431–47https://doi.org/10.1093/nar/gkab828 Google Scholar
15.
1. Law JA
2. Jacobsen SE
2010Establishing, maintaining and modifying DNA methylation patterns in plants and animalsNat Rev Genet 11:204–20https://www.nature.com/articles/nrg2719 Google Scholar
16.
1. Zhang H
2. Lang Z
3. Zhu JK
2018Dynamics and function of DNA methylation in plantsNat Rev Mol Cell Biol 1https://www.nature.com/articles/s41580-018-0016-z Google Scholar
17.
1. Liu J
2. He Z
2020Small DNA Methylation, Big Player in Plant Abiotic Stress Responses and MemoryFront Plant Sci 11https://www.frontiersin.org/article/10.3389/fpls.2020.595603 Google Scholar
18.
1. Wojtaszek P
1997Oxidative burst: an early plant response to pathogen infectionBiochem J 322:681–92https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1218243/Google Scholar
19.
1. War AR
2. Paulraj MG
3. Ahmad T
4. Buhroo AA
5. Hussain B
6. Ignacimuthu S
7. et al.
2012Mechanisms of plant defense against insect herbivoresPlant Signal Behav 7:1306–20https://doi.org/10.4161/psb.21663 Google Scholar
20.
1. Kant MR
2. Jonckheere W
3. Knegt B
4. Lemos F
5. Liu J
6. Schimmel BCJ
7. et al.
2015Mechanisms and ecological consequences of plant defence induction and suppression in herbivore communitiesAnn Bot 115:1015–51https://doi.org/10.1093/aob/mcv054 Google Scholar
21.
1. Melotto M
2. Zhang L
3. Oblessuc PR
4. He SY
2017Stomatal Defense a Decade LaterPlant Physiol 174:561–71https://doi.org/10.1104/pp.16.01853 Google Scholar
22.
1. Muhammad T
2. Zhang F
3. Zhang Y
4. Liang Y
2019RNA Interference: A Natural Immune System of Plants to Counteract Biotic StressorsCells 8:38https://www.mdpi.com/2073-4409/8/1/38 Google Scholar
23.
1. Kutyniok M
2. Müller C
2012Crosstalk between above- and belowground herbivores is mediated by minute metabolic responses of the host Arabidopsis thalianaJ Exp Bot 63:6199–210https://doi.org/10.1093/jxb/ers274 Google Scholar
24.
1. Nalam V
2. Louis J
3. Patel M
4. Shah J
2018Arabidopsis-Green Peach Aphid Interaction: Rearing the Insect, No-choice and Fecundity Assays, and Electrical Penetration Graph Technique to Study Insect Feeding BehaviorBio-Protoc 8:e2950Google Scholar
25.
1. Sangiovanni M
2. Granata I
3. Thind AS
4. Guarracino MR
2019From trash to treasure: detecting unexpected contamination in unmapped NGS dataBMC Bioinformatics 20:168https://doi.org/10.1186/s12859-019-2684-x Google Scholar
26.
1. Roman-Reyna V
2. Pinili D
3. Borja FN
4. Quibod IL
5. Groen SC
6. Alexandrov N
7. et al.
2020Characterization of the Leaf Microbiome from Whole-Genome Sequencing Data of the 3000 Rice Genomes ProjectRice 13:72https://doi.org/10.1186/s12284-020-00432-1 Google Scholar
27.
1. Gathercole LAP
2. Nocchi G
3. Brown N
4. Coker TLR
5. Plumb WJ
6. Stocks JJ
7. et al.
2021Evidence for the Widespread Occurrence of Bacteria Implicated in Acute Oak Decline from Incidental Genetic SamplingForests 12:1683https://www.mdpi.com/1999-4907/12/12/1683 Google Scholar
28.
1. Geng Y
2. Guan Y
3. Qiong L
4. Lu S
5. An M
6. Crabbe MJC
7. et al.
2021Genomic analysis of field pennycress (Thlaspi arvense) provides insights into mechanisms of adaptation to high elevationBMC Biol 19:143https://doi.org/10.1186/s12915-021-01079-0 Google Scholar
29.
1. Nunn A
2. Rodríguez-Arévalo I
3. Tandukar Z
4. Frels K
5. Contreras-Garrido A
6. Carbonell-Bejerano P
7. et al.
2022Chromosome-level Thlaspi arvense genome provides new tools for translational research and for a newly domesticated cash cover crop of the cooler climatesPlant Biotechnol J https://onlinelibrary.wiley.com/doi/abs/10.1111/pbi.13775 Google Scholar
30.
1. Hu Y
2. Wu X
3. Jin G
4. Peng J
5. Leng R
6. Li L
7. et al.
2022Rapid Genome Evolution and Adaptation of Thlaspi arvense Mediated by Recurrent RNA-Based and Tandem Gene DuplicationsFront Plant Sci 12https://www.frontiersin.org/articles/10.3389/fpls.2021.772655 Google Scholar
31.
1. Troyee AN
2. Medrano M
3. Müller C
4. Alonso C
2022Variation in DNA methylation and response to short-term herbivory in Thlaspi arvenseFlora 293:152106https://www.sciencedirect.com/science/article/pii/S0367253022001037 Google Scholar
32.
1. Galanti D
2. Ramos-Cruz D
3. Nunn A
4. Rodríguez-Arévalo I
5. Scheepens JF
6. Becker C
7. et al.
2022Genetic and environmental drivers of large-scale epigenetic variation in Thlaspi arvensePLOS Genet 18:e1010452https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1010452 Google Scholar
33.
1. Dorn KM
2. Fankhauser JD
3. Wyse DL
4. Marks MD
2015A draft genome of field pennycress (Thlaspi arvense) provides tools for the domestication of a new winter biofuel cropDNA Res 22:121–31https://academic.oup.com/dnaresearch/article/22/2/121/332636 Google Scholar
34.
1. Frels K
2. Chopra R
3. Dorn KM
4. Wyse DL
5. Marks MD
6. Anderson JA
2019Genetic Diversity of Field Pennycress (Thlaspi arvense) Reveals Untapped Variability and Paths Toward Selection for DomesticationAgronomy 9:302https://www.mdpi.com/2073-4395/9/6/302 Google Scholar
35.
1. Chopra R
2. Johnson EB
3. Emenecker R
4. Cahoon EB
5. Lyons J
6. Kliebenstein DJ
7. et al.
2019Progress toward the identification and stacking of crucial domestication traits in pennycressPlant Biology http://biorxiv.org/lookup/doi/10.1101/609990 Google Scholar
36.
1. Zhao R
2. Yang X
3. Li M
4. Peng X
5. Wei M
6. Zhang X
7. et al.
2021Biodiesel preparation from Thlaspi arvense L. seed oil utilizing a novel ionic liquid core-shell magnetic catalystInd Crops Prod 162:113316https://www.sciencedirect.com/science/article/pii/S0926669021000807 Google Scholar
37.
1. Kajiya-Kanegae H
2. Nagasaki H
3. Kaga A
4. Hirano K
5. Ogiso-Tanaka E
6. Matsuoka M
7. et al.
2021Whole-genome sequence diversity and association analysis of 198 soybean accessions in mini-core collectionsDNA Res 28:dsaa032https://doi.org/10.1093/dnares/dsaa032 Google Scholar
38.
1. Colgan TJ
2. Arce AN
3. Gill RJ
4. Ramos Rodrigues A
5. Kanteh A
6. Duncan EJ
7. et al.
2022Genomic Signatures of Recent Adaptation in a Wild BumblebeeMol Biol Evol 39:msab366https://doi.org/10.1093/molbev/msab366 Google Scholar
39.
1. Habyarimana E
2. Gorthy S
3. Baloch FS
4. Ercisli S
5. Chung G
2022Whole-genome resequencing of Sorghum bicolor and S. bicolor × S. halepense lines provides new insights for improving plant agroecological characteristicsSci Rep 12:5556https://www.nature.com/articles/s41598-022-09433-0 Google Scholar
40.
1. Mekbib Y
2. Tesfaye K
3. Dong X
4. Saina JK
5. Hu GW
6. Wang QF
2022Whole-genome resequencing of Coffea arabica L. (Rubiaceae) genotypes identify SNP and unravels distinct groups showing a strong geographical patternBMC Plant Biol 22:69https://doi.org/10.1186/s12870-022-03449-4 Google Scholar
41.
1. Metheringham CL
2. Plumb WJ
3. Stocks JJ
4. Kelly LJ
5. Gorriz MN
6. Moat J
7. et al.
2022Rapid polygenic adaptation in a wild population of ash trees under a novel fungal epidemicbioRxiv :2022.08.01.502033https://www.biorxiv.org/content/10.1101/2022.08.01.502033v3 Google Scholar
42.
1. Nocchi G
2. Brown N
3. Coker TLR
4. Plumb WJ
5. Stocks JJ
6. Denman S
7. et al.
2022Genomic structure and diversity of oak populations in British parklandsPLANTS PEOPLE PLANET 4:167–81https://onlinelibrary.wiley.com/doi/abs/10.1002/ppp3.10229 Google Scholar
43.
1. Friis G
2. Smith EG
3. Lovelock CE
4. Ortega A
5. Marshell A
6. Duarte CM
7. et al.
2024Rapid diversification of grey mangroves (Avicennia marina) driven by geographic isolation and extreme environmental conditions in the Arabian PeninsulaMol Ecol 33:e17260https://onlinelibrary.wiley.com/doi/full/10.1111/mec.17260 Google Scholar
44.
1. Meyer F
2. Paarmann D
3. D’Souza M
4. Olson R
5. Glass E
6. Kubal M
7. et al.
2008The metagenomics RAST server – a public resource for the automatic phylogenetic and functional analysis of metagenomesBMC Bioinformatics 9:386https://doi.org/10.1186/1471-2105-9-386 Google Scholar
45.
1. Keegan KP
2. Glass EM
3. Meyer F
2016MG-RAST, a Metagenomics Service for Analysis of Microbial Community Structure and FunctionMethods Mol Biol Clifton NJ 1399:207–33Google Scholar
46.
2021CABI. Myzus persicae (green peach aphid). CABI Compend. 2021 Dec 18 [cited 2023 Apr 7];CABI Compendium:35642. https://www.cabidigitallibrary.org/doi/10.1079/cabicompendium.35642https://www.cabidigitallibrary.org/doi/10.1079/cabicompendium.35642
47.
1. Gabryś B
2. Pawluk M
1999Acceptability of different species of Brassicaceae as hosts for the cabbage aphidEntomol Exp Appl 91:105–9https://onlinelibrary.wiley.com/doi/abs/10.1046/j.1570-7458.1999.00471.x Google Scholar
48.
1. Warwick SI
2. Francis A
3. Susko DJ
2002The biology of Canadian weeds. 9. Thlaspi arvense L. (updated)Can J Plant Sci 82:803http://www.nrcresearchpress.com/doi/abs/10.4141/P01-159 Google Scholar
49.
1. Feuerborn TR
2. Palkopoulou E
3. van der Valk T
4. von Seth J
5. Munters AR
6. Pečnerová P
7. et al.
2020Competitive mapping allows for the identification and exclusion of human DNA contamination in ancient faunal genomic datasetsBMC Genomics 21:844https://doi.org/10.1186/s12864-020-07229-y Google Scholar
50.
1. Singh KS
2. Cordeiro EMG
3. Troczka BJ
4. Pym A
5. Mackisack J
6. Mathers TC
7. et al.
2021Global patterns in genomic diversity underpinning the evolution of insecticide resistance in the aphid crop pest Myzus persicaeCommun Biol 4:1–16https://www.nature.com/articles/s42003-021-02373-x Google Scholar
51.
2019Kruijer, Willem, and with a contribution from Ian White, contains data collected by Padraic Flood and Rik Kooke. 2019. “Heritability: Marker-Based Estimation of Heritability Using Individual Plant or Plot Data.” https://CRAN.R-project.org/package=heritability.https://CRAN.R-project.org/package=heritability
52.
1. Gao J
2. Fang C
3. Zhao B
2019The latitudinal herbivory hypothesis revisited: To be part is to be wholeEcol Evol 9:3681–8https://onlinelibrary.wiley.com/doi/abs/10.1002/ece3.2759 Google Scholar
53.
1. Atwell S
2. Huang YS
3. Vilhjálmsson BJ
4. Willems G
5. Horton M
6. Li Y
7. et al.
2010Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred linesNature 465:627–31https://www.nature.com/articles/nature08800 Google Scholar
54.
1. Sobota RS
2. Shriner D
3. Kodaman N
4. Goodloe R
5. Zheng W
6. Gao YT
7. et al.
2015Addressing population-specific multiple testing burdens in genetic association studiesAnn Hum Genet 79:136–47https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4334751/Google Scholar
55.
1. Kim JH
2. Jander G
2007Myzus persicae (green peach aphid) feeding on Arabidopsis induces the formation of a deterrent indole glucosinolatePlant J 49:1008–19https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1365-313X.2006.03019.x Google Scholar
56.
1. Klingauf F
2. Şengonca Ç
3. Bennewitz H
1972Einfluß von Sinigrin auf die Nahrungsaufnahme polyphager und oligophager Blattlausarten (Aphididae) (Effect of Sinigrin on Sucrose Uptake by Some Polyphagous and Oligophagous Aphids (Aphididae))Oecologia 9:53–7https://www.jstor.org/stable/4214736 Google Scholar
57.
2000The Arabidopsis Information Resource (TAIR). 2000. www.arabidopsis.org/servlets/TairObject?id=137911&type=locushttp://www.arabidopsis.org/servlets/TairObject?id=137911&type=locus
58.
1. Camacho C
2. Coulouris G
3. Avagyan V
4. Ma N
5. Papadopoulos J
6. Bealer K
7. et al.
2009BLAST+: architecture and applicationsBMC Bioinformatics 10:421https://doi.org/10.1186/1471-2105-10-421 Google Scholar
59.
1. Martin M
2011Cutadapt removes adapter sequences from high-throughput sequencing readsEMBnet.journal 17:10–2http://journal.embnet.org/index.php/embnetjournal/article/view/200 Google Scholar
60.
1. Andrews S
2010FASTQC. A quality control tool for high throughput sequence dataBibSonomy https://www.bioinformatics.babraham.ac.uk/projects/fastqc/
61.
1. Ewels P
2. Magnusson M
3. Lundin S
4. Käller M
2016MultiQC: summarize analysis results for multiple tools and samples in a single reportBioinformatics 32:3047–8https://doi.org/10.1093/bioinformatics/btw354 Google Scholar
62.
1. Li H
2. Durbin R
2009Fast and accurate short read alignment with Burrows–Wheeler transformBioinformatics 25:1754–60https://doi.org/10.1093/bioinformatics/btp324 Google Scholar
63.
1. Van der Auwera GA
2. Carneiro MO
3. Hartl C
4. Poplin R
5. et al.
2013From FastQ Data to High-Confidence Variant Calls: The Genome Analysis Toolkit Best Practices PipelineCurr Protoc Bioinforma 43:11–11https://currentprotocols.onlinelibrary.wiley.com/doi/abs/10.1002/0471250953.bi1110s43 Google Scholar
64.
1. Poplin R
2. Ruano-Rubio V
3. DePristo MA
4. Fennell TJ
5. Carneiro MO
6. Auwera GAV der
7. et al.
2018Scaling accurate genetic variant discovery to tens of thousands of samplesbioRxiv :201178https://www.biorxiv.org/content/10.1101/201178v3 Google Scholar
65.
1. Frantzeskakis L
2. Kracher B
3. Kusch S
4. Yoshikawa-Maekawa M
5. Bauer S
6. Pedersen C
7. et al.
2018Signatures of host specialization and a recent transposable element burst in the dynamic one-speed genome of the fungal barley powdery mildew pathogenBMC Genomics 19:381https://doi.org/10.1186/s12864-018-4750-6 Google Scholar
66.
1. Danecek P
2. Bonfield JK
3. Liddle J
4. Marshall J
5. Ohan V
6. Pollard MO
7. et al.
2021Twelve years of SAMtools and BCFtoolsGigaScience 10:giab008https://doi.org/10.1093/gigascience/giab008 Google Scholar
67.
1. Danecek P
2. Auton A
3. Abecasis G
4. Albers CA
5. Banks E
6. DePristo MA
7. et al.
2011The variant call format and VCFtoolsBioinformatics 27:2156–8https://doi.org/10.1093/bioinformatics/btr330 Google Scholar
68.
1. Browning BL
2. Zhou Y
3. Browning SR
2018A One-Penny Imputed Genome from Next-Generation Reference PanelsAm J Hum Genet 103:338–48https://www.sciencedirect.com/science/article/pii/S0002929718302428 Google Scholar
69.
1. Paczian T
2. Trimble WL
3. Gerlach W
4. Harrison T
5. Wilke A
6. Meyer F
2019The MG-RAST API explorer: an on-ramp for RESTful query compositionBMC Bioinformatics 20:561https://doi.org/10.1186/s12859-019-2993-0 Google Scholar
70.
1. Robinson KM
2. Hawkins AS
3. Santana-Cruz I
4. Adkins RS
5. Shetty AC
6. Nagaraj S
7. et al.
2017Aligner optimization increases accuracy and decreases compute times in multi-species sequence dataMicrob Genomics 3:e000122https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5643015/Google Scholar
71.
1. Prince D
2. Mugford S
3. Vincent T
4. Hogenhout S
2014Pea Aphid Survival Assays on Arabidopsis thalianaBIO-Protoc 4https://bio-protocol.org/e1251 Google Scholar
72.
1988National Center for Biotechnology Information (NCBI). Bethesda (MD): National Library of Medicine (US). 1988. NCBI. https://www.ncbi.nlm.nih.gov/https://www.ncbi.nlm.nih.gov/
73.
1. Sievers F
2. Wilm A
3. Dineen D
4. Gibson TJ
5. Karplus K
6. Li W
7. et al.
2011Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal OmegaMol Syst Biol 7:539https://www.embopress.org/doi/full/10.1038/msb.2011.75 Google Scholar
74.
1. Agerbirk N
2. Olsen CE
3. Heimes C
4. Christensen S
5. Bak S
6. Hauser TP
2015Multiple hydroxyphenethyl glucosinolate isomers and their tandem mass spectrometric distinction in a geographically structured polymorphism in the crucifer Barbarea vulgarisPhytochemistry 115:130–42https://www.sciencedirect.com/science/article/pii/S0031942214003665 Google Scholar
75.
2020Copernicus Climate Change Service. E-OBS daily gridded meteorological data for Europe from 1950 to present derived from in-situ observations. ECMWF; 2020 [cited 2022 Feb 9]. https://cds.climate.copernicus.eu/doi/10.24381/cds.151d3ec6https://cds.climate.copernicus.eu/doi/10.24381/cds.151d3ec6
76.
1. Ziyatdinov A
2. Vázquez-Santiago M
3. Brunel H
4. Martinez-Perez A
5. Aschard H
6. Soria JM
2018lme4qtl: linear mixed models with flexible covariance structure for genetic studies of related individualsBMC Bioinformatics 19:68https://doi.org/10.1186/s12859-018-2057-x Google Scholar
77.
1. Zhou X
2. Stephens M
2012Genome-wide efficient mixed-model analysis for association studiesNat Genet 44:821–4https://www.nature.com/articles/ng.2310 Google Scholar
78.
1. Purcell S
2. Neale B
3. Todd-Brown K
4. Thomas L
5. Ferreira MAR
6. Bender D
7. et al.
2007PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage AnalysesAm J Hum Genet 81:559–75https://www.sciencedirect.com/science/article/pii/S0002929707613524 Google Scholar
79.
1. Emms DM
2. Kelly S
2019OrthoFinder: phylogenetic orthology inference for comparative genomicsGenome Biol 20:238https://doi.org/10.1186/s13059-019-1832-y Google Scholar
80.
1. Nordborg M
2. Hu TT
3. Ishino Y
4. Jhaveri J
5. Toomajian C
6. Zheng H
7. et al.
2005The Pattern of Polymorphism in Arabidopsis thalianaPLOS Biology 3:e196https://doi.org/10.1371/journal.pbio.0030196 Google Scholar
81.
1. Nunn A
2. Can SN
3. Otto C
4. Fasold M
5. Díez Rodríguez B
6. Fernández-Pozo N
7. et al.
2021EpiDiverse Toolkit: a pipeline suite for the analysis of bisulfite sequencing data in ecological plant epigeneticsNAR Genomics Bioinforma 3:lqab106https://doi.org/10.1093/nargab/lqab106 Google Scholar
82.
1. Quinlan AR
2. Hall IM
2010BEDTools: a flexible suite of utilities for comparing genomic featuresBioinformatics 26:841–2https://doi.org/10.1093/bioinformatics/btq033 Google Scholar

Article and author information

Author information

Dario Galanti
Plant Evolutionary Ecology, Institute of Evolution and Ecology, University of Tübingen, 72076 Tübingen, Germany, Royal Botanic Gardens, Kew, Richmond upon Thames, UK
ORCID iD: 0000-0002-6567-1505
- For correspondence: dario.galanti@uni-tuebingen.de
Jun Hee Jung
Plant Evolutionary Ecology, Institute of Evolution and Ecology, University of Tübingen, 72076 Tübingen, Germany
ORCID iD: 0000-0002-6523-4794
Caroline Müller
Chemical Ecology, Bielefeld University, Universitätststr. 25, 33615 Bielefeld, Germany
ORCID iD: 0000-0002-8447-534X
Oliver Bossdorf
Plant Evolutionary Ecology, Institute of Evolution and Ecology, University of Tübingen, 72076 Tübingen, Germany
ORCID iD: 0000-0001-7504-6511

Version history

Sent for peer review: February 7, 2024
Preprint posted: March 4, 2024
Reviewed Preprint version 1: April 3, 2024
Reviewed Preprint version 2: October 29, 2024
Version of Record published: December 19, 2024

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.95510. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

views: 1,431
downloads: 56
citations: 2

Views, downloads and citations are aggregated across all versions of this paper published by eLife.