The genomic landscape of meiotic crossovers and gene conversions in Arabidopsis thaliana

  1. Erik Wijnker
  2. Geo Velikkakam James
  3. Jia Ding
  4. Frank Becker
  5. Jonas R Klasen
  6. Vimal Rawat
  7. Beth A Rowan
  8. Daniël F de Jong
  9. C Bastiaan de Snoo
  10. Luis Zapata
  11. Bruno Huettel
  12. Hans de Jong
  13. Stephan Ossowski
  14. Detlef Weigel
  15. Maarten Koornneef
  16. Joost JB Keurentjes  Is a corresponding author
  17. Korbinian Schneeberger  Is a corresponding author
  1. Wageningen University, Netherlands
  2. IBMP–CNRS, Université de Strasbourg, France
  3. Max Planck Institute for Plant Breeding Research, Germany
  4. Max Planck Institute for Developmental Biology, Germany
  5. Rijk Zwaan, Netherlands
  6. Center for Genomic Regulation (CRG) and Universitat Pompeu Fabra (UPF), Spain
5 figures and 3 additional files

Figures

Figure 1 with 2 supplements
Experimental design and summary of recombination events within 62 recombinants.

(A) 13 complete tetrads were generated by crossing qrt1 in a Col background to qrt1 in a Ler background, and then using single pollen tetrads from the F1 hybrids to fertilize a Cvi male sterile …

https://doi.org/10.7554/eLife.01426.003
Figure 1—figure supplement 1
Graphical genotypes of all 13 complete tetrads.

For each tetrad, all five chromosomes of all four offspring are shown (chromosome one to five, top to bottom). The four offspring genomes of one tetrad are plotted one above the other. Black lines …

https://doi.org/10.7554/eLife.01426.004
Figure 1—figure supplement 2
Graphical genotypes of 10 DH lines.

For each DH line, all five chromosomes are shown, chromosome one to five (top to bottom). Black lines depict peri-centromeric regions.

https://doi.org/10.7554/eLife.01426.005
Figure 2 with 6 supplements
Identification of gene conversions is complex because of unknown duplications and transpositions in the A. thaliana genome.

(A) Short read alignments of Ler against the reference sequence at position 13,957,178 on chromosome 1. Individual reads are shown as blue lines, while mismatches between reference sequence and …

https://doi.org/10.7554/eLife.01426.006
Figure 2—figure supplement 1
Graphical illustration of short read alignments at type-1 and type-2 markers revealing no NCO–GCs (two loci at the left) and the same loci revealing type-1 and type-2 NCO-GCs (right).

Short read alignments are shown in colors according to the respective genome, if they cannot be uniquely assigned to one unique parent they are shown in grey. At type-1 markers, the expected allele …

https://doi.org/10.7554/eLife.01426.007
Figure 2—figure supplement 2
Observed allele distribution throughout all deeply sequenced tetrad genomes.

Observed allele frequencies (AFs) were derived from allele counts (based on short read alignments) at the respective marker loci. AFs at type-1 markers (homozygous alleles) are shown in red, AFs at …

https://doi.org/10.7554/eLife.01426.008
Figure 2—figure supplement 3
NCO–GC frequency per marker per meiosis measured in the five deeply sequenced tetrads, using a range of minimal coverage thresholds and three different marker sets.

NCO–GC frequency was assessed using different marker sets. Marker sets with more stringent filtering showed a lower frequency of putative NCO–GCs, which indicates that filtering reduces the …

https://doi.org/10.7554/eLife.01426.009
Figure 2—figure supplement 4
The number of putative type-1 and type-2 NCO–GCs in all 20 deeply sequenced tetrad offspring using different marker sets.

Bar charts show the number of NCO–GC detected per offspring plant using different marker set generated with different filter stringencies in each subsequent step (top to bottom). Quality score …

https://doi.org/10.7554/eLife.01426.010
Figure 2—figure supplement 5
Transposed sequences on A. thaliana chromosome 3.

(A) Upper panel shows, analogous to Figure 2B, the location of transposed DNA between the two parental lines Ler and Col. Thin blue and red lines indicate co-linear sequences. Green thick lines show …

https://doi.org/10.7554/eLife.01426.011
Figure 2—figure supplement 6
NCO–GC frequency per marker per meiosis measured in 10 recombinant DH lines at increasing minimal coverage thresholds.

From a minimal coverage threshold of 10 onwards, the observed frequency of putative NCO–GC does not majorly change. This is in contrast to the minimal coverage requirement for the tetrad sample …

https://doi.org/10.7554/eLife.01426.012
Sequence similarity at recombination sites, length differences between COCTs and NCOCTs and crossover interference.

(A) The percentage of confidently aligned positions within the resequencing of Ler was used as proxy for local sequence similarity. The percentages at COs were compared to a background distribution …

https://doi.org/10.7554/eLife.01426.013
Recombination sites are enriched for un-methylated, AT rich promoter regions.

(A) The GC content was calculated for 67 conversion tracts (including 500 bp of flanking sequence, top) and compared to a background distribution of 5,000 equally sized random locations sampled from …

https://doi.org/10.7554/eLife.01426.014
Recombination sites are significantly associated with two sequence motifs and nucleosome-free regions.

(A) Two sequence motifs (poly-A and CTT/GAA) were found significantly enriched at recombination sites after searching for over-enriched motifs using MEME. We established background frequency …

https://doi.org/10.7554/eLife.01426.015

Additional files

Supplementary file 1

(A) Read number. (B) Number of crossovers per tetrad and DH line and chromosome. (C) CO location. (D) NCO location and extent within the tetrad samples. (E) CO location in DH lines. (F) Exact loci of CO and extent of associated GC in tetrad samples. (G) Converted markers within COCTs in tetrad samples. (H) Converted markers within NCOCTs in DH lines.

https://doi.org/10.7554/eLife.01426.016
Supplementary file 2

(A) Initial marker list used for an initial genotyping of the tetrad samples. (B) Filtered marker list used to genotype tetrad samples. (C) High quality marker list used to genotype DH samples. (D) Primer pairs used for verification of NCOs in tetrad offspring. (E) Primer pairs used for verification of NCOs in DH lines. (F) Primers used for the verification of transposed DNA on chromosome 3.

https://doi.org/10.7554/eLife.01426.017
Supplementary file 3

Includes a visualization of the exact makeup of 71 COs identified in the tetrad samples, for which sufficient sequencing information was available. The colored areas indicate the number of short read alignments for each of the positions as indicated on the x-axis. Red and blue areas refer to regions that descended from Col-0 and Ler, respectively. Grey areas cannot be assigned to either of them. Vertical lines indicate sequence differences between the parental genotypes and have the respective genotype indicated next to them. Within the CO sites (between the outer borders of both grey areas) the polymorphism data are based on hand curated short read alignments and local assemblies. Outside the flanking markers the polymorphisms encompass the marker used for reconstructing the recombinant chromosomes only. Note each tetrad sample contains one recombinant chromosome and one that is derived from the Cvi parent. The Cvi alleles are not indicated in these plots.

https://doi.org/10.7554/eLife.01426.018

Download links