Expansion of the circadian transcriptome in Brassica rapa and genome-wide diversification of paralog expression patterns
Abstract
An important challenge of crop improvement strategies is assigning function to paralogs in polyploid crops. Here, we describe the circadian transcriptome in the polyploid crop Brassica rapa. Strikingly, almost three quarters of expressed genes exhibited circadian rhythmicity. Genetic redundancy resulting from whole genome duplication is thought to facilitate evolutionary change through sub- and neo-functionalization among paralogous gene pairs. We observed genome-wide expansion of circadian expression phase among retained paralogous pairs. Using gene regulatory network models, we compared transcription factor targets between B. rapa and Arabidopsis circadian networks to reveal evidence for divergence between B. rapa paralogs that may be driven in part by variation in conserved non-coding sequences (CNS). Additionally, differential drought response among retained paralogous pairs suggests further functional diversification. These findings support the rapid expansion and divergence of the transcriptional network in a polyploid crop and offer a new approach for assessing paralog activity at the transcript level.
Data availability
Sequencing data have been deposited in GEO under accession codes GSE123654
Article and author information
Author details
Funding
National Science Foundation (IOS-1202779)
- Kathleen Greenham
National Science Foundation (IOS-1711662)
- Ryan C Sartor
National Science Foundation (IOS-1547796)
- C Robertson McClung
Rural Development Administration (Next Generation BioGreen 21,grant number SSAC PJ01327306)
- C Robertson McClung
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Reviewing Editor
- Christian S Hardtke, University of Lausanne, Switzerland
Version history
- Received: May 15, 2020
- Accepted: September 29, 2020
- Accepted Manuscript published: September 30, 2020 (version 1)
- Version of Record published: November 10, 2020 (version 2)
Copyright
© 2020, Greenham et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 2,750
- views
-
- 405
- downloads
-
- 24
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Genetics and Genomics
LD score regression (LDSC) is a method to estimate narrow-sense heritability from genome-wide association study (GWAS) summary statistics alone, making it a fast and popular approach. In this work, we present interaction-LD score (i-LDSC) regression: an extension of the original LDSC framework that accounts for interactions between genetic variants. By studying a wide range of generative models in simulations, and by re-analyzing 25 well-studied quantitative phenotypes from 349,468 individuals in the UK Biobank and up to 159,095 individuals in BioBank Japan, we show that the inclusion of a cis-interaction score (i.e. interactions between a focal variant and proximal variants) recovers genetic variance that is not captured by LDSC. For each of the 25 traits analyzed in the UK Biobank and BioBank Japan, i-LDSC detects additional variation contributed by genetic interactions. The i-LDSC software and its application to these biobanks represent a step towards resolving further genetic contributions of sources of non-additive genetic effects to complex trait variation.
-
- Evolutionary Biology
- Genetics and Genomics
Copy number variation in large gene families is well characterized for plant resistance genes, but similar studies are rare in animals. The zebrafish (Danio rerio) has hundreds of NLR immune genes, making this species ideal for studying this phenomenon. By sequencing 93 zebrafish from multiple wild and laboratory populations, we identified a total of 1513 NLRs, many more than the previously known 400. Approximately half of those are present in all wild populations, but only 4% were found in 80% or more of the individual fish. Wild fish have up to two times as many NLRs per individual and up to four times as many NLRs per population than laboratory strains. In contrast to the massive variability of gene copies, nucleotide diversity in zebrafish NLR genes is very low: around half of the copies are monomorphic and the remaining ones have very few polymorphisms, likely a signature of purifying selection.