Abundant Parent-of-origin Effect eQTL: The Framingham Heart Study

Yongtao Guan; Tianxiao Huan; Daniel Levy

doi:10.7554/eLife.107920.1

Not revised: This Reviewed Preprint includes the authors’ original preprint (without revision), an eLife assessment, public reviews, and a provisional response from the authors.

Reviewing Editor
Siming Zhao
Dartmouth College, Lebanon, United States of America
Senior Editor
Alan Moses
University of Toronto, Toronto, Canada

Reviewer #1 (Public review):

Summary:

This study presents a systematic investigation of parent-of-origin effects on gene expression using trio-based data from the Framingham Heart Study, which is notable for its relatively large number of trios. By combining whole-genome and RNA sequencing data, the authors examined the extent to which gene expression is influenced by whether genetic variants are inherited maternally or paternally.

The authors report that parent-of-origin eQTLs are widespread, identifying 15,893 eQTLs from 14,733 variants and 1,824 genes that were significant in paternal, maternal, or joint tests but not detected by traditional eQTL approaches. They further classified these associations based on the relative strength and direction of paternal and maternal effects, highlighting a subset with opposing directions. The study also highlighted eGenes linked to known imprinted genes as well as those with opposing parent-specific effects, and observed that paternal eGenes are enriched for drug targets. Finally, the work revisits previous findings in which eQTL studies were used to interpret disease-associated loci, emphasizing that conventional eQTL analyses without testing the parent-of-origin may mislead gene prioritization efforts. The study recommends that future downstream analyses, such as Mendelian randomization, take into account the provided lists of SNPs and eGenes and exclude those with strong parent-of-origin effects when linking genetic regulation to disease risk.

Strengths:

The major strength of the study lies in the scale and quality of the dataset, the trio-based design, and the systematic application of statistical tests for parent-of-origin effects. The strengths thoughtfully employed Bayes factors rather than p-values to provide stronger evidence of association, which adds rigor to their analyses. These design choices provide compelling evidence that parent-of-origin effects are widespread and that conventional eQTL analyses miss a substantial fraction of regulatory variation. The results are clearly presented and supported by robust analyses, including the identification of opposing parental effects and the enrichment of paternal eGenes for drug targets. Notably, the two examples demonstrating how these findings can reshape disease gene prioritization highlight the broader impact of the study and encourage further work in the community to incorporate parent-of-origin effects.

Weaknesses:

The main limitations of the study are threefold. First, there is a lack of replication in independent cohorts, which is understandable given the difficulty of identifying datasets with a comparable number of trios, but replication would help establish the generalizability of the findings. Second, while Bayes factors are thoughtfully used to assess evidence of association, the paper does not fully explore how the chosen thresholds translate to the expected rate of false positives. For example, a minor allele frequency cutoff of 1% was applied, which seems somewhat arbitrary, and without reporting the allele frequency distribution of the identified eQTLs, it is unclear whether rare variants disproportionately contribute to the signals, potentially affecting the reliability of discoveries. Third, the ancestry background of the study samples is not reported, which could be a confounding factor in the genetic analyses.

https://doi.org/10.7554/eLife.107920.1.sa2

Reviewer #2 (Public review):

Summary:

The authors have used 1477 sequenced trios with available gene expression data in the offspring to discover eQTLs that act in a parent-of-origin specific manner. The classified associated SNPs are tested for enrichment for GWAS hits, drug target genes, etc.

Strengths:

The manuscript presents an impressive analysis of a very rich data set of parent-of-origin eQTLs. To my knowledge, it is one of the largest studies of its kind, most analyses are sound, and the results are of interest to many in the field and potentially beyond. The different ideas of follow-up analyses are useful and make sense.

Weaknesses:

While in general the analyses are well-conducted, I noticed a major issue with the POE eQTL classification, which puts into question most of the downstream analysis. In light of this problem, most of the analysis would need to be rerun, which represents a major revision of the paper, but is straightforward to repair.

The major problem with the classification of POEs is that simply having significant maternal, but insignificant paternal effect is not an indicator of POE, this happens widely for SNPs with no POE whatsoever (it can happen by chance even when both maternal and paternal effects are the same and non-zero - the authors can see it via simulations under the null [maternal=paternal effect]). In order to be able to talk about POE, first, a significant difference between maternal and paternal effects needs to be claimed. Therefore, none of the 4 sets of POE eQTLs are justified. To me, the only relevant criterion to pick POE SNPs is the P-value when comparing the maternal and paternal effects. The definitions of the 4 groups are based on somewhat ad hoc priors, BF thresholds, etc. Also, in Section 4.6, the value of theta is arbitrarily chosen (along with the threshold of 4 to declare POE). In my opinion, the clean treatment of the 4 groups would start with a significant P-value (beta_maternal vs beta_paternal). Within this set, you can then use the original criteria presented in the paper, but only among these associations where there is solid evidence of different parental effects.

https://doi.org/10.7554/eLife.107920.1.sa1

Author response:

We thank the two anonymous reviewers who took the time and effort to read and evaluate our work. We look forward to submitting a revised version of the manuscript that addresses their comments.

A major concern shared between both reviewers is our use of Bayes factors instead pvalues to measure the strength of association. In revision, we will add a section in Supplementary to compare and constrast Bayes factor and p-values. Very briefly here, p-value is the tail probability under the null. Formally, it is defined as P(T > t|H₀), for a test statistic T with obvserved value t computed from data D. But our interest is P(H₀|D) and P(H₁|D), posterior probabilities of the null and alternative models, about which p-value says nothing. With FDR approach, a q-value, the minium FDR at which a null is rejected, which can be estimated from a collection of p-values, has a Bayesian interpretation as the probability that H₀ is true conditioning on rejecting that H₀. This is not quite P(H₀|D) but nevertheless a useful probabilistic statement. For FDR approach to work, however, the collection of tests need to be reasonably independent, and their effect sizes need to be mixed. Both implicit assumptions can fail for cis eQTL analysis.

On the other hand, with Bayes factors we can compute posterior probability P(H₀|D) and P(H₁|D) after specifying prior odds P(H₁)/P(H₀) (or equivalently P(H₁) since P(H₀)+ P(H₁) = 1). In our manuscript, the prior odds used to determined Bayes factor threshold is 1/1000, or about 1 cis eQTL per gene. Bayes factor also allows us to directly compare two non-nested alternative models P(paternal effect|D) and P(maternal effect|D), which is difficult to do using p-values.

It was suggested (by reviewer 2) that POE eQTL should be defined by testing H₀ : θ₀ = θ₁ against H₁ : θ₀ ̸= θ₁ where θ₀ and θ₁ are maternal and paternal effects respectively. This indeed was our initial approach, as evidenced in Table 1 (last column) and Section 4.5 in Methods. Our final approach is more stringent: H₀ : β₀ = β₁ = 0 against H₁ : β₀ = 0,β₁/= 0, to use test for paternal effect as an example (the test for maternal effect can be obtained in a similar fashion). That is, we not only require that paternal and maternal effects be the same, as suggested by reviewer, but also require that they are both 0 under the null. This is partially motivated by an example in Table 1 (Gene ZNF890P) where both β₀ > 0 and β₁ > 0, and β₀/= β₁. In other words, examples like this where both paternal and maternal effects are significant and their differences are also significant were not included in our downstream classification and further analysis.

https://doi.org/10.7554/eLife.107920.1.sa0

Abundant Parent-of-origin Effect eQTL: The Framingham Heart Study

Peer review process

Editors

Be the first to read new articles from eLife