Epigenome-wide analysis of DNA methylation and coronary heart disease: a nested case-control study

  1. Jiahui Si
  2. Songchun Yang
  3. Dianjianyi Sun
  4. Canqing Yu
  5. Yu Guo
  6. Yifen Lin
  7. Iona Y Millwood
  8. Robin G Walters
  9. Ling Yang
  10. Yiping Chen
  11. Huaidong Du
  12. Yujie Hua
  13. Jingchao Liu
  14. Junshi Chen
  15. Zhengming Chen
  16. Wei Chen
  17. Jun Lv  Is a corresponding author
  18. Liming Liang  Is a corresponding author
  19. Liming Li  Is a corresponding author
  20. China Kadoorie Biobank Collaborative Group
  1. Department of Epidemiology and Biostatistics, School of Public Health, Peking University Health Science Center, China
  2. Departments of Epidemiology and Biostatistics, Harvard T.H. Chan School of Public Health, United States
  3. Chinese Academy of Medical Sciences, China
  4. Department of Urology, West China Hospital, Sichuan University, China
  5. Medical Research Council Population Health Research Unit at the University of Oxford, United Kingdom
  6. Clinical Trial Service Unit & Epidemiological Studies Unit (CTSU), Nuffield Department of Population Health, University of Oxford, United Kingdom
  7. NCDs Prevention and Control Department, Suzhou CDC, China
  8. NCDs Prevention and Control Department, Wuzhong CDC, China
  9. China National Center for Food Safety Risk Assessment, China
  10. Department of Epidemiology, School of Public Health and Tropical Medicine, Tulane University, United States
  11. Key Laboratory of Molecular Cardiovascular Sciences (Peking University), Ministry of Education, China
  12. Peking University Institute of Environmental Medicine, China

Peer review process

This article was accepted for publication as part of eLife's original publishing model.

History

  1. Version of Record updated
  2. Version of Record published
  3. Accepted Manuscript published
  4. Accepted
  5. Received

Decision letter

  1. Edward D Janus
    Reviewing Editor; University of Melbourne, Australia
  2. Matthias Barton
    Senior Editor; University of Zurich, Switzerland
  3. Edward D Janus
    Reviewer; University of Melbourne, Australia
  4. Ida Karlsson
    Reviewer; Karolinska Institutet, Sweden

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

The authors present the first large long term study in a non white population of the association between epigenetic changes and coronary heart disease risk over ten years. In this well conducted study there are novel findings for the increased risk of smoking and for impact on blood pressure. This advances our overall knowledge of epigenetic regulation in the pathways to coronary heart disease.

Decision letter after peer review:

Thank you for submitting your article "Epigenome-wide analysis of DNA methylation and coronary heart disease: a nested case-control study" for consideration by eLife. Your article has been reviewed by 3 peer reviewers, including Edward D Janus as Reviewing Editor and Reviewer #1, and the evaluation has been overseen by Matthias Barton as the Senior Editor. The following individual involved in review of your submission has agreed to reveal their identity: Ida Karlsson (Reviewer #3).

The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.

The overall comments from the three reviewers are reproduced below. The paper is excellent. We have provided a list of comments and request you consider these in providing your revision.

Suggested recommendations:

1) Starting with the main concern, the lack of replication is of course an issue (as in most EWASs). Could you compare the top hits from the largest previous EWAS of incident CHD, to see if the direction of effect is similar and if the significance is suggestive in your data? That would tell us a little bit more about the comparability.

2) It would be helpful if some of the methods (and sample description) were provided along with the results, to better follow along the findings (without having to go to the end to read the results first). I found Figure 1 helpful in understanding the results and suggest that it could be presented with the results instead of the methods.

3) Some minor clarifications and justifications I would find helpful as a reader:

a. I am not very familiar with the co-methylation network approach. Could you provide a brief overview of the method (preferably along with the results)? As the majority of the findings stem from this method, a justification of its robustness would be helpful.

b. I agree with what you say in the discussion, that it is not possible to say what direction mediation works, but it would be good with a sentence to explain the reason of the directions selected to study mediation.

c. The sample selection: I assume you aimed to capture severe cases by including only fatal IHD or nonfatal MI counted as CHD? Why were individuals with neoplasms or cerebrovascular disease excluded?

4) A couple of questions regarding the statistical analyses:

a. The analysis description for single CpG sites only says linear regression. Were the analyses not matched for case-control status (i.e. conditional linear regression), and if so why not? That should be the more powerful and robust approach.

b. In relation to that, why adjust for the matching variables?

c. I am slightly worried about overadjustment, especially in the mediation analyses as several of the lifestyle covariates are likely correlated (e.g. physical activity and BMI). Might including these adjustments in the mediation analyses mask an effect? And for the main analyses, did you also test a simpler model with only basic adjustments for comparison?

5) Please check the number of incident cases and controls so the numbers are consistent in the abstract, figure 1, introduction, results etc. The numbers as currently shown vary slightly from 489 to 494.

6) In the first paragraph of the Results section please expand as the readers won't all look at the methodology section. See also (2) above.

Suggest – 491 cases free of CHD at baseline and developing CHD during follow up and 491 controls free of CHD at baseline and follow up and matched for age, gender, region and timing of blood sampling…

7) Table 1. Please show statistical significance p value for prevalent hypertension and diabetes and for lipids.

8) line 122, can you be specific about items related "healthy lifestyle" and are they known risk factors for CHD?

9) line 126, in my opinion, the genomic inflation factor is not helpful here as it is not a good indicator for EWAS. It is because CpGs are more much correlated than SNPs, and the inflation factor is very much dependent on the trait.

10) Line 130: The difference "-0.003" between cases and controls is very small and hard to detect. Can you also show the SD of the two CpGs, or simply plot their distributions in cases and controls.

11) Line 144: Why do you use "probe" not "CpG" in this paragraph?

12) Line 156 change none to no.

13) Line 242: Did you compare the maximum follow-up time and age distribution in the other two prospective publications? I believe the choice of different follow-up time can be an important factor as we are not sure how long it takes CpGs to have effects on CHD.

14) Line 312: "lab staff were blinded" does it also mean the cases and controls were randomized on arrays/batches. This is important as we don't want to lose study power by adjusting batch effect.

15) Line 365: Remove the repeated sentence "A total of 56 SVs were generated."

16) Line 434: How were cellular compositions controlled when you could not use SV here?

17) Can you also report the corresponding p-value of the FDR 0.05 threshold?

18) If you used β-values of DNA methylation in the analysis, please state it.Reviewer #1:

The authors examine the association between genome wide epigenetic changes and coronary heart disease in an effort to further explore the residual risk not explained by known risk factors

The major strengths are that this is a large longitudinal study over 10 years in non whites that is Chinese. Earlier studies have used small samples, been primarily in Western countries, have focused on selective genomic regions rather than the whole genome and with a few exceptions have been cross sectional which precludes establishment of a temporal and potentially causative relationship.

No major weaknesses are apparent. There has been a comprehensive effort to exclude confounders. In limitations the possibility of some residual confounders is noted.

In this well conducted study there are novel findings for impact of DNA methylation at 25 cytosine phosphate-guanine (CpG) dinucleotides, predominantly novel, on CHD risk; for the increased risk of smoking (25% mediated by methylation at one specific site) and for impact on blood pressure and intracellular signaling as well.

The authors have achieved their aims and the results support their conclusions.

This study advances our overall knowledge of epigenetic regulation in the pathways to coronary heart diseaseReviewer #2:

This study used a matched case-control design to study the prospective effect of DNA methylation on coronary heart disease. The study data included 491 cases and 491 controls which can provide enough statistical power. The authors used proper statistical methods to achieve their aims, and the results support their conclusions. This study will contribute to our better understanding of the effect of DNA methylation on coronary heart disease.

Reviewer #3:

The authors present a thorough and very well-written EWAS of incident CHD in 491 cases and 491 matched controls. The sample was drawn from the China Kadoorie Biobank, with 10-years of follow-up, and the data had rich baseline information in addition to DNA methylation measures. Single site association analyses identified two sites (FDR p-value <0.05), and a network approach 24 sites (module-specific FDR<0.01; one site overlapping with single site associations). They further conducted mediation analyses, testing a wide range of cardiometabolic and lifestyle factors, and found evidence of mediation effects at some of the sites in relation to smoking and blood pressure. A weakness is that which is common to most EWASs, namely the lack of replication. However, the issue also highlights the importance of further studies in the field. The prospective nature as well as the Asian population further strengthens the importance, as both aspects are largely lacking in the field.

The methodology is sound and support the conclusions of the paper. I especially appreciate the addition of mediation analyses, as it provides a better understanding of potential biological pathways.

https://doi.org/10.7554/eLife.68671.sa1

Author response

Suggested recommendations:

1) Starting with the main concern, the lack of replication is of course an issue (as in most EWASs). Could you compare the top hits from the largest previous EWAS of incident CHD, to see if the direction of effect is similar and if the significance is suggestive in your data? That would tell us a little bit more about the comparability.

The previous largest EWAS of incident CHD used a meta-analysis of nine population-based cohorts and 11,461 participants from the United States and European countries1. Based on HumanMethylation450 BeadChip data, methylation levels at 30 CpG sites were identified to be associated with incident CHD, and 30 were associated with incident myocardial infarction (MI). The direction of effect was not quite the same. The effect of 55.2% (CHD-associated) and 51.9% (MI-associated) CpGs showed the same directions in our study and in the previous study. (Author response table 1) showed the β coefficient and p-value of these 60 top hits in our study and in the previous study. The genetic background of the study population might be an important factor for this difference and lack of comparability. Also, as the reviewer mentioned in comment 14, age distribution might explain the difference in results. The mean age of the participants in the present study was 50.1 years, and that of the previous EWAS was 64 years.

Author response table 1
OutcomeCpG namebeta coefficient in ourstudyp-value in our studybeta coefficient in the previous studyp-valuein theprevious studyGene
CHDcg22617878-5.89E-043.75E-01−0.37191.99E-08ATP2B2
cg138272098.39E-044.47E-010.2683.76E-08TGFBR1
cg14185717//−0.28781.38E-07BNC2
cg10307345-3.62E-031.88E-01−0.14801.86E-07PTPN5
cg13822123-6.12E-041.45E-010.41382.03E-07PSME4
cg232453161.04E-031.18E-01−0.46742.17E-07TSSC1
cg249772762.83E-034.60E-02−0.32562.54E-07GTF2I
cg244477881.17E-033.20E-01−0.26794.33E-07(PTBP1**)
cg084228031.35E-033.47E-010.19947.52E-07ITGB2
cg01751802-1.43E-035.85E-010.14739.35E-07KANK2
cg024493732.25E-059.77E-010.37159.98E-07FUT1
cg02683350-1.19E-033.62E-01−0.50621.55E-06ADAMTS2
cg058203122.89E-047.87E-010.50311.60E-06TRAPPC9
cg06639874-8.25E-046.41E-01−0.25061.83E-06MLPH
cg06582394-2.56E-031.49E-010.16571.90E-06CASR
cg02155262-4.62E-043.07E-010.4771.97E-06AGA
cg127663832.26E-033.11E-02−0.61941.98E-06UBR4
cg058924845.55E-059.60E-01−0.50202.01E-06MAD1L1
cg030318681.46E-048.65E-010.34612.29E-06ESD
cg25497530-5.63E-031.45E-03−0.22252.62E-06PTPRN2
cg06596307-2.06E-031.24E-01−0.41982.99E-06IGF1R
cg10702366-2.35E-031.47E-01−0.10933.09E-06FGGY
cg264701011.24E-034.46E-010.30523.09E-06(DLX2**)
cg260420241.66E-034.16E-01−0.31093.13E-06ZFAT
cg004661217.27E-044.26E-010.46463.16E-06ZNHIT6
cg04987302-1.96E-031.85E-01−0.33783.71E-06(OTX2-AS1**)
cg088534941.82E-047.27E-010.2214.03E-06RCHY1;THAP6
cg264677251.34E-032.02E-01−0.42254.22E-06SLCO3A1
cg064421929.07E-045.78E-01−0.52414.89E-06ZNF541
cg003933731.11E-032.92E-01−0.31564.91E-06ZNF518B
MIcg228717974.77E-046.85E-01−0.5995.29E-08CYFIP1
cg249772762.83E-034.60E-02−0.3669.97E-08GTF2I
cg185988611.43E-031.98E-01−0.6711.61E-07IRF9
cg097777767.95E-047.38E-010.2872.25E-07ZNF254
cg20545941-3.22E-046.69E-01−0.8852.47E-07MPPED1
cg199358459.61E-045.30E-01−0.3364.65E-07TNXB
cg24423782-2.37E-031.33E-01−0.3985.37E-07MIR182
cg003933731.11E-032.92E-01−0.4017.68E-07ZNF518B
cg004661217.27E-044.26E-010.4877.79E-07ZNHIT6
cg19227382//−0.5048.12E-07CDH23
cg03467256//−0.4088.33E-07HPCAL1
cg251968813.45E-047.46E-01−0.2691.05E-06(THBS1**)
cg023211129.68E-058.99E-010.391.08E-06(MNX1-AS1**)
cg003557992.79E-048.27E-01−0.2161.40E-06(LOC339529**)
cg17556588-1.20E-033.90E-01−0.1541.45E-06PRRG4
cg04987302-1.96E-031.85E-01−0.4281.50E-06(OTX2-AS1**)
cg07289306-1.45E-033.24E-020.6161.71E-06(MIR138-1**)
cg058924845.55E-059.60E-01−0.5511.84E-06MAD1L1
cg10702366-2.35E-031.47E-01−0.1502.11E-06FGGY
cg22618720//−0.4242.37E-06(MIR5095**)
cg140101943.46E-047.84E-01−0.4842.71E-06GUCA1B
cg138272098.39E-044.47E-010.2852.71E-06TGFBR1
cg24318598-1.75E-032.59E-01−0.2542.79E-06ANO1
cg070157751.01E-031.36E-010.4793.13E-06ZNHIT6
cg210181562.00E-031.39E-01−0.1353.17E-06(LINC01312**)
cg07475527-1.64E-032.46E-01−0.2253.77E-06(RCAN3**)
cg200005622.06E-038.15E-020.2183.93E-06SFTA3
cg074368073.57E-047.88E-01−0.7794.10E-06STAMBPL1; ACTA2
cg14029912-1.62E-031.39E-01−0.3674.29E-06(BHLHE40**)
cg228717974.77E-046.85E-01−0.5995.29E-08CYFIP1

2) It would be helpful if some of the methods (and sample description) were provided along with the results, to better follow along the findings (without having to go to the end to read the results first). I found Figure 1 helpful in understanding the results and suggest that it could be presented with the results instead of the methods.

We thank the reviewer for the thoughtful comment. We have revised the manuscript to make it easy to follow (Line 81-88).

3) Some minor clarifications and justifications I would find helpful as a reader:

a. I am not very familiar with the co-methylation network approach. Could you provide a brief overview of the method (preferably along with the results)? As the majority of the findings stem from this method, a justification of its robustness would be helpful.

We used weighted gene co-methylation network analysis2 to identify potential co-methylation network related to CHD. This method can be used to identify clusters of highly correlated co-methylation genes and relate modules to external sample traits to find biologically or clinically significant modules. By calculating correlations among the methylation level of selected CpG sites, we constructed a gene co-methylation network. We then identified gene modules using hierarchical clustering. Next, we related gene modules to CHD outcome.

For computational reasons, we selected the top 20,000 CHD-associated CpGs from single-marker tests. This is about the maximum number of CpGs the WGCNA package can handle on our high-performance computing cluster. A previous study has been restricted 23,000 probe sets to 3600 probes to test the robustness. They found that the module detection results were generally similar2. We also additionally carried out a permutation-based test by shuffling the case-control status and re-selected the top 20,000 CpGs based on the permuted data to construct module and test for association with CHD. We found no inflated false positives due to the selection of top 20,000 CpGs (the most significant module has P>0.032, Figure 2—figure supplement 1).

We have added a brief overview of the co-methylation network approach to the “Result” section (Line 111-114).

b. I agree with what you say in the discussion, that it is not possible to say what direction mediation works, but it would be good with a sentence to explain the reason of the directions selected to study mediation.

We thank the reviewer for the thoughtful comment. DNA methylation is responsive to environmental stimuli and unhealthy lifestyles. This makes DNA methylation a potential biomarker of environmental-related and lifestyle-driven diseases of adulthood, for example, metabolic dysfunction. Unhealthy lifestyles, together with metabolic dysfunction, will further increase the risk of cardiovascular disease. We have added to address this comment (Line 59-65).

c. The sample selection: I assume you aimed to capture severe cases by including only fatal IHD or nonfatal MI counted as CHD? Why were individuals with neoplasms or cerebrovascular disease excluded?

Yes, we included fatal IHD and nonfatal acute MI to capture severe cases. Previous studies suggested that DNA methylation was also a potential biomarker for neoplasms3 or cerebrovascular disease.4,5 Thus, cases with both CHD and cerebrovascular or neoplasms could present a mixture of epigenetic changes. We excluded participants who reported at baseline or have developed neoplasms or cerebrovascular diseases during follow-up to better capture the DNA methylation change associated with incident CHD.

4) A couple of questions regarding the statistical analyses:

a. The analysis description for single CpG sites only says linear regression. Were the analyses not matched for case-control status (i.e. conditional linear regression), and if so why not? That should be the more powerful and robust approach.

We didn’t use conditional linear regression in the analysis. Instead, we followed the recommendation by Leek JT, et al.6 to use simple linear regression when yielding surrogate variable analysis for removing batch effects and other unwanted variations in high-throughput experiments. Previous studies7–10 that used matched case-control design also didn’t perform conditional analysis for the matched factors, although the authors employed different methods to remove batch effects and other unmeasured cofounding (SVA,9 adjustment for principal component,7,10 or adjustment for technical variables directly8).

b. In relation to that, why adjust for the matching variables?

We agree with the reviewer’s consideration for the matched design. Besides the reason we mentioned in the last comment, we noticed that cases were still slightly older than controls despite they were already matched by age (Table 1). Thus, we adjusted for matching factors in the model instead of matching for case-control status in the analysis, same as previous studies.7–10

c. I am slightly worried about overadjustment, especially in the mediation analyses as several of the lifestyle covariates are likely correlated (e.g. physical activity and BMI). Might including these adjustments in the mediation analyses mask an effect? And for the main analyses, did you also test a simpler model with only basic adjustments for comparison?

We re-calculated SVs and adjusted for age, sex, body mass index, smoking status, education level, study area, and all SVs for comparison with the previous largest EWAS of CHD. We called this a basic model and our original model as a full model. Please find the results of these two models in Author response table 2. Adjustment for additional lifestyle covariates did not change the association materially.

Author response table 2
CpGBasic modelFull model
βPβP
cg23398826-0.0033.44E-08-0.0031.57E-08
cg023865750.0052.76E-070.0069.61E-08
cg104009370.0021.28E-050.0021.09E-05
cg205628210.0052.95E-050.0052.42E-05
cg081066610.0036.75E-050.0033.16E-05
cg116306100.0054.22E-050.0053.83E-05
cg20302171-0.0043.86E-05-0.0044.29E-05
cg26334131-0.0053.51E-05-0.0054.44E-05
cg07560408-0.0045.73E-05-0.0054.46E-05
cg212105370.00434.81E-050.0044.85E-05
cg158334470.0050.0001070.0065.55E-05
cg025918260.0024.97E-050.0025.70E-05
cg166391380.0020.0001220.0026.46E-05
cg015454540.0026.38E-050.0027.29E-05
cg072191030.0026.77E-050.0027.35E-05
cg056816430.0047.33E-050.0047.42E-05
cg06358566-0.0028.02E-05-0.0027.74E-05
cg19583211-0.0036.46E-05-0.0037.97E-05
cg106438500.0040.0001040.0048.04E-05
cg13311494-0.0048.04E-05-0.0058.50E-05
cg117546700.0010.0001010.0018.84E-05
cg05740632-0.0044.51E-05-0.0049.07E-05
cg08484100-0.0053.69E-05-0.0049.19E-05
cg247921790.0040.0001150.0049.87E-05
cg22794712-0.0050.000173-0.0061.10E-04

Similarly, we also fitted basic models in the mediation analysis by including age, sex, BMI (exclude when BMI was exposure), smoking status (exclude when smoking was exposure), education level, study area, and batch as covariates. The results were largely retained (See Author response table 3).

Author response table 3
Basic modelFull model
Proportion mediated, %Proportion mediated, %Proportion mediated, %Proportion mediated, %
Smoking, no. of cigarettes/daycg0810666126.800.03828.500.036
Diet score (ranging 0-6)cg212105373.890.2564.660.206
cg10643850-6.730.083-6.910.088
cg0574063211.030.06211.300.068
Body mass index, kg/m2cg20302171-2.900.269-2.870.267
cg08484100-2.320.304-1.910.373
Systolic blood pressure*, mmHg cg2339882613.210.0027.650.003
cg133114943.140.08215.610.031
Diastolic blood pressure*, mmHg cg2339882616.320.0026.390.006
cg133114943.800.07912.380.045
Total cholesterol, mmol/L cg26334131-5.670.449-31.620.168
cg05740632-33.040.020-3.210.126
cg21210537-6.700.169-8.190.197
cg195832118.670.1142.730.270
Cholesterol in LDL, mmol/L cg26334131-3.250.406-32.140.135
cg20302171-4.470.263-10.480.161
cg05740632-17.770.032-3.380.110
cg195832119.030.1753.600.210
cg133114949.020.1773.700.208
cg21210537-5.500.175-8.550.177
Cholesterol in HDL, mmol/L cg15833447-3.690.293-10.770.180
cg212105373.790.2607.300.235
Random blood glucose‡, mmol/L cg104009378.920.0566.720.107
cg015454546.080.1229.580.097
cg117546701.260.61336.390.086
cg263341311.500.61232.700.109
cg072191036.760.1026.170.135
cg203021712.490.39715.770.123
  1. * We added 15 and 10 mmHg to the measured systolic blood pressure and diastolic blood pressure respectively among participants who reported usage of blood pressure-lowering medications.

  2. ‡ Additionally adjusted for treatment of diabetes (yes or no) at baseline.

5) Please check the number of incident cases and controls so the numbers are consistent in the abstract, figure 1, introduction, results etc. The numbers as currently shown vary slightly from 489 to 494.

Baseline DNA methylation was measured for 494 CHD cases and 494 matched controls. In the quality control process, we excluded sex mixed-up samples (n=2); samples with missing rate >0.01 across probes (n=2); samples measured in a distinct study batch (n=2). A total of 491 cases and 491 matched controls were retained for the single marker test. Two samples were further excluded during the network analysis because they were outliers during the sample clustering step, with 491 cases and 489 controls retained. We have revised the manuscript to avoid confusion (Line 85-86, and line 114-116).

6) In the first paragraph of the Results section please expand as the readers won't all look at the methodology section. See also (2) above.

Suggest – 491 cases free of CHD at baseline and developing CHD during follow up and 491 controls free of CHD at baseline and follow up and matched for age, gender, region and timing of blood sampling…

We thank the reviewer for the thoughtful comment. We have revised the manuscript to make it easy to follow (Line 81-88).

7) Table 1. Please show statistical significance p value for prevalent hypertension and diabetes and for lipids

We have added the corresponding p values to Table 1.

8) line 122, can you be specific about items related "healthy lifestyle" and are they known risk factors for CHD?

In our study, we found CHD cases were more likely to be daily smokers, have unhealthy dietary habits, and have higher BMI. We have revised the “Results” section to make it clear (Line 92, 93).

9) line 126, in my opinion, the genomic inflation factor is not helpful here as it is not a good indicator for EWAS. It is because CpGs are more much correlated than SNPs, and the inflation factor is very much dependent on the trait.

We showed the inflation factor for comparison with previous studies.12–16 The QQ plot was also shown in Supplementary file 2B. No evidence for inflation was observed in the QQ plots. We have revised the manuscript to address this comment (Line 98, 246). If the reviewer and editor think it is redundant to present the inflation factor, we would be happy to remove it.

10) Line 130: The difference "-0.003" between cases and controls is very small and hard to detect. Can you also show the SD of the two CpGs, or simply plot their distributions in cases and controls.

We have added a column to Table 2 to show the SD of each CpG. We also showed the SD of the top two hits in the “Result” section (Line 104, 105, and 108).

11) Line 144: Why do you use "probe" not "CpG" in this paragraph?

We thank the reviewer for pointing this out. We performed a gene enrichment analysis of the annotated genes of CpG sites from the CHD-associated module. We have revised to keep consistent and avoid confusion (Line 122, 127).

12) Line 156 change none to no

We have revised “none” to “no” (Line 159).

13) Line 242: Did you compare the maximum follow-up time and age distribution in the other two prospective publications? I believe the choice of different follow-up time can be an important factor as we are not sure how long it takes CpGs to have effects on CHD.

We have summarized the mean age and mean follow-up time of our study and two previous prospective studies as below. Our study included younger participants and followed them up for a shorter time period. We agree with the reviewer that different follow-up time and age distribution might explain the differences between our findings and previous prospective publications. We have revised the “Discussion section” to address this comment (Line 223, 227, Author response table 4).

Author response table 4
Present studyGuarrera S, et al.13Agha Golareh, et al.1
No. of cases4912921,895
Sample size98258411,461
Mean age (years)50.152.464
Mean Follow-up time (years)7.612.911.2

14) Line 312: "lab staff were blinded" does it also mean the cases and controls were randomized on arrays/batches. This is important as we don't want to lose study power by adjusting batch effect.

The cases and controls were not strictly randomized on arrays. However, the lab staff was really blinded to the case/control status. We agree with the reviewer that adjusting the batch effect may lose power to some extent. We have added it as a limitation to address this comment (Line 246-248, 292, and 293).

15) Line 365: Remove the repeated sentence "A total of 56 SVs were generated."

We thank the reviewer for pointing this out. We have deleted (Line 346).

16) Line 434: How were cellular compositions controlled when you could not use SV here?

In the association analyses of 25 CHD-associated CpGs and cardiovascular risk factors, we also have performed smartSVA for each trait. Adjustment for all SVs instead of batch did not change the association materially (Table 4–Source Data 1-7). To further address this comment, we also additionally adjusted for cellular compositions. The results were generally similar (Author response table 5) . If the reviewer and the editors think it is better to present the results with adjustment for cellular compositions, we would be happy to update all tables in the manuscript.

Author response table 5
Model 1+adjusted for CC
Effect size(SE)PEffect size(SE)P
Smoking, no. of cigarettes/day cg081066611.50E-04 (4.67E-05)0.0019.00E-05 (4.39E-05)0.041
Systolic blood pressure, mmHg cg23398826-313.75 (91.72)<0.001-255.60 (94.88)0.007
cg13311494-108.86 (48.16)0.020-101.84 (47.02)0.031
Diastolic blood pressure, mmHg cg23398826-184.53 (52.74)<0.001-151.73 (55.07)0.006
cg13311494-59.54 (27.30)0.029-54.51 (27.30)0.046

17) Can you also report the corresponding p-value of the FDR 0.05 threshold?

The corresponding p-value of the FDR = 0.05 threshold was 2.01E-07. We have added to report in the “Result” section (Line 101-102).

18) If you used β-values of DNA methylation in the analysis, please state it.

Yes, we used the β-values of DNA methylation in the analysis. We have added to clarify (Line 334).References

1. Agha Golareh, Mendelson Michael M., Ward-Caviness Cavin K., et al., Blood Leukocyte DNA Methylation Predicts Risk of Future Myocardial Infarction and Coronary Heart Disease. Circulation 2019;140(8):645–57.

2. Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics 2008;9(1):559.

3. Koch A, Joosten SC, Feng Z, et al., Analysis of DNA methylation in cancer: location revisited. Nat Rev Clin Oncol 2018;15(7):459–66.

4. Martínez-Iglesias O, Carrera I, Carril JC, Fernández-Novoa L, Cacabelos N, Cacabelos R. DNA Methylation in Neurodegenerative and Cerebrovascular Disorders. Int J Mol Sci 2020;21(6):2220.

5. Li X-G, Ma N, Wang B, et al., The impact of P2Y12 promoter DNA methylation on the recurrence of ischemic events in Chinese patients with ischemic cerebrovascular disease. Sci Rep 2016;6(1):34570.

6. Leek JT, Johnson WE, Parker HS, Jaffe AE, Storey JD. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinforma Oxf Engl 2012;28(6):882–3.

7. Nakatochi M, Ichihara S, Yamamoto K, et al., Epigenome-wide association of myocardial infarction with DNA methylation sites at loci related to cardiovascular disease. Clin Epigenetics 2017;9:54.

8. Guarrera S, Fiorito G, Onland-Moret NC, et al., Gene-specific DNA methylation profiles and LINE-1 hypomethylation are associated with myocardial infarction risk. Clin Epigenetics 2015;7:133.

9. Li J, Zhu X, Yu K, et al., Genome-Wide Analysis of DNA Methylation and Acute Coronary Syndrome. Circ Res 2017;120(11):1754–67.

10. Chambers JC, Loh M, Lehne B, et al., Epigenome-wide association of DNA methylation markers in peripheral blood from Indian Asians and Europeans with incident type 2 diabetes: a nested case-control study. Lancet Diabetes Endocrinol 2015;3(7):526–34.

11. Wahl S, Drong A, Lehne B, et al., Epigenome-wide association study of body mass index, and the adverse outcomes of adiposity. Nature 2017;541(7635):81–6.

12. Nakatochi M, Ichihara S, Yamamoto K, et al., Epigenome-wide association of myocardial infarction with DNA methylation sites at loci related to cardiovascular disease. Clin Epigenetics 2017;9:54.

13. Guarrera S, Fiorito G, Onland-Moret NC, et al., Gene-specific DNA methylation profiles and LINE-1 hypomethylation are associated with myocardial infarction risk. Clin Epigenetics 2015;7:133.

14. Li J, Zhu X, Yu K, et al., Genome-Wide Analysis of DNA Methylation and Acute Coronary Syndrome. Circ Res 2017;120(11):1754–67.

15. Fernández-Sanlés A, Sayols-Baixeras S, Curcio S, Subirana I, Marrugat J, Elosua R. DNA Methylation and Age-Independent Cardiovascular Risk, an Epigenome-Wide Approach: The REGICOR Study (REgistre GIroní del COR). Arterioscler Thromb Vasc Biol 2018;38(3):645–52.

16. Rask-Andersen M, Martinsson D, Ahsan M, et al., Epigenome-wide association study reveals differential DNA methylation in individuals with a history of myocardial infarction. Hum Mol Genet 2016;25(21):4739–48.

https://doi.org/10.7554/eLife.68671.sa2

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Jiahui Si
  2. Songchun Yang
  3. Dianjianyi Sun
  4. Canqing Yu
  5. Yu Guo
  6. Yifen Lin
  7. Iona Y Millwood
  8. Robin G Walters
  9. Ling Yang
  10. Yiping Chen
  11. Huaidong Du
  12. Yujie Hua
  13. Jingchao Liu
  14. Junshi Chen
  15. Zhengming Chen
  16. Wei Chen
  17. Jun Lv
  18. Liming Liang
  19. Liming Li
  20. China Kadoorie Biobank Collaborative Group
(2021)
Epigenome-wide analysis of DNA methylation and coronary heart disease: a nested case-control study
eLife 10:e68671.
https://doi.org/10.7554/eLife.68671

Share this article

https://doi.org/10.7554/eLife.68671