Biological constraints on GWAS SNPs at suggestive significance thresholds reveal additional BMI loci

  1. Reza K Hammond
  2. Matthew C Pahl
  3. Chun Su
  4. Diana L Cousminer
  5. Michelle E Leonard
  6. Sumei Lu
  7. Claudia A Doege
  8. Yadav Wagley
  9. Kenyaita M Hodge
  10. Chiara Lasconi
  11. Matthew E Johnson
  12. James A Pippin
  13. Kurt D Hankenson
  14. Rudolph L Leibel
  15. Alessandra Chesi
  16. Andrew D Wells
  17. Struan FA Grant  Is a corresponding author
  1. Center for Spatial and Functional Genomics, Children’s Hospital of Philadelphia, United States
  2. Division of Human Genetics, The Children’s Hospital of Philadelphia, United States
  3. Naomi Berrie Diabetes Center, Vagelos College of Physicians and Surgeons, Columbia University, United States
  4. Department of Pathology and Cell Biology, Vagelos College of Physicians and Surgeons, Columbia University, United States
  5. Columbia Stem Cell Initiative, Vagelos College of Physicians and Surgeons, Columbia University, United States
  6. Department of Orthopaedic Surgery, University of Michigan Medical School, United States
  7. Division of Molecular Genetics (Pediatrics) and the Naomi Berrie Diabetes Center, Columbia University Vagelos College of Physicians and Surgeons, United States
  8. Department of Pathology, The Children’s Hospital of Philadelphia, United States
  9. Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, United States
  10. University of Pennsylvania, United States
  11. The Children’s Hospital of Philadelphia, United States
8 figures, 2 tables and 2 additional files

Figures

2015 BMI Manhattan plot depicting loci identified with 2010 salvaged SNPs 2015 BMI loci identifiable with 2010 salvaged SNPs.

Cell type where locus was identified indicated below locus name. Color indicates the p-value threshold where the locus became implicated (Locke et al., 2015). Color key: Green – 5×10−8≤p<5×10−7, blue – 5×10−7≤p<5×10−6, orange – 5×10−6≤p<5×10−5, red – 5×10−5≤p<5×10−4.

Independent 2010 BMI SNPs identified via variant-to-gene mapping that go on to reach genome-wide significance by 2015, as well as the set of unconstrained 2010 suggestive SNPs that achieve genome-wide significance by 2015.

Positive predictive value is depicted as a percentage for each bar. Above these percentages, the p-value, as identified through Fisher’s exact test, is posted. These p-values depict the probability that the proportions of salvaged SNPs using variant-to-gene mapping differ from simply salvaging all suggestive SNPs within the same suggestive bin.

Figure 2—source data 1

Number of 2010 loci identified by constrained method and the number that achieved GWS by 2015 in each cell type.

https://cdn.elifesciences.org/articles/62206/elife-62206-fig2-data1-v2.csv
Figure 2—source data 2

Number of 2010 loci identified with no constraint and the number that achieved GWS by 2015.

https://cdn.elifesciences.org/articles/62206/elife-62206-fig2-data2-v2.csv
Flowchart of the pipeline describing each computational step.

BMI 2010–2015 data is utilized here as an example to report the number of SNPs and loci that occur at each step of the analysis.

Figure 4 with 1 supplement
Independent 2010.

BMI SNPs salvaged via variant-to-gene mapping that go on to reach genome-wide significance by 2018, as well as the set of unconstrained 2010 suggestive SNPs that achieve genome-wide significance by 2018. Positive predictive value is depicted for each bar. Above these percentages, the p-value, as identified through Fisher’s exact test, is posted. These p-values depict the probability that the proportions of salvaged SNPs using variant-to-gene mapping differ from simply salvaging all suggestive SNPs within the same suggestive bin.

Figure 4—source data 1

Number of 2010 loci identified by constrained method and the number that achieved GWS by 2018 in each cell type.

https://cdn.elifesciences.org/articles/62206/elife-62206-fig4-data1-v2.csv
Figure 4—source data 2

Number of 2010 loci identified with no constraint and the number that achieved GWS by 2018.

https://cdn.elifesciences.org/articles/62206/elife-62206-fig4-data2-v2.csv
Figure 4—figure supplement 1
Empirical distribution of positive predictive values of suggestive 2010 BMI SNPs achieving GWS by 2018.
Independent 2015.

BMI SNPs salvaged via variant-to-gene mapping that go on to reach genome-wide significance by 2018, as well as the set of unconstrained 2015 suggestive SNPs that achieve genome-wide significance by 2018. Positive predictive value is depicted for each bar. The posterior probability that loci identified by our chromatin-based constraint more often achieve GWS than loci with no constraint is posted above these percentages.

Figure 5—source data 1

Number of 2015 loci identified by constrained method and the number that achieved GWS by 2018 in each cell type.

https://cdn.elifesciences.org/articles/62206/elife-62206-fig5-data1-v2.csv
Figure 5—source data 2

Number of 2015 loci identified with no constraint and the number that achieved GWS by 2018.

https://cdn.elifesciences.org/articles/62206/elife-62206-fig5-data2-v2.csv
Figure 6 with 3 supplements
Independent 2010.

WHRadjBMI SNPs salvaged via variant-to-gene mapping that go on to reach genome-wide significance by 2018, as well as the set of unconstrained 2010 suggestive SNPs that achieve genome-wide significance by 2018. Positive predictive value is depicted for each bar. Above these percentages, the p-value, as identified through Fisher’s exact test, is posted. These p-values depict the probability that the proportions of salvaged SNPs using variant-to-gene mapping differ from simply salvaging all suggestive SNPs within the same suggestive bin.

Figure 6—source data 1

Number of 2010 loci identified by constrained method and the number that achieved GWS by 2018 in each cell type.

https://cdn.elifesciences.org/articles/62206/elife-62206-fig6-data1-v2.csv
Figure 6—source data 2

Number of 2010 loci identified with no constraint and the number that achieved GWS by 2018.

https://cdn.elifesciences.org/articles/62206/elife-62206-fig6-data2-v2.csv
Figure 6—figure supplement 1
Empirical distribution of positive predictive values of suggestive 2010 WHRadjBMI SNPs achieving GWS by 2018.
Figure 6—figure supplement 2
Independent 2010.

WHRadjBMI SNPs salvaged via variant-to-gene mapping that go on to reach genome-wide significance by 2015, as well as the set of unconstrained 2010 suggestive SNPs that achieve genome-wide significance by 2015. Positive predictive value is depicted for each bar. The posterior probability that loci identified by our chromatin-based constraint more often achieve GWS than loci with no constraint is posted above these percentages.

Figure 6—figure supplement 2—source data 1

Number of 2010 loci identified by constrained method and the number that achieved GWS by 2015 in each cell type.

https://cdn.elifesciences.org/articles/62206/elife-62206-fig6-figsupp2-data1-v2.csv
Figure 6—figure supplement 2—source data 2

Number of 2010 loci identified with no constraint and the number that achieved GWS by 2015.

https://cdn.elifesciences.org/articles/62206/elife-62206-fig6-figsupp2-data2-v2.csv
Figure 6—figure supplement 3
Independent 2015.

WHRadjBMI SNPs salvaged via variant-to-gene mapping that go on to reach genome-wide significance by 2018, as well as the set of unconstrained 2015 suggestive SNPs that achieve genome-wide significance by 2018. Positive predictive value is depicted for each bar. The posterior probability that loci identified by our chromatin-based constraint more often achieve GWS than loci with no constraint is posted above these percentages.

Figure 6—figure supplement 3—source data 1

Number of 2015 loci identified by constrained method and the number that achieved GWS by 2018 in each cell type.

https://cdn.elifesciences.org/articles/62206/elife-62206-fig6-figsupp3-data1-v2.csv
Figure 6—figure supplement 3—source data 2

Number of 2015 loci identified with no constraint and the number that achieved GWS by 2018.

https://cdn.elifesciences.org/articles/62206/elife-62206-fig6-figsupp3-data2-v2.csv
Independent 2010.

BMI SNPs failing the variant-to-gene mapping filter that go on to reach genome-wide significance by 2018. Positive predictive value is depicted for each bar. The posterior probability that loci identified by our chromatin-based constraint more often achieve GWS than loci with no constraint is posted above these percentages. There is no threshold where this data differs significantly from the unconstrained set.

Figure 7—source data 1

Number of 2010 loci identified by constrained method and the number that achieved GWS by 2018 in each cell type.

https://cdn.elifesciences.org/articles/62206/elife-62206-fig7-data1-v2.csv
Independent 2010.

WHRadjBMI SNPs failing the variant-to-gene mapping filter that go on to reach genome-wide significance by 2018. Positive predictive value is depicted for each bar. The posterior probability that loci identified by our chromatin-based constraint more often achieve GWS than loci with no constraint is posted above these percentages. There is no threshold where this data differs significantly from the unconstrained set.

Figure 8—source data 1

Number of 2010 loci identified by constrained method and the number that achieved GWS by 2018 in each cell type.

https://cdn.elifesciences.org/articles/62206/elife-62206-fig8-data1-v2.csv

Tables

Table 1
2015 BMI loci that were implicated with our method in the 2010 data set.

The 2015 genome-wide significant BMI loci whose sentinel SNP was in LD with SNPs implicated from suggestive association in the 2010 BMI GWAS. Key: Notable genes from biological relevance to obesity (B); copy number variation (C); DEPICT analyses (D); GRAIL results (G); BMI-associated variant is in strong LD (r2 ≥ 0.8) with a missense variant in the indicated gene (M); gene nearest to index SNP (N); association and eQTL data converge to affect gene expression (Q) (Locke et al., 2015).

Novel as of 2015 (Locke et al.)
2015 sentinel SNP2010 implicated SNPs2015 assigned locus nameInteracting gene
rs4740619rs10810462C9orf93(C,M,N)TCONS_00015651
rs17094222rs117597828HIF1AN(N)PAX2
rs2820292rs12086240, rs2820315NAV1(N)TIMM17A
rs758747rs2238435NLRC3(N)TCONS_00024950TCONS_00024564TCONS_00024949TCONS_00024562RP11-462G12.1TCONS_00024568TCONS_00024570RP11-95P2.1TCONS_00024567TCONS_00024569TCONS_00024320TCONS_00024952
rs3736485rs7183479SCG3(B,D); DMXL2(M,N)LYSMD2SCG3, CTD-2308G16.1TMOD2
Identified between 2010 (Speliotes et al.) and 2015 (Locke et al.)
2015 Sentinel SNP2010 Implicated SNPs2015 Assigned locus nameInteracting gene
rs17024393rs72705210GNAT2(N); AMPD2(D)GSTM3, AHCYL1
rs4256980rs10840079, rs10840087, rs11041999, rs11042023, rs12803166, rs4256980,TRIM66(D,M,N); TUB(B)PBLD, RPL27A, TRIM66
Identified in 2010 (Speliotes et al.), but was not genome-wide significant
2015 Sentinel SNP2010 Implicated SNPs2015 Assigned Locus NameInteracting gene
rs10182181rs12713419, rs13012304, rs6718510, rs7597332, rs7608976ADCY3(B,M,N,Q); POMC(B,G); NCOA1(B); SH2B1(B,M,Q); APOBR(M,Q);ADCY3, TCONS_00003602
rs12016871rs7988412,MTIF3(N); GTF3A(Q) *~1 Mb from sentinelMTIF3
rs1808579rs1788783NPC1(B,G,M,Q); C18orf8(N,Q)NPC1
rs2287019rs11672660, rs34783010QPCTL(N); GIPR(B,M)GIPR
Table 2
2018 BMI loci that were identified using 2010 salvaged.

SNPs 2018 BMI loci identified as genes nearest to genome-wide significant SNPs that could be identified using SNPs salvaged from suggestive regions of the 2010 BMI GWAS.

Nearest gene to sentinelSurviving proxy SNPsLowest threshold found
ABHD17Ars893543, rs893542, rs116713475 × 10−4
AC007879.5rs11677847, rs72951700, rs11689163, rs72966483, rs11694560, rs11692026, rs964621, rs9646225 × 10−4
ADCY3rs6718510, rs7597332, rs7608976, rs13012304, rs127134195 × 10−4
ADCY9rs710893, rs2531993, rs22384355 × 10−5
AK5rs127299145 × 10−5
AP000439.5rs116057295 × 10−4
BCL7Ars72998425 × 10−4
C10orf32rs70851045 × 10−4
C18orf8rs17888265 × 10−4
C1orf61rs112644835 × 10−4
CCDC171rs108104625 × 10−4
CNNM2rs19260325 × 10−4
COQ4rs14686485 × 10−4
CRTC1rs4808845, rs48088445 × 10−4
DPYDrs120774425 × 10−4
EIF2B5rs3914188, rs356374225 × 10−4
EXOSC10rs1884429, rs120417405 × 10−4
FAIM2rs4220225 × 10−4
GAB2rs8692025 × 10−4
GIPRrs34783010, rs116726605 × 10−7
GPR61rs727052105 × 10−4
HIF1ANrs1175978285 × 10−4
HOXB1rs23260135 × 10−4
IFNGR1rs172587505 × 10−4
IPO9rs28203155 × 10−5
KCNJ12rs99060725 × 10−4
LMOD1rs20472645 × 10−4
MAP2K3rs2001651, rs37855425 × 10−4
MAP3K7CLrs9282775 × 10−4
MEF2Drs2274319, rs1925950, rs12038396, rs3818463, rs2274320, rs22743175 × 10−4
MLNrs11752353, rs6921487, rs72880511, rs1887340, rs737465095 × 10−4
MLXIPrs28530689, rs10773037, rs28737311, rs36158849, rs22805735 × 10−4
MST1Rrs3774758, rs2252833, rs64461875 × 10^−4
MTIF3rs79884125 × 10−7
MTORrs11581010, rs108644905 × 10−4
NAV1rs120862405 × 10−5
NPC1rs17887835 × 10−4
RASA2rs20428645 × 10−4
RCAN2rs39343935 × 10−5
RNU6-543Prs107616895 × 10−4
RP11-493K19.3rs131009035 × 10−4
RP11-562L8.1rs128876365 × 10−5
RP11-68I18.10rs107888005 × 10−5
RP11-707P17.1rs71834795 × 10−4
SAE1rs4664775 × 10−4
SKAP1rs16951519, rs22401215 × 10−5
STK33rs10840087, rs11041999, rs340099215 × 10−4
TNRC6Brs6001834, rs48204095 × 10−4
TRIM66rs10840079, rs4256980, rs11042023, rs128031665 × 10−6
TTC34rs64240625 × 10−5
URM1rs7859557, rs22409485 × 10−4
XXYLT1rs584349655 × 10−4

Additional files

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Reza K Hammond
  2. Matthew C Pahl
  3. Chun Su
  4. Diana L Cousminer
  5. Michelle E Leonard
  6. Sumei Lu
  7. Claudia A Doege
  8. Yadav Wagley
  9. Kenyaita M Hodge
  10. Chiara Lasconi
  11. Matthew E Johnson
  12. James A Pippin
  13. Kurt D Hankenson
  14. Rudolph L Leibel
  15. Alessandra Chesi
  16. Andrew D Wells
  17. Struan FA Grant
(2021)
Biological constraints on GWAS SNPs at suggestive significance thresholds reveal additional BMI loci
eLife 10:e62206.
https://doi.org/10.7554/eLife.62206