The lingering effects of Neanderthal introgression on human complex traits

  1. Xinzhu Wei
  2. Christopher R Robles
  3. Ali Pazokitoroudi
  4. Andrea Ganna
  5. Alexander Gusev
  6. Arun Durvasula
  7. Steven Gazal
  8. Po-Ru Loh
  9. David Reich
  10. Sriram Sankararaman  Is a corresponding author
  1. Department of Computational Biology, Cornell University, United States
  2. Department of Human Genetics, University of California, Los Angeles, United States
  3. Department of Computer Science, University of California, Los Angeles, United States
  4. Analytical and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, United States
  5. Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, United States
  6. Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, United States
  7. Dana-Farber Cancer Institute, Harvard Medical School, United States
  8. Department of Genetics, Harvard Medical School, United States
  9. Department of Human Evolutionary Biology, Harvard University, United States
  10. Center for Genetic Epidemiology, Department of Public and Population Health Sciences, University of Southern California, United States
  11. Division of Genetics,Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, United States
  12. Howard Hughes Medical Institute, Harvard Medical School, United States
  13. Department of Computational Medicine, University of California, Los Angeles, United States
13 figures and 1 additional file

Figures

Figure 1 with 2 supplements
Benchmarking approaches for estimating the heritability components of Neanderthal introgression.

We group simulations by relationships between minor allele frequency (MAF) and local linkage disequilibrium at an SNP on effect size (MAF-LD coupling): BASELINE, COMMON, RARE, HIGH, LOW. In each …

Figure 1—figure supplement 1
Benchmarking different methods for estimating the total SNP heritability.

We grouped the simulations by the five different MAF-LD coupling: BASELINE, COMMON, RARE, HIGH, LOW, as labeled on top of each column. In each group, there are 12 simulations with different levels …

Figure 1—figure supplement 2
Estimating the heritability components of Neanderthal introgression under a genetic architecture in which rare variants are enriched for phenotypic effects.

We performed simulations under an ULTRA RARE genetic architecture in which SNPs with MAF <1% account for 90% of the causal variants. We plot the distributions of two Z-scores (y-axis), one on each …

Distributions of minor allele frequency (MAF) and LD-score in Neanderthal Informative Mutations (NIMs) and modern human (MH) SNPs.

Empirical cumulative distribution functions of (a) MAF and (b) LD scores of NIMs (in solid green line) and MH SNPs (in pink dashed line) estimated in the UK Biobank (UKBB). (c) Boxplots of MAFs of …

Figure 3 with 1 supplement
Neanderthal Informative Mutation (NIM) heritability in UK Biobank (UKBB) phenotypes.

(a) Estimates of NIM heritability (h2^NIM) and (c) the Z-score of h2^NIM (testing the hypothesis that NIM heritability is positive) for each UKBB phenotype. Analogously, (b) estimates of Δ^h2 and Z-score (d) …

Figure 3—source data 1

UKBB phenotype annotation.

https://cdn.elifesciences.org/articles/80757/elife-80757-fig3-data1-v2.zip
Figure 3—source data 2

RHE-mc results with Ancestry+MAF+LD annotations and NIM PCs included in covariates applied to 96 UKBB phenotypes.

https://cdn.elifesciences.org/articles/80757/elife-80757-fig3-data2-v2.zip
Figure 3—figure supplement 1
Neanderthal Informative Mutation (NIM) heritability in UK Biobank (UKBB) phenotypes after accounting for background selection.

(a) Estimates of NIM heritability (h2^NIM) and (c) the Z-score of h2^NIM (testing the hypothesis that NIM heritability is positive) for each UKBB phenotype. Analogously, (b) estimates of Δ^h2 and Z-score (d) …

Figure 4 with 1 supplement
Comparing heritability analyses with and without controlling for minor allele frequency (MAF) and LD in UK Biobank (UKBB) phenotypes.

Each phenotype is shown with one dot colored by the phenotypic category it belongs to, on the y-axis based on its point estimate and standard error (estimated by RHE-mc with Ancestry annotation) and …

Figure 4—source data 1

RHE-mc results with Ancestry only annotation and NIM PCs included in covariates applied to 96 UKBB phenotypes.

https://cdn.elifesciences.org/articles/80757/elife-80757-fig4-data1-v2.zip
Figure 4—figure supplement 1
Comparing heritability estimates from RHE-mc without controlling for Neanderthal Informative Mutation (NIM) principal components (PCs) with Ancestry + MAF + LD annotation and RHE-mc with Ancestry annotation in UK Biobank (UKBB) phenotypes.

This figure is plotted in the same way as Figure 4. The trend that not controlling for minor allele frequency (MAF) and LD leads to underestimation of (a) total heritability h2^ , (b) NIM …

Fine mapping of Neanderthal Informative Mutations (NIMs) in simulations and the UK Biobank (UKBB).

(a) Fine mapping pipeline to identify NIMs that aims to identify genomic regions at which NIMs are likely to modulate phenotypic variation (credible NIM regions). (b) Comparison of approaches for …

Figure 6 with 3 supplements
Analysis of credible Neanderthal Informative Mutations (NIMs).

(a) Distribution of credible NIMs across the genome. (b) High and moderate impact credible NIMs annotated by SnpEff software (Cingolani et al., 2012). A total of 26 credible NIMs have high (marked …

Figure 6—figure supplement 1
Credible Neanderthal Informative Mutation (NIM) in the FCGR2A gene associated with gamma-glutamyl transferase levels.

Plot of 200 kb region surrounding rs9427397 (marked in black diamond; hg19 coordinates), a credible NIM in FCGR2A that introduces a premature stop codon and is associated with increased levels of …

Figure 6—figure supplement 2
Credible Neanderthal Informative Mutation (NIM) in the AKR1C4 gene is associated with bilirubin levels.

(a) Plot of 300 kb region surrounding rs17134592 (marked in black diamond; hg19 coordinates), a non-synonymous NIM in AKR1C4, that is associated with increased serum bilirubin levels. The plot …

Figure 6—figure supplement 3
Number of unique credible Neanderthal Informative Mutations (NIMs) that are expression quantitative trait loci (eQTLs).

The x-axis lists the tissues where credible NIMs are found as eQTLs, and the y-axis is the number of eQTLs observed in the tissue that is a credible NIM.

Appendix 2—figure 1
Benchmark stratified LDSC regression (S-LDSR) with in-sample and out-of-sample LD scores.

We group the simulations by the MAF-LD coupling: BASELINE, COMMON, RARE, HIGH, LOW, and ALL, as labeled on the x-axis. We plot the distributions of three Z-scores (y-axis), one on each panel: (a) …

Appendix 3—figure 1
Population structure within white British samples.

PC-1 from the whole genome genotypes (released by UK Biobank [UKBB]) is shown on the left, and Neanderthal Informative Mutation (NIM) PC-1 is shown on the right. We used a 20-by-20 grid along the …

Appendix 3—figure 2
Neanderthal Informative Mutation (NIM) heritability in the 96 UK Biobank (UKBB) phenotypes without controlling for NIM principal components (PCs).

This figure is plotted in the same way as Figure 3. Heritability estimates are largely similar, but fewer phenotypes are significant. Three phenotypes have significant positive NIM heritability …

Appendix 5—figure 1
Distributions of minor allele frequency (MAF) and LD-score in Neanderthal Informative Mutations (NIMs) identified by Sprime in all 1KG populations and modern human (MH) SNPs.

Empirical cumulative distribution functions of (a) MAF and (b) LD scores of NIMs identified by Sprime (in solid green line) and MH SNPs (in pink dashed line) estimated in the UK Biobank (UKBB). (c) …

Appendix 5—figure 2
Distributions of minor allele frequency (MAF) and LD-score in Neanderthal Informative Mutations (NIMs) identified by Sprime in 1KG European populations and modern human (MH) SNPs.

Empirical cumulative distribution functions of (a) MAF and (b) LD scores of NIMs identified by Sprime (in solid green line) and MH SNPs (in pink dashed line) estimated in the UK Biobank (UKBB). (c) …

Appendix 5—figure 3
Neanderthal Informative Mutation (NIM) heritability in the 96 UK Biobank (UKBB) phenotypes for the least stringent set of NIMs identified using Sprime by McArthur et al., 2021.

(a) Estimates of NIM heritability (h2^NIM) and (c) the Z-score of h2^NIM (testing the hypothesis that NIM heritability is positive) for each UKBB phenotype. Analogously, (b) estimates of Δ^h2 and Z-score (d) …

Appendix 5—figure 4
Neanderthal Informative Mutation (NIM) heritability in the 96 UK Biobank (UKBB) phenotypes for the most stringent set of NIMs identified using Sprime by McArthur et al., 2021 NIMs.

(a) Estimates of NIM heritability (h2^NIM) and (c) the Z-score of h2^NIM (testing the hypothesis that NIM heritability is positive) for each UKBB phenotype. Analogously, (b) estimates of Δ^h2 and Z-score (d) …

Additional files

Download links