Continued methodological advances have enabled numerous statistical approaches for the analysis of summary statistics from genome-wide association studies. Genetic correlation analysis within specific regions enables a new strategy for identifying pleiotropy. Genomic regions with significant ‘local’ genetic correlations can be investigated further using state-of-the-art methodologies for statistical fine-mapping and variant colocalisation. We explored the utility of a genome-wide local genetic correlation analysis approach for identifying genetic overlaps between the candidate neuropsychiatric disorders, Alzheimer’s disease, amyotrophic lateral sclerosis, frontotemporal dementia, Parkinson’s disease, and schizophrenia. The correlation analysis identified several associations between traits, the majority of which were loci in the human leukocyte antigen (HLA) region. Colocalisation analysis suggested the presence of a shared causal variant between amyotrophic lateral sclerosis and Alzheimer’s disease in this region. Our study identified candidate loci that might play a role in multiple neuropsychiatric diseases and suggested that disease-implicated variants in these loci often differ between traits. Accordingly, this suggests the role of distinct mechanisms across diseases despite shared loci. The fine-mapping and colocalisation analysis protocol designed for this study has been implemented in a flexible analysis pipeline that produces HTML reports and is available at: https://github.com/ThomasPSpargo/COLOC-reporter.
This paper presents a valuable pipeline based on state-of-the-art analytical software that was used to study genetic pleiotropy between neuropsychiatric disorders. The presented evidence supporting the claims is solid, although the paper is lacking an appropriate comparison to previously published methods as well as a more detailed exploration of some of the findings. The created pipeline is made publicly available and can thus be used by researchers from diverse fields to study different combinations of diseases and traits.
The genetic spectrum of neuropsychiatric disease is diverse and various overlaps exist between traits. For instance, genetic pleiotropy between amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD) is increasingly recognised, and ALS is genetically correlated with Alzheimer’s disease (AD), Parkinson’s disease (PD), and schizophrenia1-3. Improving understanding of the genetic architecture underlying these complex diseases could facilitate future treatment discovery.
Advances in genomic research techniques have accelerated discovery of genetic variation associated with complex traits. Genome-wide association studies (GWAS), in particular, have enabled population-scale investigations of the genetic basis of human diseases and anthropometric measures4. Summary-level results from GWAS are being shared alongside publications with increasing frequency over time5, and a breadth of approaches now exist for downstream analysis based on summary statistics which can enable their interpretation and provide further biological insight.
Genetic correlation analysis allows estimation of genetic overlap between traits6-9. A ‘global’ genetic correlation approach gives a genome-wide average estimate of this overlap However, genetic relationships between traits can be obscured when correlations in opposing directions cancel out genome-wide8. Recent methods allow for a more nuanced analysis, of ‘local’ genetic correlations partitioned across the genome8,9. This stratified approach to genome-wide analysis could prove effective for identifying pleiotropic regions and designing subsequent analyses aiming to identify genetic variation shared between traits.
A number of methods aim to disentangle causality within associated regions. This is important because the focus on single nucleotide polymorphisms (SNPs), which are markers of genetic variation, in GWAS produces results that can be difficult to interpret, and causal variants are typically unclear. More so, because of linkage disequilibrium (LD), GWAS associations often comprise large sets of highly correlated SNPs spanning large genomic regions. Statistical fine-mapping is a common approach for dissecting complex LD structures and finding variants with implications for a given trait among the tens or hundreds that might be associated in the region10.
Interpretation of regions associated with multiple traits can also be challenging, since it is often unclear whether these overlaps are driven by the same causal variant. Statistical colocalisation analysis can disentangle association signals across traits to suggest whether the overlaps result from shared or distinct causal genetic factors11-13. Traditionally this analysis was restricted by the assumption of at most one causal variant for each trait in the region. However, recent extensions to the method now permit analysis based on univariate fine-mapping results for the traits compared and, therefore, analysis of regions with multiple causal variants.
Accordingly, we conducted genome-wide local genetic correlation analysis across 5 neuropsychiatric traits with recognised phenotypic and genetic overlap2,3,14-16: AD, ALS, FTD, PD, and schizophrenia. Loci highly correlated between trait pairs were further investigated with univariate fine-mapping and bivariate colocalisation techniques to examine variants driving these associations.
2.1. Sampled GWAS summary statistics
We leveraged publicly-accessible summary statistics from European ancestry GWAS meta-analyses of risk for AD17, ALS1, FTD18, PD19, and schizophrenia20. European ancestry data were selected to avoid LD mismatch between the GWAS sample and reference data from an external European population.
Figure 1 summarises the analysis protocol for this study; further details are provided below.
2.2.1. Processing of GWAS summary statistics
A standard data cleaning protocol was applied to each set of summary statistics21. We retained only single nucleotide polymorphisms (SNPs), excluding any non-SNP or strand-ambiguous variants. SNPs were filtered to those present within the 1000 Genomes phase 3 (1KG) European ancestry population reference dataset22 (N = 503). They were matched to the 1KG reference panel by GRCh37 chromosomal position using bigsnpr (version 1.11.6)23, harmonising allele order with the reference and assigning SNP IDs.
If not reported, and where possible, effective sample size (Neff) was calculated from per-SNP case and control sample sizes. When this could not be determined per-SNP, all variants were assigned a single Neff, calculated as a sum of Neff values for each cohort within the GWAS meta-analysis24.
Further processing was performed where possible, excluding SNPs with imputation INFO <0.9, p-values ≤0 or >1, and Neff >3 standard deviations from the median Neff. We filtered to include only variants with minor allele frequency (MAF) ≥0.005 in both the reference and GWAS samples and excluded SNPs with an absolute MAF difference of >0.2 between the two.
2.2.2. Genome-wide analyses
220.127.116.11. Global heritability and genetic correlations
LDSC (version 1.0.1)6,7 was applied to estimate genome-wide univariate heritability (h2) for each trait on the liability scale. The software was also applied to derive ‘global’ (i.e., genome-wide) genetic correlation estimates between trait pairs and estimate sample overlap from the bivariate intercept.
These analyses were performed using the HapMap325 SNPs and the LD score files provided with the software, calculated in the 1KG European population. No further MAF filter was applied (therefore variants with MAF >0.005 were included) and the other settings were left to their defaults.
18.104.22.168. Local genetic correlation analysis
LAVA (version 0.1.0)8 was applied to obtain local genetic correlation estimates across 2495 approximately independent blocks delineating the genome, based on patterns in LD. We used the blocks provided alongside the LAVA software which were derived from the 1KG European cohort. Bivariate intercepts from LDSC were provided to LAVA to estimate sample overlap between trait pairs.
In accordance with prior studies, genetic correlation analysis was performed following an initial filtering step. Univariate heritability was estimated for each genomic block across SNPs in-common between a pair of traits, and only loci with local h2 p-values below a threshold of 2.004×10−5 (0.05/2495) in both traits continued to the bivariate analysis. This step ensures that univariate heritability is sufficient in both traits for a robust correlation estimate.
2.2.3. Targeted genetic analyses
22.214.171.124. Fine-mapping and colocalisation analysis
Statistical fine-mapping and colocalisation techniques were applied to further analyse associations between trait pairs in regions where the false discovery rate (FDR) adjusted p-value of local genetic correlation analysis was below 0.05 (after adjusting for all bivariate comparisons performed). Additional analysis was conducted at loci where significant correlations occurred between two trait pairs but not between the final pairwise comparison across the three implicated traits.
Fine-mapping was performed with susieR (v0.12.27)10,26, which implements the ‘sum of single effects’ (SuSiE) model to represent statistical evidence of causal genetic variation within ‘credible sets’ and per-SNP posterior inclusion probabilities (PIPs). A 95% credible set indicates 95% certainty that at least one SNP included within the set has a causal association with the phenotype and higher PIPs indicate a greater posterior probability of being a causal variant within a credible set. Multiple credible sets are identified when the data suggest more than one independent causal signal.
Colocalisation analysis was implemented with coloc (v126.96.36.199)11,12,27, which calculates posterior probabilities that a causal variant exists for neither, one, or both of two compared traits, testing also whether evidence for a causal variant in both traits suggests a shared variant (i.e., hypothesis 4 (H4); colocalisation) or independent signals (Hypothesis 3 (H3)). Colocalisation analyses can be performed across all variants sampled in a region, under an assumption of at most one variant implicated per trait. It can also be performed using variants attributed to pairs of credible sets from SuSiE, relaxing the single variant assumption11. When evidence of a shared variant is found, the individual SNPs with the highest posterior probability of being that variant can be assessed. With a 95% confidence threshold, these are termed 95% credible SNPs.
Briefly, in this workflow (see Figure 1), GWAS summary statistics are harmonised across analysed traits for a specified genomic region, including only variants in common between them and available within a reference population. An LD correlation matrix across sampled variants is derived from a reference population using PLINK (v1.90)29,30.
Quality control is performed per-dataset prior to univariate fine-mapping analysis. Diagnostic tools provided with susieR are applied to test for consistency between the LD matrix and Z-scores from the GWAS and identify variants with a potential ‘allele flip’ (reversed effect estimate encoding) that can impact fine-mapping.
Fine-mapping is performed for each dataset with the coloc package runsusie function, which wraps around susie_rss from susieR and is configured to facilitate subsequent colocalisation analysis. Sample size (Neff for binary traits) is specified as the median for SNPs analysed. Colocalisation analysis can be performed with the coloc functions coloc.abf and coloc.susie when fine-mapping yields at least one credible set for both traits and otherwise using coloc.abf only. Genes located near credible sets from fine-mapping and credible SNPs from colocalisation analyses are identified via Ensembl and biomaRt (v2.54.0)31-33.
Analysis parameters can be adjusted by the user in accordance with their needs. Various utilities are included to help interpretation of fine-mapping and colocalisation results, including identification of genes nearby to putatively causal signals, HTML reports to summarise completed analyses, and figures to visualise the results and compare the examined traits.
In this study, LD correlation matrices were derived from the 1KG European cohort. SNPs flagged for potential allele flip issues in either of the compared traits were removed from the analysis. Fine-mapping was performed with the susie_rss refine=TRUE option to avoid local maxima during convergence of the algorithm, leaving the other settings to the runsusie defaults. Colocalisation analysis was performed using the default priors for coloc.susie (P1=1×10−4, P2=1×10−4, P12=5×10−6).
Colocalisation and fine-mapping analyses were performed initially using the genomic blocks defined by LAVA, since these aim to define relatively independent LD partitions across the genome8. If a 95% credible set could not be identified in one or both traits, we inspected local Manhattan plots for the region to determine whether potentially relevant signals occurred around the region boundaries. The analysis was repeated with a ±10Kb window around the LAVA-defined genomic region if p-values for SNPs at the edge of the block were p<1×10−4 for both traits and the Manhattan plots were suggestive of a ‘peak’ not represented within the original boundaries.
3.1. Genome-wide analyses
Descriptive information and heritability estimates for the sampled traits and GWAS are presented in Table 1. ALS had nominally significant global genetic correlations with schizophrenia (p = 0.045), PD (p = 0.013), and AD (p = 0.006); no other bivariate genome-wide correlations were statistically significant (see Figure S1).
A total of 605 local genetic correlation analyses were performed across all trait pairs in genomic regions where both traits passed the univariate heritability filtering step after restricting to SNPs sampled in both GWAS (see Table 2; Figure 2; Table S1). The number of loci passing to bivariate analysis varied greatly across trait pairs and was congruent with the genome-wide heritability estimates (and their uncertainty) for each trait, reflecting differences in phenotypic variance explained by measured genetic variants and statistical power for each GWAS (see Table 1).
Twenty-six bivariate comparisons were significant following FDR adjustment (pfdr <0.05), two of which also passed the stringent Bonferroni threshold (p <8.26×10−5; 0.05/605). While some regions included genome-wide significant SNPs (p <5×10−8) for one or both traits, others occurred in regions where GWAS associations were weaker (see Table 2). Five of these associations occurred at loci within the human leukocyte antigen (HLA) region (GRCh37: Chr6:28.48-33.45Mb; 6p22.1-21.340), and all five traits were implicated in at least one of these.
3.2. Targeted genetic analyses
Univariate fine-mapping and bivariate colocalisation analyses were subsequently performed to test for variants jointly implicated between trait pairs in regions with local genetic correlation Pfdr <0.05. The ALS and schizophrenia trait pair was additionally examined at Chr6:32.22-32.45Mb because significant genetic correlations were found between ALS and FTD and between schizophrenia and FTD at this locus. The correlation between ALS and schizophrenia at this locus had not been analysed owing to insufficient univariate heritability for ALS after restricting to SNPs in common with the schizophrenia GWAS.
Fine-mapping identified at least one 95% credible set for each of the compared traits for 7 of the 27 comparisons performed (see Table 3), and for one trait only in a further 5 (see Table S2; Table S3). This analysis suggested two credible sets for schizophrenia in the Chr12:56.99-58.75Mb locus, for AD in Chr6:32.45-32.54Mb, and (only when harmonised to SNPs in common with the ALS GWAS) for FTD in Chr6:32.22-32.45Mb (see Table S3).
Colocalisation analyses performed across fine-mapping credible sets and across all SNPs in a region generally gave support to the equivalent hypothesis (Table 3; Table S2). Moreover, comparisons suggesting a signal was present in one trait only were largely concordant with the identification of fine-mapping credible sets in only that trait (Table S2). Figure S2 compares per-SNP p-values across trait pairs for comparisons with evidence of a relevant signal in both traits. Figure S3 shows patterns of LD across SNPs assigned to credible sets for these analyses.
Strong evidence was found for a shared variant between ALS and AD within the HLA region (Posterior probability of shared variant = 0.9; see Figure 3). The 95% credible SNPs for this association were distributed around the MTCO3P1 pseudogene and rs9275477, the lead genome-wide significant SNP from the ALS GWAS in this region, had the highest posterior probability of being implicated in both traits. Figure S4 presents sensitivity analysis showing that the result is robust to a range of values for the shared variant hypothesis prior probability.
The other comparisons that found fine-mapping credible sets in both traits suggested that overlaps from the correlation analysis were driven by distinct causal variants (see Table 3,Table S2).
Univariate fine-mapping of PD and schizophrenia at Chr17:43.46-44.87Mb found large credible sets spanning many genes, including MAPT41-44 and CRHR145,46 which have been previously implicated in the traits we have analysed. These expansive credible sets reflect the strong LD in the region and indicate a signal that is difficult to localise (see Figure S3(F); Table S3). The colocalisation analysis suggested independent variants for each trait despite many SNPs overlapping across their respective credible sets (see Figure S3). Sensitivity analysis showed robust support for the two independent variants hypothesis across shared-variant hypothesis priors (Figure S4).
We examined genetic overlaps between the neuropsychiatric conditions Alzheimer’s disease, amyotrophic lateral sclerosis, frontotemporal dementia, Parkinson’s disease, and schizophrenia. Associated genomic regions between pairs of traits were identified with local genetic correlation analysis and further analysed with statistical fine-mapping and colocalisation techniques.
Significant correlations were most frequent across genomic blocks within the HLA region, implicating each of the studied traits in at least one comparison. Several associated regions contained genes with known relevance for the traits studied, such as KIF5A, MAPT, and CRHR1. Colocalisation analysis found strong evidence for a shared genetic variant between ALS and AD in the Chr6:32.62-32.68Mb locus within HLA, while the other colocalisation analyses suggested causal signals distinct across traits, for one trait only, or for neither trait.
The tendency for association between traits around the HLA region is reasonable, since this is a known hotspot for pleiotropy8,47. HLA is particularly known for its role in immune response and it is implicated in various types of disease48,49. Mounting evidence has linked HLA and associated genetic variation to the traits we have analysed, and mechanisms underlying these associations are beginning to be understood48-57. For instance, AD is associated with variants around the HLA-DQA1 and HLA-DRB1 genes and several SNPs in the non-coding region between them have been shown to modulate their expression58. Notably, one of the SNPs with a demonstrated regulatory role, rs9271247, had the highest probability of being causal for AD across the 95% credible set identified in the fine-mapping of the region.
Variants showing evidence for colocalisation between AD and ALS were distributed around the MTCO3P1 pseudogene in the HLA class II non-coding region between HLA-DQB1 and HLA-DQB2. MTCO3P1 has been previously identified as one of the most pleiotropic genes in the GWAS catalog59,60. Previous studies have suggested the relevance of this region in both traits. HLA-DQB1 and HLA-DQB2 are both upregulated in the spinal cord of people with ALS, alongside other genes implicated in various immunological processes for antigen processing and inflammatory response61. HLA class II complexes, and their subcomponents, have been identified as upregulated in multiple brain regions of people with AD, using both gene and protein expression techniques57,62.
Our analysis of this region gave stronger support for colocalisation between the ALS and AD GWAS than a previous study. The previous study defined a 100Kb window around the lead genome-wide significant SNP from the ALS GWAS, rs9275477, and found ∼0.50 posterior probability for each of the shared and two independent variant(s) hypotheses1. The difference between these studies reflects differences in the processing of GWAS data; in this study all summary statistics underwent quality control to ensure only high-quality variants were retained.
More broadly, our analyses suggest that regions with a strong genetic correlation between the five traits studied often result from adjacent but trait-specific signals, likely reflecting overlaps between LD blocks47. Correlations also occurred in regions with weaker overall GWAS associations (see Table 2), where fine-mapping and colocalisation analyses did not suggest causal associations in one or either trait. Such patterns likely reflect a shared polygenic trend across the region, rather than associations attributable to discrete variants. Accordingly, other approaches may be better suited for identifying regions containing genetic variation jointly causal across diseases, including the traditional approach of testing regions around overlapping genome-wide significant variants.
This study has used gold-standard statistical tools to examine genetic relationships between traits. The local genetic correlation analysis approach enabled targeted investigation of genomic regions which appear to overlap between traits. The application of colocalisation analysis alongside a prior univariate fine-mapping step allowed for associations to be tested without conflating independent but nearby signals under the single-variant assumption of colocalisation analysis across all variants sampled in a region.
The study is not without limitation. We necessarily used the 1KG European reference population to estimate LD between SNPs. Fine-mapping is ideally performed with an LD matrix from the GWAS sample and is sensitive to misspecification when inconsistencies in LD occur between the reference and GWAS cohorts. Use of a reference population is not uncommon, and diagnostic tools available within the susieR package allow testing for inconsistencies between the reference and GWAS samples10. We accordingly implemented these tools centrally into our workflow and determined that the LD matrices from the 1KG reference were suitable for the data (estimates of Z-score and LD consistency are available in Table S3). Nevertheless, repeating this study in under-represented populations would be an important future step to validate our findings.
We employed statistical methods to identify and analyse genomic regions containing variants which might be jointly implicated across traits. These approaches provide useful associations between traits identified from large-scale genomic datasets. However, they alone are not sufficient for translation into clinical practice. Future studies should aim to extend any associations found by integrating functional and multi-omics datasets to gain mechanistic insights into observed trends and facilitate treatment discovery58,63.
The fine-mapping and colocalisation analysis pipeline we have used is available as an open-access resource on GitHub to facilitate the application of these methods in future studies: https://github.com/ThomasPSpargo/COLOC-reporter. Specified genomic regions can be readily analysed by providing GWAS summary statistics for binary or quantitative traits of interest and a population-appropriate reference dataset for estimation of LD. The pipeline returns resources including detailed reports that overview the analyses performed.
All data are publicly available
This project was part funded by the MND Association and the Wellcome Trust. This is an EU Joint Programme-Neurodegenerative Disease Research (JPND) project. The project is supported through the following funding organizations under the aegis of JPND– http://www.neurodegenerationresearch.eu/ [United Kingdom, Medical Research Council (MR/L501529/1 and MR/R024804/1) and Economic and Social Research Council (ES/L008238/1)]. AAC is a NIHR Senior Investigator. AAC received salary support from the National Institute for Health Research (NIHR) Dementia Biomedical Research Unit at South London and Maudsley NHS Foundation Trust and King’s College London. The work leading up to this publication was funded by the European Community’s Health Seventh Framework Program (FP7/2007–2013; grant agreement number 259867) and Horizon 2020 Program (H2020-PHC-2014-two-stage; grant agreement number 633413). This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 Research and Innovation Programme (grant agreement no. 772376–EScORIAL. This study represents independent research part funded by the NIHR Maudsley Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King’s College London. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR, King’s College London, or the Department of Health and Social Care. Funding was also provided by the King’s College London DRIVE-Health Centre for Doctoral Training and the Perron Institute for Neurological and Translational Science. AI is funded by South London and Maudsley NHS Foundation Trust, MND Scotland, Motor Neurone Disease Association, National Institute for Health and Care Research, Spastic Paraplegia Foundation, Rosetrees Trust, Darby Rimmer MND Foundation, the Medical Research Council (UKRI) and Alzheimer’s Research UK. OP is supported by a Sir Henry Wellcome Postdoctoral Fellowship [222811/Z/21/Z]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
This research was funded in whole or in part by the Wellcome Trust [222811/Z/21/Z]. For the purpose of open access, the author has applied a CC-BY public copyright licence to any author accepted manuscript version arising from this submission.
The authors acknowledge the use of the CREATE research computing facility at King’s College London64. We also acknowledge Health Data Research UK, which is funded by the UK Medical Research Council, Engineering and Physical Sciences Research Council, Economic and Social Research Council, Department of Health and Social Care (United Kingdom), Chief Scientist Office of the Scottish Government Health and Social Care Directorates, Health and Social Care Research and Development Division (Welsh Government), Public Health Agency (Northern Ireland), British Heart Foundation and Wellcome Trust.
- 1.Common and rare variant association analyses in amyotrophic lateral sclerosis identify 15 risk loci with distinct genetic architectures and neuron-specific biologyNat Genet 53:1636–48https://doi.org/10.1038/s41588-021-00973-1
- 2.Overlapping Genetic Architecture Between Schizophrenia and Neurodegenerative DisordersFront Cell Dev Biol 9https://doi.org/10.3389/fcell.2021.797072
- 3.Multifaceted Genes in Amyotrophic Lateral Sclerosis-Frontotemporal DementiaFront Neurosci 14https://doi.org/10.3389/fnins.2020.00684
- 4.15 years of GWAS discovery: Realizing the promiseAm J Hum Genet 110:179–94https://doi.org/10.1016/j.ajhg.2022.12.011
- 5.Sharing GWAS summary statistics results in more citationsCommun Biol 6https://doi.org/10.1038/s42003-023-04497-8
- 6.LD Score regression distinguishes confounding from polygenicity in genome-wide association studiesNat Genet 47:291–5https://doi.org/10.1038/ng.3211
- 7.An atlas of genetic correlations across human diseases and traitsNat Genet 47:1236–41https://doi.org/10.1038/ng.3406
- 8.An integrated framework for local genetic correlation analysisNat Genet 54:274–82https://doi.org/10.1038/s41588-022-01017-y
- 9.SUPERGNOVA: local genetic correlation analysis reveals heterogeneous etiologic sharing of complex traitsGenome Biol 22https://doi.org/10.1186/s13059-021-02478-w
- 10.Fine-mapping from summary data with the “Sum of Single Effects” modelPLoS Genet 18https://doi.org/10.1371/journal.pgen.1010299
- 11.A more accurate method for colocalisation analysis allowing for multiple causal variantsPLoS Genet 17https://doi.org/10.1371/journal.pgen.1009440
- 12.A Bayesian framework for multiple trait colocalization from summary association statisticsBioinformatics 34:2538–45https://doi.org/10.1093/bioinformatics/bty147
- 13.A fast and efficient colocalization algorithm for identifying shared genetic risk factors across multiple traitsNat Commun 12https://doi.org/10.1038/s41467-020-20885-8
- 14.The Neuropsychiatry of Parkinson Disease: A Perfect StormAm J Geriatr Psychiatry 27:998–1018https://doi.org/10.1016/j.jagp.2019.03.002
- 15.Genetic architecture of sporadic frontotemporal dementia and overlap with Alzheimer’s and Parkinson’s diseasesJ Neurol Neurosurg Psychiatry 88:152–64https://doi.org/10.1136/jnnp-2016-314411
- 16.Large C9orf72 hexanucleotide repeat expansions are seen in multiple neurodegenerative syndromes and are more frequent than expected in the UK populationAm J Hum Genet 92:345–53https://doi.org/10.1016/j.ajhg.2013.01.011
- 17.Genetic meta-analysis of diagnosed Alzheimer’s disease identifies new risk loci and implicates Aβ, tau, immunity and lipid processingNat Genet 51:414–30https://doi.org/10.1038/s41588-019-0358-2
- 18.Frontotemporal dementia and its subtypes: a genome-wide association studyLancet Neurol 13:686–99https://doi.org/10.1016/S1474-4422(14)70065-1
- 19.Identification of novel risk loci, causal insights, and heritable risk for Parkinson’s disease: a meta-analysis of genome-wide association studiesLancet Neurol 18:1091–102https://doi.org/10.1016/S1474-4422(19)30320-5
- 20.Mapping genomic loci implicates genes and synaptic biology in schizophreniaNature 604:502–8https://doi.org/10.1038/s41586-022-04434-5
- 21.Evaluation of polygenic prediction methodology within a reference-standardized frameworkPLoS Genet 17https://doi.org/10.1371/journal.pgen.1009021
- 22.A global reference for human genetic variationNature 526:68–74https://doi.org/10.1038/nature15393
- 23.Efficient analysis of large-scale genome-wide data with two R packages: bigstatsr and bigsnprBioinformatics 34:2781–7https://doi.org/10.1093/bioinformatics/bty185
- 24.Pervasive Downward Bias in Estimates of Liability-Scale Heritability in Genome-wide Association Study Meta-analysis: A Simple Solutionhttps://doi.org/10.1016/j.biopsych.2022.05.029
- 25.Integrating common and rare genetic variation in diverse human populationsNature 467:52–8https://doi.org/10.1038/nature09298
- 26.A simple new approach to variable selection in regression, with application to genetic fine mappingJournal of the Royal Statistical Society: Series B (Statistical Methodology) 82:1273–300https://doi.org/10.1111/rssb.12388
- 27.Bayesian Test for Colocalisation between Pairs of Genetic Association Studies Using Summary StatisticsPLoS Genet 10https://doi.org/10.1371/journal.pgen.1004383
- 28.R: A language and environment for statistical computing
- 29.PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage AnalysesAm J Hum Genet 81:559–75https://doi.org/10.1086/519795
- 30.Purcell S. PLINK. 1.9.0. Available from: http://pngu.mgh.harvard.edu/purcell/plink/
- 31.BioMart and Bioconductor: a powerful link between biological databases and microarray data analysisBioinformatics 21:3439–40https://doi.org/10.1093/bioinformatics/bti525
- 32.Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRtNat Protoc 4:1184–91https://doi.org/10.1038/nprot.2009.97
- 33.Ensembl 2022Nucleic Acids Res 50:D988–D95https://doi.org/10.1093/nar/gkab1049
- 34.Gender and incidence of dementia in the Framingham Heart Study from mid-adult lifeAlzheimers Dement 11:310–20https://doi.org/10.1016/j.jalz.2013.10.005
- 35.Incidence and lifetime risk of motor neuron disease in the United Kingdom: a population-based studyEur J Neurol 16:745–51https://doi.org/10.1111/j.1468-1331.2009.02586.x
- 36.Amyotrophic lateral sclerosis in an urban setting: A population based study of inner city LondonJ Neurol 253:1642–3https://doi.org/10.1007/s00415-006-0195-y
- 37.Prevalence, characteristics, and survival of frontotemporal lobar degeneration syndromesNeurology 86https://doi.org/10.1212/WNL.0000000000002638
- 38.Parkinson’s UK. The Incidence and Prevalence of Parkinson’s in the UK: Results from the Clinical Practice Research Datalink Reference Report. 2017. Available from: https://www.parkinsons.org.uk/professionals/resources/incidence-and-prevalence-parkinsons-uk-report
- 39.A Systematic Review of the Prevalence of SchizophreniaPLoS Med 2https://doi.org/10.1371/journal.pmed.0020141
- 40.Genome Reference Consortium. Human Genome Region MHC [cited 2023 7th March]. Available from: https://www.ncbi.nlm.nih.gov/grc/human/regions/MHC?asm=GRCh37.
- 41.Association of MAPT haplotypes with Alzheimer’s disease risk and MAPT brain gene expression levelsAlzheimers Res Ther 6https://doi.org/10.1186/alzrt268
- 42.Distinct clinical and pathological phenotypes in frontotemporal dementia associated with MAPT, PGRN and C9orf72 mutationsAmyotroph Lateral Scler Frontotemporal Degener 16:497–505https://doi.org/10.3109/21678421.2015.1074700
- 43.Role of MAPT in Pure Motor Neuron Disease: Report of a Recurrent Mutation in Italian PatientsNeurodegener Dis 18:310–4https://doi.org/10.1159/000497820
- 44.Tau aggregation and seeding analyses of two novel MAPT variants found in patients with motor neuron disease and progressive parkinsonismhttps://doi.org/10.1016/j.neurobiolaging.2019.02.016
- 45.Identifying Risk Genes and Interpreting Pathogenesis for Parkinson’s Disease by a Multiomics AnalysisGenes (Basel) 11https://doi.org/10.3390/genes11091100
- 46.Genome-Wide Association Studies of Schizophrenia and Bipolar Disorder in a Diverse Cohort of US VeteransSchizophr Bull 47:517–29https://doi.org/10.1093/schbul/sbaa133
- 47.A global overview of pleiotropy and genetic architecture in complex traitsNat Genet 51:1339–48https://doi.org/10.1038/s41588-019-0481-0
- 48.HLA variation and diseaseNature Reviews Immunology 18:325–39https://doi.org/10.1038/nri.2017.143
- 49.Major Histocompatibility Complex Genomics and Human DiseaseAnnu Rev Genom Hum 14:301–23https://doi.org/10.1146/annurev-genom-091212-153455
- 50.HLA in Alzheimer’s Disease: Genetic Association and Possible Pathogenic RolesNeuromolecular Med 22:464–73https://doi.org/10.1007/s12017-020-08612-4
- 51.Major histocompatibility complex class I molecules protect motor neurons from astrocyte-induced toxicity in amyotrophic lateral sclerosisNat Med 22:397–403https://doi.org/10.1038/nm.4052
- 52.Fine mapping of the HLA locus in Parkinson’s disease in EuropeansNPJ Parkinson’s Dis 7https://doi.org/10.1038/s41531-021-00231-5
- 53.Immune-related genetic enrichment in frontotemporal dementia: An analysis of genome-wide association studiesPLoS Med 15https://doi.org/10.1371/journal.pmed.1002487
- 54.Frontotemporal dementia: insights into the biological underpinnings of disease through gene co-expression network analysisMol Neurodegener 11https://doi.org/10.1186/s13024-016-0085-4
- 55.Psychosis: an autoimmune disease?Immunology 152:388–401https://doi.org/10.1111/imm.12795
- 56.The Major Histocompatibility Complex (MHC) in Schizophrenia: A ReviewJ Clin Cell Immunol 7https://doi.org/10.4172/2155-9899.1000479
- 57.Immunogenetic Factors of Neurodegenerative Diseases: The Role of HLA Class IIBiochemistry (Moscow) 83:1104–16https://doi.org/10.1134/S0006297918090122
- 58.Regulation of the Late Onset Alzheimer’s Disease Associated HLA-DQA1/DRB1 ExpressionAm J Alzheimers Dis Other Demen 37https://doi.org/10.1177/15333175221085066
- 59.The ubiquity of pleiotropy in human diseaseHum Genet 137:39–44https://doi.org/10.1007/s00439-017-1854-z
- 60.The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resourceNucleic Acids Res 51:D977–D85https://doi.org/10.1093/nar/gkac1010
- 61.Amyotrophic lateral sclerosis, gene deregulation in the anterior horn of the spinal cord and frontal cortex area 8: implications in frontotemporal lobar degenerationAging 9:823–51https://doi.org/10.18632/aging.101195
- 62.Markers of microglia in post-mortem brain samples from patients with Alzheimer’s disease: a systematic reviewMol Psychiatry 23:177–98https://doi.org/10.1038/mp.2017.246
- 63.Harnessing Transcriptomic Signals for Amyotrophic Lateral Sclerosis to Identify Novel Drugs and Enhance Risk Predictionhttps://doi.org/10.1101/2023.01.18.23284589
- 64.King’s College London. King’s Computational Research, Engineering and Technology Environment (CREATE) 2022 [October 12 2022]. Available from: 10.18742/rnvf-m076.https://doi.org/10.18742/rnvf-m076