Research Article

Genetics and Genomics

Multi-ancestry meta-analysis of host genetic susceptibility to tuberculosis identifies shared genetic architecture

DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, South Africa
Wellcome Centre for Human Genetics, University of Oxford, United Kingdom
Massachusetts General Hospital, United States
Dana-Farber Cancer Institute, United States
Centre for the AIDS Programme of Research in South Africa, South Africa
Harvard Medical School, United States
Division of Infection and Immunity, Faculty of Medical Sciences, University College London, United Kingdom
Department of Paediatrics, University of Oxford, United Kingdom
Department of Infectious Diseases Imperial College London, United Kingdom
School of Health and Related Research, University of Sheffield, United Kingdom
Centre for Genetics and Genomics Versus Arthritis, Centre for Musculoskeletal Research, The University of Manchester, United Kingdom
Jenner Institute, University of Oxford, United Kingdom

Jan 15, 2024

Open access
Copyright information

Abstract
Editor's evaluation
Introduction
Results
Discussion
Methods
Data availability
References
Article and author information
Metrics

Abstract

The heritability of susceptibility to tuberculosis (TB) disease has been well recognized. Over 100 genes have been studied as candidates for TB susceptibility, and several variants were identified by genome-wide association studies (GWAS), but few replicate. We established the International Tuberculosis Host Genetics Consortium to perform a multi-ancestry meta-analysis of GWAS, including 14,153 cases and 19,536 controls of African, Asian, and European ancestry. Our analyses demonstrate a substantial degree of heritability (pooled polygenic h² = 26.3%, 95% CI 23.7–29.0%) for susceptibility to TB that is shared across ancestries, highlighting an important host genetic influence on disease. We identified one global host genetic correlate for TB at genome-wide significance (p<5 × 10^-8) in the human leukocyte antigen (HLA)-II region (rs28383206, p-value=5.2 × 10^-9) but failed to replicate variants previously associated with TB susceptibility. These data demonstrate the complex shared genetic architecture of susceptibility to TB and the importance of large-scale GWAS analysis across multiple ancestries experiencing different levels of infection pressure.

Editor's evaluation

This article describes an important multi-ancestry meta-analysis of genome-wide association studies of susceptibility to tuberculosis. It demonstrates substantial heritability from common genetic variants, although this varies across studies. The main finding of the article is a variant in the HLA region that affects tuberculosis risk, for which the evidence is solid. The results and methods will be of interest to infectious disease researchers and human genetics researchers. The article highlights both the promise and challenges of performing multi-ancestry genetic association studies of infectious disease risk.

https://doi.org/10.7554/eLife.84394.sa0

Introduction

Tuberculosis (TB), caused by Mycobacterium tuberculosis (Mtb) and related species, remains a leading cause of death globally. Around one-quarter of the global population is estimated to show immunological evidence of prior exposure to Mtb (Houben and Dodd, 2016), and in 2019 an estimated 10 million people developed the disease, resulting in 1.4 million deaths (WHO, 2020). This disease burden could be substantially reduced with action to address the social determinants of disease and equitable scale-up of existing interventions. However, tools to prevent, diagnose, and treat TB could be improved if a better understanding of the underpinning pathophysiology could help identify those at greatest risk of the disease.

The role of host genetic factors in TB susceptibility has long been of significant interest. Over 100 candidate genes have been studied, but few associations have proven reproducible (Naranbhai, 2016). This failure to replicate may be a result of the modest size of many TB genome-wide association studies (GWAS), variability in phenotyping between studies, the impact of population-specific effects, the challenge of complex population structure in some high-burden settings (e.g., admixed individuals), and, possibly, pathogen variation (Correa-Macedo et al., 2019; Daya et al., 2014a; Luo et al., 2019; Möller and Kinnear, 2020; Müller et al., 2021; Omae et al., 2017; Schurz et al., 2018). Seventeen GWAS have been reported but only two loci replicate between studies (Daya et al., 2014a; Schurz et al., 2018; Chimusa et al., 2014; The Wellcome Trust Case Control Consortium, 2007; Curtis et al., 2015; Mahasirimongkol et al., 2012; Qi et al., 2017; Thye et al., 2010; Thye et al., 2012; Quistrebert et al., 2021; Sveinbjornsson et al., 2016; Hong et al., 2017; Li et al., 2021; Luo et al., 2019; Zheng et al., 2018; Grant et al., 2016; Png et al., 2012). The WT1 locus, identified in cohorts from Ghana and Gambia, replicated in South Africa and Russia. The ASAP1 locus identified in Russia was replicated through reanalysis of prior studies (Correa-Macedo et al., 2019; Möller and Kinnear, 2020).

To address these challenges, we established the International Tuberculosis Host Genetics Consortium (ITHGC) to study the host genetics of disease through collaborative and equitable data sharing (Naranbhai, 2016). The ITHGC includes 12 case–control GWAS from nine countries in Europe, Africa, and Asia (total of 14,153 pulmonary TB cases and 19,536 healthy controls). Inclusion of multiple ancestral groups in a multi-ancestry meta-analysis has the advantage of maximizing power and enhancing fine-mapping resolution to identifying true global associated variants that influence TB susceptibility across population groups.

Here we present the first analyses of the ITHGC dataset exploring host genetic correlates of TB susceptibility using a multi-ancestry meta-analysis approach, including fine-mapping of human leukocyte antigen (HLA) loci and estimation of genetic heritability.

Results

Study overview

In total, 12 GWAS from three major ancestral groups (European, African, and Asian) were included in this study (Table 1; a more detailed table outlining the selection of cases and controls is provided in Supplementary file 1a). All individual datasets were imputed and aligned to the same reference allele before association testing, using an additive genetic model, to obtain odds ratios (OR) and p-values to be used in the meta-analysis. For each individual study (for which we had raw genotyping data), the polygenic heritability was estimated, and HLA alleles were imputed for fine-mapping of the HLA regions.

Table 1

Summary of ITHGC TB-GWAS datasets.

Dataset	Population	Cases/ controls	TB prevalence per 100 ,000 pa	Estimated proportion of controls ever exposed to Mtb (±SD)*	#SNPs	Genotyping platform	Reference
China 1†	Asian	483/587	89	0.302 (0.101)	7,710,153	Affymetrix Genome-Wide Human SNP Array 6.0	thye@bni-hamburg.de (unpublished)
China 2†	Asian	1290/1145	89	0.302 (0.101)	9,769, 029	Illumina Human OmniZhonghua-8 chips	magdakellis@gmail.com (unpublished)
China 3	Asian	972/1537	89	0.302 (0.101)	9,726,450	Illumina Human OmniZhonghua-8 chips	Qi et al., 2017
Thailand	Asian	433/295	236	0.404 (0.112)	6,723,358	Illumina Human610-Quad	Mahasirimongkol et al., 2012
Japan	Asian	751/3199	23	0.142 (0.125)	9,051,051	Illumina HumanHap550	Mahasirimongkol et al., 2012
Russia†	European	5914/6022	109	0.191 (0.093)	10,878,777	Affymetrix Genome-Wide Human SNP Array 6.0	Curtis et al., 2015
Estonia	European	239/7047	13	0.116 (0.093)	10,611,556	Illumina 370K	andres.metspalu@ut.ee (unpublished)
Germany†	European	586/333	7.8	0.067 (0.081)	10,602,193	Illumina Omni2.5+exome	thye@bni-hamburg.de (unpublished)
Gambia†	African	1316/1382	126	0.280 (0.089)	18,634,017	Affymetrix GeneChip 500K	The Wellcome Trust Case Control Consortium, 2007
Ghana†	African	1359/1952	282	0.539 (0.198)	19,029,214	Affymetrix Genome-Wide Human SNP Array 6.0	Thye et al., 2010
RSA(A)† ‡	African	19/577	717	0.436 (0.127)	9,227,330	Affymetrix 500k	Daya et al., 2014b
RSA(M)†‡	African	410/405	717	0.436 (0.127)	11,371,838	Illumina MEGA array	Schurz et al., 2018

GWAS, genome-wide association studies; ITHGC, International Tuberculosis Host Genetics Consortium; Mtb, Mycobacterium tuberculosis; TB, tuberculosis.
*

Estimated proportion of control individuals ever infected with Mtb by age 35–44 in 2010, based on data from Houben & Dodd.
†

Raw genotyping data available.
‡

RSA(A/M): South African admixed population (RSA) Affymetrix (A) and MEGA (M) array data.

The summary statistics from the individual GWAS of each dataset were used to conduct a combined, multi-ancestry meta-analysis using MR-MEGA and ancestry-specific (European, African, and Asian) fixed effects (FE) meta-analyses using GWAMA. Finally, the impact of infection pressure on the multi-ancestry meta-regression was assessed and the concordance in direction of effect for the reference allele between studies was investigated.

Polygenic heritability estimates suggest a genetic contribution to TB disease susceptibility

Twin studies estimate the narrow-sense heritability of susceptibility to TB at up to 80% (Diehl and Von, 1936; Kallmann and Reisner, 1943; Comstock, 1978), but there are few modern estimates. Using raw (unimputed) genotyping data, and assuming population prevalence of disease in each study population equivalent to the reported WHO prevalence rates for that country (WHO, 2020), we estimated polygenic heritability of susceptibility to TB in 10 contributing studies which ranged from 5 to 36% (average of 26.3%, Supplementary file 1b). Comparisons of the heritability estimates between studies from different geographical locations do not take into consideration the differences in environmental pressures between the included studies, and as such these estimates of heritability are only interpretable if the distribution of nongenetic determinants of TB is held constant (Pearce, 2011). Furthermore, variations in phenotype definition can have an impact on heritability estimates (Supplementary file 1a). This is supported by previous research by McHenry et al., 2021a, where significant differences in polygenic heritability estimates were identified between subjects with latent TB infection (LTBI), active TB, and subjects classified as resistors. (McHenry et al., 2021a). As this study includes data with varying methods of classifying TB cases and healthy controls (Supplementary file 1a), there is potential for a degree of heterogeneity and misclassification (between cases and controls) that can have an impact on the heritability estimates. Recent history has seen the near elimination of TB in several countries associated with economic development and public health action. However, while improvement of socioeconomic standing and environment has a stronger impact than host genetics, these crude estimates of polygenic heritability do indicate that TB susceptibility is, in part, heritable. These results require future, more rigorous investigations to narrow down the level of heritable risk and pinpoint genomic loci involved by accounting for population stratification to obtain more accurate heritability estimates.

Multi-ancestry meta-analysis identifies susceptibility loci for TB

For the primary multi-ancestry meta-analysis, MR-MEGA was used as it allows for differences in allelic effects of variants on disease risk between GWAS. Principal components (PCs), derived from a matrix of similarities in allele frequencies between GWAS, were plotted and revealed distinct separation between the three main ancestral groups included in the study (Figure 4) . To account for this, the first two PCs were included as covariates in MR-MEGA as they sufficiently accounted for the allele frequency differences between the study populations, as assessed via a QQ-plot and associated lambda inflation value (Figure 1—figure supplement 1, lambda = 1.00). In total, 26,620,804 variants with a minor allele frequency (MAF) > 1% and present in at least three studies were included in the analysis, of which 3,184,478 were present in all 12 datasets.

A significant association peak on chromosome 6 was identified in the HLA class II region (Figure 1). One variant (rs28383206, OR = 0.89, CI = 0.84–0.94, p-value=8.26 × 10^–9) within this peak was associated with susceptibility to TB at genome-wide significance (p<5.0e^–8, Figures 1—3, Table 2). Both the residual heterogeneity (p-value=0.012) and ancestry-correlated heterogeneity (p-value=5.28e^–6) are significant (p-value<0.05) for the associated variant. However, the evidence of ancestry-correlated heterogeneity is much stronger than for residual heterogeneity, indicating that genetic ancestry contributes more to differences in effects sizes between GWAS than does study design (e.g., phenotyping differences and potential case–control misclassification). The association peak encompasses many HLA-ll genes, including HLA-DRB1/5 (major histocompatibility complex, class II, DR beta 1/5), HLA-DQA1 (major histocompatibility complex, class II, DQ alpha 1), and HLA-DQB3 (major histocompatibility complex, class II, DQ beta 3, Figures 1 and 2). While not reaching genome-wide significance, the HLA class l locus is also indirectly tagged through the association with rs2621322, in the TAP2 (transporter 2, ATP binding cassette subfamily B member) gene, a transporter protein that restores surface expression of MHC class I molecules and has previously been implicated in TB susceptibility (Thu et al., 2016). HLA-A, DQA1, DQB1, DRB1, and TAP2 genes have previously been linked to TB susceptibility through TB candidate gene and GWAS analysis (Thu et al., 2016; Kinnear et al., 2017; Stein et al., 2017; Sveinbjornsson et al., 2016; Zhang et al., 2021). The HLA-II locus encodes several proteins crucial in antigen presentation, including HLA-DR, HLA-DQ, and HLA-DP, which are widely implicated in susceptibility to infection and autoimmunity (Kelly and Trowsdale, 2019; Shiina et al., 2009).

Figure 1 with 5 supplements see all

Download asset Open asset

Manhattan plot of p-values (more than three studies) from the MR-MEGA analysis of all 12 datasets with genomic control reveals one significant association in the *HLA-ll* region of chromosome 6 (rs28383206).

Image produced using R scripts provided by MR-MEGA (Mägi et al., 2017), and source data file has been uploaded to https://doi.org/10.5061/dryad.6wwpzgn2s.

Figure 2

Download asset Open asset

Regional association plot for the chromosome 6 *HLA-ll* rs28383206 association in the multi-ancestry analysis revealing a significant peak in the HLA-ll region.

Image produced using the online LocusZoom database with linkage disequilibrium (LD) mapping set to ‘all’ and p-values>0.01 removed (Boughton et al., 2021), and source data file has been uploaded to https://doi.org/10.5061/dryad.6wwpzgn2s.

Figure 3 with 1 supplement see all

Download asset Open asset

HLA conditioning analysis.

(A) Forest plot (odds ratio and 95% confidence interval) of the significant chromosome 6 association (rs28383206) for tuberculosis (TB) susceptibility in the multi-ancestry analysis, implemented using MR-MEGA with genomic control correction (GCC). Of the 12 studies included, 8 contained this variant. Studies that did not contain the variant are included in the plot but do not have results associated with them. (B) Forest plot for HLA DQA1*02:01 for the eight studies included in the HLA association analysis. Other studies included were obtained from literature searches of previous studies where HLA imputation and association studies were performed (Sveinbjornsson et al., 2016; Li et al., 2021; Zheng et al., 2018). For source data, see Figure 3—source data 1.

Figure 3—source data 1 HLA conditioning analysis data.: https://cdn.elifesciences.org/articles/84394/elife-84394-fig3-data1-v1.xlsx
Download elife-84394-fig3-data1-v1.xlsx

Table 2

Significant and suggestive associations (p-value ≤1e^–5) for the multi-ancestry analysis including data from all 12 datasets implementing MR-MEGA analysis with GCC.

Marker name	Chromosome	Position	Gene	Location	CADD score	EA	NEA	EAF	Sample size	Datasets	p-Value
rs28383206	6	32575167	HLA-DRB1	Intergenic	7.6	G	A	0.168	25,059	8	8.26e^–0⁹

GCC, genomic control correction; EA, effect allele; EAF, effect allele frequency; NEA, noneffect allele.

HLA-II

Given the strong association peak in the HLA-ll locus (Figures 1 and 2), we imputed HLA-ll alleles to fine-map this association. HLA alleles were imputed using the HIBAG R package that utilizes both genotyping array and population-specific reference panels to obtain the most accurate imputations for each individual dataset. Association testing was then conducted using an additive genetic model for each individual dataset before meta-analyzing the results (Source data 1, sheets 11–15).

Notwithstanding inconsistency across populations, the strongest signal in the combined global analyses is at DQA1*02:01, revealing a protective effect (OR = 0.88, 95% CI = 0.82–93, p-value=1.3e^–5, Figure 3B). The signal remains apparent in the six populations with the lead SNP at MAF > 2.5% and individual-level data available (p-value=0.0003). After conditioning on the lead SNP (rs28383206) in this subset, there is no residual significant association at DQA1*02:01 (p-value=0.44, Figure 3—figure supplement 1), suggesting that the classical allele is tagging the rs28383206 association. This observation is consistent with previous observations of HLA analysis in Icelandic (DQA1*02:01: OR = 0.82, p-value=7.39e^–4) and Han Chinese populations (DQA1*02:01: OR = 0.82, p-value=7.39e^–4), but showed opposite direction of effect in another Chinese population (DQA1*02:01: OR = 1.28, p-value=0.0193, Figure 3B; Sveinbjornsson et al., 2016; Li et al., 2021; Zheng et al., 2018).

The significant HLA associations overlap with the association peak observed in the multi-ancestry meta-analysis (Figure 2) but show more consistency in the direction of effects between the input studies compared to the lead SNPs identified in the association peak. This suggests that the rs28383206 association in the meta-analysis is tagging an HLA allele, where the different linkage disequilibrium (LD) patterns from the included ancestral populations result in the differences in effects sizes between populations at the rs28383206 association.

This variation in significant associations is, in part, attributable to the observed variation in HLA allele frequencies across all the included studies and may also reflect differential tagging of at least one unknown causal variant across populations (Source data 1, sheets 16–22).

The variable role of classical HLA alleles in different populations could be partially due to unique infectious pressures that each geographical region faces and could also explain why different strains of Mtb are more or less prevalent in different regions as they adapted to the HLA profile of the population within this region. Sequencing efforts of global mycobacterial isolates find hyperconservation of class II epitopes, suggesting pathogen advantage achieved through limiting HLA-II recognition and highlighting the potential complex interplay between pathogen and host evolution in modifying class II presentation in TB infection (Comas et al., 2010). Previous work has shown evidence of interaction between genetic variants of the host and specific strains of Mtb in Ghanaian, Ugandan, South African, and Asian populations (Möller and Kinnear, 2020; Müller et al., 2021; Correa-Macedo et al., 2019; Salie et al., 2014; Luo et al., 2015; Wampande et al., 2019; Micheni et al., 2021; McHenry et al., 2021b; McHenry et al., 2020). These interactions provide further evidence that Mtb may have undergone substantial genetic evolution, in concert with host migration and evolution of different populations (Comas et al., 2013; Coscolla and Gagneux, 2014). Some studies suggest that HLA-II epitopes may have undergone regional mutations that modify HLA-II binding, and we speculate that the heterogeneity observed in HLA-II associations between regions may, at least in part, be accounted for by different pressures exerted by varying stains of Mtb (Copin et al., 2016).

Impact of infection pressure on meta-regression

To further understand the heterogeneity across populations, we attempted to account for variation in levels of prior exposure that could serve to mask host effects given that not all controls will have been exposed to Mtb. In low transmission settings, more susceptible but unexposed individuals would be included as controls, who, had they been exposed to Mtb, might have progressed to TB disease. Overall, including each cohort’s estimated prevalence of prior exposure had a significant impact on the residual heterogeneity and association statistics of 5% of the variants included in the meta-analysis (419,460/8,355,367), which at a significance level of p-value<0.05 is what is to be expected purely by chance. Separating the results into bins according to p-values revealed that the bins where the covariate had the biggest impact were for p-values in the range of 1e^–3 to 1e^–5 (Figure 1—figure supplement 2), while significant and suggestive associations reported in this study did not show any significant changes in residual heterogeneity. While the proportion of variants significantly impacted when correcting for infection pressures is low and has the biggest impact on variants with larger p-values, there was still an overall reduction in the chi-square value for the residual heterogeneity (mean chi-square value reduced by 10). This suggests that accounting for potential lifetime of infections does account for some of the observed residual heterogeneity; it is most likely not the main driving force for these residuals.

When considering the impact of force of infection, it is important to consider not only the proportion of controls ever exposed but also the impact of recurrent exposure. There is some evidence to suggest that genetic barriers to progression to TB may be overcome if the infectious dose is high (Fox et al., 1929). Repeated exposure may be observed where TB prevalence is high, as in South Africa, and could contribute to the overall lower effects sizes observed in the GWAS enrolling RSA people. Inclusion of potential lifetime infections in meta-regression could help adjust for these effects and prove useful for not only TB, but meta-analysis of infectious diseases in general, and should be further explored.

Other suggestive loci that did not reach significance

There were four loci with suggestive associations and strong peaks on the Manhattan plot (Figure 1) that did not reach significance but should still be considered as potential variants of interest (Supplementary file 1c). One chr9 peak (rs4576509, p-value=7.40e^–07) was intergenic (Figure 1—figure supplement 3) while the second (rs6477824, p-value=2.99e^–07) is located in the 5′-UTR region of the zinc finger protein 483 (ZNF483) gene (Figure 1—figure supplement 3), previously associated with age at menarche (Demerath et al., 2013; Elks et al., 2010). The chromosome 11 peak (rs12362545, p-value=1.24e^–06) is located in the PPFIA binding protein 2 (PPFIBP2) gene (Figure 1—figure supplement 4), which plays a role in axon guidance and neuronal synapse development and has previously been implicated in cancer development (Colas et al., 2011; Wu et al., 2018). The final peak (rs35787595, p-value=5.41e^–06), on chromosome 16 (Figure 1—figure supplement 5), is located in the craniofacial development protein 1 (CFDP1) gene region and involved in chromatin organization (Messina et al., 2017). These genes have not been previously linked to TB susceptibility and a potential role is unclear, and as a result further validation of these variants is needed before any conclusions on their impact to TB susceptibility can be drawn.

Ancestry-specific meta-analysis

Concordance in the direction of effects of the risk allele between the ancestry-specific meta-analyses was examined to determine whether significant enrichment (above the expected 50%) exists at different p-value thresholds. Significant enrichment in the concordance of direction of effect was only observed when using the European ancestry as reference compared to the African meta-analysis results for SNPs with p-values>0.001 and <0.01 (p-value=0.0061, Supplementary file 1d). The lack of enrichment between the ancestries suggests significant ancestry-specific associations, which could be further compounded by the differences in local infection pressures. Due to the lack of concordance and the separation of the ancestral populations in the principal component analysis (PCA) plot (Figure 4), ancestry-specific meta-analysis was done.

Figure 4 with 4 supplements see all

Download asset Open asset

Principal component analysis (PCA) plot of all 12 studies based on the MR-MEGA mean pairwise genome-wide allele frequency differences.

Image produced using the R plot function. For source data, see Figure 4—source data 1.

Figure 4—source data 1 PCA source data.: https://cdn.elifesciences.org/articles/84394/elife-84394-fig4-data1-v1.xlsx
Download elife-84394-fig4-data1-v1.xlsx

The PCA plot (Figure 4) for the 12 studies (based on mean pairwise genome-wide allele frequency differences calculated by MR-MEGA) illustrates distinct separation between the three major population groups (Asia, Europe, and Africa). The separation observed between the African studies (Gambia/Ghana and RSA) is due to the high level of admixture in the RSA population. The RSA population is a five-way admixed South African population with genetic contributions from Bantu-speaking African, KhoeSan, European, and South and South East Asian populations, which explains the observed shift in the PCA plot (Daya et al., 2013; Figure 4).

QQ-plots for the ancestry-specific analysis show no significant inflation or deflation. After removing associations without any clear peaks on the Manhattan plots (associations driven by a single study), we found no significant associations for the ancestry-specific analysis. However, suggestive peaks that did not reach genome-wide significance were identified in the European and Asian ancestry-specific analyses (Figure 4—figure supplements 1 and 2, Supplementary file 1e). Potential causes for the lack of associations and suggestive peaks in the African analysis (Figure 4—figure supplement 3) are the increased genetic diversity within Africa, the inclusion of admixed samples (RSA), and the smaller sample size compared to the other ancestry-specific meta-analysis. While power can be increased through inclusion of greater genetic diversity, between-subpopulation differences in allele frequency can introduce confounding. Confounding by genetic background can result in both spurious associations and the masking of true associations. Such confounding may explain why the results observed elsewhere may not replicate in admixed samples. Removing the admixed data and analyzing only the Gambian and Ghanaian datasets also did not produce any significant results, although, clearly, the sample size was smaller.

For the European analysis (Figure 4—figure supplement 1), suggestive peaks were identified on chromosomes 6 (rs28383206, p-value=7.06e^–08), 8 (rs3935174, p-value=1.00e^–06), and 11 (rs12362545, p-value=1.06e^–07, Supplementary file 1e), while the Asian (Figure 4—figure supplement 2) analysis identified suggestive peaks on chromosome 6 (rs146049519, p-value=1.06e^–06) and 8 (rs62495207, p-value=5.10e^–06, Supplementary file 1e).

The suggestive peaks on chromosomes 6 and 11 in the European subgroup analysis overlap with the suggestive peaks of the multi-ancestry meta-analysis (Figure 1, Figure 4—figure supplement 4, Supplementary file 1e), but the suggestive peak on chromosome 8 is unique to this population (Figure 4—figure supplement 1, Supplementary file 1e). The strongest signal for this peak (rs3935174, OR = 0.87, p-value=1.00e^–6) is located in the ArfGAP with SH3 domain, ankyrin repeat, and PH domain 1 (ASAP1) region, which encodes an ADP-ribosylation factor (ARF) GTPase-activating protein and is potentially involved in the regulation of membrane trafficking and cytoskeleton remodeling (Brown et al., 1998). Variants in ASAP1 (rs4733781 and rs10956514) have previously been linked to TB susceptibility in a TB-GWAS analysis of the same Russian population included here (Curtis et al., 2015). While these ASAP1 variants were present in all 12 studies and had consistent direction of effects, they presented with a strong signal in the European ancestry-specific analysis only (African and Asian p-values all ≥ 0.1). These differences in association were not driven by allele frequency differences as they are similar between the included study populations. A possible explanation for the association being observed only in the European meta-analysis is that the association is driven by the Russian dataset. rs4733781 has a strong signal in the Russian dataset (p-value=2.96e^–7), but very weak signals in all other populations included in the analysis (p-value>0.01) and is in LD with rs3935174 (r2 = 0.6935 and D’ = 0.8791) identified in our analysis. rs4733781 also did not replicate in a previous GWAS from Iceland (Sveinbjornsson et al., 2016), further suggesting that this association is not specific to European populations, but rather driven by the large Russian dataset included in this study.

The suggestive peak on chromosome 8 in the Asian subgroup analysis lies in an intergenic region (Figure 4—figure supplement 2, Supplementary file 1e) and the link to TB susceptibility is unclear. Finally, the suggestive region on chromosome 6 overlaps with the significant peak from the multi-ancestry analysis (Figure 1 and Figure 4—figure supplement 2) and is located in the major histocompatibility complex, class II, DR beta 1 (HLA-DRB1), as discussed above (Figure 4—figure supplement 2, Supplementary file 1e).

Prior associations

To determine whether associations from previously published TB-GWAS, TB candidate SNPs, and SNPs within candidate gene studies replicate in this meta-analysis, we extracted all significant and suggestive associations from prior analyses and compared these to our multi-ancestry and ancestry-specific meta-analysis results (Luo et al., 2019; Schurz et al., 2018; Chimusa et al., 2014; The Wellcome Trust Case Control Consortium, 2007; Curtis et al., 2015; Mahasirimongkol et al., 2012; Qi et al., 2017; Thye et al., 2010; Thye et al., 2012; Quistrebert et al., 2021; Hong et al., 2017; Zheng et al., 2018; Grant et al., 2016; Png et al., 2012; Daya et al., 2014b). In total, 44 SNPs and 36 genes were identified from the GWAS catalog, of which 33 SNPs and all candidate genes were present in our data (Source data 1, sheet 2). We also extracted the association statistics for a further 90 previously identified candidate genes from our multi-ancestry and population-specific meta-analysis results (Source data 1, sheet 2; Naranbhai, 2016).

Using a Bonferroni-corrected p-value of 0.0015 for the number of SNPs tested (33) as the significance threshold for replication, two candidate SNPs (rs4733781: p-value=3.22e^–5; rs10956514: p-value=0.000118; Source data 1, sheets 3 and 4) replicated in the multi-ancestry meta-analysis, both located in the ASAP1 gene region (Curtis et al., 2015; Chen et al., 2019; Wang et al., 2018). However, as discussed in the previous section, these associations are driven by the Russian dataset, which is the same data used by Curtis et al., 2015, where these associations were originally discovered (Curtis et al., 2015). As the Russian population included in our analysis presenting with a strong signal for these variants, there is no independent evidence for these candidate SNPs as they did not replicate in any other population.

For the Asian ancestry-specific analysis, the replicated variant was rs41553512, located in the HLA-DRB5 gene (p-value=3.53E-05). HLA-DRB5 is located within the HLA-ll region identified in the multi-ancestry meta-analysis (Figure 1) and was previously identified by Qi et al., 2017 in a Han Chinese population. The African ancestry-specific analysis did not replicate previous associations, with the lowest p-value at rs6786408 in the FOXP1 gene (p-value=0.023). While this variant was previously identified in a North African cohort, the fact that it does not replicate here could be because of the genetic diversity within Africa and specifically the variability introduced by the five-way admixed South African population.

Discussion

This large-scale, multi-ethnic meta-analysis of genetic susceptibility to TB, involving 14,153 cases and 19,536 controls, identified one risk locus achieving genome-wide significance, and further investigation of this region revealed significant classical HLA allele associations. This association is noteworthy given we show that there is association in other studies for the same allele (Kinnear et al., 2017; Stein et al., 2017).

Based on the significant association, rs28383206, in the HLA region identified in this multi-ancestry analysis (Figure 3A), HLA-specific imputation and association testing were done to fine-map the region and identify potential HLA alleles driving this association. HLA DQA1*02:01 had the strongest signal in the meta-analysis across the eight included studies (Figure 3B), but this signal disappeared when conditioning on the significant SNP (rs28383206). HLA DQA1*02:01 has previously been identified in an Icelandic and two Chinese populations, but the direction of effect was not consistent (Sveinbjornsson et al., 2016; Li et al., 2021; Zheng, 2018). Despite these inconsistencies, the association between Mtb and HLA class II should be explored in more detail in future studies. A study investigating the outcomes of Mtb exposure in individuals of African ancestry identified protective effects of HLA class II alleles for individuals resistant to TB, highlighting the importance of HLA class II and susceptibility to TB (Dawkins et al., 2022). HLA class II is a key determinant of the immune response in TB, and Mtb has the mechanisms to directly interfere with MHC class 2 antigen presentation (Sia and Rengarajan, 2019). This is supported by studies in mice, where mice in which the MHC class ll genes were deleted died quickly when exposed to Mtb and died faster than the mice in which MHC class I genes were deleted (Sia and Rengarajan, 2019).

The p-values of residual heterogeneity in genetic effects between the studies in the multi-ancestry meta-analysis show no significant inflation between the studies. This suggests that the differences in study characteristics (phenotype definition, infection pressure, Mtb strain) are not the main contributor to the lack of significant associations. However, they certainly have an impact, which is further compounded with ancestry-correlated heterogeneity and other factors (e.g., socioeconomic standing). The ancestry-correlated heterogeneity p-values are generally lower than the residual heterogeneity, suggesting that genetic ancestry has a stronger impact on the differences in effects sizes between the studies. This is supported by the fact that previous TB genetic association studies have identified significant effects of ancestry on TB susceptibility (Chimusa et al., 2014; Daya et al., 2014b). However, the effects of genetic ancestry can be confounded by other factors not accounted for in this analysis, such as the differences in socioeconomic factors (including the differences in housing, employment, poverty, and access to healthcare), phenotype definitions, and differences in infection pressure between the included study populations (Hargreaves et al., 2011; Duarte et al., 2018; Lönnroth et al., 2009). Specifically, the lack of consistency and specificity in TB diagnosis between the included studies introduces heterogeneity and the potential for misclassification of cases and controls, which can reduce the power to detect significant associations (Supplementary file 1a). While this is a limitation of this study, the fact that the residual heterogeneity is overpowered by the ancestry-specific heterogeneity suggests that the phenotype definitions are not the main driver behind the lack of significant associations. For the ancestry-specific analysis, fewer studies result in there being less input heterogeneity to account for, but the reduced sample size was not sufficient to detect any ancestry-specific genome-wide associations. This is particularly evident for the African ancestry-specific meta-analysis where the large degree of heterogeneity, which could be a result of the high genetic diversity within Africa, in combination with differences in socioeconomic factors compared to other populations included in this study, resulted in no observable suggestive association peaks (Campbell and Tishkoff, 2008; Peprah et al., 2015). Furthermore, the suggestive associations (Supplementary file 1c and e) reported in this study should be interpreted with care, and further validation is required before any conclusions can be drawn on the impact that they could have on TB susceptibility.

Polygenic heritability estimates revealed genetic contributions to TB susceptibility for all studies, but the level of this contribution varied greatly (5–36%), suggesting that other factors are contributing to both the lack of significant associations detected in this meta-analysis and the variation observed for the polygenic heritability estimates. These factors likely include environmental, socioeconomic, and varying levels of infection pressures, as well as genetic ancestry-specific effects between the included study populations. An individual from South Africa will face a much higher force of infection than individuals in Europe, and making the assumption that environmental circumstances are equal will significantly skew these crude heritability estimates (Pearce, 2011). This argument is sustained by the fact that increasing disease prevalence (higher infection pressure) increased the level of genetic contribution to TB susceptibility up to a certain point, presumably accounted for by increasingly informative control samples, after which further increasing the infection pressure will not further impact genetic susceptibility.

To determine the impact that force of infection has on the level of genetic contribution to TB susceptibility, we modeled values for proportion of people ever infected with Mtb to include in the multi-ancestry meta-analysis and correct for the different force of infection faced by individuals in each country. Inclusion of this covariate, however, only resulted in a significant difference for 5% of the analyzed variants, what is to be expected based on chance alone, and as such we cannot conclude that a significant portion of the observed residual heterogeneity is explained by this. Limited metadata forced us to make several assumptions about the ages of study participants and the dates on which they were enrolled. With more precise metadata, or Mtb infection test results in controls, the potential impact of lifetime infection could be better quantified and may contribute to elucidating genetic TB susceptibility. Multi-ancestry meta-analysis of other infectious diseases could also potentially benefit from the inclusion of force of infection covariates. It would also be important to determine whether there is a level of exposure beyond which host genetic barriers to infection are overcome (Simmons et al., 2018).

A single significant association was identified in this multi-ancestry meta-analysis, which is small when compared to other meta-analyses of similar size. Factors contributing to this include the difficulty in analyzing multi-ancestry data, the outdated arrays and lack of suitable reference panels for the included study populations, and heterogeneity in case and control definitions between the studies. The issue of heterogeneity in definitions is especially pronounced for this study as it included unpublished data with limited information, which does not indicate how cases were confirmed and controls were collected. The complexity of TB and generally small genetic effects suggests that larger sample sizes or alternative methods of investigation are needed. Utilizing GWAS arrays that better capture diverse populations in combination with imputation making use of larger and more diverse reference panels would allow for larger and more consistent datasets for future meta-analysis. Remapping specific areas of interest such as the HLA, ASAP1, or TLR using long-read sequencing would be invaluable. Increased amounts of genetic data will also allow for more accurate TB heritability analysis and permit analysis of polygenic risk scores and exploration of host–pathogen interactions.

In conclusion, this large-scale multi-ancestry TB GWAS meta-analysis revealed significant associations and shared genetic TB susceptibility architecture across multiple populations from different genetic backgrounds. The analysis shows the value of collaboration and data sharing to solve difficult problems and elucidate what determines susceptibility to complex diseases such as TB. We hope that this publication will encourage others to make their data available for future large-scale meta-analyses.

Methods

Data

This analysis includes 12 of the 17 published (and unpublished, Table 1, Supplementary file 1) GWAS of TB (with HIV-negative cohorts) prior to 2022 (Schurz et al., 2018; Chimusa et al., 2014; The Wellcome Trust Case Control Consortium, 2007; Curtis et al., 2015; Mahasirimongkol et al., 2012; Qi et al., 2017; Thye et al., 2010; Thye et al., 2012; Daya et al., 2014b). For unpublished works, we contacted researchers that were funded for genetic TB research and acquired data-sharing agreements to obtain summary statistics (or raw data) along with any metadata that was available. It excludes data from Iceland and Vietnam (Quistrebert et al., 2021) as they declined to share data. It excludes data from China, Korea, Peru, and Japan (Luo et al., 2019; Hong et al., 2017; Li et al., 2021; Zheng, 2018; Sveinbjornsson et al., 2016) as data-sharing agreements could not be finalized in time for this analysis. The Indonesian and Moroccan data were too sparsely genotyped and not suitable for reliable imputation. In addition, the Moroccan data was family-based and thus also not suitable for this meta-analysis as this would introduce confounding effects from the inclusion of related individuals (Grant et al., 2016; Png et al., 2012). Finally, cases and controls are also available within large-scale biobanks, for example, UK Biobank, which could also be leveraged in future iterations of this analysis (Munafò et al., 2018).

Included individuals were genotyped on a variety of genotyping arrays (Table 1, Supplementary file 1), and raw genotyping data was available for eight datasets and for the remainder association testing summary statistics were obtained to use in the meta-analysis (Table 1, Supplementary file 1). Quality control (QC) of raw genotyping data (Table 1, Supplementary file 1) was done using Plink (v1.9), followed by pre-phasing using SHAPEIT and imputation with IMPUTE2 with the 1000 genomes phase 3 reference panel (Chang et al., 2015; Delaneau et al., 2013; Howie et al., 2009; Sudmant et al., 2015). QC and imputation were done as described previously (Schurz et al., 2018; Schurz et al., 2019); briefly, we used a MAF filter of 0.025 and an individual and SNP missingness filter of 0.1. Hardy–Weinberg equilibrium threshold was set at a Bonferroni-corrected p-value according to the number of SNPs testes (0.05/number of SNPs) and samples where sex could not be determined from genotyping were also removed. Imputed data was filtered at a quality score of 0.3, prior to individual and genotype filtration steps. Prior to QC and imputation, allele orientation was corrected using Genotype Harmoniser version 1.4.15, and the genome build of all datasets was checked for consistency (GRCh37) and updated if necessary using the liftOver software from the UCSC genome browser (Deelen et al., 2014; Kent et al., 2002). The four datasets with only summary statistics available (Table 1, Supplementary file 1) were imputed and QC’d during the original investigations, but the marker names and allele orientation were checked for concordance between the summary statistics and the rest of the consortium’s imputed data.

Polygenic heritability analysis

To assess the level of genetic contribution to TB susceptibility, we estimated polygenic heritability on the individual studies for which raw genotyping data was available (Table 1, Supplementary file 1). Polygenic heritability estimates were calculated using GCTA (v1.93.2), a genomic risk prediction tool (Yang et al., 2011). The genetic relationship matrix was calculated for each autosomal chromosome. Raw genotype data was pruned for SNPs in LD using a 50 SNP window, sliding by 10 SNPs at a time and removing all variants with LD > 0.5. Samples were filtered by removing cryptic relatedness (--grm-cutoff 0.025) and assuming that the causal loci have similar distribution of allele frequencies as the genotyped SNPs (--grm-adj 0). Principal components were then calculated (--pca 20) to include as covariates prior to estimating heritability. Heritability estimations were transformed onto the liability scale using the GCTA software to account for the difference in the proportion of cases in the data compared to the population prevalence (Yang et al., 2011). The average heritability estimate was calculated by taking the mean of all estimates and the confidence intervals were estimated based on the standard error across all studies and the number of studies included.

Meta-analysis

All variants with MAF > 1% and polymorphic in at least three studies (from at least two different ancestries) were included in the primary analysis. For the GWAS, summary statistics of each dataset variants with infinite confidence intervals were removed prior to the meta-analysis. A multi-ancestry meta-analysis plus separate ancestry-specific analyses for Africa, Asia, and Europe were performed. MR-MEGA (Meta-Regression of Multi-Ethnic Genetic Association, v0.20), a meta-analysis tool that maximizes power and enhances fine-mapping when combining data across different ethnicities, was used for the multi-ancestry meta-analysis (Mägi et al., 2017). To account for the expected heterogeneity in allelic effects between populations, MR-MEGA implements a multi-ancestry meta-regression that includes covariates to represent genetic ancestry, obtained from multidimensional scaling of mean pairwise genome-wide allele frequency differences. Genomic control correction (GCC) was implemented during the MR-MEGA analysis for the individual input data (if lambda was >1.05) and output statistics, and the first two PCs, calculated from the genome-wide allele frequency differences, were included as covariates in the regression. QQ-plots of p-values and associated lambda values were used to assess the quality of results prior to downstream investigation.

For the ancestry-specific analyses, the studies were grouped by the major ancestral groups (Table 1, Supplementary file 1) and all variants with a MAF of > 1% that were observed in at least two studies were included in the meta-analysis. We performed traditional fixed-effects meta-analyses in GWAMA (v2.2.2), implementing GCC and assessed the results using QQ-plots (Mägi and Morris, 2010). The genome-wide significance threshold for all association testing was set at p-value=5 × 10^-8 (Panagiotou et al., 2012).

HLA imputation

To fine-map HLA alleles over the HLA locus we imputed HLA class l and ll variants for all 8 studies for which raw data was available (Table 1 and Supplementary file 1). HLA imputation for the HLA class l regions A, B and C as well as the HLA class ll regions DPB1, DRB1, DQB1 and DQA1 was done using the R package HIBAG (version 1.5), implemented in the R free software environment (version 4.0.5) using the predict() command for imputation (R Development Core Team, 2013; Zheng, 2018; Zheng et al., 2014).

The reference datasets for HLA imputation are both genotyping panel and population-specific, and HIBAG has a database of reference data for many genotyping arrays. Each reference panel is also available for either Asian, European, or African populations or a mixture of the three (https://hibag.s3.amazonaws.com/hlares_index.html#estimates). For each dataset included for imputation, the reference panel chosen was the same as the genotyping array used for the data and the reference population was chosen to match the data as closely as possible. Asian and European reference panels were used for the Asian and European populations and African references were used for the Gambia and Ghana datasets, while mixed datasets were implemented for the admixed RSA population.

Following imputation, the HIBAG package (hlaAssocTest) command was used to implement an additive association test for the HLA alleles across the different regions limited to alleles at MAF > 2.5%. Analyses were adjusted for the first four PCs with and without the rs28383206 genotype in the model. Association testing results for the eight included studies were then combined in a fixed-effects meta-analysis using Metasoft software (Han and Eskin, 2011). Ancestry-specific meta-analysis grouped according to the major population groups (Table 1, Supplementary file 1) was also done using the same method.

Estimation of infection pressure

To generate a covariate capturing the likely cumulative exposure to Mtb for included controls, the results of Houben and Dodd, 2016 were adapted to produce a distance matrix to feed into the meta-analysis. The approach in this article fits a Gaussian process model of infection risk history to local data. To represent uncertainty in derived results, a sample of 200 estimated histories of the annual risk of TB infection in each country was used to calculate the expected fraction of control participants ever infected with Mtb, assuming that controls were uniformly aged between 35 and 44 y in 2010, which approximates the period during which controls were recruited for most of the studies. The true age of the controls was not known for all of the datasets, but as quite a substantial skew to the age distribution would be required to have an impact on the results, we believe our choice here is justified. This was done by including estimates for the potential lifetime infections for each source population as a covariate in the MR-MEGA multi-ancestry meta regression. To determine the impact of the covariate, a chi-square difference test was implemented, on an SNP-SNP basis, on the residual and association testing statistics of two meta-analysis output statistics, one including and the other excluding the potential lifetime infections covariate (Satorra and Bentler, 2001). The aim was to determine whether inclusion of potential lifetime infections in the regression explained some of the residual heterogeneity.

Concordance of direction of effect

To determine the degree to which direction of effect is shared for SNPs between the ancestry-specific meta-analysis, we followed the methodology of Mahajan et al., 2014. First, we identified all variants present in all 12 included datasets. Among these SNPs, we then identified an independent subset of variants in the European ancestry-specific meta-analysis showing nominal evidence of association (p-value≤0.001) and separated by at least 500 kb. The identified SNPs were then extracted from the Asian and African ancestry-specific meta-analysis results to calculate the number of SNPs that had the same direction of effect as in the European analysis. To determine whether significant excess in concordance of effect direction was present, a one-sided binomial test was implemented with the expected concordance set at 50%. This analysis was then repeated for other p-value thresholds (0.001<p≤0.01; 0.01<p≤0.5; and 0.5<p≤1), and also using the African and Asian meta-analysis results as reference.

Data availability

Summary statistics of all meta-analysis will be made available on Dryad (https://doi.org/10.5061/dryad.6wwpzgn2s). The summary statistics and raw data (where available) of the individual data files cannot be made available but enquiries or requests for this data can be made through the corresponding authors or authors directly responsible for the data, listed in Table 1. As the ITHGC consortium has strict data transfer and sharing agreements with the original authors/owners of the data we can not ethically share the source data files in any way, be it either anonymized, de-identified or in any other form. All data that is not restricted by these data transfer and ethical agreements has been either uploaded to the online repository (https://doi.org/10.5061/dryad.6wwpzgn2s) or submitted along with this document. If any interested researchers want to apply for access to the original raw and individual GWAS datasets or any other other data currently restricted they can contact the corresponding author of this manuscript to put them in touch with the original data owners/authors, or the original data owners/authors can be contacted directly by contacting the corresponding authors listed in Table 1. Once the original authors/owners of the data have been contacted discussions can be had to share the data using the appropriate and ethically approved methods, which could include data transfer agreements or similar application processes.

The following previously published data sets were used

1. Schurz H
2. Naranbhai V
3. Yates TA
4. Gilchrist J
5. Parks T
6. Dodd P
7. Möller M
8. Hoal EG
9. Morris A
10. Hill AV
(2022) Dryad Digital Repository
Multi-ancestry meta-analysis of host genetic susceptibility to tuberculosis identifies shared genetic architecture.

https://doi.org/10.5061/dryad.6wwpzgn2s

References

(2021) LocusZoom.js: interactive and embeddable visualization of genetic association study results
Bioinformatics 37:3017–3018.

https://doi.org/10.1093/bioinformatics/btab186
- PubMed
- Google Scholar
(1998) ASAP1, a phospholipid-dependent arf GTPase-activating protein that associates with and is phosphorylated by Src
Molecular and Cellular Biology 18:7038–7051.

https://doi.org/10.1128/MCB.18.12.7038
- PubMed
- Google Scholar
1. Campbell MC
2. Tishkoff SA
(2008) African genetic diversity: implications for human demographic history, modern human origins, and complex disease mapping
Annual Review of Genomics and Human Genetics 9:403–433.

https://doi.org/10.1146/annurev.genom.9.081307.164258
- PubMed
- Google Scholar
1. Chang CC
2. Chow CC
3. Tellier LC
4. Vattikuti S
5. Purcell SM
6. Lee JJ
(2015) Second-generation PLINK: rising to the challenge of larger and richer datasets
GigaScience 4:7.

https://doi.org/10.1186/s13742-015-0047-8
- PubMed
- Google Scholar
1. Chen C
2. Zhao Q
3. Shao Y
4. Li Y
5. Song H
6. Li G
7. Zhu L
8. Lu W
9. Xu B
(2019) A Common Variant of ASAP1 Is Associated with Tuberculosis Susceptibility in the Han Chinese Population
Disease Markers 2019:7945429.

https://doi.org/10.1155/2019/7945429
- PubMed
- Google Scholar
1. Chimusa ER
2. Zaitlen N
3. Daya M
4. Möller M
5. van Helden PD
6. Mulder NJ
7. Price AL
8. Hoal EG
(2014) Genome-wide association study of ancestry-specific TB risk in the South African Coloured population
Human Molecular Genetics 23:796–809.

https://doi.org/10.1093/hmg/ddt462
- PubMed
- Google Scholar
1. Colas E
2. Perez C
3. Cabrera S
4. Pedrola N
5. Monge M
6. Castellvi J
7. Eyzaguirre F
8. Gregorio J
9. Ruiz A
10. Llaurado M
11. Rigau M
12. Garcia M
13. Ertekin T
14. Montes M
15. Lopez-Lopez R
16. Carreras R
17. Xercavins J
18. Ortega A
19. Maes T
20. Rosell E
21. Doll A
22. Abal M
23. Reventos J
24. Gil-Moreno A
(2011) Molecular markers of endometrial carcinoma detected in uterine aspirates
International Journal of Cancer 129:2435–2444.

https://doi.org/10.1002/ijc.25901
- PubMed
- Google Scholar
1. Comas I
2. Chakravartti J
3. Small PM
4. Galagan J
5. Niemann S
6. Kremer K
7. Ernst JD
8. Gagneux S
(2010) Human T cell epitopes of Mycobacterium tuberculosis are evolutionarily hyperconserved
Nature Genetics 42:498–503.

https://doi.org/10.1038/ng.590
- PubMed
- Google Scholar
1. Comas I
2. Coscolla M
3. Luo T
4. Borrell S
5. Holt KE
6. Kato-Maeda M
7. Parkhill J
8. Malla B
9. Berg S
10. Thwaites G
11. Yeboah-Manu D
12. Bothamley G
13. Mei J
14. Wei L
15. Bentley S
16. Harris SR
17. Niemann S
18. Diel R
19. Aseffa A
20. Gao Q
21. Young D
22. Gagneux S
(2013) Out-of-Africa migration and Neolithic coexpansion of Mycobacterium tuberculosis with modern humans
Nature Genetics 45:1176–1182.

https://doi.org/10.1038/ng.2744
- PubMed
- Google Scholar
1. Comstock GW
(1978) Tuberculosis in twins: a re-analysis of the Prophit survey
The American Review of Respiratory Disease 117:621–624.

https://doi.org/10.1164/arrd.1978.117.4.621
- PubMed
- Google Scholar
1. Copin R
2. Wang X
3. Louie E
4. Escuyer V
5. Coscolla M
6. Gagneux S
7. Palmer GH
8. Ernst JD
(2016) Within host evolution selects for a dominant genotype of mycobacterium tuberculosis while T cells increase pathogen genetic diversity
PLOS Pathogens 12:e1006111.

https://doi.org/10.1371/journal.ppat.1006111
- PubMed
- Google Scholar
(2019) The interplay of human and Mycobacterium Tuberculosis genomic variability
Frontiers in Genetics 10:865.

https://doi.org/10.3389/fgene.2019.00865
- PubMed
- Google Scholar
1. Coscolla M
2. Gagneux S
(2014) Consequences of genomic diversity in Mycobacterium tuberculosis
Seminars in Immunology 26:431–444.

https://doi.org/10.1016/j.smim.2014.09.012
- PubMed
- Google Scholar
1. Curtis J
2. Luo Y
3. Zenner HL
4. Cuchet-Lourenço D
5. Wu C
6. Lo K
7. Maes M
8. Alisaac A
9. Stebbings E
10. Liu JZ
11. Kopanitsa L
12. Ignatyeva O
13. Balabanova Y
14. Nikolayevskyy V
15. Baessmann I
16. Thye T
17. Meyer CG
18. Nürnberg P
19. Horstmann RD
20. Drobniewski F
21. Plagnol V
22. Barrett JC
23. Nejentsev S
(2015) Susceptibility to tuberculosis is associated with variants in the ASAP1 gene encoding a regulator of dendritic cell migration
Nature Genetics 47:523–527.

https://doi.org/10.1038/ng.3248
- PubMed
- Google Scholar
1. Dawkins BA
2. Garman L
3. Cejda N
4. Pezant N
5. Rasmussen A
6. Rybicki BA
7. Levin AM
8. Benchek P
9. Seshadri C
10. Mayanja-Kizza H
11. Iannuzzi MC
12. Stein CM
13. Montgomery CG
(2022) Novel HLA associations with outcomes of Mycobacterium tuberculosis exposure and sarcoidosis in individuals of African ancestry using nearest-neighbor feature selection
Genetic Epidemiology 46:463–474.

https://doi.org/10.1002/gepi.22490
- PubMed
- Google Scholar
1. Daya M
2. van der Merwe L
3. Galal U
4. Möller M
5. Salie M
6. Chimusa ER
7. Galanter JM
8. van Helden PD
9. Henn BM
10. Gignoux CR
11. Hoal E
(2013) A panel of ancestry informative markers for the complex five-way admixed South African coloured population
PLOS ONE 8:e82224.

https://doi.org/10.1371/journal.pone.0082224
- PubMed
- Google Scholar
(2014a) Using multi-way admixture mapping to elucidate TB susceptibility in the South African Coloured population
BMC Genomics 15:1021.

https://doi.org/10.1186/1471-2164-15-1021
- PubMed
- Google Scholar
(2014b) The role of ancestry in TB susceptibility of an admixed South African population
Tuberculosis 94:413–420.

https://doi.org/10.1016/j.tube.2014.03.012
- Google Scholar
(2014) Genotype harmonizer: automatic strand alignment and format conversion for genotype data integration
BMC Research Notes 7:901.

https://doi.org/10.1186/1756-0500-7-901
- PubMed
- Google Scholar
(2013) Improved whole-chromosome phasing for disease and population genetic studies
Nature Methods 10:5–6.

https://doi.org/10.1038/nmeth.2307
- PubMed
- Google Scholar
1. Demerath EW
2. Liu C-T
3. Franceschini N
4. Chen G
5. Palmer JR
6. Smith EN
7. Chen CTL
8. Ambrosone CB
9. Arnold AM
10. Bandera EV
11. Berenson GS
12. Bernstein L
13. Britton A
14. Cappola AR
15. Carlson CS
16. Chanock SJ
17. Chen W
18. Chen Z
19. Deming SL
20. Elks CE
21. Evans MK
22. Gajdos Z
23. Henderson BE
24. Hu JJ
25. Ingles S
26. John EM
27. Kerr KF
28. Kolonel LN
29. Le Marchand L
30. Lu X
31. Millikan RC
32. Musani SK
33. Nock NL
34. North K
35. Nyante S
36. Press MF
37. Rodriquez-Gil JL
38. Ruiz-Narvaez EA
39. Schork NJ
40. Srinivasan SR
41. Woods NF
42. Zheng W
43. Ziegler RG
44. Zonderman A
45. Heiss G
46. Gwen Windham B
47. Wellons M
48. Murray SS
49. Nalls M
50. Pastinen T
51. Rajkovic A
52. Hirschhorn J
53. Adrienne Cupples L
54. Kooperberg C
55. Murabito JM
56. Haiman CA
(2013) Genome-wide association study of age at menarche in African-American women
Human Molecular Genetics 22:3329–3346.

https://doi.org/10.1093/hmg/ddt181
- PubMed
- Google Scholar
Book
1. Diehl K
2. Von O
(1936)
Der Erbeinfluss Bei Der Tuberkulose

Gustav Fischer.
- Google Scholar
(2018) Tuberculosis, social determinants and co-morbidities (including HIV)
Pulmonology 24:115–119.

https://doi.org/10.1016/j.rppnen.2017.11.003
- PubMed
- Google Scholar
1. Elks CE
2. Perry JRB
3. Sulem P
4. Chasman DI
5. Franceschini N
6. He C
7. Lunetta KL
8. Visser JA
9. Byrne EM
10. Cousminer DL
11. Gudbjartsson DF
12. Esko T
13. Feenstra B
14. Hottenga J-J
15. Koller DL
16. Kutalik Z
17. Lin P
18. Mangino M
19. Marongiu M
20. McArdle PF
21. Smith AV
22. Stolk L
23. van Wingerden SH
24. Zhao JH
25. Albrecht E
26. Corre T
27. Ingelsson E
28. Hayward C
29. Magnusson PKE
30. Smith EN
31. Ulivi S
32. Warrington NM
33. Zgaga L
34. Alavere H
35. Amin N
36. Aspelund T
37. Bandinelli S
38. Barroso I
39. Berenson GS
40. Bergmann S
41. Blackburn H
42. Boerwinkle E
43. Buring JE
44. Busonero F
45. Campbell H
46. Chanock SJ
47. Chen W
48. Cornelis MC
49. Couper D
50. Coviello AD
51. d’Adamo P
52. de Faire U
53. de Geus EJC
54. Deloukas P
55. Döring A
56. Smith GD
57. Easton DF
58. Eiriksdottir G
59. Emilsson V
60. Eriksson J
61. Ferrucci L
62. Folsom AR
63. Foroud T
64. Garcia M
65. Gasparini P
66. Geller F
67. Gieger C
68. GIANT Consortium
69. Gudnason V
70. Hall P
71. Hankinson SE
72. Ferreli L
73. Heath AC
74. Hernandez DG
75. Hofman A
76. Hu FB
77. Illig T
78. Järvelin M-R
79. Johnson AD
80. Karasik D
81. Khaw K-T
82. Kiel DP
83. Kilpeläinen TO
84. Kolcic I
85. Kraft P
86. Launer LJ
87. Laven JSE
88. Li S
89. Liu J
90. Levy D
91. Martin NG
92. McArdle WL
93. Melbye M
94. Mooser V
95. Murray JC
96. Murray SS
97. Nalls MA
98. Navarro P
99. Nelis M
100. Ness AR
101. Northstone K
102. Oostra BA
103. Peacock M
104. Palmer LJ
105. Palotie A
106. Paré G
107. Parker AN
108. Pedersen NL
109. Peltonen L
110. Pennell CE
111. Pharoah P
112. Polasek O
113. Plump AS
114. Pouta A
115. Porcu E
116. Rafnar T
117. Rice JP
118. Ring SM
119. Rivadeneira F
120. Rudan I
121. Sala C
122. Salomaa V
123. Sanna S
124. Schlessinger D
125. Schork NJ
126. Scuteri A
127. Segrè AV
128. Shuldiner AR
129. Soranzo N
130. Sovio U
131. Srinivasan SR
132. Strachan DP
133. Tammesoo M-L
134. Tikkanen E
135. Toniolo D
136. Tsui K
137. Tryggvadottir L
138. Tyrer J
139. Uda M
140. van Dam RM
141. van Meurs JBJ
142. Vollenweider P
143. Waeber G
144. Wareham NJ
145. Waterworth DM
146. Weedon MN
147. Wichmann HE
148. Willemsen G
149. Wilson JF
150. Wright AF
151. Young L
152. Zhai G
153. Zhuang WV
154. Bierut LJ
155. Boomsma DI
156. Boyd HA
157. Crisponi L
158. Demerath EW
159. van Duijn CM
160. Econs MJ
161. Harris TB
162. Hunter DJ
163. Loos RJF
164. Metspalu A
165. Montgomery GW
166. Ridker PM
167. Spector TD
168. Streeten EA
169. Stefansson K
170. Thorsteinsdottir U
171. Uitterlinden AG
172. Widen E
173. Murabito JM
174. Ong KK
175. Murray A
(2010) Thirty new loci for age at menarche identified by a meta-analysis of genome-wide association studies
Nature Genetics 42:1077–1085.

https://doi.org/10.1038/ng.714
- PubMed
- Google Scholar
1. Fox GJ
2. Orlova M
3. Schurr E
4. Bliska JB
(1929) Tuberculosis in newborns: The lessons of the “lübeck disaster” (1929–1933)
PLOS Pathogens 12:e1005271.

https://doi.org/10.1371/journal.ppat.1005271
- Google Scholar
1. Grant AV
2. Sabri A
3. Abid A
4. Abderrahmani Rhorfi I
5. Benkirane M
6. Souhi H
7. Naji Amrani H
8. Alaoui-Tahiri K
9. Gharbaoui Y
10. Lazrak F
11. Sentissi I
12. Manessouri M
13. Belkheiri S
14. Zaid S
15. Bouraqadi A
16. El Amraoui N
17. Hakam M
18. Belkadi A
19. Orlova M
20. Boland A
21. Deswarte C
22. Amar L
23. Bustamante J
24. Boisson-Dupuis S
25. Casanova JL
26. Schurr E
27. El Baghdadi J
28. Abel L
(2016) A genome-wide association study of pulmonary tuberculosis in Morocco
Human Genetics 135:299–307.

https://doi.org/10.1007/s00439-016-1633-2
- PubMed
- Google Scholar
1. Han B
2. Eskin E
(2011) Random-effects model aimed at discovering associations in meta-analysis of genome-wide association studies
American Journal of Human Genetics 88:586–598.

https://doi.org/10.1016/j.ajhg.2011.04.014
- PubMed
- Google Scholar
(2011) The social determinants of tuberculosis: from evidence to action
American Journal of Public Health 101:654–662.

https://doi.org/10.2105/AJPH.2010.199505
- PubMed
- Google Scholar
1. Hong EP
2. Go MJ
3. Kim HL
4. Park JW
(2017) Risk prediction of pulmonary tuberculosis using genetic and conventional risk factors in adult Korean population
PLOS ONE 12:e0174642.

https://doi.org/10.1371/journal.pone.0174642
- PubMed
- Google Scholar
1. Houben RMGJ
2. Dodd PJ
(2016) The global burden of latent tuberculosis infection: A re-estimation using mathematical modelling
PLOS Medicine 13:e1002152.

https://doi.org/10.1371/journal.pmed.1002152
- PubMed
- Google Scholar
(2009) A flexible and accurate genotype imputation method for the next generation of genome-wide association studies
PLOS Genetics 5:e1000529.

https://doi.org/10.1371/journal.pgen.1000529
- PubMed
- Google Scholar
1. Kallmann FJ
2. Reisner D
(1943)
Twin studies on the significance of genetic factors in tuberculosis

American Review of Tuberculosis 47:547–549.
- Google Scholar
1. Kelly A
2. Trowsdale J
(2019) Genetics of antigen processing and presentation
Immunogenetics 71:161–170.

https://doi.org/10.1007/s00251-018-1082-2
- PubMed
- Google Scholar
1. Kent WJ
2. Sugnet CW
3. Furey TS
4. Roskin KM
5. Pringle TH
6. Zahler AM
7. Haussler D
(2002) The human genome browser at UCSC
Genome Research 12:996–1006.

https://doi.org/10.1101/gr.229102
- PubMed
- Google Scholar
(2017) The role of human host genetics in tuberculosis resistance
Expert Review of Respiratory Medicine 11:721–737.

https://doi.org/10.1080/17476348.2017.1354700
- PubMed
- Google Scholar
1. Li M
2. Hu Y
3. Zhao B
4. Chen L
5. Huang H
6. Huai C
7. Zhang X
8. Zhang J
9. Zhou W
10. Shen L
11. Zhen Q
12. Li B
13. Wang W
14. He L
15. Qin S
(2021) A next generation sequencing combined genome-wide association study identifies novel tuberculosis susceptibility loci in Chinese population
Genomics 113:2377–2384.

https://doi.org/10.1016/j.ygeno.2021.05.035
- PubMed
- Google Scholar
(2009) Drivers of tuberculosis epidemics: the role of risk factors and social determinants
Social Science & Medicine 68:2240–2246.

https://doi.org/10.1016/j.socscimed.2009.03.041
- PubMed
- Google Scholar
1. Luo T
2. Comas I
3. Luo D
4. Lu B
5. Wu J
6. Wei L
7. Yang C
8. Liu Q
9. Gan M
10. Sun G
11. Shen X
12. Liu F
13. Gagneux S
14. Mei J
15. Lan R
16. Wan K
17. Gao Q
(2015) Southern East Asian origin and coexpansion of Mycobacterium tuberculosis Beijing family with Han Chinese
PNAS 112:8136–8141.

https://doi.org/10.1073/pnas.1424063112
- PubMed
- Google Scholar
1. Luo Y
2. Suliman S
3. Asgari S
4. Amariuta T
5. Baglaenko Y
6. Martínez-Bonet M
7. Ishigaki K
8. Gutierrez-Arcelus M
9. Calderon R
10. Lecca L
11. León SR
12. Jimenez J
13. Yataco R
14. Contreras C
15. Galea JT
16. Becerra M
17. Nejentsev S
18. Nigrovic PA
19. Moody DB
20. Murray MB
21. Raychaudhuri S
(2019) Early progression to active tuberculosis is a highly heritable trait driven by 3q23 in Peruvians
Nature Communications 10:3765.

https://doi.org/10.1038/s41467-019-11664-1
- PubMed
- Google Scholar
1. Mägi R
2. Morris AP
(2010) GWAMA: software for genome-wide association meta-analysis
BMC Bioinformatics 11:288.

https://doi.org/10.1186/1471-2105-11-288
- PubMed
- Google Scholar
(2017) Trans-ethnic meta-regression of genome-wide association studies accounting for ancestry increases power for discovery and improves fine-mapping resolution
Human Molecular Genetics 26:3639–3650.

https://doi.org/10.1093/hmg/ddx280
- PubMed
- Google Scholar
1. Mahajan A
2. Go MJ
3. Zhang W
4. Below JE
5. Gaulton KJ
6. Ferreira T
7. Horikoshi M
8. Johnson AD
9. Ng MCY
10. Prokopenko I
11. Saleheen D
12. Wang X
13. Zeggini E
14. Abecasis GR
15. Adair LS
16. Almgren P
17. Atalay M
18. Aung T
19. Baldassarre D
20. Balkau B
21. Bao Y
22. Barnett AH
23. Barroso I
24. Basit A
25. Been LF
26. Beilby J
27. Bell GI
28. Benediktsson R
29. Bergman RN
30. Boehm BO
31. Boerwinkle E
32. Bonnycastle LL
33. Burtt N
34. Cai Q
35. Campbell H
36. Carey J
37. Cauchi S
38. Caulfield M
39. Chan JCN
40. Chang LC
41. Chang TJ
42. Chang YC
43. Charpentier G
44. Chen CH
45. Chen H
46. Chen YT
47. Chia KS
48. Chidambaram M
49. Chines PS
50. Cho NH
51. Cho YM
52. Chuang LM
53. Collins FS
54. Cornelis MC
55. Couper DJ
56. Crenshaw AT
57. van Dam RM
58. Danesh J
59. Das D
60. de Faire U
61. Dedoussis G
62. Deloukas P
63. Dimas AS
64. Dina C
65. Doney AS
66. Donnelly PJ
67. Dorkhan M
68. van Duijn C
69. Dupuis J
70. Edkins S
71. Elliott P
72. Emilsson V
73. Erbel R
74. Eriksson JG
75. Escobedo J
76. Esko T
77. Eury E
78. Florez JC
79. Fontanillas P
80. Forouhi NG
81. Forsen T
82. Fox C
83. Fraser RM
84. Frayling TM
85. Froguel P
86. Frossard P
87. Gao Y
88. Gertow K
89. Gieger C
90. Gigante B
91. Grallert H
92. Grant GB
93. Grrop LC
94. Groves CJ
95. Grundberg E
96. Guiducci C
97. Hamsten A
98. Han BG
99. Hara K
100. Hassanali N
101. Hattersley AT
102. Hayward C
103. Hedman AK
104. Herder C
105. Hofman A
106. Holmen OL
107. Hovingh K
108. Hreidarsson AB
109. Hu C
110. Hu FB
111. Hui J
112. Humphries SE
113. Hunt SE
114. Hunter DJ
115. Hveem K
116. Hydrie ZI
117. Ikegami H
118. Illig T
119. Ingelsson E
120. Islam M
121. Isomaa B
122. Jackson AU
123. Jafar T
124. James A
125. Jia W
126. Jöckel KH
127. Jonsson A
128. Jowett JBM
129. Kadowaki T
130. Kang HM
131. Kanoni S
132. Kao WHL
133. Kathiresan S
134. Kato N
135. Katulanda P
136. Keinanen-Kiukaanniemi KM
137. Kelly AM
138. Khan H
139. Khaw KT
140. Khor CC
141. Kim HL
142. Kim S
143. Kim YJ
144. Kinnunen L
145. Klopp N
146. Kong A
147. Korpi-Hyövälti E
148. Kowlessur S
149. Kraft P
150. Kravic J
151. Kristensen MM
152. Krithika S
153. Kumar A
154. Kumate J
155. Kuusisto J
156. Kwak SH
157. Laakso M
158. Lagou V
159. Lakka TA
160. Langenberg C
161. Langford C
162. Lawrence R
163. Leander K
164. Lee JM
165. Lee NR
166. Li M
167. Li X
168. Li Y
169. Liang J
170. Liju S
171. Lim WY
172. Lind L
173. Lindgren CM
174. Lindholm E
175. Liu CT
176. Liu JJ
177. Lobbens S
178. Long J
179. Loos RJF
180. Lu W
181. Luan J
182. Lyssenko V
183. Ma RCW
184. Maeda S
185. Mägi R
186. Männisto S
187. Matthews DR
188. Meigs JB
189. Melander O
190. Metspalu A
191. Meyer J
192. Mirza G
193. Mihailov E
194. Moebus S
195. Mohan V
196. Mohlke KL
197. Morris AD
198. Mühleisen TW
199. Müller-Nurasyid M
200. Musk B
201. Nakamura J
202. Nakashima E
203. Navarro P
204. Ng PK
205. Nica AC
206. Nilsson PM
207. Njølstad I
208. Nöthen MM
209. Ohnaka K
210. Ong TH
211. Owen KR
212. Palmer CNA
213. Pankow JS
214. Park KS
215. Parkin M
216. Pechlivanis S
217. Pedersen NL
218. Peltonen L
219. Perry JRB
220. Peters A
221. Pinidiyapathirage JM
222. Platou CG
223. Potter S
224. Price JF
225. Qi L
226. Radha V
227. Rallidis L
228. Rasheed A
229. Rathman W
230. Rauramaa R
231. Raychaudhuri S
232. Rayner NW
233. Rees SD
234. Rehnberg E
235. Ripatti S
236. Robertson N
237. Roden M
238. Rossin EJ
239. Rudan I
240. Rybin D
241. Saaristo TE
242. Salomaa V
243. Saltevo J
244. Samuel M
245. Sanghera DK
246. Saramies J
247. Scott J
248. Scott LJ
249. Scott RA
250. Segrè AV
251. Sehmi J
252. Sennblad B
253. Shah N
254. Shah S
255. Shera AS
256. Shu XO
257. Shuldiner AR
258. Sigurđsson G
259. Sijbrands E
260. Silveira A
261. Sim X
262. Sivapalaratnam S
263. Small KS
264. So WY
265. Stančáková A
266. Stefansson K
267. Steinbach G
268. Steinthorsdottir V
269. Stirrups K
270. Strawbridge RJ
271. Stringham HM
272. Sun Q
273. Suo C
274. Syvänen AC
275. Takayanagi R
276. Takeuchi F
277. Tay WT
278. Teslovich TM
279. Thorand B
280. Thorleifsson G
281. Thorsteinsdottir U
282. Tikkanen E
283. Trakalo J
284. Tremoli E
285. Trip MD
286. Tsai FJ
287. Tuomi T
288. Tuomilehto J
289. Uitterlinden AG
290. Valladares-Salgado A
291. Vedantam S
292. Veglia F
293. Voight BF
294. Wang C
295. Wareham NJ
296. Wennauer R
297. Wickremasinghe AR
298. Wilsgaard T
299. Wilson JF
300. Wiltshire S
301. Winckler W
302. Wong TY
303. Wood AR
304. Wu JY
305. Wu Y
306. Yamamoto K
307. Yamauchi T
308. Yang M
309. Yengo L
310. Yokota M
311. Young R
312. Zabaneh D
313. Zhang F
314. Zhang R
315. Zheng W
316. Zimmet PZ
317. Altshuler D
318. Bowden DW
319. Cho YS
320. Cox NJ
321. Cruz M
322. Hanis CL
323. Kooner J
324. Lee JY
325. Seielstad M
326. Teo YY
327. Boehnke M
328. Parra EJ
329. Chambers JC
330. Tai ES
331. McCarthy MI
332. Morris AP
(2014) Genome-wide trans-ancestry meta-analysis provides insight into the genetic architecture of type 2 diabetes susceptibility
Nature Genetics 46:234–244.

https://doi.org/10.1038/ng.2897
- PubMed
- Google Scholar
(2012) Genome-wide association studies of tuberculosis in Asians identify distinct at-risk locus for young tuberculosis
Journal of Human Genetics 57:363–367.

https://doi.org/10.1038/jhg.2012.35
- PubMed
- Google Scholar
1. McHenry ML
2. Bartlett J
3. Igo RP Jr
4. Wampande EM
5. Benchek P
6. Mayanja-Kizza H
7. Fluegge K
8. Hall NB
9. Gagneux S
10. Tishkoff SA
11. Wejse C
12. Sirugo G
13. Boom WH
14. Joloba M
15. Williams SM
16. Stein CM
(2020) Interaction between host genes and Mycobacterium tuberculosis lineage can affect tuberculosis severity: Evidence for coevolution?
PLOS Genetics 16:e1008728.

https://doi.org/10.1371/journal.pgen.1008728
- PubMed
- Google Scholar
1. McHenry ML
2. Benchek P
3. Malone L
4. Nsereko M
5. Mayanja-Kizza H
6. Boom WH
7. Williams SM
8. Hawn TR
9. Stein CM
(2021a) Resistance to TST/IGRA conversion in Uganda: Heritability and Genome-Wide Association Study
EBioMedicine 74:103727.

https://doi.org/10.1016/j.ebiom.2021.103727
- PubMed
- Google Scholar
1. McHenry ML
2. Wampande EM
3. Joloba ML
4. Malone LL
5. Mayanja-Kizza H
6. Bush WS
7. Boom WH
8. Williams SM
9. Stein CM
(2021b) Interaction between M. tuberculosis Lineage and Human Genetic variants reveals Novel Pathway Associations with severity of TB
Pathogens 10:1487.

https://doi.org/10.3390/pathogens10111487
- PubMed
- Google Scholar
(2017) The human Cranio Facial Development Protein 1 (Cfdp1) gene encodes a protein required for the maintenance of higher-order chromatin organization
Scientific Reports 7:45022.

https://doi.org/10.1038/srep45022
- PubMed
- Google Scholar
1. Micheni LN
2. Kassaza K
3. Kinyi H
4. Ntulume I
5. Bazira J
(2021) Diversity of Mycobacterium tuberculosis complex lineages associated with pulmonary tuberculosis in southwestern, uganda
Tuberculosis Research and Treatment 1:5588339.

https://doi.org/10.1155/2021/5588339
- PubMed
- Google Scholar
1. Möller M
2. Kinnear CJ
(2020) Human global and population-specific genetic susceptibility to Mycobacterium tuberculosis infection and disease
Current Opinion in Pulmonary Medicine 26:302–310.

https://doi.org/10.1097/MCP.0000000000000672
- PubMed
- Google Scholar
1. Müller SJ
2. Schurz H
3. Tromp G
4. van der Spuy GD
5. Hoal EG
6. van Helden PD
7. Owusu-Dabo E
8. Meyer CG
9. Muntau B
10. Thye T
11. Niemann S
12. Warren RM
13. Streicher E
14. Möller M
15. Kinnear C
(2021) A multi-phenotype genome-wide association study of clades causing tuberculosis in A Ghanaian- and South African cohort
Genomics 113:1802–1815.

https://doi.org/10.1016/j.ygeno.2021.04.024
- PubMed
- Google Scholar
(2018) Collider scope: when selection bias can substantially influence observed associations
International Journal of Epidemiology 47:226–235.

https://doi.org/10.1093/ije/dyx206
- PubMed
- Google Scholar
1. Naranbhai V
(2016) The role of host genetics (and genomics) in tuberculosis
Microbiology Spectrum 4:.

https://doi.org/10.1128/microbiolspec.TBTB2-0011-2016
- PubMed
- Google Scholar
(2017) Pathogen lineage-based genome-wide association study identified CD53 as susceptible locus in tuberculosis
Journal of Human Genetics 62:1015–1022.

https://doi.org/10.1038/jhg.2017.82
- PubMed
- Google Scholar
(2012) What should the genome-wide significance threshold be? Empirical replication of borderline genetic associations
International Journal of Epidemiology 41:273–286.

https://doi.org/10.1093/ije/dyr178
- PubMed
- Google Scholar
1. Pearce N
(2011) Epidemiology in a changing world: variation, causation and ubiquitous risk factors
International Journal of Epidemiology 40:503–512.

https://doi.org/10.1093/ije/dyq257
- PubMed
- Google Scholar
1. Peprah E
2. Xu H
3. Tekola-Ayele F
4. Royal CD
(2015) Genome-wide association studies in Africans and African Americans: expanding the framework of the genomics of human traits and disease
Public Health Genomics 18:40–51.

https://doi.org/10.1159/000367962
- PubMed
- Google Scholar
(2012) A genome wide association study of pulmonary tuberculosis susceptibility in Indonesians
BMC Medical Genetics 13:5.

https://doi.org/10.1186/1471-2350-13-5
- PubMed
- Google Scholar
1. Qi H
2. Zhang Y-B
3. Sun L
4. Chen C
5. Xu B
6. Xu F
7. Liu J-W
8. Liu J-C
9. Chen C
10. Jiao W-W
11. Shen C
12. Xiao J
13. Li J-Q
14. Guo Y-J
15. Wang Y-H
16. Li Q-J
17. Yin Q-Q
18. Li Y-J
19. Wang T
20. Wang X-Y
21. Gu M-L
22. Yu J
23. Shen A-D
(2017) Discovery of susceptibility loci associated with tuberculosis in Han Chinese
Human Molecular Genetics 26:4752–4763.

https://doi.org/10.1093/hmg/ddx365
- PubMed
- Google Scholar
1. Quistrebert J
2. Orlova M
3. Kerner G
4. Ton LT
5. Luong NT
6. Danh NT
7. Vincent QB
8. Jabot-Hanin F
9. Seeleuthner Y
10. Bustamante J
11. Boisson-Dupuis S
12. Huong NT
13. Ba NN
14. Casanova J-L
15. Delacourt C
16. Hoal EG
17. Alcaïs A
18. Thai VH
19. Thành LT
20. Abel L
21. Schurr E
22. Cobat A
(2021) Genome-wide association study of resistance to Mycobacterium tuberculosis infection identifies a locus at 10q26.2 in three distinct populations
PLOS Genetics 17:e1009392.

https://doi.org/10.1371/journal.pgen.1009392
- PubMed
- Google Scholar
Software
1. R Development Core Team
(2013) R: A language and environment for statistical computing
R Foundation for Statistical Computing, Vienna, Austria.

https://www.r-project.org
1. Salie M
2. van der Merwe L
3. Möller M
4. Daya M
5. van der Spuy GD
6. van Helden PD
7. Martin MP
8. Gao X-J
9. Warren RM
10. Carrington M
11. Hoal EG
(2014) Associations between human leukocyte antigen class I variants and the Mycobacterium tuberculosis subtypes causing disease
The Journal of Infectious Diseases 209:216–223.

https://doi.org/10.1093/infdis/jit443
- PubMed
- Google Scholar
1. Satorra A
2. Bentler PM
(2001) A scaled difference chi-square test statistic for moment structure analysis
Psychometrika 66:507–514.

https://doi.org/10.1007/BF02296192
- Google Scholar
1. Schurz H
2. Kinnear CJ
3. Gignoux C
4. Wojcik G
5. van Helden PD
6. Tromp G
7. Henn B
8. Hoal EG
9. Möller M
(2018) A sex-stratified genome-wide association study of tuberculosis using a multi-ethnic genotyping array
Frontiers in Genetics 9:678.

https://doi.org/10.3389/fgene.2018.00678
- PubMed
- Google Scholar
1. Schurz H
2. Müller SJ
3. van Helden PD
4. Tromp G
5. Hoal EG
6. Kinnear CJ
7. Möller M
(2019) Evaluating the accuracy of imputation methods in a five-way admixed population
Frontiers in Genetics 10:34.

https://doi.org/10.3389/fgene.2019.00034
- PubMed
- Google Scholar
(2009) The HLA genomic loci map: expression, interaction, diversity and disease
Journal of Human Genetics 54:15–39.

https://doi.org/10.1038/jhg.2008.5
- PubMed
- Google Scholar
1. Sia JK
2. Rengarajan J
(2019) Immunology of Mycobacterium tuberculosis infections
Microbiology Spectrum 7:2018.

https://doi.org/10.1128/microbiolspec.GPP3-0022-2018
- Google Scholar
1. Simmons JD
2. Stein CM
3. Seshadri C
4. Campo M
5. Alter G
6. Fortune S
7. Schurr E
8. Wallis RS
9. Churchyard G
10. Mayanja-Kizza H
11. Boom WH
12. Hawn TR
(2018) Immunological mechanisms of human resistance to persistent Mycobacterium tuberculosis infection
Nature Reviews. Immunology 18:575–589.

https://doi.org/10.1038/s41577-018-0025-3
- PubMed
- Google Scholar
1. Stein CM
2. Sausville L
3. Wejse C
4. Sobota RS
5. Zetola NM
6. Hill PC
7. Boom WH
8. Scott WK
9. Sirugo G
10. Williams SM
(2017) Genomics of human pulmonary tuberculosis: from genes to pathways
Current Genetic Medicine Reports 5:149–166.

https://doi.org/10.1007/s40142-017-0130-9
- PubMed
- Google Scholar
1. Sudmant PH
2. Rausch T
3. Gardner EJ
4. Handsaker RE
5. Abyzov A
6. Huddleston J
7. Zhang Y
8. Ye K
9. Jun G
10. Fritz MH-Y
11. Konkel MK
12. Malhotra A
13. Stütz AM
14. Shi X
15. Casale FP
16. Chen J
17. Hormozdiari F
18. Dayama G
19. Chen K
20. Malig M
21. Chaisson MJP
22. Walter K
23. Meiers S
24. Kashin S
25. Garrison E
26. Auton A
27. Lam HYK
28. Mu XJ
29. Alkan C
30. Antaki D
31. Bae T
32. Cerveira E
33. Chines P
34. Chong Z
35. Clarke L
36. Dal E
37. Ding L
38. Emery S
39. Fan X
40. Gujral M
41. Kahveci F
42. Kidd JM
43. Kong Y
44. Lameijer E-W
45. McCarthy S
46. Flicek P
47. Gibbs RA
48. Marth G
49. Mason CE
50. Menelaou A
51. Muzny DM
52. Nelson BJ
53. Noor A
54. Parrish NF
55. Pendleton M
56. Quitadamo A
57. Raeder B
58. Schadt EE
59. Romanovitch M
60. Schlattl A
61. Sebra R
62. Shabalin AA
63. Untergasser A
64. Walker JA
65. Wang M
66. Yu F
67. Zhang C
68. Zhang J
69. Zheng-Bradley X
70. Zhou W
71. Zichner T
72. Sebat J
73. Batzer MA
74. McCarroll SA
75. 1000 Genomes Project Consortium
76. Mills RE
77. Gerstein MB
78. Bashir A
79. Stegle O
80. Devine SE
81. Lee C
82. Eichler EE
83. Korbel JO
(2015) An integrated map of structural variation in 2,504 human genomes
Nature 526:75–81.

https://doi.org/10.1038/nature15394
- PubMed
- Google Scholar
(2016) HLA class II sequence variants influence tuberculosis risk in populations of European ancestry
Nature Genetics 48:318–322.

https://doi.org/10.1038/ng.3498
- PubMed
- Google Scholar
1. The Wellcome Trust Case Control Consortium
(2007) Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls
Nature 447:661–678.

https://doi.org/10.1038/nature05911
- Google Scholar
1. Thu KS
2. Sato N
3. Ikeda S
4. Naka-Mieno M
5. Arai T
6. Mori S
7. Sawabe M
8. Muramatsu M
9. Tanaka M
(2016) Association of polymorphisms of the transporter associated with antigen processing (TAP2) gene with pulmonary tuberculosis in an elderly Japanese population
APMIS 124:675–680.

https://doi.org/10.1111/apm.12562
- PubMed
- Google Scholar
1. Thye T
2. Vannberg FO
3. Wong SH
4. Owusu-Dabo E
5. Osei I
6. Gyapong J
7. Sirugo G
8. Sisay-Joof F
9. Enimil A
10. Chinbuah MA
11. Floyd S
12. Warndorff DK
13. Sichali L
14. Malema S
15. Crampin AC
16. Ngwira B
17. Teo YY
18. Small K
19. Rockett K
20. Kwiatkowski D
21. Fine PE
22. Hill PC
23. Newport M
24. Lienhardt C
25. Adegbola RA
26. Corrah T
27. Ziegler A
28. African TB Genetics Consortium
29. Wellcome Trust Case Control Consortium
30. Morris AP
31. Meyer CG
32. Horstmann RD
33. Hill AVS
(2010) Genome-wide association analyses identifies a susceptibility locus for tuberculosis on chromosome 18q11.2
Nature Genetics 42:739–741.

https://doi.org/10.1038/ng.639
- PubMed
- Google Scholar
1. Thye T
2. Owusu-Dabo E
3. Vannberg FO
4. van Crevel R
5. Curtis J
6. Sahiratmadja E
7. Balabanova Y
8. Ehmen C
9. Muntau B
10. Ruge G
11. Sievertsen J
12. Gyapong J
13. Nikolayevskyy V
14. Hill PC
15. Sirugo G
16. Drobniewski F
17. van de Vosse E
18. Newport M
19. Alisjahbana B
20. Nejentsev S
21. Ottenhoff THM
22. Hill AVS
23. Horstmann RD
24. Meyer CG
(2012) Common variants at 11p13 are associated with susceptibility to tuberculosis
Nature Genetics 44:257–259.

https://doi.org/10.1038/ng.1080
- PubMed
- Google Scholar
1. Wampande EM
2. Naniima P
3. Mupere E
4. Kateete DP
5. Malone LL
6. Stein CM
7. Mayanja-Kizza H
8. Gagneux S
9. Boom WH
10. Joloba ML
(2019) Genetic variability and consequence of Mycobacterium tuberculosis lineage 3 in Kampala-Uganda
PLOS ONE 14:e0221644.

https://doi.org/10.1371/journal.pone.0221644
- PubMed
- Google Scholar
1. Wang X
2. Ma A
3. Han X
4. Litifu A
5. Xue F
(2018) ASAP1 gene polymorphisms are associated with susceptibility to tuberculosis in a Chinese Xinjiang Muslim population
Experimental and Therapeutic Medicine 15:3392–3398.

https://doi.org/10.3892/etm.2018.5800
- Google Scholar
Website
1. WHO
(2020) Global tuberculosis report
Accessed October 14, 2020.

https://www.who.int/publications/i/item/9789240013131
1. Wu Y
2. Yu H
3. Zheng SL
4. Feng B
5. Kapron AL
6. Na R
7. Boyle JL
8. Shah S
9. Shi Z
10. Ewing CM
11. Wiley KE
12. Luo J
13. Walsh PC
14. Carter HB
15. Helfand BT
16. Cooney KA
17. Xu J
18. Isaacs WB
(2018) Germline mutations in PPFIBP2 are associated with lethal prostate cancer
The Prostate 78:1222–1228.

https://doi.org/10.1002/pros.23697
- PubMed
- Google Scholar
1. Yang J
2. Lee SH
3. Goddard ME
4. Visscher PM
(2011) GCTA: A tool for genome-wide complex trait analysis
American Journal of Human Genetics 88:76–82.

https://doi.org/10.1016/j.ajhg.2010.11.011
- PubMed
- Google Scholar
1. Zhang M
2. Wang X
3. Zhu Y
4. Chen S
5. Chen B
6. Liu Z
(2021) Associations of genetic variants at TAP1 and TAP2 with pulmonary tuberculosis risk among the Chinese population
Epidemiology and Infection 149:e79.

https://doi.org/10.1017/S0950268821000613
- PubMed
- Google Scholar
1. Zheng X
2. Shen J
3. Cox C
4. Wakefield JC
5. Ehm MG
6. Nelson MR
7. Weir BS
(2014) HIBAG--HLA genotype imputation with attribute bagging
The Pharmacogenomics Journal 14:192–200.

https://doi.org/10.1038/tpj.2013.18
- PubMed
- Google Scholar
Book
1. Zheng X
(2018) Imputation-based HLA typing with SNPs in GWAS studies
In: Boegel S, editors. HLA Typing: Methods and Protocols. Springer. pp. 163–176.

https://doi.org/10.1007/978-1-4939-8546-3
- Google Scholar
1. Zheng R
2. Li Z
3. He F
4. Liu H
5. Chen J
6. Chen J
7. Xie X
8. Zhou J
9. Chen H
10. Wu X
11. Wu J
12. Chen B
13. Liu Y
14. Cui H
15. Fan L
16. Sha W
17. Liu Y
18. Wang J
19. Huang X
20. Zhang L
21. Xu F
22. Wang J
23. Feng Y
24. Qin L
25. Yang H
26. Liu Z
27. Cui Z
28. Liu F
29. Chen X
30. Gao S
31. Sun S
32. Shi Y
33. Ge B
(2018) Genome-wide association study identifies two risk loci for tuberculosis in Han Chinese
Nature Communications 9:4072.

https://doi.org/10.1038/s41467-018-06539-w
- PubMed
- Google Scholar

Article and author information

Author details

Haiko Schurz

DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa

Contribution
Conceptualization, Data curation, Investigation, Methodology, Writing – original draft, Writing – review and editing

For correspondence
haikoschurz@gmail.com

Competing interests
No competing interests declared

Additional information
co-first authors

"This ORCID iD identifies the author of this article:" 0000-0002-0009-3409
Vivek Naranbhai
1. Wellcome Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
2. Massachusetts General Hospital, Boston, United States
3. Dana-Farber Cancer Institute, Boston, United States
4. Centre for the AIDS Programme of Research in South Africa, Durban, South Africa
5. Harvard Medical School, Boston, United States
Contribution
Conceptualization, Data curation, Formal analysis, Methodology, Writing – original draft, Writing – review and editing

Competing interests
No competing interests declared

Additional information
co-first authors
Tom A Yates

Division of Infection and Immunity, Faculty of Medical Sciences, University College London, London, United Kingdom

Contribution
Supervision, Methodology, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-6081-1767
James Gilchrist
1. Wellcome Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
2. Department of Paediatrics, University of Oxford, Oxford, United Kingdom
Contribution
Supervision, Methodology, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0003-2045-6788
Tom Parks
1. Wellcome Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
2. Department of Infectious Diseases Imperial College London, London, United Kingdom
Contribution
Supervision, Methodology, Writing – review and editing

Competing interests
No competing interests declared
Peter J Dodd

School of Health and Related Research, University of Sheffield, Sheffield, United Kingdom

Contribution
Supervision, Methodology, Writing – review and editing

Competing interests
No competing interests declared
Marlo Möller

DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa

Contribution
Resources, Supervision, Methodology, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-0805-6741
Eileen G Hoal

DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa

Contribution
Resources, Supervision, Methodology, Writing – review and editing

Competing interests
No competing interests declared
Andrew P Morris

Centre for Genetics and Genomics Versus Arthritis, Centre for Musculoskeletal Research, The University of Manchester, Manchester, United Kingdom

Contribution
Software, Supervision, Methodology, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-6805-6014
Adrian VS Hill
1. Wellcome Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
2. Jenner Institute, University of Oxford, Oxford, United Kingdom
Contribution
Resources, Supervision, Methodology, Writing – review and editing

Competing interests
No competing interests declared
International Tuberculosis Host Genetics Consortium

Contribution
Conceptualization, Data curation, Writing – review and editing

Competing interests
No competing interests declared
1. Haiko Schurz, DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University,, Cape Town, South Africa
2. Vivek Naranbhai, Wellcome Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
3. Tom A Yates, Division of Infection and Immunity, Faculty of Medical Sciences, University College, London, United Kingdom
4. James J Gilchrist, Wellcome Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
5. Tom Parks, Wellcome Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
6. Peter J Dodd, Centre for Genetics and Genomics Versus Arthritis, Centre for Musculoskeletal Research, The University of Manchester, Manchester, United Kingdom
7. Marlo Möller, DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University,, Cape Town, South Africa
8. Eileen G Hoal, DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University,, Cape Town, South Africa
9. Andrew P Morris, Centre for Genetics and Genomics Versus Arthritis, Centre for Musculoskeletal Research, The University of Manchester, Manchester, United Kingdom
10. Adrian VS Hill, Wellcome Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
11. Reinout van Crevel, Department of Internal Medicine and Radboud Center for Infectious Diseases, Radboud University Medical Center, Nijmegen, Netherlands
12. Arjan van Laarhoven, Department of Internal Medicine and Radboud Center for Infectious Diseases, Radboud University Medical Center, Nijmegen, Netherlands
13. Tom HM Ottenhoff, Head Lab Dept of Infectious Diseases; Head Group Immunology and Immunogenetics of Bacterial Infectious Diseases Leiden University Medical Center, Leiden, Netherlands
14. Andres Metspalu, Estonian Genome Center, Institute of Genomics, University of Tartu, Tartu, Estonia
15. Reedik Magi, Estonian Genome Center, Institute of Genomics, University of Tartu, Tartu, Estonia
16. Christian G Meyer, Institute of Tropical Medicine, Eberhard-Karls University Tübingen, Tübingen, Germany
17. Magda Ellis, Tuberculosis Research Group, Centenary Institute, Sydney, Australia
18. Thorsten Thye, School of Health and Related Research, University of Sheffield, Sheffield, United Kingdom
19. Surakameth Mahasirimongkol, Department of Medical Sciences, Ministry of Public Health, Nonthaburi, Thailand
20. Ekawat Pasomsub, Virology Laboratory, Department of Pathology, Faculty of Medicine, Ramathibodi Hospital, Mahidol University, Bangkok, Thailand
21. Katsushi Tokunaga, Genome Medical Science Project, National Center for Global Health and Medicine, Tokyo, Japan
22. Yosuke Omae, Genome Medical Science Project, National Center for Global Health and Medicine, Tokyo, Japan
23. Hideki Yanai, Fukujuji Hospital and Research Institute of Tuberculosis, Japan Anti-Tuberculosis Association, Kiyose, Japan
24. Taisei Mushiroda, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
25. Michiaki Kubo, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
26. Atsushi Takahashi, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
27. Yoichiro Kamatani, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
28. Bachti Alisjahbana, Faculty of Medicine, Universitas Padjdjaran - Hasan Sadikin Hospital, Bandung, Indonesia
29. Wei Liu, Department of Plastic and Reconstructive Surgery, Shanghai Key Laboratory of Tissue Engineering, Shanghai Ninth People’s Hospital, Shanghai Jiao Tong University – School of Medicine, Shanghai, China
30. A-dong Sheng, National Clinical Research Center for Respiratory Diseases, National Key Discipline of Pediatrics, Capital Medical University, Beijing, China
31. Yurong Yang, Ningxia Medical University, Ningxia Hui Autonomous Region, Ningxia, China

Funding

National Institute for Health Research (Academic Clinical Lectureship)

James Gilchrist

Versus Arthritis (21754)

Andrew P Morris

Medical Research Council (MR/P022081/1)

Peter J Dodd

National Institute for Health Research (NIHR Clinical Lecturer)

Tom A Yates

National Institute for Health Research (CL-2020-21-001)

Tom Parks

Wellcome

https://doi.org/10.35802/222098

Tom Parks

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication. For the purpose of Open Access, the authors have applied a CC BY public copyright license to any Author Accepted Manuscript version arising from this submission.

Acknowledgements

Computation used the Oxford Biomedical Research Computing (BMRC) facility, a joint development between the Wellcome Centre for Human Genetics and the Big Data Institute supported by Health Data Research UK and the NIHR Oxford Biomedical Research Centre. Financial support was provided by the Wellcome Trust Core Award Grant Number 203141/Z/16/Z. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health and Social Care. This work was partly supported by a Grant in-Aid for Scientific Research (B) (KAKENHI 21406006) from Japan Society for the Promotion of Science (JSPS). The clinical information and samples in Thailand, in this part, were supported by JSPS KAKENHI 17256005 and later by research grant from the Ministry of Health, Labor and Welfare (MHLW) H21-aids-12. We would like to thank all the subjects and the members of the Rotary Club of Osaka-Midosuji District 2660 Rotary International in Japan who donated their DNA for this work. We thank all members of BioBank Japan, Institute of Medical Science, The University of Tokyo, and of RIKEN Center for Genomic Medicine for their contribution to the completion of our study. This work was conducted as a part of the BioBank Japan Project that was supported by the Ministry of Education, Culture, Sports, Science and Technology of the Japanese government. As for Thai samples, we thank all of the staff and collaborators of the TB/HIV Research Project, Thailand, a research project between the Research Institute of Tuberculosis, the Japan Anti-tuberculosis Association, and the Thai Ministry of Public Health for collecting clinical data and DNA samples. We thank the German Consortium 'TB or not TB Network' (https://www.tbornottb.de/), which was responsible for collecting the German TB samples. We acknowledge the support of the DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa. This research was funded in whole, or in part, by the Wellcome Trust. For the purpose of open access, the author has applied a CC BY public copyright license to any Author Accepted Manuscript version arising from this submission. JJG is funded by an NIHR Academic Clinical Lectureship. APM acknowledges support from Versus Arthritis (grant reference 21754). PJD was supported by a fellowship from the UK Medical Research Council (MR/P022081/1); this UK-funded award is part of the EDCTP2 program supported by the European Union. ME was supported by an NHMRC fellowship (552496). The research was supported by the NHMRC grant 1025166. AvL and RvC are supported by the National Institute of Allergy and Infectious Diseases at NIH [R01 AI136921]. TAY is an NIHR Clinical Lecturer supported by the National Institute for Health Research. TP acknowledges funding from the National Institute for Health Research (CL-2020-21-001) and the Wellcome Trust (222098/Z/20/Z). The views expressed in this publication are those of the author(s) and not necessarily those of the NHS, the National Institute for Health Research, or the Department of Health and Social Care. AM and RM are funded by the EU project no. 2014-2020.4.01.15-0012 'Gentransmed'. BA is supported by the 'Scientific Programme Indonesia Netherlands' (SPIN) under the Royal Academy of Arts and Sciences (KNAW), the Netherlands.

Ethics

A research collaboration agreement was signed by all contributors. Ethics approval for the meta-analysis presented here was granted by the Health Research Ethics Committee of Stellenbosch University (project registration number S17/01/013). In addition, all institutions involved in the ITHGC have ethics approval for their respective studies: China 1 and 2: The study protocol was approved by the Ethics Committee of the Beijing Chest Hospital, the 309 Hospital of the PLA, Shijiazhuang Fifth Hospital, the China PLA General Hospital, the Tongliao TB institute and the Center for Diseases Control and Prevention in Jalainuoer. China 3: Ethics approval was granted by the Ethics Committees of the Beijing Children's Hospital, the Beijing Geriatric Hospital, the Tuberculosis Hospital in Shaanxi Province, the Beijing Institute of Genomics, Chinese Academy of Sciences and the Center for Disease Control and Prevention of Jiangsu Province. Thailand: Ethics approval was granted by the Ethics Review Committee of the Ministry of Public Health in Thailand. Japan: Ethics approval was granted by the Institutional Review Board of the Center for Genomic Medicine, RIKEN Russia: Blood samples from all participants were collected and studied with written informed consent according to the Declaration of Helsinki and with approvals from the local ethics committees in Russia (St. Petersburg and Samara) and the UK (Human Biological Resource Ethics Committee of the University of Cambridge and the National Research Ethics Service, Cambridgeshire 1 REC, 10/H0304/71). Estonia: The Estonian Bioethics and Human Research Council (EBIN) approved the Estonian Genome Center study reported in this manuscript. Germany: The study protocol was approved by the ethics committee (EC) of the University of Luebeck, Germany (reference 07-125), and was adopted by other ethics committees covering all 18 participating centres (EC of the medical faculty of the University of Goettingen; EC of the Medical Council of Hessen, Frankfurt /Main; EC of the Medical Council Hamburg; EC of the Medical Council Lower Saxony, Hannover; EC of the Medical Faculty Carl Gustav Carus, Technical University of Dresden; EC of the Medical Council Berlin; EC of the Medical Council Bavaria, Munich; EC of the Medical Faculty, Friedrich-Alexander-University Erlangen-Nuremberg; EC of the Medical Faculty of the University of Regensburg; EC of the University of Witten/ Herdecke) Gambia: Ethics approval was granted by the Medical Research Council (MRC) and the Gambian government joint ethical committee. Ghana: Ethics approval was granted by the Committee on Human Research, Publications and Ethics, School of Medical Sciences, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana, and the Ethics Committee of the Ghana Health Service, Accra, Ghana. RSA A and RSA M: Ethics approval was granted by the Health Research Ethics Committee of Stellenbosch University (project registration numbers S17/01/013, NO6/07/132 and 95/072).

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.