Recently, loss-of-function variants in TLR7 were identified in two families in which COVID-19 segregates like an X-linked recessive disorder environmentally conditioned by SARS-CoV-2. We investigated whether the two families represent the tip of the iceberg of a subset of COVID-19 male patients.
This is a nested case-control study in which we compared male participants with extreme phenotype selected from the Italian GEN-COVID cohort of SARS-CoV-2-infected participants (<60 y, 79 severe cases versus 77 control cases). We applied the LASSO Logistic Regression analysis, considering only rare variants on young male subsets with extreme phenotype, picking up TLR7 as the most important susceptibility gene.
Overall, we found TLR7 deleterious variants in 2.1% of severely affected males and in none of the asymptomatic participants. The functional gene expression profile analysis demonstrated a reduction in TLR7-related gene expression in patients compared with controls demonstrating an impairment in type I and II IFN responses.
Young males with TLR7 loss-of-function variants and severe COVID-19 represent a subset of male patients contributing to disease susceptibility in up to 2% of severe COVID-19.
Funded by private donors for the Host Genetics Research Project, the Intesa San Paolo for 2020 charity fund, and the Host Genetics Initiative.
Coronavirus disease 2019 (COVID-19), a potentially severe systemic disease caused by coronavirus SARS-CoV-2, is characterized by a highly heterogeneous phenotypic presentation, with the large majority of infected individuals experiencing only mild or no symptoms. However, severe cases can rapidly evolve toward a critical respiratory distress syndrome and multiple organ failure (Wu and McGoogan, 2020). COVID-19 still represents an enormous challenge for the world's healthcare systems almost 1 year after the first appearance in December 2019 in Wuhan, Huanan, Hubei Province of China. Although older age and the presence of cardiovascular or metabolic comorbidities have been identified as risk factors predisposing to severe disease (Hägg et al., 2020), these factors alone do not fully explain differences in severity (Stokes et al., 2020). Stokes EK et al. reported that male patients show more severe clinical manifestations than females with a statistically significant (p<0.00001) higher prevalence of hospitalizations (16% versus 12%), ICU admissions (3% versus 2%), and deaths (6% versus 5%) (Stokes et al., 2020). These results are in line with other reports indicating that gender may influence disease outcome (Garg et al., 2020; Goodman et al., 2020).
These findings suggest a role of host predisposing genetic factors in the pathogenesis of the disease, which may be responsible for different clinical outcomes as a result of different antiviral defense mechanisms as well as specific receptor permissiveness to virus and immunogenicity.
Recent evidence suggests a fundamental role of interferon genes in modulating immunity to SARS-CoV-2; in particular, rare variants have recently been identified in the interferon type I pathway that are responsible for inborn errors of immunity in a small proportion of patients and auto-antibodies against type I interferon genes in up to 10% of severe COVID-19 cases (Zhang et al., 2020; Bastard et al., 2020).
Toll-like receptors (TLRs) are crucial components in the initiation of innate immune responses to a variety of pathogens, causing the production of pro-inflammatory cytokines (TNF-α, IL-1, and IL-6) and type I and II Interferons (IFNs), that are responsible for innate antiviral responses. In particular, the innate immunity is very sensitive in detecting potential pathogens, activating downstream signaling to induce transcription factors in the nucleus, promoting synthesis and release of type I and type II IFNs in addition to a number of other proinflammatory cytokines, and leading to a severe cytokine release syndrome which may be associated with a fatal outcome. Interestingly, among the different TLRs, TLR7 recognizes several single-stranded RNA viruses including SARS-CoV-2 (Poulas et al., 2020). We previously showed that another RNA virus, hepatitis C virus (HCV), is able to inhibit CD4 T cell function via Toll-like receptor 7 (TLR7) (Mele et al., 2017). Recently, van der Made et al., 2020 have reported two independent families in which COVID-19 segregates like an X-linked recessive monogenic disorder conditioned by SARS-CoV-2 as an environmental factor.
Here, we performed a nested case-control study within our prospectively recruited GEN-COVID cohort with the aim to determine whether the two families described by van der Made et al. represent an ultra-rare situation or the tip of the iceberg of a larger subset of young male patients.
A subset of 156 young (<60 years) male COVID-19 patients was selected from the Italian GEN-COVID cohort of 1,178 SARS-CoV-2-infected participants (https://sites.google.com/dbm.unisi.it/gen-covid) (Daga et al., 2021). The study (GEN-COVID) was consistent with Institutional guidelines and approved by the University Hospital (Azienda Ospedaliero-Universitaria Senese) Ethical Review Board, Siena, Italy (Prot n. 16929, dated March 16, 2020). We performed a nested case-control study (STREGA reporting guideline was used to support reporting of this study). Cases were selected according to the following inclusion criteria: i. male gender; ii. young age (<60 years); iii endotracheal intubation or CPAP/biPAP ventilation (79 participants). As controls, 77 participants were selected using the sole criterion of being oligo-asymptomatic not requiring hospitalization. Cases and controls represented the extreme phenotypic presentations of the GEN-COVID cohort. Exclusion criteria for both cases and controls were: i. SARS-CoV-2 infection not confirmed by PCR; ii. non-white ethnicity. Materials and methods details are listed in the Online Repository. A similar cohort from the second wave, composed of 83 young male COVID-19 patients, was used to expand the cohort.
We adopted the LASSO logistic regression, one of the most common Machine Learning algorithms for classification, that provides a feature selection method within the classification task able to enforce both the sparsity and the interpretability of the results (Tibshirani, 1996). In fact, the coefficients of the logistic regression model are directly related to the importance of the corresponding features, and LASSO regularization shrinks close to zero the coefficients of features that are not relevant in predicting the response, reducing overfitting and giving immediate interpretability of the model predictions in terms of few feature importance.
The principal components analysis (PCA) was applied prior to the LASSO logistic regression in order to remove samples that were clear outliers with respect to the first three principal components from the following analyses (deviating more than five standard deviations from the average).
A 10-fold cross-validation method was applied in order to test the performances. It provides the partition of the dataset into 10 batches, then nine batches are exploited for the training of the LASSO logistic regression and the remaining batch as a test, by repeating this procedure 10 times. The performance metrics are averaged on the 10 testing sets in order to avoid overfitting. The confusion matrix is built by summing up the predictions of the 10 testing folds. During the fitting procedure, the class unbalancing is tackled by penalizing the misclassification of the minority class with a multiplicative factor inversely proportional to the class frequencies.
In order to evaluate the significance of the association between TLR7 variants and COVID severity, the Fisher’s Exact Test was used.
For the quantitative PCR assay, the fold changes in mRNA expression level per gene were compared between the individual patients and controls using an unpaired t test on the log-transformed fold changes. p Values < 0.05 were considered statistically significant.
Peripheral blood mononuclear cells (PBMC) were isolated by Ficoll‐Hypaque (GE Healthcare Bio-Sciences AB) density gradient centrifugation as previously described (Mantovani et al., 2019). 5 × 105 PBMC from COVID-19 patients 6 months after recovery and six unaffected male and female controls were stimulated for 4 hr with the TLR7 agonist imiquimod at 5 μg/mL or cell culture medium. Total RNA extraction was performed with RNeasy Plus Mini kit and gDNA eliminator mini spin columns (QIAGEN, Hilden, Germany), following the manufacturer's instructions. First-strand cDNA was synthesized from total RNA using High-Capacity cDNA Reverse Transcription Kit following the manufacturer's instructions (Thermo Fisher Scientific, Waltham, Massachusetts, United States). The Advanced Universal SYBR Green Supermix (BioRad, Redmond, WA, United States) was used. All reactions were performed in triplicates using the CFX96 Real-Time machine detection system (BioRad, Redmond, WA, United States) and each sample was amplified in duplicate. The following primers were used:
A total of 2.5 × 105 PBMC from COVID-19 patients and healthy controls were maintained in RPMI-1640 supplemented with 10% of FCS, 1% antibiotic antimycotic solution, 1% L-glutamine and 1% Sodium Pyruvate (Sigma-Aldrich, St. Louis, MO, USA) and stimulated in vitro for 4 hr with Lipopolysaccharide (LPS) at 1 μg/ml or cell culture medium and the Protein Transport Inhibitor GolgiStop (BD Biosciences, San Diego, CA, USA). After washing, PBMC were stained for surface cell marker using mouse anti-CD14PerCP-Cy5.5 (BD Biosciences) and anti-CD3BV605 (BD Biosciences) monoclonal antibody (mAb). Cells were fixed with BD Cytofix/Cytoperm and permeabilized with the BD Perm/Wash buffer (BD Biosciences) according to the manufacturer's instructions, in the presence of anti-IL6BV421 (BD Biosciences) mAb. Ex-vivo TLR7 intracellular expression was evaluated in PBMC from patients and controls by flow cytometry. 2,5 × 105 PBMC were stained for surface markers using anti-CD19BV605, anti-CD14PerCP-Cy5.5 and anti-CD3BV421 (BD Biosciences) mAbs. Cells were fixed and permeabilized in the presence of anti-TLR7 Alexa Fluor 488 (R and D System, Minneapolis, MN, USA) mAb or isotype control as described above. After staining cells were washed, immediately fixed in CellFix solution (BD Biosciences) and analysed. Cell acquisition was performed on a 12-color FACSCelesta (BD Biosciences, San Diego, CA, USA) instrument. Data analysis was performed with the Kaluza 2.1 software (Beckman Coulter).
The protein structure of Human Toll Like Receptor, UniProtKB ID Q9NYK1 [https://www.uniprot.org/uniprot/Q9NYK1], was obtained by homology modeling using Swiss Model tool (Waterhouse et al., 2018). The selected template protein with 97% of sequence identity was the Crystal structure of monkey TLR7 with PDB ID 5GMF [https://www.rcsb.org/structure/5GMF]. The two Val to Asp missense mutations were analysed by using different protein stability predictors like Polyphen-2 (Adzhubei et al., 2010), SIFT (Ng and Henikoff, 2003), and DynaMut (Rodrigues et al., 2018).
PCR based site-directed mutagenesis was performed in pUNO-hTLR7 plasmid (Invivogen), kindly provided by Ugo D’Oro (GSK Vaccines, Siena, Italy) (Iavarone et al., 2011), to generate specific plasmids for each TLR7 variant, including those considered neutral (mutagenic primers available on request).
All point mutations except for p.Arg920Lys were confirmed by Sanger sequencing. HEK293 cells were maintained in DMEM supplemented with 10% FBS, 1% L-Glutamine and 1% penicillin/streptomycin at 37°C with 5% CO2. Transient transfections were performed using Lipofectamine 2000 (Invitrogen) according to manufacturer’s instructions: 3 × 105 cell/well were seeded the day before, and then transfected with 2 μg of DNA. After 24 hr, the cells were stimulated with Imiquimod at 1 μg/ml for 4 hr and then total RNA was extracted with RNeasy Mini Kit (QIAGEN, Hilden, Germany). For each sample, cDNA was synthesized from 1 μg of total RNA using QantiTect Reverse Transcription kit (QIAGEN, Hilden, Germany) according to manufacturer’s instructions. The expression of IFN-a in stimulated and unstimulated cells was evaluated by qRT-PCR using the same procedure as described for PBMCs.
We applied LASSO logistic regression analysis, after correcting for Principal Components, to a synthetic boolean representation of the entire set of genes of the X chromosome on the extreme phenotypic ends of the male subset of the Italian GEN-COVID cohort (https://sites.google.com/dbm.unisi.it/gen-covid) (Daga et al., 2021). The GEN-COVID study was consistent with Institutional guidelines and approved by the University Hospital (Azienda Ospedaliero-Universitaria Senese) Ethical Review Board, Siena, Italy (Prot n. 16929, dated March 16, 2020). Only rare variants (≤1% in European Non-Finnish population) were considered in the boolean representation: the gene was set to one if it included at least a missense, splicing, or loss-of-function rare variant, and 0 otherwise. Fisher Exact test was then used for the specific data validation.
Toll-like receptor 7 (TLR7) was picked up as one of the most important susceptibility genes by LASSO Logistic Regression analysis (Figure 1). We then queried the COVID-19 section of the Network of Italian Genome (NIG) database (http://www.nig.cineca.it/, specifically, http://nigdb.cineca.it) that houses the entire GEN-COVID cohort represented by more than 1000 WES data of COVID-19 patients and SARS-CoV-2 infected asymptomatic participants (Bastard et al., 2020). By selecting for young (<60 year-old) males, we obtained rare (MAF ≤ 1%) TLR7 missense variants predicted to impact on protein function (CADD > 12.28) in 5 out of 79 male patients (6.3%) with life-threatening COVID-19 (hospitalized intubated and hospitalized CPAP/BiPAP) and in none of the 77 SARS-CoV2 infected oligo-asymptomatic male participants.
We then investigated a similar cohort coming from the Italian second wave composed of male patients under 60 years of age without comorbidities (56 cases and 27 controls) was used to expand the cohort. All participants were white European. We found a TLR7 variant in one of 56 cases (1.7%) and in none of 27 controls. Overall, the association between the presence of TLR7 rare variants and severe COVID-19 was significant (p=0.037 by Fisher Exact test, Table 1).
We then investigated the presence of TLR7 rare variants in the entire male cohort of 561 COVID-19 patients (261 cases and 300 controls) regardless of age. We found TLR7 rare missense variants in three additional patients over 60 years of age, including two cases (who shared the p.Ala1032Thr variant) and one control (C1), bearing the p.Val222Asp variant, predicted to have a low impact on protein function (CADD of 5.36) (Table 2).
In order to functionally link the presence of the identified TLR7 missense variants and the effect on the downstream type I IFN-signaling, we performed a gene expression profile analysis in peripheral blood mononuclear cells (PBMCs) isolated from patients following recovery, after stimulation with the TLR7 agonist imiquimod, as reported by van der Made et al., 2020. To explore all TLR7 variants identified, we examined PBMCs from the control and all cases except P4 and P6 because them were not available. However, P4 and P5 shared the same variant. This analysis showed a statistically significant decrease of all TLR7-related genes for two variants (Ser301Pro and Ala1032Thr) identified in cases P3, P7, and P8 compared with healthy controls (Ctl) demonstrating a complete impairment of TLR7 signaling pathways in response to TLR7 stimulation (Figure 2, panel A and Table 2). The variant Val219Ile (P1) showed a hypomorphic effect determining a statistically significant decrease in mRNA levels only for IRF7 (directly activated by TLR7) and IFN-γ (Figure 2, panel A). Two Ala to Val variants identified in severely affected patients, Ala288Val and Ala448Val, were functionally neutral, that is not predicted to impair the TLR7 signaling pathways. This was confirmed by biochemical and structural analysis on the crystal structure of TLR7 protein (https://www.uniprot.org/uniprot/Q9NYK1). The prediction performed with different computational approaches showed both variants as benign with no effects on structural stabilization. Interestingly, the p.Val222Asp variant (C1) proved to be functionally neutral, in keeping with it being identified in the control and not in cases (Figure 2, panel A).
TLR7 expression was evaluated in monocytes and B cells from patients and healthy controls by flow cytometry. Patients and controls expressed the TLR7 protein at the intracellular level. The functional capacity of PBMCs was evaluated after stimulation with the TLR4 agonist lipopolysaccharide (LPS). Of note, LPS-induced production of IL6 by monocytes was similar in patients and controls (data not shown).
In order to validate the functional effect of TLR7 variants, we have performed transfection experiments in HEK293 cells, cloning a dedicated TLR7 plasmid for each of them. Transfection experiments were performed in HEK293 cells that do not express endogenous TLR7 (Chehadeh and Alkhabbaz, 2013) and expression of TLR7 protein was examined by flow cytometry 24 hr after transfection, showing expression of TLR7 protein at the intracellular level in all cases (Figure 2, panel B). We then evaluated the expression of IFN-a in imiquimod stimulated and unstimulated cells by qRT-PCR employing the same assay described for PBMCs, confirming the results obtained in PBMCs for the screened variants (Figure 2, panel C).
Segregation analysis was available for two cases, P3 and P8 (Figure 3). In the two pedigrees, the disease nicely segregated as an X-linked disorder conditioned by environmental factors, that is SARS-CoV-2 (Figure 3, panel B). This was also supported by functional analysis on all TLR7-related genes (Figure 3, panel A). For example, expression profile analysis for IRF7 gene in male mutated patient P8 confirmed a statistically significant reduction compared to the wild-type brother (Figure 3, panel A). Of note, only the infected mutated male had severe COVID-19, whereas the infected not mutated brother (II-2 of P8) was asymptomatic (Figure 3, panel C).
Our results showed that the two families reported by van der Made et al., 2020. with loss-of-function variants in males with severe COVID-19 with a mean age of 26 years represent a subset of COVID-19 male patients. Specifically, missense deleterious variants in the X-linked recessive TLR7 gene may represent the cause of disease susceptibility to COVID-19 in up to 2% of severely affected young male cases (3/135, 2.2%). The same result was obtained for the entire male cohort, irrespective of age, with TLR7 deleterious variants in 5/261 cases (1.9%). Since not all identified variants were functionally effective, the true percentage could be slightly lower in young males. Overall, males with rare missense variants shown here developed COVID-19 at a mean age of 56.5 years, considerably later than 26 years, in agreement with a predicted smaller impact on the protein than the loss of function deleterious variants reported by van der Made et al., 2020. Similarly, the identified rare missense TLR7 variants impaired the mRNA expression of TLR7 as well as the downstream pathway. The observation reported here may lead to consider TLR7 screening in severely affected male patients in order to start personalized interferon treatment for those with this specific genetic disorder.
Employing a systematic approach to biobanking and analyzing clinical and genetic data for advancing COVID-19 researchEuropean Journal of Human Genetics 1:1–15.https://doi.org/10.1038/s41431-020-00793-7
Hospitalization rates and characteristics of patients hospitalized with Laboratory-Confirmed coronavirus disease 2019 - COVID-NET, 14 states, march 1-30, 2020MMWR. Morbidity and Mortality Weekly Report 69:458–464.https://doi.org/10.15585/mmwr.mm6915e3
Age, frailty, and comorbidity as prognostic factors for Short-Term outcomes in patients with coronavirus disease 2019 in geriatric careJournal of the American Medical Directors Association 21:1555–1559.https://doi.org/10.1016/j.jamda.2020.08.014
A point mutation in the amino terminus of TLR7 abolishes signaling without affecting ligand bindingThe Journal of Immunology 186:4213–4222.https://doi.org/10.4049/jimmunol.1003585
SIFT: predicting amino acid changes that affect protein functionNucleic Acids Research 31:3812–3814.https://doi.org/10.1093/nar/gkg509
DynaMut: predicting the impact of mutations on protein conformation, flexibility and stabilityNucleic Acids Research 46:W350–W355.https://doi.org/10.1093/nar/gky300
Coronavirus disease 2019 case surveillance - United states, January 22-May 30, 2020MMWR. Morbidity and Mortality Weekly Report 69:759–765.https://doi.org/10.15585/mmwr.mm6924e2
Regression shrinkage and selection via the lassoJournal of the Royal Statistical Society: Series B 58:267–288.https://doi.org/10.1111/j.2517-6161.1996.tb02080.x
SWISS-MODEL: homology modelling of protein structures and complexesNucleic Acids Research 46:W296–W303.https://doi.org/10.1093/nar/gky427
Frank L van de VeerdonkReviewing Editor; University Medical Center, Netherlands
Jos WM van der MeerSenior Editor; University Medical Centre, Netherlands
In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.
The authors provide solid evidence for the role of Toll-like receptor 7 in host defense against SARS Coronavirus-2. Based on the initial observation by Van der Made et al. (JAMA 324:1-22, 2020) that mutations in TLR-7 may lead to severe and even lethal COVID in young males, the authors found missense deleterious TLR-7 mutations in some 2 % of severe COVID male patients. In these patients there is a severe impairment of the Type-I and type-II interferon responses.
Decision letter after peer review:
Congratulations, we are pleased to inform you that your article, "Association of Toll-like receptor 7 variants with life-threatening COVID-19 disease in males", has been accepted for publication in eLife.https://doi.org/10.7554/eLife.67569.sa1
[Editors' note: we include below the reviews that the authors received from another journal, along with the authors’ responses.]
Editor's specific comments:
Please see the reviewers' comments below.
Reviewer #1: Major comments:
The authors should include a section on Statistical Methods that includes
a brief mention of Fisher's exact test for Table 1
a brief rationale for the use of LASSO logistic regression (with a reference
a brief explanation of why principal components analysis was applied prior to the LASSO logistic regression
a description of the cross-validation method and construction of the confusion matrix (with a reference)
Added in the Online Repository file.
Fallerini et al. study TLR7 variants in males with mild compared with severe COVID19 infections in an Italian and Spanish cohort.
The methods suggest that 1,178 patients were included in the analysis, while it was 156 Italians and 122 Spanish. The fact that all were white European should be noted. Refine. The PBMC analysis of gene expression should also be noted.
A subset of 156 <60-year old male COVID-19 patients was selected from the Italian GENCOVID cohort of 1,178 SARS-CoV-2-infected subjects. We refined it in the text. We have now specified that all individuals were of European Caucasoid ethnicity in the Abstract and in the text as well as for PBMC analysis of gene expression.
Capsule summary – this section should be rewritten to highlight the key results in a quantitative format. Introductory statements should be removed.
Agreed and modified as suggested.
"strong predisposing factors". This statement should be toned down and be more precise as only 4% of the affected cohort had this variant.
As suggested by the reviewer, we have toned down the statement in the Capsule Summary.
Quantitate the relative risk of severe disease in males compared with females
More information has been provided to quantitate the relative risk of severe disease in males compared with females.
Reference 4. This reference is from 2004, when SARS-CoV-2 was not around. Amend.
Agreed and the reference replaced, updating the paragraph in the text.
The fact that details of methods are in an online repository should be stated.
5 out of 79 patients – add percentage
As suggested by the reviewer, the percentage (6.32%) has been added.
Round 0.0366 to 0.04.
Describe results in more detail/quantitatively
Agreed and rephrased as suggested.
Round 57.5 to 58 years
Table 1 – change "Marginal Row totals" to "total" in column and row headings. Round 0.0366 to 0.04.
Done (Table 1).
Table 2 – Describe in footer what is meant by "clinical category" 3 and 4.
Done (Table 2).
Figure 1 Refine some sentences in this figure legend focusing on the facts pertaining to the figure, without interpretation. Panel B currently comes after panel C – suggest reordering; or removing as it's significant is unclear.
Agreed and modified as suggested by the reviewer. More specifically, legend to Figure 1 is now more factual, and figure panels have been reordered and coordinated with the legend.
Figure 1 – Panel B might be removed. Panel E- it is unclear which line refers to which – revise.
As suggested by the reviewer, Figure 1 has been refined with the reordering of the panels and the exclusion of Panel E. Panel C (ex Panel B), reporting the confusion matrix, could be useful for the evaluation of the number of false negative/false positive of the classification.
Figure 2 – focus on comparison between affected patients and Ctl, not C1 – amend in all panels. For clarity, leave out comparison between C1 and patients – just comment in text.
We agree on the changes proposed for Figure 2 and modified statistics accordingly.
Figure 3 – table – reorder with generation I members first, then generation II members and finally generation III members. With females in the pedigree, clarify whether or not they required any hospital treatment.
As suggested by the reviewer, the generations in Figure 3 have been rearranged. We have specified in the text that females did not require hospital admission.
In this manuscript, the authors report a higher frequency of rare TLR7 variants in younger (<60 years) males with life-threatening COVID-19 than in a control group with asymptomatic or oligosymptomatic infection. PBMC from three patients with TLR7 variants and life-threatening disease, from one subject with TLR7 variant and oligosymptomatic infection and from 4 healthy controls were challenged in vitro with imiquimod (a TLR7 agonist), and impaired expression of IRF7 was demonstrated in PBMC from patients with life-threatening disease. The authors conclude that deleterious TLR7 variants may account for up to 4% of severe disease in male subjects.
The study expands on a recent observation of two families in which COVID-19 segregated as an Xlinked recessive trait conditioned by SARS-CoV-2 infection.
Overall, the study is interesting. However, some of the conclusions are overstated. Some methodological aspects need to be better defined. the organization of the manuscript should be improved, and reference to recent important findings by other groups on monogenic variants associated with life-threatening COVOD-19 must be added.
1) Some of the conclusions raised by the authors are overstated. In particular, in Figure 2, IRF7 and IFN- are the only transcripts that appear to be differentially expressed between patients with life threatening disease and healthy controls. For TLR7, this difference exists only between controls and P8, and for ISG15 between the healthy controls vs. P3 and P8.
As suggested by the reviewer, we have now tested all TLR7 variants and modified the conclusions according to our recent findings. Thus we showed a significant impairment of TLR7 signalling pathway in the Ser301Pro, His630Tyr, and Ala1032Thr variants.
In the text, the authors emphasize the difference in the expression of these genes between patients with life-threatening disease and the oligosymptomatic SARS-CoV-2 infected patients, but this is not relevant if there is no difference versus healthy controls. Furthermore, there are technical and methodological weaknesses that need to be addressed. In particular, expression of TLR7 protein should be examined by flow cytometry.
We thank the reviewer for raising this important point. Indeed, we have partially replied above to this question by reviewer 2 and we modified Figure 2 according to his/her comment. It is important to emphasize that functional experiments were carried out in those patients from whom PBMC were available for further experiments. Expression of TLR7 protein has now been examined by flow cytometry in monocytes and B cells from patients and healthy controls showing that both expressed the TLR7 protein at the intracellular level.
To formally prove that these TLR7 variants are loss of function (or hylomorphic), transfection experiments should be performed in TLR7-deficient cells, and response to the TLR7 agonist should be examined.
We thank the reviewer for this suggestion. We believe that transfections are really important when primary cells cannot be retrieved from mutated patients. The availability of PBMC carrying the different TLR7 variants identified in this study would make transfections redundant. Notably, in this revised version, we were able to expand the number of variants analyzed.
Finally, it is not known at what point in the course of the disease PBMC from the patients were collected.
PBMC from all patients were collected approximately 6 months after recovery.
An impaired response may also reflect the specific functional status of the cells in that particular moment of the infection. This is why the transfection experiments mentioned above are particularly important.
To evaluate the functional status of the cell, we stimulated PBMC from patients and healthy controls with the TLR4 agonist lipopolysaccharide (LPS). The intracellular production of IL6 was evaluated in monocytes. The frequencies of IL6+CD14+ cells were comparable in patients and healthy controls demonstrating that the cells of the patients were functionally active.
2) Important recent advances in the genetic basis of COVID-19 have been neglected, perhaps because the manuscript was submitted around the time when these discoveries were made publicly available. In any case, the recent description of deleterious variants in genes involved in type I IFN synthesis or signaling to these molecules (Zhang et al., Science 2020) should be cited and commented
As suggested by the reviewer, a proper section and the relative reference to the Zhang Q. paper has been added.
3) The manuscript suffers from some organizational deficiencies. Figure 1 is cited only once in the text, but it is composed of multiple panels which are not properly mentioned and commented. The legend to this figure reported first on panel A, then on panel C (before mentioning panel B).
We thank the reviewer for the suggestion. As also requested by reviewer 2, Figure 1 has been refined and panels reordered with the exclusion of ROC curves (Panel E). Legend to Figure 1 is now clearer and coherent with the order of panels.
4) Table 2 reports on the Clinical Category of the patients, however no mention is made in the text in regard to how were the clinical categories defined
As also suggested by reviewer 2, we have added a footer to Table 2 carrying a detailed description of the clinical categories and of all abbreviations listed in the table.
5) Patients from Spain were included to expand the number of patients studied. Mention of approval from the local Institutional Review Board(s) is missing for this patient population.
We have now mentioned the Spanish Institutional Review Board approval in the Online Repository file and in the main text.
6) Segregation of the disease in the family of P6 is shown in Figure 3. However, these are only circumstantial supportive data (due to the fact that only few individuals from this family were infected with SARS-CoV-2). As such, the figure should be moved to Supplementary. If the authors insist on commenting on it, then data on X-chromosome inactivation in PBMC lineages from female carriers of the TLR7 variant should be provided.
In addition to the segregation of the disease in the family of P6, we have performed segregation analysis also in a further available pedigree (family of P3) confirming previous findings. As suggested by the reviewer, we have also provided a functional analysis for all TLR7-related genes in both families (Figure 3).
This study points to a possible risk factor of severe Covid-19 in males carrying variants of TLR7, thus confirming and potentially extending a previously published study (van der Made et al).
A serious limitation is that only 2 variants (P2 and P3) have been functionally validated.
Thank you for this valuable comment. We have now functionally validated all TLR7 variants (see above responses to other reviewers).
Data in Figure 2 for P1 do not convincingly show a functional effect of that particular variant (Val. 219 Ile).
We agree with the reviewer that the TLR7 variant carried by patient P1 (Val219Ile) has a smaller functional impact suggesting a hypomorphic effect.
If no material is available for the other patients, it is feasible to express the variants in cell lines and to test them. An assay of interferon type I production will be overall more convincing.
We have now tested PBMC from 7 of 8 cases and, since the missing patient (P4) carried the same mutation of P5, we have now functionally validated all TLR7 variants. IFN-ɑ (type IIFN) was analyzed as gene expression (Figure 2).
– There is an inconsistency in figures, i.e number of at risk cases 150 (table 1) or 156 (text)
The total number of severely affected males was 156 (as mentioned in the text). Among them, 150 subjects did not have mutations in the TRL7 gene (Table 1).
– Were the 77 controls from the first cohort males ?
Yes, all SARS-CoV-2-infected subjects included in the analysis from both cohorts were males. We have now clarified this in the text.
– Figure 3 is actually anecdotal especially since the TLR7 variant present in this family has not been functionally validated
Segregation analysis was confirmed in two distinct pedigrees from Italy and Spain patients (P3 and P6) and also supported by functional analysis in all TLR7-related genes (Figure 3).
– The recent paper by Zhang Q. et al. on interferon I pathway variants as risk factors for severe Covid-19 should be cited (Science, 2020 (6515):abbd4570).
Response to second decision letter
Thank you to the authors for addressing my previous queries. There remain some further comments that I suggest need addressing. Line numbers refer to clean untracked manuscript:
Suggest being more specific regarding number / percentage (round all percentages to one decimal places) of patients in the Italian, Spanish and total cohort that had pathogenic TLR7 gene variants – e.g. Italian – 2/79 (2.5%) (not 5), Spanish – 1/77 (1.3%), total ?/272 (?%) – - not clear how many there were in the entire male cohort.
All percentages were rounded to one decimal place.
We have removed the numbers from the Abstract and explained better in the text: 3/156 (1,9%) pathogenic TLR7 gene variants in severely affected young males and 5/261 (1,9%) in the entire male case cohort, irrespective of age.
Two reported families is not a "fraction". What is meant by the "broader and complex host genome situation”. Suggest revising these conclusions to be more accurate and clear. Remove word "significantly".
Clinical implications – Revise to be more specific and focused "This new yet complex scenario" means little to the reader.
Capsular summary – what was the exact size of the total cohort studied?
The Italian young male cohort includes 156 patients and the Spanish one 122 patients. We have revised the sentence.
Percentages of male and female ICU admission and deaths are almost the same – add in significance levels for this and also hospitalization data. If not significant state this.
5 out of 79 patients – Table suggests that 2 of this cohort had pathogenic variants? Round percentages to one decimal place.
Were these "rare missense mutations" considered pathogenic? How many patients were there in the entire cohort – ?272?
Among the additional “rare missense mutations” found in the entire male cohort of 561 COVID-19 individuals (regardless of age), the one found in the cases has been shown to be LOF (p.Ala1032Thr) and the one found in the control has been shown to be neutral (p.Val222Asp). We have revised the sentence to make it clearer.
2% is rather small percentage of the total – suggest toning down "tip of the iceberg" and being more focused revising phrase "broader genome scenario".
Round percentage to one decimal place. 3/156 does not equate to figures given in results text above. Also does not seem to include the other two TLR7 variants found in the older males – were the variants in the older males considered pathogenic or just VUS? If the later, you might consider separating the analysis and conclusions to focus on males under 60?
Percentages were rounded to one decimal place. The two older males shared the same mutation (p.Ala1032Thr) that has been shown to be LOF. We have updated the text to make the paragraph clearer.
Table 1 – Suggest changing N of mutated patients to 3 and the terminology to pathogenic variants?
Table 1 refers to the statistical analysis of sequencing data done before functional studies on all variants.
Table 2 – Suggest listing just the 3 patients with pathogenic variants (a), or otherwise putting the VUS in a separate part of the table (b)
The table has been divided, grouping the LOF mutations together followed by the Hypo mutation and the neutral two.
Figure 3B – can you add data on the 3rd pedigree (2nd Italian family) with a pathogenic variant.
Done. We added the 3rd pedigree in Figure 3.
In the revised version of the manuscript, the authors have tones down some statements and corrected some errors as per the reviewers' recommendations. They have also added new data to address other comments, however far from clarifying the observations raised by the reviewers, these new data raise new important questions and fail to demonstrate internal consistency.
1) This reviewer had requested that the authors perform transfection experiments of TLR7 variants into TLR7 knock-out cells in order to demonstrate causality. The authors have argued that availability of patient PBMC is sufficient to address this point, as it allows functional testing. There are two problems with this. First, unless rescue experiments are performed in the patient cells, it is not possible to conclude that the functional effects are directly related to the TLR7 variants. Second, and more importantly, the TLR7 protein expression data produced by the authors in the response to reviewers (and cited as "data not shown in the text") are inconsistent with the mRNA data included in Figure 2 and 3. In particular, TLR7 mRNA expression was markedly reduced in P3, P6, P7 and P8 as compared to controls. However, TLR7 protein expression was no different in P6 and in controls. While for P7 one could conclude that TLR7 protein expression was reduced, no data are provided for P3 and P8. Although different experimental conditions were used to analyze TLR7 mRNA and protein expression, it is very hard to reconcile normal TLR7 protein expression, but markedly reduced mRNA expression, in P6. These data require a more robust experimental setting, and confirm the importance of using transfection experiments.
Regarding TLR7 expression in PBMCs, there was a misunderstanding. Figure 2 and Figure 3 refer to the mRNA fold change (activated/basal mRNA levels ratio) and not to absolute mRNA levels. Therefore, these results are not comparable with protein expression data.
Transfection experiments are usually requested when (i) patient cells are not available for every mutation presented; (ii) it is the first time that a gene is associated with a disorder. We have shown the effect of each variant in patient-specific cells and the gene has already been associated with the disease (ref. 8). Thus, functional analysis in patients’ and control PBMC represent a robust outcome to support our conclusions.
However, we have considered the request of the reviewer and, in this short time, we have performed transfection experiments for the variants expected to have a functional effect, cloning a dedicated TLR7 plasmid for each of them. PCR based site-directed mutagenesis was performed in pUNO-hTLR7 plasmid (Invivogen) to generate specific plasmids for the single variants. Transfection experiments were performed in HEK293 cells that do not express endogenous TLR7 (Chehadeh and Alkhabbaz 2013). Cells were maintained in DMEM supplemented with 10% FBS, 1% L-Glutamine and 1% pen/strep at 37°C with 5% CO2. Transient transfections were performed using Lipofectamine 2000 (Invitrogen) according to manufacturer’s protocol: 3x104 cell/well were seeded the day before in 6 well plates, and then transfected with 2μg of DNA. Expression of TLR7 protein was examined by flow cytometry 24 hours after transfection, showing expression of TLR7 protein at the intracellular level in all cases (Figure 2B).
After 24 hours from transfection, the cells were stimulated in duplicate experiments with Imiquimod at 1μg/ml for 4 hours and then total RNA was extracted with RNeasy Mini Kit (Qiagen), according to manufacturer’s protocol. cDNA was synthesized from 1μg of total RNA using QuantiTect Reverse Transcription kit (Qiagen) according to the manufacturer’s instructions. We evaluated expression of IFN-ɑ in Imiquimod stimulated and unstimulated cells by qRT-PCR using the same assay described for PBMCs, confirming the results obtained in PBMCs (Figure 2C).
2) The segregation data shown in Figure 3 are not meaningful and do not provide substantial support to the authors' claims. In particular, for pedigree II, also males who did not inherit the TLR7 variant should be tested for IFN-a, ISG15 and IFN-g mRNA expression. Without this essential internal control, the data provided do not help. Incidentally, labels on the X-axis of all mRNA expression data in Figure 3 are misaligned.
Done. Figure 3 has been updated and X-axis labels aligned.
1) Throughout the manuscript, the authors should avoid use of the word “mutation” and replace it with “variant” or “deleterious variant” as appropriate
Response to third decision letter
Reviewer #2: Thank you for addressing most of my previous comments and suggestions. A few comments remain:
1) In Abstract Results – detail number numerator/denominator (percentage) of pathological TLR7 variants found in the overall affected groups.
The requested detail has been added.
2) Authors should be able to calculate statistical significance of gender differences in the Stokes et al. paper themselves using online Chi-square calculator from the raw data in the paper – suggest amending sentence rather than say "even if they reported descriptive analyses without statistical comparisons"
Statistical significance has been calculated and the sentence has been modified
3) Percentages should be x.y, rather than x,y.
4) In text and Table IIa – suggest removing details regarding gene variants that have no (neutral) functional/clinical significance and highlighting only predicted pathological variants, as non pathological variants are of no clinical relevance / not disease causing. Remove Table IIb. Figure legends will need to be adjusted accordingly.
We have removed Table 2B as requested.
However, we did not remove information on neutral variants since these variants were not previously published and we performed structural and functional analyses to validate their functionality. We thus feel that their characterization could be an added value to the paper.
5) Table I: add percentage affected in column (N. mutated patients). Suggest revising headings to "N. WT variants; N. pathological variants", rather than mutated patients.
Minor comments: The authors have adequately revised the manuscript. They have also performed transfection experiments and tested experimentally the functional effects of the TLR7 variants identified. These are very important data that support the authors'; conclusions. Surprisingly, they have elected to show them only in the point-by-point reply to the reviewers. These data should be added to the main manuscript (as Supplementary data, if so needed), because they provide strong support to the authors' findings.
As suggested from the reviewer, we have added results of transfection experiments to the manuscript as panel B and C to new Figure 2. Text and figure legend have been modified accordingly. Experimental details have been added in the “online repository file”.https://doi.org/10.7554/eLife.67569.sa2
- Alessandra Renieri
- Alessandra Renieri
- Alessandra Renieri
- Alessandra Renieri
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
This study is part of the GEN-COVID Multicenter Study, https://sites.google.com/dbm.unisi.it/gen-covid, the Italian multicenter study aimed at identifying the COVID-19 host genetic bases. Specimens were provided by the COVID-19 Biobank of Siena, which is part of the Genetic Biobank of Siena, member of BBMRI-IT, of Telethon Network of Genetic Biobanks (project no. GTB18001), of EuroBioBank, and of RD-Connect. We thank the CINECA consortium for providing computational resources and the Network for Italian Genomes (NIG) http://www.nig.cineca.it for its support. We thank private donors for the support provided to AR (Department of Medical Biotechnologies, University of Siena) for the COVID-19 host genetics research project (D.L n.18 of March 17, 2020). We also thank the COVID-19 Host Genetics Initiative (https://www.covid19hg.org/), MIUR project ‘Dipartimenti di Eccellenza 2018–2020’ to the Department of Medical Biotechnologies University of Siena, Italy, and ‘Bando Ricerca COVID-19 Toscana’ project to Azienda Ospedaliero-Universitaria Senese. We also thank Intesa San Paolo for the 2020 charity fund dedicated to the project N B/2020/0119 ‘Identificazione delle basi genetiche determinanti la variabilità clinica della risposta a COVID-19 nella popolazione italiana’.
Clinical trial registration NCT04549831.
Human subjects: The GEN-COVID study was consistent with Institutional guidelines and approved by the University Hospital (Azienda Ospedaliero-Universitaria Senese) Ethical Review Board, Siena, Italy (Prot n. 16929, dated March 16, 2020).
- Jos WM van der Meer, University Medical Centre, Netherlands
- Frank L van de Veerdonk, University Medical Center, Netherlands
© 2021, Fallerini et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
eLife has published the following articles on SARS-CoV-2 and COVID-19.
Hybridization is a major evolutionary force that can erode genetic differentiation between species, whereas reproductive isolation maintains such differentiation. In studying a hybrid zone between the swallowtail butterflies Papilio syfanius and Papilio maackii (Lepidoptera: Papilionidae), we made the unexpected discovery that genomic substitution rates are unequal between the parental species. This phenomenon creates a novel process in hybridization, where genomic regions most affected by gene flow evolve at similar rates between species, while genomic regions with strong reproductive isolation evolve at species-specific rates. Thus, hybridization mixes evolutionary rates in a way similar to its effect on genetic ancestry. Using coalescent theory, we show that the rate-mixing process provides distinct information about levels of gene flow across different parts of genomes, and the degree of rate-mixing can be predicted quantitatively from relative sequence divergence () between the hybridizing species at equilibrium. Overall, we demonstrate that reproductive isolation maintains not only genomic differentiation, but also the rate at which differentiation accumulates. Thus, asymmetric rates of evolution provide an additional signature of loci involved in reproductive isolation.