STAT3-mediated allelic imbalance of novel genetic variant Rs1047643 and B-cell-specific super-enhancer in association with systemic lupus erythematosus
Figures
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig1-v2.tif/full/617,/0/default.jpg)
Schematic of the study design.
On the basis of the functional genomic data feature, a two-stage study was designed. Summary of data sets is available in Supplementary files 1-2.
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig2-v2.tif/full/617,/0/default.jpg)
Change of allelic chromatin accessibility and expression in B cell subtypes from SLE patients and controls.
(A) Forest plot showing AI of allelic chromatin state of SNP rs1047643 in both resting naive (rN) and activated (Non-rN) B cells in patients of SLE compared with healthy controls. The p-value per study and combined p-value (summary) are calculated based on the linear regression model and Fisher’s method, respectively. The plot in the right panel displays the 95% of confidence interval of beta-value. (B–C) Boxplots showing allelic expression of SNP rs1047643 in both rN and activated B cells in patients with SLE as compared with healthy individuals. All raw data are available in Figure 2—source data 1.
-
Figure 2—source data 1
Source files for presenting results in Figure 2.
This zip archive contains all source data used for the quantitative analyses shown in Figure 2.
- https://cdn.elifesciences.org/articles/72837/elife-72837-fig2-data1-v2.zip
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig2-figsupp1-v2.tif/full/617,/0/default.jpg)
Change of allelic chromatin accessibility in B cell subtypes from SLE patients and controls.
Forest plots showing AI of allelic chromatin state of SNP rs246367 (A) and rs72642993 (B) in both rN and activated (Non-rN) B cells in patients of SLE compared with healthy controls. The plots in the right panel display the 95% of confidence interval of beta-value.
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig2-figsupp2-v2.tif/full/617,/0/default.jpg)
Expression pattern of FDFT1 and BLK across B cell subtypes in patients with SLE and healthy controls.
The data showing expression profiles for FDFT1 (A) and BLK (B) in B cell subtypes were from a case-control study (Accession ID: GSE118254).
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig2-figsupp3-v2.tif/full/617,/0/default.jpg)
Expression pattern of FDFT1 and BLK across B cell subtypes in patients with SLE and healthy controls.
The data showing expression profiles for FDFT1 (A) and BLK (B) in B cell subtypes were from a case-control study (Accession ID: GSE92387).
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig3-v2.tif/full/617,/0/default.jpg)
Association analysis and functional prediction of SNP rs1047643.
(A) Association results for the SNP rs1047643 with SLE risk in single marker analyses. MAF, minor allele frequency; OR, odds ratio; CI, confidence interval. Adjusted p-trend: after adjustment for 12 GWAS index SNPs (shown in E) in a logistic regression model. (B) Haplotype analyses of the two SNPs (SNP1: GWAS indexed SNP rs17807624; SNP2: rs1047643) in relation to SLE risk. Baseline (the reference haplotype) represents the alleles associated with a reduced risk in two SNPs. (C) Barplot showing the genomic length of chromHMM-annotated enhancer state on the super-enhancer region (blue highlighted in 3 C) in 43 epigenomes. (D) Plot shows the eQTL result of SNP rs1047643 in whole blood or B cells from three databases (shown in y-axis). (E) Genomic annotations of the SNP rs1047643. The three tracks show locations of 13 GWAS index SNP, gene annotation and 15-state chromatin segments in CD19+ B cells at 8p23 locus, respectively. Vertical blue and purple lines, represents the location of super-enhancer and SNP rs1047643, respectively. (F) Long-range interaction between a super-enhancer and SNP rs1047643. The two tracks show chromatin interactions from two independent studies using whole-genome Hi-C and capture Hi-C technologies, respectively. Orange curves show the interactions between the super-enhancer and the SNP rs1047643. (G) Heatmaps showing the 3D DNA interactions at 8p23.1 locus in eight cell lines. The rectangle represents interactions between the super-enhancer and the SNP rs1047643. All raw data are available in Figure 3—source data 1.
-
Figure 3—source data 1
Source files for presenting results in Figure 3.
This zip archive contains all source data used for the quantitative analyses shown in Figure 3.
- https://cdn.elifesciences.org/articles/72837/elife-72837-fig3-data1-v2.zip
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig3-figsupp1-v2.tif/full/617,/0/default.jpg)
Chromatin interactions with FDFT1 promoter region (marked in green arrow) on 8p23 locus from CHi-C data with duplicates in two types of normal T cells.
Orange arrow represents the location of super-enhancer identified in this study.
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig3-figsupp2-v2.tif/full/617,/0/default.jpg)
Heatmaps of Long-range chromatin interactions from Hi-C data in 8p23 locus at 10 kb (or 20 kb) resolution in a panel of human tissues (n = 9) from the 3D Genome Browser.
The circles shown on heatmaps are the interaction density between SNP rs1047643 and SE region.
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig4-v2.tif/full/617,/0/default.jpg)
Aberration of super-enhancer and FDFT1 promoter region in B cell subtypes from SLE patients.
(A) Empirical cumulative distribution of TPM values per 50 bp window across the 7 kb SE region in B cell subsets for disease and control groups. (B) Plots showing the TPM values at the third quartile (Q3) across B cell subtypes as a comparison between SLE and controls. (C) Empirical cumulative distribution of TPM values on the SE region (same as shown in A) in a comparison between two groups across four B cell subtypes. (D) Boxplots showing the TPM values per 50 bp window at the FDFT1 promoter region in B cell subtypes for SLE and controls. The black lines and grey areas represent the linear regression results towards the B cell development from T3 to DN stages, and 95% of CI. (E) Plots showing the correlation between super-enhancer and FDFT1 promoter regions based on mean TPM values with respect to B cell subtypes in SLE and controls. (F) Wiggle plot showing the enrichment of open chromatin states at 8p23.1 locus in B cell subtypes for two individuals (a healthy individual at upper panel, and a patient with SLE at lower panel). Purple and green vertical lines represent the locations for super-enhancer and FDFT1 promoter, respectively. Quantitative comparison of chromatin accessibility states in SE (G) and FDFT1 promoter regions (H) with respect to B cell subtypes. All raw data are available in Figure 4—source data 1.
-
Figure 4—source data 1
This txt file contains source data used for the quantitative analyses shown in Figure 4.
- https://cdn.elifesciences.org/articles/72837/elife-72837-fig4-data1-v2.txt
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig4-figsupp1-v2.tif/full/617,/0/default.jpg)
Genome-wide background analysis of ATAC-seq data.
Left panel: empirical cumulative distribution of TPM values per 50 bp window across randomly selected regions (n = 2,000) in B cell subsets for disease and control groups. Right panel: plots showing the TPM values at the third quartile (Q3) across B cell subtypes as a comparison between SLE and controls.
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig4-figsupp2-v2.tif/full/617,/0/default.jpg)
Aberration of super-enhancer in resting naive B cell subtypes from SLE patients in relation to healthy controls.
(A) Wiggle plot showing the enrichment of open chromatin states at 8p23.1 locus in resting native B cells from eight individuals. Blue and purple vertical lines represent the locations of SE and FDFT1 promoter, respectively. (B–C) Quantitative comparison of chromatin accessibility states in the SE and FDFT1 promoter regions in naive B cells in a comparison between SLE and controls.
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig4-figsupp3-v2.tif/full/617,/0/default.jpg)
Super-enhancer activity in T and neutrophils from SLE patients and controls.
(A–B) Empirical cumulative distribution of TPM values per 50 bp window and enrichment of ATAC-seq reads (TPM value) across the SE region in neutrophil cell subsets from SLE patients and controls. (C–D) Empirical cumulative distribution of TPM values per 50 bp window and enrichment of ATAC-seq reads (TPM value) across the SE region in two T cell subsets from SLE patients. (E) Wiggle plot showing the enrichment of open chromatin states at 8p23.1 locus in neutrophils and T cells. Blue and purple vertical lines represent the locations of SE and FDFT1 promoter, respectively.
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig5-v2.tif/full/617,/0/default.jpg)
Hypomethylation in super-enhancer region in B cell subtypes from SLE patients.
(A) Boxplots showing the CpG methylation levels per 50 bp window in 7 kb SE region in B cell subtypes for SLE and control groups. The black and red lines represent the linear regression results towards the B cell development from rN to DN stages for SLE and controls, respectively. (B) Plots showing the correlation between TPM values (y-axis) and DNA methylation levels (x-axis) averaged over each B cell type in SLE and controls. All raw data are available in Figure 5—source data 1.
-
Figure 5—source data 1
This txt file contains source data used for the quantitative analyses shown in Figure 5.
- https://cdn.elifesciences.org/articles/72837/elife-72837-fig5-data1-v2.txt
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig5-figsupp1-v2.tif/full/617,/0/default.jpg)
DNA methylation comparison across randomly selected regions in B cell subtypes between patients with SLE and controls.
Boxplots showing the CpG methylation levels per 50 bp window in 2000 randomly selected regions in B cell subtypes for SLE and control groups. The black and red lines represent the linear regression results towards the B cell development from rN to DN stages for SLE and controls, respectively.
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig6-v2.tif/full/617,/0/default.jpg)
Contribution of STAT3 modulates the enhancer activity and SNP-residing locus in cultured GM11997 cells.
(A) ChIP-qPCR for H3K27ac (left lower panel), H3K4me1 (right lower panel) and pSTAT3 (B) at 8p23 super-enhancer region following 40 μM S3I-201 treatment for 24 hr. Upper panel: UCSC genome browser showing the location of two pairs of qPCR primers (SE5 and SE3) on the SE region (yellow). Two tracks shown below are the enrichment of H3K27ac and H3K4me1 across the SE region. (C) Allelic ChIP-qPCR for pSTAT3 binding on rs1047643 (T vs C alleles) following S3I-201 treatment for 24 hr. (D–E) ChIP-qPCR for H3K27ac (D), and pSTAT3 (E) at 8p23 super-enhancer region following 100 nM ML115 treatment for 6 hr. (F) Allelic ChIP-qPCR for pSTAT3 binding on rs1047643 in cells that have been challenged with ML115 for 6 hr as indicated. Note: the fold changes for the rs1047643-associated BLK and FDFT1 genes in response to small molecules compared to vehicle (0.1% DMSO) as control, which was set as one in all cases, are presented. NS, not significance; *, p < 0.05; **, p < 0.01; ***, p < 0.005.
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig6-figsupp1-v2.tif/full/617,/0/default.jpg)
Quality control of ChIP experiments in GM11997 cells.
Plots showing ChIP-qPCR results for H3K27ac (A and D), H3K4me1 (B and E) and pSTAT3 (C and F) at a negative control (NC) region with the treatment of S3I-201 and ML115, respectively.
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig6-figsupp2-v2.tif/full/617,/0/default.jpg)
Genotyping of SNP rs1047643 in GM11997 genomic DNA using allelic qPCR analysis.
Amplification plots are presented for two alleles.
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig6-figsupp3-v2.tif/full/617,/0/default.jpg)
Validation of STAT3-mediated allelic binding in GM11997 cells.
Plots showing ChIP-qPCR results for pSTAT3 at rs1047643 following 1 μM Cucurbitacin I for 24 hr (A) and 50 ng/ml IL-6 treatment for 1 h (B), respectively. NS, not significance; *, p < 0.05.
![](https://iiif.elifesciences.org/lax/72837%2Felife-72837-fig7-v2.tif/full/617,/0/default.jpg)
Expression of two alleles on SNP rs1047643 and its linked genes in cultured cells.
Left panel: allelic RT-qPCR on SNP rs1047643 (T vs C alleles) following S3I-201 (A) and ML115 (C) treatment for 24 hr, respectively. Right panel: RT-qPCR analysis showing the fold changes for the rs1047643-associated BLK and FDFT1 genes in response to different concentrations of S3I-201 (B) and ML115 (D) compared to vehicle (0.1% DMSO) as control, which was set as one in all cases, are presented. *, p < 0.05; **, p < 0.01; ***, p < 0.005.
Tables
Reagent type (species) or resource | Designation | Source or reference | Identifiers | Additional information |
---|---|---|---|---|
Chemical compound, drug | ML115 | Cayman Chemical | Cayman Chemical: 15,178 | Madoux et al., 2010 |
Chemical compound, drug | S3I-201 | Sigma-Aldrich | Sigma-Aldrich: SML0330 | |
Chemical compound, drug | Cucurbitacin I | Sigma-Aldrich | Sigma-Aldrich: C4493 | |
Chemical compound, drug | Recombinant human IL-6 | Cell Guidance Systems | Cell Guidance Systems: GFH10AF | |
Antibody | Phospho-STAT3 (Ser727) | Thermo Fisher Scientific | Thermo Fisher Scientific Cat# PA5-17876; RRID:AB_10980044 | |
Antibody | Anti-Histone H3 (acetyl K27) | Abcam | Abcam Cat# ab4729; RRID:AB_2118291 | |
Antibody | H3K4me1 Recombinant Polyclonal Antibody | Thermo Fisher Scientific | Thermo Fisher Scientific Cat# 710795; RRID:AB_2532764 | |
Antibody | normal mouse IgG | Santa Cruz Biotechnology | Santa Cruz Biotechnology Cat# sc-2025; RRID:AB_737182 | |
Antibody | normal rabbit IgG | Santa Cruz Biotechnology | Santa Cruz Biotechnology Cat# sc-2027; RRID:AB_737197 | |
Cell line (H. sapiens) | GM11997 | Coriell | Coriell Cat# GM11997; RRID:CVCL_5C55 | |
Sequence-based reagent | ChIP-qPCR primers | This paper | See Supplementary file 5 | |
Sequence-based reagent | RT-qPCR primers | This paper | See Supplementary file 5 | |
Sequence-based reagent | Allelic qPCR primers | This paper | See Supplementary file 5 | |
Software, algorithm | R | R Foundation | https://www.r-project.org | Version 4.0.2 |
Software, algorithm | Hisat2 | Kim et al., 2019 | Version 2 | |
Software, algorithm | Allelic imbalance analysis and plots | This paper (Zhang, 2021) | The R code used for the AI analysis can be accessed via github at https://github.com/youngorchuang/Allelic-imbalance-analysis, (copy archived at swh:1:rev:f0db42af8fed130ebbfe0b46abf992300dadddd6) | |
Software, algorithm | HiCUP | Wingett et al., 2015 | ||
Commercial assay or kit | Mycoplasma detection kit | Sigma-Aldrich | Sigma-Aldrich:MP0025 | |
Commercial assay or kit | SuperScript III reverse transcriptase | Thermo Fisher Scientific | Thermo Fisher Scientific:18080044 | |
Commercial assay or kit | Luna Universal qPCR Master Mix | New England Biolabs | New England Biolabs:M3003X |
Additional files
-
Supplementary file 1
Summary of data sets used in the study.
Functional genomics data sets, including ATAC-seq, RNA-seq and RRBS-seq data sets from seven SLE case-control studies (Supplementary file 2), and Hi-C data sets in multiple cell lines, and a SNP microarray data set from a lupus GWAS study.
- https://cdn.elifesciences.org/articles/72837/elife-72837-supp1-v2.xlsx
-
Supplementary file 2
List of data sets from seven SLE case-control studies.
- https://cdn.elifesciences.org/articles/72837/elife-72837-supp2-v2.xlsx
-
Supplementary file 3
Association results for the SNP rs1047643 with SLE risk in European population.
- https://cdn.elifesciences.org/articles/72837/elife-72837-supp3-v2.xlsx
-
Supplementary file 4
LD score (r2) between SNP rs1047643 and 12 GWAS tag SNPs in European population.
- https://cdn.elifesciences.org/articles/72837/elife-72837-supp4-v2.xlsx
-
Supplementary file 5
List of primers used in this study.
- https://cdn.elifesciences.org/articles/72837/elife-72837-supp5-v2.xlsx
-
Transparent reporting form
- https://cdn.elifesciences.org/articles/72837/elife-72837-transrepform1-v2.docx