Introduction

The three-dimensional (3D) architecture of chromatin in the nucleus is highly organized and contributes to the regulation of gene expression and cellular differentiation. Within the nucleus, each chromosome forms compartments that are distinct and separate from each other, called chromosome territories1. Within chromosome territories, the 3D architecture forms many distinct local features, like loops and topologically associating domains (TADs), as well as long-range interactions between enhancers, insulators, and promoters2. Cohesin and CCCTC-binding factor (CTCF) determine the structure and boundaries of TADs and these structural elements are conserved in many different cell types and species3,4. Loss of these proteins in most cell types disrupts TAD structure and chromatin loops although surprisingly gene expression is mostly unchanged57. But in myeloid cells, loss of CTCF greatly disrupts expression of innate immune genes after acute inflammatory stimuli, suggesting local chromatin structure has a significant role in certain context specific gene expression responses8,9.

While transcription factors, like nuclear factor-kappa B (NF-κB) and interferon-regulatory factors (IRF), bind and activate specific regulatory elements in the genome10, the 3D genome architecture, via TADs and chromatin loops, bring these elements into close proximity, and increase associations between enhancers and promoters of genes in similar activation states11,12. In certain contexts, transcription factor activation can then alter the 3D genomic landscape by increasing chromatin accessibility and contact frequency of certain regions. For example, in response to type 1 and type 2 interferons, loci in the genome containing families of interferon stimulated genes, increase chromatin contacts13.

While the 3D architecture plays a major role in cellular differentiation and gene regulation, less is known about how chromatin structure changes in disease11,14. A recent study, which performed high-throughput chromatin conformation capture technology (Hi-C) on primary monocytes from two patients with systemic lupus erythematosus (SLE) and two age- and sex-matched healthy controls, found few changes in chromatin structure between healthy and disease individuals15. This would suggest that Hi-C might not be a useful tool for predicting how monocytes transform in disease, and how genome architecture influences gene expression in primary cells. On the other hand, a few studies have found many differences in chromatin structure between primary monocytes and THP-1 cells, a monocytic cell line1517. While these data suggest that 3D genome architecture may be a conserved feature of cellular differentiation and not a significant contributor to disease, more studies have to be conducted in different disease contexts to see how chromatin structure can influence monocyte gene expression. One limitation of the previous studies is that, due to the cost of these experiments, the sample size tends to be low, 1-2 patient samples, and data is often combined to increase resolution. As a result, statistical testing and disease specific comparisons have not been measured in most previous studies.

Alcohol-associated Hepatitis (AH) is a severe inflammatory disease characterized by extremely pro-inflammatory immune cells which infiltrate and damage the liver18,19. Monocytes in AH express significantly more pro-inflammatory genes in response to innate immune stimuli2022. Many innate immune genes are clustered in the genome, like the CXC-chemokine cluster on Chromosome 4, the CC-chemokine cluster on Chromosome 17, and the NK-gene receptor complex on Chromosome 12, all of which have been implicated in AH. Single cell analysis revealed that these genes have highly coordinated expression patterns, suggesting the proximity of these genes influences their expression patterns20.

Here, we present the 3D genome architecture of primary monocytes isolated from four patients with severe AH, and four age-matched healthy controls using Hi-C technology. We hypothesized that hypersensitivity to bacterial lipopolysaccharides (LPS) and heightened pro-inflammatory responses in AH might be due to a significantly perturbed 3D genome architecture. Our results show that there are extensive changes in chromatin structure throughout the genome in AH. While changes in contact frequency occurred throughout every chromosome, there were a number of hotspots with a high density of changes, and in many cases, hotspots contained important families of genes involved in immunity and have been implicated in AH. Ultimately, these results suggest the 3D genome structure is significantly altered in AH, and these changes contribute to perturbed gene expression patterns, specifically pro-inflammatory genes.

Results

Hi-C reveals the 3D genome architecture of hyper-inflammatory monocytes from AH patients

To investigate changes in 3D genome architecture, we performed Hi-C on primary monocytes isolated from four AH patients (all males) and 4 age-matched healthy controls (3 males and 1 female, Table 1, Supplemental Table 1). Cryo-preserved PBMCs were thawed and CD14+ and CD16+ monocytes were isolated by negative selection. Hi-C was then performed using the Arima kit, and all 8 libraries were sequenced to an average depth of about 570 million reads per sample (Supplemental Table 2). Using Juicer, the reads were aligned to the genome and chromatin conformation was determined at a resolution of 100kb. As expected, most of the chromatin interactions observed were local and intra-chromosomal (Figure 1, Supplemental Figure 1, Supplemental Table 2). As a result, we focused on intra-chromosomal analyses and not inter-chromosomal interactions.

Hi-C data reveals regions of the genome in close proximity

Juicebox plot showing the contact frequency map (100kb resolution) for the entire genome with each chromosome laid end to end (X and Y chromosomes at the bottom right). Each red dot consists of two regions of the genome associated with each other. Top right: HC patient monocytes. Bottom Left: AH patient monocytes

Summary of Clinical data for patient samples used for Hi-C

Using HiCRep23, we measured the correlation coefficient of the chromatin interactions for each chromosome between every sample to assess in an unbiased manner how much variation there is in genome architecture between different patients. For most comparisons, the correlation coefficient between all individuals was very high (>90%), indicative of the high conservation of 3D genome architecture in monocytes regardless of disease status. But for most chromosomes, higher correlations were observed between healthy controls or between AH samples, and not between the two groups (Figure 2), indicative of a change in genome architecture caused by disease. Notable exceptions include chromosomes 4, 11, 16, 19 and the X chromosome. For the X chromosome, one female healthy control sample (HC2) differed significantly from the 7 males, and the 3 HC males were more similar to each other than the 4 AH patients (Supplemental Figure 2). Chromosomes 4, 11, and 19 mostly showed higher similarity by disease, except the female healthy control was more similar to the AH patients. Chromosome 16 did not segregate by disease. These data indicate that while much of the 3D genome architecture of monocytes does not change with disease, there likely exist chromosome specific regions that differ significantly.

Correlation reveals differences between AH and HC

Hierarchically clustered heatmaps of correlation coefficients of the Hi-C data for each chromosome. For each heatmap, darker colors refer to higher correlation coefficients.

Differences in 3D Architecture occur throughout all chromosomes in AH

Hi-C data reveal regions of the genome in close proximity via the contact frequency between two loci. We can measure disease-specific changes in 3D genome architecture by measuring statistically significant differences in these frequencies. Using multiHiCcompare24, we measured the contact frequency between every pair of 100kb genome windows (region-region pairs) and calculated statistically significant changes between HC and AH. We observed significant change in contact frequency throughout the entire length of each chromosome (Figure 3). To ensure the method was reliable, we randomized the samples prior to statistical testing and observed almost no significant changes between regions.

Changes in Contact Frequency in Disease are Local and Long-Range

Plots showing regions with significant changes in contact frequency in AH. Each dot indicates two regions of the genome that either increased (Blue) or decreased (Red) contact frequency in AH. X- and Y-axes are position along the chromosome

For most chromosomes, the changes in contact frequency caused by disease occurred in region-region pairs that are adjacent to each other, which corresponds to the fact that most Hi-C interactions are between local genomic regions25,26. For example, interaction data from Chromosome 2 is primarily between adjacent regions and diminishes by distance, and similarly changes in contact frequency caused by disease are also mostly local (Supplemental Figure 3). Chromosome 1 is an exception because it can be separated into 3 major megadomains, and changes in genome structure in chromosome 1 were restricted to these regions, with almost nothing differing between each megadomain (Supplemental Figure 4). Some chromosomes did have long-distance structural changes between chromosome arms, including chromosomes 11, 12, 14, 16, 17, 19, and 20. The telomeric regions of Chromosome 14 had extensive increases in contact frequency with the body of the chromosome in AH. The telomeres of Chromosome 14 contain the T-cell receptor alpha (TRA) locus on one arm and the immunoglobulin heavy chain (IGH) locus on the other.

Numerous Hotspots with significant structural change in AH contain genes important for innate immunity

While there were differences in contact frequency throughout the genome in AH, there were a number of notable hotspots that contained a high density of the most significantly altered regions (Figure 4). The size of these hotspots varied significantly from 21MB down to less than 1MB. For most hotspots, the differences in contact frequency occurred entirely within the region. Many of the hotspots entirely increased or decreased contact frequency in AH while some contained separate areas that increased or decreased, further indicative of a localized rearrangement in genome structure, with some areas moving closer and others moving further away in disease.

Certain Regions in the Genome have Hotspots of Significant Change in Disease

Manhattan plot showing regions of the genome with significant changes in chromatin interaction with disease. X-axis is position along the chromosome. Y-axis is –log10(p).

Considering their size, some hotspots contained hundreds of genes, including many genes associated with immune responses and have been associated with AH (Table 2, Supplemental Tables 3 and 4). Many of the hotspots contain large families of genes that are clustered together in the genome. For example, Chromosome 4 (∼67-85M) contains the CXC-chemokines, Chromosome 5 (∼140-145M) contains the protocadherins, Chromosome 12 (∼6-15M) contains the NK-gene receptor complex, Chromosome 14 (∼21-23M) contains the T-cell receptor alpha locus and Chromosome 14 (∼90M) contains the Serpin family. One hotspot on Chromosome 19 (∼34-56M) contains multiple large families of genes that are involved in diverse aspects of immune regulation including killer-cell immunoglobulin-like receptors (KIRs), leukocyte immunoglobulin-like receptors (LILRs), sialic-acid-binding immunoglobulin-like lectins (Siglecs), C5 complement receptors, and NOD-like family of receptors (NLRPs). Additionally, hotspots also contained other notable immune genes like NFKB1 (Chromosome 4), IFNG (Chromosome 12) and IFNGR1 (Chromosome 6).

Locations and gene contents of hotspots with significant change in AH

Structural Changes Observed at Hotspots are more complex than TADs

Because hotspots represent regions of the genome that have a large number of structural changes in disease, and they contain important genes involved in innate immunity, these regions were examined with finer detail. Within the hotspots, TAD structure, such as the loss or expansion of a TAD, was unchanged. Instead, most changes in genome architecture were due to the interaction profile within individual TADs and the relationship between the TADs and the surrounding genome.

CXC-chemokines are upregulated in AH, and we previously found increased co-regulation of these genes in response to LPS in AH patients, using single cell data of monocytes20. Zooming into the hotspot on chromosome 4 containing the CXC-chemokine cassette, we see that all of the CXC-chemokine genes exist within a single TAD (Figure 5). Interestingly, while the borders of the TAD did not change, the contact frequency within that TAD increased in AH. Zooming out, we observed an alternating pattern of interactions between this TAD and the neighboring TADs.

CXC-chemokine cluster has increase contact frequency within the TAD and reduced outside

A) Schematic of the CXC-chemokine gene cluster. B) Juicebox plot showing contact frequency map of the CXC-chemokine gene cluster and the surrounding genomic region. Blue boxes are 100kb regions significantly different in disease. C) Plot corresponding to the Juicebox plot showing whether each significant region increased (Blue) or decreased (Red) contact frequency in disease.

We previously found that monocytes in AH have increased expression of CLRs, which are a family of PRRs that sense a wide diversity of PAMPs and DAMPs. Many of these CLRs are found within the NK gene receptor complex on chromosome 12, and in particular, four of these CLRs (Mincle, Dectin-2, Dectin-3 and DCIR) are in close proximity and have highly co-regulated expression in AH. Similar to the CXC-chemokines, TAD structure in this hotspot did not change, but rather there were increased interactions within the individual TADs, increased connections between TADs within the entire NK-gene receptor complex (∼7M-10M), and reduced interaction immediately outside of it (Figure 6). Another hotspot on chromosome 12 at approximately 65-69M, which contains the genes for IL-22, IL-26, and IFNG, had the opposite structure, with fewer connections within the hotspot and more connections to the surrounding genomic landscape (Figure 7).

Myeloid CLR gene cluster has increased local contacts in AH

A) Schematic of the NK-gene receptor complex. B) Juicebox plot showing contact frequency map of the NK-gene receptor complex and the surrounding genomic region. Blue boxes are 100kb regions significantly different in disease. C) Plot corresponding to the Juicebox plot showing whether each significant region increased (Blue) or decreased (Red) contact frequency in disease.

Genomic region around IL-22 and IFNG have decreased local contacts in AH

A) Juicebox plot showing contact frequency map of the region on Chromosome 12 around the IL-22 and IFNG genes. Blue boxes are regions significantly different in disease. B) Plot corresponding to the Juicebox plot showing whether each significant region increased (Blue) or decreased (Red) contact frequency in disease.

Two adjacent loci of CCL chemokine genes have increased connectivity and correlated expression

In AH, CC-chemokines are upregulated in monocytes, similar to CXC-chemokines and many other pro-inflammatory cytokines20,27. In the genome, the CC-chemokine genes are in two loci separated by about 1.5MB. From the Hi-C data, each CC-chemokine locus is within an individual TAD (Figure 8). While this region of the genome was not a hotspot, there was significant changes in 3D genome architecture, particularly in terms of the relationship between the CC-chemokine loci and the surrounding area, which increased connectivity in disease.

The Split CC-Chemokine Cluster has increased contact between loci, and highly correlated expression in AH

A) Juicebox plot showing contact frequency map of the NK-gene receptor complex and the surrounding genomic region. Blue boxes are regions significantly different in disease. B) Plot corresponding to the Juicebox plot showing whether each significant region increased (Blue) or decreased (Red) contact frequency in disease. C/D) Correlation analysis showing coordinately expressed in response to LPS (100pg/ml, for 24 hours) for healthy control (C) and AH (D). Genes are organized by chromosomal position and oriented in the same manner as A and B. Blue squares indicate genes with highly correlated expression while red squares indicate anti-correlated expression.

We previously published single-cell RNA-seq data from PBMCs isolated from patients with severe AH and healthy controls, with and without ex vivo LPS challenge (100pg/ml for 24 hours). In response to LPS, both HC and AH monocytes express chemokines, though the expression is much higher in AH. Next, we measured how well correlated gene expression was in single cells, in order to determine if neighboring genes have coordinated expression (Figure 8). In HC, we observed highly co-regulated expression of the CCL-chemokines in one of the loci (CCL3, CCL4, CCL3L1, CCL4L2). In AH, CCL-chemokine expression was highly co-regulated in both loci, including a few more genes in the first locus (CCL23, CCL18, CCL3, CCL4, CCL3L1, CCL4L2) and genes in the second locus (CCL2, CCL7, CCL13). And importantly, expression between both loci were also coordinated, suggesting the change in 3D genome architecture allowed for coordinated expression of these genes.

Discussion

Alcohol-associated Hepatitis is characterized by dysfunctional monocytes that increase systemic inflammation and can cause extensive damage to the liver and eventually end-organ failure. Monocytes from AH patients are hypersensitive to innate immune stimuli, and in response to bacterial LPS, upregulate expression of cytokines and chemokines much more dramatically than seen in healthy controls. In this study, we try to understand how changes in the 3D genome architecture of monocytes are transformed during AH and infer the impact these changes have on gene expression.

Many studies have been conducted trying to understand how changes in 3D genome architecture affects gene expression and cell development using unbiased and untargeted methods like Hi-C. Far fewer studies have been conducted on primary monocytes and how the architecture changes with disease. We hypothesized that in a disease like AH, where immune cells are so hypersensitive to LPS, alterations in 3D genome architecture may play a significant role in gene regulation. In our study, using monocytes isolated from four AH patients and 4 healthy controls, we in fact see significant changes in chromatin conformation caused by disease. Interestingly, we did not find significant changes in TAD boundaries or DNA looping patterns. But by measuring significant changes in contact frequency for every region-region pair, we were able to assess that throughout each chromosome, there were significant changes in how the TADs and larger loci associated with each other. Notably, we found a number of hotspots in the genome that contained a large number of structural changes.

Many large gene families and genes associated with innate immunity and AH were found within these hotspots. For example, in a previous publication, our group found highly coordinated expression of CXC-chemokines and some of the C-type lectin genes present within the NK-gene receptor complex20. Here, we find that the 3D genome architecture of these regions has significantly changed in disease. Moreover, throughout these hotspots we identified a number of other genes and gene families associated with AH. Chromosome 19 contained a large hotspot containing genes associated with the inflammasome (NOD-like receptors), which is an important inflammatory complex involved in IL-1B release and pyroptosis, which contributes to significant damage in liver and other organs during AH28,29. This hotspot also contained SPACA6, which is a host gene for a cluster of microRNAs that are also upregulated in AH30.

This region on chromosome 19 also contains the killer cell immunoglobulin-like receptor family, which are genes that encode transmembrane glycoproteins involved in NK-cell target recognition. Alongside the NK-gene receptor complex on Chromosome 12, another hotspot, our data suggests significant architectural changes in two different loci important for NK-cell target recognition in monocytes. While NK-cells are dysfunctional in AH31,32, the structural changes we observe in monocytes in these regions is more likely to do with the other gene families in the area, such as in the chromosome 12 hotspot, the CLR genes, and in the chromosome 19 hotspot, the leukocyte immunoglobulin-like receptors, which are highly coordinately expressed in monocytes.

On the other hand, some of the hotspots contained genes with unclear roles in monocytes or AH. Looking at single cell data, some of these hotspots contained genes with very low expression in monocytes. For example, the T-cell receptor locus on chromosome 14, which increased contact frequency throughout the length of the chromosome, but none of these genes are expressed in monocytes. But all of these hotspots are more than just the genes within and may contain important regulatory elements.

While the CC-chemokine gene cassettes on chromosome were not a clear hotspot with a large number of changes in disease, there were still significant differences. This region was of interest because many CCL chemokines are highly upregulated in response to LPS in AH patients20, and levels of circulating CCL chemokines are higher in disease33. In AH, the two CC-chemokine cassettes, which are separated by a little more than 1Mb and are in different TADs, had higher contact frequency and from single-cell data, had highly correlated expression. This suggests that these two regions came closer together in disease, and that the proximity of regulatory elements had an effect on the expression of these genes.

While there were significant changes in 3D genome architecture in AH, this study has a number of limitations. Other Hi-C studies in monocytes and THP-1 cells have been limited by small sample size, typically one patient/sample or two patients with the data combined to increase resolution. This is not ideal for human disease which is typically multifactorial and complex. In this study, we studied monocytes isolated from four healthy controls and four AH patients, in order to look at changes in contact frequency with high statistical confidence. But the trade-off was that we could not analyze this data at a much higher resolution without much deeper sequencing.

Taken together, these results indicate that the 3D genome architecture of monocytes is significantly altered in AH and suggest that perturbations in the 3D architecture contribute to differences in gene expression, especially in response to innate immune challenges. Future studies will focus on trying to understand the cause of genome restructuring in AH, as both alcohol and innate immune stimuli have the potential to alter epigenetic gene regulation34.

Additionally, more work is needed to understand how changes in local genome structure affect transcription factor dynamics and the relationship of enhancers and promoters in disease.

Finally, these studies suggest that Hi-C is a useful tool to understand how disease alters the genome, and there is an ongoing need to build more databases of this kind of data from a wider diversity of cell types, disease states, and patients.

Methods

Alcohol-related Hepatitis and Healthy Control Patient Selection

Enrolled patients had confirmed diagnosis of AH by clinicians at the Cleveland Clinic based on medical history, physical examination, and laboratory results, according to the guidelines of the American College of Gastroenterology [https://gi.org/clinical-guidelines/] (Supplemental Table 1). Healthy controls were recruited from the Clinical Research Unit at the Cleveland Clinic.

Isolation of Human PBMCs

PBMCs were isolated from human blood as previously described20,30. Isolation of mononuclear cells was performed by density gradient centrifugation on Ficoll-Paque PLUS (GE Healthcare, Uppsala, Sweden). 1 mL of freshly collected Buffy Coat was mixed at a ratio of 1:5 (vol/vol) with phosphate buffered saline (PBS) at 37°C and divided and layered onto 8 mL Ficoll-Paque PLUS in two 15 mL conical centrifuge tubes. After centrifugation at 400 × g for 30 min at 20°C (no brake), buffy coat fractions were collected, pooled, resuspended in culture media (Roswell Park Memorial Institute (RPMI)-1640 supplemented with 100 μM Penicillin-streptomycin and 10% fetal bovine serum (FBS)), and centrifuged at 400 × g for 15 min at 20°C. The pellets were resuspended in 8 mL of culture media, counted, and again centrifuged at 400 × g for 8 min at 20°C. Cells were then cryo-preserved by resuspending in freezing media (50% culture media, 40% FBS, 10% dimethyl sulfoxide (DMSO)) at a concentration of 1.5 × 106 cells/mL, and allowed to freeze slowly to −80°C in a styrofoam container. For long-term preservation, cells were stored in liquid nitrogen.

Hi-C of Isolated Monocytes from Cryopreserved PBMCs

Cryopreserved PBMCs from 4 AH patients and 4 age-matched healthy controls were thawed following the 10x protocol for cryopreserved PBMCs. Monocytes were isolated by negative selection using the EasySep Human Monocyte Enrichment Kit without CD16 Depletion (StemCell Technologies, Cambridge, MA) according to factory instructions. Hi-C was performed with the Arima Genome-wide Hi-C kit (Carlsbad, CA) according to factory instructions. Libraries were pooled and sequenced with an Illumina Novaseq 6000.

Analysis of Hi-C Data

Sequencing data for the Hi-C experiments were aligned to the genome (GRC38, release 93) using Juicer35. From the output of Juicer, we summarized the quality of the Hi-C maps using the inter.txt files (Supplementary Table 2). Chromosomal correlations were calculated using HiCRep23. Differential contact frequencies were measured using multiHiCcompare, at a resolution of 100kb and using the inter.hic files from Juicer24. Hi-C data was visualized using Juicebox to view each of the inter.hic files36. Regions with changes in contact frequency in disease were labeled using data from multiHiCcompare.

scRNA-seq Analysis and Clustering

Sequencing data was aligned to the Human genome (GRC38, release 93) using cellranger (v3.0.2). All gene expression and clustering analyses were performed using Seurat (3.1.1) as previously described20,37. Briefly, all samples were first normalized using SCTransform and then filtered to remove low quality cells (nFeature_RNA<4000, nFeature_RNA>200, percent.mt<20, which removes doublets, cells with low reads, and cells with high mitochondrial content))38. All samples were combined using the PrepSCTIntegration and FindIntegrationAnchors functions to find common anchor genes in all samples for all cell types, and then integrated using the IntegrateData function, with all normalizations using the SCT transformed data37,39. Clustering was performed using RunPCA and RunUMAP, and clusters were identified using FindNeighbors and FindClusters.

bigSCale2

For correlation analyses, bigSCale2 was used to calculate correlations using the Z-score algorithm40. For these analyses, only monocytes were used (clusters labeled CD14_Monocyte1, CD14_Monocyte2, CD14_Monocyte3, and CD16_Monocyte). All gene clusters were determined by first ordering all annotated human genes by chromosome and start codon then finding the desired clusters and selecting an arbitrary set of genes surrounding it (genes that are not hypothesized to be involved) to ensure the entire cluster was obtained. Genes were then filtered for low expression using the criteria that bigSCale2 was unable to determine a correlation coefficient. Heatmaps of the correlation coefficients were made with only the top and bottom 5% of all correlation coefficients shown, to remove noise and isolate the most important correlations.

Data and Code Availability

The Hi-C data generated for this study can be found at the database of Genotypes and Phenotypes (dbGaP) [TBD] The scRNA-seq data for this study can be found at National Center for Biotechnology Information Gene Expression Omnibus under accession number [PRJNA596980]20. All scripts used for analyses, differential expression results, for all cell types, and figure generation can be found at the author’s github (https://github.com/atomadam2/). Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.

Acknowledgements

The authors thank the Clinical Research Unit, Genomics Core and Computing Services at the Cleveland Clinic. Additionally, Dr. Laura E Nagy, from the Cleveland Clinic and Northern Ohio Alcohol Center, for reagents and critical comments on the manuscript.

Additional information

Ethical Approval and Consent to Participate

The study protocol was approved by the Institutional Review Board for the Protection of Human Subjects in Research at the Cleveland Clinic and MetroHealth Hospitals, Cleveland. All methods were performed in accordance with the IRB’s guidelines and regulations and written informed consent was obtained from all subjects.

Author Contributions

AK and LEN contributed conception and design of the study. JD, AB, DS, NW, SD recruited patients and performed clinical analyses. AK wrote the first draft of the manuscript. AK and LEN wrote sections of the manuscript. All authors contributed to manuscript revision, read and approved the submitted version.

Financial Support

This work was funded by the following NIH-NIAAA grants: K99/R00AA028048 (AK), P50AA024333, U01AA026264, R01AA027456 (LEN), R01GM119174, R01DK113196, U01AA021890, U01AA026976, R56HL141744, U01DK061732, U01DK062470, R21AR071046 (SD), R01AA028763 (VV), K12HL141952, American College of Gastroenterology Clinical Research Award, K08AAAA028794 (NW).