Simultaneous trimodal single-cell measurement of transcripts, epitopes, and chromatin accessibility using TEA-seq

Abstract
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

Single-cell measurements of cellular characteristics have been instrumental in understanding the heterogeneous pathways that drive differentiation, cellular responses to signals, and human disease. Recent advances have allowed paired capture of protein abundance and transcriptomic state, but a lack of epigenetic information in these assays has left a missing link to gene regulation. Using the heterogeneous mixture of cells in human peripheral blood as a test case, we developed a novel scATAC-seq workflow that increases signal-to-noise and allows paired measurement of cell surface markers and chromatin accessibility: integrated cellular indexing of chromatin landscape and epitopes, called ICICLE-seq. We extended this approach using a droplet-based multiomics platform to develop a trimodal assay that simultaneously measures transcriptomics (scRNA-seq), epitopes, and chromatin accessibility (scATAC-seq) from thousands of single cells, which we term TEA-seq. Together, these multimodal single-cell assays provide a novel toolkit to identify type-specific gene regulation and expression grounded in phenotypically defined cell types.

Introduction

Peripheral blood mononuclear cells (PBMCs) purified using gradient centrifugation are a major source of clinically relevant cells for the study of human immune health and disease (Böyum, 1968). Like most other human tissues, PBMCs are a complex, heterogeneous mixture of cell types derived from common stem cell progenitors (Laurenti and Göttgens, 2018). Despite the genome being mostly invariant between different PBMC cell types, each immune cell type performs an important and distinct function. Understanding the genomic regulatory landscape that controls lineage specification, cellular maturation, activation state, and functional diversity in response to intra- and extracellular signals is key to understanding the immune system in both health and disease (Satpathy et al., 2019; Wang et al., 2020; Zheng et al., 2020).

Recent improvements in single-cell genomic methods have enabled profiling of the regulatory chromatin landscape of complex cell-type mixtures. In particular, droplet-based single-nucleus or single-cell assays for transposase-accessible chromatin (snATAC-seq, scATAC-seq, dscATAC-seq, mtscATAC-seq) allow profiling of open chromatin at single-cell resolution (Buenrostro et al., 2015; Lareau et al., 2019). Promising new methods have combined scATAC-seq with simultaneous measurement of nuclear mRNAs (e.g., sci-CAR, Cao et al., 2018; SNARE-seq, Chen et al., 2019; SHARE-seq, Ma et al., 2020) or in combination with cell surface epitopes (ASAP-seq, Mimitou et al., 2020). However, a unifying approach for all three modalities that can be applied to highly specified functional immune cell types has yet to emerge in the landscape of single-cell methods. We systematically tested whole cell and nuclear purification and preparation methods for PBMCs to overcome limitations that restricted previous assays to measurement of only nuclear components (ATAC and nuclear RNAs) or proteins on the cell surface. We found that intact, permeabilized cells perform extremely well for scATAC-seq, exceeding conventional scATAC-seq on nuclei by some measures (Figure 1b). This insight enables a new protocol analogous to Cellular Indexing of Transcriptomes and Epitopes (CITE-seq; Stoeckius et al., 2017) to measure both surface protein abundance and chromatin accessibility: integrated cellular indexing of chromatin landscape and epitopes (ICICLE-seq, Figures 1a and 3). Finally, we demonstrate that our optimized permeable cell approach can be combined with a droplet-based multiomics platform to enable the simultaneous measurement of three different molecular compartments of the cell: mRNA (by scRNA-seq), protein (using oligo-tagged antibodies), and DNA (by scATAC-seq), which we term TEA-seq after Transcription, Epitopes, and Accessibility (Figure 4). These assays enable a new, more unified view into the molecular underpinnings of gene regulation and expression at the single-cell level.

Figure 1 with 6 supplements see all

Download asset Open asset

Improvements to scATAC-seq methods to enable permeabilized cell profiling.

(a) Schematic overview of major steps in snATAC, scATAC, and ICICLE-seq methods. (b) Comparison of quality control characteristics of ATAC-seq libraries generated from nuclei isolation and permeabilized cells, with and without fluorescence-activated cell sorting. Top panels show signal-to-noise as assessed by fraction of reads in peaks on the y-axis and quantity of unique fragments per cell barcode on the x-axis. Lower panels display fragment length distributions obtained from paired-end sequencing of ATAC libraries. Colored lines represent barcodes that pass QC filters; gray lines represent barcodes failing QC (non-cell barcodes). All libraries were equally downsampled to 200 million total sequenced reads for comparison. Colors in (b) are reused in remaining panels. (c) Total coverage of Tn5 footprints summed across all transcription start sites (TSS). Tn5 footprints are 29 bp regions comprising the 9 bp target-site duplication (TSD) and 10 bp on either side, which represent accessible chromatin for each transposition event. (d) Total coverage of TSD centers summed over a set of 100,000 genomic CTCF motifs found in previously published DNase-hypersensitive sites (Meuleman et al., 2020). TSD centers are obtained by shifting +4 and −5 bp from the 5' and 3' ends of uniquely aligned fragments, respectively. (e) Barplot representations of the fraction of total aligned reads in various QC categories. Fragments overlapping a previously published peak set for peripheral blood mononuclear cell dscATAC-seq (Lareau et al., 2019) are in the 'Overlap Peaks' category. Unique fragments are the remaining uniquely aligned fragments that do not overlap peak regions. 'Waste' reads were not aligned or were assigned to cell barcodes with fewer than 1000 total reads. (f) Violin plots showing distributions of QC metrics. Median (wide bar) and 25th and 75th quantiles (whiskers and narrow bars) are overlaid on violin plots. Median values are also in Table 1. Note that the y-axis of the first panel is on a logarithmic scale; remaining panels are linear. snATAC: single-nucleus assays for transposase-accessible chromatin; scATAC: single-cell assays for transposase-accessible chromatin; ICICLE-seq: integrated cellular indexing of chromatin landscape and epitopes.

Figure 1—source data 1 Single cell metadata and QC metrics for scATAC-seq experiments .: https://cdn.elifesciences.org/articles/63632/elife-63632-fig1-data1-v1.zip
Download elife-63632-fig1-data1-v1.zip
Figure 1—source data 2 Fragment size distribution data .: https://cdn.elifesciences.org/articles/63632/elife-63632-fig1-data2-v1.zip
Download elife-63632-fig1-data2-v1.zip
Figure 1—source data 3 TSS Footprint pileups.: https://cdn.elifesciences.org/articles/63632/elife-63632-fig1-data3-v1.zip
Download elife-63632-fig1-data3-v1.zip
Figure 1—source data 4 CTCF Tn5 target site duplication center pileups.: https://cdn.elifesciences.org/articles/63632/elife-63632-fig1-data4-v1.zip
Download elife-63632-fig1-data4-v1.zip
Figure 1—source data 5 Fraction of reads per alignment category.: https://cdn.elifesciences.org/articles/63632/elife-63632-fig1-data5-v1.zip
Download elife-63632-fig1-data5-v1.zip

Table 1

QC metrics summary for experiments displayed in Figure 1.

Source type	FACS depletion	N pass QC	Median N fragments	Median mitochondrial reads	Median unique	Median in TSS	Median in peaks
Nuclei	Unsorted	4719	7344	43 \| 0.6%	5247 \| 71.6%	1332 \| 24.9%	2306 \| 43.9%
Nuclei	Dead/debris	5526	10,526	59 \| 0.6%	7284 \| 69.4%	2186.5 \| 30.1%	3647 \| 50.5%
Nuclei	Dead/debris/neutrophils	7769	19,972	136 \| 0.9%	11,528 \| 59.1%	4846 \| 41.5%	7503 \| 64.8%
Permeabilized cells	Unsorted	6329	5541	100 \| 1.9%	3308 \| 59.8%	871 \| 26.4%	1390 \| 42.2%
	Dead/debris	6956	6733.5	120 \| 2.0%	3795.5 \| 56.8%	1219.5 \| 32.6%	1874 \| 50.2%
	Dead/debris/neutrophils	9849	14,069	514 \| 4.0%	4756 \| 34.3%	2536 \| 54.4%	3650 \| 76.8%
For all metrics to the right of median N fragments, both the absolute number and a percentage are provided. Median % mitochondrial and median % unique were calculated as a fraction of total fragments; % in TSS and % in peaks were calculated as a fraction of unique fragments. TSS: transcription start sites; FACS: fluorescence-activated cell sorting.

Results

Optimization of single-nucleus and single-cell ATAC-seq of PBMCs

As a baseline for optimization and cell surface retention, we performed snATAC-seq as recommended by 10x Genomics, with a protocol based on the Omni-ATAC workflow (Corces et al., 2017). This single-nucleus assay utilizes a combination of hypotonic lysis, detergents, and a saponin to isolate nuclei without retaining mitochondrial DNA. After performing snATAC-seq using this method, sequencing, and tabulating data quality metrics (Materials and methods), we identified two major populations of cell barcodes (Figure 1b, left panel): (1) a large number of barcodes, shown in gray, that have a low number of unique fragments and a low fraction of reads in peaks (FRIP). These barcodes contain little useful information but consume 80% of total sequenced reads (Figure 1e, non-cell barcodes) at a sequencing depth of 200 million reads per library (20,000 reads per expected barcode). (2) Barcodes with higher quality as measured by FRIP (red points) that contain enough information to attempt downstream analysis.

The loss of 80% of sequenced reads to non-cell barcodes is costly. Previous studies of scRNA-seq data have shown that cellular lysis can release ambient RNA that increases the abundance of low-quality barcodes and contaminates droplets, yielding barcodes with both cellular and ambient RNAs that reduces the accuracy of the transcriptional readout (Marquina-Sanchez et al., 2020). We reasoned that nuclear isolation protocols may cause the release of ambient DNA, causing a similar effect in scATAC-seq datasets. Optimization of nuclear lysis protocols, especially changing to less stringent detergents, provided increased FRIP and decreased non-cell barcodes (Figure 1—figure supplement 1, Figure 1—figure supplement 2a). Hypotonic lysis conditions used in these protocols may also be a biophysical stressor to the native chromatin state, as previously observed (Lima et al., 2018). To reduce perturbation of chromatin and retain the cell surface for multimodal assays, we performed cell membrane permeabilization under isotonic conditions to allow access to the nuclear DNA without isolating nuclei through hypotonic lysis. The saponin digitonin was used to cause concentration-dependent selective permeabilization of cholesterol-containing membranes while leaving inner mitochondrial membranes intact, preventing high levels of Tn5 transposition in mitochondrial DNA (Adam et al., 1990; Colbeau et al., 1971). Digitonin has previously been used for ATAC-seq assays under hypotonic conditions in Fast-ATAC (Corces et al., 2016) and plate-based scATAC-seq (Chen et al., 2018b) protocols. Permeabilization of intact cells under isotonic conditions greatly reduced the number of non-cell barcodes and their contribution to sequencing libraries (Figure 1—figure supplement 2b).

Removal of neutrophils greatly increases PBMC scATAC-seq quality

We observed that PBMCs purified by leukapheresis rather than Ficoll gradient centrifugation had consistently higher FRIP scores and fewer non-cell barcodes (Figure 1—figure supplement 2c). A major difference between Ficoll-purified PBMCs and leukapheresis-purified PBMCs was the presence of residual neutrophils in Ficoll-purified samples. We tested removal of dead cells and debris with and without removal of neutrophils using fluorescence-activated cell sorting (FACS) from PBMC samples with high neutrophil content (Figure 1a, Figure 1—figure supplement 4). When applied to either nuclei (Figure 1b, left panels) or permeabilized cells (Figure 1b, right panels), there was a large increase in FRIP and reduction in non-cell barcodes in our scATAC-seq libraries (Figure 1e). Removal of neutrophils did not have an adverse effect on leukapheresis-purified PBMC data (Figure 1—figure supplement 2c, right panel). Depletion of neutrophils using anti-CD15 magnetic beads also improved data quality (Figure 1—figure supplement 2d, Figure 1—figure supplement 5), though not to the same extent as FACS-based depletion. Staining and flow cytometry using an eight-antibody panel (Supplementary file 1) on Ficoll- and leukapheresis-purified PBMCs showed minimal effect of the magnetic bead treatment on non-neutrophil cell-type abundance (Supplementary file 2).

Comparing single-cell and single-nucleus ATAC characteristics

We assessed the quantitative and qualitative differences between nuclei and permeabilized cell protocols with and without sorting by performing both protocols on a single set of input cells. To fairly compare data quality across methods, we equally downsampled raw data at the level of raw sequenced reads per well for each sample that we compare directly (see Materials and methods for details). We then utilized a uniform set of transcription start site (TSS) regions (TSS ± 2 kb) and a previously published PBMC peak set (Lareau et al., 2019, Materials and methods) as reference regions for computation of fraction of reads in TSS (FRITSS) and FRIP, respectively. Permeabilized cells yielded many more high-quality cell barcodes than nuclear preps using equal loading of cells or nuclei (15,000 loaded, expected 10,000 captured, Table 1). scATAC-seq libraries prepared from nuclei contained many more reads originating from nucleosomal DNA fragments (Figure 1b, lower panels), and non-cell barcodes from nuclei (gray lines) contained more of these fragments than cell barcodes. Thus, an overabundance of mononucleosomal fragments may indicate non-cell fragment contamination. Libraries from permeabilized cells consisted almost entirely of short fragments, suggesting that permeabilization under isotonic conditions did not loosen or release native chromatin structure at the time of tagmentation (Figure 1b, lower panels). Previous bulk ATAC-seq studies have shown that differing nuclear isolation protocols lead to varying amounts of mononucleosomal fragments (Li et al., 2019a). In agreement with in vitro experiments studying the effects of low salt on nucleosomal arrays (Allahverdi et al., 2015), this further suggests that hypotonic lysis leads to alteration of chromatin structure, raising the possibility of artifactual measurements of accessibility in nuclei-based ATAC-seq. To assess the effect that this difference has on the data obtained by each method, we overlaid Tn5 footprints near TSS ( Figure 1c) and CTCF transcription factor binding sites (TFBS, Figure 1d). The signal at TSS was retained in permeabilized cells, but positions flanking the TSS (occupied by neighboring nucleosomes) had reduced signal compared to isolated nuclei (examined in detail in Figure 1—figure supplement 6). At CTCF motifs, we observed nearly identical patterns of accessibility in both nuclei and permeabilized cells, suggesting that scATAC-seq signal at regulatory TFBS is retained in permeabilized cells. Neutrophil and dead cell removal improved the quality of nuclear scATAC-seq libraries, which yielded the highest number of unique fragments, reads in TSS, and reads in peaks, though at the cost of fewer high-quality barcodes captured when compared to permeabilized cells (Table 1). Overall, permeabilized intact cells obtained by FACS had the highest FRIP and FRITSS scores, fewest non-cell barcodes, and greatest cell capture efficiency with only a modest increase in mitochondrial reads (Table 1, Figure 1e, f).

Improved label transfer and differential analysis

We next examined the effect of methodological differences on downstream biological analyses (Figure 2). Removal of neutrophils greatly improved the ability to separate various cell types in uniform manifold approximation and projection (UMAP) projections of both nuclei and cells (Figure 2a, b). To provide ground truth for label transfer, we performed flow cytometry on an aliquot of the same PBMC sample used for scATAC-seq above. A panel of 25 antibodies (Supplementary file 3) was used to determine the proportion of each of the 12 cell types used to label the scATAC-seq cells in the PBMC sample (Figure 2—figure supplement 1 and Supplementary file 4). Label transfer was performed using the ArchR package (Granja et al., 2020) to generate gene scores that enabled label transfer from a reference scRNA-seq dataset using the method provided in the Seurat package (Stuart et al., 2019) (Materials and methods). Using these tools, removal of neutrophils improved label transfer scores, and permeabilized cells yielded more cells with high label transfer scores than nuclei-based approaches (Figure 2b, c). In addition, permeabilized cells provided labels most similar to the cell-type proportions identified by flow cytometry (Figure 2d), with identification of CD8 effector cells only observed in scATAC-seq with permeabilized cells. All methods yielded fewer CD16+ monocytes than observed by flow cytometry, suggesting that CD16+ monocytes may be lost during scATAC-seq using either nuclei or permeabilized cells, or that label transfer methods were not conducive to identifying this cell type (Figure 2d). After labeling cell types, we used ArchR to call peaks for each cell type and perform pairwise tests of differential accessibility between each pair of cell types (Figure 2—figure supplement 2a). We found many more differentially accessible sites in both cells and nuclei after removal of neutrophils. Differential accessibility was also used to identify differentially enriched TFBS motifs in each cell type (Figure 2—figure supplement 2b). Without neutrophil removal (nuclei unsorted, top panel), we were unable to identify significantly enriched motifs in B cells and NK cells that were readily apparent in data from clean nuclei or permeabilized cells (bottom two panels). Together, these results demonstrate that neutrophil removal and the use of permeabilized cells allow for identification of specific cell types and TFBS motifs that are involved in regulation of gene expression.

Figure 2 with 2 supplements see all

Download asset Open asset

Improvements to 2D projection and label transfer for scATAC-seq data.

(a) Uniform manifold approximation and projection (UMAP) projection plots for corresponding datasets in Figure 1. Points are colored based on a common scale of fraction of reads in peaks, bottom right. The number of cells in each panel is displayed in Figure 1b. (b) UMAP projection plots colored based on cell type obtained by label transfer from scRNA-seq (Materials and methods). Colors for cell types are below, to the right. (c) To visualize the number and quality of transferred labels, we ranked all cells based on the Seurat label transfer score obtained from label transfer results and plotted lines through the score (y-axis) vs. rank (x-axis) values. (d) Barplot showing the fraction of cells in each dataset that were assigned each cell-type label. The top row shows cell-type proportions for the same peripheral blood mononuclear cell sample obtained by 25-color immunotyping flow cytometry (Materials and methods, Supplementary file 3, and Figure 2—figure supplement 1). Root-mean-square deviation values were computed by comparison of labeled cell-type proportions to values derived from flow cytometry (Supplementary file 4). Colors for cell types are to the right of the barplot.

Figure 2—source data 1 Single cell UMAP coordinates and labeling scores.: https://cdn.elifesciences.org/articles/63632/elife-63632-fig2-data1-v1.zip
Download elife-63632-fig2-data1-v1.zip
Figure 2—source data 2 Fractions of cells assigned to each type by flow cytometry and scATAC-seq.: https://cdn.elifesciences.org/articles/63632/elife-63632-fig2-data2-v1.zip
Download elife-63632-fig2-data2-v1.zip

Joint measurement of accessibility and epitopes with ICICLE-seq

Under standard scATAC-seq protocols, removal of the cell membrane severs the connection between the cell surface and the chromatin state of cells. To test our ability to measure cell surface proteins and chromatin state simultaneously on permeabilized cells, we modified our optimized permeabilized cell scATAC-seq methodology to incorporate measurements using commercially available barcoded antibody reagents, and we term this new method ICICLE-seq (Figure 3 and Materials and methods). The ICICLE-seq protocol utilizes a custom Tn5 transposome complex with capture sequences compatible with the 10x Genomics 3' scRNA-seq gel bead capture reaction for simultaneous capture of ATAC fragments and polyadenylated antibody barcode sequences (Figure 3—figure supplement 1 and Supplementary file 5). Antibody-derived tags (ADTs) from oligo-antibody conjugates and ATAC-seq fragments can then be selectively amplified by PCR to generate separate libraries for sequencing (Figure 3—figure supplement 1). Due to the nature of fragment capture in this system, we obtain both a cell barcode and a single-end scATAC-seq read from the two ends of the paired-end sequencing reaction. We performed ICICLE-seq on a leukapheresis-purified PBMC sample using a 46-antibody panel (Supplementary file 6) and were able to obtain 10,227 single cells with both scATAC-seq and ADT data from three capture wells that passed adjusted QC criteria: >500 unique ATAC fragments (median = 761), FRIP >0.65 (median = 0.725). Cells passing ATAC QC had a median of 3871 ADT unique molecule indexes (UMIs) per cell (Figure 3—figure supplement 2b). UMAP projection and ATAC label transfer on ICICLE-seq data had resolution similar to scATAC-seq on intact permeabilized cells after dead cell and debris removal (Figure 3b). However, the data quality was limited by the poly-A-based capture method and the single-end readout of transposed fragments. Nonetheless, the ICICLE-seq results demonstrated that permeabilized cells enabled simultaneous capture of chromatin accessibility in the nucleus and high-quality cell surface epitope quantification. We were able to leverage the additional ADT data to cluster and identify cell types based on their cell surface antigens (Figure 3c) UMAP based on ADT data, and Jaccard–Louvain clustering allowed identification of cell-type-specific clusters (Figure 3d) based on clear association of cell-type-specific markers with clusters (Figure 3e, Figure 3—figure supplement 2c). Thus, ICICLE-seq provided a key proof of principle for directly linked capture of functional cell types and chromatin accessibility.

Figure 3 with 2 supplements see all

Download asset Open asset

Simultaneous profiling of chromatin accessibility and cell surface epitopes.

(a) Uniform manifold approximation and projection (UMAP) projection plot of integrated cellular indexing of chromatin landscape and epitopes (ICICLE-seq) cells based on single-cell assays for transposase-accessible chromatin (scATAC-seq) data. Cells are colored based on fraction of reads in peaks. n = 10,227 cells passing QC criteria are displayed. (b) UMAP projection of scATAC-seq data, as in (a). Cells are colored based on cell-type labels obtained by ArchR label transfer (Materials and methods). (c) UMAP projection plot of ICICLE-seq cells based on antibody-derived tag (ADT) data. Cells are colored according to the total number of unique molecule indexes across all markers. (d) UMAP projection based on ADT data, as in (c), colored according to cell-type labels derived from marker expression (Materials and methods). (e) Heatmap of median ADT count values for each marker in each cell type labeled in (d). Values are separately scaled in each row between zero and the maximum value (right column) for each marker.

Figure 3—source data 1 Cell type labels and UMAP coordinates for ICICLE-seq cells.: https://cdn.elifesciences.org/articles/63632/elife-63632-fig3-data1-v1.zip
Download elife-63632-fig3-data1-v1.zip
Figure 3—source data 2 ADT count data for ICICLE-seq.: https://cdn.elifesciences.org/articles/63632/elife-63632-fig3-data2-v1.zip
Download elife-63632-fig3-data2-v1.zip

Trimodal measurement of transcripts, epitopes, and accessibility with TEA-seq

With the release of a commercially available platform for simultaneous capture of RNA-seq and ATAC-seq from single nuclei, we reasoned that permeabilized cells could be used to perform simultaneous capture of three major molecular compartments: DNA can be captured using scATAC-seq, RNA could be captured using scRNA-seq, and protein epitope abundance can be captured using polyadenylated antibody barcodes, which we term TEA-seq after Transcripts, Epitopes, and Accessibility. After trials and optimization of key steps, we were able to obtain libraries on the 10x Genomics Multiome ATAC plus Gene Expression platform that combined all three of these measurements for thousands of single cells (Figure 4a) using a panel of 46 oligo-tagged antibodies (Supp Table 6). After initial data processing for cells loaded into four wells, we identified 29,264 cell barcodes that passed the QC criteria for scATAC-seq described above and had >2,500,500 unique ATAC-seq fragments (median = 8762 unique ATAC fragments, Figure 4b, d) and had >500 genes detected by scRNA-seq (median = 2399 RNA UMIs; median = 1,249,249 genes detected, Figure 4b), and >500 ADT UMIs (median = 1171 ADT UMIs, Figure 4c). Additional QC metrics are provided in Figure 4—figure supplement 1.

Figure 4 with 4 supplements see all

Download asset Open asset

Trimodal measurement of transcription, epitopes, and accessibility.

(a) Workflow diagram for the major steps in transcription, epitopes, and accessibility (TEA-seq). (b) Scatterplot comparing unique single-cell assays for transposase-accessible chromatin (scATAC-seq) fragments and scRNA-seq unique molecule indexes (UMIs) for each TEA-seq cell barcode. In (a, b, d, and e), n = 227,390,390 barcodes are displayed in total; 29,264 passing QC criteria are represented by purple points. (c) Scatterplot comparing unique scATAC-seq fragments and antibody-derived tags UMIs for each cell barcode. (d) scATAC-seq QC scatterplot comparing unique scATAC-seq fragments and fraction of reads in peaks scores for each cell barcode. n = 34,757,757 total cells are displayed (those with >1,000,000 unique ATAC fragments); 29,264 passing QC criteria are represented by purple points. (e) Uniform manifold approximation and projection (UMAP) projections generated using each of the three modalities separately. Only cells passing QC (n = 7,939,939 barcodes) are presented in (e, f). (f) A joint UMAP projection generated using three-way weighted nearest neighbors that leverages all three of the measured modalities. (g) Scatterplot showing the peak-to-RNA correlations (x-axis) and peak-to-protein (y-axis) correlation values for each peak (points) that was found to be correlated with the CCR2 gene or CD192 antibody in TEA-seq data. Histograms at the margins show the distribution of scores for RNA correlations (top) and protein correlations (right). Dashed line shows 1:1 correspondence. Peaks not found to be correlated in each method were assigned a score of 0. (h) Genome tracks showing links between peaks (red hashes) and the CCR2 gene based on protein expression (above peaks) and gene expression (below peaks). Correlations are represented by arcs colored based on the correlation score (color scale for both panels to the left). The bottom panel shows the gene neighborhood around CCR2. All coordinates are from the Hg38 genome assembly. (i), as in (g), for the CD38 gene and correlated peaks. (j), as in (h), for the CD38 gene locus.

Figure 4—source data 1 Single cell quality metrics for TEA-seq samples.: https://cdn.elifesciences.org/articles/63632/elife-63632-fig4-data1-v1.zip
Download elife-63632-fig4-data1-v1.zip
Figure 4—source data 2 Cell type labels and UMAP coordinates for TEA-seq samples.: https://cdn.elifesciences.org/articles/63632/elife-63632-fig4-data2-v1.zip
Download elife-63632-fig4-data2-v1.zip
Figure 4—source data 3 Peak to gene and peak to protein link correlations.: https://cdn.elifesciences.org/articles/63632/elife-63632-fig4-data3-v1.zip
Download elife-63632-fig4-data3-v1.zip

With the addition of scRNA-seq data, we were able to perform cell-type label transfer from RNA to RNA using Seurat label transfer (Stuart et al., 2019) rather than cross-modality transfer from RNA to ATAC (Materials and methods). For each modality, we were able to perform dimensionality reduction and UMAP projection to get separate views of the relationships between cells (Figure 4e), as for ICICLE-seq, above (Figure 3a, b). However, these views do not leverage the strength of linked multimodal measurements. To begin to take advantage of our tri-modal measurements, we extended a method recently described by Hao and Hao et al. for paired weighted nearest-neighbors (WNN) analysis (Hao et al., 2020) to allow an arbitrary number of simultaneously measured modalities to contribute to the WNN network (Materials and methods). After applying this three-way WNN, we could generate a UMAP embedding with contributions from all three simultaneously measured modalities (Figure 4f), which enhanced cell-type separation. We found that marker expression is consistent across all three modalities for some markers, such as CD8A, though we found that consistency across all modalities is not a universal characteristic of functional markers (Figure 4—figure supplement 2a). In agreement with observations by Hao and Hao et al. on two-modality data, the weights contributed by different modalities to the WNN analyses vary by cell type (Figure 4—figure supplement 2b).

Expanding cis-regulatory module identification with TEA-seq

Identification of putative cis-regulatory modules (CRMs) that control the functional state of cells is a distinct advantage of directly linked measurements of chromatin accessibility and gene expression (Cao et al., 2018; Ma et al., 2020). This linkage is dependent on reliable detection of gene (or protein) expression, which is greatly enhanced by the high sensitivity of epitope detection in methods like CITE-seq (Hao et al., 2020; Stoeckius et al., 2017), and now TEA-seq. Because TEA-seq allows direct linkage of chromatin accessibility to both gene expression and protein detection, we used these complementary modalities to investigate peak-to-gene (and peak-to-protein) correlations among the proteins we measured in our TEA-seq epitope panel. We leveraged the analytical capabilities of the ArchR package to compute peak-to-gene correlation (Granja et al., 2020), then substituted the more sensitive measurements afforded by ADT counts to compute peak-to-protein correlations. In cases where both gene and protein expression were reliably detected, these comparisons were confirmatory – many of the same putative CRMs that were identified in each comparison (Figure 4—figure supplement 3). For some targets, however, more sensitive protein abundance measures allowed detection of many additional correlated CRMs. We present two examples – one of CCR2/CD192 (Figure 4g, h), a chemokine receptor expressed on monocytes (Tsou et al., 2007), and CD38 (Figure 4i, j), a glycoprotein expressed on the surface of multiple immune cell types (Ferrero and Malavasi, 2002), and a target of therapies for multiple myeloma (Dima et al., 2020; Sanchez et al., 2016). In both cases, peak-to-protein correlations identified many CRMs were missed with peak-to-gene correlation alone. The converse is also observed in our experiment: reliably detected genes may enable detection of some regulatory regions that are not identified using protein detection alone – and peak-to-gene links based on scRNA-seq can be generated genome-wide for all expressed genes, whether or not they are targeted by the epitope panel.

Comparisons of TEA-seq to CITE-seq and scATAC-seq

To perform a direct assessment of TEA-seq data quality relative to previously published assays, we performed a comparison to standalone 10x scATAC-seq on permeabilized cells, CITE-seq using the 10x 3′ scRNA-seq kit, and both unstained nuclei and unstained permeabilized cells on the 10x Multiome kit (RNA + ATAC). All samples originated from the same pool of thawed and FACS sorted PBMCs (with dead cells, debris, and neutrophils removed). Separate aliquots were taken for scATAC-seq and the two unstained Multiome kit preps, while CITE-seq and TEA-seq samples were taken from the same pool of ADT-stained PBMCs (Figure 4—figure supplement 4a). In each case, 15,000 cells were used as input, and we downsampled a single well from the resulting libraries to equal sequencing depth for a fair comparison (200 × 10⁶ total fragments per well for RNA and ATAC libraries; 48 × 10⁶ for ADT libraries). We performed four wells of TEA-seq to generate a large reference dataset (shown in Figure 4), and one well for each other assay. For all comparisons presented in Figure 4—figure supplement 4, we used a single well of TEA-seq. In these direct comparisons, we found that TEA-seq results appeared qualitatively similar to CITE-seq and scATAC-seq across assays (Figure 4—figure supplement 4b, e, i). The alignment method used for these datasets allowed us to quantify the number of RNA UMIs that aligned to intronic vs. exonic regions of the transcriptome. We found that the fraction of reads mapping to exons was lower than that found in whole-cell CITE-seq data, similar to that seen using a lysed-cell nuclei preparation (median fraction of reads with exonic mapping: TEA-seq, 51.8%, Nuclei Multiome, 52.2%; CITE-seq, 69.5% Figure 4—figure supplement 4c), suggesting the primary source of RNAs in TEA-seq is intron-containing nuclear transcripts. We assessed gene detection by calculating the fraction of cells with UMIs > 0 for each gene and plotted comparisons between TEA-seq and 10x Multiome nuclei (Figure 4—figure supplement 4d) as well as CITE-seq (Figure 4—figure supplement 4g). We found that gene detection was most like nuclei and lower in many cases than CITE-seq. One group of genes stands out as particularly low, originating from ribosomal gene transcripts (with gene symbols starting with RPL and RPS, highlighted in blue). In comparisons of scATAC-seq metrics, we found that TEA-seq performed comparably to standalone permeabilized cell scATAC-seq or Multiome RNA + ATAC using permeabilized cells, with higher FRIP than purified nuclei used for the same assay (Figure 4—figure supplement 4e, f). Finally, in comparisons of protein detection by ADTs, we found that antibody performance in TEA-seq varied by target. Many bimodal distributions were retained (e.g., CD4, CD8a, CD86), while some separations were less apparent (e.g., CD45RO and CD45RA; Figure 4—figure supplement 4h), resulting in reduced separation in UMAP analysis for some types (i.e., naïve vs. mature CD4 cells), though most major cell types were able to be separated (Figure 4—figure supplement 4i). Together, these results show that TEA-seq data is of high quality but may not always be the method of choice if a particular modality is of greatest interest to researchers. The strength of TEA-seq is its ability to measure all three aspects – transcription, epitopes, and accessibility – simultaneously on each cell, enabling deeper insights into the connections between regulation, expression, and function as demonstrated in Figure 4.

Discussion

Multimodal data collection from permeabilized PBMCs will be of use to many researchers in the immunology field and beyond who seek to get the most high-quality data from precious clinical samples and to facilitate cross-datastream integration with flow cytometry. We found that isotonic cell permeabilization allows the generation of scATAC-seq libraries with high quality as measured by FRIP and low nucleosomal content, suggesting that chromatin state in the nucleus is unperturbed (Figure 1, Figure 2). While a previous study utilized FACS followed by scATAC on individually sorted cells (Pi-ATAC; Chen et al., 2018c), the use of permeabilized cells enables simultaneous interrogation of chromatin accessibility state in the nucleus and the functional state of cells based on their cell surface proteins (ICICLE-seq, Figure 3) together with mRNA (TEA-seq, Figure 4) at unprecedented scale for the first time. Our methods utilize permeabilized cells to allow the addition of protein measurements that provide a direct link between high-quality scRNA-seq data and scATAC-seq data using truly paired methods extending beyond SNARE-seq (Chen et al., 2019) and SHARE-seq (Ma et al., 2020) by taking advantage of 10x Genomics Single Cell Multiome sequencing reagents as a platform for nucleic acid capture. The use of a droplet-based method allows multimodal measurements at a much higher scale by capturing thousands of single cells at a time. We expect that the datasets generated by our study will be useful beyond researchers studying PBMC biology, including those seeking to develop new analytical tools to remove background noise from existing sn/scATAC-seq data (e.g., SCALE, Xiong et al., 2019, or scOpen, Li et al., 2019b) and tools to accurately identify links across modalities (e.g., AI-TAC, Tasaki et al., 2020).

We have focused on PBMCs during this initial development and demonstration of the utility of TEA-seq. PBMCs are a widely used source of clinically relevant heterogeneous cell populations for monitoring clinical disease status and research. As such, TEA-seq optimized for PBMCs will provide unprecedented biological insight into the molecular underpinnings of human immune health and disease. It is likely that the use of TEA-seq in other cell types will require sample-specific optimization. For example, the requirement for neutrophil depletion may be limited to studies related to the immune system or immune-infiltrated tissues. However, the value of debris and dead cell removal is likely to be applicable to disaggregated tissue samples from biopsies (Donlin et al., 2018). Likewise, researchers should take care when assessing fragment length distributions as a quality control metric. In PBMCs, we found that high abundance of mononucleosomal fragments may indicate neutrophil-driven background, but this indicator of poor data quality may not be observed in other cell types or tissues. Early versions of nuclei-based ATAC-seq suffered issues high levels of mitochondrial reads (Corces et al., 2017), which may be used to profile clonal populations based on their mitochondrial genomes (Lareau et al., 2020). Here, we have used low concentrations of digitonin to selectively permeabilize cholesterol-containing membranes, thereby leaving the inner mitochondrial membrane intact (Adam et al., 1990; Colbeau et al., 1971). Different cell types have been shown to have differing sensitivity to digitonin concentrations, perhaps reflecting natural variance in membrane cholesterol composition (Holden and Horton, 2009). As such, cell permeabilization for TEA-seq may need to be optimized for other cell types. This can be readily tested using bulk ATAC-seq and by measuring library complexity, FRIP score, and mitochondrial contamination to determine data quality.

In addition to the Methods presented in this article, we are maintaining an online protocol to aid the research community in further development and adoption of TEA-seq on the Protocols.io platform (https://doi.org/10.17504/protocols.io.bqagmsbw). Additional development could allow simultaneous measurement of additional aspects of cell biology (e.g., CpG sequencing methods, scCUT&Tag; Bartosovic et al., 2020), as well as improvements to the modalities in the TEA-seq trio (e.g., s3-ATAC, Mulqueen et al., 2021). In direct comparisons of TEA-seq to existing methods (CITE-seq and scATAC-seq), we found that the sensitivity of gene detection was lower and distributions of antibody detections were altered relative to CITE-seq; however, ATAC-seq data quality remained comparable to the standalone assay (Figure 4—figure supplement 4). We anticipate that simultaneous, truly multimodal measurement across molecular compartments will be an essential tool to expand our view of the full picture of immune cell state in health and disease, with many applications extending throughout the genomics research community.

Materials and methods

Key resources table

Reagent type (species) or resource	Designation	Source or reference	Identifiers	Additional information
Biological sample (Homo sapiens)	Primary peripheral blood mononuclear cells	BioIVT, Westbury, NY, USA	SERATRIALS-18002; sample BRH1291132	WIRB protocol # 20190318; Ficoll-purified
Biological sample (Homo sapiens)	Primary peripheral blood mononuclear cells	BioIVT, Westbury, NY, USA	SERATRIALS-18002; sample HMN85396	WIRB protocol # 20190318; leukapheresis-purified
Biological sample (Homo sapiens)	Primary peripheral blood mononuclear cells	BioIVT, Westbury, NY, USA	SERATRIALS-18002; sample RG1131	WIRB protocol # 20190318; Ficoll-purified
Biological sample (Homo sapiens)	Primary peripheral nlood mononuclear cells	Bloodworks NW, Seattle, WA, USA	BT001; sample 5716BW	WIRB protocol # 20141589; Ficoll-purified
Biological sample (Homo sapiens)	Primary peripheral blood mononuclear cells	Bloodworks NW, Seattle, WA, USA	BT001; sample 7811BW	WIRB protocol # 20141589; Ficoll-purified
Antibody	Biotin anti-human CD15 (SSEA-1) antibody (mouse monoclonal)	BioLegend	Cat# 301913, RRID:AB_2561325	(0.15 µL per million cells)
Antibody	BUV395 anti-human CD14 antibody (mouse monoclonal)	BD Biosciences	Cat# 563561, RRID:AB_2744288	(1 µL per million cells)
Antibody	BUV496 mouse anti-human CD45 antibody (mouse monoclonal)	BD Biosciences	Cat# 750179, RRID:AB_2868405	(1 µL per million cells)
Antibody	BUV563 mouse anti-human CD15 antibody (mouse monoclonal)	BD Biosciences	Cat# 741417, RRID:AB_2868406	(1 µL per million cells)
Antibody	BUV615 mouse anti-human CD56 antibody (mouse monoclonal)	BD Biosciences	Cat# 613001, RRID:AB_2868413	(2 µL per million cells)
Antibody	Brilliant Violet 650 anti-human CD3 antibody (mouse monoclonal)	BioLegend	Cat# 300468, RRID:AB_2629574	(2 µL per million cells)
Antibody	Mouse anti-human CD19 High Parameter Custom Antibody BB790-P Conjugate (mouse monoclonal)	BD Biosciences	Cat# 624296	Custom reagent; conjugate of mouse anti-human CD19 (clone HIB19) and BB790-P (1 µL per million cells)
Antibody	PE/Cy5 anti-human CD16 antibody (mouse monoclonal)	BioLegend	Cat# 302009, RRID:AB_314209	(1 µL per million cells)
Antibody	BUV395 mouse anti-human CD3 antibody (mouse monoclonal)	BD Biosciences	Cat# 563546, RRID:AB_2744387	Custom reagent; conjugate of mouse anti-human CD3 (clone UCHT1) and BUV395 (1:50 in 100 µL volume)
Antibody	Mouse anti-human CD45RA High Parameter Custom Antibody BUV615-P Conjugate (mouse monoclonal)	BD Biosciences	Cat# 624297	Custom reagent; conjugate of mouse anti-human CD45RA (clone HI100) and BUV615-P (1:5 in 100 µL volume)
Antibody	BUV661 mouse anti-human CD14 antibody (mouse monoclonal)	BD Biosciences	Cat# 741684, RRID:AB_2868407	(1:50 in 100 µL volume)
Antibody	BUV737 mouse anti-human CD8 antibody (mouse monoclonal)	BD Biosciences	Cat# 749367, RRID:AB_2868408	(1:5 in 100 µL volume)
Antibody	BUV805 mouse anti-human CD11c antibody (mouse monoclonal)	BD Biosciences	Cat# 742005, RRID:AB_2868409	(1:33 in 100 µL volume)
Antibody	BV421 mouse anti-human CD25 antibody (mouse monoclonal)	BD Biosciences	Cat# 562442, RRID:AB_11154578	(1:50 in 100 µL volume)
Antibody	BV480 mouse anti-human CD4 (mouse monoclonal)	BD Biosciences	Cat# 566104, RRID:AB_2739506	(1:5 in 100 µL volume)
Antibody	BV605 mouse anti-Human CD16 (FcγRIII) antibody (mouse monoclonal)	BD Biosciences	Cat# 563172, RRID:AB_2744297	(1:50 in 100 µL volume)
Antibody	Brilliant Violet 650 anti-human CD123 antibody (mouse monoclonal)	BioLegend	Cat# 306020, RRID:AB_2563827	(1:50 in 100 µL volume)
Antibody	Brilliant Violet 711 anti-human CD127 (IL-7Rα) antibody (mouse monoclonal)	BioLegend	Cat# 351328, RRID:AB_2562908	(1:50 in 100 µL volume)
Antibody	BV750 mouse anti-human IgD antibody (mouse monoclonal)	BD Biosciences	Cat# 747484, RRID:AB_2868411	(1:20 in 100 µL volume)
Antibody	BV786 mouse anti-human neuropilin-1 (CD304) antibody (mouse monoclonal)	BD Biosciences	Cat# 743132, RRID:AB_2741299	(1:50 in 100 µL volume)
Antibody	BB515 mouse anti-human CD141 antibody (mouse monoclonal)	BD Biosciences	Cat# 565084, RRID:AB_2739058	(1:33 in 100 µL volume)
Antibody	CD11b monoclonal antibody, PerCP-Cy5.5 conjugated (rat monoclonal)	BD Biosciences	Cat# 561114, RRID:AB_2033995	(1:5 in 100 µL volume)
Antibody	PE anti-human CD27 antibody (mouse monoclonal)	BioLegend	Cat# 302808, RRID:AB_314300	(1:20 in 100 µL volume)
Antibody	PE/Dazzle 594 anti-human TCRα/βantibody (mouse monoclonal)	BioLegend	Cat# 306726, RRID:AB_2566599	(1:33 in 100 µL volume)
Antibody	Mouse anti-CD34 monoclonal antibody, PE-Cy5 conjugated (mouse monoclonal)	BD Biosciences	Cat# 555823, RRID:AB_396152	(1:20 in 100 µL volume)
Antibody	PE/Cy7 anti-human CD197 (CCR7) antibody (mouse monoclonal)	BioLegend	Cat# 353226, RRID:AB_11126145	(1:33 in 100 µL volume)
Antibody	APC anti-human CD38 antibody (mouse monoclonal)	BioLegend	Cat# 356606, RRID:AB_2561902	(1:20 in 100 µL volume)
Antibody	APC-R700 mouse anti-human CD56 (NCAM-1) antibody (mouse monoclonal)	BD Biosciences	Cat# 565139, RRID:AB_2744429	(1:20 in 100 µL volume)
Antibody	APC/Cyanine7 anti-human HLA-DR antibody (mouse monoclonal)	BioLegend	Cat# 307618, RRID:AB_493586	(1:20 in 100 µL volume)
Antibody	TotalSeq-A0062 anti-human CD10 antibody (mouse monoclonal)	BioLegend	Cat# 312231, RRID:AB_2734286	(0.5 µg per million cells)
Antibody	TotalSeq-A0161 anti-human CD11b antibody (mouse monoclonal)	BioLegend	Cat# 301353, RRID:AB_2734249	(0.05 µg per million cells)
Antibody	TotalSeq-A0053 anti-human CD11c antibody (mouse monoclonal)	BioLegend	Cat# 371519, RRID:AB_2749971	(0.025 µg per million cells)
Antibody	TotalSeq-A0064 anti-human CD123 antibody (mouse monoclonal)	BioLegend	Cat# 306037, RRID:AB_2749977	(0.1 µg per million cells)
Antibody	TotalSeq-A0390 anti-human CD127 (IL-7Rα) antibody (mouse monoclonal)	BioLegend	Cat# 351352, RRID:AB_2734366	(0.025 µg per million cells)
Antibody	TotalSeq-A0081 anti-human CD14 antibody (mouse monoclonal)	BioLegend	Cat# 301855, RRID:AB_2734254	(0.2 µg per million cells)
Antibody	TotalSeq-A0163 anti-human CD141 (thrombomodulin) antibody (mouse monoclonal)	BioLegend	Cat# 344121, RRID:AB_2783229	(0.1 µg per million cells)
Antibody	TotalSeq-A0083 anti-human CD16 antibody (mouse monoclonal)	BioLegend	Cat# 302061, RRID:AB_2734255	(0.05 µg per million cells)
Antibody	TotalSeq-A0408 anti-human CD172a (SIRPα) antibody (mouse monoclonal)	BioLegend	Cat# 372109, RRID:AB_2783285	(0.25 µg per million cells)
Antibody	TotalSeq-A0144 anti-human CD185 (CXCR5) antibody (mouse monoclonal)	BioLegend	Cat# 356937, RRID:AB_2750356	(0.125 µg per million cells)
Antibody	TotalSeq-A0050 anti-human CD19 antibody (mouse monoclonal)	BioLegend	Cat# 302259, RRID:AB_2734256	(0.2 µg per million cells)
Antibody	TotalSeq-A0242 anti-human CD192 (CCR2) antibody (mouse monoclonal)	BioLegend	Cat# 357229, RRID:AB_2750501	(0.5 µg per million cells)
Antibody	TotalSeq-A0148 anti-human CD197 (CCR7) antibody (mouse monoclonal)	BioLegend	Cat# 353247, RRID:AB_2750357	(0.5 µg per million cells)
Antibody	TotalSeq-A0181 anti-human CD21 antibody (mouse monoclonal)	BioLegend	Cat# 354915, RRID:AB_2750006	(0.05 µg per million cells)
Antibody	TotalSeq-A0180 anti-human CD24 antibody (mouse monoclonal)	BioLegend	Cat# 311137, RRID:AB_2750374	(0.5 µg per million cells)
Antibody	TotalSeq-A0085 anti-human CD25 antibody (mouse monoclonal)	BioLegend	Cat# 302643, RRID:AB_2734258	(0.08 µg per million cells)
Antibody	TotalSeq-A0056 anti-human CD269 (BCMA) antibody (mouse monoclonal)	BioLegend	Cat# 357521, RRID:AB_2749974	(0.5 µg per million cells)
Antibody	TotalSeq-A0191 anti-mouse/rat/human CD27 antibody (Armenian hamster monoclonal)	BioLegend	Cat# 124235, RRID:AB_2750344	(0.125 µg per million cells)
Antibody	TotalSeq-A0171 anti-human/mouse/rat CD278 (ICOS) antibody (Armenian hamster monoclonal)	BioLegend	Cat# 313555, RRID:AB_2800824	(0.01 µg per million cells)
Antibody	TotalSeq-A0088 anti-human CD279 (PD-1) antibody (mouse monoclonal)	BioLegend	Cat# 329955, RRID:AB_2734322	(0.1 µg per million cells)
Antibody	TotalSeq-A0034 anti-human CD3 antibody (mouse monoclonal)	BioLegend	Cat# 300475, RRID:AB_2734246	(0.05 µg per million cells)
Antibody	TotalSeq-A0406 anti-human CD304 (neuropilin-1) antibody	BioLegend	Cat# 354525, RRID:AB_2783261	(0.1 µg per million cells)
Antibody	TotalSeq-A0830 anti-human CD319 (CRACC) antibody (mouse monoclonal)	BioLegend	Cat# 331821, RRID:AB_2800872	(0.5 µg per million cells)
Antibody	TotalSeq-A0410 anti-human CD38 antibody (mouse monoclonal)	BioLegend	Cat# 356635, RRID:AB_2800967	(0.05 µg per million cells)
Antibody	TotalSeq-A0176 anti-human CD39 antibody (mouse monoclonal)	BioLegend	Cat# 328233, RRID:AB_2750005	(0.05 µg per million cells)
Antibody	TotalSeq-A0072 anti-human CD4 antibody (mouse monoclonal)	BioLegend	Cat# 300563, RRID:AB_2734247	(0.1 µg per million cells)
Antibody	TotalSeq-A0031 anti-human CD40 antibody (mouse monoclonal)	BioLegend	Cat# 334346, RRID:AB_2749968	(0.25 µg per million cells)
Antibody	TotalSeq-A0063 anti-human CD45RA antibody (mouse monoclonal)	BioLegend	Cat# 304157, RRID:AB_2734267	(0.0625 µg per million cells)
Antibody	TotalSeq-A0087 anti-human CD45RO antibody (mouse monoclonal)	BioLegend	Cat# 304255, RRID:AB_2734268	(0.1 µg per million cells)
Antibody	TotalSeq-A0047 anti-human CD56 (NCAM) antibody (mouse monoclonal)	BioLegend	Cat# 362557, RRID:AB_2749970	(0.15 µg per million cells)
Antibody	TotalSeq-A0166 anti-human CD66b antibody (mouse monoclonal)	BioLegend	Cat# 392905, RRID:AB_2750372	(0.25 µg per million cells)
Antibody	TotalSeq-A0394 anti-human CD71 antibody (mouse monoclonal)	BioLegend	Cat# 334123, RRID:AB_2800884	(0.025 µg per million cells)
Antibody	TotalSeq-A0005 anti-human CD80 antibody (mouse monoclonal)	BioLegend	Cat# 305239, RRID:AB_2749958	(0.5 µg per million cells)
Antibody	TotalSeq-A0006 anti-human CD86 antibody (mouse monoclonal)	BioLegend	Cat# 305443, RRID:AB_2734273	(0.05 µg per million cells)
Antibody	TotalSeq-A0080 anti-human CD8a antibody (mouse monoclonal)	BioLegend	Cat# 301067, RRID:AB_2734248	(0.2 µg per million cells)
Antibody	TotalSeq-A0156 anti-human CD95 (Fas) antibody (mouse monoclonal)	BioLegend	Cat# 305649, RRID:AB_2750368	(0.1 µg per million cells)
Antibody	TotalSeq-A0352 anti-human FcεRIα antibody (mouse monoclonal)	BioLegend	Cat# 334641, RRID:AB_2750503	(0.5 µg per million cells)
Antibody	TotalSeq-A0159 anti-human HLA-DR antibody (mouse monoclonal)	BioLegend	Cat# 307659, RRID:AB_2750001	(0.25 µg per million cells)
Antibody	TotalSeq-A0384 anti-human IgD antibody (mouse monoclonal)	BioLegend	Cat# 348243, RRID:AB_2783238	(0.05 µg per million cells)
Antibody	TotalSeq-A0090 Mouse IgG1, κ isotype Ctrl antibody (mouse monoclonal)	BioLegend	Cat# 400199, RRID:AB_2868412	(0.5 µg per million cells)
Antibody	TotalSeq-A0136 anti-human IgM antibody (mouse monoclonal)	BioLegend	Cat# 314541, RRID:AB_2749992	(0.05 µg per million cells)
Antibody	TotalSeq-A0153 anti-human KLRG1 (MAFA) antibody (mouse monoclonal)	BioLegend	Cat# 367721, RRID:AB_2750373	(0.25 µg per million cells)
Antibody	TotalSeq-A0584 anti-human TCR Vα24-Jα18 (iNKT cell) antibody (mouse monoclonal)	BioLegend	Cat# 342923, RRID:AB_2783227	(0.5 µg per million cells)
Antibody	TotalSeq-A0581 anti-human TCR Vα7.2 antibody (mouse monoclonal)	BioLegend	Cat# 351733, RRID:AB_2783246	(0.05 µg per million cells)
Antibody	TotalSeq-A0224 anti-human TCR α/β antibody (mouse monoclonal)	BioLegend	Cat# 306737, RRID:AB_2783167	(0.125 µg per million cells)
Antibody	TotalSeq-A0139 anti-human TCR γ/δ antibody (mouse monoclonal)	BioLegend	Cat# 331229, RRID:AB_2734325	(0.5 µg per million cells)
Sequence-based reagent	MOSAIC_Bot	Integrated DNA Technologies	Tn5 complex oligo	5'-/5phos/CTGTCTCTTATACACATCT-'3; standard desalting
Sequence-based reagent	Tn5ME-s7_Top	Integrated DNA Technologies	Tn5 complex oligo	5'-GTCTCGTGGGCTCGGAGA TGTGTATAAGAGACAG-3'; standard desalting
Sequence-based reagent	Poly-A Top-L	Integrated DNA Technologies	Tn5 complex oligo	5'-TTTTTTTTTTTTTTT TTTTTTTTTTTTTTT VNAGATGTGTATAAGAGACAG -3'; standard desalting
Sequence-based reagent	SI-P5-22	Integrated DNA Technologies	PCR primer	5'-AATGATACGGCGACCACCG AGATCTACACTCTTTCCCTACAC GACGCTCTTCCGATCT-3'; standard desalting
Sequence-based reagent	F BC primer	Integrated DNA Technologies	PCR primer	5'-CTACACGACGCT CTTCCGATCT-3'; standard desalting
Sequence-based reagent	ADT-Rev-AMP	Integrated DNA Technologies	PCR primer	5'-CCTTGGCACCCGAGAATTCC-3'; standard desalting
Sequence-based reagent	TruSeq R1 Seq primer	Integrated DNA Technologies	PCR primer	5'-ACACTCTTTCCCTACACG ACGCTCTTCCGATCT-3'; standard desalting
Sequence-based reagent	ADT-i7 primer	Integrated DNA Technologies	PCR primer	5'-CAAGCAGAAGACGGCA TACGAGATNNNNNNNN GTGACTGGAGTTCCT TGGCACCCGAGAATTCCA-3'; PAGE purification; = phosphorothioate bond; N’s indicate barcode position
Sequence-based reagent	SI-PCR-Oligo	Integrated DNA Technologies	PCR primer	5'-AATGATACGGCGACCA CCGAGATCTACACTCTT TCCCTACACGACGCTC-3'; standard desalting
Peptide, recombinant protein	Tn5 transposase	Beta Lifescience	TN5-BL01
Commercial assay or kit	Chromium Next GEM Single Cell 3' GEM, Library & Gel Bead Kit v3.1	10x Genomics	1000121
Commercial assay or kit	Next GEM Chip G Single Cell Kit	10x Genomics	1000120
Commercial assay or kit	Single Index Kit T Set A	10x Genomics	1000213
Commercial assay or kit	Chromium Next GEM Single Cell ATAC Library & Gel Bead Kit v1.1	10x Genomics	1000175
Commercial kit or assay	Next GEM Chip H Single Cell Kit	10x Genomics	1000161
Commercial kit or assay	Single Index Kit N Set A	10x Genomics	1000212
Commercial kit or assay	Chromium Next GEM Single Cell Multiome ATAC + Gene Expression Reagent Bundle	10x Genomics	1000283
Commercial kit or assay	Next GEM Chip J Single Kit	10x Genomics	1000234
Commercial kit or assay	Dual Index Kit TT Set A	10x Genomics	1000215
Commercial kit or assay	Quant-iT PicoGreen dsDNA Assay Kit	Thermo Fisher Scientific	P7589
Commercial kit or assay	Library Quantification Kit, Illumina Platform	KAPA Biosystems	KK4844
Commercial kit or assay	Bioanalyzer high-sensitivity DNA analysis	Agilent Technologies	5067-4626
Software, algorithm	R	The R Foundation	RRID:SCR_001905	v3.6.3 and >4.0.2
Software, algorithm	ArchR	Jeffrey Granja and Ryan Corces		v1.0.1
Software, algorithm	BarCounter	Elliott Swanson; this study
Software, algorithm	cellranger	10x Genomics		v5.0.0
Software, algorithm	cellranger-atac	10x Genomics		v1.1.0 and v1.2.0
Software, algorithm	cellranger-arc	10x Genomics		v1.0.0
Software, algorithm	Seurat	Paul Hoffman, Satija Lab, and Collaborators	RRID:SCR_016341	v4.0-beta
Software, algorithm	FlowJo	BD		v10.7
Other	Digitonin	MP Biomedicals	0215948082
Other	Human TruStain FcX	BioLegend	422302
Other	ViaStain acridine orange/propidium iodide solution	Nexcelom Bioscience	CS2-0106-25mL
Other	Dynabeads MyOne SILANE	Thermo Fisher Scientific	37002D
Other	Fixable Viability Stain 510	BD Biosciences	564406
Other	IGEPAL-CA630	Sigma	I8896	CAS 9002-93-1
Other	SPRIselect	Beckman Coulter	B23319
Other	NEBNext Ultra II Q5 Master Mix	New England Biolabs	M0544
Other	KAPA HiFi HotStart ReadyMix	KAPA Biosystems	KM2602
Other	Protector RNase Inhibitor	Sigma-Aldrich	03335399001
Other	Fixable Viability Stain 510	BD Biosciences	Cat# 564406, RRID:AB_2869572

Share this article

Cite this article

Improvements to scATAC-seq methods to enable permeabilized cell profiling.

Figure 1—source data 1

Figure 1—source data 2

Figure 1—source data 3

Figure 1—source data 4

Figure 1—source data 5

QC metrics summary for experiments displayed in Figure 1.

Improvements to 2D projection and label transfer for scATAC-seq data.

Figure 2—source data 1

Figure 2—source data 2

Simultaneous profiling of chromatin accessibility and cell surface epitopes.

Figure 3—source data 1

Figure 3—source data 2

Trimodal measurement of transcription, epitopes, and accessibility.

Figure 4—source data 1

Figure 4—source data 2

Figure 4—source data 3

Author details

Elliott Swanson

Contribution

Competing interests

Cara Lord

Present address

Contribution

Competing interests

Julian Reading

Contribution

Competing interests

Alexander T Heubeck

Contribution

Competing interests

Palak C Genge

Contribution

Competing interests

Zachary Thomson

Contribution

Competing interests

Morgan DA Weiss

Contribution

Competing interests

Xiao-jun Li

Contribution

Competing interests

Adam K Savage

Contribution

Competing interests

Richard R Green

Present address

Contribution

Competing interests

Troy R Torgerson

Contribution

Competing interests

Thomas F Bumol

Contribution

Competing interests

Lucas T Graybuck

Contribution

For correspondence

Competing interests

Peter J Skene

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism