Schematic of potential off-target binding in 10x Genomics Xenium.

In this illustration, the arms of the padlock probes were designed to bind an RNA sequence intended to correspond to a target gene (green). However, these probes exhibit off-target binding and bind to an RNA sequence in a different off-target gene (red). The probe is circularized and subsequently amplified via rolling circle amplification (RCA). Hybridization of fluorescent probes to the RCA product enables the generation a fluorescent signal that is used to quantify RNA expression within cells.

OPT output of genes with predicted off-target binding based on perfect sequence homology using GENCODE v47.

This table shows the 37 genes whose probes in the 10x Genomics Xenium v1 Human Breast Gene Expression Panel exhibit predicted off-target probe binding, where each off-target alignment involves a perfect 40bp match to the probe sequence. Although OPT predicted off-target binding of CCPG1 probe sequences to the DNAAF1-CCPG1 gene, we manually excluded it from our list because DNAAF1-CCPG1 is a read-through gene containing portions of both DNAAF1 and CCPG1. The final column shows the gene types, in order, of each of the off-target genes shown in column 3. Abbreviations: PC = protein-coding; PG = pseudogene; NMD = nonsense-mediated decay; lncRNA = long non-coding RNA.

Comparison of spatial gene expression patterns between Xenium and Visium.

(A) Spatial gene expression of MS4A1 overlaid on the corresponding histological images for Xenium and Visium, accompanied by a density plot comparing Xenium vs. Visium MS4A1 expression. The dotted line indicates the identity line (X = Y), and the solid line represents the line of best fit. (B) Gene expression patterns for APOBEC3B: Xenium expression, Visium expression, the aggregated Visium expression combining APOBEC3B and its predicted off-target gene’s expression APOBEC3D and APOBEC3F, and Visium expression of APOBEC3B’s predicted off-targets APOBEC3D and APOBEC3F. Two density plots are shown: one comparing Xenium vs. Visium for APOBEC3B alone, and one comparing Xenium vs. the aggregated Visium expression of APOBEC3B with all off-targets. The dotted line indicates the identity line (X = Y), and the solid line represents the line of best fit. (C) Scatterplot of log-transformed total expression counts (with a pseudocount) for 307 genes comparing Visium and Xenium data. The dotted line indicates the identity line (X = Y), and points (genes) are colored by probe information.

Comparison of single-cell gene expression patterns between Xenium and scRNA-seq.

(A) Harmonized UMAP visualization of MS4A1 expression for Xenium and scRNA-seq data, accompanied by a scatterplot comparing Xenium vs. scRNA-seq MS4A1 cluster expression. The dotted line indicates the identity line (X = Y), and the solid line represents the line of best fit. (B) Comparison of APOBEC3B expression patterns on harmonized UMAP: Xenium expression, scRNA-seq expression, an aggregated scRNA-seq profile combining APOBEC3B and its predicted off-target gene’s expression APOBEC3D and APOBEC3F, and scRNA-seq expression of APOBEC3B’s predicted off-targets APOBEC3D and APOBEC3F. Two scatterplots are shown: one comparing Xenium vs. scRNA-seq for APOBEC3B cluster expression alone, and one comparing Xenium vs. the aggregated scRNA-seq cluster expression of APOBEC3B and it’s predicted off-targets. The dotted line indicates the identity line (X = Y), and the solid line represents the line of best fit. (C) Scatterplot of log-transformed total expression counts (with a pseudocount) for 313 genes between Visium and scRNA-seq data. The dotted line indicates the identity line (X = Y), and points (genes) are colored by probe information.

Screenshot from the Integrated Genome Viewer (IGV) showing the 40bp probe sequence (ID: ENSG00000196154|S100A4|ab4e3dc) that matches both S100A5 and S100A4.

Shown are 6 isoforms from the CHESS v3.1 annotation, 4 from GENCODE basic v47, and 3 from RefSeq v110 for S100A4, as well as 2 RefSeq isoforms for the neighboring S100A5. The probe sequence aligns to the overlapping region between S100A5 and S100A4 gene loci. Matching probe shown in a zoomed-in view below. The forward- and reverse-strand sequences of the probe are shown, and the highlighted area indicates approximately where the probe falls within the gene.

UpSet plot illustrating the overlap of protein-coding genes across three genome annotations: GENCODE basic, RefSeq, and CHESS.

(A) Overlap regions between Visium (orange outline) and Xenium (blue outline) data, shown on the Xenium histological image and the Visium histological image, respectively. (B) Log transformed aggregated total gene counts for spots (∼55μm x 55μm) in both Xenium and Visium datasets, overlaid on their corresponding histological image.

Screenshots from the Integrated Genome Viewer (IGV) showing alignments of (A) four 40bp probes targeting APOBEC3B (ENSG00000179750|APOBEC3B|17d76bb, ENSG00000179750|APOBEC3B|e03f8ab, ENSG00000179750|APOBEC3B|5991db9, and ENSG00000179750|APOBEC3B|59c9349). (B) All four probes align to their intended target gene APOBEC3B while two of the four probes align to each off-target gene: (C) APOBEC3D and (D) APOBEC3F. The forward- and reverse-strand sequences of the probe are shown, and the highlighted areas indicate approximately where the probe falls within the gene. Panels (B-D) share a common legend.

UMAP visualization of integrated scRNA-seq and Xenium datasets: (A) before harmony batch correction and (B) after harmony batch correction. (C) Leiden clustering results on the harmonized UMAP.

(A) Schematic illustrating that hybridization may still occur even when there is a sequence mismatch at the non-ligated ends of the probe sequence. (B) Schematic depicting how probes could bind to each other instead of to their intended target.

(A) Gene expression patterns for ACTG2: Xenium expression, Visium expression, the aggregated Visium expression combining ACTG2 and its predicted off-target gene’s expression ACTA1, ACTB, and POTEM, and Visium expression of ACTG2’s predicted off-targets ACTA1, ACTB, and POTEM. Two density plots are shown: one comparing Xenium vs. Visium for ACTG2 alone, and one comparing Xenium vs. the aggregated Visium expression. The dotted line indicates the identity line (X = Y), and the solid line represents the line of best fit. (B) Gene expression patterns for TUBB2B: Xenium expression, Visium expression, the aggregated Visium expression combining TUBB2B and its predicted off-target gene’s expression TUBB2B and TUBB2A, and Visium expression of TUBB2B’s predicted off-target TUBB2A. Two density plots are shown: one comparing Xenium vs. Visium for TUBB2B alone, and one comparing Xenium vs. the aggregated Visium expression. The dotted line indicates the identity line (X = Y), and the solid line represents the line of best fit.

(A) Comparison of ACTG2 expression patterns on harmonized UMAP: Xenium expression, scRNA-seq expression, an aggregated scRNA-seq profile combining ACTG2 and its predicted off-target gene’s expression ACTB, POTEM, POTEE, POTEF, POTEI, POTEJ, and ACTA1, and scRNA-seq expression of ACTG2’s potential off-targets. Two scatterplots are shown: one comparing Xenium vs. scRNA-seq for ACTG2 cluster expression alone, and one comparing Xenium vs. the aggregated scRNA-seq cluster expression. The dotted line indicates the identity line (X = Y), and the solid line represents the line of best fit. (B) Comparison of TUBB2B expression patterns on harmonized UMAP: Xenium expression, scRNA-seq expression, an aggregated scRNA-seq profile combining TUBB2B and its predicted off-target gene’s expression TUBB2A, and scRNA-seq expression of TUBB2B’s potential off-target TUBB2A. Two scatterplots are shown: one comparing Xenium vs. scRNA-seq for TUBB2B cluster expression alone, and one comparing Xenium vs. the aggregated scRNA-seq cluster expression. The dotted line indicates the identity line (X = Y), and the solid line represents the line of best fit.

Heatmap visualization of target genes and their predicted off-target genes of the HuBMAP placenta custom probe panel using a corresponding placenta bulk RNA-seq dataset.

Gene expression values are counts-per-million (CPM) normalized, with a pseudocount added prior to log transformation. The first column shows the expression of each target gene in the placenta bulk RNA-seq dataset, while the remaining columns display the expression of the corresponding predicted off-target genes.

Heatmap visualizations of target genes and their predicted off-target genes of the HuBMAP multi custom probe panel using a corresponding (A) kidney, (B) lung, and (C) heart scRNA-seq datasets.

Gene expression values are counts-per-million (CPM) normalized, with a pseudocount added prior to log transformation. The first column shows the expression of each target gene in their respective scRNA-seq dataset, while the remaining columns display the expression of the corresponding predicted off-target genes.

Heatmap visualizations of target genes and their predicted off-target genes for the Janesick et al. probes using (A) pseudo-bulked scRNA-seq and (B) Visium data.

Gene expression values are counts-per-million (CPM) normalized, with a pseudocount added prior to log transformation. The first column shows the expression of each target gene in their respective scRNA-seq dataset, while the remaining columns display the expression of the corresponding predicted off-target genes.

(A) Gene expression patterns for ADH1B: Xenium expression, Visium expression, the aggregated Visium expression combining ADH1B and its predicted off-target gene’s expression ADH1A and ADH1C, and Visium expression of ADH1B’s predicted off-targets ADH1A and ADH1C. Two density plots are shown: one comparing Xenium vs. Visium for ADH1B alone, and one comparing Xenium vs. the aggregated Visium expression. The dotted line indicates the identity line (X = Y), and the solid line represents the line of best fit. (B) Comparison of ADH1B expression patterns on harmonized UMAP: Xenium expression, scRNA-seq expression, an aggregated scRNA-seq profile combining ADH1B and its predicted off-target gene’s expression ADH1A and ADH1C, and scRNA-seq expression of ADH1B’s potential off-targets ADH1A and ADH1C. Two scatterplots are shown: one comparing Xenium vs. scRNA-seq for ADH1B cluster expression alone, and one comparing Xenium vs. the aggregated scRNA-seq cluster expression. The dotted line indicates the identity line (X = Y), and the solid line represents the line of best fit.

(A) Spatial gene expression of HDC overlaid on the corresponding histological images for Xenium and Visium. (B) Harmonized UMAP visualization of HDC expression for Xenium and scRNA-seq data.

Screenshots from the Integrated Genome Viewer (IGV) illustrating annotation-dependent differences in probe alignment.

(A) A 40bp probe (ID: ENSG00000125878|TCF15|5d3cbc2) aligns to an exonic region in GENCODE v28 but to an upstream region in GENCODE v47. (B) A 40bp probe (ID: ENSG00000169083|AR|a0c6719) aligns to an exonic region in GENCODE v28 but to an intronic region in GENCODE v47. Matching probe shown in a zoomed-in view below. The forward- and reverse-strand sequences of the probe are shown, and the highlighted areas indicate approximately where the probe falls within the gene. Panels (A-B) share a common legend.

OPT output of genes with predicted off-target binding based on perfect sequence homology in RefSeq.

This table shows the 14 genes whose probes in the 10x Genomics Xenium v1 Human Breast Gene Expression Panel that exhibit predicted off-target probe binding, where each off-target alignment involves a perfect 40bp match to the probe sequence. The final column shows the gene types, in order, of each of the off-target genes shown in column 3. Off-target alignments between CCPG1 probes and DNAAF1-CCPG1 were excluded. Abbreviations: PC = protein-coding; PG = pseudogene; precursor_RNA = precursor RNA; misc_RNA = miscellaneous RNA; ncRNA = non-coding RNA.

OPT output of genes with predicted off-target binding based on perfect sequence homology in CHESS.

This table shows the 23 genes whose probes in the 10x Genomics Xenium v1 Human Breast Gene Expression Panel that exhibit predicted off-target probe binding, where each off-target alignment involves a perfect 40bp match to the probe sequence. The final column shows the gene types, in order, of each of the off-target genes shown in column 3. Off-target alignments between CCPG1 probes and DNAAF1-CCPG1 were excluded. Abbreviations: PC = protein-coding; PG = pseudogene; miRNA = microRNA.

The number of off-target probes and affected genes (from the set of 313 genes in the Xenium panel) found when looking for perfect matches between probe sequences and transcripts in four different reference annotations: GENCODE basic, GENCODE comprehensive, RefSeq, and CHESS.

Off-target alignments between CCPG1 probes and DNAAF1-CCPG1 were excluded.

Union set of protein-coding genes that OPT predicts to be affected by off-target binding, across three different reference annotations: GENCODE basic, RefSeq, and CHESS.

16 additional genes that were identified to exhibit potential off-target probe binding when allowing for a 10bp error margin on either side of the binding site when using GENCODE v47.

Abbreviations: PC = protein-coding; PG = pseudogene; lncRNA = long non-coding RNA.

9 genes that were previously predicted to be affected by off-target binding based on perfect matching now show additional predicted off-target interactions when a 10bp error margin is allowed on either side of the binding site when using GENCODE v47.

This effect is observed either through the accumulation of new probes with predicted off-target binding or via the identification of additional predicted off-target genes per probe. Abbreviations: PC = protein-coding; PG = pseudogene; lncRNA = long non-coding RNA.

OPT output for 49 genes with predicted off-target binding in the HuBMAP placenta custom probe panel, generated using GENCODE v47 and allowing a 10bp mismatch on either end of each probe.

The final column lists the gene types, in order, corresponding to the off-target genes shown in column 3. Abbreviations: PC = protein-coding; PG = pseudogene; precursor_RNA = precursor RNA; misc_RNA = miscellaneous RNA; ncRNA = non-coding RNA.

OPT output for 24 genes with predicted off-target binding in the HuBMAP multi custom probe panel, generated using GENCODE v47 and allowing a 10bp mismatch on either end of each probe.

The final column lists the gene types, in order, corresponding to the off-target genes shown in column 3. Abbreviations: PC = protein-coding; PG = pseudogene; precursor_RNA = precursor RNA; misc_RNA = miscellaneous RNA; ncRNA = non-coding RNA.