1. Developmental Biology
  2. Genetics and Genomics
Download icon

Mapping the mouse Allelome reveals tissue-specific regulation of allelic expression

  1. Daniel Andergassen
  2. Christoph P Dotter
  3. Daniel Wenzel
  4. Verena Sigl
  5. Philipp C Bammer
  6. Markus Muckenhuber
  7. Daniela Mayer
  8. Tomasz M Kulinski
  9. Hans-Christian Theussl
  10. Josef M Penninger
  11. Christoph Bock
  12. Denise P Barlow  Is a corresponding author
  13. Florian M Pauler  Is a corresponding author
  14. Quanah J Hudson  Is a corresponding author
  1. Research Center for Molecular Medicine of the Austrian Academy of Sciences, Austria
  2. Institute of Molecular Biotechnology of the Austrian Academy of Sciences, Austria
  3. Institute of Molecular Pathology, Austria
Tools and Resources
Cite this article as: eLife 2017;6:e25125 doi: 10.7554/eLife.25125
6 figures, 1 table and 1 additional file

Figures

Figure 1 with 3 supplements
Defining the mouse Allelome.

(A) Strategy for detecting allelic expression from RNA-seq data from 23 mouse tissues and developmental stages using Allelome.PRO. Every gene in the annotation is classified into one of seven different allelic expression categories indicated by different colors in the key and explained in the text. These colors are used in figures throughout the manuscript. The sex of the tissues is indicated by XX (female) and XY (male). Individuals were used except for indicated embryonic tissues where an entire litter was pooled (XX/XY). (B) Allelome.PRO classification of the allelic expression status of protein-coding genes in each tissue. (C) Allelome.PRO classification of non-coding genes. Tissues examined were placenta (Pl embryonic day (E) 12.5, E16.5), visceral yolk sac endoderm (VE E9.5, E12.5, E16.5), embryonic stem cells (ESC), mouse embryonic fibroblasts (MEF E12.5), embryonic liver (Li E12.5, E16.5), embryonic heart (He E16.5), embryonic and neonatal brain (Br E16.5, 3 days postnatal (dpn)), neonatal tongue (To 3dpn), adult brain (aBr), adult lactating female brain (lfBr), adult virgin mammary glands (vMG), adult lactating female mammary glands (lfMG), adult lung (aLu), adult leg muscle (aLM), adult heart (aHe), adult thymus (aTh), adult liver (aLi) and adult spleen (aSp). Embryo and placenta diagrams adapted from Hudson et al. (2011). Allelome.PRO settings: FDR 1%, allelic-ratio cutoff 0.7, minread 2.

https://doi.org/10.7554/eLife.25125.002
Figure 1—figure supplement 1
Clustering of tissues by their RNA-seq expression data matches the expected developmental relationships.

Cluster analysis confirms identity and similarity of sequenced tissues and replicates. The Heatmap shows unsupervised clustering of a Spearman correlation matrix from log-transformed gene expression data across 92 samples (23 tissues x four replicates, Refseq genes). In an exception, clustering did not distinguish virgin and lactating mammary glands (*).

https://doi.org/10.7554/eLife.25125.003
Figure 1—figure supplement 2
The Allelome.PRO pipeline output and quality controls.

(A) Allelome.PRO strategy to classify monoallelic (strain biased or imprinted) and biallelic (BAE) expression (FDR 1%, allelic ratio cutoff 0.7, minread 2). The three different categories of biallelic expression are shown. For more details see the Materials and methods and the Allelome.PRO methods paper and manual (Andergassen et al., 2015). (B) Left: the percentage of each biallelic category in each tissue. Right: histograms showing the allelic ratio distribution for each of the three biallelic categories. (C) Allelic ratios of genes show a similar distribution between tissues (outliers not shown in boxplots). (D) X-linked genes align to the correct allele in XY males, indicating that detection of X-inactivation escapers is not due to an alignment artifact.

https://doi.org/10.7554/eLife.25125.004
Figure 1—figure supplement 3
Novel non-coding RNAs show a similar non-coding potential to annotated Refseq non-coding RNAs.

Novel annotated loci outside of RefSeq are non-protein coding. The boxplots show the estimated RNA-code (top) (Washietl et al., 2011) and Coding Potential Calculator (CPC) scores (bottom) (Kong et al., 2007) for mRNAs (RefSeq NM), non-coding RNAs (RefSeq NR) and loci annotated in this study (XLOC, outliers not shown). p-Values were calculated using a t-test. For more details see Materials and methods.

https://doi.org/10.7554/eLife.25125.005
Figure 2 with 1 supplement
The Allelome reveals tissue-specific expression of strain-biased genes.

(A) The percentage of strain-biased genes from total informative genes for each tissue for protein-coding (pc, black) and non-coding (nc, grey) genes. (B) The percentage of tissues where pc- and nc- genes maintained their biallic (left) or strain-biased (right), allelic expression status (calculated relative to number of tissues where a gene was informative). Allelome.PRO settings: FDR 1%, allelic-ratio cutoff 0.7, minread 2, dotted lines indicates the outcome with a 0.8 allelic ratio cutoff. (C) The enrichment of H3K27ac ±50 kb from the transcription start site (TSS) of genes that show strain-biased expression in either E12.5 VE or Li, and biallelic expression in the other tissue. Top: H3K27ac enrichment near strain-biased genes. The enrichment over random of allelic H3K27ac 4 kb windows was calculated. Bottom: The same analysis for the same set of genes where they show biallelic expression. Analysis detailed in Materials and methods. (D) An example of putative enhancer switching: Glrx switches from FVB strain-biased expression in liver to BAE expression in VE. This is associated with a switch in putative enhancers that matches the allelic expression status. Allelome.PRO settings for H3K27ac: FDR 1%, allelic-ratio cutoff 0.7, minread 1. (E) Strain-biased genes with biallelic H3K4me3 on their promoter are enriched for genes that show biallelic expression in their introns and strain-biased expression in their exons, indicating post-transcriptional differences in stability may explain strain-biased expression. Scatter plots comparing the allelic ratio of strain-biased genes in their exons and introns. Top: Strain-biased genes with biallelic H3K4me3 ChIP-seq enrichment on their promoter. Bottom: Strain-biased genes with supporting monoallelic enrichment of H3K4me3 on their promoter (key indicates H3K4me3 color code and tissue). Color code as in Figure 1.

https://doi.org/10.7554/eLife.25125.006
Figure 2—figure supplement 1
Characterization of the mouse Allelome reveals that protein-coding genes switch their allelic status among tissue and development.

(A) Distribution of allelic ratios in mouse tissues. A scatterplot of the median allelic ratios of autosomes in the FVBxCAST (x2) versus CASTxFVB (x2) crosses determined by Allelome.PRO analysis of RNA-seq data from the 23 tissues examined (abbreviations defined in Figure 1). Genes are color-coded according to their allelic bias classification as indicated in the key. (B) Most biallelic pc-genes (top) showed biallelic expression in the majority of tissues, while nc-genes (bottom) showed biallelic expression mainly in a single or few tissues. (C) Tissue clustering by allelic ratio reflects developmental relationships. Unsupervised clustered was performed using the median allelic ratios of the 765 strain-biased genes that were informative in all 23 tissues (Hierarchical Clustering Method (hClust) = ‘complete’ in R). Genes were color coded according to the allelic ratio using a gradient between 0 (CAST brown), 0.5 (BAE green) and 1 (FVB turquoise) (see key). The SNP coverage per gene is shown in the box plot on the right. (D) Biallelic enrichment of H3K27ac is frequent in the genome. Strain biased H3K27ac 4 kb enrichment windows occur at between 1–5% the frequency of biallelic enrichment windows.

https://doi.org/10.7554/eLife.25125.007
Figure 3 with 2 supplements
X chromosome inactivation (XCI) escapers appear to be highly tissue-specific.

(A) Circos plot showing the mouse chromosome X Allelome for 19 female tissues (Figure 1). Outer layers: 16 embryonic, neonatal and adult tissues showing random XCI (FVB X preferentially inactivated in CAST/FVB F1 tissues (skewed XCI)). Middle layer: Giemsa banding (UCSC genome browser). Inside layers: three extra-embryonic tissues showing imprinted XCI (paternal X chromosome inactivated). The top 25/225 escapers from random XCI are indicated on the outside of the Circos plot, while the top 20/43 escapers from imprinted XCI are indicated on the inside (escapers ranked by number of tissues). An asterisk marks known escapers. Dystrophin (Dmd) escapes specifically in muscle (aLM and To 3dpn, indicated by arrow). Color code as in Figure 1 except non-escapers are partially transparent (20% opacity, CAST in embryonic and adult tissues, MAT in extra-embryonic tissues). Allelome.PRO settings: FDR 1%, allelic-ratio cutoff 0.6, minread 2. (B) Top: the percentage of pc-genes (black) escaping XCI from all informative pc for each female tissue. The number of informative pc-genes is given above the barplot, while the number of pc escapers is given above each bar. Middle: the same analysis performed for nc-genes (grey). Bottom: the allelic ratio of Xist for each replicate in extra-embryonic tissues (Xist expressed paternally (blue)) and in non extra-embryonic tissues (Xist preferentially expressed from the FVB allele (turquoise)). (C) Leg muscle XCI escapers are expressed at a higher level than non-escapers. Protein coding X-chromosome escaper genes are expressed significantly higher than non-escapers on the X (t-test). Box plots indicate the expression levels of genes in the different categories (outliers not shown). (D) The distance of parental-specific H3K27ac window to the closest non-escaper (331) and escaper (36) gene in placenta E12.5. Maternal H3K27ac windows with a distance higher than 500 kb were not included in the analysis. A boxplot including median values is shown (outliers not shown). After correcting for sample size, a significant difference was observed between escapers and non-escapers (Fisher’s exact test, p<1×10−17, details in Materials and methods).

https://doi.org/10.7554/eLife.25125.008
Figure 3—figure supplement 1
The distribution of X chromosome inactivation escapers across tissues.

(A) X chromosome inactivation (XCI) escapers in 19 mouse tissues. A scatterplot of allelic ratios of X-linked genes in the FVBxCAST versus CASTxFVB crosses determined by Allelome.PRO analysis of RNA-seq data from the 19 XX tissues examined (abbreviations defined in Figure 1). Escapers, non-escapers, and genes excluded from the analysis (see Materials and methods) are indicated in the key. Xist is indicated in each scatter plot. (B) XCI escapers are detected over a wide range of expression levels. First row: number of informative X-linked genes in each of 5 expression bins for each female tissue. Second row: the number of XCI escaper numbers for each expression bin for each tissue. Third row: percentage of informative X-linked genes that escape XCI in each expression bin for each tissue. Fourth row: Xist expression level does not correlate with the number XCI escapers. Shown are the Xist expression levels for each tissue. Right: Genes with low expression levels are not more likely to be classified as XCI escapers. Box plots display the percentage of informative genes that escape X-inactivation in each expression bin across the 19 female tissues assayed in this study.

https://doi.org/10.7554/eLife.25125.009
Figure 3—figure supplement 2
Validation of X chromosome inactivation escapers in an independent dataset.

(A) Genes escaping X-inactivation confirmed in an independent dataset. Scatter plots of the median allelic ratio from the forward and reverse crosses (two replicates) of X-linked genes classified as escapers, non-escapers, or excluded from the analysis (as described in methods, color code in key). Left: adult liver (aLi) and adult leg muscle (aLM) from this study using FVBxCAST reciprocal crosses (FVBxCAST SNPs from Sanger). Right: re-analyzed adult liver and adult leg muscle data from the Gregg lab (Bonthuis et al., 2015) using C57BL/6xCAST reciprocal crosses (read number adjusted to match this study, C57BL/6xCAST SNPs from Sanger). Escapers detected in both datasets are indicated in yellow (see key). (B) Novel X-chromosome escapers show a lower SNP coverage than known escapers. Boxplots show the SNP coverage of known and novel escapers in adult brain (aBr), spleen (aSp) and placenta (Pl) (Berletch et al., 2015; Finn et al., 2014). Outliers are not shown. Red line indicates a SNP coverage of 50.

https://doi.org/10.7554/eLife.25125.010
Figure 4 with 1 supplement
The Allelome reveals tissue-specific regulation of imprinted protein-coding genes.

(A) The number of known and candidate imprinted genes detected in this study by RNA-seq before validation. Known genes were RefSeq genes listed by the Otago or Harwell imprinted databases on 24th Sept 2015 (Glaser et al., 2006; Williamson et al., 2013). (B) The number of known and candidate novel imprinted genes found among different tissues and developmental stages. Tissues important for the energy transfer from the mother to the offspring are indicated (§). (C) The heatmap shows the allelic pattern for all 126 known imprinted genes among the different tissues. Gene names (left side) colored black are confirmed in this study, while gene names colored grey could not be confirmed. ! Indicates disputed genes. ° Probable RNA-seq strand bleed through. Examples given in D for variable (‘a’, in ≤70% expressing tissues) and consistent (‘b’ in >70% expressing tissues) imprinted expression are indicated. The chromosome number and the base pair coordinates (in Mb) for each gene are indicated on the right side. The imprinted allelic ratio for each gene and tissue (white) is given only if all four replicates show a bias in the same direction (1 = 100% expression from the maternal allele, 0 = 100% expression from the paternal allele). The sex for each tissue is indicated on the bottom of the heatmap: F (female, XX), M (male, XY), P (pooled XX/XY). Note: allelic analysis of the X chromosome can only be done for female tissues. (D) The percentage of tissues that maintain imprinted expression of protein-coding genes (top) and nc-genes (bottom) (calculated as the number of tissues showing imprinted expression, divided by the number of informative tissues for each gene). The analysis was done for the 69 known imprinted genes confirmed by RNA-seq in this study (Peg3os and Hymai were excluded due to probable RNA-seq bleed-through). Dotted boxes indicate genes that show variable (‘a’, in ≤70% expressing tissues) and consistent (‘b’ in >70% expressing tissues) imprinted expression. Examples are positioned according to the percentage of expressing tissues where they show imprinted expression. Color key as in Figure 1 except novel maternal and paternal imprinted candidates are pale red and blue, respectively (Allelome.PRO settings: FDR 1%, allelic-ratio cutoff 0.7, minread 2).

https://doi.org/10.7554/eLife.25125.011
Figure 4—figure supplement 1
The allelic categorization of candidate novel imprinted genes across tissues.

Novel imprinted genes identified in this study. The heatmap shows the allelic pattern among the different tissues for the 76 novel imprinted genes detected by RNA-seq. Bold gene names (left side) indicate non-coding genes. (*) Indicates maternally expressed genes restricted to placenta. The chromosome number and the base pair coordinates (in Mb) are indicated on the right side. The imprinted allelic ratio for each gene and tissue (white) is given only if the allelic direction agrees for all four replicates (1 = 100% expression from the maternal allele, 0 = 100% expression from the paternal allele). The sex for each tissue is indicated on the bottom of the heatmap: F (female), M (male), P (pooled). Note allelic analysis of the X chromosome can only be done in females.

https://doi.org/10.7554/eLife.25125.012
Figure 5 with 1 supplement
Novel validated imprinted genes belong to known clusters.

(A) A novel X-linked imprinted nc-gene is transcribed anti-sense to Firre lncRNA. Maternal imprinted expression of CrossFirre (Gm34612) detected from RNA-seq in adult brain (top) was validated by maternal H3K4me3 enrichment in MEFs (middle). The gene body of CrossFirre is enriched for LINE repetitive elements (bottom). Highlighted in grey are H3K4me3 peaks over the CrossFirre promoter (MAT), the canonical Firre promoter (CAST), and multiple peaks in the Firre gene body (FVB) (UCSC genome browser screenshot). (B) Slc38a4 forms a cluster with a novel imprinted lncRNA. The Slc38a4 promoter is associated with maternal DNA methylation of the gametic differentially methylated region (gDMR, red square). In E12.5 VE, biallelic expression of Slc38a4 is associated with biallelic H3K4me3 enrichment over an alternative TSS (highlighted by grey bar). Paternal expression of the novel upstream imprinted lncRNA XLOC_032279 was validated by paternal H3K4me3 enrichment (UCSC genome browser screenshot). (C) The allelic ratio in embryo-transferred placentas distinguishes maternal imprinted expression from maternal contamination. CAST (F) x FVB (M) blastocysts were transferred into pseudo-pregnant FVB females, and placentas collected at E12.5 and subject to ribosome RNA depleted RNA-seq. Genes showing maternal imprinted expression show a bias toward the maternal CAST allele. Genes expressed in maternal blood or decidua (maternal contamination) show a bias toward the FVB allele of the host mother. Novel imprinted genes that showed imprinted expression only in placenta or visceral yolk sac endoderm are displayed. Dotted line indicates 0.7 allelic ratio cutoff, (*) indicates genes with too low SNP coverage to be informative in this analysis. (D) The Airn promoter deletion (R2∆) demonstrates that genes over a 10 Mb region are subject to imprinted silencing by Airn. First row: Known and novel imprinted genes detected by Allelome.PRO analysis of RNA-seq from E12.5 placenta in 10 Mb region surrounding the known Igf2r cluster (UCSC genome browser screenshot). Second row: The SNP coverage (reads over SNPs) of imprinted genes and selected biallelic controls between Arid1b and Thbs2 on chromosome 17 for CASTxFVB and CASTxFVB(R2Δ) E12.5 placentas. Third row: Differential expression analysis calculated using Cuffdiff (version 2.2.1) ** adjusted p-value<0.01* adjusted p-value<0.05, ns non-significant. Fourth row: The allelic ratio (median and standard deviation) for the same genes calculated using the Allelome.PRO pipeline (0.5 = 100% maternal and −0.5 = 100% paternal expression). (E) A summary of known and candidate imprinted genes confirmed in this study. After validation, imprinted genes from the literature were confirmed or not (RefSeq genes listed by the Otago or Harwell imprinted databases on 24th Sept 2015 [Glaser et al., 2006; Williamson et al., 2013]). * 70 confirmed known genes include Meg3, Rian and Mirg, which our data indicates is a single imprinted transcript (Figure 5—figure supplement 1B). Novel imprinted candidates were classified as ‘confirmed’ (validated or supported), ‘fragments’ (supported nc-genes near known imprinted genes without evidence they are independent transcripts), ‘candidates’ (detected in one tissue by RNA-seq without further supporting evidence), and ‘maternal contamination’ (bias toward host mother allele in embryo transfer and/or expression in maternal decidua and blood). The 23 confirmed novel imprinted genes are shown in Table 1. Classification of imprinted gene candidates following validation is further explained in the text.

https://doi.org/10.7554/eLife.25125.013
Figure 5—figure supplement 1
Validation of novel imprinted genes.

(A) The lncRNA Firre locus contains a full-length CAST biased transcript sense overlapped by multiple shorter FVB biased transcripts. The Firre locus in MEFs has a CAST biased H3K4me3 ChIP-seq peak at its canonical 5´end, and multiple internal FVB biased H3K4me3 peaks. 10/13 of the internal FVB H3K4me3 peaks overlap with the previously reported RRD repeats (Hacisuleyman et al., 2014). The Firre locus contains multiple cap analysis gene expression (CAGE) profiles from the FANTOM5 consortium, indicating transcription start sites (TSS). Cufflinks assembly of the MEF RNA-seq data finds transcripts with the canonical TSS plus alternative transcripts indicating six alternative TSS, all of which overlap FVB biased H3K4me3 peaks (see Materials and methods for details of alignment and Cufflinks assembly). (B) Known imprinted lncRNAs Meg3, Rian, and Mirg may form single maternally imprinted transcript in MEFs. Maternal enrichment of H3K4me3 is found over the Meg3 TSS, but not the annotated Rian and Mirg TSS. Cufflinks assembly from RNA-seq data indicates that transcripts can extend over the entire Meg3/Rian/Mirg locus. (C) Maternal tissue in placental samples may result in false-positive maternal imprinted expression. The mean decidua/placenta RPKM ratio (3x E12.5 decidua and placenta) for extra-embryonic specific candidate maternal imprinted genes (19 genes) presented on a log(2) scale. The error bar represents the standard deviation between replicates. (D) Embryo transfer confirms maternal imprinted expression of known imprinted genes detected in placenta. Allelic ratio from E12.5 placentas from CAST (F) x FVB (M) embryos transferred into FVB host mother shown. (E) Novel placental imprinted genes are detected in BALBc x CAST reciprocal crosses. The imprinted ratio of novel placenta imprinted genes detected from ribosome RNA depleted RNA-seq from E12.5 BALBc/CAST placentas is shown. Dotted line indicates 0.7 allelic ratio cutoff.

https://doi.org/10.7554/eLife.25125.014
Figure 6 with 1 supplement
Tissue and developmental-specific expansion and contraction of imprinted clusters correlates with parental-specific histone modification.

(A) A summary of imprinted genes detected in this study. Mouse chromosomes with the positions of known (left side of the chromosome) and novel supported or validated (right side of the chromosome) imprinted pc (−) and nc (~) genes. Candidate imprinted genes that are not supported or validated are indicated in grey. Imprint control elements (ICE), known and candidate gDMRs are indicated (Proudhon et al., 2012; Xie et al., 2012). * Indicates maternally expressed genes restricted to placenta. The base pair coordinates (Mb) are indicated on the left side. Underlined are new imprinted clusters. Dashed boxes indicate the Crossfirre locus shown in Figure 5A, the Slc38a4 cluster shown in Figure 5B, and the Igf2r cluster shown in Figure 5D. Color code as in Figure 1. For more details see Materials and methods and Supplementary file 1, sheets G-J. (B) The Igf2r and Kcnq1 cluster size during development and between tissues (tissue abbreviations as in Figure 1). The number of imprinted genes for each developmental stage/tissue is indicated at the top of the bar. (C) Top: Allelic H3K27ac enrichment (4 kb sliding windows) over the expanded Igf2r cluster for E12.5 Liver, VE and placenta (UCSC genome browser screenshot). Numbers indicate tissue where a gene shows imprinted expression. Bottom: The number of parental-specific H3K27ac 4 kb sliding windows within non-overlapping 100 kb count windows for E12.5 VE and placenta (Pl). Counts over the background cutoff are shown (defined as the maximum count detected outside of imprinted regions for each tissue). For more details see Materials and methods. NA = not available (no parental-specific windows available for analysis).

https://doi.org/10.7554/eLife.25125.016
Figure 6—figure supplement 1
Expansion and contraction of imprinted clusters during development.

The cluster size in Mb of confirmed and novel validated/supported imprinted regions was plotted. Underlined names indicate novel imprinted regions.

https://doi.org/10.7554/eLife.25125.017

Tables

Table 1

Novel imprinted genes. The list of novel imprinted genes detected in this study and confirmed by extra evidence (more detail is given in Supplementary file 1, sheet I). Imprinted genes were classified as novel if they were not listed in the Otago or Harwell imprinted gene databases (downloaded 24th Sept 2015, [Glaser et al., 2006; Williamson et al., 2013]). Previous reports not in these databases are shown.

https://doi.org/10.7554/eLife.25125.015
NameChrM/PValidation evidencePrevious reports
XLOC_0478442PATChIP, BALBc, multiple tissues, proximity
Mafb2MATProximity
XLOC_0507392PATChIP, BALBc, multiple tissues, proximity
2400006E01Rik3PATBALBc, proximity
Fkbp65PATChIP(Strogantsev et al., 2015)
XLOC_0759917PATsDMR, multiple tissues
XLOC_0761437PATChIP, BALBc, multiple tissues, proximity
BC02040210PATProximity
XLOC_01103911MATChIP, BALBc, multiple tissues, proximity
Platr2011PATChIP(Babak et al., 2015)
XLOC_01162911PATChIP, multiple tissues
Smoc112PATChIP, BALBc
XLOC_03227915PATChIP, BALBc, multiple tissues, proximity
Galnt615PATMultiple tissues, proximity
XLOC_03741417MATChIP, embryo transfer, proximity
Arid1b17MATGenetic, ChIP, proximity
Qk17MATProximity(Calabrese et al., 2015)
Park217MATGenetic, embryo transfer, BALBc, multiple tissues, proximity(Calabrese et al., 2015)
XLOC_03761517MATProximity
Dact217MATGenetic, ChIP, BALBc, proximity(Babak et al., 2015; Calabrese et al., 2015)
Smoc217MATGenetic, ChiP, embryo transfer, BALBc, proximity
Thbs217MATGenetic, embryo transfer, BALBc, multiple tissues, proximity
CrossFirre (Gm35612)XMATChIP
  1. Key: Genetic: validation of novel candidates in proximity by genetic deletion of the lncRNA Airn; ChIP: validation by allele-specific H3K4me3 ChIP-seq; sDMR: Transcription start side overlaps sDMRs mapped in Brain (Xie et al., 2012); BALBc: validation by RNA-seq in CastxBALBc background; Embryo transfer: Embryo transfer supports maternal expression in placenta; Multiple tissues: Imprinted expression detected in multiple tissues and developmental stages; Proximity: distance between novel candidate and gDMR or known imprinted region < 7 Mb

Additional files

Supplementary file 1

Validation of X chromosome inactivation (XCI) escapers and imprinted expression.

(A) Samples analyzed in this study. (B) Equalizing the input reads between samples. Input reads were adjusted to obtain approximately 15Mio uniquely aligned fragments for RNA-seq (30Mio single reads) and approximately 20Mio uniquely aligned reads for H3K27ac ChIP-seq per replicate. (C) Significant enrichment of allelic H3K27ac based on 100x random permutations of the allelic tags (D) X chromosome inactivation (XCI) escaper candidates from epiblast-derived tissues (embryonic and adult tissues that show skewed XCI in CAST/FVB). (E) XCI escaper candidates from extra-embryonic tissues that show imprinted XCI. (F) XCI escaper validation. (G) Validation of imprinted expression detected by RNA-seq using H3K4me3 ChIP-seq differential enrichment on the promoter (4 kb window). (H) Validation of imprinted expression. (I) Confirmed and novel validated independent imprinted genes. (J) Summary of novel imprinted gene candidates. (K) Tissue and developmental specific expansion and contraction of imprinted clusters.

https://doi.org/10.7554/eLife.25125.018

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Download citations (links to download the citations from this article in formats compatible with various reference manager tools)

Open citations (links to open the citations from this article in various online reference manager services)