Application of ATAC-Seq for genome-wide analysis of the chromatin state at single myofiber resolution

  1. Korin Sahinyan
  2. Darren M Blackburn
  3. Marie-Michelle Simon
  4. Felicia Lazure
  5. Tony Kwan
  6. Guillaume Bourque
  7. Vahab D Soleimani  Is a corresponding author
  1. Department of Human Genetics, McGill University, Canada
  2. Lady Davis Institute for Medical Research, Jewish General Hospital, Canada
  3. McGill Genome Centre, Canada
  4. Canadian Centre for Computational Genomics, Canada
6 figures, 6 tables and 2 additional files

Figures

Figure 1 with 1 supplement
Schematic of ATAC-seq performed on a single myofiber.

Schematic of the steps and reactions involved in the preparation of sequencing ready libraries of single myofiber DNA for ATAC-Seq. Briefly, myofibers were isolated from the EDL muscle and an …

Figure 1—figure supplement 1
Quality control of ATAC-Seq libraries.

(A) Bioanalyzer profile of an ATAC-Seq library prepared from 5,000 MuSCs. (B) Example bioanalyzer profile of ATAC-Seq library prepared from a single myofiber. (C) Representative picture of a ready …

Figure 2 with 3 supplements
smfATAC-Seq can effectively identify the accessible regions on a single myofiber.

(A) Representative picture of an isolated WT C57BL/6 J uninjured myofiber stained for Hoechst showing the presence and location of myonuclei. Scale bar = 50 µm. (B) Representative picture of an …

Figure 2—figure supplement 1
IGV snapshots of non-myogenic genes.

(A) Platelet and Endothelial Cell Adhesion Molecule 1 (Pecam1) expressed in endothelial cells. (B) Resistin (Retn) as a marker of adipocytes. (C) CD45 expressed in hematopoietic cells. (D) CD90 (Thy1

Figure 2—figure supplement 2
Correlation analysis between biological replicates of each condition.

(A–C) IGV snapshots of genes expressed in myofibers and MuSCs for all the replicates of each condition that were pooled together for further analysis. DNase-Seq track added to demonstrate …

Figure 2—figure supplement 3
IGV snapshots of Myogenic Regulatory Factors (MRFs).

(A) Myogenic Factor 5 (Myf5). (B) MyoD. (C) Myogenin (Myog). (D) Myogenic factor 6 (Myf6). *ATAC-Seq was performed in biological replicates (n = 3 MuSCs, n = 3 injured myofibers, n = 2 uninjured …

Figure 3 with 3 supplements
Uninjured and injured myofibers and MuSCs display distinct chromatin states.

(A) Heatmap clustering of Pearson correlation coefficients showing the correlation between the replicates of the conditions in the regions defined by the union peakset (merged peaks of all …

Figure 3—figure supplement 1
Correlation analysis between uninjured and injured myofibers only.

(A) Heatmap clustering of Pearson correlation coefficients showing the correlation between the replicates of the injured and uninjured conditions in the regions defined by the union peakset (merged …

Figure 3—figure supplement 2
IGV snapshots of genes expressed in fast and slow muscle fiber types.

(A) Troponin I2 (Tnni2) expressed in fast skeletal muscle fiber. (B) Troponin T3 (Tnnt3) expressed in fast skeletal muscle fiber. (C) Troponin T1 (Tnnt1) expressed in slow skeletal muscle fibers. (D)…

Figure 3—figure supplement 3
Unique peaks between different conditions indicate a distinct chromatin state for each cell type.

(A Peak score distribution (calculated by MACS2 peak calling algorithm)) for each of the different conditions. Peak score = -log10 (FDR). (B) Heatmap showing the read count +/–500 bp of the center …

Figure 4 with 2 supplements
Comparative analysis of chromatin state between uninjured myofibers and MuSCs and between uninjured myofibers and injured myofibers.

(A–C) Gene Ontology (GO Biological Process) analysis of genes associated with ATAC-Seq peaks based on association by proximity using Genomic Regions Enrichment of Annotations Tool (GREAT) (McLean et …

Figure 4—figure supplement 1
Gene Ontology analysis of unique and common peaks between conditions.

(A) Gene Ontology (GO Biological Process) analysis of genes associated with unique peaks present in the uninjured myofiber compared to MuSCs, based on the proximity of the peaks to the genes. (B) GO …

Figure 4—figure supplement 2
Top enriched motifs in the ATAC-Seq peaks of uninjured and injured myofibers.

(A) Top 10 significantly enriched motifs in the peaks that are common between uninjured and injured myofibers overlapping the promoters (±5 kb). (B) Top 10 significantly enriched motifs in the peaks …

Figure 5 with 1 supplement
Identification of cell type specific pathways by global analysis of chromatin accessibility.

(A) Gene Set Enrichment Analysis performed on genes nearest to the differentially accessible regions/peaks for uninjured myofibers compared to injured myofibers. Top 10 enriched pathways are shown …

Figure 5—figure supplement 1
Analysis of Notch and TGFβ signalling pathways reveal differential accessibility between MuSCS and uninjured myofibers, and injured and uninjured myofibers.

(A) Heatmap showing genes involved in the Notch signalling pathway based on read counts of MuSCs, uninjured fibers and injured fibers,±1 kb of the TSS of each gene in the pathway. (B) IGV snapshot …

Figure 6 with 5 supplements
Comparative analysis of chromatin state between MDX and WT myofibers.

(A) Heatmaps showing enrichment at transcription start site (TSS) for the ATAC-Seq libraries of MDX and WT myofibers respectively. (B) Peak annotation pie charts for ATAC-Seq peaks of MDX and WT …

Figure 6—figure supplement 1
Correlation analysis between biological replicates of mdx and WT myofiber ATAC-Seq samples.

(A–I) Scatter plot showing the Pearson correlation between the replicates. (J,K) IGV snapshots of muscle creatine kinase (Ckm) and housekeeping gene Rps2 for all the replicates of each condition …

Figure 6—figure supplement 2
IGV snapshots of myogenic and non-myogenic genes for the quality control of mdx and WT smfATAC-Seq.

(A–F) IGV snapshots of genes known to expressed in muscle fiber displaying accessibility on their respective TSS. (A) The muscle creatine kinase (Ckm). (B) Actin alpha 1 (Acta1). (C) Part of the …

Figure 6—figure supplement 3
Gene Ontology analysis of total mdx and WT peaks.

(A) Gene Ontology (GO Biological Process) analysis of genes associated with all peaks present in the mdx myofiber, based on the proximity of the peaks to the genes. (B) GO term analysis of genes …

Figure 6—figure supplement 4
Correlation analysis between injured, mdx and WT myofibers.

(A) Heatmap clustering of Pearson correlation coefficients showing the correlation between the replicates of the conditions in the regions defined by the union peakset (merged peaks of all …

Figure 6—figure supplement 5
Top enriched motifs in the ATAC-Seq peaks of mdx and WT myofibers.

(A) Top 10 significantly enriched motifs in the peaks that are common between mdx and WT myofibers overlapping the promoters (±5 kb). (B) Top 10 significantly enriched motifs in the peaks that are …

Tables

Table 1
Sequencing read information for smfATAC-Seq and MuSCs ATAC-Seq libraries.
LibraryNumber of raw readsNumber of surviving readsAligned filtered reads (mm10 reference)Duplicate readsMitochondrial readsPercentage of mitochondrial reads (%)Final reads alignedNumber of peaksFraction in peaks (FrIP)
Muscle Stem Cells_117592473411393843610313018647623836529,9670.515497638365,5680.3642
Muscle Stem Cells_217496593611735721210357000943672484374,1760.365952334968,6580.1971
Muscle Stem Cells_3131990380912615847994412131299456223,5400.284842112569,5730.1296
Injured_12299354261172126789004000281024926830,2150.92818486132,8530.2885
Injured_2194563870129934972987521578854932913006151.32890221328,3510.2863
Injured_3142411536628885525213245542271079868,8081.67899256825,0020.2325
Uninjured_114546541075781456610345695258831512743322.09717192212,2760.2181
Uninjured_2151015852641927065012028245914841965,0371.93324040414,7420.3208
MDX_1107540762509797324048580336205908802,5611.98347733440,8330.7256
MDX_210313072654209722464554723729153110997472.37806419439,2540.4932
MDX_310813066248920904406773593448400311713162.88502204035,6910.5589
WT_110421957843914902341621422860049816511994.83391044526,8730.7283
WT_211010869237411936312992222534532111433173.65481058428,4300.64
WT_318358350672489354569836374931092318402653.23583244939,1780.7611
WT_48653384036708706287148932515796514047124.89215221621,2520.752
Table 2
Percentage of ATAC-Seq peaks that overlap with the TSS±500 bp by at least 1 bp.
Top 100 genes expressed in whole muscle but not in myofiberTop 50 genes expressed in whole muscle but not in myofiberAll genes expressed in whole muscle tissueAll genes in the genome
Number of overlapping peaksTotal number of peaks% overlapping peaksNumber of overlapping peaksTotal number of peaks% overlapping peaksNumber of overlapping peaksTotal number of peaks% overlapping peaksNumber of overlapping peaksTotal number of peaks% overlapping peaks
Uninjured_Fiber1219,7040.0609013319,7040.01522537,86519,70439.91575312,99519,70465.951076
Injured_Fiber6547,1120.13796911247,1120.025471214,25947,11230.26617426,19847,11255.607913
EDL_Whole_Muscle19860,7190.32609236560,7190.107050518,41960,71930.33482133,04860,71954.427774
  1. Genes identified as being expressed solely in whole muscle but not in myofiber were retrieved from “High-resolution genome-wide expression analysis of single myofibers using SMART-Seq, JBC, Blackburn et al., 2019” and were defined as any gene with an expression of at least 10 RPM in the whole muscle RNA-seq, but 0 RPM in the single myofiber RNA-seq. All genes expressed in whole muscle tissue was defined as any gene that had an RPM value of at least 10 RPM from the whole muscle RNA-seq data by Blackburn et al., 2019 accessible through the GEO accession number GSE138591.

Table 3
Percentage of overlapping peaks between smfATAC-Seq from uninjured myofibers and whole EDL muscle ATAC-Seq.
Percent overlap (%)
smfATAC-Seq peaks that overlap with EDL-ATAC-Seq by at least 1 bp65.9510759
smfATAC-Seq peaks that overlap with EDL-ATAC-Seq by at least 20%61.4951279
smfATAC-Seq peaks that overlap with EDL-ATAC-Seq by at least 40%52.6136825
smfATAC-Seq peaks that overlap with EDL-ATAC-Seq by at least 60%42.1082014
smfATAC-Seq peaks that overlap with EDL-ATAC-Seq by at least 90%24.3453106
  1. Whole EDL muscle ATAC-Seq was retrieved from “Dynamic enhancers control skeletal muscle identity and reprogramming, Ramachandran et al., 2019.” This data is accessible through the GEO accession number GSM3981673.

Table 4
Percentage of total peaks found in each genomic feature.
Muscle stem cells (%)Injured myofiber (%)Uninjured myofiber (%)MDX myofiber (%)WT myofiber (%)
Promoter (±1 kb TSS)20.6631.6156.5435.5835.15
Promoter (±1 kb and/or ±2 kb TSS)4.814.843.453.784.53
Promoter ((±2 kb and/or ±3 kb TSS))4.373.923.014.144.30
5'UTR0.340.270.230.460.39
3'UTR2.501.821.152.862.58
First Exon1.831.471.531.941.78
Other Exon4.753.422.194.744.25
First Intron11.8510.877.3510.9310.56
Other Intron20.8018.8410.3518.8118.30
Downstream ( ≤ 300 kb)1.161.010.691.020.99
Distal Intergenic26.9521.9313.5115.7417.15
Table 5
Percentage of differential peaks in each genomic feature.
Uninjured myofiber vs MuSCs (%)Uninjured vs injured myofiber (%)WT vs MDX myofiber (%)
Promoter (±1 kb TSS)43.072529.92
Promoter (±1 kb and/or ±2 kb TSS)3.367.813.68
Promoter (±2 kb and/or ±3 kb TSS)3.393.124.49
5’UTR0.370.780.46
3’UTR1.953.122.99
First Exon2.293.911.84
Other Exon3.857.814.49
First Intron9.1613.2814.84
Other Intron13.3418.7524.86
Downstream ( ≤ 300 kb)0.830.780.12
Distal Intergenic18.3915.6212.31
Key resources table
Reagent type (species) or resourceDesignationSource or referenceIdentifiersAdditional information
Genetic reagent (M. musculus)C57BL/6 JThe Jackson LaboratoryStock #: 000664
Genetic reagent (M. musculus)C57BL/10ScSnJThe Jackson LaboratoryStock #: 000476
Genetic reagent (M. musculus)C57BL/10ScSn-Dmdmdx/JThe Jackson LaboratoryStock #: 001801
Genetic reagent (M. musculus)Tg(Pax7-EGFE)#Tagb (Pax7-nGFP)Sambasivan, R. et al.
Distinct Regulatory Cascades Govern Extraocular
and Pharyngeal Arch Muscle Progenitor
Cell Fates. Developmental Cell, (2009). (Sambasivan et al., 2009)
PMID:19531352Dr. Shahragim Tajbakhsh (Institut Pasteur)
Commercial kit or assayTn5 transposaseIlluminaCat #: 20034197
Commercial kit or assayNextera XT adaptorsIlluminaCat #: FC-131–1001
Commercial kit or assayQIAquick PCR purification kitQiagenCat #: 28,104
Chemical compound, drugTriton X –100Sigma-AldrichCat #: T9284
Chemical compound, drugTween-20Sigma-AldrichCat #: P1379-1L
Chemical compound, drugDigitoninPromegaCat #: G9441
Chemical compound, drugCollagenase DRocheCat #: 110888820012.4 U/mL
Chemical compound, drugCollagenaseSigma-AldrichCat #: C01301000 U/mL
Chemical compound, drugDispase IIRocheCat #: 3930780012 U/mL
Chemical compound, drugCardiotoxinSigma AldrichCat #: 11061-96-4
Sequence-based reagentMyoD_LThis paperPCR primersTGCTCCTTTG
AGACAGCAGA
Sequence-based reagentMyoD_RThis paperPCR primersAGTAGGGAA
GTGTGCGTGCT
OtherQ5 High Fidelity DNA polymeraseNew England BiolabsCat #: M0491SFor amplification of DNA post Tn5 tagmentation (see Library Preparation)
Chemical compoundDAPI stainInvitrogenCat #: D3671(5 mg/mL)
OtherAmpure XP beadsBeckmanCat #: A63880For library size selection at a concentration of 0.85 x (see Library Preparation)
Chemical compoundHoechstMolecular ProbesCat #: H1399(5 mg/mL)

Additional files

Transparent reporting form
https://cdn.elifesciences.org/articles/72792/elife-72792-transrepform1-v2.pdf
Source data 1

Quality control source data.

(A) Unlabeled agarose gel (1.25%) of MuSC ATAC-Seq sequence ready libraries. (B) Unlabeled agarose gel (1.25%) of uninjured myofiber ATAC-Seq sequence ready library. (C) Labeled agarose gel (1.25%) image of MuSC and uninjured myofiber ATAC-Seq sequence ready libraries. (D) Raw file of bioanalyzer results from single myofiber sequence ready ATAC-Seq libraries.

https://cdn.elifesciences.org/articles/72792/elife-72792-data1-v2.zip

Download links