Research Article

Genetics and Genomics

Loss of centromere function drives karyotype evolution in closely related Malassezia species

Molecular Mycology Laboratory, Molecular Biology and Genetics Unit, Jawaharlal Nehru Centre for Advanced Scientific Research, India
Department of Molecular Genetics and Microbiology, Duke University Medical Center, United States
The Institute of Mathematical Sciences/HBNI, India
National Genomics Infrastructure, Science for Life Laboratory, Department of Immunology, Genetics and Pathology, Uppsala University, Sweden
Skin Research Institute Singapore, Agency for Science, Technology and Research (A*STAR), Singapore
Department of Drug Discovery, Medical University of South Carolina, School of Pharmacy, United States

Jan 20, 2020

https://doi.org/10.7554/eLife.53944

Open access
Copyright information

Figures
Tables
Additional files

8 figures, 5 tables and 2 additional files

Figures

Figure 1 with 1 supplement

Download asset Open asset

Genome assembly and karyotype diversity in representative *Malassezia* species.

The genomes of (A) *M. globosa*, (B) *M. slooffiae*, and (C) *M. furfur* were sequenced and assembled in this study, whereas the genome assembly of (D) *M. sympodialis* was reported earlier (Zhu et al., 2017) and is shown for comparison. In each panel, bar plots represent the assembled chromosomes of the indicated *Malassezia* species, with the telomeres and the ribosomal DNA (rDNA) marked as dark gray and blue bars, respectively. Telomere-repeat motifs are shown at the 5′- and 3′-ends of the Chr1 in each species. Electrophoretic karyotypes of *M. globosa* (Mg) and *M. sloffiiae* (*Msl*), are shown in (A) and (B), respectively, with chromosome sizes estimated from the genome assembly. Chromosomes of *Saccharomyces cerevisiae* (Sc) served as size markers. The chromosome containing the rDNA (marked with an ‘r’), in *M. globosa,* co-migrates with Chr3. This was assessed by chromoblot hybridization using unique sequences from Chr3, Chr4, Chr5, and Chr6 as probes (regions indicated by red arrowheads). Chromosomes of similar size (denoted in red) migrate together in the gel and appear as a doublet band (i.e. MgChr3–MgChr5, MslChr2–MslChr3, and MslChr5–MslChr6).

Figure 1—figure supplement 1

Download asset Open asset

Statistics of the genome assemblies of *M. globosa*, *M. slooffiae,* and *M. furfur* generated in this study.

Figure 1—figure supplement 1—source data 1 Statistics of the genome assemblies of M. globosa, M. slooffiae, and M. furfur generated in this study.: https://cdn.elifesciences.org/articles/53944/elife-53944-fig1-figsupp1-data1-v2.docx
Download elife-53944-fig1-figsupp1-data1-v2.docx

Figure 2 with 1 supplement

Download asset Open asset

Kinetochores cluster and localize at the nuclear periphery in *M. sympodialis*.

(A) Schematic of the kinetochore organization of *M. sympodialis*. Gray boxes indicate proteins absent in *M. sympodialis*. The outer kinetochore protein Mtw1 (a component of Mis12C) served as the kinetochore marker in the present study. (B) Line diagram representation of Mtw1 tagged with GFP at the N- terminus. Immunoblot analysis of whole-cell lysates prepared from the untagged *M. sympodialis* strain (ATCC42132) and from GFP-Mtw1 expressing cells (MSY001) probed with anti-GFP antibodies and anti-PSTAIRE antibodies. PSTAIRE served as a loading control. Relative intensity values normalized to PSTAIRE are indicated below each lane. (C) Logarithmically grown MSY001 cells expressing GFP-Mtw1 were fixed and stained with DAPI (blue) and anti-GFP antibodies (pseudo-colored in red). Scale bar, 2.5 µm. (D) Cell cycle stage-specific localization dynamics of GFP-Mtw1. Scale bar, 2.5 µm.

Figure 2—figure supplement 1

Download asset Open asset

Identification of kinetochore proteins in *M. sympodialis* by BLAST.

Putative homologs of the indicated proteins in *M. sympodialis* were identified by BLAST analysis using corresponding protein sequences of *C. neoformans* as the query. An asterisk (*) indicates cases where corresponding protein sequences from *Ustilago maydis* were used as query. To detect proteins of the CBF3 complex, protein sequences of Ndc10, Cep3, and Ctf13 from *S. cerevisiae* were used as query. ‘n.d.’ indicates homolog not detected.

Figure 2—figure supplement 1—source data 1 Identification of kinetochore proteins in M. sympodialisby BLAST. Putative homologs of the indicated proteins in M. sympodialis were identified by BLAST analysis using corresponding protein sequences of C. neoformans as the query. Asterisk (*) indicates cases where corresponding protein sequences from Ustilago maydis were used as query. To detect proteins of the CBF3 complex, protein sequences of Ndc10, Cep3, and Ctf13 from S. cerevisiae were used as query. ‘n.d.’ indicates homolog not detected.: https://cdn.elifesciences.org/articles/53944/elife-53944-fig2-figsupp1-data1-v2.docx
Download elife-53944-fig2-figsupp1-data1-v2.docx

Figure 3 with 3 supplements

Download asset Open asset

Localization of Mtw1 to single-peaks identifies centromeres on each of the eight chromosomes of *M. sympodialis.*

(A) GFP-Mtw1 ChIP-seq reads were mapped to each chromosome. The x-axis indicates chromosomal coordinates (in kb) and the y-axis indicates read depth. ‘Input’, reads from total DNA; ‘IP,’ reads from immunoprecipitated sample; *CEN,* Mtw1-enriched regions derived by subtracting input reads from those of the IP sample (peak values 0–2500). Additional peaks observed in both IP and input tracks on Chr5 are from the rDNA locus. (B) A 13 kb-window of the Mtw1 enrichment profile (*CEN*, represented as peaks and heat-map in two different tracks, red) plotted along with the GC content (%GC, blue) and regions of transcription (RNA-seq, green). Numbers in the topmost track in every panel indicate chromosomal coordinates (in kb). The scales for the y-axis are as follows: *CEN* (0–8000), %GC (0–75), RNA-seq reads (0–1500). Gray arrows in each panel indicate predicted ORFs based on RNA-seq data with arrowheads indicating the direction of transcription of the corresponding gene, also marked as ‘+’ and ‘–’ in the axis label. (C) Fold difference in Mtw1 and histone H3 enrichment at *CEN8*, as compared to a non-centromeric control region (190 kb away on the right of *CEN1*) by qPCR analysis. Schematic of a 4-kb region of Chr8 with the *CEN8* core (yellow box) is depicted below the graph. Blue lines indicate regions assayed by PCR: core-region corresponding to the GC trough; L1 and R1, 750 bp away from the core; L2, 1500 bp away from the core; and a non-centromeric control region (190 kb away from centromere in Chr1). The x-axis indicates regions across the *CEN8* probed by PCR and the y-axis indicates fold difference in the enrichment of Mtw1 and histone H3 as compared to the control region. Error bars indicate standard deviations (SD). Values from three experiments, each performed with three technical replicates, were used to generate the plot. (D) Logo of the consensus DNA sequence identified from *M. sympodialis* centromeres, graphically represented with the size of the base corresponding with the frequency of occurrence.

Figure 3—source data 1 Source raw data for Figure 3C (ChIP-qPCR for GFP-Mtw1 and Histone H3 across MsyCEN8).: https://cdn.elifesciences.org/articles/53944/elife-53944-fig3-data1-v2.xlsx
Download elife-53944-fig3-data1-v2.xlsx

Figure 3—figure supplement 1

Download asset Open asset

Mtw1-enriched regions in *M. sympodialis* contain transcriptionally active genes.

(A) Normalized read counts (RPKM) for transcripts from ORFs flanking the centromere as compared with the genome average RPKM value calculated from RNA-seq data of *M. sympodialis* (ATCC42132). The x-axis represents ORFs labeled L1–L5 and R1–R5, with L1 and R1 being proximal to the GC trough of the centromere. The genome average considered for comparison is labeled as global. The y-axis represents Log₂ RPKM values. (B) ChIP-qPCR assays validating the enrichment of Mtw1 at the centromeres identified by ChIP-sequencing analysis. The x-axis indicates individual *CEN* regions assayed with primers mapping to the chromosomal GC troughs (see Supplementary file 1 for primer sequences) and the y-axis indicates enrichment of Mtw1 over a centromere-unlinked control region as ‘normalized % input immunoprecipitated’. Values from three experiments, each performed with three technical replicates, were used to generate the plot. Error bars indicate standard deviations (SD).

Figure 3—figure supplement 1—source data 1 Source raw data for Figure 3—figure supplement 1B (ChIP-qPCR for GFP-Mtw1 across all M. sympodialis centromeres).: https://cdn.elifesciences.org/articles/53944/elife-53944-fig3-figsupp1-data1-v2.xlsx
Download elife-53944-fig3-figsupp1-data1-v2.xlsx

Figure 3—figure supplement 2

Download asset Open asset

Sequence features of centromeres in *M. sympodialis.*

(A) The genome of *M. sympodialis* was scanned for matches to the 12-bp AT-rich motif using a 500 bp sliding window. Hit counts (y-axis) were plotted against the chromosomal coordinates (x-axis, in kb). Red asterisks near the line corresponding to maximum enrichment in every chromosome mark the regions predicted as centromeres. (B) Comparison of the maximum motif counts in a 500-bp window in each *M. sympodialis* chromosome (y-axis) estimated with the position weight matrix (PWM) of the 12 bp motif (blue bars, labeled real) and a scrambled version of the matrix (orange bars, labeled shuffled). The x-axis indicates individual chromosomes in *M. sympodialis.* (C) A dot-plot generated by plotting sequences that include 5 kb flanking the Mtw1-enriched regions in *M. sympodialis* against themselves is depicted. The coordinates of regions used in *M. sympodialis* are: *CEN1*, 781716–791716; *CEN2*, 350852–360852; *CEN3*, 232777–242777; *CEN4*, 413320–423320; *CEN5*, 120251–130251; *CEN6*, 97796–107796; *CEN7*, 426611–436611; and *CEN8*, 19905–29905.

Figure 3—figure supplement 3

Download asset Open asset

Enrichment of the 12-bp motif at the centromere core in *M. sympodialis*.

The sequences corresponding to the Mtw1-enriched regions in Chr1 (with the AT-rich core region highlighted in pink) are taken as being representative of *M. sympodialis* centromeres. The sequences corresponding to the 12-bp motif shown in Figure 3D are highlighted. Red and blue boxes highlight the forward and reverse orientation of motif occurrence.

Figure 4 with 2 supplements

Download asset Open asset

Centromeres in *M. furfur*, *M. slooffiae,* and *M. globosa* map to global GC troughs on each chromosome.

(A) Schematic of epitope tagging of CENP-A with 3xFLAG at the C-terminus. (Bottom) Immunoblot analysis using whole-cell lysates prepared from the untagged wild-type *M. furfur* (CBS14141) cells and CENP-A-3xFLAG expressing cells (MF001) probed with anti-FLAG antibodies and anti-PSTAIRE antibodies. PSTAIRE served as the loading control. (B) Abundance of CENP-A at each of the predicted *M. furfur* centromeres by qPCR analysis of DNA immunoprecipitated with anti-FLAG affinity gel in MF001 cells expressing CENP-A-3xFLAG. The x-axis indicates individual *CEN* regions assayed with primers that are homologous to the GC troughs on each chromosome that were predicted to be centromeres (see Supplementary file 1 for primer sequences). The non-centromeric control sequence maps to a region 1.3 Mb away from predicted *CEN1*. The y-axis indicates enrichment of CENP-A estimated as the percentage of input immunoprecipitated. (C) Abundance of CENP-A across *MfCEN7* by ChIP-qPCR analysis in MF001 cells. A schematic representation of a 2-kb region is shown below the graph. The yellow bar indicates the centromere core of *CEN7* corresponding to the GC trough. Blue bars indicate regions analyzed by qPCR: L1 and R1, which are 750 bp away from the centromere core. ‘Control’ refers to a region 1.3 Mb away from *CEN1*. CENP-A enrichment is plotted along the y-axis as the percentage of input immunoprecipitated. (**D, E**) Comparison of the relative abundance of histone H3 compared to histone H4 at the predicted centromeres to a non-centromeric control locus in *M. slooffiae* (D) and *M. globosa* (E), respectively. Enrichment was estimated as the percentage of input immunoprecipitated with histone H3 and histone H4 antibodies and their ratio is plotted as relative enrichment (y-axis). The x-axis indicates centromeres in each species and these were assayed with primers homologous to GC troughs (i.e. predicted centromeres) in each chromosome (see Supplementary file 1 for primer sequences). The control region that is unlinked to the centromere corresponds to a locus 630 kb away from *CEN1* in *M. slooffiae* and 416 kb away from *CEN2* in *M. globosa*. (F) The relative abundance of histone H3 compared to histone H4 across *MslCEN1* as determined by qPCR analysis of the DNA immunoprecipitated using histone H3 and histone H4 antibodies. A schematic of the *MslCEN1* locus is shown below the graph. The yellow bar indicates the *CEN1* core region corresponding to the GC trough. Blue bars indicate regions analyzed by qPCR: L1 and L2 map to regions 750 bp and 1.5 kb to the left of the *CEN1* core; R1 maps to a region 750 bp to the right of the *CEN1* core. The control region corresponds to a locus 630 kb away from the *CEN1* core. The ratio of enrichment of histone H3 to that of histone H4 is plotted as the relative enrichment (y-axis). (G) The relative abundance of histone H3 compared to histone H4 across *MgCEN2* by qPCR analysis of the DNA immunoprecipitated using histone H3 and histone H4 antibodies. A schematic of a 5-kb region containing *MgCEN2* is shown below the graph. The yellow bar indicates the *CEN2* core region corresponding to the GC trough. Blue bars indicate regions analyzed by qPCR: L1 and L2 indicate regions 750 bp and 1.5 kb to the left of the *CEN2* core; R1 and R2 indicate regions 750 bp and 1.5 kb to the right of the *CEN2* core. The control region corresponds to a locus 416 kb away from the *CEN2* core. The ratio of enrichment of histone H3 to that of histone H4 is plotted as the relative enrichment (y-axis). Values from three experiments, each performed with three technical replicates, were used to generate the plots in panels (B–G). Error bars indicate standard deviations (SD). Statistical significance was tested by one-way ANOVA: *, significant at p<0.05, *** , significant at p<0.001.

Figure 4—source data 1 Source raw data for Figure 4B (ChIP-qPCR for CENP-A-3xFLAG across all M. furfur centromeres).: https://cdn.elifesciences.org/articles/53944/elife-53944-fig4-data1-v2.xlsx
Download elife-53944-fig4-data1-v2.xlsx
Figure 4—source data 2 Source raw data for Figure 4C (ChIP-qPCR for CENP-A and Histone H3 across MfCEN7).: https://cdn.elifesciences.org/articles/53944/elife-53944-fig4-data2-v2.xlsx
Download elife-53944-fig4-data2-v2.xlsx
Figure 4—source data 3 Source raw data for Figure 4D (ChIP-qPCR for Histone H3 and Histone H4 across all M. slooffiae centromeres).: https://cdn.elifesciences.org/articles/53944/elife-53944-fig4-data3-v2.xlsx
Download elife-53944-fig4-data3-v2.xlsx
Figure 4—source data 4 Source raw data for Figure 4E (ChIP-qPCR for Histone H3 and Histone H4 across all M. globosa centromeres).: https://cdn.elifesciences.org/articles/53944/elife-53944-fig4-data4-v2.xlsx
Download elife-53944-fig4-data4-v2.xlsx
Figure 4—source data 5 Source raw data for Figure 4F (ChIP-qPCR for Histone H3 and Histone H4 across MslCEN1).: https://cdn.elifesciences.org/articles/53944/elife-53944-fig4-data5-v2.xlsx
Download elife-53944-fig4-data5-v2.xlsx
Figure 4—source data 6 Source raw data for Figure 4G (ChIP-qPCR for Histone H3 and Histone H4 across MgCEN2).: https://cdn.elifesciences.org/articles/53944/elife-53944-fig4-data6-v2.xlsx
Download elife-53944-fig4-data6-v2.xlsx

Figure 4—figure supplement 1

Download asset Open asset

Putative centromeres of *M. furfur*, *M. slooffiae*, *M. globosa,* and *M. restricta* map to global GC troughs in each chromosome.

(**A–D**) Graphs indicating the GC content (red lines) and GC3 content (black lines) of each chromosome of *M. furfur*, *M. slooffiae, M. globosa,* and *M. restricta.* The GC3 content is measured by estimating the G or C content at the third position in each codon of all genes across a chromosome (Lynch et al., 2010; Zhu et al., 2017). Position of putative centromeres mapping to chromosomal GC minima are marked in blue. The x-axis indicates chromosomal coordinates in Mb.

Figure 4—figure supplement 2

Download asset Open asset

The 12-bp AT-rich motif is enriched at the putative centromeres of *M. furfur, M. slooffiae, M. globosa,* and *M. restricta*.

(**A–D**) The genomes of *M. furfur, M. slooffiae, M. globosa,* and *M. restricta* were scanned for matches to the 12-bp AT-rich motif using a 500-bp sliding window. Hit counts (y-axis) were plotted against the chromosomal coordinates (x-axis, in kb) for each of the above species. Red asterisks near the line corresponding to maximum enrichment in every chromosome or scaffold mark the regions predicted as centromeres in each species.

Figure 5

Download asset Open asset

*MgCEN2* and *MslCEN5* map to a synteny breakpoint in *M. sympodialis*.

(**A, B**) Circos plots depicting the gene synteny blocks that are conserved between *M. globosa* and *M. sympodialis* chromosomes and between *M. slooffiae* and *M. sympodialis* chromosomes. Tracks from outside to inside represent the positions of centromeres and telomeres, GC content (plotted as blue and red lines indicating GC content below or above genome average, calculated in 0.4-kb non-overlapping windows), and colored connectors indicate regions of conserved gene synteny between the two species. (**C, D**) Linear chromosome plots depicting syntenic regions between Chr2 of *M. globosa* and Chr5 of *M. slooffiae* with chromosomes of *M. sympodialis.* GC content (in %) is shown as red/blue lines above each chromosome. Circular labels mark the gene synteny breakpoints. The synteny breakpoint at *MgCEN2* and *MslCEN5* is marked as III. The regions on MsyChr2 and MsyChr4 where the homologs of ORFs flanking the breakpoint are located are marked II and IV, respectively. Labels I and V indicate gene synteny conservation on the other side of the fusion points II and IV on MsyChr2 and MsyChr4 as compared to *M. globosa* and *M. slooffiae* chromosomes, respectively. (**E, F**) Zoomed-in image of the gene synteny breakpoint at *MgCEN2* and *MslCEN5*, representing the conservation of genes flanking these centromeres in *M. sympodialis* chromosomes at the ORF level.

Figure 6 with 1 supplement

Download asset Open asset

Inactivation of *CEN8* and *CEN9* of *M. slooffiae* and *M. globosa* in MfChr3 resulted in reduction in chromosome number in *M. furfur.*

(**A, B**) Circos plots depicting the gene synteny blocks that are conserved between the *M. slooffiae* or *M. globosa* chromosomes and the *M. furfur* chromosomes. Tracks from outside to inside represent positions of centromeres and telomeres, GC content (plotted as blue and red lines indicating GC content below or above genome average, calculated in 0.4-kb non-overlapping windows), and colored connectors indicate regions of conserved gene synteny between each pair of two species. (C) Linear chromosome plot depicting the regions that show synteny between Chr3 of *M. furfur* and Chr3, Chr4, Chr7, Chr8, and Chr9 of *M. slooffiae.* GC content (in %) is shown as red/blue lines above each chromosome. Regions corresponding to *MslCEN8* and *MslCEN9* in MfChr3 are marked I and II, respectively. (D) Linear chromosome plot depicting the conservation of gene synteny between Chr3 of *M. furfur* and Chr1, Chr4, Chr8, and Chr9 of *M. globosa*. Regions corresponding to *MgCEN9* and *MgCEN8* in MfChr3 are marked I and II, respectively.

Figure 6—figure supplement 1

Download asset Open asset

Loss of AT-content in regions corresponding to *CEN8* and *CEN9* of *M. slooffiae* and *M. globosa* in MfChr3.

(**A, B**) Zoomed-in view of regions marked I (A) and II (B) in Figure 6C, corresponding to inactivated *MslCEN8* and *MslCEN9* in *M. furfur*, respectively, along with the regions of gene synteny are represented. GC content (in %) is shown as a red/blue line diagram above each chromosome. (**C, D**) Zoomed-in view of the regions marked I (C) and II (D) in Figure 6D, corresponding to inactivated *MgCEN9* and *MgCEN8* in *M. furfur*, respectively, along with the regions of gene synteny are represented. GC content (in %) is shown as red/blue lines above each chromosome.

Figure 7 with 4 supplements

Download asset Open asset

Karyotype evolution by loss of centromere function in *Malassezia* species.

(A) Phylogenetic relationships between the *Malassezia* species analyzed in this study are represented and their chromosome numbers are shown. Species representing each clade are color-coded on the basis of previous reports (Theelen et al., 2018). The chromosome numbers for *M. slooffiae* and *M. globosa* are based on results from this study. In the case of *M. sympodialis, M. restricta,* and *M. furfur*, the chromosome numbers are based on previous reports (Boekhout and Bosboom, 1994; Senczek et al., 1999; Zhu et al., 2017). For *M. dermatis, M. nana*, *M. vespertilionis,* and *M. japonica,* the number of chromosomes were estimated from the predicted number of centromeres. The nodes corresponding to the ancestral state for Clade A and Clade B are labeled ‘Anc. A’ and ‘Anc. B’, respectively. Green and red circles indicate the origins of karyotypes that have eight and seven chromosomes, respectively, from an ancestral state of nine chromosomes. (B) Schematic of the centromere loss by breakage and the resulting reduction in chromosome number as observed in *M. sympodialis* (represented as the current state). A karyotype with nine chromosomes (as shown for *M. globosa*) is depicted as the ancestral state. (C) Proposed model of centromere inactivation observed in *M. furfur* as a consequence of fusion of AnChr8 and AnChr9 to the AnMfChr3 equivalent, resulting in a seven-chromosome configuration. The fusion product corresponding to extant MfChr3 is represented as the current state.

Figure 7—figure supplement 1

Download asset Open asset

Putative centromeres of *M. dermatis*, *M. nana*, *M. vespertilionis,* and *M. japonica* map to global GC troughs in each chromosome.

(**A–D**) Graphs indicating the GC content (red lines) and GC3 content (black lines) of each chromosome of *M. dermatis*, *M. nana*, *M. vespertilionis,* and *M. japonica.* The positions of putative centromeres mapping to chromosomal GC minima are marked in blue. The x-axis indicates chromosomal coordinates in Mb.

Figure 7—figure supplement 2

Download asset Open asset

The 12-bp AT-rich motif is enriched at the putative centromeres of *M. dermatis, M. nana, M. vespertilionis,* and *M. japonica*.

The genomes of *M. dermatis, M. nana, M. vespertilionis,* and *M. japonica* were scanned for matches to the 12-bp AT-rich motif using a 500 bp sliding window. Hit counts (y-axis) were plotted against the chromosomal coordinates (x-axis, in kb) for each of the above species. Red asterisks near the line corresponding to maximum enrichment in every chromosome or scaffold mark the regions predicted as centromeres in each species.

Figure 7—figure supplement 3

Download asset Open asset

All of the predicted centromeres of *M. sympodialis*, *M. nana,* and *M. dermatis* belong to gene synteny blocks that are also conserved in species containing nine chromosomes.

The gene order and synteny of ORFs flanking the centromeres of six *Malassezia* species, which are representative of clades B and C, analyzed by BLAST analysis using the protein sequences from *M. sympodialis* as query. Species included are: *Mna, M. nana*; *Mde, M. dermatis*; *Msy, M. sympodialis*; *Mgl, M. globosa*; *Mre, M. restricta*; and *Msl, M. slooffiae*. Chromosome/scaffold numbers are indicated at the start of every track. Boxes represent ORFs with the numbers inside them indicating percentage identity from BLAST analysis. Broken lines marked with a filled circle towards the ends indicate gene synteny breakpoints. Red arrows indicate inverted orientation of the scaffold/chromosome.

Figure 7—figure supplement 4

Download asset Open asset

*MgCEN2* and *MgCEN3* are predicted to form secondary structures.

The secondary structure of the centromere cores of *MgCEN2* and *MgCEN3* predicted by the ViennaRNA, generated using Geneious 9.0 (Lorenz et al., 2011). Blue and red circles mark the 5′- and the 3′- DNA ends, respectively.

Author response image 1

Download asset Open asset

Figure corresponding to the generation of antibodies against CENP-A in *Malassezia* species.

(A) A line diagram of MfCENP-A indicating the peptide sequence (EANYASSASA) at positions 75-84 aa used for generation of antibodies in rabbit. (B) Detection of the peptide epitope by antibodies before and after purification by ELISA. For this assay, 50 ng peptide was coated in each well and probed with varying dilutions of the antisera (1:4000, 1:32000, 1:64000). Absorbance at 450 nm was recorded after the addition of stopping solution and absorbance values are plotted for each sample in the y-axis. NC- no antigen control, (**C–E**) whole cell extracts from *M. globosa (Mg*, CBS7966 strain), *M. sympodialis (Msy*, ATCC42132 strain) and *M. furfur (Mf*, CBS14141 strain) were probed with pre-immune sera, purified CENP-A antisera, and histone H3 antisera in various dilutions as indicated. Lane L denotes the protein molecular weight markers. Both pre-immune sera and the purified antibodies were tested at a dilution of 1:200 while histone H3 antibodies were used at 1:2500 dilution.

Tables

Table 1

Coordinates of centromeres and their GC content in M. sympodialis.

Coordinates and length of Mtw1-enriched regions in comparison with those of the core centromeres in M. sympodialis.

Chromosome number	Core centromere				Full-length centromere
	Coordinates		Length (bp)	%GC	Coordinates		Length (bp)
	Start	End	Length (bp)	%GC	Start	End	Length (bp)
1	786,541	787,061	520	16.4	784,833	788,599	3767
2	355,760	355,841	81	20	354,218	357,486	3269
3	237,534	238,686	1152	15.6	235,615	239,940	4326
4	418,202	418,728	526	15.2	415,985	420,656	4672
5	125,056	125,220	164	18	123,219	127,284	4066
6	101,950	102,502	552	14.4	100,342	105,251	4910
7	431,542	431,987	445	13.2	430,028	433,194	3167
8	24,694	25,564	870	18.4	22,334	27,476	5143

Genome average GC content (in %): 58.5.

Table 2

Coordinates, length, and GC content (in %) of the centromeres predicted in M. furfur, M. globosa, M. slooffiae, and M. restricta.

	Chr./scaffold	CEN	Core centromere				% GC genome
	Chr./scaffold	CEN	Start	End	Length (bp)	% GC	% GC genome
M. furfur CBS14141	Chr1	CEN1	2,850,135	2,850,402	268	15.7	64.9
	Chr2	CEN2	68,763	68,931	168	15.4
	Chr3	CEN3	717,557	718,084	528	22.9
	Chr4	CEN4	155,897	156,301	405	18.3
	Chr5	CEN5	342,885	343,372	488	21.5
	Chr6	CEN6	86,112	86,832	721	27
	Chr7	CEN7	56,894	57,339	445	20.9
M. globosa CBS7966	Chr1	CEN1	981,894	982,242	349	17.7	52.05
	Chr2	CEN2	362,480	362,807	327	25.9
	Chr3	CEN3	219,647	220,121	474	27.2
	Chr4	CEN4	152,635	152,994	359	18.3
	Chr5	CEN5	215,437	215,595	158	17
	Chr6	CEN6	464,007	464,114	107	32.4
	Chr7	CEN7	736,701	737,015	314	18.1
	Chr8	CEN8	59,472	59,817	345	19.7
	Chr9	CEN9	114,080	114,535	455	23.5
M. slooffiae CBS7956	Chr1	CEN1	138,919	139,465	547	26	66.31
	Chr2	CEN2	132,717	133,193	477	23.1
	Chr3	CEN3	367,665	368,177	513	23.8
	Chr4	CEN4	130,942	131,501	560	27
	Chr5	CEN5	183,442	183,981	540	28.5
	Chr6	CEN6	411,984	412,552	569	27.4
	Chr7	CEN7	54,307	54,889	583	30
	Chr8	CEN8	497,637	498,149	513	24
	Chr9	CEN9	55,948	56,479	532	26.7
M. restricta CBS877	Chr1	CEN1	347,813	348,406	594	29.7	55.73
	Chr2	CEN2	87,190	87,806	617	33.3
	Chr3	CEN3	1,101,494	1,102,083	590	33.9
	Chr4	CEN4	754,356	754,989	634	34.2
	Chr5	CEN6	621,177	621,863	687	31.7
	Chr6	CEN7	390,657	391,286	630	35.1
	Chr7	CEN8	362,842	363,381	540	32
	Chr8	CEN9	117,021	117,603	583	32.8
	Chr9	CENR	70,306	70,913	608	36.3

Table 3

Synteny of centromeres across all of the Malassezia species analyzed in this study.

Clade C	Clade B1			Clade B2		Clade A
M. slooffiae (9 Chr)	M. dermatis (8 Chr)	M. nana (8 Chr)	M. sympodialis (8 Chr)	M. globosa (9 Chr)	M. restricta (9 Chr)	M. vespertilionis (9 Chr)	M. japonica (9 Chr)	M. furfur (7 Chr)
CEN2	CEN1	CEN1	CEN1	CEN3	CEN5 partial	CEN6	CEN8	CEN4
CEN6	CEN2	CEN2	CEN2	CEN6	CEN7	CEN9	CEN4 partial	CEN2 partial
CEN3	CEN3	CEN3	CEN3	CEN1	CEN3	CEN7 partial	CEN6 partial	CEN3 partial
CEN4	CEN4	CEN4	CEN4	CEN4	CEN1	CEN1	CEN2	CEN7 partial
CEN1	CEN6	CEN6	CEN5	CEN5	CEN2	CEN4	CEN7	CEN1
CEN7	CEN5	CEN5	CEN6	CEN7	CEN6	CEN5 partial	CEN5 partial	CEN6
CEN8	CEN7	CEN7	CEN7	CEN9	CEN4	CEN2	CEN9 partial	Inactivated
CEN9	CEN8	CEN8	CEN8	CEN8	CEN9	CEN8	CEN1	Inactivated
CEN5	BP	BP	BP	CEN2	CEN8 partial	CEN3	CEN3 partial	CEN5

‘BP’ indicates the presence of a gene synteny break. ‘Inactivated’ indicates centromere inactivation resulting from sequence divergence and erosion of AT-richness.

Table 4

Coordinates, length, and GC content (in %) of the centromeres predicted in M. dermatis, M. nana, M. vespertilionis, and M. japonica.

	Chr./scaffold	CEN	Core centromere				% GC genome
	Chr./scaffold	CEN	Start	End	Length (bp)	% GC	% GC genome
M. dermatis JCM11348	BCKX01000001.1 (Scf1)	CEN1	711,456	711,978	523	22.8	59.05
	BCKX01000002.1 (Scf2)	CEN2	1,014,281	1,014,977	697	31.7
	BCKX01000003.1 (Scf3)	CEN3	232,065	232,795	731	29.3
	BCKX01000004.1 (Scf4)	CEN4	409,839	410,631	793	29.5
	BCKX01000005.1 (Scf5)	CEN5	94,520	95,018	499	18.2
	BCKX01000006.1 (Scf6)	CEN6	473,487	474,334	848	30.4
	BCKX01000007.1 (Scf7)	CEN7	76,361	76,975	615	26
	BCKX01000008.1 (Scf8)	CEN8	17,893	18,540	648	26.4
M. nana JCM12085	BCLA01000001.1 (Scf1)	CEN1	715,036	715,592	557	27.8	57.95
	BCLA01000002.1 (Scf2)	CEN2	349,428	350,120	693	33
	BCLA01000003.1 (Scf3)	CEN3	220,773	221,345	573	27.9
	BCLA01000004.1 (Scf4)	CEN4	410,594	411,387	794	33.2
	BCLA01000005.1 (Scf5)	CEN5	524,594	525,105	512	24.8
	BCLA01000006.1 (Scf6)	CEN6	133,647	134,324	678	33.6
	BCLA01000007.1 (Scf7)	CEN7	408,363	409,067	705	34.2
	BCLA01000008.1 (Scf8)	CEN8	398,756	399,423	668	32.5
M. vespertilionis CBS15041	KZ454987.1 (Scf1)	CEN1	410,820	411,340	521	15.7	56.6
	KZ454988.1 (Scf2)	CEN2	1,275,509	1,276,238	730	25.8
	KZ454989.1 (Scf3)	CEN3	322,361	323,277	917	38.2
	KZ454990.1 (Scf4)	CEN4	583,450	584,319	870	28.9
	KZ454991.1 (Scf5)	CEN5	802,843	804,042	1200	28.8
	KZ454992.1 (Scf6)	CEN6	739,896	740,558	663	22.3
	KZ454993.1 (Scf7)	CEN7	268,699	269,626	928	28.8
	KZ454994.1 (Scf8)	CEN8	10,985	11,865	881	28
	KZ454995.1 (Scf9)	CEN9	19,047	19,724	678	29.1
M. japonica JCM11963	BCKY01000001.1 (Scf1)	CEN1	1,068,050	1,068,614	564	25.1	62.35
	BCKY01000002.1 (Scf2)	CEN2	139,423	139,920	497	20.3
	BCKY01000003.1 (Scf3)	CEN3	350,068	350,603	535	24.3
	BCKY01000004.1 (Scf4)	CEN4	380,877	381,439	562	24.2
	BCKY01000005.1 (Scf5)	CEN5	507,632	508,230	598	24.5
	BCKY01000006.1 (Scf6)	CEN6	240,968	250,550	582	23.7
	BCKY01000007.1 (Scf7)	CEN7	286,711	287,234	523	24.9
	BCKY01000008.1 (Scf8)	CEN8	87,314	87,873	559	23.9
	BCKY01000010.1 (Scf10)	CEN9	230,906	231,456	530	24.1

Key resources table

Reagent type (species) or resource	Designation	Source or reference	Identifier	Additional information
Biological sample (Malassezia globosa)	CBS7966	CBS		Wild-type strain
Biological sample (Malassezia slooffiae)	CBS7956	CBS		Wild-type strain
Biological sample (Malassezia furfur)	CBS14141	CBS		Wild-type strain
Biological sample (Malassezia sympodialis)	ATCC42132	ATCC		Wild-type strain
Biological sample (Saccharomyces cerevisiae)	BY4741			Wild-type strain (MATa his3Δ1 leu2Δ0 met15Δ0 ura3Δ0)
Genetic reagent (Malassezia sympodialis)	MSY001	This study		M. sympodialis (ATCC42132) cells expressing GFP-Mtw1
Genetic reagent (Malassezia furfur)	MF001	This study		M. furfur (CBS14141) cells expressing CENP-A-3xFLAG
RecombinantDNA reagent	Plasmid pGI3	(Ianiri et al., 2017b)		Vector backbone for Agrobacterium-mediated transconjugation
RecombinantDNA reagent	Plasmid pAIM1	(Ianiri et al., 2016)		Plasmid with NAT marker, used as template for PCR
RecombinantDNA reagent	Plasmid pVY7	(Kozubowski et al., 2013)		Plasmid with GFP, used as template for PCR
RecombinantDNA reagent	Plasmid pMHR04	This study		Construct used to tag MsyMtw1 with GFP tag at the amino-terminus
RecombinantDNA reagent	Plasmid pMF1	This study		Construct used to tag MfCENP-A with 3xFLAG tag at the carboxyl terminus
Antibody	Anti-GFP (mouse monoclonal)	Roche	11814460001	IF (1:100) WB (1:3000)
Antibody	Anti-PSTAIRE (mouse monoclonal)	Abcam	Cat. no. 10345	WB (1:5000)
Antibody	Anti-mouse IgG HRP (goat polyclonal)	Abcam	Cat. no. ab97023	WB (1:10,000)
Antibody	Anti-H3 (rabbit polyclonal)	Abcam	Cat. no. ab1791	ChIP (5 μL per 500 μL IP fraction)
Antibody	Anti-H4 (rabbit polyclonal)	Abcam	Cat. no. ab10158	ChIP (5 μL per 500 μL IP fraction)
Antibody	Anti-FLAG (M2) (mouse monoclonal)	Sigma	Cat. no. F1804	WB (1: 2500)
Other	GFP-trap beads	ChromoTek	Cat. no. gta-20	ChIP (20 μL per 500 μL fraction)
Other	Blocked agarose beads	ChromoTek	Cat. no. bab-20	ChIP (20 μL per 500 μL fraction)
Other	Protein-A sepharose beads	Sigma	Cat. no. P3391	ChIP (20 μL per 500 μL fraction)
Other	M2 anti-FLAG affinity gel	Sigma	Cat. no. A2220	ChIP (20 μL per 500 μL fraction)
Other	Lysing enzymes from Trichoderma harzianum	Sigma	Cat. no. L1412	Enzyme to prepare spheroplasts
Other	Zymolyase 20T	MP biomedicals	Cat. no. 320921	Enzyme to prepare spheroplasts
Other	Chitosanase	Sigma	Cat. no. C0794	Enzyme to prepare spheroplasts
Other	SensiFAST SYBR No ROX Kit	Bioline	Cat. no. BIO-98020	qPCR assay reagent
Chemical compound, drug	2-mercaptoethanol	HiMedia	Cat. no. MB041
Software, algorithm	Geneious 9.0	Biomatters Ltd.
Software, algorithm	SyMap	(Soderlund et al., 2011)
Software, algorithm	PhylloGibbs-MP	(Siddharthan, 2008)
Software, algorithm	Circos	(Krzywinski et al., 2009)
Software, algorithm	Satsuma	(Grabherr et al., 2010)
Software, algorithm	Easyfig	(Sullivan et al., 2011)
Software, algorithm	GET_HOMOLOGUES	(Contreras-Moreira and Vinuesa, 2013)
Software, algorithm	IQ-TREE v1.6.5	(Nguyen et al., 2015)
Software, algorithm	iTOL v4.3.3	(Letunic and Bork, 2019)
Software, algorithm	SMRTPortal v2.3	PacBio, Menlo Park, CA, USA
Software, algorithm	Fiji	National Institutes of Health
Software, algorithm	Illustrator	Adobe Systems
Software, algorithm	Excel	Microsoft
Software, algorithm	Word	Microsoft
Sequence-based reagent	List of primers used in this study	Sigma		In Supplementary file 1

Additional files

Supplementary file 1 This file contains the names, sequences, and description of primers used in this study.: https://cdn.elifesciences.org/articles/53944/elife-53944-supp1-v2.docx
Download elife-53944-supp1-v2.docx
Transparent reporting form: https://cdn.elifesciences.org/articles/53944/elife-53944-transrepform-v2.pdf
Download elife-53944-transrepform-v2.pdf

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Mendeley

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Sundar Ram Sankaranarayanan
Giuseppe Ianiri
Marco A Coelho
Md Hashim Reza
Bhagya C Thimmappa
Promit Ganguly
Rakesh Netha Vadnala
Sheng Sun
Rahul Siddharthan
Christian Tellgren-Roth
Thomas L Dawson Jnr
Joseph Heitman
Kaustuv Sanyal

(2020)

Loss of centromere function drives karyotype evolution in closely related Malassezia species

eLife 9:e53944.

https://doi.org/10.7554/eLife.53944

Share this article

Cite this article

Genome assembly and karyotype diversity in representative Malassezia species.

Statistics of the genome assemblies of M. globosa, M. slooffiae, and M. furfur generated in this study.

Figure 1—figure supplement 1—source data 1

Kinetochores cluster and localize at the nuclear periphery in M. sympodialis.

Identification of kinetochore proteins in M. sympodialis by BLAST.

Figure 2—figure supplement 1—source data 1

Localization of Mtw1 to single-peaks identifies centromeres on each of the eight chromosomes of M. sympodialis.

Figure 3—source data 1

Mtw1-enriched regions in M. sympodialis contain transcriptionally active genes.

Figure 3—figure supplement 1—source data 1

Sequence features of centromeres in M. sympodialis.

Enrichment of the 12-bp motif at the centromere core in M. sympodialis.

Centromeres in M. furfur, M. slooffiae, and M. globosa map to global GC troughs on each chromosome.

Figure 4—source data 1

Figure 4—source data 2

Figure 4—source data 3

Figure 4—source data 4

Figure 4—source data 5

Figure 4—source data 6

Putative centromeres of M. furfur, M. slooffiae, M. globosa, and M. restricta map to global GC troughs in each chromosome.

The 12-bp AT-rich motif is enriched at the putative centromeres of M. furfur, M. slooffiae, M. globosa, and M. restricta.

MgCEN2 and MslCEN5 map to a synteny breakpoint in M. sympodialis.

Inactivation of CEN8 and CEN9 of M. slooffiae and M. globosa in MfChr3 resulted in reduction in chromosome number in M. furfur.

Loss of AT-content in regions corresponding to CEN8 and CEN9 of M. slooffiae and M. globosa in MfChr3.

Karyotype evolution by loss of centromere function in Malassezia species.

Putative centromeres of M. dermatis, M. nana, M. vespertilionis, and M. japonica map to global GC troughs in each chromosome.

The 12-bp AT-rich motif is enriched at the putative centromeres of M. dermatis, M. nana, M. vespertilionis, and M. japonica.

All of the predicted centromeres of M. sympodialis, M. nana, and M. dermatis belong to gene synteny blocks that are also conserved in species containing nine chromosomes.

MgCEN2 and MgCEN3 are predicted to form secondary structures.

Figure corresponding to the generation of antibodies against CENP-A in Malassezia species.

Coordinates of centromeres and their GC content in M. sympodialis.

Coordinates, length, and GC content (in %) of the centromeres predicted in M. furfur, M. globosa, M. slooffiae, and M. restricta.

Synteny of centromeres across all of the Malassezia species analyzed in this study.

Coordinates, length, and GC content (in %) of the centromeres predicted in M. dermatis, M. nana, M. vespertilionis, and M. japonica.

Supplementary file 1

Transparent reporting form

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)