PFG is a target gene of AP2-G and is expressed in female gametocytes

A. Schematic diagram of PFG from P. berghei and its homologs in apicomplexan parasites. Regions homologous to Regions 1 and 2, which are highly conserved among Plasmodium species, are shown as yellow and blue rectangles, respectively. Nuclear localization signals were predicted using the cNLS mapper (http://nls-10mapper.iab.keio.ac.jp/cgi-bin/NLS_Mapper_form.cgi). The gene IDs of P. berghei PFG, P. falciparum PFG, and their homologs in Toxoplasma gondii, Eimeria tenella and Vitrella brassicaformis are PBANKA_0902300, PF3D7_1146800, TGGT1_239670, ETH2_1252400, and Vbra_10234, respectively. B. Alignment of amino acid sequences of Region 1 and Region 2 from P. berghei and P. falciparum PFGs was performed using the ClustalW program (https://www.genome.jp/tools-bin/clustalw). C. The amino acid sequences of Regions 1 and 2 from P. berghei PFG and its homologs from other apicomplexan parasites in (A) were aligned using the ClustalW program in MEGA X. The positions at which all these sequences have identical amino acids are indicated by two asterisks, and positions with amino acid residues possessing the same properties are indicated by one asterisk. D. Expression of PFG in mature male and female gametocytes of PFG::GFP parasites. Nuclei were stained with Hoechst 33342. Scale bars: 5 μm. E. Integrative Genomics Viewer (IGV) images from the ChIP-seq data of AP2-G and AP2-FG in the upstream region of PFG. Data were obtained from previous papers (8)(9). The purple and red bars indicate the binding motifs of AP2-G and AP2-FG in the peak region. F. Time-course observations of PFG expression during gametocyte development using PFG::GFP parasites. Parasites were observed every two hours from 12 hpi onwards. Nuclei were stained with Hoechst 33342.

Development of female gametocytes was impaired in PFG(-)parasites.

A. Proliferation of blood-stage parasites in wild-type and PFG(−) parasites. Parasitemia was calculated by counting infected red blood cells using Giemsa staining of blood smears. B. Giemsa-stained images of mature male and female gametocytes in PFG(-) parasites. C. Number of oocysts in the mosquito midgut 14 days after infective blood meal of wild-type and PFG(-) parasites. D. Ookinete cultures of wild type and PFG(-) parasites. Ratios of zygotes, retort-form, and banana-shaped ookinetes to all female-derived cells are shown in black, grey, and white, respectively. E. A cross-fertilization experiment was performed between PFG(-), P48/45(-) (males are infertile), and P47(-) (females are infertile) parasites. The rate of conversion of females to mature ookinetes is shown. F. Flow cytometry analysis of 820 parasites and PFG(-) parasites derived from 820 parasites. G. Volcano plot showing differential expression of genes in PFG(-) compared to wild-type parasites. Red dots represent female-enriched genes, and horizontal and vertical lines indicate p-value of 0.001 and log2(fold change) of −2, respectively. A pie graph on the top right of the plot area shows the number of female-enriched genes among the significantly downregulated genes.

PFG is co-localized with AP2-FG A. IGV images from ChIP-seq experiments 1 and 2 of PFG on chromosome 14. Histograms show row read coverage in the ChIP data at each base. The Venn diagram on the right shows the number of peaks common in ChIP-seq experiments 1 and 2. B. Representative peak images of PFG and AP2-FG upstream of genes that were significantly downregulated in PFG(-) parasites. Positions of ten-base motifs are indicated by red bars. ChIP-seq data for AP2-FG were obtained from a previous paper (23). C. The ten-base motif was enriched around the PFG peaks. Sequence logos were constructed using WebLogo 3 (http://weblogo.threeplusone.com/create.cgi). D. Heat maps showing coverage in ChIP-seq of PFG (left) and AP-FG (right) with positioning summits of PFG peaks at the center. E. IGV images of ChIP-seq peaks of AP2-FG that lacked the corresponding PFG peak. Histograms show the row read coverage in the ChIP data at each base.

PFG mediates AP2-FG binding to the ten-base motif

A. Venn diagram showing the number of common peaks between experiments 1 and 2 in ChIP-seq of PFG using PFG::GFPAP2-FG(-) parasites. B. IGV images from the ChIP-seq peaks of PFG on part of chromosome 14. The top panel shows the ChIP peaks using PFG::GFP parasites, and the bottom panel shows those using PFG::GFPAP2-FG(-) parasites. Histograms show row read coverage in the ChIP data at each base. C. Heat maps showing coverage in ChIP-seq of PFG using PFG::GFP parasites (left) and PFG using PFG::GFPAP2-FG(-) parasites, with positioning summits of peaks obtained in the other ChIP-seq at the center. D. Motifs enriched around the summits of peaks identified in ChIP-seq of PFG using PFG::GFPAP2-FG(-) parasites. The sequence logo was depicted using WebLogo 3. E. Venn diagram showing the number of common peaks between experiments 1 and 2 in ChIP-seq of AP2-FG using AP2-FG::GFPPFG(-) parasites. F. IGV images of ChIP-seq peaks of AP2-FG and PFG in a region on which several AP2-FG peaks lack the corresponding PFG peak (chromosome 14:1,150 kb-1,500 kb). The top panel shows ChIP peaks of AP2-FG using AP2-FG::GFP. The middle and bottom panels show the ChIP-seq peaks of PFG using PFG::GFP and the ChIP peaks of AP2-FG using AP2-FG::GFPPFG(-) parasites, respectively. The AP2-FG peaks lacking their corresponding PFG peaks are highlighted by rectangles. Histograms show row read coverage in the ChIP data at each base. G. Motifs enriched around the summits of peaks identified in the ChIP-seq of AP2-FG using AP2-FG::GFPPFG(-) parasites. Sequence logos were depicted using WebLogo 3. H. Peak image of PFG and the ten-base motif under the summit upstream of a CPW-WPC family protein gene (PBANKA_1346300). The motif was mutated using the CRISPR/Cas9 system, and the mutation was confirmed by Sanger sequencing (the lowest panel; mutated nucleic acid residues are highlighted). The protospacer adjacent motif (PAM) sequence used for targeting is also highlighted by a rectangle. I. ChIP-qPCR analysis of PFG in the upstream region of PBANKA_1346300. Gray and white bars indicate the results using wild-type and motif-mutated parasites, respectively. Three independent biological experiments were performed. The results are shown as a percentage input. Error bars indicate standard error. Experiments with another CPW-WPC family protein gene (PBANKA_1123200) and TRAP were performed as positive and negative controls, respectively.

sAP2-FG binds to five-base motifs that function as a cis-acting element independent of the ten-base motif.

A. Motifs enriched around peaks identified in DIP-seq of the AP2 domain of AP2-FG. Sequence logos were depicted using WebLogo 3. B. Histogram showing the distance between the peak summit and the nearest five-base motifs in (A) for each ChIP peak of sAP2-FG. C. Peak image of sAP2-FG upstream of NEK2 and the five-base motif under the summit. The motif was mutated, and the sequence was confirmed by Sanger sequencing (lowest panel). The protospacer adjacent motif (PAM) sequence used for targeting is highlighted by a rectangle. Mutated nucleic acid residues have also been highlighted. D. ChIP-qPCR analysis of sAP2-FG in the upstream region of NEK2. Gray and white bars indicate the results obtained for wild-type and motif-mutated parasites, respectively. The results are shown as mean %input values from three biologically independent experiments. Error bars indicate standard error. Experiments with MTIP and IMC1i were performed as positive controls, and that in TRAP as a negative control. E. Schematic diagram of a Plasmodium centromere plasmid used to assessing promoter activity of P28. Mutations introduced into the ten-base and five-base motifs of the P28 promoter are described in a grey box. Pbcen, a sequence of the P. berghei chromosome 5 centromere, 5’-ELF1α, a bidirectional promoter of the elongation 1α gene for conferring constitutive expression F. FACS analysis of parasites harboring the P28-reporter plasmid (E). Parasites were gated on forward-scatter and staining with Hoechst 33342, and then on GFP. The percentages of mCherry-positive parasites in all gated cells are shown in the left panel. The histogram on the right shows the number of mCherry-positive parasites at different signal strengths.

Target genes of PFG (cAP2-FG) and sAP2-FG contain different sets of genes.

A. Functional classification of the PFG target genes (cAP2-FG) (572 annotated genes in total). In this graph, different subgroups related to female gametocyte-specific functions are collectively shown as a group ‘female-specific functions’ (see also Table S3A). Hypothetical protein genes were not included. The number of members in each group is shown in the chart. B. Functional classification of the target genes of sAP2-FG (240 annotated genes in total). Hypothetical protein genes were not included. C. Venn diagram showing an overlap between the target genes of sAP2-FG and PFG (cAP2-FG). The target genes in ‘female-specific functions’ are written according to their belongings: unique to sAP2-FG, common, and unique to cAP2-FG. The assignment of these genes into each subgroup was based on functional annotation in PlasmoDB and the following references(16)(24)(25)(26)(27)(28)(29)(30)(31)(14)(32)(33)(18)(34)(35)(36)(37)(38)(39)(40)(41)(42). D. IGV images from the ChIP-seq data of AP2-G, PFG, and sAP2-FG in the upstream region of PFG. The data of the ChIP-seq peaks for AP2-G were obtained from a previous study (8). The peak region’s binding motifs of AP2-G, ten-base and five-base motifs are indicated by purple, red, and orange bars. E. A putative cascade of TFs starting from AP2-G is suggested in this and previous studies. F. Box-and-whisker plots showing log2(fold change) for target genes unique to sAP2-FG and PFG (cAP2-FG) and common for both. Cross marks in the boxes indicate the average values. G. A model of transcriptional regulation during female development. Female-specific genes harbor either or both of the female-specific cis-activating elements, 5-base and ten-base elements in the upstream. In early female gametocytes, AP2-FG binds to a five-base cis-activating element via its AP2 domain and activates genes for gametocyte development and some genes for zygote/ookinete development. In the later stage, when PFG is highly expressed, AP2-FG is predominantly recruited to the PFG on the ten-base cis-activating element, and the AP2-FG/PFG complex activates a full repertoire of genes for zygote/ookinete development.

Preparation of transgenic parasites.

A. Preparation of PFG::GFP parasites. B. Preparation of PFG(-) and PFG(-)820 parasites. C. Preparation of PFG::GFPAP2-FG(-) parasites. PFG::GFPAP2-FG(-) parasites were prepared from AP2-FG(-) parasites using the sulfadiazine-resistant dihydropteroate synthase (DHPS) gene as a selectable marker gene. D. Preparation of AP2-FG::GFPPFG(-) parasites. AP2-FG::GFPPFG(-) parasites were prepared using CRISPR/Cas9 in two steps. First, PFG was disrupted in pbcas9, and then GFP was fused to the 3’-portion of the AP2-FG gene.