
Germline stem cells (GSCs) are of essential importance for genome transmission from generation to generation [1]. Although unipotent, GSCs have a unique capability to continuously generate gametes. In recent years, extensive efforts have been made to understand the specification of primordial germ cells (PGCs) and the profound epigenetic reprogramming (including genome-wide DNA demethylation and histone remodeling) which is necessary for the zygote to acquire totipotency after fertilization [2, 3]. Much less is known about the regulatory mechanisms that govern the fundamental properties of mammalian female GSCs. It has long been believed that female mammals lose the ability to produce oocytes at birth [48]. However, this concept has been reshaped by recent studies in which female germline stem cells (FGSCs) have been identified in postnatal ovaries of various mammalian organisms [914]. For example, some promoter regions in ESCs are co-marked by H3K4me3 and H3K27me3 and have been termed bivalent domains. These bivalent genes are poised at the ESC stage and could be activated in downstream development stages [16]. A study of H3K4me1 and H3K27ac, histone modifications marking enhancers, indicated that they are cell type-specific and involved in determining cellular identity [17]. Identifying and characterizing regulatory DNA elements (e.g., promoters and enhancers) is hugely difficult due to the lack of recognizable and consistent sequence features but epigenetic profiling in ESCs has proven that it is a powerful tool to delineate these. Meanwhile, these profiling analyses provide insights into the understanding of stem cell biology.

In previous reports, we generated mouse FGSC lines and demonstrated that FGSCs could undergo oogenesis once transplanted into ovaries of infertile mice and give rise to offspring [9, 13]. Although this study was viewed to be useful for both basic research and medicine [18], the regulatory mechanisms that govern the identity of FGSCs remain elusive. Here, we carried out extensive epigenomic profiling and RNA sequencing (RNA-Seq) analyses with the aim of understanding epigenetic and genetic control in mouse FGSCs.


FGSCs exhibit lineage-specific gene expression signatures

We first characterized the cultured FGSCs by examining the molecular signatures associated with germline development. Immunocytochemical analysis indicated that FGSCs are positive for germline-specific markers Mvh and Fragilis (Additional file 1: Figure S1a). Then we examined the expression of other germ cell-specific markers by reverse transcription polymerase chain reaction (RT-PCR). We found that Dazl and Stella are expressed in FGSCs (Additional file 1: Figure S1b). The characteristics detected in this study are consistent with the observations we reported previously [9, 13].

To obtain a global view of the transcription pattern of FGSCs, we performed transcriptional profiling of mRNA with strand-specific RNA-Seq (Fig. 1a) and compared our data with those from ESCs [19]. In contrast to Nanog and Sox2, which are specifically expressed in ESCs, we found that Ifitm3/Fragilis, Ptx3, and GM1673 are selectively expressed in FGSCs (Fig. 1b). Moreover, we found that Akt1 is highly expressed in FGSCs; the Akt1 pathway is involved in self-renewal of mouse germline stem cells [20]. Piwi proteins bind piwi-interacting RNA (piRNA), which is responsible for repetitive element silencing during germline development. Intriguingly, we observed that the piwi proteins Mili, Miwi, and Miwi2 are not actively expressed, probably due to the Piwi–piRNA pathway, which is particularly involved in gametogenesis [21]. Thus, these observations verify the known molecular signatures of FGSCs.

Generation of genome-wide epigenome reference maps in FGSCs. b Scatter plot of FGSC/ESC expression data sets. Orange dots indicate genes with significantly differential expression (p < 0.01).

Extensive map** of chromatin marks in FGSCs

To explore the chromatin state and its effect on the properties of FGSCs, we performed chromatin immunoprecipitation sequencing (ChIP-Seq) to generate genome-wide maps by profiling four histone modifications (H3K4me1, H3K27ac, H3K4me3, and H3K27me3) and RNA polymerase II (RNA Pol II) occupancy. We also profiled global DNA methylation by MethylCap-Seq. In addition, we measured gene expression levels with RNA-Seq and generated more than 108 million uniquely mapped reads for detecting gene expression in FGSCs (Fig. 1a; Additional file 1: Table S1). For each high-throughput sequencing analysis, at least two biological replicates were performed, which are fairly correlated (Additional file 1: Figure S2a). All data sets have been deposited in a public database.

The sequencing data were visualized in the Integrative Genomics Viewer (IGV) by generating histograms of normalized densities of ChIP fragments across the FGSC genome (Fig. 1c; Additional file 1: Figure S3) [22]. The map of histone modifications and DNA methylation shows signal distributions that are consistent with their functions [14]. For example, H3K4me3 has been regarded as a hallmark of transcription initiation and is primarily localized at promoters [23]. In our study we found 90 % of H3K4me3 sites are located at promoter regions. A case in point is the presence of strong H3K4me3 at the promoter of Ifitm3/Fragilis, which encodes a protein used to generate the FGSC line [9, 16]. Taken together, these observations suggest the data sets we generated here are able to be used to identify the cis-regulatory elements in FGSCs.

Active enhancers distinguish FGSCs from ESCs

It has been recognized that enhancer regions are marked by H3K4me1 in a cell type-specific manner and involved in determining cellular identity [17, 24]. Moreover, these cis-regulatory elements could be further classified into “active” or “poised” enhancers based on the presence of H3K27ac [25]. Consistent with these observations, we found both types of enhancer sites in FGSCs (Fig. 2a). Examination of the genomic distribution of both types of enhancers relative to transcription start sites (TSSs) indicated that these enhancers exhibit a similar distribution pattern, with the majority located away from TSSs (Fig. 2b).

Epigenetic profiling identifies the enhancer regions in FGSCs. b Distribution of active and poised enhancers relative to their closest UCSC gene transcription start site (TSS). c K-means clustering of H3K4me1 and H3K27ac ChIP-Seq signals, the predictors of active enhancers, in ESCs and FGSCs. A window of 10 kb (−5 kb to +5 kb) around the peak center is shown. d Gene expression was measured as fragments per kilobase of exon per million fragments mapped (FPKM) and calculated for all mouse UCSC genes (blue) and for those closest (within 200 kb) to FGSC-specific active enhancers (red) (class 2 in c). Transcription levels in both cell types are presented as box plots (p values were calculated using paired Wilcoxon tests). e Enriched mouse phenotypes for nearest genes within 200 kb of FGSC enhancer signatures (p < 0.05). Loss of genes (e.g., Npr2 and Ptgs2) with FGSC-specific enhancer signatures causes abnormal reproductive system physiology [57, 58]. f Number of bivalent promoters in ESCs and FGSCs

Both ESCs and FGSCs are capable of self-renewal in vitro, whereas they possess different developmental potential. To examine the underlying regulatory elements exclusively involved in FGSCs, we performed a K-means clustering analysis with active enhancer sites co-modified by H3K4me1 and H3K27ac and generated four major classes (Fig. 2c). Not surprisingly, genes associated with the FGSC-specific active enhancer regions (single nearest genes within 200 kb) exhibit higher transcriptional activity in FGSCs compared with ESCs (Fig. 2d). To understand how the lineage-specific enhancers contribute to FGSC identity, we performed gene ontology (GO) analysis with the Genomic Regions Enrichment of Annotations Tool (GREAT) [26]. We found the FGSC-specific enhancer peaks (class 2) are highly enriched for genes involved in reproduction-related phenotypes (Fig. 2e; Additional file 2: Table S2), including reproductive system physiology (e.g., Notch2, Npr2, and Nr2f2) and female fertility (e.g., Ptgs2, Ptx3, and vrk1) (Additional file 1: Figure S4a). Meanwhile, we found ESC-specific active enhancers (Fig. 2c, class 3) are mainly involved in embryogenesis and the active enhancers shared by ESCs and FGSCs are enriched for mitotic cell cycle-related genes (Additional file 1: Figure S4b).

A bivalent domain chromatin signature is not widespread in FGSCs

Bivalent domain chromatin has been reported to be involved in developmental plasticity of ESCs [16]. In this study, we examined the presence of bivalent promoters in FGSCs. To our surprise, we observed that bivalent promoters in FGSCs are much less prevalent (Fig. 2f; Additional file 3: Table S3). It is less likely that this observation results from inefficient H3K27me3 ChIP-Seq as a considerable number of H3K27me3-marked regions are identified in FGSCs (Additional file 3: Table S3). A similar phenomenon was reported in multipotent neural crest cells [27]. These observations suggest that an “epigenetic code” other than the bivalent domain is responsible for the developmental plasticity of stem/progenitor cells.

DNA methylation contributes to FGSC identity by suppressing the somatic program

One of the major issues in germline stem cell biology is how unipotency is maintained. During the specification of germ cells, the Blimp1/Prmt5 complex plays an important role in the maintenance of unipotency through repressing targets by generation of repressive H2A/H4R3me2s and this complex translocates from the nucleus to cytoplasm in embryonic day (E)11.5 PGCs [28]. Moreover, we found that Prdm1, the gene encoding Blimp1, is not actively expressed in FGSCs. To understand how the unipotency of FGSCs is maintained, we examined the presence of DNA methylation, another epigenetic mark critically involved in gene silencing, across the FGSC genome. To this end, we generated a DNA methylation profile by MethylCap-Seq and compared our data with the data sets of the precursors of FGSCs generated by MeDIP-seq. We observed a remarkable difference in global DNA methylation patterns among ESCs, E11.5 PGCs, and FGSCs; the correlation between FGSCs and E11.5 PGCs is 0.05 and between FGSCs and ESCs is 0.27 (Fig. 3a, b).

Genomic DNA methylation contributes to the identity of FGSCs. c K-means clustering of DNA methylation at promoter regions for ESCs, PGCs, and FGSCs. d Functional enrichment of FGSC-specific methylated regions by GREAT analysis (p < 0.05). e Expression levels of genes methylated at the promoter or gene body only or hypomethylated. f Quantitative RT-PCR analysis of development-related genes in Dnmt1 knockdown FGSCs. Calculation of relative expression levels was based on comparison with the control. Error bars indicate standard deviations of three biological replicates

To understand the significance of DNA methylation in FGSCs, we performed a clustering analysis with the data sets of ESCs, E11.5 PGCs, and FGSCs; the FGSC-specific methylated promoter regions (Fig. 3c, class 1) were used for GREAT analysis [26]. Notably, functional annotation revealed that the FGSC-specific methylated genes are mainly involved in somatic developmental processes (Fig. 3d), including Hox, Fox, and Tbx family transcription factors. Although viewed as a silencing epigenetic mark, growing evidence has revealed that the effect of DNA methylation on transcription is dependent on the genomic context [29]. We categorized the methylated regions of the FGSC genome into three groups and examined the relationship of DNA methylation and transcription activity. Similar to the previous study [30], we found that the transcription levels of genes with methylated promoters are lower than those of genes in the other two categories (Fig. 3e). Surprisingly, we observed that more than 90 % of genes with a methylated promoter are simultaneously methylated at the gene body (Additional file 4: Table S4), an epigenetic signature associated with active transcription [29]. To further explore the contribution of DNA methylation in FGSCs, we knocked down Dnmt1 and found that several somatic development-related genes with DNA methylation and low occupancy of RNA Pol II at promoter regions were remarkably up-regulated (Fig. 3f). These results suggest that DNA methylation critically contributes to FGSC identity.

Differential DNA methylation is involved in sexual identity maintenance of FGSCs

Germ cells are sexually bipotential in the early embryonic gonad and commit to either male or female development by E13.5 [31]. Although it is generally recognized that sex determination of germ cells is primarily determined by signaling molecules from the soma, increasing evidence has suggested the “sex” of the soma and germ cells must match each other for proper gametogenesis [32]. Nevertheless, it remains unclear how FGSCs intrinsically match the soma to maintain sexual identity; we thus asked whether DNA methylation is involved in this process.

To address this issue, we compared the DNA methylation pattern in FGSCs with that in male germline stem cells (MGSCs) measured by bisulfite sequencing [33] (Fig. 4a). We analyzed the DNA methylation datasets with the method reported previously [34] and found the Pearson correlation to be 0.229 (Fig. 4b), suggesting a low correlation of DNA methylation between male and female GSCs. We particularly compared the methylated promoter regions. Among the methylated promoter regions of 11,936 genes, only 2689 (22.5 %) exhibit a similar DNA methylation level, whereas the majority exhibit a gender-specific methylation pattern (Fig. 4c; Additional file 5: Table S5). GO analysis of these female-specific methylated genes indicated they are involved in terms related to male sexual development (Fig. 4d). For example, Bcl2l11, Nr0b1, and Sfrp2 play a critical role in development of male characteristics in mouse [3537]. Here, we observed the promoters of these three genes are exclusively methylated in FGSCs and hypomethylated in MGSCs (Additional file 1: Figure S5). Transposable elements constitute 37 % of the mouse genome [38] and we investigated the DNA methylation of six categories of transposable elements that overlap with CpG islands. In contrast to the higher DNA methylation frequency of long interspersed nuclear element (LINE) L1, long terminal repeat ERV1, and intracisternal A-type particles (IAPs) in MGSCs, DNA transposons, SINE B1, and SINE B2 are more frequently methylated in FGSCs (Fig. 4e). We also investigated DNA methylation at imprinted loci. We observed the sex-specific DNA methylation pattern at differentially methylated regions (DMRs) of the imprinted loci examined (Fig. 4f; Additional file 1: Figure S6). Together, these results suggest DNA methylation is involved in sexual identity maintenance of FGSCs.

Comparison of DNA methylation state in FGSCs and MGSCs. c The number of genes methylated at TSS regions (−2 kb to 500 bp) in FGSCs and MGSCs. d Functional annotation of genes with FGSC-specific (left) or MGSC-specific methylation (right) (p < 0.05). e The DNA methylation frequency at transposable element loci. f DNA methylation status of imprinting genes (H19 and Peg10) in FGSCs and MGSCs. IAP intracisternal A-type particle, LINE long interspersed nuclear element, LTR long terminal repeat, SINE short interspersed nuclear element

Prmt5 is implicated in FGSC biology

As mentioned above, Prmt5 forms a complex with the PGC determinant Blimp1 and is involved in the commitment of germ cell lineage, whereas the Blimp1/Prmt5 complex translocates from the nucleus to cytoplasm after E11.5 [28]. Our RNA-Seq data show Prmt5 is actively expressed in FGSCs. Therefore, it remains intriguing to explore its function in FGSCs. Given the subcellular localization dynamics of Prmt5 during the early germline development, we first examined its localization and found that Prmt5 is primarily localized in the cytoplasm of FGSCs (Fig. 5a). We then performed a Prmt5 knockdown assay (Additional file 1: Figure S7) and examined the biological consequences. We found that the meiosis-related genes (including Figla, Sycp3, and Sycp1) and oogenesis-related genes (including Zp2 and Zp3) are up-regulated upon Prmt5 knockdown (Fig. 5b), suggesting that Prmt5 is involved in maintenance of the undifferentiated status of FGSCs.

Prmt5 is involved in FGSC biology. b Quantitative RT-PCR analysis of meiosis-related genes in Prmt5 knockdown FGSCs. Relative expression levels were normalized to the control. Error bars indicate standard deviations of three biological replicates. c Scatter plot of RNA-Seq reads in control (x-axis) and Prmt5 knockdown (y-axis) cells. Red dots indicate genes that are up-regulated in Prmt5 knockdown FGSCs. d GO analysis of up-regulated genes in Prmt5 knockdown FGSCs (p < 0.05)

To gain a global view of the effect of Prmt5 on gene expression, we performed RNA-Seq analysis using Prmt5 knockdown FGSCs and examined the Prmt5-responsive genes. Compared with the control, 2916 genes were found to be statistically up-regulated upon Prmt5 knockdown (p < 0.05; Fig. 5c; Additional file 6: Table S6). Using DAVID [39], we performed GO analysis and found that, in addition to meiosis-related GO terms, some development-related biological processes (including heart development, embryonic development ending in birth or egg hatching, in utero embryonic development, developmental growth, and respiratory system development) are statistically enriched (p < 0.05) (Fig. 5d). These results suggest Prmt5 is implicated in FGSC identity, possibly through suppressing both terminal differentiation and the somatic program.


Germline stem cells are critical for passing genetic information from generation to generation. In our previous work we generated mouse FGSCs [9] and several studies have reported the generation of FGSCs in other mammalian species [16]. In this study, we found that such a bivalent domain chromatin signature is less prevalent throughout the FGSC genome. Instead, we observed that DNA methylation is actively involved in repression of the somatic program (Fig. 3d). Moreover, we found that DNA methylation of developmental genes is present only in FGSCs and not in MGSCs (Fig. 4d). Similar observations were reported in zebrafish germ cells [42], suggesting that such gender-specific DNA methylation patterns are probably conserved between teleost vertebrates and mammals. Among genes with a methylated promoter, more than 90 % are also methylated at the gene body region and most of these genes are involved in development (Additional file 4: Table S4). Given the lower presence of bivalent genes in FGSCs, we speculate that FGSCs utilize alternative epigenetic mechanisms, such as methylation at both promoters (the repressive transcription mark) and gene bodies (the active transcription mark), to maintain developmental plasticity. In addition to DNA methylation, we found Prmt5 is also involved in such repression (Fig. 5d). These observations suggest that multiple layers of regulation restrict the somatic program in FGSCs.

The relatively simple characteristics of FGSC (self-renewal and unipotency) make it an ideal model for stem cell biology. Although the pluripotency genes Nanog, Sox2, Esrrb, and Tcl1 [43] are repressed in both FGSCs and MGSCs [33], which suggests that GSCs may maintain unipotency by preventing the expression of the core pluripotency circuit, our recent work demonstrated that FGSCs could be converted into pluripotent stem cells [44]. Here, we found several mechanisms are shared in both types of stem cells. A previous report demonstrated that Prmt5 associates with Mep50 and methylates cytosolic histone H2A (H2AR3me2s) to inhibit differentiation of ESCs [45]. Similarly, our results indicate Prmt5 is involved in FGSC biology by repression of meiosis- and oogenesis-related genes (Fig. 5b, d). Although localized in cytoplasm, it remains to be elucidated whether Prmt5 forms a complex with Mep50 to exert the repression activity. In addition to Prmt5, Max was found to repress meiosis-related genes in ESCs and the expression of Stra8 and Sycp3 is significantly up-regulated in Max knockdown ESCs [46]. Consistent with this observation, we found the expression of some meiosis-related genes (including Stra8, Sycp3, and Figla) is significantly up-regulated when Max was knocked down in FGSCs (Additional file 1: Figure S9). These observations suggest FGSCs and ESCs share some mechanisms to inhibit differentiation. Given the conserved mechanisms in various types of stem cells, studies on FGSCs, despite their relative simplicity, may provide insights into the mechanisms involved in other types of stem cells with complicated properties.


Our extensive epigenomic profiling analysis revealed that DNA methylation contributes to the unipotency of FGSCs primarily by suppressing somatic programs and is potentially involved in the maintenance of FGSC sexual identity. The genome-wide epigenetic signatures and the transcription regulators identified here provide an invaluable resource for understanding the fundamental features of mouse FGSCs.


Cell culture

FGSCs at passage number 32–35 were isolated as in our previous report [34]. Briefly, we first divided the UCSC known gene promoter regions into 500-bp windows and calculated the methylation level in each window and then calculated the Pearson correlation coefficient between them. To examine the methylation levels in FGSCs, we evaluated them with the relative methylation score (rms), calculated with the R package MEDIPS; for the MGSC whole-genome bisulfate sequencing dataset, the absolute methylation signal (ams), calculated with the R package MethlKit, was used. To further analyze the difference, we divided the genome into 1-kb tiling windows and defined the highly methylated regions in each cell population. For FGSCs, the MEDIPS package was used and the regions that met the criteria (false discovery rate (FDR) adjusted p value <0.01 and fold change >2) were defined to be highly methylated. For MGSCs, the exact binomial test was used to test the significance of the ratio of C/(C + T); regions with a ratio significantly larger than 0.25 (FDR-adjusted p value <0.01) were regarded as methylated and those with a ratio significantly larger than 0.75 (FDR-adjusted p value <0.01) were regarded as highly methylated.


