Abstract
Organization of the genome into euchromatin and heterochromatin appears to be evolutionarily conserved and relatively stable during lineage differentiation. In an effort to unravel the basic principle underlying genome folding, here we focus on the genome itself and report a fundamental role for L1 (LINE1 or LINE-1) and B1/Alu retrotransposons, the most abundant subclasses of repetitive sequences, in chromatin compartmentalization. We find that homotypic clustering of L1 and B1/Alu demarcates the genome into grossly exclusive domains, and characterizes and predicts Hi-C compartments. Spatial segregation of L1-rich sequences in the nuclear and nucleolar peripheries and B1/Alu-rich sequences in the nuclear interior is conserved in mouse and human cells and occurs dynamically during the cell cycle. In addition, de novo establishment of L1 and B1 nuclear segregation is coincident with the formation of higher-order chromatin structures during early embryogenesis and appears to be critically regulated by L1 and B1 transcripts. Importantly, depletion of L1 transcripts in embryonic stem cells drastically weakens homotypic repeat contacts and compartmental strength, and disrupts the nuclear segregation of L1- or B1-rich chromosomal sequences at genome-wide and individual sites. Mechanistically, nuclear co-localization and liquid droplet formation of L1 repeat DNA and RNA with heterochromatin protein HP1α suggest a phase-separation mechanism by which L1 promotes heterochromatin compartmentalization. Taken together, we propose a genetically encoded model in which L1 and B1/Alu repeats blueprint chromatin macrostructure. Our model explains the robustness of genome folding into a common conserved core, on which dynamic gene regulation is overlaid across cells.
Similar content being viewed by others
Introduction
The mammalian genomic DNA that is roughly 2 meters long in a cell is folded extensively in order to fit the size of the nucleus with a diameter of ~5–10 μm.1 Microscopic and 3C-based approaches reveal a hierarchical organization of the genome.2,3,4,5 At the megabase scale, chromatin is subdivided into two spatially segregated compartments, arbitrarily labeled as A and B, with distinct transcriptional activity and histone modification as well as other features such as CpG frequency and DNA replication timing.6,7,8,9,10 The euchromatic A compartment adopts a central position, whereas the heterochromatic B compartment moves towards the nuclear periphery and nucleolar regions.11 This nuclear organization appears to be conserved from ciliates to humans and has been maintained in eukaryotes over 500 million years of evolution.12 Within compartments at the kilobase-to-megabase scale, chromatin is organized in topologically associated domains (TADs), which serve as functional platforms for physical interactions between co-regulated genes and regulatory elements.13 At a finer scale, TADs are divided into smaller loop domains, in which distal regulatory elements such as enhancers come into direct contact with their target genes via chromatin loops.14 Intriguingly, most A/B compartments and TADs are relatively stable in different mouse and human cell types (Supplementary information, Text S1), whereas sub-TAD loops and a small fraction of lineage-specific regions with less pronounced compartment associations tend to be more variable for differential gene expression during cell-fate transition.13,15,16,17,18,19
Evidence suggests that compartments and TADs may be formed by distinct mechanisms. TADs are thought to be formed by active extrusion of chromatin loops by the ring-shaped cohesin complex, which co-localizes with the insulator protein CTCF at the boundaries and anchor regions of contact domains and loops.20,21 Depletion of CTCF disrupted TAD boundaries but failed to impact compartmentalization, whereas cohesion loss made TADs disappear but increased compartmentalization, although both eliminated sub-TAD loop contacts.22,23,24,25,26,27,28,29,30 These results indicate that compartmentalization of mammalian chromosomes emerges independently of proper insulation of TADs. A few mechanisms have been proposed for compartmentalization, such as anchoring heterochromatin to the nuclear lamina,31,32,33,34,35 preferential attraction of chromatin harboring similar histone modifications and regulators,4,36,37,38,39 and hypothetical models involving pairing of homologous sequences mediated by active transcription and phase separation of block copolymers.36,40,41,42,43,44 Although lamin-associated domains (LADs) contribute to a basal chromosome architecture, a large body of work has demonstrated a secondary role for lamina scaffolding in compartmental segregation of heterochromatin and euchromatin.31,32,33,34,35,45,46,47,48,49 In vitro assembled nucleosomal arrays harboring histone H3 lysine 9 di- and tri-methylation (H3K9me2 and H3K9me3) marks undergo phase separation with heterochromatin protein 1 (HP1) and associated proteins to form macromolecule-enriched liquid droplets, reminiscent of heterochromatin.38 However, the role of histone modifications in regulating compartmentalization in vivo remains uncertain. Taken SUV39H H3K9 methyltransferases for example, SUV39H double-null cells still exhibit DAPI-dense heterochromatin foci despite the loss of pericentric H3K9me3 marks;39 and double knockout mice of SUV39H survive at birth with abnormalities.48 A phase-separation model of block copolymers with similar activity appears attractive in explaining compartmental formation.12,34,40,41,42,43,44 However, this hypothesis remains inconclusive, owing to a large void of identification and experimental validation of the molecular drivers that underlie compartmental segregation of euchromatin and heterochromatin.
Repetitive elements comprise more than half of human and mouse genomes.50,51 Once regarded as genomic parasites,52 retrotransposons have been recently implicated in playing active roles in re-wiring the genome and gene expression programs in diverse biological processes.53,54,55,56,57,58,59,60,61 Long and short interspersed nuclear elements (LINEs and SINEs, respectively) are the two predominant subfamilies of retrotransposons in most mammals.62 L1 (also named as LINE1 or LINE-1) is the most abundant subclass of all repeats, making up to 19% and 17% (0.9–1.0 million copies) of the genome in mouse and human, respectively.63 B1 in mouse and its closely related, primate-specific Alu elements in human are the most abundant subclass of SINEs, constituting 3%–11% (0.6–1.3 million copies) of mouse and human genomes.64,65 L1 and B1/Alu have distinct nucleotide compositions and sequence lengths. L1 elements are 6–7 kb long and AT-rich, while Alu elements are ~300 bp long and rich in G and C nucleotides.66 Analysis of metaphase chromosome banding showed roughly inverse distributions of L1 and Alu elements in chromosomal regions with distinct biochemical properties.45,67,68 Initial studies suggested that Alu/B1 elements appear to be enriched in gene-rich, euchromatic A compartments, whereas L1 elements tend to be enriched in gene-poor, heterochromatic B compartments that interact with lamina-associated domains.35,45,47,69,70,71 However, evidence to pinpoint a role for L1 and B1/Alu repeats in organizing the genome has to our knowledge not been reported, albeit fragmented information about their localizations in scattered reports (Supplementary information, Text S1). Systematic map** and visualization of L1 and B1/Alu distributions are still lacking.
We have postulated that the primary DNA sequences, particularly abundant repetitive elements embedded in the genome, may instruct genome folding.61 Here, we report that L1 and B1/Alu repeats tend to cluster with sequences from their own repeat subfamily and form grossly exclusive domains in the nuclear space, which efficiently explains and predicts the compartmental organization revealed by Hi-C. The segregated pattern of L1-rich sequences in the nuclear and nucleolar peripheries and B1/Alu-rich sequences in the nuclear interior is highly conserved across a variety of mouse and human cells, and re-occurs during the cell cycle. In addition, de novo establishment of nuclear segregation of L1- and B1-rich compartments is coincident with the formation of higher-order chromatin structures during early embryogenesis, and appears to be critically regulated by L1 and B1 repeat RNA. Importantly, depletion of L1 RNA in mouse embryonic stem cells (mESCs) significantly weakens spatial contacts of homotypic repeat DNA, disrupts the nuclear localization and segregation of L1- or B1-rich chromosomal sequences, and leads to attenuated compartmentalization of the higher-order chromatin structure. Moreover, we show that recombinant HP1α is able to bind RNA and to phase separate in the presence of RNA or DNA in vitro. Genome-wide co-localization of L1 and HP1α renders these repeat DNA and RNA sequences an advantage in promoting HP1α phase separation in heterochromatin contexts. Altogether, our findings suggest a genetically encoded mechanism by which L1 and B1/Alu repeats organize chromatin macrostructure at the compartmental level, providing an important clue to the conservation and robustness of the higher-order chromatin structure across mouse and human.
Results
L1 and B1/Alu distributions correlate with global compartmentalization in mouse and human
We analyzed the genomic positions of the major repeat subfamilies in mouse and observed positive correlations within L1 or SINE B1 subfamilies, but strong inverse correlations between them (Supplementary information, Fig. S1a). This suggests that L1 and B1 elements tend to be positioned away from each other in the genome, while repeats from the same subfamily tend to be clustered. The non-random positioning of repeat sequences in the genome prompted us to examine their relative distributions in high-order chromatin structures. We first analyzed the published Hi-C data from mESCs.72 Dense L1 and B1 repeats appear to be enriched in distinct compartments across the mouse genome, and within a compartment they are evenly distributed without obvious bias towards the boundary (Fig. 1a, b). L2 repeats show weak enrichments in B1-rich compartments, whereas other types of retrotransposons such as ERV1 and ERVK tend to be randomly distributed (Fig. 1a and Supplementary information, Fig. S1b). The compartments marked by B1 repeats show enrichment of active histone marks (H3K4me3, H3K9ac, H3K27ac, H3K36me3), strong binding of RNA polymerase II (Pol II), and high levels of chromatin accessibility and transcription activity. In contrast, the compartments marked by L1 repeats show signatures of heterochromatin, including enrichment of the repressive H3K9me2 and H3K9me3 marks, and strong binding of heterochromatin proteins such as HP1α and the nuclear corepressor KRAB-associated protein-1 (KAP1 or TRIM28) (Fig. 1a and Supplementary information, Fig. S1b).
We then performed a quantitative sequence analysis of annotated A/B compartments in six distinct mouse and human cell types.20,72 All cells exhibit consistently high levels of SINE repeats (including B1, B2, B4 in mouse and Alu in human) in the A compartments and L1 repeats (including truncated or intact, and evolutionary old or young L1s) in the B compartments (Fig. 1b and Supplementary information, Fig. S1c–e). In contrast, L2 and ERV1 repeats fail to show consistent enrichments across mouse and human. In addition, unsupervised clustering revealed that the genomic positions of A/B compartments are highly similar across six cell types, with an average Spearman correlation coefficient > 0.73 within species and > 0.52 between species (Supplementary information, Fig. S1f). Compared to other subclass repeats, L1 and B1/Alu are most strongly related to the high-order chromatin structures, and their distributions appear to be conserved in homologous regions of the mouse and human genomes (Supplementary information, Fig. S1f). For example, a region in mouse chromosome 2 (chr2: 140–170 Mb) and its syntenic region in human chromosome 20 (chr20: 6–50 Mb) show similar patterns of Hi-C contact probabilities, and gene and repeat compositions and distributions in the corresponding A and B compartments along the DNA sequences (Fig. 1c).
We further analyzed the published datasets of higher-order chromatin interactions in 21 primary human tissues and cell lines.17 On the basis of the PC1 values of a principal components analysis on the Hi-C correlation matrix reported by Schmitt et al.,17 we found that A/B-compartmental associations are highly correlated across all 21 examined samples with correlation coefficients ranging 0.47–0.99 and a median value of 0.79 (Supplementary information, Fig. S2a). The degree of compartmental conservation is highly significant (P < 2.2e-16), as ~80% of the genome shows consistent compartmental labeling in at least 16 samples and ~40% is invariant in all 21 samples, in contrast to 7% and 0% to be expected by chance, respectively (Supplementary information, Fig. S2b, c). Most of compartmental switches that account for 20% of the genome occur in one or few (≤ 5) samples with less pronounced compartmental labeling (gray highlighted regions with low absolute values of PC1 in Supplementary information, Fig. S2d; see also Supplementary information, Text S1). Thus, despite some switching events occurring in individual cells, global compartmentalization is rather stable. Consistently, the genomic regions with conserved A or B compartments across samples exhibit significantly higher levels of Alu or L1 repeats, respectively (Supplementary information, Fig. S2e). Altogether, these results indicate that co-segregation of B1/Alu and L1 repeats with the A and B compartments appears to be stable in different cell types in mouse and human.
Homotypic clustering of L1 and B1 repeats characterizes and predicts compartmental organization
To have a close look at repeat distribution and the higher-order chromatin structure, we took mouse chromosome 17 (chr17, 95 Mb in length) as an example to overlay L1 and B1 features on the Hi-C interaction matrix of mESCs. Interestingly, the plaid pattern of enriched and depleted interaction blocks in the Hi-C map is largely correlated with the compositions and distributions of B1 and L1 along the whole chr17 (Fig. 1d). In a 42-Mb region of chr17, four L1-rich compartments (denoted by c, e, g and i) and three B1-rich compartments (denoted by D, F and H) are alternately positioned along the linear DNA sequence (Fig. 1e). Strong interactions were observed within L1-rich compartments (represented by ce, cg, eg, ei and gi, dotted boxes) or B1-rich compartments (represented by DF, DH and FH, dotted boxes), but not between these two compartments (Fig. 1e). The interaction frequencies between D and F (DF) or between c and e (ce) are much stronger than those of D or F with c or e (cD, De, eF), despite the fact that these regions are closer in the linear sequence. Note that L1- or B1-rich segments often span several adjacent TADs (Fig. 1e and Supplementary information, Fig. S2f), consistent with the findings that TADs are smaller, structural units of compartments.73,74 L1 and B1 compositions within a TAD also exhibit strong anti-correlations (Pearson correlation coefficient <−0.75) across 2200 annotated TADs in mESCs (Supplementary information, Fig. S2g). This observation is consistent with repeat analyses at the genome-wide and compartmental levels (Fig. 1a and Supplementary information, Fig. S1a), illustrating a mutually exclusive distribution of L1 or B1-rich sequences along the genome.
Conversion of Hi-C contact frequencies into Pearson correlation coefficients sharpened our view of the long-range chromatin interactions (Fig. 1f). By visual inspection, we found that the plaid pattern of the Hi-C correlation map precisely matches the distribution and interaction status of L1 and B1. L1-rich or B1-rich regions show strong enrichment of contacts with regions containing the same repeat type (red blocks in Fig. 1f). We refer to these as homotypic contacts. Contacts between regions containing the other repeat type (heterotypic interactions) are strongly depleted (blue blocks in Fig. 1f). For example, in one region of chr17 (35 to 95 Mb), L1-rich segments (from e to u) and B1-rich segments (from F to T) exhibit high frequencies of homotypic contacts (Fig. 1f, highlighted by arrows), but strong depletion of heterotypic contacts. Similarly, homotypic contacts between L1-rich regions or B1-rich regions were also observed between chromosomes, as illustrated by chromosomes 17 and 19 (Supplementary information, Fig. S2h). These results indicate that genomic regions containing B1 or L1 repeats tend to interact with genomic regions containing repeat sequences from similar subfamilies, but not from different subfamilies, regardless of linear proximity, which characterizes the organization at intra- and inter-chromosomal levels.
Next, we sought to predict compartmental organization based on repeat distributions. We used the criterion of log2 ratio of B1 to L1 density [log2(B1/L1)] larger or smaller than 0 for B1-rich or L1-rich compartments, respectively. About 540 B1-rich and 648 L1-rich compartments were identified with a median size of 1.2 Mb across the mouse genome (Supplementary information, Table S1). The numbers and sizes of these B1- and L1-rich compartments called de novo are comparable to those of A and B compartments annotated by Hi-C in mESCs (366 and 364, respectively, with a median size of 1.9 Mb). Importantly, 82% of B1-rich compartments and 77% of L1-rich compartments are overlapped with annotated A or B compartments, respectively (Fig. 1g and Supplementary information, Fig. S3). Only 18% to 23% of compartments show inconsistent labeling between our prediction and Hi-C. We then analyzed genomic features in these ‘falsely’ labeled regions. Intriguingly, L1-rich regions that fall into Hi-C-annotated A compartments (designated as ‘L1.A’) still exhibit a high level of heterochromatic H3K9me3 mark and low levels of chromatin accessibility and gene expression, and contain genes enriched in specialized functions such as responses to pheromone and immunoglobulin and synapse (Supplementary information, Fig. S4a–c). Similarly, B1-rich regions that fall into Hi-C-annotated B compartments (designated as ‘B1.B’) exhibit high levels of chromatin accessibility and gene expression but low H3K9me3 binding (Supplementary information, Fig. S4a–c). In addition, L1.A and B1.B regions exhibit significantly less pronounced PC1 values (close to zero) than those consistent regions (B1.A and L1.B) (Supplementary information, Fig. S4a). Thus, a mere usage of B1 to L1 density ratios successfully re-constructs most of A and B compartments annotated by Hi-C, which suggests that the linear genomic DNA repeats contain the macroscopic structural information. Taken together, homotypic clustering of regions rich in B1 or L1 repeats nicely explains and predicts genome organization at the compartmental level.
Nuclear segregation of L1- and B1/Alu-rich compartments is conserved
High-resolution imaging of L1 and B1 distributions in the conventional nucleus remains lacking, despite initial evidence of their differential localization.45,68 To visualize their positioning in the nuclear space, we performed dual-color DNA fluorescence in situ hybridization (FISH) using fluorescence-tagged oligonucleotide probes that specifically target the consensus sequences of B1 and L1 elements (Fig. 2a). Strikingly, L1 and B1 exhibit distinct yet complementary nuclear localizations in mESCs (Fig. 2b and Supplementary information, Fig. S5a). B1 DNA shows punctate signals in the nuclear interior. In contrast, L1 DNA exhibits highly organized and concentrated signals that line the periphery of the nucleus and nucleolus. Weak L1 signals were also detected in a few areas of the nuclear interior subregions where B1 signals were absent. Both B1 and L1 signals are absent from DAPI-dense regions, which likely represent satellite repeat-enriched chromocenters.75
To confirm the L1 localization at the nucleolar periphery in mESCs, we performed DNA immuno-FISH, using an antibody against the nucleolar marker Nucleolin (NCL). Indeed, L1 signals surround and partially overlap with the ring-shaped signals of NCL at the nucleolar periphery (Fig. 2c). The localization of L1 surrounding the nucleus and nucleolus is consistent with sequencing-based analysis of nucleolus- and lamina-associated domains (NADs and LADs), in which L1-rich sequences are sequestered.61,69,76,77 In addition, to further confirm nuclear colocalization of L1-rich sequences in B compartments and B1-rich sequences in A compartments, we performed Oligopaint DNA FISH for five representative loci, each of which ranging from ~100 to 1 Mb was targeted by a set of 500–4500 DNA probes (targeting single-copy sequences) at a density of 200–300 bp per probe. Indeed, three regions (F, H, and R) annotated in the A compartment are colocalized with B1 FISH signals in the nuclear interior (205 out of 217 nuclei), whereas two B compartment-associated regions (g and q) are colocalized with L1 FISH signals in either LAD or NAD (248 out of 251 nuclei) (Fig. 2d and Supplementary information, Fig. S5b, c).
Moreover, to ask whether L1 and B1 localizations might vary with cell type, we analyzed four additional cell lines, including mouse neural stem cells (NSC), fibroblasts (NIH3T3), and myoblasts (C2C12), and human cervical cancer cells (HeLa) (Fig. 2c). Similar to mESCs, all these cells show non-overlap** localizations of B1/Alu in the nuclear interior and L1 at the nuclear and nucleolar peripheries. Thus, consistent with Hi-C results, the segregated staining pattern of B1/Alu and L1 further demonstrates that homotypic clustering of similar repeat sequences in the nuclear space divides the nucleus into distinct territories. This pattern is conserved across different cell types in mouse and human.
Dynamic re-construction of L1 and B1/Alu segregation during the cell cycle
We then asked whether the nuclear segregation of L1- and B1-rich compartments could be re-constructed during mitosis when chromatin structure undergoes dynamic reorganization. DNA FISH analysis of synchronous mESCs showed that L1 and B1 localizations change dramatically at different cell cycle stages (Fig. 3a and Supplementary information, Fig. S6a). S-phase cells show non-overlap** and complementary localizations of L1 and B1 repeats (Figs. 2b and 3a). This is similar to the pattern we observed previously in asynchronous mESCs, more than half of which are in the S phase of the cell cycle (Supplementary information, Fig. S6a). However, L1 and B1 DNA signals are mixed on mitotic chromosomes in metaphase (M phase, including prophase and anaphase), when the nuclear membrane and nucleoli are disassembled. As the cell cycle progresses into the G1 phase, L1 and B1 DNA start to segregate again (Fig. 3a and Supplementary information, Fig. S6b). To quantify the degree of segregation, we defined a FISH-based segregation index as the negative value of Pearson’s correlation coefficient of L1 and B1 DNA signals in the nucleus. The FISH segregation index is lowest in M-phase cells, but increases significantly in the G1 phase and peaks in the S phase (Fig. 3b).
To provide molecular evidence for the segregation of repeats during the cell cycle, we analyzed the published Hi-C data from cell-cycle synchronized mESCs and HeLa cells.78,79 In both cell types, G1-phase cells exhibit a classic plaid pattern of hierarchical interactions, with enriched and depleted interaction blocks outside of the diagonal region of the Hi-C interaction heatmap (Fig. 3c, d and Supplementary information, Fig. S6c, d). In contrast, M-phase cells exhibit stronger signals along the diagonal, which represent the linearly organized, longitudinally compressed array of consecutive chromatin loops. To quantify this difference, we defined a Hi-C-based segregation index by calculating the ratio of homotypic versus heterotypic interaction frequencies between L1 and B1/Alu subfamilies. Indeed, the Hi-C segregation index is significantly higher in G1-phase cells than in M-phase cells (Fig. 3c and Supplementary information, Fig. S6c). These results indicate that segregation of B1/Alu and L1 repeats is dispersed by mitosis, and is re-established when the higher-order chromatin structure forms during each cell cycle in both mouse and human cells. This finding agrees with previous reports that in metaphase, chromosome folding becomes homogeneous and large megabase-scale A and B compartments are lost, whilst in interphase, chromosomes return to a highly compartmentalized state.78,79
Dynamic establishment of L1 and B1 segregation in early embryogenesis
After fertilization, the chromatin undergoes extensive reprogramming from a markedly relaxed state in zygotes to fully organized structures in blastocysts.80,81,82 We performed a time-course DNA FISH analysis of L1 and B1 in early mouse embryos. During embryonic divisions, L1 and B1 signals are largely overlap** in zygotes, and become progressively more segregated in 2-cell, 4-cell, morula and blastocyst embryos (Fig. 3e, f). Consistently, analysis of the published Hi-C data of early embryos81 showed that early 2-cell embryos exhibit prevalent cis-chromosomal contacts along the diagonal of the Hi-C interaction map, whereas the plaid patterns of Hi-C interactions become readily detectable in late 2-cell embryos and are fully established in the inner cell mass (ICM) cells of blastocysts (Fig. 3g and Supplementary information, Fig. S7a). Plotting the FISH and Hi-C segregation indexes showed a gradual increase of L1 and B1 segregation along the course of blastocyst development, which reaches the highest level in blastocysts or in mESCs (Fig. 3f, h). We conclude that, in early embryos, compartmentalization of L1- and B1-rich regions appears to be established in a stepwise manner, coincident with de novo establishment of higher-order chromatin structures. Notably, the greatest change (steepest trend-line) of FISH segregation indexes occurs between the zygote and the late 2-cell stage (Fig. 3f), which implies that the initiation of B1 and L1 compartmentalization may coincide with the zygotic genome activation, during which massive transcription switches on.
It was reported that inhibition of Pol II by α-amanitin caused embryonic arrest at the late 2-cell stage,83 yet the higher-order chromatin structure could still be established.80,81 However, compared to the control groups, we found that α-amanitin-treated embryos exhibited significantly lower L1/B1 segregation indexes and less clear patterns of Hi-C plaids (Fig. 3g, h). At 20 h, α-amanitin-treated embryos showed a low median level of L1/B1 segregation indexes and a Hi-C pattern with extensive diagonal signals that are similar to those of early 2-cell embryos, while the control group had proceeded into the late 2-cell stage (Fig. 3g, h). At 45 h, the control group had proceeded into the 8-cell and morula stages, whereas α-amanitin-treated embryos (45 h) still showed low L1/B1 segregation and a Hi-C plaid pattern similar to that of late 2-cell embryos, despite the segregation index modestly increases compared with that of embryos at 20 h (Fig. 3g, h and Supplementary information, Fig. S7b). These results indicate a delayed and incomplete formation of the higher-order chromatin structure in the absence of zygotic Pol II transcription in mouse.
In accordance with delayed chromatin folding in embryos, treatments of mESCs with the drug 5,6-dichloro-1-β-d-ribofuranosylbenzimidizole (DRB) which inhibits Pol II transcription elongation, led to a partial loss of L1 perinucleolar localization and a gain of mixed nuclear L1 and B1 signals (Fig. 4a, b and Supplementary information, Fig. S7c). Inhibition of both Pol I and II by a high concentration of actinomycin D (ActD) had a more severe effect compared to DRB treatment (Fig. 4a, b). Thus, in both ESCs and early embryos, inhibition of Pol II transcription appears to partially, but not completely, block L1/B1 segregation. These results imply that L1/B1 compartmentalization is likely to be autonomously initiated and subsequently facilitated by transcription.
Repeat transcripts promote L1 and B1 segregation in embryonic cells
In an effort to link repeat function with chromatin structure, we sought to explore the role of repeat RNA that is transcribed from L1 and B1 sequences. Both L1 and B1 repeats are activated and highly expressed in two-cell embryos (Supplementary information, Fig. S7d).71,84,85,86 We have reported previously that depletion of L1 RNA by an antisense morpholino (AMO) in mouse embryos led to arrest at the 2-cell stage; and in mESCs, its depletion led to reduced proliferation and global de-repression of hundreds of L1-associated genes; however, it did not alter the expression of OCT4 and NANOG, two known master regulators of the pluripotency program, nor induced ESC differentiation.61,87 Using the same L1 AMO sequence, we depleted L1 RNA by 17.4% on average shown by RNA FISH (n = 16 embryos; Supplementary information, Fig. S8a). This modest depletion is consistent with the general consensus that AMO acts through steric blockage of its target RNA rather than inducing RNA degradation. In concordance with the previous report by Percharde et al.,84 more than 91.3% of embryos (42 out of 46) were arrested at the 2-cell stage in contrast to only 15.4% of embryos treated with scramble AMO that were arrested (Fig. 4c), indicating effective inhibition of L1 RNA.
We also sought to perturb B1 expression by microinjecting B1 antisense oligonucleotides (ASO) into mouse zygotes. Two B1 ASOs significantly downregulated B1 RNA levels by 36% shown by RNA FISH (n = 17 embryos; Supplementary information, Fig. S8b). Strikingly, embryos depleted of B1 RNA were able to pass the first embryonic division, but failed to divide further and became arrested at the 2-cell stage (n = 79 embryos; Fig. 4c), indicating an essential requirement for B1 RNA in embryonic development. We collected these embryos for DNA FISH analysis when the control group injected with scramble AMO or ASO had grown to the late 2-cell stage. Compared to the control embryos, both L1- and B1-depleted embryos exhibited significantly lower L1/B1 FISH segregation indexes (Fig. 4d and Supplementary information, Fig. S8c, d), indicating delayed segregation of L1 and B1 compartments.
In order to dissect the effects independent of embryonic progression, we then tried to deplete B1 and L1 transcripts in mESCs. B1/Alu repeats have been broadly implicated in diverse processes, including transcription, RNA processing, and nuclear export.88,89,90,91 Treatment of mESCs with B1 ASO led to severe cell death within hours of transfection (data not shown), which precluded direct assessment of B1 RNA in chromatin organization. It has been suggested that nuclear organization is critically dependent on interactions within heterochromatin,34,92 where L1, the most abundant one of all repeat subclasses, is predominantly enriched. In subsequent analysis, we in-depth characterized the effects of depleting L1 RNA on chromatin organization.
Treatments of mESCs with L1 AMO led to a depletion of L1 RNA by 28% (n = 30 cells) (Supplementary information, Fig. S8e, f). At both 12 and 36 h post transfection of L1 AMO, nucleoplasmic signals of L1 DNA were obviously increased and perinucleolar L1 signals became fuzzy or absent (Fig. 4e). B1 signals became more uniformly dispersed in the nucleoplasm, in contrast to punctate staining of B1 in the control mESCs. Emergence of the overlap** L1 and B1 FISH signals is indicative of decreased homotypic clustering and segregation of L1 and B1 DNA. In comparison, treatment with the drug azidothymidine (AZT), which blocks L1 retrotransposition activity,S13a). For comparison, we also generated two synthetic DNA and RNA sequences in 1-kb length, comprising 8 tandem copies of either B1 element or scrambled B1 (designated as 8× B1 or 8× SCR, respectively). L1 mix as well as two synthetic fragments in DNA or RNA efficiently pulled down recombinant HP1α (Supplementary information, Fig. S13b). In addition, L1 RNA–HP1α interactions were robustly detected in highly stringent conditions with up to 1 M of salt and urea (Fig. 7e). These results indicate that HP1α exhibits strong binding activities towards RNA and DNA in vitro.
Recent studies have reported that HP1α forms phase-separated droplets in the presence of DNA or nucleosomes in vitro, and heterochromatin formation may entail a phase-separation mechanism.37,38,100,101,102 Indeed, the L1 DNA mix as well as two synthetic DNA controls (8× B1 and 8× SCR) promote the phase separation of HP1α, which fails to phase separate on its own (Supplementary information, Fig. S13c, d). Consistent with its strong RNA-binding activity, HP1α also phase separates with the L1 RNA mix in a concentration-dependent manner to form spherical droplets with liquid-like properties, such as fusion of droplets and rapid recovery after photo-bleaching (Fig. 7f and Supplementary information, Fig. S13d–f). Intriguingly, L1 RNA fragments from F3 to F6 with low GC contents (< 40%) covering the inter-ORF and central conserved ORF2 sequence (3.2-kb) tend to have higher activities in promoting HP1α droplet formation compared to F1 and F8 fragments with GC contents of 45%–56% (Supplementary information, Fig. S13g, h). In addition, careful examination of HP1α ChIP-seq in mESCs showed that HP1α preferentially binds to the central region of L1 repeat DNA (Supplementary information, Fig. S13i), which hints some degree of specificity in L1–HP1α interplay. However, there was no obvious difference detected between L1 mix and 8× B1 (60% in GC) RNA/DNA in HP1α phase separation (Supplementary information, Fig. S13j). Although we cannot conclude a sequence specificity of HP1α, clearly, HP1α shows strong RNA- and DNA-binding activities and phase-separates in the presence of RNA and DNA in vitro. Given the abundance and co-residence of L1 repeat DNA and RNA with HP1α within the nucleus, it is tempting to speculate that their co-localization may provide a means of specificity for L1 in promoting HP1α phase separation during heterochromatin formation.
Discussion
Although tremendous efforts have been dedicated to studies of structural chromatin proteins and cataloging chromatin maps, the role of DNA sequences in 3D genome organization has been largely ignored. Interestingly, the overall higher-order chromatin structure has been reported to be stable across different cell types and conserved in related species, despite occasional compartment switches in a small portion of the genome in a given cell (Supplementary information, Text S1). This remarkable conservation of chromatin compartments suggests a fundamental principle which all cells stick to while co** with shifting signals in different cell fates. Compared to transcription and epigenetic modifications, the primary DNA sequence has an unparalleled advantage to directly control and govern the stability of 3D genome folding due to its static nature during development. Then, the question comes what DNA sequences serve such a task as the blueprint of 3D genome folding. By employing in silico polymer simulations to interpret microscopy and Hi-C data, Mirny and colleagues have suggested that compartmental segregation may occur through a microphase-separation mechanism of block copolymers.29,34 Together with the Solovei group, they further proposed that interactions between heterochromatic regions, rather than euchromatic contacts and lamina-heterochromatin interactions, are crucial for compartmentalization of the genome in both inverted and conventional nuclei.29,34 However, what was unknown in their model is the molecular determinants, particularly the genetic information of block copolymers that are responsible for chromatin compartmentalization.
In this study, we reveal a remarkable correlation between repeat distribution and compartmental organization of the higher-order chromatin structure. First, using complementary genomics and imaging approaches, we demonstrate that the self-clustering of L1 and B1/Alu repeats forms grossly exclusive nuclear domains that are highly correlated with and predict the known A/B compartments, and that nuclear segregation of L1-rich and B1/Alu-rich sequences is conserved across mouse and human cells, and can be dynamically established during cell division and early embryogenesis. Second, we show that depletion of L1 RNA by AMOs drastically alters repeat segregation and compartmentalization on a global scale and at individual loci by Hi-C, DNA FISH and Oligopaint FISH. Collectively, the overall positive correlation and the essentiality of L1 RNA in compartmental organization suggest a functional role for L1 repeats in driving genome folding. These results disfavor the notion that L1 or B1/Alu repeats are merely markers of large chromosomal segments with a different activity, although we cannot firmly exclude this possibility. Our model is also consistent with the growing evidence showing active roles of retrotransposons in re-wiring the genome and regulatory programs.53,54,55,56,57,58,59,60,61 As often a challenge for genome organization studies, we note that the current support for going beyond correlative evidence to really show a causative role of L1 is still limited.
L1 RNA tends to co-localize with L1 DNA sequences in regions enriched for the binding of HP1α and H3K9me3. Intriguingly, L1 RNAs can also be detected outside the nuclear and nucleolar periphery (Supplementary information, Fig. S12b). L1 RNA has a short half-life of 40 min.61 Torres-Padilla and colleagues reported previously that exogenous L1 RNA fails to rescue the chromatin defects upon abnormal silencing of L1 in mouse zygotes.71 Although these observations disfavor a trans-acting mechanism, it is possible that L1 transcripts might be mobilized to distal L1 DNA sequences. As 20%–40% of L1 repeats are located in annotated euchromatic compartments (Supplementary information, Fig. S1e), these L1 transcripts may be more readily visualized by microscopy. For the majority of L1 enriched in transcriptionally silenced heterochromatic environments, their expression might be temporally regulated in a more transient way, thus creating difficulty for direct visualization and detection. Studies of X chromosome inactivation revealed different roles for silenced and actively transcribed L1s in regulating heterochromatin formation induced by ** by RNA in gene regulatory elements. Science 350, 978-981 (2015)." href="/article/10.1038/s41422-020-00466-6#ref-CR105" id="ref-link-section-d206178417e2522">105), or acting as a scaffold to anchor L1-rich chromosomal segments to the nucleolar and nuclear peripheries. Cases for RNA in organizing subnuclear domains have been reported. For example, ** of long-range interactions reveals folding principles of the human genome. Science 326, 289–293 (2009)." href="/article/10.1038/s41422-020-00466-6#ref-CR6" id="ref-link-section-d206178417e2761">6 and compartmentalization strength23,34,94 were identified as described previously.
Segregation index
The FISH-based segregation index is defined as the negative value of Pearson’s correlation coefficient of L1 and B1 DNA signals in the nucleus. The Hi-C-based segregation index is defined as the ratio of homotypic versus heterotypic interaction frequencies between L1 and B1/Alu subfamilies.
In vitro pull-down and phase separation assays
A series of eight 1-kb fragments, designated as F1 to F8, were produced to cover the full-length 6544-kb L1 sequence by PCR (Supplementary information, Fig. 13a). Two artificial fragments comprising eight tandem copies of either B1 element (8× B1, in 1-kb length) or scrambled B1 sequence (8× SCR, in 1-kb length) were used for comparison. Biotin-labeled RNA was obtained by in vitro transcription for the pull-down experiment. We purified the recombinant human HP1α protein as previously described.38 In phase separation assays, recombinant HP1α with DNA or RNA fragments for L1, 8× B1, or 8× SCR were incubated at 4 °C overnight.
Statistical analysis
Statistical analyses were carried out using Excel or R (version 3.4.3).
Please also see Supplementary information, Data S1 for details.
Data availability
Sequencing data generated in this study have been deposited into the Gene Expression Omnibus database under accession numbers GSE123806 and GSE125766.
References
Annunziato, A. T. DNA packaging: nucleosomes and chromatin. Nature Educ. 1, 26 (2008).
Mirny, L. A. The fractal globule as a model of chromatin architecture in the cell. Chromosome Res. 19, 37–51 (2011).
Boettiger, A. N. et al. Super-resolution imaging reveals distinct chromatin folding for different epigenetic states. Nature 529, 418–422 (2016).
Wang, S. et al. Spatial organization of chromatin domains and compartments in single chromosomes. Science 353, 598–602 (2016).
Dekker, J., Rippe, K., Dekker, M. & Kleckner, N. Capturing chromosome conformation. Science 295, 1306–1311 (2002).
Lieberman-Aiden, E. et al. Comprehensive map** of long-range interactions reveals folding principles of the human genome. Science 326, 289–293 (2009).
Dekker, J., Marti-Renom, M. A. & Mirny, L. A. Exploring the three-dimensional organization of genomes: interpreting chromatin interaction data. Nat. Rev. Genet. 14, 390–403 (2013).
Rivera-Mulia, J. C. & Gilbert, D. M. Replication timing and transcriptional control: beyond cause and effect-part III. Curr. Opin. Cell Biol. 40, 168–178 (2016).
Gibcus, J. H. & Dekker, J. The hierarchy of the 3D genome. Mol. Cell 49, 773–782 (2013).
Bonev, B. & Cavalli, G. Organization and function of the 3D genome. Nat. Rev. Genet. 17, 661–678 (2016).
Buchwalter, A., Kaneshiro, J. M. & Hetzer, M. W. Coaching from the sidelines: the nuclear periphery in genome regulation. Nat. Rev. Genet. 20, 39–50 (2019).
Solovei, I., Thanisch, K. & Feodorova, Y. How to rule the nucleus: divide et impera. Curr. Opin. Cell Biol. 40, 47–59 (2016).
Dixon, J. R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012).
Kadauke, S. & Blobel, G. A. Chromatin loops in gene regulation. Biochim. Biophys. Acta 1789, 17–25 (2009).
Dixon, J. R. et al. Chromatin architecture reorganization during stem cell differentiation. Nature 518, 331–336 (2015).
Stadhouders, R. et al. Transcription factors orchestrate dynamic interplay between genome topology and gene regulation during cell reprogramming. Nat. Genet. 50, 238–249 (2018).
Schmitt, A. D. et al. A compendium of chromatin contact maps reveals spatially active regions in the human genome. Cell Rep. 17, 2042–2059 (2016).
Rennie, S., Dalby, M., van Duin, L. & Andersson, R. Transcriptional decomposition reveals active chromatin architectures and cell specific regulatory interactions. Nat. Commun. 9, 487 (2018).
Stevens, T. J. et al. 3D structures of individual mammalian genomes studied by single-cell Hi-C. Nature 544, 59–64 (2017).
Rao, S. S. et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin loo**. Cell 159, 1665–1680 (2014).
Splinter, E. et al. CTCF mediates long-range chromatin loo** and local histone modification in the beta-globin locus. Genes Dev. 20, 2349–2354 (2006).
Rao, S. S. P. et al. Cohesin loss eliminates all loop domains. Cell 171, 305–320 (2017).
Nora, E. P. et al. Targeted degradation of CTCF decouples local insulation of chromosome domains from genomic compartmentalization. Cell 169, 930–944 (2017).
Wutz, G. et al. Topologically associating domains and chromatin loops depend on cohesin and are regulated by CTCF, WAPL, and PDS5 proteins. EMBO J. 36, 3573–3599 (2017).
Schwarzer, W. et al. Two independent modes of chromatin organization revealed by cohesin removal. Nature 551, 51–56 (2017).
Bintu, B. et al. Super-resolution chromatin tracing reveals domains and cooperative interactions in single cells. Science 362, eaau1783 (2018).
Gassler, J. et al. A mechanism of cohesin-dependent loop extrusion organizes zygotic genome architecture. Embo J. 36, 3600–3618 (2017).
Haarhuis, J. H. I. et al. The cohesin release factor WAPL restricts chromatin loop extension. Cell 169, 693–707 e614 (2017).
Nuebler, J., Fudenberg, G., Imakaev, M., Abdennur, N. & Mirny, L. A. Chromatin organization by an interplay of loop extrusion and compartmental segregation. Proc. Natl. Acad. Sci. USA 115, E6697–E6706 (2018).
Kubo, N. et al. Preservation of chromatin organization after acute loss of CTCF in mouse embryonic stem cells. bioRxiv https://doi.org/10.1101/118737 (2017).
van Steensel, B. & Belmont, A. S. Lamina-associated domains: links with chromosome architecture, heterochromatin, and gene repression. Cell 169, 780–791 (2017).
Guelen, L. et al. Domain organization of human chromosomes revealed by map** of nuclear lamina interactions. Nature 453, 948–951 (2008).
Zullo, J. M. et al. DNA sequence-dependent compartmentalization and silencing of chromatin at the nuclear lamina. Cell 149, 1474–1487 (2012).
Falk, M. et al. Heterochromatin drives compartmentalization of inverted and conventional nuclei. Nature 570, 395–399 (2019).
Wijchers, P. J. et al. Characterization and dynamics of pericentromere-associated domains in mice. Genome Res. 25, 958–969 (2015).
Jost, D., Carrivain, P., Cavalli, G. & Vaillant, C. Modeling epigenome folding: formation and dynamics of topologically associated chromatin domains. Nucleic Acids Res. 42, 9553–9561 (2014).
Strom, A. R. et al. Phase separation drives heterochromatin domain formation. Nature 547, 241–245 (2017).
Wang, L. et al. Histone modifications regulate chromatin compartmentalization by contributing to a phase separation mechanism. Mol. Cell 76, 646–659 (2019).
Maison, C. et al. Higher-order structure in pericentric heterochromatin involves a distinct pattern of histone modification and an RNA component. Nat. Genet. 30, 329–334 (2002).
Cook, P. R. & Marenduzzo, D. Transcription-driven genome organization: a model for chromosome structure and the regulation of gene expression tested through simulations. Nucleic Acids Res. 46, 9895–9906 (2018).
Rowley, M. J. et al. Evolutionarily conserved principles predict 3D chromatin organization. Mol. Cell 67, 837–852 (2017).
Cook, P. R. Predicting three-dimensional genome structure from transcriptional activity. Nat. Genet. 32, 347–352 (2002).
Tang, S. J. Potential role of phase separation of repetitive DNA in chromosomal organization. Genes (Basel) 8, 279 (2017).
van de Werken, H. J. G. et al. Small chromosomal regions position themselves autonomously according to their chromatin class. Genome Res. 27, 922–933 (2017).
Solovei, I. et al. Nuclear architecture of rod photoreceptor cells adapts to vision in mammalian evolution. Cell 137, 356–368 (2009).
Amendola, M. & van Steensel, B. Nuclear lamins are not required for lamina-associated domain organization in mouse embryonic stem cells. EMBO Rep. 16, 610–617 (2015).
Lenain, C. et al. Massive resha** of genome-nuclear lamina interactions during oncogene-induced senescence. Genome Res. 27, 1634–1644 (2017).
Zheng, X. et al. Lamins organize the global three-dimensional genome from the nuclear periphery. Mol. Cell 71, 802–815 (2018).
Chang, L. et al. Nuclear peripheral chromatin-lamin B1 interaction is required for global integrity of chromatin architecture and dynamics in human cells. Protein Cell https://doi.org/10.1007/s13238-020-00794-8 (2020).
Biemont, C. A brief history of the status of transposable elements: from junk DNA to major players in evolution. Genetics 186, 1085–1093 (2010).
de Koning, A. P., Gu, W., Castoe, T. A., Batzer, M. A. & Pollock, D. D. Repetitive elements may comprise over two-thirds of the human genome. PLoS Genet. 7, e1002384 (2011).
Orgel, L. E. & Crick, F. H. Selfish DNA: the ultimate parasite. Nature 284, 604–607 (1980).
Rebollo, R., Romanish, M. T. & Mager, D. L. Transposable elements: an abundant and natural source of regulatory sequences for host genes. Annu. Rev. Genet. 46, 21–42 (2012).
Bourque, G. et al. Evolution of the mammalian transcription factor binding repertoire via transposable elements. Genome Res. 18, 1752–1762 (2008).
Lynch, V. J., Leclerc, R. D., May, G. & Wagner, G. P. Transposon-mediated rewiring of gene regulatory networks contributed to the evolution of pregnancy in mammals. Nat. Genet. 43, 1154–1159 (2011).
Grow, E. J. et al. Intrinsic retroviral reactivation in human preimplantation embryos and pluripotent cells. Nature 522, 221–225 (2015).
Chuong, E. B., Elde, N. C. & Feschotte, C. Regulatory evolution of innate immunity through co-option of endogenous retroviruses. Science 351, 1083–1087 (2016).
Durruthy-Durruthy, J. et al. The primate-specific noncoding RNA HPAT5 regulates pluripotency during human preimplantation development and nuclear reprogramming. Nat. Genet. 48, 44–52 (2016).
Wang, J. et al. Primate-specific endogenous retrovirus-driven transcription defines naive-like stem cells. Nature 516, 405–409 (2014).
Liu, N. et al. Selective silencing of euchromatic L1s revealed by genome-wide screens for L1 regulators. Nature 553, 228–232 (2018).
Lu, J. Y. et al. Genomic repeats categorize genes with distinct functions for orchestrated regulation. Cell Rep. 30, 3296–3311 (2020).
Mandal, P. K. & Kazazian, H. H. Jr. SnapShot: vertebrate transposons. Cell 135, 192–192.e1 (2008).
Taylor, M. S. et al. Affinity proteomics reveals human host factors implicated in discrete stages of LINE-1 retrotransposition. Cell 155, 1034–1048 (2013).
Mouse Genome Sequencing Consortium et al. Initial sequencing and comparative analysis of the mouse genome. Nature 420, 520–562 (2002).
Lander, E. S. et al. Initial sequencing and analysis of the human genome. Nature 409, 860–921 (2001).
Jurka, J., Kohany, O., Pavlicek, A., Kapitonov, V. V. & Jurka, M. V. Duplication, coclustering, and selection of human Alu retrotransposons. Proc. Natl. Acad. Sci. USA 101, 1268–1272 (2004).
Korenberg, J. R. & Rykowski, M. C. Human genome organization: Alu, lines, and the molecular structure of metaphase chromosome bands. Cell 53, 391–400 (1988).
Bolzer, A. et al. Three-dimensional maps of all chromosomes in human male fibroblast nuclei and prometaphase rosettes. PLoS Biol. 3, e157 (2005).
Meuleman, W. et al. Constitutive nuclear lamina-genome interactions are highly conserved and associated with A/T-rich sequence. Genome Res. 23, 270–280 (2013).
Deininger, P. Alu elements: know the SINEs. Genome Biol. 12, 236 (2011).
Jachowicz, J. W. et al. LINE-1 activation after fertilization regulates global chromatin accessibility in the early mouse embryo. Nat. Genet. 49, 1502–1510 (2017).
Fraser, J. et al. Hierarchical folding and reorganization of chromosomes are linked to transcriptional changes in cellular differentiation. Mol. Syst. Biol. 11, 852 (2015).
Sexton, T. & Cavalli, G. The role of chromosome domains in sha** the functional genome. Cell 160, 1049–1059 (2015).
Dixon, J. R., Gorkin, D. U. & Ren, B. Chromatin domains: the unit of chromosome organization. Mol. Cell 62, 668–680 (2016).
Mateos-Langerak, J. et al. Pericentromeric heterochromatin domains are maintained without accumulation of HP1. Mol. Biol. Cell 18, 1464–1471 (2007).
Peric-Hupkes, D. et al. Molecular maps of the reorganization of genome-nuclear lamina interactions during differentiation. Mol. Cell 38, 603–613 (2010).
Nemeth, A. et al. Initial genomics of the human nucleolus. PLoS Genet. 6, e1000889 (2010).
Nagano, T. et al. Cell-cycle dynamics of chromosomal organization at single-cell resolution. Nature 547, 61–67 (2017).
Naumova, N. et al. Organization of the mitotic chromosome. Science 342, 948–953 (2013).
Ke, Y. et al. 3D chromatin structures of mature gametes and structural reprogramming during mammalian embryogenesis. Cell 170, 367–381 e320 (2017).
Du, Z. et al. Allelic reprogramming of 3D chromatin architecture during early mammalian development. Nature 547, 232–235 (2017).
Flyamer, I. M. et al. Single-nucleus Hi-C reveals unique chromatin reorganization at oocyte-to-zygote transition. Nature 544, 110–114 (2017).
Qiu, J. J. et al. Delay of ZGA initiation occurred in 2-cell blocked mouse embryos. Cell Res. 13, 179–185 (2003).
Fadloun, A. et al. Chromatin signatures and retrotransposon profiling in mouse embryos reveal regulation of LINE-1 by RNA. Nat. Struct. Mol. Biol. 20, 332–338 (2013).
Rothstein, J. L. et al. Gene-expression during preimplantation mouse development. Gene Dev. 6, 1190–1201 (1992).
Abe, K. et al. The first murine zygotic transcription is promiscuous and uncoupled from splicing and 3’ processing. EMBO J. 34, 1523–1537 (2015).
Percharde, M. et al. A LINE1-nucleolin partnership regulates early development and ESC Identity. Cell 174, 391–405 (2018).
Chen, C., Ara, T. & Gautheret, D. Using Alu elements as polyadenylation sites: a case of retroposon exaptation. Mol. Biol. Evol. 26, 327–334 (2009).
Dominissini, D., Moshitch-Moshkovitz, S., Amariglio, N. & Rechavi, G. Adenosine-to-inosine RNA editing meets cancer. Carcinogenesis 32, 1569–1577 (2011).
Lubelsky, Y. & Ulitsky, I. Sequences enriched in Alu repeats drive nuclear localization of long RNAs in human cells. Nature 555, 107–111 (2018).
Polak, P. & Domany, E. Alu elements contain many binding sites for transcription factors and may play a role in regulation of developmental processes. BMC Genom. 7, 133 (2006).
Houda Belaghzal, T. B. et al. Compartment-dependent chromatin interaction dynamics revealed by liquid chromatin Hi-C. bioRxiv https://doi.org/10.1101/704957 (2019).
**e, Y., Rosser, J. M., Thompson, T. L., Boeke, J. D. & An, W. Characterization of L1 retrotransposition with high-throughput dual-luciferase assays. Nucleic Acids Res. 39, e16 (2010).
Abramo, K. et al. A chromosome folding intermediate at the condensin-to-cohesin transition during telophase. Nat. Cell Biol. 21, 1393–1402 (2019).
Rhodes, J. D. P. et al. Cohesin disrupts polycomb-dependent chromosome interactions in embryonic stem cells. Cell Rep. 30, 820–835 (2020).
Keller, C. et al. HP1(Swi6) mediates the recognition and destruction of heterochromatic RNA transcripts. Mol. Cell 47, 215–227 (2012).
Stunnenberg, R. et al. H3K9 methylation extends across natural boundaries of heterochromatin in the absence of an HP1 protein. EMBO J. 34, 2789–2803 (2015).
Muchardt, C. et al. Coordinated methyl and RNA binding is required for heterochromatin localization of mammalian HP1 alpha. EMBO Rep. 3, 975–981 (2002).
Maison, C. et al. SUMOylation promotes de novo targeting of HP1alpha to pericentric heterochromatin. Nat. Genet. 43, 220–227 (2011).
Larson, A. G. et al. Liquid droplet formation by HP1alpha suggests a role for phase separation in heterochromatin. Nature 547, 236–240 (2017).
Zwicker, D., Decker, M., Jaensch, S., Hyman, A. A. & Julicher, F. Centrosomes are autocatalytic droplets of pericentriolar material organized by centrioles. Proc. Natl. Acad. Sci. USA 111, E2636–E2645 (2014).
Li, P. et al. Phase transitions in the assembly of multivalent signalling proteins. Nature 483, 336–340 (2012).
Chow, J. C. et al. LINE-1 activity in facultative heterochromatin formation during X chromosome inactivation. Cell 141, 956–969 (2010).
Marenduzzo, D., Finan, K. & Cook, P.R. The depletion attraction: an underappreciated force driving cellular organization. J. Cell Biol. 175, 681–686 (2006).
Sigova, A. A. et al. Transcription factor trap** by RNA in gene regulatory elements. Science 350, 978-981 (2015).
McHugh, C. A. et al. The **st lncRNA interacts directly with SHARP to silence transcription through HDAC3. Nature 521, 232–236 (2015).
Probst, A. V. et al. A strand-specific burst in transcription of pericentric satellites is required for chromocenter formation and early mouse development. Dev. Cell 19, 625–638 (2010).
Velazquez Camacho, O. et al. Major satellite repeat RNA stabilize heterochromatin retention of Suv39h enzymes by RNA-nucleosome association and RNA: DNA hybrid formation. Elife 6, e25293 (2017).
Quinodoz, S. A. et al. Higher-order inter-chromosomal hubs shape 3D genome organization in the nucleus. Cell 174, 744–757 (2018).
Chong, S. et al. Imaging dynamic and selective low-complexity domain interactions that control gene transcription. Science 361. eaar2555 (2018).
Falahati, H., Pelham-Webb, B., Blythe, S. & Wieschaus, E. Nucleation by rRNA dictates the precision of nucleolus assembly. Curr. Biol. 26, 277–285 (2016).
Tjong, H. et al. Population-based 3D genome structure analysis reveals driving forces in spatial genome organization. Proc. Natl. Acad. Sci. USA 113, E1663–E1672 (2016).
Tsirigos, A. & Rigoutsos, I. Alu and b1 repeats have been selectively retained in the upstream and intronic regions of genes of specific functional classes. PLoS Comput. Biol. 5, e1000610 (2009).
Buckley, R. M., Kortschak, R. D., Raison, J. M. & Adelson, D. L. Similar evolutionary trajectories for retrotransposon accumulation in mammals. Genome Biol. Evol. 9, 2336–2353 (2017).
Sultana, T. et al. The landscape of L1 retrotransposons in the human genome is shaped by pre-insertion sequence biases and post-insertion selection. Mol. Cell 74, 555–570 e557 (2019).
Flasch, D. A. et al. Genome-wide de novo L1 retrotransposition connects endonuclease activity with replication. Cell 177, 837–851 (2019).
Pavlicek, A. et al. Similar integration but different stability of Alus and LINEs in the human genome. Gene 276, 39–45 (2001).
Graham, T. & Boissinot, S. The genomic distribution of L1 elements: the role of insertion bias and natural selection. J. Biomed. Biotechnol. 2006, 75327 (2006).
Kruse, K. et al. Transposable elements drive reorganisation of 3D chromatin during early embryogenesis. bioRxiv https://doi.org/10.1101/523712 (2019).
Zhang, Y. et al. Transcriptionally active HERV-H retrotransposons demarcate topologically associating domains in human pluripotent stem cells. Nat. Genet. 51, 1380–1388 (2019).
Anfinsen, C. B. The formation and stabilization of protein structure. Biochem. J. 128, 737–749 (1972).
Anfinsen, C. B., Redfield, R. R., Choate, W. L., Page, J. & Carroll, W. R. Studies on the gross structure, cross-linkages, and terminal sequences in ribonuclease. J. Biol. Chem. 207, 201–210 (1954).
Tempel, S. Using and understanding RepeatMasker. Methods Mol. Biol. 859, 29–51 (2012).
Shen, X. et al. Jumonji modulates polycomb activity and self-renewal versus differentiation of stem cells. Cell 139, 1303–1314 (2009).
Becker, J. S. et al. Genomic and proteomic resolution of heterochromatin and its restriction of alternate fate genes. Mol. Cell 68, 1023–1037 (2017).
Bhatt, D. M. et al. Transcript dynamics of proinflammatory genes revealed by sequence analysis of subcellular RNA fractions. Cell 150, 279–290 (2012).
Tsanov, N. et al. smiFISH and FISH-quant–a flexible single RNA detection approach with super-resolution capability. Nucleic Acids Res. 44, e165 (2016).
Servant, N. et al. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 16, 259 (2015).
Acknowledgements
We thank the Shen Laboratory members for insightful discussions, and we thank Benoit G. Bruneau and Robert J. Klose for providing CTCF-AID and RAD21-AID mESCs. This work was supported in part by the National Basic Research Program of China (2018YFA0107604, 2017YFA0504204 to X.S.); the National Natural Science Foundation of China (31925015, 31630095 to X.S.; 21825401 to Y.S.); the National Key R&D Program of China (2017YFA0505300 to Y.S.); Bei**g Advanced Innovation Center for Structural Biology (to X.S). J.Y.L. is a Shuimu Tsinghua Scholar.
Author information
Authors and Affiliations
Contributions
X.S. and J.Y.L. conceived of the study. X.S. and Y.S. supervised the study. X.S., Y.S., J.Y.L., and L.C. designed the experiments. J.Y.L. performed bioinformatics analysis. L.C. conducted all imaging experiments with help from Y.Y., G.Z., T.W., X.H., and J.Z. T.L. performed phase separation and RNA pull-down experiments. K.Z., Q.P., and P.Y. prepared the Hi-C library. T.W., W.L., and P.W. did embryo injections. P.L. and L.W. provide HP1α proteins. W.X., Y.T., J.N., H.Z., X.B., W.S., Z.W., R.M., B.Z., M.R.-S., M.P., Z.C., Y. Hong, and Y. Hou contributed technical assistance/suggestions. X.S., J.Y.L., and L.C. wrote the manuscript with input from all authors. T.W., Y.Y., G.Z., X.H., K.Z. and Y.T. contributed equally to this work.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Lu, J.Y., Chang, L., Li, T. et al. Homotypic clustering of L1 and B1/Alu repeats compartmentalizes the 3D genome. Cell Res 31, 613–630 (2021). https://doi.org/10.1038/s41422-020-00466-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41422-020-00466-6
- Springer Nature Singapore Pte Ltd.
This article is cited by
-
Single-cell 3D genome structure reveals distinct human pluripotent states
Genome Biology (2024)
-
Emergence of replication timing during early mammalian development
Nature (2024)
-
Map** crossover events of mouse meiotic recombination by restriction fragment ligation-based Refresh-seq
Cell Discovery (2024)
-
Genome folding principles uncovered in condensin-depleted mitotic chromosomes
Nature Genetics (2024)
-
Regulation and function of transposable elements in cancer genomes
Cellular and Molecular Life Sciences (2024)