Transposable element-derived sequences in vertebrate development

Etchegaray, Ema; Naville, Magali; Volff, Jean-Nicolas; Haftek-Terreau, Zofia

doi:10.1186/s13100-020-00229-5

Transposable element-derived sequences in vertebrate development

Review
Open access
Published: 06 January 2021

Volume 12, article number 1, (2021)
Cite this article

Download PDF

You have full access to this open access article

Mobile DNA Aims and scope Submit manuscript

Transposable element-derived sequences in vertebrate development

Download PDF

Ema Etchegaray ORCID: orcid.org/0000-0001-9596-6040¹,
Magali Naville¹,
Jean-Nicolas Volff¹ &
…
Zofia Haftek-Terreau¹

10k Accesses
39 Citations
9 Altmetric
Explore all metrics

Abstract

Transposable elements (TEs) are major components of all vertebrate genomes that can cause deleterious insertions and genomic instability. However, depending on the specific genomic context of their insertion site, TE sequences can sometimes get positively selected, leading to what are called “exaptation” events. TE sequence exaptation constitutes an important source of novelties for gene, genome and organism evolution, giving rise to new regulatory sequences, protein-coding exons/genes and non-coding RNAs, which can play various roles beneficial to the host. In this review, we focus on the development of vertebrates, which present many derived traits such as bones, adaptive immunity and a complex brain. We illustrate how TE-derived sequences have given rise to developmental innovations in vertebrates and how they thereby contributed to the evolutionary success of this lineage.

Evolutionary impact of transposable elements on genomic diversity and lineage-specific innovation in vertebrates

Article 22 September 2015

Mammalian Genome Plasticity: Expression Analysis of Transposable Elements

The Relationship between Transposons and Transcription Factors in the Evolution of Eukaryotes

Article 01 January 2019

Background

Transposable elements (TEs) were discovered by Barbara McClintock in the 1940s and described as moving DNA sequences that can cause genomic instability [1]. As she was able to link TE activity with variations in maize kernel colors, she coined them “controlling elements”, underlying their apparent involvement in gene regulation. TEs are nowadays known to be major components of genomes and have been found in every species that has been looked at, including prokaryotes, protists, fungi, plants and animals [2,3,4].

TEs are classified into two main classes according to their transposition mechanism [5, 6]. The transposition of retrotransposons (class I TEs) occurs through the reverse transcription of an RNA intermediate into a cDNA molecule that is subsequently inserted into a new locus [7, 8]. This replicative transposition process, a “copy-and-paste” mechanism called retrotransposition, leads to the expansion of the retroelement family in the host genome. Retrotransposons gather both Long Terminal Repeat retrotransposons (LTRs), with flanking repeated sequences in direct orientation necessary for the expression and integration of the element, and non-LTR retrotransposons, also called Long Interspersed Nuclear Elements (LINEs). Autonomous retrotransposons encode a reverse transcriptase (RT) and other proteins necessary for integration (an integrase for LTRs and an endonuclease for LINEs) and other aspects of transposition [7,8,9]. In contrast, non-autonomous retrotransposons, including Short Interspersed Nuclear Elements (SINEs) that are mobilized by autonomous non-LTR retrotransposons, do not encode any proteins and rely on those produced in trans by autonomous elements to transpose [10, 11]. DNA transposons (class II TEs) do not require the reverse transcription of an RNA intermediate for their transposition [12]. They mostly use a “cut-and-paste” mechanism, the TE copy being excised from its original locus and integrated elsewhere into the genome. Many DNA transposons, including the widespread DDE transposon family, classically encode a transposase (with the DDE motif forming its active site in DDE transposons) and are flanked by Terminal Inverted Repeat (TIR) sequences that are bound by the transposase for excision and integration [9, 12]. Other types of DNA transposons include Helitrons [13, 14], which are rolling-circle DNA transposons with no TIRs encoding a helicase, and Polintons/Mavericks [15, 16], which are self-synthesizing DNA transposons with long TIRs encoding a DNA polymerase. Non-autonomous elements called Miniature Inverted Repeat Transposable Elements (MITEs) are mobilized in trans by related autonomous DNA transposons [12].

Each species genome is characterized by a specific composition in TEs, both quantitatively and qualitatively. For instance, the genome of the maize Zea mays is composed of nearly 85% of transposable elements [17], whereas the genome of the yeast Saccharomyces cerevisiae contains less than 4% of TEs [18]. In unicellular organisms, the genome of Trichomonas vaginalis contains almost exclusively DNA transposons, while almost only retrotransposons are found in Entamoeba histolytica [19, 20]. A marked variability in TE content and diversity has been also observed among vertebrates [21]. Indeed, the genomic amount of TEs ranges from 6% in the pufferfish Tetraodon nigroviridis up to 55% in the zebrafish Danio rerio. Some groups of TEs are found in most vertebrate species (LINE retrotransposons or Tc-Mariner DNA transposons for instance), whereas others are restricted to certain vertebrate sublineages and absent from others, such as the DIRS and Copia retrotransposons that are present in fish and amphibians but absent from mammals and birds [21].

Most TE insertions are thought to be either neutral or deleterious, depending on the context of the genomic region where they are inserted. TE insertions can be deleterious for instance by disrupting open reading frames (ORFs) or by altering gene transcriptional regulations. However, and despite their “selfish” characteristics, TEs are subject to the drift-selection balance and can be positively selected if they are beneficial to the host [12]. Indeed, some insertions have been shown to play a positive role in species evolution by contributing to new regulatory and coding sequences (Fig. 1) [22,23,24,25,26,27,28]. Such a recruitment by the host to fulfil useful functions is called exaptation or molecular domestication. The ability of TE sequences to give rise to evolutionary innovations has been more and more documented in the past years and becomes of growing interest, helped by the recent technological developments in genome sequencing and gene expression profile analysis. The structural and functional characteristics of different TE families might confer them with different potential to be exapted. TEs can contain different functional ORFs encoding proteins with various properties such as endonucleases, integrases, transposases, reverse transcriptases and other proteins with DNA/RNA/protein-binding domains, and diverse transcriptional regulatory sequences such as promoters or enhancers. For example, LINE L1 elements contain an internal RNA polymerase II promotor and encode beside an RT an RNA-binding protein and an endonuclease; SINEs in contrast do not carry any ORF and have an RNA polymerase III promoter; LTR retrotransposons present transcriptional regulatory sequences in their long terminal repeats and generally encode an integrase, a protease, a RNase H and a structural protein called GAG in addition to their RT, with an additional Envelope gene that Endogenous Retroviruses (ERVs) have occasionally kept from their infectious ancestors; DNA transposons can among others code for transposases, helicases and DNA polymerases. These functional ORFs and regulatory sequences can be reused to the host benefits. The mobilome can thus be regarded as an evolutionary toolbox, as TEs bring with them in host genomes sequences encoding proteins able to bind, replicate, cut, rearrange or degrade nucleic acids, and to associate with and modify other proteins, among other biologically relevant properties.

Vertebrates constitute a geographically widely expanded taxonomic group that appeared more than 500 million years ago and has colonized almost all ecological environments [29]. The emergence of vertebrates represents a major evolutionary transition. This group has acquired many derived traits, namely: a unique nervous system composed of a complex brain with forebrain, midbrain and hindbrain specialized regions, and cranial nerves, spinal cord and ganglia; the sensory placodes and the sensory organs they give rise to (olfactory bulbs, vestibular apparatus and otic placode for example); the neural crest, which develops into cranium, branchial skeleton and sensory ganglia; a complex endocrine system allowing the apparition of new hormones and new organs such as the placenta; bones and cartilages contributing to the skull, jaws and vertebrae; paired appendages; adaptive immunity [30,31,32]. These novelties, which subsequently diversified in different sublineages, have contributed to the evolutionary success of vertebrates, allowing them to improve the sense of and the move in their environment, to develop new organs and complexify them, and to turn to extensive predation.

At the origin of vertebrates, two events of whole genome duplications allowed a massive expansion of the gene repertoire [33]. However, the sole emergence of paralogous genes may not explain all the innovations that appeared, and it has been also proposed that regulatory divergence might account for major organismal diversification [34, 35]. Accordingly, the analysis of the genome of the cephalochordate amphioxus, a sister outgroup species of vertebrates, has underlined the specialization of gene expression and the complexification of gene regulation during invertebrate to vertebrate transition, mainly due to the recruitment of new regulatory networks [36]. The precise understanding of the genetic and evolutionary mechanisms underlying this transition is of particular interest, and we propose to explore the role of TEs in this context. Several examples of TE recruitment events crucial for vertebrate development have been documented in the last years. In this review, we discuss the different mechanisms through which TE-derived sequences have played a role in vertebrate genome evolution. We focus on selected examples illustrating the innovative potential of transposable elements as a source of new protein-coding sequences, new small and long non-coding RNA genes and new regulatory elements having driven the evolution of vertebrate development.

TE-derived sequences as new protein-coding sequences

TE exonization

Inserted TE sequences can occasionally be recruited as new exons of pre-existing genes, a process called TE exonization (Fig. 1a). Exonization is defined as the formation of a novel exon from an intronic or intergenic sequence carrying splicing sites. Such new exons can be protein-coding but might also constitute new 5′ or 3′ untranslated regions with possible regulatory functions.

TE exonization is not an anecdotal process and has been largely documented in mammals and other vertebrates, where it occurs more frequently than in non-vertebrate species [37,38,39]. In the human genome, among 233,785 exons, more than 3000 (~ 1%) are derived from TEs [37, 40]. Among them, about 1640 correspond to Alu SINE elements, 640 to LINEs, 310 to MIRs (Mammalian-wide Interspersed Repeats, SINE elements), 300 to LTRs and 230 to DNA transposons [37]. Human exonized TEs are generally alternatively spliced, allowing protein variability [41,42,43]. It was also hypothesized that many TE-derived exons act as post-transcriptional gene regulators instead of being part of the protein-coding sequence itself [40]. The prevalence of Alu elements as TE-derived exons can be linked not only to their high copy number -with 1200,000 copies, they constitute as much as 10% of the human genome [44], but also to the fact that Alu sequences contain many potential splicing sites [45]. Alu elements indeed present up to ten 5′ and thirteen 3′ cryptic splicing sites that can be activated into functional splice sites through mutations or modifications such as adenosine-to-inosine RNA editing [38, 41]. Alu exons often modulate translational efficiency and can lead to lineage-specific regulations of gene translation [46]. Alu exonization can also cause genetic diseases in human such as the Alport syndrome, which is characterized by progressive renal failure, hearing loss and ocular abnormalities [47]. LINEs and to a lesser extent LTR retroelements can be exonized too [48, 49].

Exonization of intronic insertions is influenced by multiple factors. In the human genome, exonization is promoted by large intron size, high intronic GC content, and, importantly, by the presence of young transposable elements, in particular close to transcription starting sites [50]. These factors might contribute to a decrease of RNA polymerase II elongation rate and to a reduction of spliceosomal efficiency, allowing an increase of the “window of opportunity” for spliceosomal recognition and thus for exonization. Other mechanisms inhibit Alu exonization. It has been shown in human that the RNA-binding protein hnRNP C prevents Alu exonization by avoiding the binding of splicing factor U2AF65 to Alu cryptic exons, thus blocking Alu splicing sites; this prohibits Alu exon inclusion that would potentially lead to the formation of aberrant transcripts [51]. The binding of hnRNP C to Alu RNA is highly dependent on two poly(U) tracts present in Alu sequences inserted and transcribed in antisense orientation compared to the gene. These poly(U) arise from the antisense transcription by the gene promoter of the Alu terminal poly(A) and the internal poly(A) linker separating the two arms of Alu sequences (Alu are dimeric elements). Point mutations in these Alu poly(U) sequences are sufficient to impair the binding of hnRNP C [51]. Thus, the accumulation of mutations preventing hnRNP C binding can favor Alu exon inclusion.

Some examples illustrate well how intronic TEs can drive transcriptome and proteome diversification through the formation of lineage- and tissue-specific alternative exons. The vertebrate lamina-associated polypeptide 2 gene (tmpo for thymopoetin) encodes several membrane protein isoforms including LAP2β suggested to control nuclear lamina dynamics at the nuclear periphery by binding specifically to B-type lamins. Another isoform, the mammalian-specific LAP2α protein, has a domain derived from the gag ORF of a DIRS1-like retrotransposon [52]. Unlike other isoforms, LAP2α is a non-membrane protein that binds to A-type lamins in the nucleoplasm [53]. This isoform is implicated in nuclear organization dynamics during the cell cycle [54, 55]. A mutation in the TE-derived domain of LAP2α has been associated with dilated cardiomyopathy in humans [56].

In mammals, the gene prl3c1 belonging to the prolactin gene family encodes a cytokine expressed in uterine decidua and implicated in the establishment of pregnancy. In rodents, this gene has acquired a novel transcript variant in a common ancestor of the house mouse Mus musculus, M. spretus and M. caroli through the insertion of a composite TE into its first intron [57]. The inserted TE, which consists of an LTR element interrupted by a LINE, gave rise to an alternative promoter and an alternative first exon. In contrast to the “classical” transcript, the new variant is expressed in the Leydig cells of the testis. The variant protein shows a different intracellular localization and modulates the growth of testes and their capacity to produce testosterone and sperm. Such a TE co-option might contribute to the diversity of testicular development and functioning.

The rtdpoz-T1 and rtdpoz-T2 retrogenes, specifically expressed in testis and in the develo** embryo in rat, and supposed to encode nuclear scaffold proteins functioning as transcription regulators, have multiple exons deriving from TE sequences [58, 59]. For example, rtdpoz-T1 has 5 out of 8 exons and an alternative polyadenylation signal that are derived from various TEs, mainly L1 and ERVs. These TE-derived exons may be implicated in the translational regulation of these transcripts, notably through the formation of upstream ORFs [59].

The vertebrate insulin-like growth factor 1 (IGF-1) is a hormone involved in the development and growth of many tissues. IGF-1 plays a role for instance in synapse maturation and skeletal muscle development. Three isoforms of IGF-1 are known, IGF-1Ea, IGF-1Eb and IGF-1Ec [60]. The IGF-1Ea isoform is conserved among vertebrates, whereas the two others are mammal-specific and coincide with the insertion of a MIR-b SINE element that allows the formation of a fifth exon [61]. This fifth exon adds a disordered tail to IGF-1, which is highly suspected to be the source of post-translational modifications and regulatory functions. This allows a lineage-specific regulation of IGF-1.

Finally, the exonization of an Alu-J SINE element has been linked to the evolution of hemochorial placentation in anthropoid primates [62]. Hemochorial placentation is a placental implantation specific to rodents and higher order primates. In this type of placenta, the maternal blood is separated from the fetal blood by only one barrier, the chorion. This may optimize nutrient and gas exchange but makes the immune tolerance more challenging. The chorionic gonadotropin (CG) is a heterodimeric glycoprotein hormone formed by an alpha subunit, the glycoprotein hormone alpha (GPHA), and a beta subunit CGB [63]. CG is involved in the regulation of ovarian, testicular and placental functions. An Alu-J is inserted in the gpha gene in anthropoid primates, and its alternative exonization induces the formation of a GPHA isoform called Alu-GPHA that contains an additional N-terminus [62]. This isoform is only expressed in chorionic villus tissues and placenta, while the GPHA isoform without the Alu is expressed in other tissues. In human, the heterodimer Alu-hCG formed with the subunit Alu-GPHA shows a longer serum half-life and has a better trophoblast invasion activity compared to hCG, allowing the improvement of placenta implantation and invasion.

TE molecular domestication to form new protein-coding genes

TEs can give rise to new functional host genes, a process known as molecular domestication (Fig. 1b). In the human genome, more than hundred protein-coding genes are thought to be derived from TEs [64, 65], representing about 0.5% of the complete set of human protein-coding genes. For example, the mammalian centromere protein B (CENP-B) is derived from the transposase of a pogo-like DNA transposon [66, 67]. Like its transposase ancestor, this protein is able to bind DNA. CENP-B is involved in centromere formation during both interphase and mitosis, and directs kinetochore assembly. Ty3/gypsy LTR retrotransposons have given rise to several multigenic gene families including the Paraneoplastic (PNMA, also called Ma genes, 15 genes), MART (12 genes) and SCAN families (56 genes) [68,69,70,71]. Overall, at least 103 genes derived from GAG proteins of Gypsy LTR retrotransposons have been identified in mammalian genomes, 85 being present in the human genome.

TE domestication and lymphocyte development

Two important TE-derived proteins in jawed vertebrates are RAG1 and RAG2 (Recombination Activating Gene 1 and 2) that together catalyze the V(D)J somatic recombination, a mechanism essential for the establishment of the vertebrate immune repertoire [72]. This genetic recombination, which takes place in develo** lymphocytes, is at the basis of the adaptive immune system, since it allows the formation of diverse antibodies and T-cell receptors capable of specifically recognizing a great variety of pathogens. Pathogen recognition is ensured by the antigen-binding domain, which is encoded after assembling gene segments called variable (V), diversity (D) and joining (J). The joining of different V, D and J segments generates, in association with additional mutational processes, the great diversity of antibodies that can be produced by a jawed vertebrate.

RAG1 and RAG2 lymphoid-specific endonucleases are key enzymes for this somatic recombination. Both proteins associate as a recombinase to introduce double-strand breaks in DNA at recombination signal sequences (RSSs) that frame each V, D and J gene segment. This DNA cleavage resembles the transposition mechanism of DNA transposons in early steps. Indeed, the rag1 and rag2 genes have been derived from a RAG transposon related to Transib DNA transposons approx. 500–600 million years ago [73,74,75]. The RSSs recognized by RAG1/RAG2 might be derived from the TIRs of the ancestral transposon. The hypothesis is that, at the basis of deuterostomes, a Transib element originally containing only a rag1 transposase might have captured an additional rag2 ORF, leading to a RAG transposon with increased transposition activity [76]. By comparing vertebrate RAG proteins to a RAG transposon from the amphioxus genome that carries both rag1- and rag2-like genes [76, 77], putative key mutations in the domestication process, that impaired the transposition ability of the rag genes in the post-cleavage steps, have been identified [78]. This example of molecular domestication illustrates well how a specific genomic context may favor the selection and domestication of a transposable element. Indeed, for the emergence of the V(D)J recombination, the insertion of a TE with its RSS sequences into a gene encoding an immunoglobulin-domain receptor protein was probably a prerequisite to the formation of the ancestral fragmented antigen receptor gene [78].

TE domestication and brain development

Several retrotransposon-derived genes are implicated in vertebrate brain development, such as members of the PNMA, MART, SCAN and ARC gene families, that are all derived from gag genes of Ty3/gypsy LTR retrotransposons [68,69,70,71].

The pnma10 gene (aka sizn1/zcchc12/pnma7a) from the PNMA gene family is involved in mouse forebrain development and mutations are associated with X-linked mental retardation in human [79]. The pnma5 gene shows a neocortex-specific expression in primate adult brain particularly in the association areas [80]. Higher order association areas are primate-specific areas responsible for the integration of multiple inputs such as somatosensory, visuospatial, auditory and memory processes; they contribute to perception, cognition and behavior [81]. The pnma5 gene is also present in mice but its neocortex-specific expression is not conserved. Thus, pnma5 is thought to be one of the major genes involved in the expansion and specialization of association areas in the primate brain [80].

The protein encoded by the eutherian gene sirh11 (aka mart4/rtl4), which belongs to the MART gene family, has conserved the gag zinc finger domain necessary for its binding to nucleic acids [70]. Sirh11 is of crucial function for cognition [82]. Indeed, mice sirh11 knockout mutants show impulsivity, attention and working memory defects as well as hyperactivity, suggesting a critical role in behavior. As this gene is present in eutherians only and could have conferred an essential advantage for competition by develo** cognitive functions, it has been suggested to have played an important role in eutherian evolution [82].

The placental mammal gene peg3 (zscan24) from the SCAN gene family has been also shown to be involved in mouse behavior [70]. This gene is paternally expressed during embryonic development and in adult brain. Its inactivation leads to growth retardation and abnormal maternal behavior for nest building, pup retrieval and crouching over pups, which can cause offspring death [83]. Moreover, mutant mothers present milk ejection defects. This phenotype has been related to a reduced number of oxytocin neurons. Growth retardation and abnormal maternal behavior are suggested to be due to impaired neuronal connectivity [83].

Finally, the arc tetrapod gene was shown in mice to be essential for synapse maturation and synaptic plasticity, and is involved in major neuronal processes of learning [70, 84]. Arc mutations have also been linked to several human disorders such as Alzheimer’s disease, Angelman neurodevelopmental disease, schizophrenia and autism among others, highlighting the crucial role of the arc gene in brain development and functioning [85,86,87,88,89,90,91,92]. The ARC protein has conserved structural properties similar to those of GAG proteins. Particularly, it forms capsid-like structures that transport RNA molecules across synapses and thus mediate intercellular communication between neurons [93]. Interestingly, arc-like genes called darc have been identified as duplicated copies in the genome of Drosophila melanogaster. Although tetrapod arc and Drosophila darc genes have been formed from Ty3/gypsy retrotransposons by independent molecular domestication events, they present similar properties of mRNA trafficking, suggesting evolutionary convergence [93, 94].

TE domestication and placenta development

TE molecular domestication probably played crucial roles in the appearance and diversification of placenta development during mammalian evolution (Fig. 2). For instance, the MART genes peg10 (aka mart2/rtl2) and peg11 (aka mart1/rtl1) are placental genes derived from gag and partial pol sequences of Sushi Ty3/gypsy LTR retrotransposons [95, 96]. Peg10 influences the development of the spongiotrophoblast and labyrinth layers, which are the cell layers separating the embryo from the maternal tissues of the placenta, and peg11 maintains the fetal capillary endothelial cells. Mutation of the sirh7 (aka mart7/rtl7/ldoc1) gene leads to dysregulation of placental cell differentiation and maturation linked to placental hormone overproduction [97].

Syncytin genes also play a central role in placenta development. They are derived from endogenous retrovirus envelope (env) sequences, which encode membrane proteins that allow viral fusion with the target cells necessary for infection. The SYNCYTIN proteins have kept some properties of the ancestral ENV proteins. They are able to promote cell-cell fusion, allowing trophoblast differentiation and the formation of the syncytiotrophoblast tissue, which triggers the exchange of nutrients and gases between mother and child [98,99,100]. Moreover, some SYNCYTIN proteins play a role in maternal immune tolerance, this being probably linked to the capacity of parental retroviruses to target and repress immune cells thanks to the immunosuppressive activity of the ENV protein [101,102,103]. Indeed, at least one human (SYNCYTIN-2) and one mouse SYNCYTIN (SYNCYTIN-B) show immunosuppressive activity in vivo in mouse [104].

Among placental mammals, 14 different syncytin genes have been identified in different lineages presenting various placenta structures characterized by different invasion levels of the uterus by trophoblast cells. The different syncytin genes, their expression and their properties may play a role in the placental morphological diversity observed among mammals. In sheep, the env gene of a very recently endogenized Jaagsiekte Sheep Retrovirus (JSRV), present at ca. 20 copies in the genome, has functions similar to those of syncytin domesticated genes [105]. This env gene indeed contributes to trophectoderm (first epithelium of the mammalian embryo) development and leads to pregnancy loss when downregulated. This might represent an example of a retrovirus gene being on the way of molecular domestication. Additionally, the human gene suppressyn has also been identified as an ERV env-derived gene [106]. Its protein product acts as a regulator of SYNCYTIN by binding to SYNCYTIN-1 receptor, thus inhibiting SYNCYTIN-1-mediated cell fusion.

Interestingly, syncytin genes in different lineages are not orthologous and have been formed by independent events of molecular domestication of ERV envelope genes, testifying for a fascinating case of convergent evolution. This underlines how TEs can represent (almost) ready-to-use molecular material that can be repurposed independently several times during the evolution of different lineages. In addition, it has been recently demonstrated that ERV env sequence captures are not specific of eutherian mammals, since other syncytin genes of independent origins have been found in marsupials and even in some viviparous lizards [107, 108].

Mammalian placenta evolution through the molecular domestication of several different retrotransposon and retrovirus genes has been proposed to follow a “baton pass” mechanism [109]. First, the early birth and high conservation of the three LTR retrotransposon-derived genes peg10, peg11 and sirh7 among mammals suggest that they could be at the origin of the primitive placenta at the base of placental mammals. Subsequently, an ancestral gene responsible for cell fusion may have been substituted by syncytin gene(s), which might have then replaced one another, ensuring or even improving the function and the performance of the previous syncytin gene, and allowing placenta morphological innovations [109, 110].

Placenta appears thus to be the place of multiple events of TE co-option. Some studies suggest that these domestications may have been facilitated by the hypomethylation of DNA in placenta compared to other tissues, allowing higher TE expression and subsequent easier TE recruitment [111, 112].

TE domestication and the diverse roles of the ZBED family

The ZBED gene family derives from hAT DNA transposons, and more precisely from the BED zinc finger domain of their transposase, which is involved in DNA binding [113]. This gene family is implicated in various aspects of tissue or organ development in vertebrates. For example, the mammalian ZBED3 binds to the AXIN protein to form a complex that regulates the Wnt/β-catenin signaling pathway, which is essential for embryogenesis and carcinogenesis [114]. In addition to the BED domain, zbed1, zbed4 and zbed6 also kept the DDE catalytic domain of the ancestral TE transposase, which contains an ⍺-helical domain and a dimerization domain. Present in bony vertebrates, zbed4 is proposed to be involved in retinal morphogenesis and in the functioning of Müller retinal glial cells by activating the transcription of genes expressed in Müller cells or by regulating their nuclear hormone receptors [115]. The placental mammal gene zbed6 encodes a transcription factor essential for muscle development. A single nucleotide (nt) mutation in an igf2 intronic sequence prevents the repression of this gene by ZBED6, leading to an increase in muscle growth and heart size and to a decrease in fat deposition [116]. ChIP-sequencing experiments have revealed about 1200 additional putative genes targeted by ZBED6, with particular enrichment in genes involved in development, cell differentiation, morphogenesis, neurogenesis, cell-cell signaling and muscle development. Finally, the vertebrate gene zbed1 is implicated in cell proliferation by regulating several ribosomal protein genes [117, 118].

TEs as a source of new non-coding RNA genes

TE-derived small non-coding RNAs

TE sequences can be a source of small non-coding RNAs (sncRNAs) (Fig. 1c). Several studies have shown that some sncRNAs can derive from TEs, such as microRNAs (miRNAs) [119] and Piwi-interacting RNAs (piRNAs) [120]. These sncRNAs generally constitute TE silencing factors, but they have also shown abilities to regulate host gene expression by sequence complementarity through mRNA degradation and translation inhibition (Fig. 3a). sncRNAs can also induce DNA methylation of the loci close to the nascent mRNA their target. This can induce heterochromatinization, which can spread in the targeted genomic region and thus can potentially lead to the transcriptional repression of neighboring genes (Fig. 3a) [121].

Conclusions

In this review, we present an overview of the multiple TE resources and functionalities that can be co-opted by host genomes (Fig. 4). TEs can be the source of developmental innovations through their recruitment as new coding sequences and new ncRNAs, and by acting as regulatory sequences, even if TEs are probably less active in gene regulation than expected from their abundance in vertebrate genomes [215]. Particularly, TEs have been instrumental to the evolution of brain, placenta, immunity and embryonic development in vertebrates. The pace of TE recruitment in vertebrate developmental program remains to be investigated. According to the developmental gene hypothesis for punctuated equilibrium, developmental regulatory genes essential for organism morphogenesis are extremely conserved and intolerant to mutations, maintaining an equilibrium state [279]. Changes might not be progressive but rather punctuated, this being often due to transposable elements accumulation and co-option as regulatory sequences to give rise to bursts of morphological innovations and species divergence.

Concerning the formation of new genes, Ohno proposed in 1999 that gene duplication is the main mechanism sha** evolutionary transitions [33]. New genes can also be formed from scratch, but this mechanism is very rare. We show here that TEs are a major source of material for the birth of novel protein-coding and RNA genes. In the absence of events of whole genome duplications, it has been estimated in primates that 53% of new genes originate at least partially from TE exaptation (mostly in primate-specific regions) compared to 24% from gene duplication and 5.5% de novo from non-coding sequences (the origin of the last 17.5% is still unclear) [280]. The contribution of TEs in this process is thus quantitatively important, in addition to the new functions they provide to the genome.

Several characteristics could modulate the propensity of TEs to be exapted. First, the different characteristics of each TE, such as the presence/absence of internal promoters, protein-binding motifs and ORFs encoding proteins with various properties, might favor the domestication of certain families depending on the needs of the host. For instance, ERVs have greater capacities to become gene regulatory drivers than most other TE families [215]. This has been proposed to be linked to the frequent loss of functional internal genes in ERVs, which abolish their transposition ability but leaves LTRs in genomes that can be readily repurposed. ERVs are frequently non-repressed in hypomethylated tissues, this also possibly facilitates their recruitment. Second, the age of the TE sequences might also be of importance. Repressive silencing being relaxed in old TEs, the repression of younger elements in the genome might limit their chance to be recruited by the host. Third, the activity, copy number and diversity of a TE family probably influence its evolutionary potential for the host. Even if low copy number elements can also lead to important innovations, as shown for the Izanagi transposon in the sex determination cascade of the medaka fish [236], high copy number and diversity of TEs might increase the probability of generating an element advantageous for the host at both sequence and localization levels. On the other hand, maintenance of transposition activity and recombination opportunity with other TE copies might hinder the fixation of a beneficial TE-derived sequence at a specific position in the genome. Fourth, the insertion preferences of TEs or the strength of the selection pressure against their maintenance certainly impact their possible recruitment. While TEs inserting or better tolerated in gene-poor regions will probably undergo less counter-selection, they might be often silenced in heterochromatin. On the other hand, TE preferential insertion or tolerance in gene-rich regions might be more frequently deleterious but could also increase the chance of generating a beneficial combination between TE and host sequences [27]. This might for example be the case for Alu elements in primates, which are probably better tolerated than LINEs in gene-rich regions due to their smaller size and therefore more frequently recruited in exaptation processes. The major factor influencing the co-option of a TE is probably the context of its insertion, as proposed for the domestication of the Transib-like DNA transposon at the origin of the V(D)J recombination [281]. A significant part (36.5% in the human genome) of TE-derived genes are positioned head-to-head to a host gene and share with him a bidirectional promoter containing a CpG island [282]. Since CpG islands correspond to open and actively transcribed chromatin regions, these promoters could be targeted by TE insertions and would provide them with a permissive transcriptional context for their expression, favoring the TE recruitment by the host as new transcribed sequences. TE domestication might also be facilitated by an insertion close to a promoter, or when the insertion results in a fusion with a host gene, with the TE possibly benefiting from the regulatory elements of the linked host gene if this gene is expressed in the germ line [64, 283, 284]. Fifth, if a novel TE is acquired by horizontal transfer, it will transiently escape the repression mechanisms of the host, bringing new evolutionary potentialities and recruitment opportunities.

Developmental pathways are closely linked to those causing cancer. Illustrating this, several examples of TE-derived developmental innovations have also been associated to cancer formation. The human syncytin-1 gene, involved in immunomodulation and cell-cell fusion in placenta, is expressed in several cancers such as colorectal and breast cancers, and endometrial carcinoma [285,286,287]. Several genes of the PNMA family have also been implicated in cancers, such as pnma5 or pnma7a, which acts as an oncogene in thyroid cancers [288, 289]. Finally, the RAG1/RAG2 recombinase, which catalyzes the V(D)J recombination, is a driver of the genetic instability linked to lymphoblastic leukemia [290].

To conclude, Barbara McClintock’s initial model [1] is now widely illustrated. In addition to form “controlling elements”, TEs are also a rich source of new host coding and RNA sequences. Most current examples illustrating the role of TE-derived sequences in vertebrate developmental innovation stems from mammals, but it is reasonable to think that TEs play also a major role in the evolution of other vertebrate species, which generally present even a higher diversity of transposable elements compared to mammals [21]. More studies in other vertebrate sub-lineages are therefore needed. For instance, an accumulation of TE sequences in the Hox gene clusters has been recently reported in four species of squamates (green-anole lizard, slow-worm, corn snake and gecko), which contrasts with the extremely conserved structure of Hox clusters in other vertebrates [291, 292]. It has been suggested that these TEs may provide new coding and non-coding regions or novel regulations of transcription to the cluster genes. The emergence of such elements inside the Hox clusters may explain the observed morphological diversity of squamates, but this hypothesis must now be tested at the functional level [292, 293]. The accurate characterization of the whole mobilome of multiple and divergent vertebrate species, i.e. the accurate and complete genome-wide identification and annotation of TEs and TE-derived sequences in genomes along with their evolutionary and functional characteristics, is an ongoing challenge that will allow to better assess the impact of TEs on vertebrate evolution.

Availability of data and materials

Not applicable.

Abbreviations
2C:

Two-Cell stage

ERV:

Endogenous Retroviruse

HERV:

Human Endogenous RetroVirus

JSRV:

Jaagsiekte Sheep Retrovirus

KRAB-ZFP:

Kruppel-associated box zinc finger proteins

lincRNA:

long intergenic non-coding RNAs

LINE:

Long Interspersed Nuclear Elements

lncRNA:

long non-coding RNAs

LTR:

Long Terminal Repeat

MER:

Medium Reiteration frequency

MIR:

Mammalian-wide Interspersed Repeat

miRNA :

microRNA

MITE:

Miniature Inverted Repeat Transposable Element

nt:

nucleotide

ORF:

Open Reading Frame

piRNA:

PIWI-interacting RNAs

RSS:

Recombination Signal Sequence

RT:

Reverse Transcriptase

SINE:

Short Interspersed Nuclear Elements

siteRNA:

small intronic transposable element RNA

sncRNA:

small non-coding RNA

TAD:

Topologically Associated Domains

TBP:

TATA box-binding protein

TE:

Transposable Element

TF:

Transcription Factor

TFBS:

Transcription Factor Binding Site

TIR:

Terminal Inverted Repeat

TSC:

Trophoblast Stem Cell

UTR:

Untranslated Region

References
McClintock B. Controlling elements and the gene. Cold Spring Harb Symp Quant Biol. 1956;21:197–216.
Article CAS PubMed Google Scholar
Kazazian HH. Mobile elements: drivers of genome evolution. Science. 2004;303(5664):1626–32.
Article CAS PubMed Google Scholar
Biémont C, Vieira C. Junk DNA as an evolutionary force. Nature. 2006;443(7111):521–4.
Article PubMed CAS Google Scholar
Bourque G, Burns KH, Gehring M, Gorbunova V, Seluanov A, Hammell M, et al. Ten things you should know about transposable elements. Genome Biol. 2018;19(1):199.
Article CAS PubMed PubMed Central Google Scholar
Wicker T, Sabot F, Hua-Van A, Bennetzen JL, Capy P, Chalhoub B, et al. A unified classification system for eukaryotic transposable elements. Nat Rev Genet. 2007;8(12):973–82.
Article CAS PubMed Google Scholar
Kapitonov VV, Jurka J. A universal classification of eukaryotic transposable elements implemented in Repbase. Nat Rev Genet. 2008;9(5):411–2.
Article PubMed Google Scholar
Beauregard A, Curcio MJ, Belfort M. The take and give between retrotransposable elements and their hosts. Annu Rev Genet. 2008;42(1):587–617.
Article CAS PubMed PubMed Central Google Scholar
Goodier JL. Restricting retrotransposons: a review. Mobile DNA. 2016;7(1):16.
Article PubMed PubMed Central Google Scholar
Curcio MJ, Derbyshire KM. The outs and ins of transposition: from mu to kangaroo. Nat Rev Mol Cell Biol. 2003;4(11):865–77.
Article CAS PubMed Google Scholar
Dewannieux M, Esnault C, Heidmann T. LINE-mediated retrotransposition of marked Alu sequences. Nat Genet. 2003;35(1):41–8.
Article CAS PubMed Google Scholar
Richardson SR, Doucet AJ, Kopera HC, Moldovan JB, Garcia-Perez JL, Moran JV. The influence of LINE-1 and SINE retrotransposons on mammalian genomes. Microbiol Spectr. 2015;3(2):MDNA3–0061–2014.
Article Google Scholar
Feschotte C, Pritham EJ. DNA transposons and the evolution of eukaryotic genomes. Annu Rev Genet. 2007;41:331–68.
Article CAS PubMed PubMed Central Google Scholar
Kapitonov VV, Jurka J. Helitrons on a roll: eukaryotic rolling-circle transposons. Trends Genet. 2007;23(10):521–9.
Article CAS PubMed Google Scholar
Thomas J, Pritham EJ. Helitrons, the eukaryotic rolling-circle transposable elements. Microbiol Spectr. 2015;3(4):893–926.
Kapitonov VV, Jurka J. Self-synthesizing DNA transposons in eukaryotes. Proc Natl Acad Sci U S A. 2006;103(12):4540–5.
Article CAS PubMed PubMed Central Google Scholar
Krupovic M, Koonin EV. Polintons: a hotbed of eukaryotic virus, transposon and plasmid evolution. Nat Rev Microbiol. 2015;13(2):105–15.
Article CAS PubMed Google Scholar
Schnable PS, Ware D, Fulton RS, Stein JC, Wei F, Pasternak S, et al. The B73 maize genome: complexity, diversity, and dynamics. Science. 2009;326(5956):1112–5.
Article CAS PubMed Google Scholar
Carr M, Bensasson D, Bergman CM. Evolutionary genomics of transposable elements in Saccharomyces cerevisiae. PLoS ONE. 2012;7(11):e50978.
Article CAS PubMed PubMed Central Google Scholar
Pritham EJ, Feschotte C, Wessler SR. Unexpected diversity and differential success of DNA transposons in four species of Entamoeba protozoans. Mol Biol Evol. 2005;22(9):1751–63.
Article CAS PubMed Google Scholar
Carlton JM, Hirt RP, Silva JC, Delcher AL, Schatz M, Zhao Q, et al. Draft genome sequence of the sexually transmitted pathogen Trichomonas vaginalis. Science. 2007;315(5809):207–12.
Article PubMed PubMed Central Google Scholar
Chalopin D, Naville M, Plard F, Galiana D, Volff J-N. Comparative analysis of transposable elements highlights mobilome diversity and evolution in vertebrates. Genome Biol Evol. 2015;7(2):567–80.
Article CAS PubMed PubMed Central Google Scholar
Kidwell MG, Lisch DR. Transposable elements and host genome evolution. Trends Ecol Evol. 2000;15(3):95–9.
Article CAS PubMed Google Scholar
Warren IA, Naville M, Chalopin D, Levin P, Berger CS, Galiana D, et al. Evolutionary impact of transposable elements on genomic diversity and lineage-specific innovation in vertebrates. Chromosome Res. 2015;23(3):505–31.
Article CAS PubMed Google Scholar
Lee H-E, Ayarpadikannan S, Kim H-S. Role of transposable elements in genomic rearrangement, evolution, gene regulation and epigenetics in primates. Genes Genet Syst. 2015;90(5):245–57.
Article CAS PubMed Google Scholar
Garcia-Perez JL, Widmann TJ, Adams IR. The impact of transposable elements on mammalian development. Development. 2016;143(22):4101–14.
Article CAS PubMed Google Scholar
Chuong EB, Elde NC, Feschotte C. Regulatory evolution of innate immunity through co-option of endogenous retroviruses. Science. 2016;351(6277):1083–7.
Article CAS PubMed PubMed Central Google Scholar
Chuong EB, Elde NC, Feschotte C. Regulatory activities of transposable elements: from conflicts to benefits. Nat Rev Genet. 2017;18(2):71–86.
Article CAS PubMed Google Scholar
Jangam D, Feschotte C, Betrán E. Transposable element domestication as an adaptation to evolutionary conflicts. Trends Genet. 2017;33(11):817–31.
Article CAS PubMed PubMed Central Google Scholar
Kumar S, Hedges SB. A molecular timescale for vertebrate evolution. Nature. 1998;392(6679):917–20.
Article CAS PubMed Google Scholar
Shimeld SM, Holland PWH. Vertebrate innovations. Proc Natl Acad Sci U S A. 2000;97(9):4449–52.
Article CAS PubMed PubMed Central Google Scholar
Khaner O. Evolutionary innovations of the vertebrates. Integr Zool. 2007;2(2):60–7.
Article PubMed Google Scholar
Sugahara F, Murakami Y, Pascual-Anaya J, Kuratani S. Reconstructing the ancestral vertebrate brain. Develop Growth Differ. 2017;59(4):163–74.
Article Google Scholar
Ohno S. Gene duplication and the uniqueness of vertebrate genomes circa 1970–1999. Semin Cell Dev Biol. 1999;10(5):517–22.
Article CAS PubMed Google Scholar
King M, Wilson A. Evolution at two levels in humans and chimpanzees. Science. 1975;188(4184):107–16.
Article CAS PubMed Google Scholar
Carroll SB, Grenier JK, Weatherbee SD. From DNA to diversity: molecular genetics and the evolution of animal design. 2nd ed. Malden: Blackwell Pub; 2005. p. 258.
Google Scholar
Marlétaz F, Firbas PN, Maeso I, Tena JJ, Bogdanovic O, Perry M, et al. Amphioxus functional genomics and the origins of vertebrate gene regulation. Nature. 2018;564(7734):64–70.
Article PubMed PubMed Central CAS Google Scholar
Sela N, Mersch B, Gal-Mark N, Lev-Maor G, Hotz-Wagenblatt A, Ast G. Comparative analysis of transposed element insertion within human and mouse genomes reveals Alu’s unique role in sha** the human transcriptome. Genome Biol. 2007;8(6):R127.
Article PubMed PubMed Central CAS Google Scholar
Sela N, Mersch B, Hotz-Wagenblatt A, Ast G. Characteristics of transposable element exonization within human and mouse. PLoS ONE. 2010;5(6):e10907.
Article PubMed PubMed Central CAS Google Scholar
Sela N, Kim E, Ast G. The role of transposable elements in the evolution of non-mammalian vertebrates and invertebrates. Genome Biol. 2010;11(6):R59.
Article PubMed PubMed Central Google Scholar
Piriyapongsa J, Rutledge MT, Patel S, Borodovsky M, Jordan IK. Evaluating the protein coding potential of exonized transposable element sequences. Biol Direct. 2007;2(1):31.
Article PubMed PubMed Central CAS Google Scholar
Sorek R, Ast G, Graur D. Alu-containing exons are alternatively spliced. Genome Res. 2002;12(7):1060–7.
Article CAS PubMed PubMed Central Google Scholar
Modrek B, Lee CJ. Alternative splicing in the human, mouse and rat genomes is associated with an increased frequency of exon creation and/or loss. Nat Genet. 2003;34(2):177–80.
Article CAS PubMed Google Scholar
Alekseyenko AV, Kim N, Lee CJ. Global analysis of exon creation versus loss and the role of alternative splicing in 17 vertebrate genomes. RNA. 2007;13(5):661–70.
Article CAS PubMed PubMed Central Google Scholar
International Human Genome Sequencing Consortium. Initial sequencing and analysis of the human genome. Nature. 2001;409(6822):860–921.
Article Google Scholar
Krull M, Brosius J, Schmitz J. Alu-SINE exonization: En route to protein-coding function. Mol Biol Evol. 2005;22(8):1702–11.
Article CAS PubMed Google Scholar
Shen S, Lin L, Cai JJ, Jiang P, Kenkel EJ, Stroik MR, et al. Widespread establishment and regulatory impact of Alu exons in human genes. Proc Natl Acad Sci U S A. 2011;108(7):2837–42.
Article CAS PubMed PubMed Central Google Scholar
Nozu K, Iijima K, Ohtsuka Y, Fu XJ, Kaito H, Nakanishi K, et al. Alport syndrome caused by a COL4A5 deletion and exonization of an adjacent AluY. Mol Genet Genomic Med. 2014;2(5):451–3.
Article CAS PubMed PubMed Central Google Scholar
Piriyapongsa J, Polavarapu N, Borodovsky M, McDonald J. Exonization of the LTR transposable elements in human genome. BMC Genomics. 2007;8:291.
Article PubMed PubMed Central CAS Google Scholar
Attig J, Agostini F, Gooding C, Chakrabarti AM, Singh A, Haberman N, et al. Heteromeric RNP assembly at LINEs controls lineage-specific RNA processing. Cell. 2018;174(5):1067–1081.e17.
Article CAS PubMed PubMed Central Google Scholar
Avgan N, Wang JI, Fernandez-Chamorro J, Weatheritt RJ. Multilayered control of exon acquisition permits the emergence of novel forms of regulatory control. Genome Biol. 2019;20(1):141.
Article PubMed PubMed Central CAS Google Scholar
Zarnack K, König J, Tajnik M, Martincorena I, Eustermann S, Stévant I, et al. Direct competition between hnRNP C and U2AF65 protects the transcriptome from the exonization of Alu elements. Cell. 2013;152(3):453–66.
Article CAS PubMed PubMed Central Google Scholar
Abascal F, Tress ML, Valencia A. Alternative splicing and co-option of transposable elements: the case of TMPO/LAP2α and ZNF451 in mammals. Bioinformatics. 2015;31(14):2257–61.
Article CAS PubMed PubMed Central Google Scholar
Dechat T, Korbei B, Vaughan OA, Vlcek S, Hutchison CJ, Foisner R. Lamina-associated polypeptide 2alpha binds intranuclear A-type lamins. J Cell Sci. 2000;113(Pt 19):3473–84.
Article CAS PubMed Google Scholar
Dechat T. Detergent-salt resistance of LAP2alpha in interphase nuclei and phosphorylation-dependent association with chromosomes early in nuclear assembly implies functions in nuclear structure dynamics. EMBO J. 1998;17(16):4887–902.
Article CAS PubMed PubMed Central Google Scholar
Vlcek S. Just H, Dechat T, Foisner R. Functional diversity of LAP2α and LAP2β in postmitotic chromosome association is caused by an α-specific nuclear targeting domain. EMBO J. 1999;18(22):6370–84.
Article CAS PubMed PubMed Central Google Scholar
Taylor MRG, Slavov D, Gajewski A, Vlcek S, Ku L, Fain PR, et al. Thymopoietin (lamina-associated polypeptide 2) gene mutation associated with dilated cardiomyopathy. Hum Mutat. 2005;26(6):566–74.
Article CAS PubMed Google Scholar
Bu P, Yagi S, Shiota K, Alam SMK, Vivian JL, Wolfe MW, et al. Origin of a rapidly evolving homeostatic control system programming testis function. J Endocrinol. 2017;234(2):217–32.
Article CAS PubMed PubMed Central Google Scholar
Huang C-J, Chen C-Y, Chen H-H, Tsai S-F, Choo K-BTDPOZ. a family of bipartite animal and plant proteins that contain the TRAF (TD) and POZ/BTB domains. Gene. 2004;324:117–27.
Article CAS PubMed Google Scholar
Huang C-J, Lin W-Y, Chang C-M, Choo K-B. Transcription of the rat testis-specific Rtdpoz-T1 and -T2 retrogenes during embryo development: co-transcription and frequent exonisation of transposable element sequences. BMC Mol Biol. 2009;10(1):74.
Article PubMed PubMed Central CAS Google Scholar
Barton ER. The ABCs of IGF-I isoforms: impact on muscle hypertrophy and implications for repair. Appl Physiol Nutr Metab. 2006;31(6):791–7.
Article CAS PubMed Google Scholar
Annibalini G, Bielli P, De Santi M, Agostini D, Guescini M, Sisti D, et al. MIR retroposon exonization promotes evolutionary variability and generates species-specific expression of IGF-1 splice variants. Biochim Biophys Acta. 2016;1859(5):757–68.
Article CAS PubMed Google Scholar
Chen H, Chen L, Wu Y, Shen H, Yang G, Deng C. The exonization and functionalization of an Alu-J element in the protein coding region of glycoprotein hormone alpha gene represent a novel mechanism to the evolution of hemochorial placentation in primates. Mol Biol Evol. 2017;34(12):3216–31.
Article CAS PubMed Google Scholar
Fournier T, Guibourdenche J, Review E-BD. hCGs: Different sources of production, different glycoforms and functions. Placenta. 2015;36:S60–5.
Article CAS PubMed Google Scholar
Volff J-N. Turning junk into gold: domestication of transposable elements and the creation of new genes in eukaryotes. Bioessays. 2006;28(9):913–22.
Article CAS PubMed Google Scholar
Alzohairy AM, Gyulai G, Jansen RK, Bahieldin A. Transposable elements domesticated and neofunctionalized by eukaryotic genomes. Plasmid. 2013;69(1):1–15.
Article CAS PubMed Google Scholar
Tudor M, Lobocka M, Goodell M, Pettitt J, O’Hare K. The pogo transposable element family of Drosophila melanogaster. Mol Gen Genet. 1992;232(1):126–34.
Article CAS PubMed Google Scholar
Smit AF, Riggs AD. Tiggers and DNA transposon fossils in the human genome. Proc Natl Acad Sci U S A. 1996;93(4):1443–8.
Article CAS PubMed PubMed Central Google Scholar
Volff J-N, Körting C, Schartl M. Ty3/Gypsy retrotransposon fossils in mammalian genomes: Did they evolve into new cellular functions? Mol Biol Evol. 2001;18(2):266–70.
Article CAS PubMed Google Scholar
Brandt J, Veith AM, Volff J-N. A family of neofunctionalized Ty3/gypsy retrotransposon genes in mammalian genomes. Cytogenet Genome Res. 2005;110(1–4):307–17.
Article CAS PubMed Google Scholar
Campillos M, Doerks T, Shah PK, Bork P. Computational characterization of multiple Gag-like human proteins. Trends Genet. 2006;22(11):585–9.
Article CAS PubMed Google Scholar
Chalopin D, Galiana D, Volff J-N. Genetic innovation in vertebrates: gypsy integrase genes and other genes derived from transposable elements. Int J Evol Biol. 2012;2012:1–11.
Article CAS Google Scholar
Thompson CB. New insights into V(D) J recombination and its role in the evolution of the immune system. Immunity. 1995;3(5):531–9.
Article CAS PubMed Google Scholar
Kapitonov VV, Jurka J. RAG1 core and V(D) J recombination signal sequences were derived from Transib transposons. PLoS Biol. 2005;3(6):e181.
Article PubMed PubMed Central CAS Google Scholar
Kapitonov VV, Koonin EV. Evolution of the RAG1-RAG2 locus: both proteins came from the same transposon. Biol Direct. 2015;10(1):20.
Article PubMed PubMed Central CAS Google Scholar
Carmona LM, Schatz DG. New insights into the evolutionary origins of the recombination-activating gene proteins and V(D) J recombination. FEBS J. 2017;284(11):1590–605.
Article CAS PubMed PubMed Central Google Scholar
Carmona LM, Fugmann SD, Schatz DG. Collaboration of RAG2 with RAG1-like proteins during the evolution of V(D) J recombination. Genes Dev. 2016;30(8):909–17.
Article CAS PubMed PubMed Central Google Scholar
Huang S, Tao X, Yuan S, Zhang Y, Li P, Beilinson HA, et al. Discovery of an active RAG transposon illuminates the origins of V(D) J recombination. Cell. 2016;166(1):102–14.
Article CAS PubMed PubMed Central Google Scholar
Zhang Y, Cheng TC, Huang G, Lu Q, Surleac MD, Mandell JD, et al. Transposon molecular domestication and the evolution of the RAG recombinase. Nature. 2019;569(7754):79–84.
Article CAS PubMed PubMed Central Google Scholar
Cho G, Lim Y, Golden JA. XLMR candidate mouse gene, Zcchc12 (Sizn1) is a novel marker of Cajal–Retzius cells. Gene Expr Patterns. 2011;11(3–4):216–20.
Article CAS PubMed Google Scholar
Takaji M, Komatsu Y, Watakabe A, Hashikawa T, Yamamori T. Paraneoplastic antigen-like 5 gene (PNMA5) is preferentially expressed in the association areas in a primate specific manner. Cereb Cortex. 2009;19(12):2865–79.
Article PubMed PubMed Central Google Scholar
Yamamori T. Selective gene expression in regions of primate neocortex: Implications for cortical specialization. Prog Neurobiol. 2011;94(3):201–22.
Article CAS PubMed Google Scholar
Irie M, Yoshikawa M, Ono R, Iwafune H, Furuse T, Yamada I, et al. Cognitive function related to the Sirh11/Zcchc16 gene acquired from an LTR retrotransposon in eutherians. PLoS Genet. 2015;11(9):e1005521.
Article PubMed PubMed Central CAS Google Scholar
Li L, Keverne EB, Aparicio SA, Ishino F, Barton SC, Surani MA. Regulation of maternal behavior and offspring growth by paternally expressed Peg3. Science. 1999;284(5412):330–3.
Article CAS PubMed Google Scholar
Plath N, Ohana O, Dammermann B, Errington ML, Schmitz D, Gross C, et al. Arc/Arg3.1 Is essential for the consolidation of synaptic plasticity and memories. Neuron. 2006;52(3):437–44.
Article CAS PubMed Google Scholar
Park S, Park JM, Kim S, Kim J-A, Shepherd JD, Smith-Hicks CL, et al. Elongation factor 2 and fragile X mental retardation protein control the dynamic translation of Arc/Arg3.1 essential for mGluR-LTD. Neuron. 2008;59(1):70–83.
Article CAS PubMed PubMed Central Google Scholar
Greer PL, Hanayama R, Bloodgood BL, Mardinly AR, Lipton DM, Flavell SW, et al. The Angelman Syndrome protein Ube3A regulates synapse development by ubiquitinating Arc. Cell. 2010;140(5):704–16.
Article CAS PubMed PubMed Central Google Scholar
Wu J, Petralia RS, Kurushima H, Patel H, Jung M, Volk L, et al. Arc/Arg3.1 regulates an endosomal pathway essential for activity-dependent β-amyloid generation. Cell. 2011;147(3):615–28.
Article CAS PubMed PubMed Central Google Scholar
Fromer M, Pocklington AJ, Kavanagh DH, Williams HJ, Dwyer S, Gormley P, et al. De novo mutations in schizophrenia implicate synaptic networks. Nature. 2014;506(7487):179–84.
Article CAS PubMed PubMed Central Google Scholar
Purcell SM, Moran JL, Fromer M, Ruderfer D, Solovieff N, Roussos P, et al. A polygenic burden of rare disruptive mutations in schizophrenia. Nature. 2014;506(7487):185–90.
Article CAS PubMed PubMed Central Google Scholar
Alhowikan AM. Activity-regulated cytoskeleton-associated protein dysfunction may contribute to memory disorder and earlier detection of autism spectrum disorders. Med Princ Pract. 2016;25(4):350–4.
Article PubMed PubMed Central Google Scholar
Managò F, Mereu M, Mastwal S, Mastrogiacomo R, Scheggia D, Emanuele M, et al. Genetic disruption of Arc/Arg3.1 in mice causes alterations in dopamine and neurobehavioral phenotypes related to schizophrenia. Cell Rep. 2016;16(8):2116–28.
Article PubMed PubMed Central CAS Google Scholar
Pastuzyn ED, Shepherd JD. Activity-dependent Arc expression and homeostatic synaptic plasticity are altered in neurons from a mouse model of Angelman syndrome. Front Mol Neurosci. 2017;10:234.
Article PubMed PubMed Central CAS Google Scholar
Pastuzyn ED, Day CE, Kearns RB, Kyrke-Smith M, Taibi AV, McCormick J, et al. The neuronal gene Arc encodes a repurposed retrotransposon gag protein that mediates intercellular RNA transfer. Cell. 2018;172(1–2):275–288.e18.
Article CAS PubMed PubMed Central Google Scholar
Ashley J, Cordy B, Lucia D, Fradkin LG, Budnik V, Thomson T. Retrovirus-like gag protein Arc1 binds RNA and traffics across synaptic boutons. Cell. 2018;172(1–2):262–274.e11.
Article CAS PubMed PubMed Central Google Scholar
Ono R, Nakamura K, Inoue K, Naruse M, Usami T, Wakisaka-Saito N, et al. Deletion of Peg10, an imprinted gene acquired from a retrotransposon, causes early embryonic lethality. Nat Genet. 2006;38(1):101–6.
Article CAS PubMed Google Scholar
Sekita Y, Wagatsuma H, Nakamura K, Ono R, Kagami M, Wakisaka N, et al. Role of retrotransposon-derived imprinted gene, Rtl1, in the feto-maternal interface of mouse placenta. Nat Genet. 2008;40(2):243–8.
Article CAS PubMed Google Scholar
Naruse M, Ono R, Irie M, Nakamura K, Furuse T, Hino T, et al. Sirh7/Ldoc1 knockout mice exhibit placental P4 overproduction and delayed parturition. Development. 2014;141(24):4763–71.
Article CAS PubMed PubMed Central Google Scholar
Frendo J-L, Olivier D, Cheynet V, Blond J-L, Bouton O, Vidaud M, et al. Direct involvement of HERV-W Env glycoprotein in human trophoblast cell fusion and differentiation. Mol Cell Biol. 2003;23(10):3566–74.
Article CAS PubMed PubMed Central Google Scholar
Mallet F, Bouton O, Prudhomme S, Cheynet V, Oriol G, Bonnaud B, et al. The endogenous retroviral locus ERVWE1 is a bona fide gene involved in hominoid placental physiology. Proc Natl Acad Sci U S A. 2004;101(6):1731–6.
Article CAS PubMed PubMed Central Google Scholar
Dupressoir A, Vernochet C, Harper F, Guegan J, Dessen P, Pierron G, et al. A pair of co-opted retroviral envelope syncytin genes is required for formation of the two-layered murine placental syncytiotrophoblast. Proc Natl Acad Sci U S A. 2011;108(46):E1164–73.
Article CAS PubMed PubMed Central Google Scholar
Cianciolo G, Copeland T, Oroszlan S, Snyderman R. Inhibition of lymphocyte proliferation by a synthetic peptide homologous to retroviral envelope proteins. Science. 1985;230(4724):453–5.
Article CAS PubMed Google Scholar
Haraguchi S, Good RA, James-Yarish M, Cianciolo GJ, Day NK. Differential modulation of Th1- and Th2-related cytokine mRNA expression by a synthetic peptide homologous to a conserved domain within retroviral envelope protein. Proc Natl Acad Sci U S A. 1995;92(8):3611–5.
Article CAS PubMed PubMed Central Google Scholar
Schlecht-Louf G, Renard M, Mangeney M, Letzelter C, Richaud A, Ducos B, et al. Retroviral infection in vivo requires an immune escape virulence factor encrypted in the envelope protein of oncoretroviruses. Proc Natl Acad Sci U S A. 2010;107(8):3782–7.
Article CAS PubMed PubMed Central Google Scholar
Mangeney M, Renard M, Schlecht-Louf G, Bouallaga I, Heidmann O, Letzelter C, et al. Placental syncytins: Genetic disjunction between the fusogenic and immunosuppressive activity of retroviral envelope proteins. Proc Natl Acad Sci U S A. 2007;104(51):20534–9.
Article CAS PubMed PubMed Central Google Scholar
Dunlap KA, Palmarini M, Varela M, Burghardt RC, Hayashi K, Farmer JL, et al. Endogenous retroviruses regulate periimplantation placental growth and differentiation. Proc Natl Acad Sci U S A. 2006;103(39):14390–5.
Article CAS PubMed PubMed Central Google Scholar
Sugimoto J, Sugimoto M, Bernstein H, **no Y, Schust D. A novel human endogenous retroviral protein inhibits cell-cell fusion. Sci Rep. 2013;3(1):1462.
Article PubMed PubMed Central CAS Google Scholar
Cornelis G, Vernochet C, Carradec Q, Souquere S, Mulot B, Catzeflis F, et al. Retroviral envelope gene captures and syncytin exaptation for placentation in marsupials. Proc Natl Acad Sci U S A. 2015;112(5):E487–96.
Article CAS PubMed PubMed Central Google Scholar
Cornelis G, Funk M, Vernochet C, Leal F, Tarazona OA, Meurice G, et al. An endogenous retroviral envelope syncytin and its cognate receptor identified in the viviparous placental Mabuya lizard. Proc Natl Acad Sci U S A. 2017;114(51):E10991–1000.
Article CAS PubMed PubMed Central Google Scholar
Imakawa K, Nakagawa S, Miyazawa T. Baton pass hypothesis: successive incorporation of unconserved endogenous retroviral genes for placentation during mammalian evolution. Genes Cells. 2015;20(10):771–88.
Article CAS PubMed Google Scholar
Lavialle C, Cornelis G, Dupressoir A, Esnault C, Heidmann O, Vernochet C, et al. Paleovirology of ‘ syncytins ’, retroviral env genes exapted for a role in placentation. Philos Trans R Soc Lond B Biol Sci. 2013;368(1626):20120507.
Article PubMed PubMed Central CAS Google Scholar
Chapman V, Forrester L, Sanford J, Hastie N, Rossant J. Cell lineage-specific undermethylation of mouse repetitive DNA. Nature. 1984;307(5948):284–6.
Article CAS PubMed Google Scholar
Chuong EB. Retroviruses facilitate the rapid evolution of the mammalian placenta: Insights & Perspectives. BioEssays. 2013;35(10):853–61.
Hayward A, Ghazal A, Andersson G, Andersson L, Jern P. ZBED evolution: Repeated utilization of DNA transposons as regulators of diverse host functions. PLoS ONE. 2013;8(3):e59940.
Article CAS PubMed PubMed Central Google Scholar
Chen T, Li M, Ding Y, Zhang L, ** Y, Pan W, et al. Identification of zinc-finger BED domain-containing 3 (Zbed3) as a novel Axin-interacting protein that activates Wnt/β-catenin signaling. J Biol Chem. 2009;284(11):6683–9.
Article CAS PubMed PubMed Central Google Scholar
Saghizadeh M, Gribanova Y, Akhmedov NB, Farber DB. ZBED4, a cone and Müller cell protein in human retina, has a different cellular expression in mouse. Mol Vis. 2011;17:2011–8.
CAS PubMed PubMed Central Google Scholar
Markljung E, Jiang L, Jaffe JD, Mikkelsen TS, Wallerman O, Larhammar M, et al. ZBED6, a novel transcription factor derived from a domesticated DNA transposon regulates IGF2 expression and muscle growth. PLoS Biol. 2009;7(12):e1000256.
Article PubMed PubMed Central CAS Google Scholar
Ohshima N, Takahashi M, Hirose F. Identification of a human homologue of the DREF transcription factor with a potential role in regulation of the histone H1 gene. J Biol Chem. 2003;278(25):22928–38.
Article CAS PubMed Google Scholar
Yamashita D, Sano Y, Adachi Y, Okamoto Y, Osada H, Takahashi T, et al. hDREF regulates cell proliferation and expression of ribosomal protein genes. Mol Cell Biol. 2007;27(6):2003–13.
Article CAS PubMed PubMed Central Google Scholar
Qin S, ** P, Zhou X, Chen L, Ma F. The role of transposable elements in the origin and evolution of microRNAs in human. PLoS ONE. 2015;10(6):e0131365.
Article PubMed PubMed Central CAS Google Scholar
Betel D, Sheridan R, Marks DS, Sander C. Computational analysis of mouse piRNA sequence and biogenesis. PLoS Comput Biol. 2007;3(11):e222.
Article PubMed PubMed Central CAS Google Scholar
Rebollo R, Karimi MM, Bilenky M, Gagnier L, Miceli-Royer K, Zhang Y, et al. Retrotransposon-induced heterochromatin spreading in the mouse revealed by insertional polymorphisms. PLoS Genet. 2011;7(9):e1002301.
Article CAS PubMed PubMed Central Google Scholar
Bartel DP. MicroRNAs: Genomics, biogenesis, mechanism, and function. Cell. 2004;116(2):281–97.
Article CAS PubMed Google Scholar
Smalheiser N, Torvik V. Mammalian microRNAs derived from genomic repeats. Trends Genet. 2005;21(6):322–6.
Article CAS PubMed Google Scholar
Piriyapongsa J, Mariño-Ramírez L, Jordan IK. Origin and evolution of human microRNAs from transposable elements. Genetics. 2007;176(2):1323–37.
Article CAS PubMed PubMed Central Google Scholar
Piriyapongsa J, Jordan IK. A family of human microRNA genes from miniature inverted-repeat transposable elements. PLoS ONE. 2007;2(2):e203.
Article PubMed PubMed Central CAS Google Scholar
Borchert GM, Holton NW, Williams JD, Hernan WL, Bishop IP, Dembosky JA, et al. Comprehensive analysis of microRNA genomic loci identifies pervasive repetitive-element origins. Mob Genet Elements. 2011;1(1):8–17.
Article PubMed PubMed Central Google Scholar
Roberts JT, Cooper EA, Favreau CJ, Howell JS, Lane LG, Mills JE, et al. Continuing analysis of microRNA origins: Formation from transposable element insertions and noncoding RNA mutations. Mob Genet Elements. 2013;3(6):e27755.
Article PubMed Google Scholar
Spengler RM, Oakley CK, Davidson BL. Functional microRNAs and target sites are created by lineage-specific transposition. Hum Mol Genet. 2014;23(7):1783–93.
Article CAS PubMed Google Scholar
Smalheiser N, Torvik V. Alu elements within human mRNAs are probable microRNA targets. Trends Genet. 2006;22(10):532–6.
Article CAS PubMed Google Scholar
Jahangirimoez M, Medlej A, Tavallaie M, Soltani B. Hsa-miR-587 regulates TGFβ/SMAD signaling and promotes cell cycle progression. Cell J. 2019;22(2):158–64.
Esau C, Davis S, Murray SF, Yu XX, Pandey SK, Pear M, et al. miR-122 regulation of lipid metabolism revealed by in vivo antisense targeting. Cell Metab. 2006;3(2):87–98.
Article CAS PubMed Google Scholar
Xu R-R, Zhang C-W, Cao Y, Wang Q. mir122 deficiency inhibits differentiation of zebrafish hepatoblast into hepatocyte. Hereditas (Bei**g). 2013;35(4):488–94.
Article CAS Google Scholar
Ward JR, Heath PR, Catto JW, Whyte MKB, Milo M, Renshaw SA. Regulation of neutrophil senescence by microRNAs. PLoS ONE. 2011;6(1):e15810.
Article CAS PubMed PubMed Central Google Scholar
Allantaz F, Cheng DT, Bergauer T, Ravindran P, Rossier MF, Ebeling M, et al. Expression profiling of human immune cell subsets identifies miRNA-mRNA regulatory relationships correlated with cell type specific expression. PLoS ONE. 2012;7(1):e29979.
Article CAS PubMed PubMed Central Google Scholar
Molnár V, Érsek B, Wiener Z, Tömböl Z, Szabó PM, Igaz P, et al. MicroRNA-132 targets HB-EGF upon IgE-mediated activation in murine and human mast cells. Cell Mol Life Sci. 2012;69(5):793–808.
Article PubMed CAS Google Scholar
Gilicze AB, Wiener Z, Tóth S, Buzás E, Pállinger É, Falcone FH, et al. Myeloid-derived microRNAs, miR-223, miR27a, and miR-652, are dominant players in myeloid regulation. BioMed Res Int. 2014;2014:1–9.
Article CAS Google Scholar
Krist B, Podkalicka P, Mucha O, Mendel M, Sępioł A, Rusiecka OM, et al. miR-378a influences vascularization in skeletal muscles. Cardiovasc Res. 2020;116(7):1386–97.
Trockenbacher A, Suckow V, Foerster J, Winter J, Krauß S, Ropers H-H, et al. MID1, mutated in Opitz syndrome, encodes an ubiquitin ligase that targets phosphatase 2A for degradation. Nat Genet. 2001;29(3):287–94.
Article CAS PubMed Google Scholar
Liu E, Knutzen CA, Krauss S, Schweiger S, Chiang GG. Control of mTORC1 signaling by the Opitz syndrome protein MID1. Proc Natl Acad Sci U S A. 2011;108(21):8680–5.
Article CAS PubMed PubMed Central Google Scholar
Unterbruner K, Matthes F, Schilling J, Nalavade R, Weber S, Winter J, et al. MicroRNAs miR-19, miR-340, miR-374 and miR-542 regulate MID1 protein expression. PLoS ONE. 2018;13(1):e0190437.
Article PubMed PubMed Central CAS Google Scholar
Quaderi NA, Schweiger S, Gaudenz K, Franco B, Rugarli EI, Berger W, et al. Opitz G/BBB syndrome, a defect of midline development, is due to mutations in a new RING finger gene on Xp22. Nat Genet. 1997;17(3):285–91.
Article CAS PubMed Google Scholar
Ma Z, Sun X, Xu D, **ong Y, Zuo B. MicroRNA, miR-374b, directly targets Myf6 and negatively regulates C2C12 myoblasts differentiation. Biochem Biophys Res Commun. 2015;467(4):670–5.
Article CAS PubMed Google Scholar
Jee YH, Wang J, Yue S, Jennings M, Clokie SJ, Nilsson O, et al. mir-374-5p, mir-379-5p, and mir-503-5p regulate proliferation and hypertrophic differentiation of growth plate chondrocytes in male rats. Endocrinology. 2018;159(3):1469–78.
Article CAS PubMed PubMed Central Google Scholar
Rasheed VA, Sreekanth S, Dhanesh SB, Divya MS, Divya TS, Akhila PK, et al. Developmental wave of Brn3b expression leading to RGC fate specification is synergistically maintained by miR-23a and miR-374: miR-23a and 374 in RGC differentiation. Dev Neurobiol. 2014;74(12):1155–71.
Article CAS PubMed Google Scholar
Pan S, Zheng Y, Zhao R, Yang X. miRNA-374 regulates dexamethasone-induced differentiation of primary cultures of porcine adipocytes. Horm Metab Res. 2013;45(07):518–25.
Article CAS PubMed Google Scholar
Su R, Fu S, Zhang Y, Wang R, Zhou Y, Li J, et al. Comparative genomic approach reveals novel conserved microRNAs in Inner Mongolia cashmere goat skin and longissimus dorsi. Mol Biol Rep. 2015;42(5):989–95.
Article CAS PubMed Google Scholar
Sun Z, Zhang Y, Zhang R, Qi X, Su B. Functional divergence of the rapidly evolving miR-513 subfamily in primates. BMC Evol Biol. 2013;13(1):255.
Article PubMed PubMed Central CAS Google Scholar
Schmidt EE, Ohbayashi T, Makino Y, Tamura T, Schibler U. Spermatid-specific overexpression of the TATA-binding protein gene involves recruitment of two potent testis-specific promoters. J Biol Chem. 1997;272(8):5326–34.
Article CAS PubMed Google Scholar
Aravin AA, Sachidanandam R, Girard A, Fejes-Toth K, Hannon GJ. Developmentally regulated piRNA clusters implicate MILI in transposon control. Science. 2007;316(5825):744–7.
Article CAS PubMed Google Scholar
Vourekas A, Zheng Q, Alexiou P, Maragkakis M, Kirino Y, Gregory BD, et al. Mili and Miwi target RNA repertoire reveals piRNA biogenesis and function of Miwi in spermiogenesis. Nat Struct Mol Biol. 2012;19(8):773–81.
Article CAS PubMed PubMed Central Google Scholar
Gou L-T, Dai P, Yang J-H, Xue Y, Hu Y-P, Zhou Y, et al. Pachytene piRNAs instruct massive mRNA elimination during late spermiogenesis. Cell Res. 2014;24(6):680–700.
Article CAS PubMed PubMed Central Google Scholar
Grivna ST, Pyhtila B. Lin H. MIWI associates with translational machinery and PIWI-interacting RNAs (piRNAs) in regulating spermatogenesis. Proc Natl Acad Sci U S A. 2006;103(36):13415–20.
Article CAS PubMed PubMed Central Google Scholar
Aravin AA, Sachidanandam R, Bourc’his D, Schaefer C, Pezic D, Toth KF, et al. A piRNA pathway primed by individual transposons is linked to de novo DNA methylation in mice. Molecular Cell. 2008;31(6):785–99.
Article CAS PubMed PubMed Central Google Scholar
Zhang P, Kang J-Y, Gou L-T, Wang J, Xue Y, Skogerboe G, et al. MIWI and piRNA-mediated cleavage of messenger RNAs in mouse testes. Cell Res. 2015;25(2):193–207.
Article CAS PubMed PubMed Central Google Scholar
Ernst C, Odom DT, Kutter C. The emergence of piRNAs against transposon invasion to preserve mammalian genome integrity. Nat Commun. 2017;8(1):1411.
Article PubMed PubMed Central CAS Google Scholar
Grimson A, Srivastava M, Fahey B, Woodcroft BJ, Chiang HR, King N, et al. Early origins and evolution of microRNAs and Piwi-interacting RNAs in animals. Nature. 2008;455(7217):1193–7.
Article CAS PubMed Google Scholar
Sarkar A, Volff J-N, Vaury C. piRNAs and their diverse roles: a transposable element-driven tactic for gene regulation? FASEB J. 2017;31(2):436–46.
Article CAS PubMed Google Scholar
Assis R, Kondrashov AS. Rapid repetitive element-mediated expansion of piRNA clusters in mammalian evolution. Proc Natl Acad Sci U S A. 2009;106(17):7079–82.
Article CAS PubMed PubMed Central Google Scholar
Zheng K, Wang PJ. Blockade of pachytene piRNA biogenesis reveals a novel requirement for maintaining post-meiotic germline genome integrity. PLoS Genet. 2012;8(11):e1003038.
Article CAS PubMed PubMed Central Google Scholar
Watanabe T, Cheng E, Zhong M, Lin H. Retrotransposons and pseudogenes regulate mRNAs and lncRNAs via the piRNA pathway in the germline. Genome Res. 2015;25(3):368–80.
Article PubMed PubMed Central CAS Google Scholar
Kuramochi-Miyagawa S, Watanabe T, Gotoh K, Totoki Y, Toyoda A, Ikawa M, et al. DNA methylation of retrotransposon genes is regulated by Piwi family members MILI and MIWI2 in murine fetal testes. Genes Dev. 2008;22(7):908–17.
Article CAS PubMed PubMed Central Google Scholar
Aravin A, Gaidatzis D, Pfeffer S, Lagos-Quintana M, Landgraf P, Iovino N, et al. A novel class of small RNAs bind to MILI protein in mouse testes. Nature. 2006;442(7099):203–7.
Article CAS PubMed Google Scholar
Fu A, Jacobs DI, Zhu Y. Epigenome-wide analysis of piRNAs in gene-specific DNA methylation. RNA Biology. 2014;11(10):1301–12.
Article PubMed Google Scholar
Gan H, Lin X, Zhang Z, Zhang W, Liao S, Wang L, et al. piRNA profiling during specific stages of mouse spermatogenesis. RNA. 2011;17(7):1191–203.
Article CAS PubMed PubMed Central Google Scholar
Roovers EF, Rosenkranz D, Mahdipour M, Han C-T, He N, Chuva de Sousa Lopes SM, et al. Piwi proteins and piRNAs in mammalian oocytes and early embryos. Cell Rep. 2015;10(12):2069–82.
Article CAS PubMed Google Scholar
Harding JL, Horswell S, Heliot C, Armisen J, Zimmerman LB, Luscombe NM, et al. Small RNA profiling of Xenopus embryos reveals novel miRNAs and a new class of small RNAs derived from intronic transposable elements. Genome Res. 2014;24(1):96–106.
Article CAS PubMed PubMed Central Google Scholar
Ransohoff JD, Wei Y, Khavari PA. The functions and unique features of long intergenic non-coding RNA. Nat Rev Mol Cell Biol. 2018;19(3):143–57.
Article CAS PubMed Google Scholar
Bhat SA, Ahmad SM, Mumtaz PT, Malik AA, Dar MA, Urwat U, et al. Long non-coding RNAs: Mechanism of action and functional utility. Noncoding RNA Res. 2016;1(1):43–50.
Article PubMed PubMed Central Google Scholar
Loewer S, Cabili MN, Guttman M, Loh Y-H, Thomas K, Park IH, et al. Large intergenic non-coding RNA-RoR modulates reprogramming of human induced pluripotent stem cells. Nat Genet. 2010;42(12):1113–7.
Article CAS PubMed PubMed Central Google Scholar
Rinn JL, Chang HY. Genome regulation by long noncoding RNAs. Annu Rev Biochem. 2012;81(1):145–66.
Article CAS PubMed Google Scholar
Brockdorff N, Ashworth A, Kay GF, McCabe VM, Norris DP, Cooper PJ, et al. The product of the mouse **st gene is a 15 kb inactive X-specific transcript containing no conserved ORF and located in the nucleus. Cell. 1992;71(3):515–26.
Article CAS PubMed Google Scholar
Elisaphenko EA, Kolesnikov NN, Shevchenko AI, Rogozin IB, Nesterova TB, Brockdorff N, et al. A dual origin of the **st gene from a protein-coding gene and a set of transposable elements. PLoS ONE. 2008;3(6):e2521.
Article PubMed PubMed Central CAS Google Scholar
Pandey RR, Mondal T, Mohammad F, Enroth S, Redrup L, Komorowski J, et al. Kcnq1ot1 antisense noncoding RNA mediates lineage-specific transcriptional silencing through chromatin-level regulation. Mol Cell. 2008;32(2):232–46.
Article CAS PubMed Google Scholar
Nagano T, Mitchell JA, Sanz LA, Pauler FM, Ferguson-Smith AC, Feil R, et al. The Air noncoding RNA epigenetically silences transcription by targeting G9a to chromatin. Science. 2008;322(5908):1717–20.
Article CAS PubMed Google Scholar
Delás MJ, Hannon GJ. lncRNAs in development and disease: from functions to mechanisms. Open Biol. 2017;7(7):170121.
Article PubMed PubMed Central CAS Google Scholar
Wilkes MC, Repellin CE, Sakamoto KM. Beyond mRNA: The role of non-coding RNAs in normal and aberrant hematopoiesis. Mol Genet Metab. 2017;122(3):28–38.
Article CAS PubMed PubMed Central Google Scholar
Ng S-Y, Lin L, Soh BS, Stanton LW. Long noncoding RNAs in development and disease of the central nervous system. Trends Genet. 2013;29(8):461–8.
Article CAS PubMed Google Scholar
Necsulea A, Soumillon M, Warnefors M, Liechti A, Daish T, Zeller U, et al. The evolution of lncRNA repertoires and expression patterns in tetrapods. Nature. 2014;505(7485):635–40.
Article CAS PubMed Google Scholar
Hezroni H, Koppstein D, Schwartz MG, Avrutin A, Bartel DP, Ulitsky I. Principles of long noncoding RNA evolution derived from direct comparison of transcriptomes in 17 species. Cell Rep. 2015;11(7):1110–22.
Article CAS PubMed PubMed Central Google Scholar
Kutter C, Watt S, Stefflova K, Wilson MD, Goncalves A, Ponting CP, et al. Rapid turnover of long noncoding RNAs and the evolution of gene expression. PLoS Genet. 2012;8(7):e1002841.
Article CAS PubMed PubMed Central Google Scholar
Popadin K, Gutierrez-Arcelus M, Dermitzakis ET, Antonarakis SE. Genetic and epigenetic regulation of human lincRNA gene expression. Am J Hum Genet. 2013;93(6):1015–26.
Article CAS PubMed PubMed Central Google Scholar
Washietl S, Kellis M, Garber M. Evolutionary dynamics and tissue specificity of human long noncoding RNAs in six mammals. Genome Res. 2014;24(4):616–28.
Article CAS PubMed PubMed Central Google Scholar
Kelley D, Rinn J. Transposable elements reveal a stem cell-specific class of long noncoding RNAs. Genome Biol. 2012;13(11):R107.
Article PubMed PubMed Central CAS Google Scholar
Kapusta A, Kronenberg Z, Lynch VJ, Zhuo X, Ramsay L, Bourque G, et al. Transposable elements are major contributors to the origin, diversification, and regulation of vertebrate long noncoding RNAs. PLoS Genet. 2013;9(4):e1003470.
Article CAS PubMed PubMed Central Google Scholar
Kannan S, Chernikova D, Rogozin IB, Poliakov E, Managadze D, Koonin EV, et al. Transposable element insertions in long intergenic non-coding RNA genes. Front Bioeng Biotechnol. 2015;3:71.
Carlevaro-Fita J, Polidori T, Das M, Navarro C, Zoller TI, Johnson R. Ancient exapted transposable elements promote nuclear enrichment of human long noncoding RNAs. Genome Res. 2019;29(2):208–22.
Article CAS PubMed PubMed Central Google Scholar
Krchňáková Z, Thakur PK, Krausová M, Bieberstein N, Haberman N. Müller-McNicoll M, et al. Splicing of long non-coding RNAs primarily depends on polypyrimidine tract and 5′ splice-site sequences due to weak interactions with SR proteins. Nucleic Acids Res. 2019;47(2):911–28.
Article PubMed CAS Google Scholar
Johnson R, Guigo R. The RIDL hypothesis: transposable elements as functional domains of long noncoding RNAs. RNA. 2014;20(7):959–76.
Article CAS PubMed PubMed Central Google Scholar
Loda A. Heard E. **st RNA in action: Past, present, and future. PLoS Genet. 2019;15(9):e1008333.
Article PubMed PubMed Central CAS Google Scholar
Lyon MF. The Lyon and the LINE hypothesis. Semin Cell Dev Biol. 2003;14(6):313–8.
Article CAS PubMed Google Scholar
Tang YA, Huntley D, Montana G, Cerase A, Nesterova TB, Brockdorff N. Efficiency of **st-mediated silencing on autosomes is linked to chromosomal domain organisation. Epigenetics Chromatin. 2010;3(1):10.
Article PubMed PubMed Central CAS Google Scholar
Chow JC, Ciaudo C, Fazzari MJ, Mise N, Servant N, Glass JL, et al. LINE-1 activity in facultative heterochromatin formation during X chromosome inactivation. Cell. 2010;141(6):956–69.
Article CAS PubMed Google Scholar
Casanova M, Moscatelli M, Chauvière LÉ, Huret C, Samson J, Liyakat Ali TM, et al. A primate-specific retroviral enhancer wires the XACT lncRNA into the core pluripotency network in humans. Nat Commun. 2019;10(1):5652.
Article PubMed PubMed Central CAS Google Scholar
Ramsay L, Marchetto MC, Caron M, Chen S-H, Busche S, Kwan T, et al. Conserved expression of transposon-derived non-coding transcripts in primate stem cells. BMC Genomics. 2017;18(1):214.
Article PubMed PubMed Central CAS Google Scholar
The FANTOM Consortium, Fort A, Hashimoto K, Yamada D, Salimullah M, Keya CA, et al. Deep transcriptome profiling of mammalian stem cells supports a regulatory role for retrotransposons in pluripotency maintenance. Nat Genet. 2014;46(6):558–66.
Article CAS Google Scholar
Lu X, Sachs F, Ramsay L, Jacques P-É, Göke J, Bourque G, et al. The retrovirus HERVH is a long noncoding RNA required for human embryonic stem cell identity. Nat Struct Mol Biol. 2014;21(4):423–5.
Article CAS PubMed Google Scholar
Wang J, **e G, Singh M, Ghanbarian AT, Raskó T, Szvetnik A, et al. Primate-specific endogenous retrovirus-driven transcription defines naive-like stem cells. Nature. 2014;516(7531):405–9.
Article CAS PubMed Google Scholar
Durruthy-Durruthy J, Sebastiano V, Wossidlo M, Cepeda D, Cui J, Grow EJ, et al. The primate-specific noncoding RNA HPAT5 regulates pluripotency during human preimplantation development and nuclear reprogramming. Nat Genet. 2016;48(1):44–52.
Article CAS PubMed Google Scholar
Jachowicz JW, Bing X, Pontabry J, Bošković A, Rando OJ, Torres-Padilla M-E. LINE-1 activation after fertilization regulates global chromatin accessibility in the early mouse embryo. Nat Genet. 2017;49(10):1502–10.
Article CAS PubMed Google Scholar
Percharde M, Lin C-J, Yin Y, Guan J, Peixoto GA, Bulut-Karslioglu A, et al. A LINE1-Nucleolin partnership regulates early development and ESC identity. Cell. 2018;174(2):391–405.e19.
Article CAS PubMed PubMed Central Google Scholar
Zucchelli S, Fasolo F, Russo R, Cimatti L, Patrucco L, Takahashi H, et al. SINEUPs are modular antisense long non-coding RNAs that increase synthesis of target proteins in cells. Front Cell Neurosci. 2015;9:174.
Podbevšek P, Fasolo F, Bon C, Cimatti L, Reißer S, Carninci P, et al. Structural determinants of the SINE B2 element embedded in the long non-coding RNA activator of translation AS Uchl1. Sci Rep. 2018;8(1):3189.
Article PubMed PubMed Central CAS Google Scholar
Fasolo F, Patrucco L, Volpe M, Bon C, Peano C, Mignone F, et al. The RNA-binding protein ILF3 binds to transposable element sequences in SINEUP lncRNAs. FASEB J. 2019;33(12):13572–89.
Article CAS PubMed PubMed Central Google Scholar
Liu Y, Fallon L, Lashuel HA, Liu Z, Lansbury PT. The UCH-L1 gene encodes two opposing enzymatic activities that affect α-synuclein degradation and Parkinson’s disease susceptibility. Cell. 2002;111(2):209–18.
Article CAS PubMed Google Scholar
Carrieri C, Cimatti L, Biagioli M, Beugnet A, Zucchelli S, Fedele S, et al. Long non-coding antisense RNA controls Uchl1 translation through an embedded SINEB2 repeat. Nature. 2012;491(7424):454–7.
Article CAS PubMed Google Scholar
Schein A, Zucchelli S, Kauppinen S, Gustincich S, Carninci P. Identification of antisense long noncoding RNAs that function as SINEUPs in human cells. Sci Rep. 2016;6(1):33605.
Article CAS PubMed PubMed Central Google Scholar
Hughes JJ, Alkhunaizi E, Kruszka P, Pyle LC, Grange DK, Berger SI, et al. Loss-of-function variants in PPP1R12A: from isolated sex reversal to holoprosencephaly spectrum and urogenital malformations. Am J Hum Genet. 2020;106(1):121–8.
Article CAS PubMed Google Scholar
Barresi MJF, Burton S, Dipietrantonio K, Amsterdam A, Hopkins N, Karlstrom RO. Essential genes for astroglial development and axon pathfinding during zebrafish embryogenesis. Dev Dyn. 2010;239(10):2603–18.
Article CAS PubMed PubMed Central Google Scholar
Ulitsky I, Shkumatava A, Jan CH, Sive H, Bartel DP. Conserved function of lincRNAs in vertebrate embryonic development despite rapid sequence evolution. Cell. 2011;147(7):1537–50.
Article CAS PubMed PubMed Central Google Scholar
Sarangdhar MA, Chaubey D, Srikakulam N, Pillai B. Parentally inherited long non-coding RNA Cyrano is involved in zebrafish neurodevelopment. Nucleic Acids Res. 2018;46(18):9726–35.
Article CAS PubMed PubMed Central Google Scholar
Bourque G, Leong B, Vega VB, Chen X, Lee YL, Srinivasan KG, et al. Evolution of the mammalian transcription factor binding repertoire via transposable elements. Genome Res. 2008;18(11):1752–62.
Article CAS PubMed PubMed Central Google Scholar
Sundaram V, Cheng Y, Ma Z, Li D, **ng X, Edge P, et al. Widespread contribution of transposable elements to the innovation of gene regulatory networks. Genome Res. 2014;24(12):1963–76.
Article CAS PubMed PubMed Central Google Scholar
Nikitin D, Garazha A, Sorokin M, Penzar D, Tkachev V, Markov A, et al. Retroelement—linked transcription factor binding patterns point to quickly develo** molecular pathways in human evolution. Cells. 2019;8(2):130.
Article CAS PubMed Central Google Scholar
Trizzino M, Kapusta A, Brown CD. Transposable elements generate regulatory novelty in a tissue-specific fashion. BMC Genomics. 2018;19(1):468.
Article PubMed PubMed Central CAS Google Scholar
Simonti CN, Pavličev M, Capra JA. Transposable element exaptation into regulatory regions is rare, influenced by evolutionary age, and subject to pleiotropic constraints. Mol Biol Evol. 2017;34(11):2856–69.
Article CAS PubMed PubMed Central Google Scholar
Ferrigno O, Virolle T, Djabari Z, Ortonne J-P, White RJ, Aberdam D. Transposable B2 SINE elements can provide mobile RNA polymerase II promoters. Nat Genet. 2001;28(1):77–81.
Article CAS PubMed Google Scholar
Shankar R, Grover D, Brahmachari SK, Mukerji M. Evolution and distribution of RNA polymerase II regulatory sites from RNA polymerase III dependant mobile Alu elements. BMC Evol Biol. 2004;4(1):37.
Article PubMed PubMed Central CAS Google Scholar
Cohen CJ, Lock WM, Mager DL. Endogenous retroviral LTRs as promoters for human genes: A critical assessment. Gene. 2009;448(2):105–14.
Article CAS PubMed Google Scholar
Nishihara H, Kobayashi N, Kimura-Yoshida C, Yan K, Bormuth O, Ding Q, et al. Coordinately co-opted multiple transposable elements constitute an enhancer for wnt5a expression in the mammalian secondary palate. PLoS Genet. 2016;12(10):e1006380.
Article PubMed PubMed Central CAS Google Scholar
Yamaguchi TP, Bradley A, McMahon AP, Jones S. A Wnt5a pathway underlies outgrowth of multiple structures in the vertebrate embryo. Development. 1999;126(6):1211–23.
Article CAS PubMed Google Scholar
Ge SX. Exploratory bioinformatics investigation reveals importance of “junk” DNA in early embryo development. BMC Genomics. 2017;18(1):200.
Article PubMed PubMed Central CAS Google Scholar
Jacques P-É, Jeyakani J, Bourque G. The majority of primate-specific regulatory sequences are derived from transposable elements. PLoS Genet. 2013;9(5):e1003504.
Article CAS PubMed PubMed Central Google Scholar
Kunarso G, Chia N-Y, Jeyakani J, Hwang C, Lu X, Chan Y-S, et al. Transposable elements have rewired the core regulatory network of human embryonic stem cells. Nat Genet. 2010;42(7):631–4.
Article CAS PubMed Google Scholar
Macfarlan TS, Gifford WD, Driscoll S, Lettieri K, Rowe HM, Bonanomi D, et al. Embryonic stem cell potency fluctuates with endogenous retrovirus activity. Nature. 2012;487(7405):57–63.
Article CAS PubMed PubMed Central Google Scholar
Ito J, Sugimoto R, Nakaoka H, Yamada S, Kimura T, Hayano T, et al. Systematic identification and characterization of regulatory elements derived from human endogenous retroviruses. PLoS Genet. 2017;13(7):e1006883.
Article PubMed PubMed Central CAS Google Scholar
Ecco G, Cassano M, Kauzlaric A, Duc J, Coluccio A, Offner S, et al. Transposable elements and their KRAB-ZFP controllers regulate gene expression in adult tissues. Dev Cell. 2016;36(6):611–23.
Article CAS PubMed PubMed Central Google Scholar
Sasaki T, Nishihara H, Hirakawa M, Fujimura K, Tanaka M, Kokubo N, et al. Possible involvement of SINEs in mammalian-specific brain formation. Proc Natl Acad Sci U S A. 2008;105(11):4220–5.
Article CAS PubMed PubMed Central Google Scholar
Alcamo EA, Chirivella L, Dautzenberg M, Dobreva G, Fariñas I, Grosschedl R, et al. Satb2 regulates callosal projection neuron identity in the develo** cerebral cortex. Neuron. 2008;57(3):364–77.
Article CAS PubMed Google Scholar
Britanova O, de Juan Romero C, Cheung A, Kwan KY, Schwark M, Gyorgy A, et al. Satb2 is a postmitotic determinant for upper-layer neuron specification in the neocortex. Neuron. 2008;57(3):378–92.
Article CAS PubMed Google Scholar
Notwell JH, Chung T, Heavner W, Bejerano G. A family of transposable elements co-opted into developmental enhancers in the mouse neocortex. Nat Commun. 2015;6(1):6644.
Article CAS PubMed Google Scholar
Uemura O, Okada Y, Ando H, Guedj M, Higashijima S, Shimazaki T, et al. Comparative functional genomics revealed conservation and diversification of three enhancers of the isl1 gene for motor and sensory neuron-specific expression. Dev Biol. 2005;278(2):587–606.
Article CAS PubMed Google Scholar
Bejerano G, Lowe CB, Ahituv N, King B, Siepel A, Salama SR, et al. A distal enhancer and an ultraconserved exon are derived from a novel retroposon. Nature. 2006;441(7089):87–90.
Article CAS PubMed Google Scholar
Crepaldi L, Policarpi C, Coatti A, Sherlock WT, Jongbloets BC, Down TA, et al. Binding of TFIIIC to SINE elements controls the relocation of activity-dependent neuronal genes to transcription factories. PLoS Genet. 2013;9(8):e1003699.
Article CAS PubMed PubMed Central Google Scholar
**e M, Hong C, Zhang B, Lowdon RF, **ng X, Li D, et al. DNA hypomethylation within specific transposable element families associates with tissue-specific enhancer landscape. Nat Genet. 2013;45(7):836–41.
Article CAS PubMed PubMed Central Google Scholar
Trizzino M, Park Y, Holsbach-Beltrame M, Aracena K, Mika K, Caliskan M, et al. Transposable elements are the primary source of novelty in primate gene regulation. Genome Res. 2017;27(10):1623–33.
Article CAS PubMed PubMed Central Google Scholar
Herpin A, Braasch I, Kraeussling M, Schmidt C, Thoma EC, Nakamura S, et al. Transcriptional rewiring of the sex determining dmrt1 gene duplicate by transposable elements. PLoS Genet. 2010;6(2):e1000844.
Article PubMed PubMed Central CAS Google Scholar
Nishihara H. Retrotransposons spread potential cis-regulatory elements during mammary gland evolution. Nucleic Acids Res. 2019;47(22):11551–62.
CAS PubMed PubMed Central Google Scholar
Peaston AE, Evsikov AV, Graber JH, de Vries WN, Holbrook AE, Solter D, et al. Retrotransposons regulate host genes in mouse oocytes and preimplantation embryos. Dev Cell. 2004;7(4):597–606.
Article CAS PubMed Google Scholar
Franke V, Ganesh S, Karlic R, Malik R, Pasulka J, Horvat F, et al. Long terminal repeats power evolution of genes and gene expression programs in mammalian oocytes and zygotes. Genome Res. 2017;27(8):1384–94.
Article CAS PubMed PubMed Central Google Scholar
Flemr M, Malik R, Franke V, Nejepinska J, Sedlacek R, Vlahovicek K, et al. A retrotransposon-driven dicer isoform directs endogenous small interfering RNA production in mouse oocytes. Cell. 2013;155(4):807–16.
Article CAS PubMed Google Scholar
Davis MP, Carrieri C, Saini HK, Dongen S, Leonardi T, Bussotti G, et al. Transposon-driven transcription is a conserved feature of vertebrate spermatogenesis and transcript evolution. EMBO Rep. 2017;18(7):1231–47.
Article CAS PubMed PubMed Central Google Scholar
Prudhomme S, Oriol G, Mallet F. A retroviral promoter and a cellular enhancer define a bipartite element which controls env ERVWE1 placental expression. J Virol. 2004;78(22):12157–68.
Article CAS PubMed PubMed Central Google Scholar
Lynch VJ, Nnamani MC, Kapusta A, Brayer K, Plaza SL, Mazur EC, et al. Ancient transposable elements transformed the uterine regulatory landscape and transcriptome during the evolution of mammalian pregnancy. Cell Rep. 2015;10(4):551–61.
Article CAS PubMed PubMed Central Google Scholar
Lynch VJ, Leclerc RD, May G, Wagner GP. Transposon-mediated rewiring of gene regulatory networks contributed to the evolution of pregnancy in mammals. Nat Genet. 2011;43(11):1154–9.
Article CAS PubMed Google Scholar
Schulte AM, Lai S, Kurtz A, Czubayko F, Riegel AT, Wellstein A. Human trophoblast and choriocarcinoma expression of the growth factor pleiotrophin attributable to germ-line insertion of an endogenous retrovirus. Proc Natl Acad Sci. 1996;93(25):14759–64.
Article CAS PubMed PubMed Central Google Scholar
Bi S, Gavrilova O, Gong D-W, Mason MM, Reitman M. Identification of a placental enhancer for the human leptin gene. J Biol Chem. 1997;272(48):30583–8.
Article CAS PubMed Google Scholar
Ball M, Carmody M, Wynne F, Dockery P, Aigner A, Cameron I, et al. Expression of pleiotrophin and its receptors in human placenta suggests roles in trophoblast life cycle and angiogenesis. Placenta. 2009;30(7):649–53.
Article CAS PubMed Google Scholar
Pérez-Pérez A, Toro A, Vilariño-García T, Maymó J, Guadix P, Dueñas JL, et al. Leptin action in normal and pathological pregnancies. J Cell Mol Med. 2017;22(2):716–27.
Kamat A, Hinshelwood MM, Murry BA, Mendelson CR. Mechanisms in tissue-specific regulation of estrogen biosynthesis in humans. Trends Endocrinol Metab. 2002;13(3):122–8.
Article CAS PubMed Google Scholar
van de Lagemaat LN, Landry J-R, Mager DL, Medstrand P. Transposable elements in mammals promote regulatory variation and diversification of genes with specialized functions. Trends Genet. 2003;19(10):530–6.
Article PubMed CAS Google Scholar
Stocco C. Tissue physiology and pathology of aromatase. Steroids. 2012;77(1–2):27–35.
Article CAS PubMed Google Scholar
Chishima T, Iwakiri J, Hamada M. Identification of transposable elements contributing to tissue-specific expression of long non-coding RNAs. Genes. 2018;9(1):23.
Article PubMed Central CAS Google Scholar
Gerlo S, Davis JRE, Mager DL, Kooijman R. Prolactin in man: a tale of two promoters. Bioessays. 2006;28(10):1051–5.
Article CAS PubMed PubMed Central Google Scholar
Jabbour H, Critchley H. Potential roles of decidual prolactin in early pregnancy. Reproduction. 2001;121(2):197–205.
Emera D, Casola C, Lynch VJ, Wildman DE, Agnew D, Wagner GP. Convergent evolution of endometrial prolactin expression in primates, mice, and elephants through the independent recruitment of transposable elements. Mol Biol Evol. 2012;29(1):239–47.
Article CAS PubMed Google Scholar
Chuong EB, Rumi MAK, Soares MJ, Baker JC. Endogenous retroviruses function as species-specific enhancer elements in the placenta. Nat Genet. 2013;45(3):325–9.
Article CAS PubMed PubMed Central Google Scholar
Zheng H, **e W. The role of 3D genome organization in development and cell differentiation. Nat Rev Mol Cell Biol. 2019;20(9):535–50.
Article CAS PubMed Google Scholar
Lupiáñez DG, Kraft K, Heinrich V, Krawitz P, Brancati F, Klopocki E, et al. Disruptions of topological chromatin domains cause pathogenic rewiring of gene-enhancer interactions. Cell. 2015;161(5):1012–25.
Article PubMed PubMed Central CAS Google Scholar
Medrano-Fernández A, Barco A. Nuclear organization and 3D chromatin architecture in cognition and neuropsychiatric disorders. Mol Brain. 2016;9(1):83.
Article PubMed PubMed Central CAS Google Scholar
Davis L, Onn I, Elliott E. The emerging roles for the chromatin structure regulators CTCF and cohesin in neurodevelopment and behavior. Cell Mol Life Sci. 2018;75(7):1205–14.
Article CAS PubMed Google Scholar
Udvardy A. Dividing the empire: boundary chromatin elements delimit the territory of enhancers. EMBO J. 1999;18(1):1–8.
Article CAS PubMed PubMed Central Google Scholar
Dixon JR, Selvaraj S, Yue F, Kim A, Li Y, Shen Y, et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature. 2012;485(7398):376–80.
Article CAS PubMed PubMed Central Google Scholar
Bell AC, West AG, Felsenfeld G. The protein CTCF is required for the enhancer blocking activity of vertebrate insulators. Cell. 1999;98(3):387–96.
Article CAS PubMed Google Scholar
Choudhary MN, Friedman RZ, Wang JT, Jang HS, Zhuo X, Wang T. Co-opted transposons help perpetuate conserved higher-order chromosomal structures. Genome Biol. 2020;21(1):16.
Article CAS PubMed PubMed Central Google Scholar
Schmidt D, Schwalie PC, Wilson MD, Ballester B, Gonçalves Â, Kutter C, et al. Waves of retrotransposon expansion remodel genome organization and CTCF binding in multiple mammalian lineages. Cell. 2012;148(1–2):335–48.
Article CAS PubMed PubMed Central Google Scholar
Thybert D, Roller M, FCP N, Fiddes I, Streeter I, Feig C, et al. Repeat associated mechanisms of genome evolution and function revealed by the Mus caroli and Mus pahari genomes. Genome Res. 2018;28(4):448–59.
Diehl AG, Ouyang N, Boyle AP. Transposable elements contribute to cell and species-specific chromatin loo** and gene regulation in mammalian genomes. Nat Commun. 2020;11(1):1796.
Article CAS PubMed PubMed Central Google Scholar
Kaaij LJT, Mohn F, van der Weide RH, de Wit E, Bühler M. The ChAHP Complex Counteracts Chromatin Loo** at CTCF Sites that Emerged from SINE Expansions in Mouse. Cell. 2019;178(6):1437–1451.e14.
Article CAS PubMed Google Scholar
Zhang Y, Li T, Preissl S, Amaral ML, Grinstein JD, Farah EN, et al. Transcriptionally active HERV-H retrotransposons demarcate topologically associating domains in human pluripotent stem cells. Nat Genet. 2019;51(9):1380–8.
Article CAS PubMed PubMed Central Google Scholar
Wang J, Vicente-García C, Seruggia D, Moltó E, Fernandez-Miñán A, Neto A, et al. MIR retrotransposon sequences provide insulators to the human genome. Proc Natl Acad Sci U S A. 2015;112(32):E4428–37.
Article CAS PubMed PubMed Central Google Scholar
Lunyak VV, Prefontaine GG, Núñez E, Cramer T, Ju B-G, Ohgi KA, et al. Developmentally regulated activation of a SINE B2 repeat as a domain boundary in organogenesis. Science. 2007;317(5835):248–51.
Article CAS PubMed Google Scholar
Roman AC, Benitez DA, Carvajal-Gonzalez JM, Fernandez-Salguero PM. Genome-wide B1 retrotransposon binds the transcription factors dioxin receptor and Slug and regulates gene expression in vivo. Proc Natl Acad Sci U S A. 2008;105(5):1632–7.
Article CAS PubMed PubMed Central Google Scholar
Roman AC, Gonzalez-Rico FJ, Molto E, Hernando H, Neto A, Vicente-Garcia C, et al. Dioxin receptor and SLUG transcription factors regulate the insulator activity of B1 SINE retrotransposons via an RNA polymerase switch. Genome Res. 2011;21(3):422–32.
Article CAS PubMed PubMed Central Google Scholar
Soibam B. Super-lncRNAs: identification of lncRNAs that target super-enhancers via RNA:DNA:DNA triplex formation. RNA. 2017;23(11):1729–42.
Article CAS PubMed PubMed Central Google Scholar
Engreitz JM, Ollikainen N, Guttman M. Long non-coding RNAs: spatial amplifiers that control nuclear structure and gene expression. Nat Rev Mol Cell Biol. 2016;17(12):756–70.
Article CAS PubMed Google Scholar
da Rocha ST, Boeva V, Escamilla-Del-Arenal M, Ancelin K, Granier C, Matias NR, et al. Jarid2 is implicated in the initial **st-induced targeting of PRC2 to the inactive X chromosome. Molecular Cell. 2014;53(2):301–16.
Article PubMed CAS Google Scholar
Beletskii A, Hong Y-K, Pehrson J, Egholm M, Strauss WM. PNA interference map** demonstrates functional domains in the noncoding RNA **st. Proc Natl Acad Sci U S A. 2001;98(16):9215–20.
Article CAS PubMed PubMed Central Google Scholar
Wutz A, Rasmussen TP, Jaenisch R. Chromosomal silencing and localization are mediated by different domains of **st RNA. Nat Genet. 2002;30(2):167–74.
Article CAS PubMed Google Scholar
Casanova EL, Konkel MK. The developmental gene hypothesis for punctuated equilibrium: combined roles of developmental regulatory genes and transposable elements. Bioessays. 2020;42(2):1900173.
Article Google Scholar
Toll-Riera M, Bosch N, Bellora N, Castelo R, Armengol L, Estivill X, et al. Origin of primate orphan genes: A comparative genomics approach. Mol Biol Evol. 2009;26(3):603–12.
Article CAS PubMed Google Scholar
Sniezewski L, Janik S, Laszkiewicz A, Majkowski M, Kisielow P, Cebrat M. The evolutionary conservation of the bidirectional activity of the NWC gene promoter in jawed vertebrates and the domestication of the RAG transposon. Dev Comp Immunol. 2018;81:105–15.
Article CAS PubMed Google Scholar
Kalitsis P, Saffery R. Inherent promoter bidirectionality facilitates maintenance of sequence integrity and transcription of parasitic DNA in mammalian genomes. BMC Genomics. 2009;10:498.
Article PubMed PubMed Central CAS Google Scholar
Long M, Betrán E, Thornton K, Wang W. The origin of new genes: glimpses from the young and old. Nat Rev Genet. 2003;4(11):865–75.
Article CAS PubMed Google Scholar
Gotea V, Makalowski W. Do transposable elements really contribute to proteomes? Trends Genet. 2006;22(5):260–7.
Article CAS PubMed Google Scholar
Bjerregaard B, Holck S, Christensen IJ, Larsson L-I. Syncytin is involved in breast cancer-endothelial cell fusions. Cell Mol Life Sci. 2006;63(16):1906–11.
Article CAS PubMed Google Scholar
Larsen JM, Christensen IJ, Nielsen HJ, Hansen U, Bjerregaard B, Talts JF, et al. Syncytin immunoreactivity in colorectal cancer: Potential prognostic impact. Cancer Lett. 2009;280(1):44–9.
Article CAS PubMed Google Scholar
Strick R, Ackermann S, Langbein M, Swiatek J, Schubert SW, Hashemolhosseini S, et al. Proliferation and cell–cell fusion of endometrial carcinoma are induced by the human endogenous retroviral Syncytin-1 and regulated by TGF-β. J Mol Med. 2006;85(1):23–38.
Article PubMed CAS Google Scholar
Wang O, Zheng Z, Wang Q, ** Y, ** W, Wang Y, et al. ZCCHC12, a novel oncogene in papillary thyroid cancer. J Cancer Res Clin Oncol. 2017;143(9):1679–86.
Article CAS PubMed Google Scholar
Pang SW, Lahiri C, Poh CL, Tan KO. PNMA family: Protein interaction network and cell signalling pathways implicated in cancer and apoptosis. Cell Signal. 2018;45:54–62.
Article CAS PubMed Google Scholar
Papaemmanuil E, Rapado I, Li Y, Potter NE, Wedge DC, Tubio J, et al. RAG-mediated recombination is the predominant driver of oncogenic rearrangement in ETV6-RUNX1 acute lymphoblastic leukemia. Nat Genet. 2014;46(2):116–25.
Article CAS PubMed PubMed Central Google Scholar
Di-Poi N, Montoya-Burgos JI, Duboule D. Atypical relaxation of structural constraints in Hox gene clusters of the green Anole lizard. Genome Res. 2009;19(4):602–10.
Article CAS PubMed PubMed Central Google Scholar
Di-Poï N, Montoya-Burgos JI, Miller H, Pourquié O, Milinkovitch MC, Duboule D. Changes in Hox genes’ structure and function during the evolution of the squamate body plan. Nature. 2010;464(7285):99–103.
Article PubMed CAS Google Scholar
Boissinot S, Bourgeois Y, Manthey JD, Ruggiero RP. The mobilome of reptiles: evolution, structure, and function. Cytogenet Genome Res. 2019;157(1–2):21–33.
Article PubMed Google Scholar
Siomi MC, Sato K, Pezic D, Aravin AA. PIWI-interacting small RNAs: the vanguard of genome defence. Nat Rev Mol Cell Biol. 2011;12(4):246–58.
Article CAS PubMed Google Scholar
Download references
Acknowledgements
Not applicable.
Funding
Our work is supported by grants from the French National Research Agency ANR (EVOBOOSTER project) and the Ecole Normale Supérieure de Lyon (emerging project grant) (to JNV). EE is the recipient of a competitive PhD fellowship from the French Ministry of Higher Education, Research and Innovation.
Author information
Authors and Affiliations
Institut de Genomique Fonctionnelle de Lyon, Univ Lyon, CNRS UMR 5242, Ecole Normale Superieure de Lyon, Universite Claude Bernard Lyon 1, 46 allee d’Italie, F-69364, Lyon, France
Ema Etchegaray, Magali Naville, Jean-Nicolas Volff & Zofia Haftek-Terreau
Authors
Ema Etchegaray
View author publications
You can also search for this author in PubMed Google Scholar
Magali Naville
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Nicolas Volff
View author publications
You can also search for this author in PubMed Google Scholar
Zofia Haftek-Terreau
View author publications
You can also search for this author in PubMed Google Scholar
Contributions
EE has drafted the initial version of the review and designed the figures; MN, JNV and ZH have contributed to the writing of the manuscript. All authors have approved the final version.
Corresponding author
Correspondence to Ema Etchegaray.
Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions
About this article
Cite this article
Etchegaray, E., Naville, M., Volff, JN. et al. Transposable element-derived sequences in vertebrate development. Mobile DNA 12, 1 (2021). https://doi.org/10.1186/s13100-020-00229-5
Download citation
Received: 22 July 2020
Accepted: 15 December 2020
Published: 06 January 2021
DOI: https://doi.org/10.1186/s13100-020-00229-5
Keywords
Transposable elements
Vertebrates
Development
Genetic innovation
Exaptation
Genome evolution

Associated Content

Part of a collection:

All Reviews Collection

Advertisement

Transposable element-derived sequences in vertebrate development

Abstract

Similar content being viewed by others

Evolutionary impact of transposable elements on genomic diversity and lineage-specific innovation in vertebrates

Mammalian Genome Plasticity: Expression Analysis of Transposable Elements

The Relationship between Transposons and Transcription Factors in the Evolution of Eukaryotes

Background