Background

Rats and mice have been the most widely used models in biomedical research and drug development for many years [1,2,3]. However, a shift has taken place that mice rapidly overtake rats as the major model of choice [4]. As a result, the proportion of publications using mice models has increased from about 20% in the 1970s and 1980s to over 50% in the recent neuroscience-related researches. This shift might result from genome knockout technique, which was first used in mice, rather than in rats [5]. However, the rat is the preferred animal model for physiology, toxicology, nutrition, behavior and neoplasia studies. In addition, the rat can reduce the spread of drugs following intracranial injections [6]. These lead to urgent demands to study gene regulation patterns in rat. Benefiting from the creation and evolution of Rat Genome Database (RGD) [7] and the completion of the rat genome sequence in 2004 [8], we could look deep into the genetic rat models.

Transposable elements (TEs) were first discovered in maize and described as “controlling elements” of nearby genes [9]. At present, TEs have been found to exist in almost all species, with the proportions varying from ~1% in Fusarium graminearum to ~85% in maize genome [10,11,12]. It could be categorized into retrotransposons and DNA transposons. The former could be amplified through a copy-and-paste mechanism with an intermediate of the element-encoded RNA, while the latter utilizes a cut-and-paste mechanism to self-propagate with the intermediate of DNA [13, 14]. Retrotransposons could be further subdivided into long terminal repeats (LTRs), long interspersed elements (LINEs) and short interspersed elements (SINEs). L1 elements are the main retrotransposons in mammalian genomes with important roles in mutagenesis [15] and early cancer diagnosis [Full size image

In this study, TE subfamilies were considered to be expressed with the averaged RPKM ≥ 1. A TE subfamily was defined as “commonly expressed TEs” if it was expressed in all organs, developmental stages and sexes. Circos [41] was used to draw the graph of the number of DETEs among organs and links between organs and classes. The clustering of TE subfamilies was performed using Average linkage in MATLAB. Principal variance component analysis (PVCA) leverages the strengths of principal components analysis and variance components analysis to quantify the corresponding proportion of variation of each effect [42]. In this study, it was used to quantify the relative contributions of effects (organ, age, sex and replicate) to total model variance based on the expression matrix of TE subfamilies in different samples.

Identification of differentially expressed and organ-enriched TE subfamilies

In order to compare with expression of genes, we adopted same methods with Yu et al. [34] to identify enriched TEs. For the sake of completeness, we would describe these methods in brief.

A TE subfamily was defined as the DETE between two organs if t-test with a Bonferroni-corrected P-value was ≤ 0.05 and fold change (FC) was ≥ 2 (overexpressed) or ≤ 0.5 (underexpressed). The intersection of DETEs that were overexpressed in any other 10 organs were defined as organ-enriched TE subfamilies. The development-dependent DETEs were evaluated by comparing different developmental stages for each organ. The condition was FC ≥2 or ≤0.5 plus Bonferroni-corrected P-value ≤ 0.05. Except for testis and uterus samples, other 288 samples were separated into 36 groups according to the organ types and developmental stages. FC and t-test were also performed between male and female to identify sex-dependent DETEs in each group.

In each organ, the FC was calculated between two adjacent developmental stages, with the older developmental stage as numerator, in other words, 104- versus 21-weeks old, 21- versus 6-weeks old and 6- versus 2-weeks old.

A TE subfamily with FC ≥2 or ≤0.5 plus Bonferroni-corrected P-value ≤ 0.05 was divided into the “up” pattern or “decrease” pattern, respectively. The other TE subfamilies were divided into “maintain”. Therefore, a TE subfamily could be divided into 1 out of 27 patterns in each organ, ranging from up-up-up (UUU), maintain-maintain-maintain (MMM), to decrease-decrease-decrease (DDD).