Background

Kidney transplantation is the most effective procedure for clinical treatment of patients with end-stage kidney failure. Based on the stable work of transplanted kidney, electrolyte and acid–base balance could be maintained and patients no longer need the support of dialysis to eliminate the metabolic waste. However, due to the immune response of human body, kidney transplant rejection (KTR), including antibody-medicated rejection (AMR) and T cell-mediated rejection (CMR), is observed to have irreversible effects or even results in the loss of renal allograft function [1]. Although a series of clinical symptoms, e.g., increased serum creatinine, decreased urine output, distending pain in the transplantation area, and fever, are measured for indicating KTR occurrence, the specificity is still limited since the infection and drug response could cause similar phenotypes. Thus, screening sensitive biomarkers to predict and monitor the development of rejection after allogeneic kidney transplantation is of great significance.

MicroRNAs (miRNAs) are small single-stranded non-coding RNAs with the potential as biomarkers for a variety of pathologies. Recent studies found that miRNAs are functional regulators and indicators in the immune processes related to kidney transplantation. For example, Liu et al. reported that downregulated miR-10b could mediate rejection of renal allografts through inhibiting BCL2L11 expression [2]. ** et al. decoded a novel mechanism between miR-650 and BCL11B for preventing rejection after kidney transplantation [

Table 1 Datasets used in this study

The differentially expression analysis were performed on the normalized data to extract differentially expressed (DE) miRNAs and mRNAs between KTR and normal groups. According to the comparison of methods for microarray-based DE-gene identification [15], the empirical bayes (eBayes) method was recommended for raw p-value calculation [16] and the Benjamini–Hochberg false discovery rate (FDR) was calculated for multiple testing and p-value adjustment. For the gene related to multiple probes, the probe with the most significant variation was assigned. The cutoff for DE-miRNA and DE-mRNA extraction was set as the adjusted p-value (adj.p-value) < 0.05.

The miRNA-mRNA regulatory data were retrieved from our previous work, where a human global miRNA-mRNA network was constructed based on the integration of miRNA-mRNA pairs from both experimentally validated and computationally predicted databases [10]. For human PPI data, the online tool STRING (v11.0) was applied and PPIs with the score ≥ 0.7 (high confidence) and active interaction sources except text-mining were chosen for network modeling and analysis [17].

Multi-omics network characterization and miRNA biomarker identification

As described in Fig. 1, an integrated KTR-specific network was constructed by merging interactive data at two omics levels to quantify the power of miRNAs for gene regulation:

First, the DE-miRNAs and DE-mRNAs were mapped onto the human global miRNA-mRNA network, and a KTR-specific miRNA-mRNA network was extracted to measure miRNA regulation at the post-transcriptional level.

Second, it should be acknowledged that genes in the network could influence the downstream biological processes and eventually cause disease phenotypes via interacting with each other. To better prioritize the interactive activity of genes regulated by miRNAs, a KTR-specific PPI network was further developed by integrating KTR-specific mRNAs with human PPI data using STRING online [17]. Here three topological features, i.e., degree centrality (DC), closeness centrality (CC), and betweenness centrality (BC), were calculated to indicate the structural significance of genes in PPIN [18]:

$${\text{DC}}\left( {\text{i}} \right) \, = {\text{ k}}$$

where k is the number of edges that are linked to a given node i. Nodes with high DC are critical in the network.

$${\text{CC}}\left( {\text{i}} \right) \, = \frac{{\text{N}}}{{\sum\nolimits_{{\text{j = 1}}}^{{\text{N}}} {{\text{d}}\left( {{\text{i}},{\text{j}}} \right)} }}$$

where N is the total number of nodes in the network, and d(i, j) represents the distance between i and j. This index represents how close the given node is to all the other nodes in the network. In general, the node with high CC holds the optimum position for evaluating the information flow.

$${\text{BC}}\left( {\text{i}} \right) \, = \sum\nolimits_{x \ne i \ne y} {\frac{{{\text{m}}_{xy}^{i} }}{{{\text{n}}_{xy} }}}$$

where nxy is the number of the shortest paths connecting node x and y, and mixy is the number of the shortest paths connecting node x and y that contain the given node i. Since the interaction of two non-adjacent nodes can be influenced by nodes that lie between them, this index indicates the significance of a given node based on the number of penetrated shortest paths.

In this study, genes whose DC, CC and BC are greater than their average values are defined as CORE set in the network (see Additional file 1), and the KTR-specific miRNA-mRNA network were upgraded by merging PPIN to a cross-level miRNA-mRNA-PPI network (miR-PPIN). In this novel framework, the structural importance of target genes is prioritized according to their contributions in the network, and miRNAs with more CORE targets have the theoretical priority for biomarker discovery.

Based on the above definitions, two feature parameters, i.e., Number of ALL Targeted Genes (NTGA) and Number of Targeted CORE Genes (NTGC), were respectively defined, where the former is numerically equal to the number of genes regulated by miRNAs and the latter is the number of CORE genes targeted by a given miRNA. As hub genes are reported to be functionally important in the network system [19], miRNAs with high NTGA values have strong power in regulation. Moreover, NTGC index strengthens such regulatory pattern by reasonably weighting the contribution of target genes across multi-omics networks. Hence candidate miRNA biomarkers were computationally identified with the following two steps:

Step 1: miRNAs with significantly high NTGA values (p-value < 0.05, Wilcoxon signed-rank test) were extracted from the KTR-specific network.

Step 2: miRNAs with significantly high NTGC values (p-value < 0.05, Wilcoxon signed-rank test) were selected from those screened in Step 1 as candidate biomarkers for expression and functional validation.

ROC analysis and performance validation

The receiver-operating characteristic (ROC) analysis was performed and compared using the statistical tool MedCalc (v20.009) at two levels, i.e., the identified miRNAs as single predictors and a combined signature that incorporated all identified miRNAs. Here the Logistic regression was applied to predict the value of combined signature based on the expression data of each miRNA variables. The area under ROC curve (AUC) was calculated to quantify the predictive power of biomarkers for KTR diagnosis (KTR vs. nKTR) and subty** (AMR vs. CMR).

The potential of miRNAs as biomarkers was further validated by literature-based pathogenic survey, and keywords including “kidney/renal transplantation”, “rejection”, “immune response”, etc. were used to investigate the associations of identified miRNAs with KTR through text-mining in PubMed and Web of Science.

Knowledge-guided key regulatory module extraction and functional exploration

The pathogenesis of identified miRNAs was explored based on a three dimensional “miRNA-gene-pathway” paradigm. First, functional targets of biomarker miRNAs were retrieved from KTR-specific miR-PPIN network for Gene Ontology (GO) enrichment at the biological process (BP) level using the Database for Annotation, Visualization and Integrated Discovery (DAVID, v6.8) online [20]. The criterion for significant term selection was set as p-value < 0.05. Then PPIs in KTR-specific miR-PPIN were clustered by the plug-in Molecular Complex Detection (MCODE) of Cytoscape for molecular complex analysis with default thresholds [21]. Finally, significant clusters were identified and key regulatory modules were extracted by integrating BP knowledge and miRNA regulations for translational “genotype-phenotype” etiologic understanding.