Introduction

Since ancient times, human civilizations have based their own health care on medicinal plants, making the latter one of the greatest heritages of modern mankind.

In the last decades, several pharmaceutical companies have focused their attention on plant secondary metabolites, especially polyphenols, due to their nutraceutical effect on mammalian systems. In detail, over 40% of the modern drugs consists of plant-derived substances or molecules synthesized considering the chemical structure of plant compounds [1].

Several studies have demonstrated the strong biological effects, including antioxidant and anti-inflammatory potential, of a wide range of phytochemicals that might represent natural raw substances by which develo** new anticancer drugs [2].

Malva sylvestris L., also known as common mallow, is one of the most used species in traditional phytomedicine. It is a biennial, or perennial, erect herbaceous plant belonging to the Malvaceae family and native to Europe, Asia, and North Africa [3, 4]. Although the entire plant possesses therapeutic properties, leaves and flowers exhibit the most significant pharmacological activities, due to the presence of mucilage and flavonoids [5], useful for treating gastrointestinal disorders, respiratory diseases, and urological problems [6, 7]. In addition, some studies have demonstrated the capacity of mallow aqueous and hydroalcoholic extract to act as antinociceptive, reducing prostaglandin synthesis, oxidative stress levels, and the expression of proinflammatory factors [8, 9]. In general, M. sylvestris has been documented to be rich in anthocyanins. These secondary metabolites represent a widespread group of flavonoids, mainly responsible for flower and fruit staining in the range from black/dark blue to light pink. Indeed, several natural food colorants derive from these polyphenols [10]. In vitro experimental studies have proved that anthocyanin-rich extracts possess a potential capacity in promoting human health, thanks to their biological properties, including antioxidant, antiinflammatory, antidiabetic, antimicrobial, antineoplastic, and vasoprotective capacity [11,12,13,14,15,16], suggesting a key role at the expense of these flavonoids for the mallow. All this evidence has underlined the necessity to extend the scientific research towards clinical and toxicological aspects of M. sylvestris, in order to clarify the potential mechanism of action of its phytocomplex [3,4,5,6,7,8,9,10,11,12,13,14,15,16,17].

The phytocomplex is the set of chemical substances synthesized and accumulated in the tissues of a specific plant organism, living in a determinate environment. It includes both active and non-bioactive compounds, such as plant secondary metabolites (e.g., flavonoids, alkaloids, terpenes), sugars, and vitamins, which work in synergy. The phytocomplexes can exert their functions both on plant and animal systems, including humans. This potential would represent the scientific basis underlying the concept of ancient and modern phytomedicine/phytotherapy. Recently, one could hypothesise that even microRNAs (miRNAs, miRs) may be considered part of the phytocomplex. In this regard, in the last decade, various scholars have demonstrated and sustained the existence of a cross-kingdom regulation (CKR) mediated by plant miRNAs acquired through the diet [18,19,20,21]. Indeed, miRNAs have been classified as a group of small non-coding RNAs able to regulate the gene expression machinery, promoting degradation or translational arrest of their mRNA targets [22]. In support of CKR, it has been proved that the plant miRNAs can survive to heat cooking treatment (e.g., boiling), digestive processes (e.g., low pH), and enzyme activity, even persisting a few hours into the gastrointestinal tract before moving into bloodstream [42, Full size image

Considering that M. sylvestris is usually consumed as an aqueous decoction or as fresh material and that ethanol determines toxicity in the human body, we decided to dry out totally our hydroalcoholic plant extracts, to resuspend them in pure ddH2O (aq extracts), and finally to perform spectrophotometric analyses for evaluating the residual compounds preserved in these more biocompatible preparations. Total phenols and flavonoids for all aq extracts followed the same trend shown by the pure extracts (Fig. 1E-H). In detail, the concentration of flavonoids in leaves (L aq; Fig. 1F) and phenols in flowers (F aq; Fig. 1G) were perfectly in line with those detected in the respective pure samples (Fig. 1B, C), while the levels of phenolics in the leaf aq extracts (Fig. 1E) appeared lower than its pure counterpart (Fig. 1A). This effect might be linked to the fact that a significant portion of L phenols was soluble only in the organic fraction of the solvent. By contrast, the accumulation of polyphenols in aq F samples compared to their original ones (Fig. 1H vs D) was probably due to the increased solubility of these metabolites (which probably remained aggregated in presence of ethanol), confirming our previous glycosylation theory. In addition, this datum could be justified by two other elements: the presence of anthocyanins or other water-soluble pigments in the petals of the common mallow and the capacity of polyphenols to bind proteins and generate insoluble complexes [48, 49].

An HPLC–DAD analysis was carried out to further characterize M. sylvestris pure and aqueous extracts, detecting and quantifying 20 plant metabolites (Fig. 1I, J; Tables 1 and 2). All the selected molecules were found at least in one sample of leaves and flowers, confirming their ubiquitous nature. Quercetin-3-glucoside was the most abundant compound identified in L pure hydroalcoholic extracts (i.e., 50WEL and 70WEL), reaching the levels of 146.9 and 159.08 ng per mg FMW respectively, while F pure hydroalcoholic extracts (i.e., 50WE, 70WE, and E) were particularly typified by Rosmarinic acid (33.09, 49.15, and 30.42 ng per mg FMW, respectively).

Table 1 HPLC–DAD analysis. Results of M. sylvestris leaf extracts expressed as ng per mg of FMW
Table 2 HPLC–DAD analysis. Results of M. sylvestris flower extracts expressed as ng per mg of FMW

Even though only one portion of the phytocomplex was defined in this chromatographic analysis, results showed that L samples were richer in phenols and flavonoids than F ones, as already evidenced through the previous spectrophotometric measurements. Remarkably, some chemical compounds appeared undetectable in the pure hydroalcoholic extracts but were present in the respective aq samples, and vice versa (although in lesser size). Obviously, this phenomenon might be linked with a different solubility of the plant metabolites between water and ethanol, also explaining the increase of total phenolic and flavonoid compounds estimated by spectrophotometric assays in the aq extracts. In general, L and F 50WE and 70WE macerations revealed the highest contents of the series for the chosen secondary metabolites, indicating that these two extraction procedures were the most profitable in terms of yield and composition. Indeed, it is widely documented that organic solvents are poor diluents for phenolics and flavonoids if not mixed with a certain percentage of water [50,51,52].

As M. sylvestris is known to contain anthocyanins [53], we decided to estimate the level of these phytochemicals in all our extracts, in order to reach a higher level of molecular characterization. To do it, we applied a spectrophotometric method exploiting the peculiar absorption range of the different anthocyanins (i.e., from 490 to 550 nm) [54]. No anthocyanin was recorded in the L samples, while their signals were easily detectable in the F extracts (Fig. 1K, L). At this point, it is necessary to underline that the levels of total flavonoids and those of the anthocyanins would not seem to be in line, being the latter higher than the first ones. However, a reasonable explanation for this evidence could be associated to the fact that anthocyanins are prone to undergo structural transformations and complexation reactions [55] and that maybe the aluminum chloride method is mainly devoted for the quantitation of simple flavonoids. Although anthocyanins are water soluble pigments, we observed that the extraction of these compounds increased when the organic solvent was added to the maceration, reaching the maximum yields with 70%WE and then decreasing with further supplementation of ethanol, as already reported by Cacace and Mazza [56]. MAL3G and PTD3G were the most abundant anthocyanins present in the petals of common mallow, followed by PLG, CYN, DEL, and PEO, in that order (Fig. 1K). It is unsurprising that PEO was detected in low concentration, considering that usually this anthocyanin is usually not typical of the common mallow [3]. In the aq extracts (Fig. 1L), a decrease of anthocyanin concentration was detected compared to the respective pure samples, although they remained stable one each other in terms of proportion. This phenomenon could be linked to degradation events occurring during the lyophilization process and/or to the reduced capacity of the new solvent (i.e., ddH2O) to resuspend these molecules.

All together, these metabolomic analyses allowed us to define more in detail the phytocomplex of leaves and flowers from M. sylvestris and to investigate the solubilization properties of plant secondary metabolites in different extraction solvents.

MiRNome analysis

Recently, beyond secondary metabolites, plant miRNAs have attracted interest and curiosity from the scientific community, due to their CKR capacity of the human gene expression [18,19,20]. Considering this premise, it would be possible to suppose that the beneficial properties of M. sylvestris [57] are attributable also to its content in miRNAs. Thus, first, the presence of two plant miRNAs (i.e., miR397-5p, miR159), chosen for their wide distribution in various plant species and their elevated nucleotide conservation in the plant kingdom [58, 59], was investigated (Fig. 2A). Positive (UniSp6; 5S RNA) and negative (Neg1; Neg2) controls (described in Materials and Methods section) were performed to confirm accuracy and efficiency of the extraction and detection methods. As shown in Fig. 2A, only one out of two selected plant miRNAs (i.e., miR159) was confirmed in the mallow samples (data also validated then by the NGS approach).

Fig. 2
figure 2

MiRNome analysis. A Validation of presence and extraction procedure for plant miRNAs. After miRNA isolation from M. sylvestris flowers (left panel) and leaves (right panel) and retrotranscription reaction, PCR amplifications were carried out and the relative products fractioned on agarose gel and visualised under UV-light. The signals for the UniSp6 (the positive control of the retrotranscription kit), the plant 5S rRNA, and one out of two selected ubiquitous plant miRNAs (i.e., miR397-5p, and miR159) are shown. In addition, the absence of amplicons in the lanes of the negative controls (Neg 1 and Neg 2; see Materials and Methods section) can be noticed. (MW, molecular weight). The full-length image of the gel is reported in Supplemental material 3. B PCA graph based on presence and abundances of the miRNAs detected in the common mallow leaves (L, green) and flowers (F, violet). Red arrows indicate the effect of each miRNA on the first (PC1) and second (PC2) component. C PCoA graph based on the frequency of miRNAs present in M. sylvestris leaves (green spots) and flowers (violet spots). D Graph of the differential abundance for the miRNAs found in leaves (green bars) and flowers (violet bars) of the common mallow. The relative abundance is plotted in log10 on the y-axis. E Heatmap of the abundance of the the detected miRNAs in the different flower and leaf samples, with relative hierarchical clustering

After this preliminary check, taking into account the potential inhomogeneity in miRNA expression linked to possible physiological and metabolic variations and to provide statically significant results, 6 pools of miRNAs were isolated from leaves (i.e., LE01, LE02, LE03) and flowers (i.e., F01, F02, F03) of different M. sylvestris specimens and subjected to NGS analysis. Small-RNA sequencing detected 419.188 reads map** to all samples, for a total of 33 different miRNAs, all attributable to already known families of plant miRNAs. The distribution of the reads for each miRNA among the samples is reported in Table 3, together with their nucleotide sequences. In particular, 31 miRNAs were identified in the leaves, although only 20 present in the whole LE series, while 23 were found in the flowers, with just 17 detectable in all F specimens. Two miRNAs (i.e., miR396c-3p, miR399e-3p) characterized only the flower samples, while 12 were typical of leaves (i.e., miR3600, miR159c-3p, miR172b-3p, miR172c-5p, miR2118, miR398b, miR399c, miR399f-3p, miR403b-3p, miR482a, miR530b, miR8051-5p). The most abundant miRNAs in flowers were miR159, miR166f-p, and miR6300, while in leaves were miR167e-5p, miR159, miR166f-p, miR172c, miR396a, and miR396f-5p.

Table 3 Identified miRNAs. List of the expressed miRNAs detected by NGS analysis in the common mallow, together with their respective nucleotide sequence. The number of raw sequencing reads counted for each miRNA in the samples of leaves (i.e., LE01, LE02, LE03) and flowers (i.e., F01, F02, F03) were indicated

To explore the diversity of the miRNA presence and frequency between leaves and flowers, we performed a PCA (Fig. 2B). The two principal components (PC1 and PC2) together explained 87.4% of the whole dataset variance, that is they were able to separate clearly and group the detected miRNAs in two main clusters, according to their flower or leaf origin. We confirmed this evidence also by carrying out a PCoA (Fig. 2C), a method which considers the dissimilarities among data and not their similarities like in the case of PCA. Furthermore, the distance matrix used as input for the PCoA was employed to assess the statistical significance of the two clusters. In detail, the matrix was evaluated by using the PERMANOVA test. This analysis allowed to verify the predictive effect of the presence for each miRNA on the basis of the plant tissue (i.e., flower vs. leaf) and/or the investigated specimen (i.e., 01 vs. 02 vs. 03), through the calculation of the relative pseudoF, R2, and p-values. Our results evidenced that the localization in plant was the only parameter able to significantly predict for miRNAs’ presence, explained the 91% of the variance.

Small RNA-seq outcomes were normalized and analyzed using the DESeq2 R package, to investigate more in detail the validity of our data. The abundance of all miRNAs in flowers and leaves was reported, although only 16 of them significantly differed between the two plant organs. In detail, 8 miRNAs typified the leaves (i.e., miR396a, miR403b-3p, miR396f-5p, miR172b-3p, miR482a, miR160a, miR167e-5p, miR172c-5p), while other 8 the flowers (i.e., miR396c-3p, miR159c-3p, miR6300, miR3954b-5p, miR395c-3p, miR160b-5p, miR166g-3p, miR164d-5p). The visualization of the expression levels of all miRNAs was reported in Fig. 2D. The results relative to miRNAs’ frequency were also shown as a heatmap (Fig. 2E), where the colors provide a rapid information about the differences found in their abundance in the whole studied series. The hierarchical clustering deriving from the heatmap corroborated our previous suppositions.

It is important to remember that miRNAs play crucial roles in plants, modulating their propagation, reproduction, development, growth, homeostasis, signaling, response to biotic and abiotic stressors, etc. [60]. Moreover, it has been documented that the transcriptional patterns of these small nucleic acids are finely regulated and dependent on time (i.e., life phase), space (i.e., tissue), and exogenous stimuli (e.g., temperature variation) [61,62,63]. Therefore, a bioinformatics analysis was carried out to identify the putative targets of the miRNAs detected in common mallow, to understand their potential functions in the plant and clarify if their expression profiles could vary according to the tissue (i.e., leaf, flower) in which they were synthesized.

The list of the plant mRNA targets predicted for each M. sylvestris miRNA was reported in detail in Supplemental material 1. However, to make all this information more usable to the reader, the most significant results were summarized in Table 4. Overall, 19 miRNAs out of 33 presented as targets at least one transcript encoding for chromosome scaffold proteins (CSSs), which are responsible for holding the chromatin in compact form [64]. Although the condensed structure of heterochromatin mainly depends on histone-related proteins [65], it is possible to hypothesize that also CSSs may be involved in the regulation of DNA close conformation [66]. Therefore, the miRNAs potentially able to modulate CSSs’ mRNAs could exert a key role in the gene expression of M. sylvestris, acting as epigenetic mediators. Examples might be miR398b, miR399c, miR403b-3p, and miR530b.

Table 4 Summary of the bioinformatics prediction against plant transcriptome. For each investigated miRNA (Query), the putative plant target mRNAs showing an expectation (Exp) ≤ 1 (where possible, otherwise the lowest registered values) were listed and described as Unigene-DFCI Gene Index-GORAGI IDs (Accession) and in full (Description). The complete output of this bioinformatics analysis was reported in Supplemental material 1

Regarding the miRNAs found only in the leaves, particular targets were predicted for miR172b-3p, miR172c-5p, miR482a, and miR2118. In detail, the first one would seem to bind and modulate the Ethylene-responsive transcription factors related to APETALA2. As these proteins are involved in several processes (e.g., primary and secondary metabolism, growth programs, response to environmental stimuli) and can act both as activators and repressors of the transcription [67], it is complicated to define the specific role of miR172b-3p in our case. By contrast, the synthesis of miR172c-5p, miR482a, and miR2118 in LE samples suggested a fine regulation of the mitochondrial and plastid functions, since their putative targets were linked to plastid division, cytochrome activity, ATP degradation, and hydrolysis of complex sugars [68, 69].

The two miRNAs typifying the flowers of common mallow, that is miR396c-3p and miR399e-3p, were found to potentially interact with Calcium binding protein and MYB transcription factor, in that order, excluding Chromosome scaffold proteins (already discussed above). Literature reports that calcium and MYB transcription factor are related to flower development and color [70,71,72]; therefore, our results would suggest that these factors were probably no more necessary in the analysed flowers (so, to be down-regulated), as already completely mature.

In general, at this stage, we can only suppose the plant functions in which the detected miRNAs are involved because further molecular investigations specifically focused on each one of them are necessary to clearly understand their role. However, the demonstration of their existence in the common mallow might open new perspectives in the study of the gene regulation for this plant species.

Despite some scholars believe plant miRNAs cannot be absorbed by diet and ascribe the evidence of their presence in human samples to artifacts or contamination [73,74,75], a great part of the scientific community has proved that food and medicinal plant-derived miRNAs can be acquired and transported in the bloodstream, released in specific human tissues, and even capable to perform CKRs of the hosts’ gene expression [18,19,20,21]. This testimony suggests that plant miRNAs may play a key role in modulating human health in physiological and pathological states. In this regard, in COVID time, ginger and grapefruit miRNAs have been even proposed as possible antiviral agents, potentially targeting SARS-CoV-2 genes [76].

Considering the previous premise, the last step of our work consisted in the bioinformatics prediction of the putative human targets, involved in 5 fundamental mammalian cellular processes (i.e., apoptosis, senescence, cell-cycle, oxidative stress, and invasiveness), for the 33 miRNAs sequenced from common mallow. The miRNA target prediction tool used in this work [39, 40] identified, in the human transcriptome, a total of 383 mRNAs that theoretically could be bound and regulated by M. sylvestris miRNAs (Supplemental material 2). Only two miRNAs (i.e., miR530b and miR8051-5p) did not show any human target at high prediction efficiency. To resume the huge amount of data obtained by this analysis, the three most significant human targets for each miRNA were reported in Table 5. In addition, the whole set of predictions were schematized in the graphs shown in Fig. 3, where the number of putative human transcripts was indicated for the most significantly expressed miRNAs in leaves and flowers. Based on this bioinformatics clustering, miR160a, miR168-5p, and miR398b-3p isolated from leaves and miR160b-5p and miR164d-5p from flowers presented the highest number of targets for the analysed pathways. Interestingly, miR172b-3p from leaves showed only targets linked to the oxidative stress, while miR396c-3p and miR6300 from flowers revealed the lowest number of total targets, exactly 5 per each.

Table 5 Summary of the bioinformatics prediction against human transcriptome. For each investigated miRNA (Query), the most significant putative human target mRNAs, showing at least a percentage of maximum likelihood > 0.86, were listed and described as NCBI acronyms (Code) and in full (Description). The complete output of this bioinformatics analysis was reported in Supplemental material 2
Fig. 3
figure 3

Human target prediction. Number of the total putative human mRNA targets predicted for each miRNA present in the common mallow leaf (green graphs) and flower (violet graphs), grouped for cell biological processes (A and B: Apoptosis; C and D: Invasion; E and F: Senescence; G and H: Cell Cycle; I and J: Oxidative stress). Data elaborated from the information reported in Supplemental material 2

In order to identify the main biological processes potentially linked to the whole set of putative targets for M. sylvestris miRNAs, GO enrichment analyses were carried out (Fig. 4). The targets predicted for leaf miRNAs appeared in their entirety able to act on chromatin organization and to negatively influence metabolic and cellular processes in plants (Fig. 4A, B). In parallel, always in plants, the profile of the transcripts modulated by the flower miRNAs converged on regulative mechanisms for chromatin arrangement, epigenetics, development, and response to heat (Fig. 4C, D). These results were quite expected because, as already previously stated, miRNAs would seem to modulate plant gene expression also influencing the heterochromatin/euchromatin ratio. Interestingly, the biological function associated to the temperature variation (i.e., GO:1,900,036) could suggest that flowers were living a change of the environmental condition of their habitat at the sampling time. On the other hand, assuming the existence of the CKR phenomenon, the human GOs detected by the enrichment analysis using the leaf miRNAs’ targets as input data were listed in Fig. 4E, F, and I. In this case, apoptosis, platelet activation, and cell cycle, morphogenesis, and adhesion were the main processes predicted to be potentially controlled. A very similar prediction was also obtained considering the targets of the miRNA profile from flowers (Fig. 4G, H, J), although two new functions were detected (i.e., cell migration and rhythmic process). This last evidence would indicate that miRNAs might represent a significant bioactive component of M. sylvestris derivatives (e.g., decoctions, fresh and dry leaves) on human health, clarifying the medicinal role that this species has played in folk phytotherapy since time immemorial.

Fig. 4
figure 4

Gene Ontology (GO) enrichment analyses. A Bar graph of plant GO enriched terms across input gene lists relative to leaf miRNAs’ putative plant targets. B Top-level plant GO biological processes relative to GO enriched terms reported in panel (A). C Bar graph of plant GO enriched terms across input gene lists relative to flower miRNAs’ putative plant targets. D Top-level plant GO biological processes relative to GO enriched terms reported in panel (C). E Bar graph of human GO enriched terms across input gene lists relative to leaf miRNAs’ putative human targets. F Top-level human GO biological processes relative to GO enriched terms reported in panel (E). G) Bar graph of human GO enriched terms across input gene lists relative to flower miRNAs’ putative human targets. H Top-level human GO biological processes relative to GO enriched terms reported in panel (G). I Network of GO enriched terms relative to the analysis reported in panel (E) and (F). J) Network of GO enriched terms relative to the analysis reported in panel (G) and (H). In the bar graphs the colours indicate different p-values. In the networks, cluster IDs are coloured in different ways, nodes that share the same cluster ID are typically close, and the colour intensity of the conjunctions indicate different p-value (i.e., terms containing more genes tend to have a major significance)