Introduction

Climate changes and anthropogenic pressures are the main drivers of a biodiversity decline characterized by an unprecedented rate of species extinction1,2. In addition to a decrease in species composition, habitat reduction and degradation are also causing a loss of species’ genetic diversity3,4. The genetic diversity of natural populations and communities ensures adaptive potential and evolvability5, reducing the risk of extinction6. To address these problems, monitoring actions and conservation strategies are essential. The inter-governmental group on Earth Observations and Biodiversity Observation Network (GEO BON) has developed the Essential Biodiversity Variables (EBVs), which are a set of variables aiming to capture different levels of biodiversity and are divided into 6 classes: genetic composition, species populations, species traits, community composition, ecosystem function, ecosystem structure. Specifically, the class of genetic composition includes genetic diversity and genetic differentiation or genetic distance; the class of species populations includes species distribution and abundances; the class of community composition includes community abundance and taxonomic/phylogenetic diversity7.

The advent of environmental DNA (eDNA) metabarcoding as a high-throughput tool for biodiversity assessment is certainly facilitating data collection for biodiversity monitoring.

Environmental DNA (eDNA) refers to DNA from an environmental sample and it is possible to identify two main ways to obtain this genetic material8,9. The first method involves DNA to be extracted from a bulk sample of organisms. The bulk sampling method has been applied to small planktonic organisms, but also to larger and multicellular species10. The second method uses traces of DNA collected by filtering water or using soil or sediment samples11. The sources of this DNA include cells, faeces, mucous, gametes, degrading cells or tissues and fragments resulting from predation9.

eDNA metabarcoding as a biodiversity assessment tool has already been applied in a variety of environments indicating its capability to unveil the macro-guilds used as ecological indicators as well as ecological communities structure12,12,Statistical OTU diversity estimation

OTU richness has been calculated as the total number of OTUs per sampling site. OTU occurrence has been calculated as the percentage of OTUs occurring in the same number of sampling sites. All calculations have been done through the software Microsoft Excel. Bray–Curtis distance-based Redundancy Analysis (db-RDA) was performed by using the R Package Vegan. Response variables are represented by the occurrence of OTUs in each sampling site, while explanatory variables are represented by the mean abiotic parameter values registered in the Aquatina lagoon during routine surveys, specifically during the sampling period (Salinity, Temperature, Dissolved Oxygen, and pH). Two-way ANOVA was performed for significance values. The results are visualised as an RDA plot obtained using the R Package ggplot2.

Neighbour-joining phylogenetic analysis

OTUs consensus sequences were multiple aligned using Clustal Omega 1.2.2 through the Geneious Prime alignment options, with settings automatically adjusted based on the number of sequences. The obtained multiple sequence alignment was used to construct the Neighbour-Joining tree. For the tree construction, Resampling Method was set on Bootstrap with 674,476 random seeds and 1000 replicates. The genetic distance model was set on Jukes-Cantor, with no outgroup. Tree Build Method was set on Neighbour-Joining.

Results

eDNA detects phytoplankton biodiversity in the lagoon

To analyse the biodiversity of phytoplankton communities characterizing the study area, we identified seven sampling sites along the entire lagoon. DNA was extracted from water samples of the selected sites, and a region of the 18S rRNA gene marker was amplified with primers spanning the V4 region of the gene. Illumina sequencing generated a total of about 3,000,000 raw 18S rRNA reads with an average of about 420,000 sequences per sample, of which 41% were retained after bioinformatic processing (Table S1). The Good’s coverage and Shannon Index rarefaction curve of filtered sequences indicated that the samples reached a plateau (Figs. S1, S2), and therefore sequencing depth was appropriate in all the samples.

Following taxonomic assignment, only the OTUs annotated as phytoplankton taxa and classified beyond the D2 level of the SILVA 138 database were retained, resulting in a total of 2293 phytoplankton OTUs. This eDNA metabarcoding experiment detected a large fraction of phytoplankton groups (Table S2), of which 48.8% were assigned up to genus level and 9% up to species level. Analysing the biodiversity of phytoplankton OTUs, 47% of OTUs belong to the Ochrophyta phylum, 37% of OTUs belong to the Myzozoa phylum, 7% and 6% of OTUs belong to Cryptophyta and Chlorophyta phyla, respectively (Fig. 2). eDNA analysis detected OTUs annotated as nano- and pico-phytoplanktonic genera such as Ostreococcus, Micromonas and Bolidomonas.

Figure 2
figure 2

Total percentage of Phytoplankton OTUs at phyla level.

Interestingly, eDNA from water samples, in our experimental conditions, was able to detect a wide fraction of phytoplankton taxa ranging over different phytoplankton size classes. In addition, both the total and relative numbers of OTUs belonging to the identified phyla present a specific and different distribution in the seven selected sites of the lagoon (Fig. 3).

Figure 3
figure 3

Number of OTUs at phyla level in each of the seven sampling sites.

Phytoplankton OTU richness and occurrence across lagoon sites

To quantify the differences in phytoplankton OTU richness and composition, we compiled the OTU presence/absence matrix (Table S3). Based on this matrix, OTU richness and OTU occurrence were calculated.

OTU richness represents the number of phytoplankton OTUs observed per sampling site, revealing that sites A (n. of OTUs 1771), E (n. of OTUs 1599), and C (n. of OTUs 1417) host most of the OTU richness, site AF (n. of OTUs 496) appears to be the poorest, and sites B (n. of OTUs 899), D (n. of OTUs 909), and F (n. of OTUs 959) present intermediate values.

The OTU occurrence has been calculated as the percentage of phytoplankton OTUs occurring in the same number of sampling sites and visualised as a bar plot (Fig. 4). The majority of OTUs (26.72%) occur in two sampling sites, while the minority of OTUs (3.46%) occur in all seven sampling sites.

Figure 4
figure 4

OTUs occurrence analysis calculated as the percentage of OTUs occurring in the same number of sampling sites.

Abiotic parameters explain occurrence heterogeneity among lagoon sites

To infer the dissimilarity between sampling sites and correlate it to the main abiotic drivers that characterize the environment under study through a constrained ordination, we performed a Bray–Curtis distance-based Redundancy Analysis (db-RDA). All loading and score values are summarised in Table S4. The most commonly routinely surveyed environmental parameters have been used as explanatory variables (independent variables): Temperature (T), Salinity (PSU), Dissolved Oxygen (DO) and pH (Fig. 5). Specifically, these environmental parameters are particularly relevant for transitional water ecosystems, in which they describe their heterogeneity. The explained variables (dependent variables) are represented by the OTUs composition in each sampling site. According to the p-values calculated for each of the abiotic parameters, it results that salinity (p < 0.01) and temperature (p < 0.01) are the main explanatory variables for the phytoplankton OTUs distribution across the sampling sites, while pH (p < 0.05) and especially DO (p < 0.1) do not seem to affect it.

Figure 5
figure 5

Bray–Curtis distance-based Redundancy Analysis (db-RDA) calculated to explain the differences in the phytoplankton OTUs distribution across the Aquatina di Frigole lagoon. PSU = Salinity (PSU); T = Temperature (°C); DO = Dissolved oxygen (mg/L). Black dots and letters indicate the seven sampling sites.

The biplot of this analysis displays sites A, C, and D grouped, while sites AF, B, E, and F do not show any grou** pattern. This result reflects the similar environmental conditions occurring in sites A, C, and D, where comparable levels of the considered environmental variables have been registered. The proximity of these three sites in the biplot also indicates similarities in the phytoplankton OTUs composition across these sites. At the same time, the four sites far apart in the biplot represent areas with distinct environmental differences, influencing the phytoplankton OTUs distribution in a way that separates them apart and from the other three sites.

Exploring genetic diversity across the lagoon: the case of Tetraselmis genus

The analysis of the presence/absence matrix of OTUs highlighted that different OTUs assigned to the same taxon exhibited a different distribution across sampling sites. Specifically, we focused on multiple OTUs assigned to the Tetraselmis genus because the relative and different OTUs presented a clearly different distribution among sampling sites. The bioinformatics analysis recovered 8 OTUs assigned to the Tetraselmis genus highly differently occurring across the sampling sites (Table S5). 2 OTUs occurred only in the marine site AF, 4 OTUs presented wide occurrence across sampling sites, and 2 OTUs presented a peculiar occurrence in two sampling sites. We analysed the genetic distance, calculated as nucleotide substitutions in a phylogenetic tree, to inspect if the differences in the spatial distribution reflect genetic variations (Fig. 6).

Figure 6
figure 6

Neighbour-joining phylogenetic tree (Jukes-Cantor genetic distance model with 674,476 random seed and 1000 bootstrap replicates) of OTUs of Tetraselmis genus for V4 region of 18S marker. The Clustal omega multiple sequence alignment consists of about 390 bp region. Branch labels indicate substitutions per site.

Interestingly, the OTUs present only in the marine site AF (Tetraselmis 6 and 8) clustered and separated from OTUs widely distributed across the sampling sites, especially Tetraselmis 1, 2, 3, but significantly also from Tetraselmis 5.

This analysis suggests that the different spatial distribution of OTUs, at least for the Tetraselmis genus, presents a relation with the genetic background.

Discussion

Coastal lagoons provide valuable ecosystem services and are suitable areas for biodiversity conservation24. However, they are sensitive to climatic and anthropogenic pressures, which are compromising their ecological integrity25, 26. Among the ecological indicators, eukaryotic phytoplankton is known to exhibit remarkable species richness in transitional waters27. However, molecular phytoplankton surveys are still in their infancy in these environments, where they are only limited to the detection of harmful algal blooms28. In this research, we conducted eDNA metabarcoding with the 18S rRNA gene marker in a protected Mediterranean coastal lagoon. We selected 18S as a gene marker because it is a largely represented barcode gene for phytoplankton in Mediterranean ecosystems29. We assessed an experimental plan aimed to overcome specific issues related to lagoon ecosystems such as water turbidity and abundance of humic substances. We extracted the DNA present in the water column, recovering phytoplankton biodiversity ranging over all the phytoplankton size classes. In general, for planktonic organisms has been applied the bulk sample approach. We used the eDNA metabarcoding method and the results highlight the identification of a wide fraction of nanophytoplankton and microphytoplankton, representing about 80% of OTUs. The high percentage of OTUs belonging to these phytoplanktonic size classes could reflect the different coverage of phytoplankton species in the DNA barcode reference databases.

In this study, eDNA also detected significant correlations between phytoplankton OTUs occurrence and lagoon abiotic components. Specifically, the significant environmental explanatory variables were salinity and temperature (p-values < 0.01), displaying a high difference between site AF, located at the interface between the sea and the lagoon, characterized by high salinity and low temperature, site F, located in proximity to the freshwater input, characterized by low salinity and higher temperature, and the other sites, characterized by a greater intermediate homogeneity. The specific pattern observed in the db-RDA biplot shows that similar environmental conditions correspond to similarities in the phytoplankton community composition, while distinct environmental differences reflect in separated communities. These results confirm that environmental heterogeneity can represent the spatial factor playing a key role in influencing the phytoplankton community structure in transitional water ecosystems30.

Different OTUs were taxonomically assigned to the Tetraselmis genus. The absence of species assignment is probably related to the absence of 18S DNA barcodes for Mediterranean Tetraselmis species in the reference databases. OTUs belonging to the Tetraselmis genus and highly differently distributed across the sampling sites were analysed for the genetic distances. The results outlined that genetic backgrounds correlate with specific spatial distribution. Although the analysis is limited to a short fragment of a conserved gene, the results suggest that sequencing data from eDNA amplicons could be useful to inspect directional genetic variations.

A high genetic variation is important for the ability of phytoplanktonic assemblages to be extremely viable in highly heterogeneous environments such as coastal lagoons, which are subjected to strong variations driven by extrinsic factors such as climate and tides, other than by intrinsic variability given by abiotic heterogeneity.

Overall, this work demonstrates that eDNA metabarcoding is a valuable tool for rapidly surveying the phytoplanktonic communities and describing their spatial distribution in transitional water ecosystems.