Background

Understanding assembly mechanisms of microbial community across geographic and taxonomic scales is a fundamental issue of microbial ecology (Zhou and Ning 2017). As the two fundamental theories describing community assembly processes, niche-based theory hypothesizes that deterministic processes such as environmental selection and interspecies interactions govern community assembly (Chesson and Kuang 2008; Letten et al. 2016), while neutral theory assumes that community assemblies are governed by stochastic processes such as dispersal, ecological drift (including random birth and death), and speciation/diversification (McGill 2003; Volkov et al. 2003). As the fusion of the two theories has led to the general consensus that both deterministic and stochastic processes contributed to community assembly, the central focus of microbial community assembly mechanisms is to quantify the relative importance of deterministic and stochastic processes (Vellend 2010). As two most popular and influential models specifically developed for microbial communities, Sloan’s Neutral Model as a neutral-theory-based process-oriented model (Sloan et al. 2006) and/or Stegen’s two-step Null Model based on phylogenetic signal in niche differences between species (Stegen et al. 2013) have been extensively used to infer assembly processes of microbial communities across a broad range of ecosystems or habitats including marine water (Sun et al. 2023; Wu et al. 2020), river (Isabwe et al. 2022; Yang et al. 2023b), lake (Yan et al. 2017; Yang et al. 2023a), soil (Barnett et al. 2020; Tripathi et al. 2018; Xu et al. 2023), human lung (Venkataraman et al. 2015), and aquatic animal (Wang et al. 2020b). Although the core assumptions of the two approaches differ, the deviation of observed patterns from the neutral or null distribution can indicate the extent of determinism relative to stochasticity in sha** microbial communities (Stegen et al. 2012; Venkataraman et al. 2015), thus providing important insights into the balance of ecological processes in governing microbial community assembly. However, given the high diversity and the broad fitness of microbes, quantifying community assembly mechanisms at the whole community level is limited due to the neglect of taxonomically dependent processes, since various ecological processes commonly act on the finer taxonomic levels rather than the whole communities (Nemergut et al. 2013). Previous works based on null models at the community level have reported contrasting assembly mechanisms in global or regional marine waters between microbial domains/kingdoms, including bacteria vs. archaea (Wang et al. 2020a), bacteria vs. protists (Wu et al. 2018), and prokaryotes vs. microeukaryotes (Logares et al. 2020), suggesting the taxonomic dependency at a high taxonomic level. However, assembly mechanisms of different bacterial taxonomic groups across complex coastal waters and their determinants have not been well understood, especially at the regional scale.

According to some previous discussions, including ours, about the pros and cons of neutral and null models (Wang et al. 2020a; Zhou and Ning 2017), we propose that simultaneously considering two methods could improve the inference of the microbial assembly processes. The typical results of either neutral or null models can reflect the general pattern in relative importance of deterministic and stochastic processes (or specific ecological processes) in sha** microbial communities in the study areas (Logares et al. 2020; Wu et al. 2018; Yan et al. 2017). However, at the larger geographic scale (i.e., regional scale), the understanding of underlying mechanism sha** microbial biogeography could be oversimplified without further characterization of the spatial variability in assembly processes (Wang et al. 2019; Yan et al. 2021). Therefore, evaluating the heterogeneity of assembly processes of microbial communities across space is essential to understanding the mechanism sha** the spatial assembly of microbes, especially at or beyond the regional scale. However, the spatial heterogeneity in the assembly of total bacteria and different taxonomic groups across complex coastal waters at the regional scale has not been comprehensively investigated.

In marine waters, several previous studies have suggested that contrasting community assembly mechanisms of prokaryotes and picoeukaryotes were driven by their differences in dormancy potential and species composition (Kong et al. 2022; Logares et al. 2020), while the distinct assembly mechanisms between bacterial and protist communities depended on niche breadth and cellular size (Wu et al. 2018). Our previous work found that domain-dependency patterns in prokaryotes corresponded to differences in niche breadth and bacteria and archaea population sizes (Wang et al. 2020a). For a finer perspective of sub-communities, most previous efforts compared assembly mechanisms of abundant and rare communities (Alonso-Sáez et al. 2015; Logares et al. 2014; Mo et al. 2018; Wu et al. 2017). For example, taxa abundance and diversity were suggested to contribute to the differences in assembly mechanisms of abundant and rare communities of bacteria in subtropical bays (Mo et al. 2018). However, key factors mediating the taxonomic dependency or spatial variability in assembly processes of marine bacteria have not been extensively revealed. Seawater density and temperature were suggested to be the most important environmental modulators of the balance between stochastic and deterministic assembly processes of prokaryotes along a ~ 2000-km longitudinal transect (Allen et al. 2020). Our previous work suggested suspended particles as a crucial factor driving the balance between deterministic and stochastic assembly processes of bacteria across the coastal waters in the East China Sea (Wang et al. 2020a). However, determinants of taxonomic dependency and spatial heterogeneity in assembly mechanisms of bacteria across complex coastal waters remain largely unknown.

To characterize taxonomic dependency and spatial heterogeneity in assembly mechanisms of bacteria and their regulation in coastal waters, we used the coastal area of northern Zhejiang, East China Sea, with spatially structured environmental gradients (primarily salinity and nutrient-related factors including dissolved inorganic nitrogen, phosphate, and suspended particles) (Wang et al. 2015), as a model system. A 16S rRNA microbiome dataset with regionally high coverage was analyzed with both neutral and null models, and with corresponding visualization methods to test three hypotheses: (1) there would be pronounced taxonomic dependency in ecological processes governing bacterial assembly; (2) spatial heterogeneity in assembly processes of bacteria along the environmental gradients would be common across taxonomic groups; and (3) the extent and determinants of spatial heterogeneity would also be taxonomically dependent. Our work could provide a baseline for assessing the impact of regional environmental changes on the mechanisms of maintenance of bacterial diversity and aggregation.

Methods

Sampling scheme, measurements of water physicochemical parameters, 16S rRNA gene amplicon sequencing, and sequence processing

The study area and sampling procedures were described in our previous work (Wang et al. 2015). Briefly, we used a high-coverage sampling scheme at a ~ 200-km scale across the coastal area of northern Zhejiang Province, China. A total of 95 surface water samples (at 0.5-m depth) were collected from 95 stations, affiliated to eight zones: Hangzhou Bay (HZ), Zhoushan archipelago (ZSI, including three subzones: ZSI_north (northern part of the archipelago), ZSI_mouth (in the mouth of HZ), and ZSI_other (others)), ** prokaryote community structure. Environ Microbiol 8(4):732–740. https://doi.org/10.1111/j.1462-2920.2005.00956.x " href="/article/10.1186/s13717-023-00480-7#ref-CR46" id="ref-link-section-d218292237e710">2006). Briefly, the relationship between the frequency of occurrence of ZOTUs in the local communities of 82 stations and their abundance in the metacommunity (estimated by the mean relative abundance across all local communities) was fitted by the neutral model. The model predicts that more abundant species (as referred to ZOTUs here) of a metacommunity will be more ubiquitous across local communities, because of their higher probability to be randomly dispersed and then to colonize in a local community, while less abundant species are more likely to be lost or replaced by others due to ecological drift (Burns et al. 2016). The R code from Burns et al. (2016) was used for the neutral model fitting, the goodness of model fitting was evaluated by R2, ranging from ≤ 0 (not fit) to 1 (perfectly fit). The 95% confidence intervals of the model were calculated by bootstrap** with 1,000 replicates. The estimated migration rate (m), presenting the probability that stochastic losses of individuals in local communities replaced by dispersal from the metacommunity, was calculated using a non-linear least-squares fitting with the R package ‘minpack.lm’ (Burns et al. 2016; Elzhov et al. 2013). This parameter can be considered as an indicator of dispersal limitation, that is, higher m values mean less dispersal limited (Burns et al. 2016).

The ZOTUs that fall within the 95% confidence intervals of the neutral model are considered as neutrally distributed, which are likely assembled into local communities by stochastic dispersal from the metacommunity and ecological drift (Venkataraman et al. 2015). The ZOTUs that were overrepresented compared to the neutral prediction hold a strong probability of preference for certain local conditions, thus being selected for, while the ZOTUs that were underrepresented compared to the neutral prediction are likely selected against by most of local conditions and/or governed by dispersal limitation from the metacommunity (Venkataraman et al. 2015). The cumulative relative abundances of neutrally distributed and non-neutrally distributed (above and below prediction) species were calculated as a metric to infer the relative influence of dispersal and drift (stochastic processes) and selection (deterministic processes) in governing the assembly of bacteria at the community level (Venkataraman et al. 2015). To assess the taxonomic dependency in relative importance of deterministic and stochastic processes, we calculated the cumulative relative abundance of non-neutrally and neutrally distributed ZOTUs of total bacterial communities and different taxonomic groups. Furthermore, the abundance ratio of non-neutrally and neutrally distributed ZOTUs (hereinafter referred to as non-neutral-to-neutral ratio) of each community at each station was calculated as following:

$${\text{Non-neutral-to-neutral ratio}} = \frac{{\sum}_{{\text{i}}= 1}^{\text{M}} {{\text{Above}}}_{\text{i}}+{\sum}_{{\text{j}}=1}^{\text{N}} {{\text{Below}}}_{\text{j}}}{{\sum}_{{\text{k}}=1}^{\text{T}}{{\text{Neutral}}}_{\text{k}}},$$
(1)

where Abovei, Belowj, and Neutralk are the relative abundance of overrepresented ZOTU i, underrepresented ZOTU j, and neutrally distributed ZOTU k in a given community, respectively. Then non-neutral-to-neutral ratio of each station was visualized using ArcGIS Desktop 10.4 to evaluate the spatial heterogeneity in assembly processes of bacteria.

Inference and visualization of assembly processes of bacterial communities on between-station basis using the null models

The assembly processes of bacterial communities on the basis of pairwise comparison between stations were inferred using the null models (Stegen et al. 2013). This approach (Stegen et al. 2013) and spatial visualization of assembly processes (Wang et al. 2019) have been described previously. Briefly, the first step of this approach is using the deviation of observed phylogenetic turnover (based on β-Mean Nearest Taxon Distance (βMNTD)) from the null expectation, that is β-Nearest Taxon Index (βNTI), to distinguish deterministic and stochastic processes:

$${{\beta\text{MNTD}}}=0.5\left[\sum_{{i}_{k}=1}^{{n}_{k}}{f}_{{i}_{k}}min \left({\Delta }_{{i}_{k}{j}_{m}}\right)+\sum_{{i}_{m}=1}^{{n}_{m}}{f}_{{i}_{m}}min \left({\Delta}_{{i}_{m}{j}_{k}}\right)\right],$$
(2)

where \({f}_{{i}_{k}}\) is the relative abundance of ZOTU i in community k, nk is the number of ZOTUs in k, and \(min\left({\Delta}_{{i}_{k}{j}_{m}}\right)\) is the minimum phylogenetic distance between ZOTU i in community k and all ZOTUs j in community m;

$${\beta\text{NTI}}=\frac{{{\beta\text{MNTD}}}_{obs}-\overline{{{\beta\text{MNTD}}}_{null}}}{{\text{sd}({\beta\text{MNTD}}}_{null})},$$
(3)

where βMNTDobs is phylogenetic distances between two observed communities, βMNTDnull is that between two randomized communities, \(\overline{{{{\beta}\text{MNTD}} }_{null}}\) is the mean βMNTDnull from 999 randomization, and sd(βMNTDnull) is standard deviations of 999 βMNTDnull. The significant difference (|βNTI|> 2) indicates the dominance of deterministic processes for a given pair of communities, and βNTI >  + 2 or <  − 2 suggests that heterogeneous or homogeneous selection governs between-community difference or similarity, respectively. For all the pairs of communities with |βNTI|< 2, which suggests stochastic processes, the second step uses Raup–Crick metric based on Bray–Curtis dissimilarity (RCbray) to estimate the standardized deviation of observed ZOTU turnover from the null expectation, thus disentangling various stochastic processes (Chase and Myers 2011; Stegen et al. 2013). When |βNTI|< 2, the significant difference, that is RCbray >  + 0.95 or <  − 0.95, suggests that dispersal limitation or homogenizing dispersal governs between-community difference or similarity, respectively, while |RCbray|< 0.95 suggests that the turnover between a given pair of communities is undominated by any processes (Stegen et al. 2015). Subsequently, the spatial distribution of assembly processes of total bacterial communities or different taxonomic groups between stations was visualized using ArcGIS Desktop 10.4 (Yan et al. 2021).

Calculation of niche breadth of bacteria

Levins’ niche breadth was used to present habitat specialization and generalization of each bacterial ZOTU, based on the abundance of species in different resource states (Levins 1968). Here, resource states were defined by non-hierarchical clustering as previously described (Wang et al. 2020a; Yan et al. 2022). The ZOTU tables of total bacterial community and different taxonomic groups were then converted into resource matrices (Krebs 2014). Niche breadth of ZOTUs was calculated and standardized as following (Pandit et al. 2009).

$$\text{Levins' }\text{niche}\; \text{breadth}\; \text{index}\;(B): {{B}}_{j}=1/{\sum}_{i=1}^{N}{P}_{\text{ij}}^{2},$$
(4)
$${\text{Levins'}\; \text{standardized}\; \text{niche}\; \text{breadth}}\;\left({{\text{B}}_{A}} \right):B_{A} = \left( {{ \text{B}} -1} \right)/\left( {{ \text{N}}- 1} \right),$$
(5)

where Bj is the niche breadth of ZOTU j, Pij is the proportion of ZOTU j in resource state i, N is the total number of resource states. The arithmetic average BA of all ZOTUs in a given bacterial community or groups were calculated as niche breadth at the levels of the total community or taxonomic group (Wu et al. 2018). The habitat specialists and generalists were defined according to BA value of a given ZOTU as previously described (Liao et al. 2016). Additional details of the threshold of habitat specialists and generalists are provided in Additional file 1.

Inference of potential microbial interactions by association network analysis

Direct microbial associations were inferred using FlashWeave (sensitive = true, heterogeneous = false, alpha = 0.001, normalize = true) (Tackmann et al. 2019). FlashWeave was used because of its merits on detecting and removing indirect (i.e., purely correlational) associations to construct direct association networks based on local-to-global learning approach, a constraint-based causal inference framework for the prediction of direct relationships between variables, thus reducing false or suspicious associations. It furthermore allows to estimate influence of environmental factors on microbial associations and then to remove indirect associations driven by them. The total bacterial network with non-environmentally driven edges was generated, and then was divided into sub-networks for seven bacterial groups according to the edges connected to the nodes (ZOTUs) of each bacterial group. Community cohesions and cohesion ratio (|negative cohesion/positive cohesion|), as metrics evaluating the degree of connectivity and relative importance of negative and positive relationships between taxa in a community, were calculated based on the associations revealed in total bacterial network and seven sub-networks as previously described (Hernandez et al. 2021; Herren and McMahon 2017). Furthermore, station-based networks were extracted from the total bacterial network and seven bacterial sub-networks according to the edges connected to the nodes (ZOTUs) present in the local community (Ma et al. 2016) and then topological features including modularity and average degree, as metrics evaluating community stability and potential interaction strength (Hernandez et al. 2021; Wan et al. 2020), were calculated using the R package “igraph”.

Estimating the direct and indirect effects of different factor categories on bacterial community assembly

Partial least squares path modeling (PLS-PM) (Sanchez et al. 2023) was conducted to obtain a systematic understanding of the direct and indirect effects of factor categories including Longitude, Latitude, basic abiotic constraints (Basic; including pH and DO), inorganic resources (Inorganic; including salinity, DIN (dissolved inorganic nitrogen; sum of NO3 (nitrate), NO2 (nitrite), NH4 (ammonium), and PO4 (phosphate)), organic resources (Organic; including SP (suspended particles), COD (chemical oxygen demand), and oil), chlorophyll-a (Chl-a), niche breadth of bacterial community (Niche), bacterial alpha diversity indices (Diversity; including phylogenetic diversity, ZOTU richness, and Shannon–Wiener index), relative abundance (Abundance), and the features reflecting potential microbial interactions (Interaction; including cohesion ratio (|negative cohesion/positive cohesion|), modularity, and average degree) on bacterial community assembly mechanisms (as expressed by the ratio of the relative abundance of non-neutrally ZOTUs to that of neutrally distributed ones) with the R package ‘plspm’ (Sanchez et al. 2023). The GoF index is regarded as goodness of fit of the entire model. The total effects are the sum of direct and indirect effects. The direct effects are expressed as the path coefficients, and the indirect effects are expressed as the product of the path coefficients by taking an indirect path. Partial least squares path modeling shows the path coefficients (direct effects) of the above ten factor categories, significance of linear model fitting between pairwise factor categories were checked by bootstrap t-test.

General statistical analyses

The geo-statistics were performed in ArcGIS Desktop 10.4. Kruskal–Wallis analysis was applied to test the significance of differences in the ecological features including non-neutral-to-neutral ratio, niche breadth, alpha diversity indices, and the features of microbial associations across bacterial communities using IBM SPSS Statistics Version 22.0. Spearman rank correlations between assembly mechanisms (as expressed by non-neutral-to-neutral ratio) of bacterial communities and other community ecological features were tested in IBM SPSS Statistics Version 22.0. Distance-based redundancy analysis (db-RDA) was performed to determine key environmental driver of compositional variation of bacterial communities using the ‘capscale’ function of the R package “vegan”.

Results

Assembly processes of bacterial communities

Our analyses focused on seven bacterial taxonomic groups at the phylum and proteobacterial class levels, accounting for 92.8% of reads of the metacommunity. Overall, the assembly of total bacterial communities fit the neutral model (R2 = 0.77; Additional file 1: Fig. S2). According to the cumulative relative abundance of three categories of ZOTUs (Zero-radius Operational Taxonomic Units) indicating the relative importance of different ecological processes, neutral (stochastic) processes had slightly more contribution to total bacterial community assembly compared with that of selection (deterministic (above or below prediction)) processes (58.4% vs. 41.6%) (Fig. 1A). However, the relative importance of neutral and selection processes was highly variable across the seven bacterial groups, that is, Actinobacteria, Gammaproteobacteria, Alphaproteobacteria, and Cyanobacteria were more dominantly governed by neutral processes; and Bacteroidetes, Planctomycetes, and Deltaproteobacteria were more shaped by selection processes, with more selection against in the assembly of Bacteroidetes and Planctomycetes but more selection for in the assembly of Deltaproteobacteria.

Fig. 1
figure 1

Relative importance of assembly processes of bacteria using neutral (A) and null models (B). A Cumulative relative abundance of bacterial ZOTUs above prediction, below prediction, and neutrally distributed. B The percentage of ecological processes governing the spatial turnover of total bacterial communities or seven taxonomic groups in all pairwise comparisons between stations according to βNTI values

In order to quantify phylogenetic turnover of bacterial communities using the null model based on βNTI (β-Nearest Taxon Index), we first tested for phylogenetic signals for total bacterial community or taxonomic groups (Additional file 1: Fig. S3), and confirmed significant signals across relatively short phylogenetic distances (typically < 13% of the maximum) (Stegen et al. 2012). The null models showed that total bacterial communities were equally governed by deterministic and stochastic processes (Fig. 1B). Similar to the pattern revealed by the neutral model, Actinobacteria, Gammaproteobacteria, Alphaproteobacteria, and Cyanobacteria were more governed by stochastic processes, of which the relative importance was even higher than that shown by neutral model, while the enhanced stochasticity made Bacteroidetes equally shaped by deterministic and stochastic processes (50.2% vs. 49.8%) (Fig. 1B). However, Planctomycetes and Deltaproteobacteria were governed more by stochastic processes, showing the opposite pattern as revealed by the neutral model. Additionally, we found that all bacterial groups were governed more by deterministic processes when quantifying with RCbray (Raup–Crick metric based on Bray–Curtis dissimilarity) alone than with βNTI alone (Additional file 1: Fig. S4).

Spatial variability of assembly processes of bacterial communities

The ratio of deterministic and stochastic assembly processes at each station quantified by non-neutral-to-neutral ratio was mapped to illustrate the spatial heterogeneity of bacterial community assembly mechanism (Fig. 2). Total bacterial communities were more shaped by deterministic processes (selection) in Hangzhou Bay (HZ) and Yushan Reserve (YS), serving as two ends of multiple environmental gradients (including salinity and nutrient-related factors). In other zones among the intermediate interval of the environmental gradients, stochastic (neutral) processes showed more power in governing the assembly of total bacterial communities. The assembly processes of the seven bacterial groups showed distinct spatial patterns. Bacteroidetes showed a similar pattern as the total bacterial community, with the zones dominated by determinism extending to the northern part of Zhoushan archipelago (ZSI_north) and Jiushan Islands (JS). The assemblies of Alphaproteobacteria and Gammaproteobacteria were dominantly governed by deterministic processes in HZ, while stochastic processes in other zones. Actinobacteria and Cyanobacteria were generally shaped by stochastic processes across the entire study area (except several HZ stations for Cyanobacteria), while the assembly of Deltaproteobacteria was dominated by deterministic processes. The assembly of Planctomycetes was dominated by deterministic processes in most zones except the east boundary of the Island-chain (BIC). The degree of heterogeneity in assembly mechanisms of bacteria estimated by coefficient of variation (CV) of non-neutral-to-neutral ratio varied from 0.36 to 1.35 (data not shown). Taxonomic groups with higher spatial heterogeneity were Bacteroidetes (1.35), Alphaproteobacteria (1.18), Cyanobacteria (1.12), and Planctomycetes (1.06), while those with lower heterogeneity were Actinobacteria (0.69), Gammaproteobacteria (0.42), and Deltaproteobacteria (0.36).

Fig. 2
figure 2

Kriged maps illustrating the spatial variability of the ratio of relative abundance of non-neutrally (sum of above-prediction and below-prediction) distributed ZOTUs to that of neutrally distributed ZOTUs (defined by the neutral model) in total bacterial communities or seven taxonomic groups. The colors of the stations correspond to different zones, and the stations in the Zhoushan archipelago were grouped into three subzones including ZSI_north (northern part of the archipelago), ZSI_mouth (in the mouth of HZ), and ZSI_other (others), which were shown as square, circle, and triangle symbols, respectively

From the perspective of pairwise comparisons between zones (as indicated by the ratio of deterministic processes between zones according to βNTI), total bacterial community and taxonomic groups including Bacteroidetes, Deltaproteobacteria, and Planctomycetes showed higher spatial heterogeneity in assembly processes, compared with other taxonomic groups (Fig. 3 and Additional file 1: Fig. S5). The extent of heterogeneity across taxonomic groups overall corresponded to those revealed by the neutral model, except Alphaproteobacteria and Gammaproteobacteria, which showed much less heterogeneity compared with that based on the neutral model. For the total bacterial community, a determinism-dominated pattern was more frequently detected between ZSI/** prokaryote community structure. Environ Microbiol 8(4):732–740. https://doi.org/10.1111/j.1462-2920.2005.00956.x " href="/article/10.1186/s13717-023-00480-7#ref-CR46" id="ref-link-section-d218292237e2398">2006). The second possible explanation could be the endogenous difference in the basis for inferring community assembly process, that is the local-metacommunity relationship of each species for the neutral model and pairwise comparison between local communities based on community-level metrics for the null model. These results emphasize the necessity of using different models to complementarily interpret assembly mechanisms of microbial communities, especially for taxonomic groups with lower relative abundance, since conflicting results from the two models were more likely to occur in bacterial groups with lower relative abundance. Despite that, both models confirmed that taxonomic dependencies exist in the assembly mechanisms of bacterial groups in terms of dominant assembly mechanisms and specific ecological processes. Due to the limitation of sequencing depth and unevenness in sequences and coverage across taxonomic scales for different samples, our analyses did not expand to the finer taxonomic resolutions. Future efforts should be made to assess taxonomic scale dependency and hierarchical determinants of bacterial assembly mechanisms.

The current knowledge about taxonomic dependency determinants in community assembly mechanisms is scarce. Niche breadth at the community level has been proposed as a major determinant of differences in assembly mechanisms across microbial domains (Logares et al. 2020; Wang et al. 2020a; Wu et al. 2018), since microorganisms with wider niche breadths are less sensitive to environmental changes and are less governed by environmental selection, thus leading to stronger stochastic assembly relative to deterministic assembly (Jiao et al. 2020; Liao et al. 2016). In this study, we also found that niche breadth showed a strong negative correlation with the determinism-to-stochasticity ratio in the assembly of bacterial groups, suggesting this principle could also apply to taxonomic dependency in assembly mechanisms within the domain Bacteria. Furthermore, the relative abundance of habitat specialists and generalists (as defined by the range of niche breadth) in a given community could also determine its assembly mechanism, and the community with higher proportion or abundance of specialists tends to be more governed by selection (Mo et al. 2012; Vellend 2010; Zhou and Ning 2017). However, most of the studies interpreted microbial community assembly mechanisms in a general manner across various geographic scales (Cheng et al. 2023; Logares et al. 2020; Wu et al. 2018). The spatial variability or heterogeneity of community assembly mechanisms of microorganisms has been neglected for a long time. Our previous work has demonstrated the remarkable spatial variability of community assembly processes of total archaea and the dominant archaeal groups (Marine Groups I and II) (Wang et al. 2019). We also found that the extent of spatial heterogeneity of microbial assembly mechanisms might largely depend on the range of environmental gradients across similar geographic scales, that is, broader environmental gradients led to higher spatial heterogeneity of assembly mechanisms (Wang et al. 2019, 2020a). In this study, by using customized spatial visualization methods for both neutral and null models, we confirmed the prevalence of spatial heterogeneity in assembly processes of total bacterial community and taxonomic groups, and the differences in the degree of spatial heterogeneity in assembly processes across the seven bacterial groups suggested taxonomic dependency in spatial heterogeneity of assembly mechanisms. As the two ends of the environmental gradients (including salinity and nutrients) in our study area, the Hangzhou Bay (HZ) and Yushan Reserve (YS) served as the only two hot spots of determinism-dominated mechanism for the total bacterial community. This corresponded to the stronger selection triggered by more extreme local environmental conditions in these two zones, which harbored very distinct community composition compared with those in the other zones (Wang et al. 2015). As one of the most eutrophic coastal area in China, the study area forms a strong-to-weak gradient of anthropogenic/terrestrial disturbances from HZ to YS (MEE 2023), due to the emissions from the intensive economic development of the big/mega cities surrounding HZ and the terrestrial runoffs from the Qiantang River (Chen et al. 2009; Sun et al. 2013; Yang et al. 2012). The stochasticity-dominated assembly mechanism of the total bacterial community found in the zones across the intermediate range of gradients suggests that intermediate anthropogenic/terrestrial disturbances could lead to more stochastic assembly of bacteria. These results indicate an ‘intermediate disturbance hypothesis’ (Connell 1979) of heterogeneity in microbial community assembly mechanisms. This has been shown in a soil ecosystem/microcosm experiment where stochasticity overwhelmed determinism in bacterial community assembly processes at neutral pH/moderate disturbance conditions but showed the opposite pattern at two poles of pH value/disturbance frequency (Santillan et al. 2019; Tripathi et al. 2018).

Although the degree of heterogeneity varied across bacterial groups, HZ served as the hot spot of determinism-dominated mechanisms for more than half of the taxonomic groups including Alphaproteobacteria, Gammaproteobacteria, Planctomycetes, and Bacteroidetes, emphasizing that these bacterial taxa tend to be more deterministically assembled under more intensive disturbances. Several previous studies also found enhanced determinism (niche selection) coupled with more intensive perturbation such as anthropogenic activities and extreme climates like heavy rain and desiccation across various ecosystems including freshwater lakes (Obieze et al. 2022; Wu et al. 2023a), rock pools (Vass et al. 2020), and coastal sediments (Valverde et al. 2014). Furthermore, eutrophic waters in HZ could increase the proportion of deterministic processes of these taxa as other researchers found that planktonic Vibrio communities were more deterministically assembled in eutrophic waters compared with those in mesotrophic waters in a marine subtropical gulf (Li et al. 2020). Furthermore, YS also served as the hot spot of determinism-dominated mechanisms for Bacteroidetes, which corresponded to the dominant Bacteroidetes likely triggered by the phytoplankton bloom in this zone as previously reported (Wang et al. 2015). Collectively, spatial heterogeneity in the assembly mechanisms of bacteria was prevalent across the study area.

Our understanding of the determinants of spatial heterogeneity in bacterial assembly mechanisms in marine waters is poor at best. Some studies have demonstrated that temperature was the major factor mediating the balance between stochastic and deterministic assembly processes of bacteria in the sediments of hot springs (He et al. 2021) and in the oligotrophic ocean at a ~ 2,000-km scale (Allen et al. 2020). Our previous study across coastal waters at a ~ 300-km scale found that suspended particles (SP) and phosphate had a great impact on spatial variability of bacterial assembly processes (Wang et al. 2020a), while salinity largely regulated that along an exorheic river (Shi et al. 2023). Here, we found that environmental determinants including pH, dissolved oxygen (DO), salinity, dissolved inorganic nitrogen (DIN), and phosphate directly affected the spatial heterogeneity of the determinism-to-stochasticity ratio of total bacterial community. Among them, the nutrient concentrations (mainly DIN and phosphate) formed a high-to-low gradient from HZ to YS, while salinity and pH showed a high-to-low gradient from YS to HZ, corresponding to the determinism-dominance in total bacterial assembly in these two zones, but stochasticity-dominated pattern in other zones as discussed above. Besides strong environmental constraints on total bacterial community assembly mechanisms over space, potential interactions between taxa also contributed to spatial assembly patterns of total bacteria, emphasizing the role of specific microbial interactions in enhancing deterministic community assembly as we discussed above.

In general, many factors could affect the spatial assembly of multiple bacterial groups, but how they acted on distinct communities subtly differed. As we hypothesized, the determinants of spatial heterogeneity in assembly mechanisms were also taxonomically dependent. Similar to the total bacterial community, spatial variability in the assembly mechanism of Alphaproteobacteria was strongly and directly affected by basic abiotic constraints and inorganic resources. But the underestimation of a key alphaproteobacterial group (Pelagibacterales, aka. SAR11 clade) by the current primer set could influence the assessment of processes and determinants of Alphaproteobacteria assembly, which should be evaluated in the future with the modified primer set. Bacteroidetes was strongly and directly affected by its relative abundance, corresponding to its thriving following a phytoplankton bloom in YS as mentioned above. It is well known that the spatial distribution of abundance and diversity of marine Cyanobacteria is mainly driven by the combination of light, temperature, and inorganic nutrients including N, P, and Fe (Cunningham and John 2017; Flombaum et al. 2013), but the factors determining the spatial variability in its assembly processes are barely known. We found that spatial heterogeneity in the assembly of Cyanobacteria was directly regulated by organic resources including SP and chemical oxygen demand (COD), enforcing determinism-dominance in the mouth of HZ, which may reflect underlying cruciality of light and nutrients. Given the importance of Cyanobacteria in marine endogenous organic carbon flux, this association between cyanobacterial assembly and exogenous organic matter indicates the complex roles of cyanobacteria in organic carbon turnover in the transitional zone between land and sea. Among all the tested bacterial groups, only the assembly of Planctomycetes was simultaneously and directly controlled by geographic, environmental, and community ecological features. This suggests complexity in the mechanisms governing the spatial assembly of Planctomycetes. Given that longitude showed the strongest effect on Planctomycetes, we speculated that unmeasured factors such as water temperature highly associated with longitude might be the actual drivers, which deserves further investigation.

Compared with the above bacterial groups, the assembly mechanism of Gammaproteobacteria, Actinobacteria, and Deltaproteobacteria showed much lower spatial heterogeneity. The spatial heterogeneity in assembly mechanism of Gammaproteobacteria was strongly and directly affected by its alpha diversity, which was largely conditioned by environmental and geographic factors, suggesting diversification as a force in governing deterministic assembly in HZ. Alpha diversity also directly influence spatial assembly patterns of Actinobacteria and Deltaproteobacteria but with opposite manners and different co-factors, suggesting distinct mechanisms underlying the observed patterns. Collectively, the degree and determinants of spatial heterogeneity in community assembly mechanisms varied across bacterial groups. The ones with higher heterogeneity in assembly mechanism were more related to environmental and/or geographic factors (except Bacteroidetes), while those with lower heterogeneity were more related to community ecological features.

Conclusions

This study systematically tested the existence and extent of taxonomic dependency and spatial heterogeneity in assembly mechanisms of marine bacteria in a coastal ecosystem with spatially structured regional environmental gradients. Our results confirmed the variability of assembly processes of bacteria with taxonomic group and with space. The assembly of total bacterial communities was balancedly governed by deterministic and stochastic processes, while only the Bacteroidetes were dominated by determinism among the seven dominant bacterial groups. The taxonomic dependency of bacterial assembly processes was mainly related to the differences in niche breadth and negative-to-positive cohesion ratio, followed by alpha diversity and relative abundance of bacterial taxa. The spatial distribution patterns of assembly processes commonly varied across bacterial groups, and were driven by various combinations of factors, suggesting that spatial heterogeneity of assembly processes of bacteria also exhibited taxonomic dependency. Collectively, this work assessed the pervasiveness of taxonomic dependency and spatial heterogeneity in bacterial community assembly from the perspectives of one-station (local-community) basis and pairwise between-station comparisons, providing a comprehensive understanding of the regulation of bacterial community assembly across taxa and space.