INTRODUCTION
Marine microbial communities, composed of bacteria, archaea, and protists, as well as viruses, play essential roles in the functioning and regulation of Earth’s biogeochemical cycles (
1). Their roles within planktonic ecosystems have typically been studied under the prism of bottom-up research, namely, understanding how resources and abiotic factors affect their abundance, diversity, and functions. On the other hand, the effects of mortality, allelopathy, symbiosis, and other biotic processes are also likely to shape their communities and to exert strong selective pressures, yet they have been studied much less. With concentrations reaching 10
7 protists (
2) and 10
9 prokaryotes (
3,
4) per liter of sea water, biotic interactions are likely to impact community structure from the microscale to the ecosystem level (
5).
Among marine protists, diatoms (Bacillariophyta) are of key ecological importance. They are a ubiquitous and predominant component of phytoplankton, characterized by their ornate silica cell walls, and are considered to be responsible for approximately 40% of marine net primary productivity (NPP) (
6,
7). The array of biotic interactions in which marine diatoms have been described is vast. They are fed upon by heterotrophic microzooplankton such as ciliates and phagotrophic dinoflagellates (
8–10) as well as by metazoan grazers such as copepods (
11–14). Other known interactions include symbioses with nitrogen-fixing cyanobacteria (
15,
16) and tintinnids (
17), parasitism by chytrids and diplonemids (
18), diatom-targeted allelopathy by algicidal prokaryotes and dinoflagellates (
19,
20), and allelopathy mediated by diatom-derived compounds detrimental to copepod growth (
21,
22). Beyond direct biotic interactions, diatoms are also known to thrive in high-nutrient and high-turbulence environments, such as upwelling regions, at the expense of the other major phytoplankton groups, for instance, dinoflagellates and haptophytes (
23,
24). Competition for silicon between diatoms and radiolarians, other silicifying members of plankton, has also been noted (
25,
26).
Diatoms are one of the most diverse planktonic groups in terms of species, widely distributed across the world’s sunlit ocean (
27) and capable of generating massive “blooms” in which the diatom biomass can increase up to 3 orders of magnitude in just a few days (
28). Their success has been attributed, in part, to a broad range of predation avoidance mechanisms (
29), such as their solid mineral skeleton (
30), chain and spine formation in some species, and toxic aldehyde production (
31,
32). However, a global view of their capacity to interact with other organisms and an assessment of the impact of diatom interactions on community composition are still lacking.
Cooccurrence networks using meta-omics data are increasingly being used to study microbial communities and interactions (
33,
34), e.g., in human and soil microbiomes (
35,
36) as well as in marine and lake bacterioplankton (
37–39). Such networks provide an opportunity to extend community analysis beyond alpha and beta diversity toward a simulated representation of the relational roles played by different organisms, many of which are uncultured and uncharacterized (
40,
41). Over large spatial scales, nonrandom patterns according to which organisms frequently or never occur in the same samples are the result of several processes, such as biotic interactions, habitat filtering, as well as neutral processes (
42). Quantifying the relative importance of each component is still in its infancy. However, these networks can be used to reveal niche spaces, to identify potential biotic interactions, and to guide more focused studies. Much like in protein-protein networks, interpreting microbial association networks also relies on literature-curated gold-standard databases (
34), although such references are woefully incomplete for most planktonic groups (
43).
As part of the recent
Tara Oceans expedition (
44,
45), determinants of community structure in global ocean plankton communities were assessed using cooccurrence networks (
46), based on the abundances of viruses, bacteria, metazoans, and protists across 68
Tara Oceans stations in two depth layers in the photic zone. Pairwise links between species were computed based on how frequently they were found to cooccur in similar samples (positive correlations; here named copresences) or, on the contrary, if the presence of one organism negatively correlated with the presence of another (negative correlations; here named exclusions). It should be noted that our use here of the terms copresence and exclusion does not imply any type of biotic interaction or active process from either of the partners. In order to prevent spurious correlations due to the presence of additional confounding components such as abiotic factors, interaction information was furthermore calculated to assess whether or not correlations were driven by an environmental parameter. The
Tara Oceans interactome has global coverage and reports over 90,000 statistically significant correlations, with ∼68,000 of them being positive, ∼26,000 of them being negative, and ∼9,000 being due to the simultaneous higher correlation of two organisms (operational taxonomic units [OTUs]) with a third environmental parameter.
In this study, we provide an in-depth analysis of the diatom interactome in the open ocean, involving both prokaryotic and eukaryotic partners. We show how species distribution patterns reveal segregation between diatoms and specific taxonomic groups. We further investigate network properties involving the groups with which diatoms display the highest numbers of associations and reveal ecologically relevant areas of potential research by comparing the diatom interactome with literature previously published on the topic.
DISCUSSION
The
Tara Oceans interactome represents an ideal case study to investigate global-scale community structure involving diatoms, as it maximizes spatiotemporal variance across a global sampling campaign and captures systems-level properties. Here, we reveal that diatoms and polycystines are the organismal groups with the highest proportions of exclusions within the
Tara Oceans interactome and classify them as segregators according to a definition described previously (
47), as they display more negative than positive associations. Diatoms and polycystines prevent their cooccurrence with a range of potentially harmful organisms over broad spatial scales (
Fig. 1a and
d), a pattern unseen in the other photosynthetic classes examined (
Fig. 1b and
c), reflected by a significant exclusion of major functional groups of predators, parasites, and competitors such as copepods, Syndiniales, and Dinophyceae (
Fig. 1e).
Diatoms are known to have developed an effective arsenal composed of silicified cell walls, spines, toxic oxylipins, and chain formation to increase size, so we propose that the observed exclusion pattern reflects the worldwide impact of the diatom arms race against potential competitors, grazers, and parasites. Additionally, building upon the phylogenetic affiliation of individual sequences, barcodes can be assigned to a plankton functional type that refers to traits such as the trophic strategy and role in biogeochemical cycles (
58). As demonstrated in the
Tara Oceans interactome (
46), diatoms compose the “phytoplankton silicifiers” metanode and display a variety of mutual exclusions that again distinguish them from other phytoplankton groups. The role of biotic interactions is emphasized by the fact that out of the complete diatom association network, colocalization and coexclusion of diatoms with other organisms are due to shared preferences for an environmental niche in 13% of the cases, emphasizing the importance of biotic factors in 87% of the associations (
Fig. 2).
Diatom-MAST and diatom-MALV networks display more specialist interactions than diatom-copepod and diatom-Dinophyceae networks (
Fig. 3b). Correlation values reveal stronger exclusion patterns of diatoms against MASTs and MALVs (
Fig. 3c). These properties are conserved in the other segregator group, polycystines. Yet diatoms outcompete polycystines with higher strengths of exclusions based on correlation values and denser networks suggesting more species-specific interactions in polycystines (
Fig. 3c to
e). Previous work exploring abundance patterns among planktonic silicifiers in the
Tara Oceans data (
26) revealed strong size-fractionated communities: while the smallest-sized fraction (0.8 to 5 μm) contained a large diversity of silicifying organisms in nearly constant proportions, cooccurrence of diatoms and polycystines was rare in larger-sized fractions (20 to 180 μm), where the presence of one organism appeared to exclude the presence of the other.
Analysis at the genus level shows that abundant diatoms such as
Attheya do not prevail in the network, contrary to
Synedra, which, on a global scale, is less significant in terms of abundance but is highly connected to the plankton community. We show the existence of a species-level segregation effect that can be attributed to harmful traits (
54) (
Fig. 4a), reflected by blooming and endemic distribution patterns for the top segregating diatoms (
Fig. 4b to
d). These results support previously reported observations indicating the importance of biotic interactions in affecting ocean planktonic blooms and distribution (
29,
59). However, we cannot discount environmental parameters, as diatom blooms are also known to be triggered by light and nutrient perturbation.
Our literature survey reveals a skewed knowledge, focusing on freshwater diatoms and interactions with macroorganisms, with very few parasitic, photosymbiotic, or bacterial associations (
Fig. 6a). The relative paucity of marine microbial studies can be explained by the difficulty in accessing these interactions in the field, which obviously limits our understanding of how such interactions structure the community on a global scale. Comparing empirical knowledge and data-driven association networks reveals understudied genera, such as
Leptocylindrus and
Actinocyclus, and those that are not even present in the literature, such as
Proboscia and
Haslea (
Fig. 6b and
c). However,
Proboscia is a homotypic synonym of
Rhizosolenia that is found in the interactome, which illustrates the consequences of nonuniversal taxonomic denominations on diversity analysis.
While 18.5% of the literature database was recovered in the interactome, it explained only 6.5% of the 4,369 edges composing the diatom network. The gap between the 20% of diatom-bacterium interactions in the
Tara Oceans interactome and only 4.8% of diatom-bacterium associations described in the literature highlights how little we know about host-associated microbiomes at this time. Most of the experimental studies focus on symbiosis with diazotrophs (
16) and dinoflagellates (
60) and the antibacterial activity of
Skeletonema against bacterial pathogens (
61). In many ways, this high proportion of unmatched interactions should be regarded as the “unknown” proportion of microbial diversity emerging from metabarcoding surveys. Part of it is truly unknown and new, part of it is due to biases in data gathering and processing, and part of it is due to the lack of an extensive reference database. Indeed, the current literature is biased toward model organisms and species that can be easily cultured as well as diatoms with biotechnological potential.
This study faces challenges regarding the computation, analysis, and interpretation of cooccurrence networks while suggesting their potential to uncover processes governing diatom-related microbial communities. Further studies should compare diatom networks using several cooccurrence methods (
62), taxonomic levels (
63), and theoretical frameworks (
47,
64,
65). Assigning biological interactions such as predation, parasitism, or symbiosis to correlations will require enhanced references of biotic interactions (
34), of which the open-source collaborative database provided in this paper is an addition that also highlights potential research avenues. Furthermore, a vast body of literature already exists in the field of ecological networks, traditionally focusing on observational noninferred data and the modeling of food webs and host-parasite and plant-pollinator networks (
66,
67). Various properties linked to the architecture of these antagonistic and mutualistic networks have been formalized, such as nestedness, modularity, or the impact of combining several types of interactions in a single framework (
68,
69). These works have inspired this study, and we envision that enhanced cross-fertilization between the disciplines of ecological networks and cooccurrence networks would highly benefit both communities, ultimately helping to understand the laws governing the “tangled bank” (
70).
Diatoms have undoubtedly succeeded in adapting to the ocean’s fluctuating environment, shown by recurrent, predictable, and highly diverse bloom episodes (
71). They are considered r-selected species with high growth rates under favorable conditions that range from nutrient-rich highly turbulent environments to stratified oligotrophic waters (
24,
72,
73). Their success has long been attributed to this ecological strategy; here, we suggest that abiotic factors alone are not sufficient to explain their ecological success. The present study shows that diatoms do not cooccur with potentially harmful organisms such as predators, parasites, and pathogens (
74), shedding light on the top-down forces that could drive diatom evolution and adaptation in the modern ocean.