Open access
Computational Biology
Research Article
15 May 2024

Integration of metataxonomic data sets into microbial association networks highlights shared bacterial community dynamics in fermented vegetables

ABSTRACT

The management of food fermentation is still largely based on empirical knowledge, as the dynamics of microbial communities and the underlying metabolic networks that produce safe and nutritious products remain beyond our understanding. Although these closed ecosystems contain relatively few taxa, they have not yet been thoroughly characterized with respect to how their microbial communities interact and dynamically evolve. However, with the increased availability of metataxonomic data sets on different fermented vegetables, it is now possible to gain a comprehensive understanding of the microbial relationships that structure plant fermentation. In this study, we applied a network-based approach to the integration of public metataxonomic 16S data sets targeting different fermented vegetables throughout time. Specifically, we aimed to explore, compare, and combine public 16S data sets to identify shared associations between amplicon sequence variants (ASVs) obtained from independent studies. The workflow includes steps for searching and selecting public time-series data sets and constructing association networks of ASVs based on co-abundance metrics. Networks for individual data sets are then integrated into a core network, highlighting significant associations. Microbial communities are identified based on the comparison and clustering of ASV networks using the “stochastic block model” method. When we applied this method to 10 public data sets (including a total of 931 samples) targeting five varieties of vegetables with different sampling times, we found that it was able to shed light on the dynamics of vegetable fermentation by characterizing the processes of community succession among different bacterial assemblages.

IMPORTANCE

Within the growing body of research on the bacterial communities involved in the fermentation of vegetables, there is particular interest in discovering the species or consortia that drive different fermentation steps. This integrative analysis demonstrates that the reuse and integration of public microbiome data sets can provide new insights into a little-known biotope. Our most important finding is the recurrent but transient appearance, at the beginning of vegetable fermentation, of amplicon sequence variants (ASVs) belonging to Enterobacterales and their associations with ASVs belonging to Lactobacillales. These findings could be applied to the design of new fermented products.

INTRODUCTION

Over the last 20 years, the development of low-cost sequencing technologies has led to the creation of a large number of microbiome data sets, mainly generated using metataxonomic analyses based on 16S rRNA metabarcoding technology. For example, the number of papers using metataxonomic or metagenomic approaches to study the microbial communities of food increased sixfold between 2015 and 2021 and currently exceeds 600 (1); similarly, within the NCBI database, the Taxonomy ID “Food metagenome” (NCBI: Tax id 870726) is associated with 770 BioProjects. In keeping with the principles of Open Science, most of these publication-associated data sets are available in public repositories such as SRA (the Sequence Read Archive of NCBI), ENA (the European Nucleotide Archive of EBI), or the DNA Data Bank of Japan. To promote the reuse of certain kinds of data sets, specialized databases have been developed, such as MGnify for microbiome data (2). Here, we focused on metataxonomic studies whose integration by a robust and efficient approach allows the identification of the key taxa found in a specific ecosystem. The availability of a vast amount of metataxonomic data sets provides an unprecedented opportunity to develop new integrative tools for comparing and better understanding taxa associations in various closely related microbial ecosystems. However, these efforts face numerous challenges related to data reusability (e.g., data availability, metadata quality, and data preprocessing) and the most appropriate ways of identifying biologically informative features in a collection of metataxonomic studies. In this work, we address these challenges by designing a workflow for exploring public data sets related to the microbiota of fermented vegetables and performing a meta-analysis (i.e., reusing independent data sets and integrating them into a larger analysis to generate new knowledge).
Our choice of ecosystem was motivated by current interest in the bacterial communities involved in the fermentation of vegetables (35). Plant-based fermented foods diversify human diets and possess interesting properties in terms of sustainability and nutritional quality. These products require little energy to produce and preserve, and their consumption confers several benefits to human health (6, 7). With this study, we wanted to assess whether public data sets that are already available for fermented vegetables could help to improve our knowledge on the ecological dynamics taking place in these products. Fermented vegetables are created through the (usually spontaneous) activity of heterofermentative and homofermentative lactic acid bacteria (LAB) naturally present in the raw material (8). In Europe, the most popular example of this kind of food is sauerkraut, for which the use of pre-selected starter strains remains uncommon even for large-scale production (9). A combination of low pH and the anaerobic conditions resulting from the fermentation process are the main factors that select for the beneficial anaerobic LAB essential in the production of good-quality fermented vegetables (3). These bacteria are a broad and diverse group of species classified in the phylum Firmicutes, class Bacilli, and order Lactobacillales and include representatives from the families Lactobacillaceae, Streptococcaceae, Enterococcaceae, Carnobacteriaceae, and Aerococcaceae (10).
It should be noted that, to date, most studies have focused on describing the microbial communities present at the end of the fermentation process (4, 5), while the dynamic succession of various microbial populations during fermentation has received little attention. This represents an important gap in knowledge, especially when compared, for example, to research on cheese microbial communities, which has revealed that the proper succession of microbial populations is important to the quality of the final product (11, 12). Two separate metataxonomic analyses have revealed important changes in microbial dynamics during vegetable fermentation. A study on carrot juice reported a succession process involving Enterobacteriaceae, Leuconostoc, and Lactobacillus, while work on Suan Cai (Chinese pickles) showed that the dominant species changed from early stages of fermentation (Leuconostoc mesenteroides) to later ones (Lactiplantibacillus plantarum) (13, 14). The little information that can be gathered on the subject does not allow us to identify species or consortia that might be responsible for controlling various stages of fermentation among different vegetables. In this context, the use of metataxonomic data to carry out meta-analysis could prove illuminating.
The use and comparison of amplicon data (such as the 16S-based data considered in the present work) raise certain difficulties. First, sequencing technology may vary among studies, as may the region amplified or PCR primers employed. Second, taxonomic assignment based on the 16S variable region is considered valid only at the genus level, limiting species-level interpretations (4). There are, therefore, two possibilities for carrying out a comparative study of multiple data sets: comparing genus-level taxonomic profiles or comparing exact sequences, specifically, amplicon sequence variants (ASVs). The advantages of the first approach include the ability to compare different sequenced regions and to reduce the sparsity of the count matrices, while the use of ASVs enables intra-genus diversity to be taken into account (15, 16). In both cases, the aim of this type of meta-analysis is often to identify core taxa based on criteria of abundance and prevalence (17).
The analytical design of such a study is also important. One promising approach for meta-analysis is the construction of microbial association networks, which provide additional and complementary information to classic analyses of alpha- and beta-diversity (18). Association networks enable the identification of hub species (19, 20), taxa clusters (21), and core networks, the last of which corresponds to the intersection of several microbial association networks and can be used to identify taxa and associations shared by most networks (22). Association networks were originally designed for macroscopic ecosystems and have only recently been adapted for the investigation of interactions within microbial assemblages (21). They are constructed using count data from the sequenced environment, which are compositional (23), high-dimensional, and in the form of sparse matrices, thus increasing the difficulty of analysis (21). However, compared to networks from other assemblages, the association networks in fermented ecosystems appear to be significantly smaller (16) (due to the decrease in microbial diversity over time), making them easier to construct, visualize, and compare. According to Chen et al. (24), association networks can be divided into four categories, which are built using different approaches: correlation networks [CoNet (25) and SparCC (26)], conditional correlation networks [SPIEC-EASI (27)], mixture networks [MixMPLN (28)], and differential networks (DCDTr). Due to the complexity of microbial interactions, all these approaches have important limitations, and no method has yet managed to capture all of the aspects of interest. Indeed, studies have even shown that classical measures such as Pearson and Spearman correlations can perform just as well as computationally time-consuming methods based on more sophisticated statistical models (29, 30). Integrating independent studies, thanks to network analysis, offers a comprehensive view of microbial communities, facilitating the discovery of core associations among taxa (22). By modeling sample composition as a function of covariates, linear mixed models serve as an alternative method for integrating metataxonomic data sets. However, they primarily focus on differential analysis and the discovery of population structures within samples, as exemplified in the investigation of conditions such as inflammatory bowel disease (31). Conversely, meta-analyses employing a network approach center on elucidating microbial associations and are intrinsically integrative. They can be used to visualize associations between taxa and eventually highlight those that are conserved [for instance, in a study revealing the stability of gut microbiota across diverse geographic populations (32)].
This study presents an integrative bioinformatics workflow based on a network approach for the meta-analysis of public amplicon data sets. The workflow includes steps designed to search for and select public time-series data sets and construct ASV association networks based on co-abundance metrics. Microbial communities are then analyzed by comparing and clustering the ASV networks. We applied this workflow to 10 publicly available data sets and 931 samples of different fermented vegetables with specific sampling times on the microbial assemblages of fermented vegetables. Here, we describe the value of this approach for discovering core bacterial taxa and core associations shared by different vegetables during the process of fermentation.

RESULTS

Design of a bioinformatics workflow for the integration of metataxonomic data sets

We designed a workflow aimed at selecting public metabarcoding studies, re-analyzing the samples, constructing ASV association networks, and finally integrating them by constructing a robust core network and ASV clusters. Figure 1 depicts the main steps of the bioinformatics workflow designed to analyze and integrate the amplicon data sets. The first step involved the careful selection of public data sets focused on the microbial communities of fermented vegetables. Next, ASV count tables were constructed for each of the selected studies. Using these count tables, we then produced ASV association networks for each study, which were based on four sensitive and computationally efficient metrics: Jaccard distance, Pearson and Spearman correlations between relative abundances, and a proportionality measure calculated from clr (centered log-ratio)-transformed abundances . The purpose of the networks was to help visualize how microbial communities interact and evolve dynamically. Finally, the various networks were integrated together. A core network was constructed that identified which bacterial ASVs were common to most fermentations and which associations between ASVs were significantly shared among networks. In addition, a multiple stochastic block model (SBM) clustering method was used to identify a set of ASVs that were associated with each other across the different networks.
Fig 1
Fig 1 Meta-analysis approach for integrating amplicon data sets into microbial association networks to compare microbial communities of fermented vegetables.

Selection of metataxonomic studies on fermented vegetables

Ten data sets meeting our selection criteria (see Materials and Methods) were obtained out of 1,443 studies from SRA (NCBI), 10 studies from MGnify (ENA), and 3 studies from FoodMicrobioNet. All data sets contained sequences of the V3–V4 or V4 hypervariable region, enabling ASV comparison. The selected data sets originated from studies on five different varieties of vegetables (cucumber, carrot, cabbage, pepper, and radish, used alone or in a mixture) and comprised between 18 and 310 samples each, for a total of 931 samples (Table 1). The time scales that were examined varied among studies, as the data sets included between 2 and 12 time points. Depending on the study in question, monitoring began between 0 and 30 days after the beginning of fermentation and ended between 3 and 720 days after. All studies were conducted on spontaneous fermentations, with the exception of PRJNA751723 and PRJNA662831, which included samples from spontaneous fermentations as well as samples inoculated with various LAB (Latilactobacillus curvatus, Leuconostoc gelidum, Latilactobacillus sakei, or Weissella koreensis). Data set PRJEB15657 contained data from two sets of experiments (samples from a laboratory experiment and samples from a citizen science experiment), which we divided into two subsets.
TABLE 1
TABLE 1 Publicly available metataxonomic studies on fermented vegetables used in the present worka
Study identificationAssociated publicationBioproject
identifier
Data origin
and type
No. of
samples
First day time monitoringLast day time monitoringNo. of
time points
MGnify
(MGYS00003504)
Wuyts et al. (13), “Carrot juice fermentations as man-made microbial ecosystems dominated by lactic acid bacteria."
https://doi.org/10.1128/AEM.00134-18
PRJEB15657
ERP017487
Carrot/V431016012
SRAStoll et al. (33), "Influence of salt concentration and iodized table salt on the microbiota of fermented cucumbers." https://doi.org/10.1016/j.fm.2020.103552PRJNA595462SRP237464Cucumber/V4600645
SRAZhang et al. (34), "Dynamics of physicochemical factors and microbial communities during ripening fermentation of Pixian Doubanjiang, a typical condiment in Chinese cuisine."
https://doi.org/10.1016/j.fm.2019.103342
PRJNA544161
SRP19928
Doubanjiang (red pepper, and meju)/V3–V428307203
SRAJung et al. (35), "Role of combinated lactic acid bacteria in bacterial, viral, and metabolite dynamics during fermentation of vegetable food, kimchi."
https://doi.org/10.1016/j.foodres.2022.111261
PRJNA751723
SRP330976
Kimchi (kimchi cabbage, garlic, ginger, and red pepper) + inoculation/V3–V41920306
SRASong et al. (36), "Microbial niches in raw ingredients determine microbial community assembly during kimchi fermentation."
https://doi.org/10.1016/j.foodchem.2020.126481
PRJNA564474
SRP221859
Kimchi (kimchi cabbage, garlic, ginger, and red pepper)/V3–V4521508
SRAWang et al. (37), "Effects of salt concentration on the quality of paocai, a fermented vegetable product from China."
https://doi.org/10.1002/jsfa.11271
PRJNA689239
SRP300069
Paocai (cabbage)/V3–V4181306
SRALi et al. (38), "Metagenomic insights into the changes in microbial community and antimicrobial resistance genes associated with different salt content of red pepper (Capsicum annuum L.) sauce."
https://doi.org/10.1016/j.fm.2019.103295
PRJNA473189
SRP149752
Red pepper/V480310910
FoodMicrobioNetPérez-Díaz et al. (39), “Modulation of the bacterial population in commercial cucumber fermentations by brining salt type.
https://doi.org/10.1111/jam.14597
PRJNA485506
SRP157093
Cucumber/V3–V4721147
SRAHe et al. (40), "Fermentation characteristics and bacterial dynamics during Chinese sauerkraut fermentation by Lactobacillus curvatus LC-20 under varied salt concentrations reveal its potential in low-salt suan cai production."
https://doi.org/10.1016/j.jbiosc.2021.03.009
PRJNA662831
SRP282266
Chinese sauerkraut + inoculation/V4453152
SRAZabat et al. (41), "The impact of vegan production on the kimchi microbiome."
https://doi.org/10.1016/j.fm.2018.04.001
PRJNA418790
SRP125207
Kimchi (ginger, scallion, onion, garlic, red pepper, sugar, radish, cabbage, miso paste, and fish sauce)/V474034
a
The main features of the 10 public metataxonomic studies retained for this analysis are described. The “study identification” column indicates the source that allowed the identification of the data set. The “associated publication” column indicates the first author, title, and DOI of the article that was used to complete the data set metadata. The “BioProject identifier” column indicates the BioProject SRA study identifier that was used to download the corresponding public data sets. The “data origin and type” column indicates the vegetable(s) of origin (with possible indication of additional inoculation, if applicable) of the data set and the sequenced 16S region. The last four columns indicate, respectively, the number of samples in the study, the first and last day of monitoring, and the total number of time points assessed.

Visualization of microbial succession during fermentation through the construction of association networks

Historically, bar graphs have been used to visualize changes in the taxonomic composition of bacterial communities between samples. Microbial association networks highlight taxa with similar changes in abundance and can visually present information that is complementary to bar graphs. For each of the 10 data sets, we built association networks, of which one is presented in Fig. 2A and B (study PRJNA689239, paocai fermentation over 30 days, captured at six time points). This network appeared to be composed of two subnetworks: one containing a high diversity of ASVs (including Pseudomonadales and Enterobacterales) with a weighted mean age (WMA) between 0 and 10 days, and the other containing a lower diversity of ASVs belonging to Enterobacterales and Lactobacillales, with a higher WMA (between 8 and 30 days). This corroborates that there is a shift during fermentation from a broad initial diversity of ASVs to an assemblage dominated by few taxa (in this case, LAB) as observed in many fermentations. Interestingly, we observed the same patterns in the PRJNA564474 study (Fig. 2D; kimchi fermentation over 50 days and eight time points). However, a notable difference from the paocai study was that the first subnetwork was present at WMAs ranging from 0 to 50 days, and the second, composed only of Lactobacillales ASVs, appeared at 10–50 days. This structure suggests that some of the samples failed to ferment, as observed for sample SRR10127549 in the bar graph.
Fig 2
Fig 2 Microbial association networks for studies PRJNA689239 and PRJNA564474 highlight the dynamic evolution of microbial communities during fermentation. (A) and (C) Barplots depicting relative abundances in each sample for studies PRJNA689239 and PRJNA564474, respectively. Samples are ordered by age (the sampling time in days is included in the sample name). A dark gray color indicates a taxonomic order other than Enterobacterales, Lactobacillales, and Pseudomonadales, and NA corresponds to ASVs with unknown taxonomic affiliation at the order level. (B) and (D) ASV association networks for studies PRJNA689239 and PRJNA564474, respectively. Each node represents an ASV; node size reflects its maximum relative abundance, and color represents its taxonomic order. The x-axis corresponds to the weighted mean age of the samples in which the ASV was detected, measured in days, and weighted by ASV relative abundance. An edge between two nodes indicates an association that was detected according to at least one metric.
A similar network pattern was observed for 8 of the 11 networks analyzed (Fig. S1 to S3). The overall pattern could be described as follows: samples initially contained a high diversity of ASVs (featuring Pseudomonadales, in particular) with a low WMA; then, as the WMA increased, nodes corresponding to ASVs from Enterobacterales and then Lactobacillales appeared, with numerous associations between them. However, we would like to emphasize a few points to keep in mind when interpreting these networks. The WMA of an ASV does not reflect the exact time point at which the ASV first appears. Indeed, during each of the vegetable fermentations, all ASVs were present from the beginning of the fermentation process. This measurement may also represent both living and dead bacterial populations because the DNA of dead bacteria may be recovered and sequenced as well. Hence, the use of WMA to organize an association network merely provides a general picture of the temporal dynamics of ASVs over a fermentation process, highlighting the main “peaks” of presence and potential species associations.
We also analyzed the three networks that did not exhibit this succession of communities (PRJNA473189, PRJNA662831, and PRJNA544161; see Fig. S1 to S3). A common feature of these three studies was a shift in timing compared to the others: more precisely, sampling did not start until 3 days after the onset of fermentation. Therefore, it is possible that the successional shift in microbial communities took place before the first sampling point. This hypothesis is supported by the observation that the taxonomic profile of the pepper and sauerkraut samples (PRJNA473189 and PRJNA662831) did not change over time. In the case of doubanjiang (PRJNA544161), a fermented product containing numerous ingredients (beans, soya, rice, and spices), ASVs belonging to Enterobacterales appeared to proliferate relatively late, as observed on the bar graph.

Comparison of association networks to identify a core network of bacterial communities

To integrate the 11 association networks, we constructed a core network, i.e., the intersection of several networks (Fig. 3). In Fig. 3A, it can be seen that the 11 networks shared three vertices (ASVs) overall, and pairwise analyses revealed between 10 and 58 vertices that were shared between a given pair of networks. Similarly, pairwise analyses detected between 3 and 296 edges that were shared by two networks, but no edges were shared by more than nine networks (Fig. 3B). To evaluate the statistical significance of the edge intersections, we compared them with a null model using a Kolmogorov-Smirnov test; the results rejected the null hypothesis that our set of networks followed the same distribution as the null model for intersections between two, three, four, five, or six networks (P-value < 0.05 for 100 cases). This means that those network subsets share associations in a significant way.
Fig 3
Fig 3 Core network and succession of bacterial communities. (A) Boxplot showing the number of vertices in the core networks built from the intersection of 2–11 networks. The dotted gray line corresponds to three vertices. (B) Boxplot showing the number of edges in the core networks built from the intersection of 2–11 networks. (C) Core network built from ASV associations found in at least three networks. The line type of an edge represents the number of times the ASV association was found. The node position on the x-axis is the mean scaled WMA. ASVs are colored by taxonomic order. (D) Boxplot showing the differences in mean scaled WMA between ASVs affiliated with the orders Pseudomonadales, Enterobacterales, and Lactobacillales. (E) Scatterplot of ASVs colored by taxonomic order, depicting their prevalence in relation to mean scaled WMA. (F) Boxplot showing the differences in mean scaled WMA among the genera in the family Lactobacillaceae. Each dot is an ASV.
We then constructed core networks based on the intersections between two and six networks (all shown in Fig. S2). The core network built using microbial associations present in at least three networks (Fig. 3C) included 97 ASVs (out of a total of 975 used to construct the 11 networks). Among them, 13 were affiliated with the order Pseudomonadales, 17 with Enterobacterales, and 25 with Lactobacillales. In representing the core network, we used the scaled WMA on the x-axis. The rationale of the scaled WMA was to normalize time data and establish a common time scale between the various studies. Indeed, the WMAs are not directly comparable between studies because the time points measured varied from one study to another.
Analysis of the different significant core networks revealed that, despite all of the differences between experiments (type of sequencing, fermentation conditions, and time scale), there appeared to be a common temporal structure in the microbial dynamics of fermented vegetables. In particular, after a mean scaled WMA of 0.5, Lactobacillales ASVs tended to predominate. Furthermore, we also observed a shift from the initial microbial population of vegetables to one dominated by Enterobacterales and then a second shift to Lactobacillales.
This observation was confirmed by a clear difference in scaled WMA among all ASVs corresponding to Pseudomonadales, Enterobacterales, and Lactobacillales, as shown in Fig. 3D. Figure 3E highlights this trend and also shows that the ASVs with the lowest and highest scaled WMAs were less often shared among studies (less than three graphs when WMA was lower than −1 or higher than 2) than those with median WMA values. This suggests that the initial community, as well as the LAB present mainly at the end of fermentation, tended to be more specific to a given experiment than other ASVs. Moreover, the ASVs belonging to Enterobacterales were more likely to be shared between networks than those corresponding to Lactobacillales (non-parametric Wilcoxon-Mann-Whitney test, P-value = 0.02). In fact, of the three ASVs that were detected in all experiments, all belonged to the Enterobacterales (Klebsiella, Pectobacterium, and an unidentified Enterobacterales).
Finally, we investigated distinctions between different genera within the family Lactobacillaceae [following the new taxonomy of Zheng et al. (42)] based on the type of fermentation performed. In the core network, ASVs belonging to genera that perform hetero-lactic fermentation were more numerous than those belonging to genera that perform homo-lactic fermentation. Moreover, most members of the Lactobacillales were found in only one graph (143 out of 208, i.e., 69%), and among those shared by more than two graphs, 18 perform heterofermentation and 7 perform homofermentation. We can therefore conclude that LABs are generally highly specific to a fermentation process, and the ASVs that are shared among different processes are mostly heterofermentative. There was no significant difference between the scaled WMA of heterofermentative and homofermentative genera, but we did detect some expected successional shifts in genera (Fig. 3F, Leuconostoc and Lactiplantibacillus, P-value = 0.02).

Multiple clustering to identify putative bacterial consortia shared among studies

To identify sets of ASVs that were connected in similar ways across the 11 microbial association networks, we applied the multiplex stochastic block model graph clustering method. Ten different clusters were identified, which varied in their size and the prevalence and taxonomy of their member ASVs. Clusters 1–5 contained few ASVs (between 5 and 45) that were shared between two or more networks, while clusters 6–10 contained many ASVs (between 94 and 463) that were mainly specific to one network (Fig. 4A). ASVs affiliated with Lactobacillales predominantly belonged to clusters 5, 9, and 10; this last group contained most of the Lactobacillales ASVs and those corresponding to the diverse initial microflora, i.e., those that were specific to each experiment.
Fig 4
Fig 4 Network clustering shows an association between highly prevalent ASVs from orders Lactobacillales and Enterobacterales. (A) Scatterplot of ASVs colored by cluster. The shape corresponds to the taxonomic order. (B) Core network built from ASV associations found in at least three networks, with ASVs colored by cluster. The shape corresponds to the taxonomic order.
Among the different clusters, clusters 1 and 5 were particularly interesting, as they included the majority of ASVs that were shared by more than five networks, and they were the predominant clusters in the core network (Fig. 4B). Moreover, ASVs in the two clusters differed significantly in their scaled WMA (P-value = 0.013). It is possible that the ASVs in these two clusters correspond to successive bacterial communities that are common in vegetable fermentation: cluster 1 appeared to be highly associated with ASVs in the orders Pseudomonadales and Enterobacterales, while cluster 5 represented an assemblage of Enterobacterales and Lactobacillales. This community of conserved ASVs could potentially represent a shared core consortia of early fermentation, including the genera Leuconostoc, Weissella, Latilactobacillus, Lactiplantibacillus, Enterococcus, Levilactobacillus, Lactococcus, Vagococcus, Pediococcus, Companilactobacillus, Brochothrix from Lactobacillales, and Yersinia, Citrobacter, Pseudocitrobacter, Erwinia, Serratia, and Providencia from Enterobacterales; its detailed composition is shown in Table S1.

DISCUSSION

This work presents an integrative bioinformatics approach that uses association networks to combine different independent data sets on the microbial dynamics of different vegetable fermentations. By using relevant network metrics and integration methods on metabarcoding data, we obtained valuable insights into bacterial community structure during different phases of fermentation. Historically, association networks have been used to detect potential inter-species interactions; here, we adapted this strategy to identify and visualize ASVs with similar temporal dynamics. To our knowledge, this work is the first to construct a core network representing the fermentation of different vegetables throughout time based on sequence data from multiple independent data sets. By integrating several public data sets together, we were able to characterize two successional shifts that were conserved among different fermentation ecosystems: the first from the initial microbial population of vegetables to Enterobacterales, and the second to an assemblage dominated by Lactobacillales. To test the significance of the core network we obtained, we used an approach based on comparison to a null model, which was similar to that developed by Röttjers et al. (22), with a sampling of random graphs similar to Doane et al. (43). Indeed, the identification of core networks is a more challenging task than the computation of the global intersection network (21). With these tests, we determined that some intersections between networks would not be expected by random chance, and thus that some edges may correspond to genuine ASV dynamics shared among several studies. Finally, we complemented this approach by using the SBM method for ASV clustering, which is a technique applicable to multiplexes (a type of multi-layer network) that does not require any a priori assumptions regarding connectivity patterns. The SBM model has been used for community detection in various fields, such as sociology. More recently, it has been applied to taxonomic profiling of the human microbiome in order to uncover patterns of community structure. Specifically, it was used as a bipartite model for clustering samples and taxa (44). In another study, the simple SBM enabled the detection of OTU clusters based on their connectivity patterns in a co-occurrence network (45). In the present work, we applied the multiplex version of this model to a collection of networks in order to identify clusters of ASVs that share similar patterns of associations across the different networks. We were able to identify 10 clusters of ASVs, which could be used to guide the exploration and delineation of new bacterial consortia in fermented vegetables (46).
With respect to the microbial ecology of fermented vegetables, our methodology not only confirms previously proposed hypotheses on bacterial succession from individual studies but also brings to light novel insights. Two expected successional shifts are observed, one from the initial very rich and diverse microbiota to Enterobacterales, and the second to an assemblage dominated by a few abundant Lactobacillales specific to the fermented product. However, contrary to expectations, we did not detect the anticipated succession between heterofermentative and homofermentative genera in our study. Our study also showed that Enterobacterales ASVs were more widely shared than Lactobacillales ASVs. Finally, our most important finding was the recurring and transient appearance, at the beginning of fermentation, of ASVs belonging to Enterobacterales and their association with ASVs affiliated with Lactobacillales. This raises the question of the ecological function of Enterobacterales in vegetable fermentation and their impact on the properties of the final product. Due to the small number of studies carried out on the subject and the extensive variability in the methodologies used, most reports have not generated convincing conclusions on the impact of Enterobacterales and their possible interactions with LAB. Nevertheless, based on the existing literature, several hypotheses can be put forward. Enterobacterales may have fermentative properties or they may participate in nutritional mutualism that is beneficial to the development of LAB. Indeed, certain trophic relationships between LAB and Enterobacteriaceae have already been described. For example, some LAB generate metabolic energy using an agmatine deiminase pathway that relies on agmatine produced by Enterobacteriaceae (47). In the wet coffee fermentation process, the first phase involves interactions between Enterobacteriaceae (with pectinolytic activity), acetic acid bacteria, and some yeasts (48). Enterobacteriaceae have also been found in two other studies on fermented vegetables (49, 50), of which the former hypothesizes that the presence of Erwinia sp. may reflect its ability to invade compromised plant tissues or its potential ability to ferment sugar.
This meta-analysis demonstrates the value of using public data sets in an Open Data and Open Science framework. The approach we designed is particularly well suited to fermented vegetable ecosystems: since these ecosystems are closed, contain relatively few taxa, and undergo a temporal succession of communities, the representation of ASV association networks is fairly easy to visualize and interpret. This approach could be easily applied to amplicon or shotgun metagenomic data on closed ecosystems with community shifts. One limitation of the present meta-analysis is that it was carried out on a relatively small scale (on 10 independent data sets including a total of 931 samples), due to the small number of reusable public metabarcoding data sets on fermented vegetables. Hence, biological results could be confirmed by investigating additional data sets. This limitation is mainly due to difficulties in accessing raw data (some samples are missing, some data are pre-processed, etc.) and metadata (sometimes incomplete and inconsistent, with manual extract from paper required). Indeed, these limitations were highlighted in a recent article (51), which recommended that data be deposited in public repositories together with assay metadata (technical features of the experiment) and biological metadata (environmental conditions of the biosamples). This, along with the adoption of other best practices, will enable wider reuse and integration of microbiome data sets on a broader scale.
This study is based on 16S metataxonomic data, more specifically, the V4 hypervariable region because it was used in the majority of the data sets found. This region is the most frequent target of studies focused on food ecosystems, along with the V3–V4 region of the 16S rRNA gene (1). Unfortunately, this gene region has poor discriminatory power; it is able to provide reliable taxonomic assignment at the genus level only and cannot be used to study species-level diversity [unlike, for instance, the V1–V3 region (52)]. Therefore, although it is interesting to discover ASVs that are shared between different studies, this approach is ill-suited for characterizing the species- and strain-level diversity of Lactobacillales and Enterobacterales. Furthermore, the read count tables obtained for the different studies can be shaped by many biases, including differences in sample collection and storage, DNA extraction method and primer choice, variation in the number of rRNA operons (52, 53), amplification of extracellular DNA, and errors in taxonomic affiliations. Therefore, the results of any individual ASV count table must be interpreted cautiously. However, in the context of our study, the use of ASVs enabled direct comparison of sequences between studies and reduced the influence of taxonomic misclassifications (54, 55). In addition, integrating ASVs into association networks allowed comparisons of similar dynamics between ASVs in different studies and limited the biases that might arise from direct comparison of relative abundances.
This work demonstrates the effectiveness of using association networks for temporal meta-analysis. The approach we developed could easily be applied to new data sets or extended to incorporate new tools for association network inference, core network detection, and clustering. In the future, it could be interesting to integrate additional sample metadata (such as temperature, lactic acid concentration, pH, and/or salinity) if they were available in a standardized format and could be easily integrated into an association network. Indeed, many factors can influence the composition and dynamics of the fermenting microbial community, as shown previously for salinity or temperature (3). This approach could lead to the design of ideal consortia that could make vegetable fermentation safer (56), more reproducible, and exploitable on a large scale (57).
Finally, the taxonomic profile inferred from 16S rRNA is not able to provide insights into the functional profile of bacterial communities or into the part(s) played by other microorganisms [even if their presence is minor, e.g., less than 5% relative abundance for fungi and Archaea in brine food according to Leech et al. (4)]. Ultimately, there is a need for complementary functional studies (shotgun metagenomics and metatranscriptomics) to improve our understanding of vegetable fermentation and assess the functional interactions taking place during this process between all microorganisms.

MATERIALS AND METHODS

Study selection

Data sets were obtained from three repositories: the MGnify database (on microbiome data), the FoodMicrobioNet database (on food ecosystems), and the NCBI SRA database. Studies focused on the microbial ecosystems in fermented vegetables were identified in MGnify by selecting the biome “Food production” and filtering with the term “Fermented vegetables,” while in FoodMicrobioNet, we used the spoilage filter “Fermented” from studies labeled with “Vegetables and vegetable products.” From NCBI/SRA, we retrieved studies with the Taxonomy IDs “Food metagenome” (870726), “Fermentation metagenome” (1326787), and “Food fermentation metagenome” (1154581). Of the resulting studies, the only ones that were considered were those whose “SRA Run Selector” metadata contained the words “day,” “week,” “month,” “hour,” or “time” and that had an associated publication on fermented vegetables.
We included only studies that examined at least two time points, contained more than 10 samples, and were associated with a publication (to ensure access to extensive metadata). Finally, we retained only studies that sequenced the V4 or V3–V4 hypervariable region of the 16S rRNA gene to permit comparisons of ASVs. Raw sequencing data of the resulting selected studies were retrieved from the NCBI SRA repository using homemade scripts.

Construction of ASV count tables

Sequencing data from each study were processed using the dada2 pipeline (54) for read quality control, read filtering and trimming (with parameters truncLen = 240 or 220 depending on read length, maxN = 0, maxEE = 2, truncQ = 2), error rate learning and ASV inference, paired read assembly (with parameter minOverlap = 3), chimera removal, and taxonomic assignment to kingdom, phylum, class, order, family, and genus (using Silva database nr 99 v 138. 1). For the five studies in which the V3–V4 region of the 16S rRNA gene was sequenced, only the V4 region was retained. The ASV count table for each sample, the ASV taxonomy table, and the sample metadata were combined into one phyloseq object (58) for each study. ASVs matching mitochondrial or chloroplast DNA and samples from negative fermentation controls were excluded from the count tables.

Inference of microbial association networks

For each study, a count table was filtered to create a microbial association network. Only non-control samples with more than 15,000 reads and ASVs found in at least three samples and with an average relative abundance greater than 1e-5 were included. We chose association metrics that take into account co-presence, with Jaccard distance, as well as co-abundance, with Pearson and Spearman correlations based on relative abundances. The proportionality measure proposed by Lovell et al. (59) (Φ(a,b)=var(clr(a)clr(b))var(clr(a)+clr(b))) was also used following centered log-ratio transformation, performed using the function aldex.clr from the package ALDEx2 (60). Edges were traced if at least one of these four measures reached a non-stringent threshold (0.4 for Jaccard distance and 0.5 for the three other measures). The thickness of each edge reflected the number of combined metrics supporting it. A force-driven algorithm (Fruchterman-Reingold) was used to calculate the layout of each association network. This layout was preserved on the y-axis, but the x-axis was modified: the position of each ASV was the mean age of the samples in which the ASV was present, weighted by its relative abundance (hereafter named WMA for weighted mean age).

Core network construction

The core network was constructed based on the intersections of the independent association networks created for each study. To account for the different sampling time points and fermentation rates among studies, the x-axis position of each ASV in the core network corresponded to the average of its centered and scaled positions in the original networks. A null-model statistical test was used to assess the significance of the core networks constructed from edges shared by a subset of networks or by all networks. First, we generated 100 sets of networks with the same nodes as the networks of interest but with random edges, using the “rewire” function of the igraph R package with prob = 1. Next, the distribution of edges shared by a given subset of networks or by all networks was compared between each null model and the studied set of networks with a Kolmogorov-Smirnov test.

SBM multiplex clustering

Multiplex networks refer to a collection of networks involving the same sets of nodes but originating from different types of relationships. Here, each network corresponded to a specific study and each node corresponded to an ASV. SBM clustering was applied to multiplex networks to assign each ASV to a community (or block) according to its connection patterns. The estimateMultiplexSBM function from the R package sbm (61) was used with a Poisson model describing the relationship between the nodes. The number of blocks was chosen using a penalized likelihood criterion ( integrated completed likelihood [ICL]), and the likelihood maximization was obtained via a variational version of the expectation-maximization algorithm.

Statistical analysis and figure construction

To compare WMA or the prevalence among studies of ASVs belonging to different groups (taxonomic rank or SBM cluster), the non-parametric Wilcoxon-Mann-Whitney test was performed. To create figures, the R packages ggplot2, viridis, ggpubr, and ComplexHeatmap were used (62, 63).

ACKNOWLEDGMENTS

We are grateful to INRAE MICA Division and the ENS Paris-Saclay for the funding of this PhD, INRAE MIGALE bioinformatics facility (MIGALE, INRAE, 2020. Migale bioinformatics Facility, doi:10.15454/1.5572390655343293E12) for providing computing and storage resources, and to Lindsay Higgins for proofreading the manuscript.
R.J., Conceptualization, Methodology, Writing – original draft, Writing – review and editing; F.V., Supervision, Writing – original draft, Writing – review and editing; M-Y.M., Supervision, Writing – original draft, Writing – review and editing; S.C., Conceptualization, Methodology, Supervision, Writing – original draft, Writing – review and editing; H.C., Conceptualization, Funding acquisition, Methodology, Supervision, Writing – original draft, Writing – review and editing.

SUPPLEMENTAL MATERIAL

Supplemental material - spectrum.00312-24-s0001.pdf
Table S1; Fig. S1-S4.
ASM does not own the copyrights to Supplemental Material that may be linked to, or accessed through, an article. The authors have granted ASM a non-exclusive, world-wide license to publish the Supplemental Material files. Please contact the corresponding author directly for reuse.

REFERENCES

1.
Parente E, Zotta T, Ricciardi A. 2022. Foodmicrobionet V4: a large, integrated, open and transparent database for food bacterial communities. Int J Food Microbiol 372:109696.
2.
Mitchell AL, Almeida A, Beracochea M, Boland M, Burgin J, Cochrane G, Crusoe MR, Kale V, Potter SC, Richardson LJ, Sakharova E, Scheremetjew M, Korobeynikov A, Shlemov A, Kunyavskaya O, Lapidus A, Finn RD. 2020. MGnify: the microbiome analysis resource in 2020. Nucleic Acids Res 48:D570–D578.
3.
Thierry A, Baty C, Marché L, Chuat V, Picard O, Lortal S, Valence F. 2023. Lactofermentation of vegetables: an ancient method of preservation matching new trends. Trends Food Sci Technol 139:104112.
4.
Leech J, Cabrera-Rubio R, Walsh AM, Macori G, Walsh CJ, Barton W, Finnegan L, Crispie F, O’Sullivan O, Claesson MJ, Cotter PD. 2020. Fermented-food metagenomics reveals substrate-associated differences in taxonomy and health-associated and antibiotic resistance determinants. mSystems 5:e00522-20.
5.
Xu M, Su S, Zhang Z, Jiang S, Zhang J, Xu Y, Hu X. 2022. Two sides of the same coin: meta-analysis uncovered the potential benefits and risks of traditional fermented foods at a large geographical scale. Front Microbiol 13:1045096.
6.
Lebeer S, Vanderleyden J, De Keersmaecker SCJ. 2008. Genes and molecules of lactobacilli supporting probiotic action. Microbiol Mol Biol Rev 72:728–764.
7.
Melini F, Melini V, Luziatelli F, Ficca AG, Ruzzi M. 2019. Health-promoting components in fermented foods: an up-to-date systematic review. Nutrients 11:1189.
8.
Di Cagno R, Coda R, De Angelis M, Gobbetti M. 2013. Exploitation of vegetables and fruits through lactic acid fermentation. Food Microbiol 33:1–10.
9.
Medina-Pradas E, Pérez-Díaz IM, Garrido-Fernández A, Arroyo-López FN. 2017. Chapter 9 - review of vegetable fermentations with particular emphasis on processing modifications, microbial ecology, and spoilage, p 211–236. In Bevilacqua A, Corbo MR, Sinigaglia M (ed), The microbiological quality of food. Woodhead Publishing Series in Food Science, Technology and Nutrition, Woodhead Publishing.
10.
Ashaolu TJ, Reale A. 2020. A holistic review on euro-asian lactic acid bacteria fermented cereals and vegetables. Microorganisms 8:1176.
11.
Parente E, Zotta T, Ricciardi A. 2022. A review of methods for the inference and experimental confirmation of microbial Association networks in cheese. Int J Food Microbiol 368:109618.
12.
Portik D, Dutton RJ, Ashby M. 2023. Longitudinal, multi-platform metagenomics yields a high-quality genomic catalog and guides an in vitro model for cheese communities. mSystems.
13.
Wuyts S, Van Beeck W, Oerlemans EFM, Wittouck S, Claes IJJ, De Boeck I, Weckx S, Lievens B, De Vuyst L, Lebeer S. 2018. Carrot juice fermentations as man-made microbial ecosystems dominated by lactic acid bacteria. Appl Environ Microbiol 84:e00134-18.
14.
Yang H, Wu H, Gao L, Jia H, Zhang Y, Cui Z, Li Y. 2016. Effects of Lactobacillus curvatus and Leuconostoc mesenteroides on Suan cai fermentation in northeast China. J Microbiol Biotechnol 26:2148–2158.
15.
Berry D, Widder S. 2014. Deciphering microbial interactions and detecting keystone species with co-occurrence networks. Front Microbiol 5:219.
16.
Parente E, Zotta T, Faust K, De Filippis F, Ercolini D. 2018. Structure of association networks in food bacterial communities. Food Microbiol 73:49–60.
17.
Holman DB, Brunelle BW, Trachsel J, Allen HK. 2017. Meta-analysis to define a core microbiota in the swine gut. mSystems 2:e00004-17.
18.
Lv X, Zhao K, Xue R, Liu Y, Xu J, Ma B. 2019. Strengthening insights in microbial ecological networks from theory to applications. mSystems 4:e00124-19.
19.
Banerjee S, Schlaeppi K, van der Heijden MGA. 2018. Keystone taxa as drivers of microbiome structure and functioning. Nat Rev Microbiol 16:567–576.
20.
Röttjers L, Faust K. 2019. Can we predict Keystones? Nat Rev Microbiol 17:193–193.
21.
Faust K. 2021. Open challenges for microbial network construction and analysis. ISME J 15:3111–3118.
22.
Röttjers L, Vandeputte D, Raes J, Faust K. 2021. Null-model-based network comparison reveals core associations. ISME Commun 1:36.
23.
Gloor GB, Macklaim JM, Pawlowsky-Glahn V, Egozcue JJ. 2017. Microbiome datasets are compositional: and this is not optional. Front Microbiol 8:2224.
24.
Chen L, Wan H, He Q, He S, Deng M. 2022. Statistical methods for microbiome compositional data network inference: a survey. J Comput Biol 29:704–723.
25.
Faust K, Raes J. 2016. Conet app: inference of biological association networks using cytoscape. F1000Res 5:1519.
26.
Friedman J, Alm EJ. 2012. Inferring correlation networks from genomic survey data. PLOS Comput Biol 8:e1002687.
27.
Kurtz ZD, Müller CL, Miraldi ER, von Mering C, Bonneau RA, Blaser MJ, Littman DR. 2015. Sparse and Compositionally robust inference of microbial ecological networks. PLOS Comput Biol 11:e1004226.
28.
Tavakoli S, Yooseph S. 2019. Learning a mixture of microbial networks using minorization–maximization. Bioinformatics 35:i23–i30.
29.
Hirano H, Takemoto K. 2019. Difficulty in inferring microbial community structure based on co-occurrence network approaches. BMC Bioinformatics 20:329.
30.
Weiss S, Van Treuren W, Lozupone C, Faust K, Friedman J, Deng Y, Xia LC, Xu ZZ, Ursell L, Alm EJ, Birmingham A, Cram JA, Fuhrman JA, Raes J, Sun F, Zhou J, Knight R. 2016. Correlation detection strategies in microbial data sets vary widely in sensitivity and precision. ISME J 10:1669–1681.
31.
Ma S, Shungin D, Mallick H, Schirmer M, Nguyen LH, Kolde R, Franzosa E, Vlamakis H, Xavier R, Huttenhower C. 2022. Population structure discovery in meta-analyzed microbial communities and inflammatory bowel disease using MMUPhin. Genome Biol 23:208.
32.
Jackson MA, Bonder MJ, Kuncheva Z, Zierer J, Fu J, Kurilshikov A, Wijmenga C, Zhernakova A, Bell JT, Spector TD, Steves CJ, et al. 2018. Detection of stable community structures within gut microbiota co-occurrence networks from different human populations. PeerJ 6:e4303.
33.
Stoll DA, Müller A, Meinhardt A-K, Dötsch A, Greiner R, Kulling SE, Huch M. 2020. Influence of salt concentration and Iodized table salt on the Microbiota of fermented cucumbers. Food Microbiology 92:103552.
34.
Zhang L, Che Z, Xu W, Yue P, Li R, Li Y, Pei X, Zeng P. 2020. Dynamics of Physicochemical factors and microbial communities during ripening fermentation of Pixian Doubanjiang, a typical condiment in Chinese cuisine. Food Microbiology 86:103342.
35.
Jung M-J, Kim J, Lee SH, Whon TW, Sung H, Bae J-W, Choi Y-E, Roh SW. 2022. Role of Combinated lactic acid bacteria in bacterial, viral, and metabolite Dynamics during fermentation of vegetable food, Kimchi. Food Research International 157:111261.
36.
Song HS, Whon TW, Kim J, Lee SH, Kim JY, Kim YB, Choi H-J, Rhee J-K, Roh SW. 2020. Microbial niches in raw ingredients determine microbial community assembly during Kimchi fermentation. Food Chemistry 318:126481.
37.
Wang X, Song G, He Z, Zhao M, Cao X, Lin X, Ji C, Zhang S, Liang H. 2021. Effects of salt concentration on the quality of Paocai, a fermented vegetable product from China. Journal of the Science of Food and Agriculture 101:6202–6210.
38.
Li Z, Dong L, Zhao C, Zhu Y. 2020. Metagenomic insights into the changes in microbial community and antimicrobial resistance genes associated with different salt content of red pepper (capsicum Annuum L.) sauce. Food Microbiology 85:103295.
39.
Pérez‐Díaz IM, Dickey AN, Fitria R, Ravishankar N, Hayes J, Campbell K, Arritt F. 2020. Modulation of the bacterial population in commercial cucumber fermentations by brining salt type. Journal of Applied Microbiology 128:1678–1693.
40.
He J, Li F, Wang Y, Wu H, Yang H. 2021. Fermentation characteristics and bacterial Dynamics during Chinese sauerkraut fermentation by Lactobacillus Curvatus LC-20 under varied salt concentrations reveal its potential in low-salt Suan cai production. Journal of Bioscience and Bioengineering 132:33–40.
41.
Zabat MA, Sano WH, Cabral DJ, Wurster JI, Belenky P. 2018. The impact of Vegan production on the Kimchi Microbiome. Food microbiology 74:171–178.
42.
Zheng J, Wittouck S, Salvetti E, Franz CMAP, Harris HMB, Mattarelli P, O’Toole PW, Pot B, Vandamme P, Walter J, Watanabe K, Wuyts S, Felis GE, Gänzle MG, Lebeer S. 2020. A Taxonomic note on the genus Lactobacillus: description of 23 novel genera, emended description of the genus Lactobacillus Beijerinck 1901, and union of Lactobacillaceae and Leuconostocaceae. Int J Syst Evol Microbiol 70:2782–2858.
43.
Doane MP, Reed MB, McKerral J, Farias Oliveira Lima L, Morris M, Goodman AZ, Johri S, Papudeshi B, Dillon T, Turnlund AC, Peterson M, Mora M, de la Parra Venegas R, Pillans R, Rohner CA, Pierce SJ, Legaspi CG, Araujo G, Ramirez-Macias D, Edwards RA, Dinsdale EA. 2023. Emergent community architecture despite distinct diversity in the global whale shark (Rhincodon typus) epidermal microbiome. Sci Rep 13:12747.
44.
Cobo-López S, Gupta VK, Sung J, Guimerà R, Sales-Pardo M. 2022. Stochastic block models reveal a robust nested pattern in healthy human gut microbiomes. PNAS Nexus 1:pgac055.
45.
Hall CV, Lord A, Betzel R, Zakrzewski M, Simms LA, Zalesky A, Radford-Smith G, Cocchi L. 2019. Co-existence of network architectures supporting the human gut microbiome. iScience 22:380–391.
46.
Ibrahim M, Raajaraam L, Raman K. 2021. Modelling microbial communities: harnessing consortia for biotechnological applications. Comput Struct Biotechnol J 19:3892–3907.
47.
Michael GG. 2015. Lactic metabolism revisited: metabolism of lactic acid bacteria in food fermentations and food spoilage. Curr Opinion Food Sci 2:106–117.
48.
Pothakos V, De Vuyst L, Zhang SJ, De Bruyn F, Verce M, Torres J, Callanan M, Moccand C, Weckx S. 2020. Temporal shotgun metagenomics of an ecuadorian coffee fermentation process highlights the predominance of lactic acid bacteria. Curr Res Biotechnol 2:1–15.
49.
Raghuvanshi R, Grayson AG, Schena I, Amanze O, Suwintono K, Quinn RA. 2019. Microbial transformations of organically fermented foods. Metabolites 9:165.
50.
Yasir M, Al-Zahrani IA, Bibi F, Abd El Ghany M, Azhar EI. 2022. New insights of bacterial communities in fermented vegetables from shotgun metagenomics and identification of antibiotic resistance genes and probiotic bacteria. Food Res Int 157:111190.
51.
Huttenhower C, Finn RD, McHardy AC. 2023. Challenges and opportunities in sharing microbiome data and analyses. Nat Microbiol 8:1960–1970.
52.
Poirier S, Rué O, Peguilhan R, Coeuret G, Zagorec M, Champomier-Vergès M-C, Loux V, Chaillou S. 2018. Deciphering intra-species bacterial diversity of meat and seafood spoilage Microbiota using gyrB Amplicon sequencing: A comparative analysis with 16S rDNA V3-V4 Amplicon sequencing. PLoS ONE 13:e0204629.
53.
Poirier S, Rué O, Coeuret G, Champomier-Vergès M-C, Loux V, Chaillou S. 2018. Detection of an amplification bias associated to leuconostocaceae family with a universal primer routinely used for monitoring microbial community structures within food products. BMC Res Notes 11:802.
54.
Callahan BJ, McMurdie PJ, Rosen MJ, Han AW, Johnson AJA, Holmes SP. 2016. DADA2: high-resolution sample inference from Illumina amplicon data. Nat Methods 13:581–583.
55.
Callahan BJ, McMurdie PJ, Holmes SP. 2017. Exact sequence variants should replace operational taxonomic units in marker-gene data analysis. ISME J 11:2639–2643.
56.
Capozzi V, Fragasso M, Romaniello R, Berbegal C, Russo P, Spano G. 2017 Spontaneous food fermentations and potential risks for human health. Fermentation 3:49.
57.
Torres S, Verón H, Contreras L, Isla MI. 2020. An overview of plant-autochthonous microorganisms and fermented vegetable foods. Food Sci Human Wellness 9:112–123.
58.
McMurdie PJ, Holmes S. 2013. Phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data. PLOS ONE 8:e61217.
59.
Lovell D, Pawlowsky-Glahn V, Egozcue JJ, Marguerat S, Bähler J. 2015. Proportionality: a valid alternative to correlation for relative data. PLOS Comput Biol 11:e1004075.
60.
Quinn TP, Erb I, Gloor G, Notredame C, Richardson MF, Crowley TM. 2019. A field guide for the compositional analysis of any-omics data. Gigascience 8:giz107.
61.
Aubert J, Barbillon P, Donnet S. 2022. Using latent block models to detect structure in ecological networks. Stat Appro Hidden Variables Ecol:117–134.
62.
Gu Z. 2022. Complex heatmap visualization. iMeta 1.
63.
Wickham H. 2011. Ggplot2. WIREs Computational Stats 3:180–185.

Information & Contributors

Information

Published In

cover image Microbiology Spectrum
Microbiology Spectrum
Volume 12Number 64 June 2024
eLocator: e00312-24
Editor: Kate S. Howell, University of Melbourne, Parkville, Victoria, Australia
PubMed: 38747598

History

Received: 27 February 2024
Accepted: 26 March 2024
Published online: 15 May 2024

Keywords

  1. metataxonomic study
  2. data integration
  3. microbial association network
  4. fermented vegetables
  5. meta-analysis

Data Availability

No new research data was generated in the preparation of this article.

Contributors

Authors

MaIAGE, INRAE, Université Paris-Saclay, Jouy-en-Josas, France
Author Contributions: Conceptualization, Methodology, Writing – original draft, and Writing – review and editing.
STLO, Agrocampus Ouest, INRAE, Rennes, France
Author Contributions: Supervision, Writing – original draft, and Writing – review and editing.
MaIAGE, INRAE, Université Paris-Saclay, Jouy-en-Josas, France
Author Contributions: Supervision, Writing – original draft, and Writing – review and editing.
MICALIS, INRAE, Université Paris-Saclay, Jouy-en-Josas, France
Author Contributions: Conceptualization, Methodology, Supervision, Writing – original draft, and Writing – review and editing.
MaIAGE, INRAE, Université Paris-Saclay, Jouy-en-Josas, France
Author Contributions: Conceptualization, Funding acquisition, Methodology, Supervision, Writing – original draft, and Writing – review and editing.

Editor

Kate S. Howell
Editor
University of Melbourne, Parkville, Victoria, Australia

Notes

The authors declare no conflict of interest.

Metrics & Citations

Metrics

Note:

  • For recently published articles, the TOTAL download count will appear as zero until a new month starts.
  • There is a 3- to 4-day delay in article usage, so article usage will not appear immediately after publication.
  • Citation counts come from the Crossref Cited by service.

Citations

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. For an editable text file, please select Medlars format which will download as a .txt file. Simply select your manager software from the list below and click Download.

View Options

Figures

Tables

Media

Share

Share

Share the article link

Share with email

Email a colleague

Share on social media

American Society for Microbiology ("ASM") is committed to maintaining your confidence and trust with respect to the information we collect from you on websites owned and operated by ASM ("ASM Web Sites") and other sources. This Privacy Policy sets forth the information we collect about you, how we use this information and the choices you have about how we use such information.
FIND OUT MORE about the privacy policy