INTRODUCTION
Microorganisms conduct key biogeochemical processes involved in the main fluxes of matter and energy on Earth. Most microbial diversity remains uncultured, and only analyses of environmental DNA samples have made it possible to unravel existing microbial diversity and to identify the main species involved (
1). Indeed, out of 112 known bacterial phyla in the Genome Taxonomy Database (GTDB) (
2), more than one half are still recognized only from environmental sequences (
3).
One of the phyla that was first identified using molecular phylogenetic methods was
Gemmatimonadota (also called
Gemmatimonadetes [
4]), which was originally established as the so-called BD group based on five 16S rRNA gene sequences, which originated from deep-sea sediments, soils, and reactor sludge (
5,
6). Independently, the group was also proposed as a candidate division KS-B based on three 16S rRNA gene sequences retrieved from coastal sediment samples from French Guiana (
7).
The first cultured strain T27 belonging to the BD/KS-B group was isolated from a wastewater treatment plant in Japan. The isolate was named
Gemmatimonas aurantiaca and established the new phylum
Gemmatimonadota, along with its first class
Gemmatimonadetes and genus
Gemmatimonas (
8). Subsequently, three more
Gemmatimonadota genera (
Gemmatirosa,
Longimicrobium, and
Roseisolibacter) with type strains were described from various soil environments (
9–11). Apart from class
Gemmatimonadetes, phylum
Gemmatimonadota consists of four more class-level groups which include class
Longimicrobia, two terrestrial groups (BD2-11 and S0134), and one marine benthic group (PAUC43f) (
10,
12). The four cultured representatives from the phyla were all chemo-organoheterotrophs that require oxygen and grow under fully aerobic or semiaerobic conditions (
8–11). An interesting metabolic potential and ecological role was reported for
G. aurantiaca, as this species has the ability to reduce the greenhouse gas N
2O (
13). However, with the discovery of
Gemmatimonas phototrophica, which contains photosynthetic reaction centers (
14,
15),
Gemmatimonadota were added to several bacterial phyla containing anoxygenic phototrophic species alongside
Proteobacteria,
Chlorobi (now included as a class-level lineage in
Bacteroidota [
2]),
Chloroflexota,
Firmicutes (
Bacillota),
Acidobacteriota, and the newly discovered phylum “
Candidatus Eremiobacterota” (WPS-2) (
14,
16,
17). Anoxygenic phototrophs, such as
G. phototrophica, are able to support their metabolism by harvesting light using bacteriochlorophylls; however, they require a supply of organic substrate for growth (
18). Another characteristic found in
G. phototrophica is the organization of its photosynthesis genes into a cluster called the photosynthesis gene cluster (PGC). Interestingly, the gene arrangement in the PGC of
G. phototrophica is very similar to the one found in
Proteobacteria, so it has been suggested that phototrophy in
Gemmatimonadota originates from an ancient horizontal gene transfer of the entire PGC from
Proteobacteria (
14). As yet, this is the only known case of horizontal transfer of an entire set of photosynthesis genes between distant bacterial phyla (
14,
19).
Information about the prevalence of
Gemmatimonadota in different habitats is continuously growing, although information about their ecology is scarce. Members of this phylum were found in many natural environments (
12,
20–22) and represent the eighth most abundant phylum in soils, accounting for about 1 to 2% of bacteria in soils worldwide (
23). Their highest contributions are typically found in fertile agricultural and forest soils (
20) but are also present in more unique soil environments, such as arid Antarctic Dry Valley soils (
24,
25). It has been suggested that
Gemmatimonadota may be relatively more abundant in dry soils (
26). On the other hand, from the available data, it is known that they are also present in aquatic environments, such as freshwater lakes (
27), sediments (
22,
28–30), and estuaries (
31,
32). In addition,
G. phototrophica was isolated from a freshwater lake in Inner Mongolia (
18). However, this organism does not grow in liquid culture and requires microaerophilic conditions, which are more typical for sediment-dwelling species. Thus, the data showing that
Gemmatimonadota prefer dry environments does not seem to be universal. There is probably a large ecological and functional diversity among the members of
Gemmatimonadota. The question remains whether
Gemmatimonadota-related sequences identified in lakes originate from strictly limnic species, or perhaps they are just a passive component that enters the lakes along with runoff waters from surrounding soil.
Current progress in sequencing technologies and bioinformatics has circumvented the necessity for cultivated representatives and allowed biological and ecological inferences to be drawn by using genomic data recovered directly from microbial communities. Over the past few years, the usage of metagenome-assembled genomes (MAGs) has allowed the description of many novel bacterial divisions and unearthed large radiations in the prokaryotic tree of life (
33). This approach has already led to the discovery of new phototrophic organisms belonging to the yet uncultured candidate phylum “
Ca. Eremiobacterota” (
16,
34). Also, two MAGs belonging to
Gemmatimonadota were recovered from Lake Baikal. One MAG was more similar to
Gemmatirosa kalamazoonensis found in soils and seemed more abundant at a depth of 5 m. The other MAG-encoded rhodopsin gene and was closely related to the phototrophic species
G. phototrophica and showed a higher abundance at 20 m (
35).
Therefore, in order to address the question whether there are any truly limnic
Gemmatimonadota and to investigate their diversity, we analyzed metagenome data from five freshwater lakes in Czechia and Switzerland. The lakes were chosen based on their trophic status and included a representative mesoeutrophic Římov Reservoir, a dystrophic Jiřická pond, oligomesotrophic Lake Zurich and Lake Constance, and an ultraoligotrophic Lake Thun (
Table 1). The metagenome sequences were collected over several years and seasons. The reconstructed MAGs were analyzed with the aim to identify the most common freshwater and photoheterotrophic
Gemmatimonadota and to analyze their spatiotemporal variability. Using catalyzed reporter deposition-fluorescence
in situ hybridization (CARD-FISH), cells of
Gemmatimonadota were visualized for the first time in their natural environment, and their association with other organisms was observed.
DISCUSSION
In the presented work, we clearly demonstrated that
Gemmatimonadota were permanently present in all of the studied lakes showing the ubiquitous nature of this phylum in contrasting freshwater lakes ranging from ultraoligotrophic to mesoeutrophic. The freshwater
Gemmatimonadota community followed seasonal patterns with water temperature as the main driving variable, especially in the epilimnion. This observation further supports the idea that the studied
Gemmatimonadota are truly limnic, since abundances of passively transferred microorganisms correlate with water mass movements rather than with temperature (
42). The stratification of the lake also seemed to have a significant effect on the
Gemmatimonadota community, as clusters inhabiting the epilimnion differed from those present in the hypolimnion, suggesting that they can be metabolically diverse and be specialized in different ecological roles. While
Gemmatimonadota were relatively stable in the hypolimnion, their numbers in the epilimnion varied with higher numbers in late summer and autumn and lower numbers in spring and early summer. The stratification effect was previously studied in Grand Lake (OK, USA) where the
Gemmatimonadota community was present in late summer in the free-living fraction within the thermocline and hypolimnion. The highest abundance in the hypolimnion at this time seemed to be associated with the sedimentation of organic matter, turbidity, and a lack of oxygen that developed in the thermocline and hypolimnion (
43). The
Gemmatimonadota community in Grand Lake was mostly composed of the genus
Gemmatimonas, which is suggested to participate in the degradation of organic matter after an algal bloom lysis (
44). In contrast, we observed that the genus
Gemmatimonas was present in both the epilimnion and hypolimnion, but it appeared with higher abundances in the epilimnion, while the hypolimnion community was dominated by other clusters, including phototrophic group 1. However, we could also correlate the highest abundance of the genus
Gemmatimonas in the epilimnion (15-Aug-16) with the highest abundance of the cyanobacterial community in the same metagenome sample (
41). The prokaryotic community of metagenomes from Římov Reservoir was analyzed previously (
41) showing
Actinobacteria as persistently abundant in Římov Reservoir through all seasons and in both the epilimnion and hypolimnion. Other more abundant groups in the epilimnion and the hypolimnion were
Alphaproteobacteria,
Bacteroidetes, and
Burkholderiales (previously
Betaproteobacteriales). Instead, while
Cyanobacteria were abundant in the epilimnion, they were recovered in smaller numbers, which the authors attribute to the method of sampling where most of the filamentous
Cyanobacteria are removed.
The clear difference between samples from the epilimnion and hypolimnion was also observed using epifluorescence microscopy. The highest number of small-sized hybridized cells was detected in the hypolimnion of Římov Reservoir. In contrast, the samples from the epilimnion contained many
Gemmatimonadota cells attached or associated with photosynthetic organisms: diatoms (
Fragilaria sp.) or cyanobacteria (
Microcystis sp.) (
Fig. 5a and
b and
Fig. 6). Since both phototrophic and heterotrophic
Gemmatimonadota require an organic source of carbon, they may benefit from the input of organic carbon from algae and cyanobacteria, providing in exchange inorganic nutrients acquired through mineralizing organic substances (
45). Additionally, cells from the epilimnion, where light is available, seemed to have a larger size. Some previous studies showed that aerobic anoxygenic photoheterotrophic bacteria can often increase carbon assimilation in the presence of light, which allows them to have larger cells (
46,
47). Subsequently, the distribution of
Gemmatimonadota in the epilimnion could be dependent upon the distribution and occurrence of diatoms like
Fragilaria sp. and cyanobacteria like
Microcystis sp., as their blooms can influence and alter bacterioplankton communities (
45,
48). Likewise, this dependence is a plausible explanation for the reason why photoheterotrophic members of this phylum are proving to be difficult to culture.
The presented 45 metagenome-assembled genomes of
Gemmatimonadota in this study represent the largest collection of freshwater metagenome-assembled genomes (MAGs) of this phylum so far available. The obtained MAGs further confirmed the limnic nature of
Gemmatimonadota. With recovery of several MAGs that represent the same species based on AAI (with 99 or 100% similarity), we show that
Gemmatimonadota MAGs are reassembled from metagenomes and recovered periodically from different years as well as different seasons. This confirms the strong resilience of their microdiversity in freshwater and cannot be taken as a random occurrence. Furthermore, 39 of the obtained MAGs are different from soil species (
Fig. 1; also see
Fig. S1 and
S4 in the supplemental material) and form distinct phylogenetic groups (with AAI between 45 and 65%) with both photoheterotrophic and heterotrophic representatives (
38). Six MAGs, gained from the hypolimnion of deep lakes, three from Lake Constance (CH-Jul18-bin44, CH-Jul18-bin76, and CH-Jul18-bin112), two from Lake Zurich (ZH-3nov15-207 and ZH-3nov15-212), and one from Lake Thun (TH-Jun18-bin75), (
Fig. 1 and
Fig. S1) are more closely related to genomes from soil and sediments. In the 16S rRNA phylogeny (
Fig. S1), two previously mentioned MAGs (CH-Jul18-bin112 and ZH-3nov15-212) clustered within the BD2-11 terrestrial group based on SILVA SSU v138 database taxonomy (
12). This is consistent with 16S rRNA gene abundance in Swiss lakes where we could episodically observe the occurrence of 16S rRNA sequences related with terrestrial groups, such as
Longimicrobia, BD2-11, and S0134 that has, as yet, been connected only with soil environments (
10). Environmental 16S rRNA gene sequences from freshwater have until now been associated only with the class
Gemmatimonadetes (
10–12), and all other groups were formed with soil and sediment representatives. While the six MAGs could represent new freshwater members of the soil-connected groups, due to the close phylogenomic similarity with soil representatives and not with other freshwater MAGs, it is more probable that they represent genomes recovered due to soil runoff. Nevertheless, the MAGs present in these different phylogenetic groups were assembled from metagenomes gained from different sampling times which shows
Gemmatimonadota were recovered repeatedly from freshwater environments. All this evidence shows that at least 39 of the identified MAGs represent truly limnic and planktonic species that do not come from soils. Furthermore, with the notable genomic diversity of limnic
Gemmatimonadota, we demonstrate the ecological relevance of this group, as different members are clearly able to persist in the water column, occupying different ecological niches as they occur both in the hypolimnion and in the epilimnion. Moreover, the distribution of
Gemmatimonadota in contrasting freshwater lakes showed they are adapted to different types of aquatic environments. The highest diversity in terms of different genus level groups or species of
Gemmatimonadota, especially photoheterotrophic representatives, was observed in Římov Reservoir. Despite this, we cannot associate the higher diversity to mesoeutrophic lakes, as the data set obtained from Římov Reservoir was substantially larger, allowing for an increased chance of recovering higher diversity. A higher trophic status of any lake is connected with higher phytoplankton productivity; therefore, lakes and reservoirs often show variations of microbial communities connected with phytoplankton productivity (
49,
50). Specifically, the connection of bacterial communities with changes in phytoplankton has already been recorded in Římov Reservoir (
51). Furthermore, anoxygenic phototrophic bacteria are often found in close association with algae (
45,
52,
53), and since they seem to follow seasonal phytoplankton blooms in freshwater lakes (
45,
48), it is suggested that they represent an important functional group in freshwater environments (
54).
Phototrophic gene, phylogenomic, and AAI analyses have shown that phototrophy in the
Gemmatimonadota spans multiple genera. The identified phototrophic
Gemmatimonadota represent three different groups but share the same set of phototrophic genes with
G. phototrophica. All identified PGCs share a very similar gene inventory (see
Table S4 in the supplemental material) and organization (
Fig. 3). It seems that the convergent orientation (→ ←) of superoperons
bchFNBHLM and
crtFbchCXYZ-puf is conserved among all the phototrophic
Gemmatimonadota, in contrast to phototrophic
Proteobacteria, where orientation of these superoperons may be divergent, convergent, or colinear (
55). Moreover, the split of the PGC with inserted genes in the type strain
G. phototrophica AP64 (
14) is also present in several MAGs from the epilimnion. Previously, the PGC of
G. phototrophica was compared with two PGCs reconstructed from the Odense wastewater metagenome (OdenseWW) and the Aalborg wastewater metagenome 2 (AalborgWW-2) which did not contain the insert of several hypothetical genes between two operons. The explanation proposed for the difference in having a split PGC or not was that it could represent different evolutionary history of phototrophic
Gemmatimonadota originating from different environments (
21). Apart from the split PGC present in some of the MAGs, we observed that members of phototrophic group 1 have coenzyme F420-reducing hydrogenase (
frhB gene) instead of bacteriochlorophyll synthase 4.5-kDa chain (
bch2), as part of the
bchP2G operon. Coenzyme F420-reducing hydrogenase enzyme seems to have an important role in energy conservation and methanogenesis from CO
2 (
56).
In support of the true photoheterotrophic nature of
Gemmatimonadota, a previous study conducted in Římov Reservoir, found active expression of their
pufM genes (a common molecular marker gene for aerobic anoxygenic phototrophs) (
27). Expression of the photosynthetic apparatus of
Gemmatimonadota showed that they are an active part of bacterial community and do not just passively contain the phototrophic genes (
27).
Recently, several
Gemmatimonadota MAGs that originate from sediments of a soda lake were reported to contain both phototrophic genes and genes related to the large subunit of ribulose 1,5-bisphosphate carboxylase/oxygenase enzyme (
57), suggesting that these soda lake MAGs represent the first photoautotrophic
Gemmatimonadota (
58). In contrast, some of our freshwater
Gemmatimonadota contain phototrophic genes as well as type IV RuBisCO (
Fig. S3), which is considered only a homologue of RuBisCO, since it does not have any carboxylation activity (
59,
60). Type IV RuBisCO genes are present in many microorganisms, including both phototrophic and heterotrophic bacteria and
Archaea and are thought to participate in some other metabolic pathways different from the Calvin-Benson cycle (
61,
62). Therefore, these freshwater
Gemmatimonadota MAGs are not photoautotrophs, rather have a photoheterotrophic metabolism, typical for aerobic anoxygenic photoheterotrophic bacteria. These bacteria do not fix inorganic carbon and need to rely on organic carbon produced by other organisms, so the ability to harvest light is used to supply energy for their mostly organoheterotrophic metabolism (
40,
63,
64).
In conclusion, with MAGs from these contrasting freshwater lakes, we not only reveal the existence of several new phototrophic species that differ phylogenetically from the already cultured and characterized G. phototrophica but also show the considerable diversity of both photoheterotrophic and heterotrophic Gemmatimonadota in freshwater.