INTRODUCTION
Freshwater lakes are disproportionally active sites of carbon cycling relative to the surface area that they cover due to strong linkages to the surrounding land, from which they receive inorganic nutrients as well as organic carbon (
1,
2). Of the estimated 1.9 Pg of terrestrial organic carbon that freshwater systems process per year, nearly half is respired by bacteria (
3–5). When soil dissolved organic carbon outgassing is included, net freshwater carbon emissions are of the same order of magnitude as net oceanic uptake (
2). While photochemical mineralization of organic carbon can predominate in lake habitats with high levels of photosynthetically active radiation (
6), bacterial contributions to dissolved organic matter (DOM) mineralization are important as well (
7).
Nevertheless, linkages between the metabolism of organic carbon and specific populations remain limited, particularly in the less-studied hypolimnia of lakes, even for ubiquitous and highly abundant taxa, due to challenges with the isolation of representatives of these taxa (
8). In recent years, the use of culture-independent methods has provided insights into the metabolic potential of some key heterotrophic freshwater lineages, such as LD12, the freshwater clade that is a sister to marine clade SAR11 (
Alphaproteobacteria [
9]), acI (
Actinobacteria [
10,
11]), and
Polynucleobacter (
Betaproteobacteria [
12,
13]). A common feature of these ubiquitous and abundant heterotrophic freshwater taxa is a streamlined genome whose sizes range from 1 to 2.5 Mbp and which contains a variety of genes encoding the ability to take up and metabolize DOM.
We currently have no information regarding the geochemical role of the CL500-11 lineage of the
Chloroflexi. Chloroflexi are rarely observed in freshwater pelagic zones when oxygen is plentiful (
8), but CL500-11-like populations are a notable exception and are emerging as a taxon restricted to lakes that are deep enough to maintain low temperatures (<10°C) in the hypolimnion after stratification (
14). This group was first observed in Crater Lake, OR, where it is abundant throughout the year, contributing up to 50% of all cells in the deep (
15,
16). Since its initial discovery, a similar predominance has been observed in deep lakes around the world, including in Western Europe, East Asia, and, most recently, two of the largest freshwater lakes in the world, Lake Superior and Lake Huron (
14,
17). It would be worthwhile to know more about the role of CL500-11 populations in deep-lake hypolimnia, as (i) a large proportion of all surface freshwater is contained in the hypolimnia of deep lakes, and (ii) CL500-11 populations contribute a large proportion of all bacterial biomass in these habitats through a combination of numerical dominance and large cell size.
In this study, we analyzed the occurrence of a CL500-11-lineage population along a near-shore to offshore transect in Lake Michigan based on sequencing of the V4 region of the 16S rRNA gene as well as catalyzed reporter deposition fluorescent in situ hybridization (CARD-FISH). To understand its success and role in the deep-lake habitat, we reconstructed a nearly complete genomic data set from Lake Michigan metagenomic data and identified functions in the CL500-11 genome that were overrepresented relative to their representation in the genomes of other ubiquitous freshwater heterotrophs with streamlined genomes. To gain further insights into the in situ functioning of this organism, we generated metatranscriptomic data and identified highly expressed functions, as well as differential expression patterns, based on season and location in the water column.
DISCUSSION
While high
Chloroflexi levels are rarely reported in oxygenated freshwater lakes (
8), the CL500-11 lineage of the
Chloroflexi is emerging as a highly abundant taxon in the hypolimnion of deep stratified lakes around the world (
14,
15). Our study adds Lake Michigan to this list of lakes, which means that CL500-11 has now been shown to reach relative abundances of up to 20% in the hypolimnia of all three upper Great Lakes (Lake Superior, Lake Huron, and Lake Michigan [
17]). Analysis of its morphology and comparative genomic and metatranscriptomic analyses reveal adaptations typical of both oligotrophic and copiotrophic lifestyles, in line with their occurrence across a broad productivity gradient (
14).
A recent summary of conditions under which CL500-11-lineage bacteria are found to be abundant indicates that they can thrive under both oligotrophic and mesotrophic conditions, as long as oxygen does not get depleted in the hypolimnion and hypolimnion temperatures remain below 10°C (see Table 1 in reference
14). The peak relative abundance of CL500-11 in Lake Michigan was similar to that in Lake Biwa, where the same CARD-FISH probe used in the present study was used, while it remained well below the 50% reported in Crater Lake (
15,
16). Total bacterial and CL500-11 cell numbers were about 1 order of magnitude lower in Lake Michigan than in Lake Biwa, most likely reflecting the lower levels of DOC and nutrients in Lake Michigan, which is oligotrophic (
Fig. 1); in contrast, Lake Biwa is mesotrophic (
14). CL500-11 remained present at high numbers at the end of the spring mixed period in Lake Michigan, while it became undetected in Lake Biwa (
14), potentially due to differences in the mixing regimes of these two lakes. All current observations of CL500-11 are consistent with its niche being oxygenated cold water of deep, though not necessarily large, lakes. The only findings that argue against cold adaptation were the identification of CL500-11-like sequences near hydrothermal vents in Lake Yellowstone. However, one study indicated multiple lines of evidence of dilution of the vent water samples with surrounding cold (<10°C) water, including the presence of multiple nonthermophilic microorganisms and the presence of significant levels of O
2 in the sample while the vent water was anoxic (
41). The other study observed CL500-11-like sequences only in the coldest (∼16°C) vent sample (
42).
Our metabolic analysis as well as recent findings of amino sugar incorporation by a CL500-11-like population points to a heterotrophic lifestyle (
43). In a previous study of Lake Biwa, it was shown that semilabile DOM produced by phytoplankton in the epilimnion is subsequently biologically remineralized in the hypolimnion (
44). Considering its abundance, CL500-11 may play an important role in this process. Comparisons of the reconstructed CL500-11-LM genome to the genomes of other abundant freshwater heterotrophs with streamlined genomes identified traits that support this function and may help explain its success in the deep-lake hypolimnion habitat. While some of the genomes compared are even more streamlined than the genome of CL500-11-LM (e.g., the LD12 lineage [
9] and the acI lineage [
10,
11]), many similarities and some contrasts can be found. Similarly to other freshwater heterotrophs, CL500-11 dedicates ∼3% of its genome to functions involved with the importation of diverse DOM substrates (
45), but it stands out as having the most diverse substrate spectrum, particularly for di- and oligopeptides. In line with this observation, the overrepresentation of genes encoding metabolic functions to metabolize these peptides was observed. The low number of carboxylic acid transporters relative to the numbers in LD12 and
Polynucleobacter could be a reflection of the preference of CL500-11 for water depths with lower PAR, where there is a lower availability of these compounds, which have been shown to be produced through the interaction between UV and DOC (
46).
In addition to the overrepresentation of genes for transporters, the disproportionate number of genes encoding transporters among highly expressed genes confirmed the importance to CL500-11-LM of transporters, particularly those involved with peptide transport. The prevalence of genes encoding transporters among highly expressed genes contrasts to the findings of a recent metatranscriptomic survey of a mesotrophic reservoir that showed few genes encoding transporters among the 5% most highly expressed genes (
47). However, our results are in line with community-level transcript measurements in coastal ocean systems, where up to 13% of all sequences assigned to COGs were involved in transport, and about half of those were involved in DOM transport (
48). In the most oligotrophic areas of the open ocean, proteomics experiments have indicated that two-thirds of all protein originated from transport systems in SAR11 (
49). Strikingly, the apparent importance of the CL500-11 lineage in organic nitrogen transformation matches past findings that in the hypolimnion of Lake Biwa, during the mineralization of DOM, N-rich compounds are preferentially converted (
50).
Carboxidovory and, likely, methylovory in CL500-11-LM allow additional energy generation through the oxidation of CO and methyl groups, respectively. Methylovory has been demonstrated for “
Candidatus Pelagibacter ubique” (
51), and we identified homologous genes in the SAR11 freshwater sister clade LD12. Carboxidovory in SAR11 has been suggested to be one mechanism by which it gains an edge in oligotrophic environments, such as the Sargasso Sea. Sargasso Sea DOC concentrations are 2 to 3 times lower than those in deep offshore Lake Michigan water, while Lake Michigan soluble reactive phosphorus levels are lower (
52,
53). A lack of data precludes a comparison of inorganic nitrogen levels, though freshwater systems are typically more P limited than N limited (
54). Evidence of methylovory (i.e., carbon monoxide dehydrogenase genes), on the other hand, was not identified in any of the other freshwater heterotrophs with streamlined genomes or SAR11. The gene for a carbon monoxide dehydrogenase is present in copiotrophic and often abundant freshwater
Limnohabitans species (
55) and has been observed to be highly expressed in a mesotrophic reservoir (
47). The one-carbon oxidation pathways may be beneficial in both oligotrophic and more productive systems, where CL500-11 can be similarly abundant (
14).
In addition to the large cell size, the overrepresentation of cell envelope biosynthesis genes and the energetically costly investment in motility are features that characterize a bacterial copiotrophic lifestyle (
56). However, the relatively small genome size, the limited investment in transcriptional regulation (3.5% of CL500-11 genes are protein-coding genes) and signal transduction (1.5%), and the small proportion of secreted (3.2%) or membrane-bound (19.4%) proteins are consistent with adaptation to an oligotrophic lifestyle for CL500-11 (
56). Genes for regulatory mechanisms are particularly reduced in number, even in comparison to the number of such genes in the genomes of other freshwater organisms with streamlined genomes. Relative to
A. thermophila UNI-1, CL500-11-LM had 60% and 40% fewer genes involved in signal transduction and transcription, respectively, whereas the predicted decrease is 20% fewer genes in both categories on the basis of the difference in the sizes of the genomes alone (3,166 and 2,153 protein-coding genes in
A. thermophila UNI-1 and CL500-11-LM, respectively) (
57). Overall, the limited conservation in genome content and sequence composition between CL500-11-LM and the filamentous thermophile
A. thermophila (
35) are in line with the large differences in their morphology and ecology.
The cell envelope provides the interface that helps determine the outcome of interactions with viruses and predatory grazers, primarily nanozooplankton (
58). Escape from grazers due to specific cell surface structures (S layer) has been shown for the acI lineage (
59). Multiple envelope structures, including S layers, have been inferred from genomic data for
Chloroflexi lineages (
60). Although the composition of the cell envelope can contribute to multiple traits, the overrepresentation of cell envelope biosynthesis genes in CL500-11-LM relative to their representation in all other genomes may underpin the surface structures that allow it to escape from grazing predators, allowing it to maintain high relative abundance levels. Independently of the cell envelope composition, increased grazing resistance may be conferred by the large cell size and curved morphology of CL500-11-LM (
61), as well as by the large proportion of cell pairs. It is notable that in filter-feeding experiments using water and invasive quagga mussel samples from Lake Michigan, the relative abundance of CL500-11-LM increased after 3.5 h of filter feeding (V. J. Denef, unpublished results).
The occurrence of CL500-11-LM in the surface waters in spring corresponded with increased cell size, which could explain its increased occurrence in the 3- to 20-μm fraction in spring. The larger cells and potentially increased abundance of cell pairs in the spring, which has previously been suggested to be indicative of cell division (
14), could indicate more favorable conditions and a higher growth rate in spring (
62). However, the metatranscriptomic data did not indicate increases in the numbers of genes for core functions, such as transcription, DNA replication, and cell division, but instead indicated exposure to stress in the surface waters. In particular, multiple genes indicated the presence of increased oxidative stress in surface water relative to that in deep water (the presence of the
suf operon [
63], the thioredoxin gene,
ahpC, genes for several proteases involved in damaged protein turnover [
64]), which is likely, considering the increased photosynthetically active radiation at the surface (
65). Oxidative stress response mechanisms are commonly found in the genomes of other freshwater heterotrophs with streamlined genomes as well (
10,
12), though different protein families appear to be involved, according to the genes found in CL500-11 and other freshwater organism genomes.
In light of the observed stress response, the higher level of expression of the gene for proteorhodopsin in the surface water, which is expected on the basis of the higher PAR, suggests that CL500-11 uses proteorhodopsin as a means of surviving under suboptimal conditions rather than as a means for increasing cell growth (
66). However, the upregulation of genes for electron chain components, glycolysis, and TCA cycle enzymes is similar to the response seen during proteorhodopsin-induced growth in a marine flavobacterium (
67). Considering its marginal classification as part of the CL500-11-LM bin, further verification of the presence of the gene for proteorhodopsin in the genome of CL500-11 and its ecological role will be required.
CL500-11-like populations predominate in the hypolimnia of at least three of the five Laurentian Great Lakes, which contain ∼20% of the world's surface freshwater. In addition, they have been identified in multiple deep and often large lakes around the world and can likely be found in most deep lakes that maintain a cold and oxygenated hypolimnion. Their numerical abundance, large cell size, and low within-clade sequence variation (
14) mean that populations with traits similar to those of CL500-11-LM likely contribute a significant proportion of the world's freshwater bacterial biomass. Our analyses suggest that they play an important role in the transformation of biologically derived organic matter, particularly nitrogen-rich DOM.