INTRODUCTION
Butyrate-producing bacteria are widespread and can be found in many environments (
1) but especially in host-associated sites, including the rumen (
2), the mouth (
3), and the large intestine (
4). Recently, butyrate gained attention, because of its proposed key role in maintaining gut homeostasis and epithelial integrity, since it serves as the main energy source for colonocytes, directly influences host gene expression by inhibiting histone deacetylases, and interferes with proinflammatory signals, such as NF-κB (
5,
6). A breakdown of epithelial integrity is associated with emerging diseases such as inflammatory bowel diseases and type II diabetes (
7,
8), and butyrate-producing members specifically are reduced in such patients (
9,
10).
Butyrate producers form a functional cohort rather than a monophyletic group, and members of
Lachnospiraceae and
Ruminococcaceae have received the most attention because they are very abundant in the human colon, comprising 10 to 20% of the total bacteria. Butyrate is synthesized via pyruvate and acetyl-coenzyme A (CoA), mostly by the breakdown of complex polysaccharides (e.g., starch and xylan) that escape digestion in the upper gastrointestinal tract and reach the colon (
11). Alternative substrates, particularly those derived from cross-feeding with other primary degraders and lactate-synthesizing bacteria, are described as well (
12). Acetyl-CoA is then converted to the intermediate butyryl-CoA in a four-step pathway closely related to the β-oxidation of fatty acids in prokaryotes and eukaryotes (
13,
14). It is postulated that butyrate producers can conserve energy during the conversion from crotonyl-CoA to butyryl-CoA, which creates a proton motive force via ferredoxin reduction by the butyryl-CoA dehydrogenase electron-transferring flavoprotein complex (
15). The final step from butyryl-CoA to butyrate is either catalyzed by butyryl-CoA:acetate CoA transferase (encoded by
but) or butyrate kinase (encoded by
buk; after phosphorylation of butyryl-CoA). Typically, these two genes are used as biomarkers for the identification/detection of butyrate-producing communities (
16,
17). However, direct functional predictions based on gene homology alone can commonly result in misannotations if genes with distinct function share regions of high similarity, as specifically described for both
but and
buk (
17). Furthermore, CoA transferases show activity with several different substrate combinations
in vitro (
18), and alternative terminal CoA transferases were proposed for this pathway (
19). Targeting the whole pathway for functional predictions is hence a robust way to circumvent difficulties associated with the analysis based on specific genes only. Additionally, there are other known butyrate-producing pathways, namely, the lysine, glutarate, and 4-aminobutyrate pathways, where amino acids serve as major substrates. These pathways are found in
Firmicutes as well as other phyla, such as
Fusobacteria and
Bacteroidetes (
20–22), but are traditionally neglected as potential butyrate-producing routes in enteric environments.
The availability of complete databases, including diverse candidates and pathways, is essential to investigate specific microbial functions in complex microbial communities, to assess their effects on the host, and to ultimately develop treatment strategies for functional dysbiosis. The aim of this study was to screen available genomes, many from the Human Microbiome Project (HMP) framework, for potential butyrate producers and to characterize their phylogeny, gene arrangements, and gene phylogeny. The resulting gene catalogue was then used to screen for butyrate synthesis pathways in metagenomic HMP data to reveal this important functional community within the healthy microbiota.
DISCUSSION
The established gene catalogue together with our metagenomic analysis allowed us to reveal microbial butyrate-producing communities in the healthy microbiota and their associated metabolic pathways. This metabolic framework is a critical step in investigating the role of this function in host health and disease. Although targeting complete pathways is a more robust way to predict function than single-gene analysis, their detection in genomes does not automatically imply functionality, since that must be done by specific biochemical testing. For several isolates, such as members of
Peptococcaceae and
Syntrophomonaceae, the detected ability to produce butyrate is doubtful, since they are known rather to oxidize butyrate for growth (see reference
28). This is also true for the majority of the
Proteobacteria shown in
Fig. 2, which belong to the delta class, that use anaerobic respiration for energy conservation, and butyrate consumption is documented for several isolates (e.g., see reference
29). In these taxa, pathway genes are often not in synteny and only distantly related to genes of confirmed butyrate producers (
Fig. 3), and terminal genes are missing in many strains. However, it cannot be excluded that certain environmental conditions, such as the absence of H
2-consuming bacteria or lack of appropriate inorganic electron acceptors, might trigger fermentative growth and the synthesis of butyrate in certain isolates. Furthermore, a few strains are known to generate butyrate as building blocks for secondary metabolites, such as salinosporamide B, produced by the actinobacterium
Salinispora tropica (
30).
Neighbor-joining trees revealed very consistent patterns for all genes of an individual pathway, indicating a high degree of coevolution. Nevertheless, clear HGT signatures were detected in isolates, especially for the acetyl-CoA pathway, confirming earlier findings (
31). However, our results indicate transfer of entire pathways rather than of single genes. The fast microbial turnover and enormous selective pressures in the colonic environment promote large-scale HGT (
32). Since the acetyl-CoA pathway was detected to be the dominant pathway, displaying the greatest diversity, observations of HGT signatures specifically for this pathway make sense. Furthermore, our metagenomic results also did not detect unknown “disconnected HGT” events, i.e., bacteria that acquired genes of the acetyl-CoA pathway from distinct precursors (representing unknown gene combinations). This supports the observed coevolutionary behavior of all genes in this pathway. However, for the lysine pathway, the presence of gene combinations that have not yet been captured in sequenced isolates was indicated.
Diet is a major external force shaping gut communities (
33). Good reviews of studies investigating the influence of diet on butyrate-producing bacteria exist (
11 and
34) and suggest that plant-derived polysaccharides such as starch and xylan, as well as cross-feeding mechanisms with lactate-producing bacteria, are the main factors governing their growth. Our metagenomic analysis supports the acetyl-CoA pathway as the main pathway for butyrate production in healthy individuals (
Fig. 4), implying that a sufficient polysaccharide supply is probably sustaining a well-functioning butyrate-producing community, at least in these North American subjects. However, the detection of additional amino acid-fed pathways, especially the lysine pathway, indicates that proteins could also play an important role in butyrate synthesis and suggests some flexibility of the microbiota to adapt to various nutritional conditions maintaining butyrate synthesis. Whether the prevalence of amino acid-fed pathway is associated with a protein-rich diet still needs to be assessed. It should be noted that those pathways are not restricted to single substrates, as displayed in
Fig. 1, i.e., glutarate and lysine, but additional amino acids, such as aspartate, can be converted to butyrate via those routes as well (
26). Furthermore, the acetyl-CoA pathway also can be supplied with substrates derived from proteins either by cross-feeding with the lysine pathway (as discussed above) or by direct fermentation of amino acids to acetyl-CoA (
35). However, whereas diet-derived proteins are probably important for butyrate synthesis in the ileum, where epithelial cells use butyrate as a main energy source as well (
36), it still needs to be assessed whether enough proteins reach the human colon to serve as a major nutrient source for microorganisms. Another possible colonic protein source could originate with lysed bacterial cells. Enormous viral loads have been detected in this environment, suggesting fast cell/nutrient turnover, which might explain the presence of corresponding pathways in both fecal isolates and metagenomic data (
Fig. 1,
4, and
5). Detailed investigations of butyrate-producing communities in the colon of carnivorous animals will add additional key information on the role of proteins in butyrate production in that environment. It should be noted that diet provides only a part of the energy/carbon sources for microbial growth in the colon, since host-derived mucus glycans serve as an important nutrient source as well. Several butyrate-producing organisms do specifically colonize mucus (
37), and for some, growth on mucus-derived substrates was shown (
38).
Systems biology together with metabolic modeling is a promising approach to handle complexities of nutrient fluxes within the gut microbiota and will eventually help in predicting functional performance (
39). This study provides an important step forward, since it enabled us to assess the butyrate-producing potential of complex microbial communities, including predictions of basic nutritional requirements for butyrate synthesis. However, next to substrate availability, additional factors, such as pH, were demonstrated to be important factors governing the successful competition of butyrate producers with other intestinal organisms (
11). Furthermore, the presence of butyrate-producing pathways alone might not allow optimal predictions of actual butyrate production, since the organisms involved show metabolic flexibility and diverse profiles of fermentation products. Butyrate synthesis was shown to be influenced by several factors, such as type of limiting substrate and growth rate (
40), oxygen concentration (
41), and growth style (attached versus unattached [
42]). Furthermore, both the presence of inorganic electron acceptors promoting anaerobic respiration and aceto-/methanogenesis lowering the H
2 partial pressure can lead to more oxidized fermentation products, especially acetate, at the expense of more reduced substances, such as butyrate (
40). Our metagenomic approach, in combination with additional “-omics”-based technologies, will help to improve functional predictions and to assess the resulting effects on the host.