Steroid transport.
Several TonB-dependent receptor proteins (COG category “inorganic ion transport and metabolism”) were among the membrane proteins with the highest increases in abundance, including Nov2c232 (gene cluster 2), Nov2c378 (near gene cluster 3), and Nov2c659 (near gene cluster 4). TonB-dependent outer membrane transporter proteins (called TonB-dependent receptor protein) require the accessory proteins TonB, ExbB, and ExbD in the inner membrane to form a functional TonB system (
32). These accessory proteins were identified in the membrane protein-enriched fractions of cells grown on all substrates with similar Mascot scores, indicating constitutive formation (Nov1c1853 to 1856). Transporters for the uptake of bile salts are not known in
Proteobacteria, but there are indications that TonB-dependent receptors could be involved. First, they are generally known to participate in the import of complex growth substrates, such as lignin degradation compounds (
33). Second, in
Novosphingobium tardaugens NBRC16725, which was recently renamed
Caenibius tardaugens NBRC16725 (
34), a TonB-dependent receptor was upregulated in estradiol-grown cells, and a corresponding deletion mutant showed reduced growth with estradiol (
35).
The TonB-dependent receptor Nov2c232 (28% identity to the TonB-dependent receptor that was deleted in
C. tardaugens NBRC16725) is encoded in the putative side chain degradation cluster in close vicinity to a gene encoding a transporter of the major facilitator superfamily (MFS) (Nov2c225), and both were more abundant in cholate- and deoxycholate-grown cells (
Fig. 4 and Table S1). These could be involved in transport of bile salts as well as early degradation intermediates such as Δ
4-3-ketocholate (II in
Fig. 1) or HOCDA (VII in
Fig. 1), which are found in strain Chol11 culture supernatants (
10,
29). Such transient extracellular accumulation of intermediates (
10) is common for bile salt-degrading
Proteobacteria and has been observed during bile salt degradation in soil (
36). Thus, the above transporters could alternatively be involved in intermediate efflux.
Phase 1: A-ring oxidation and B-ring dehydration (blue section in Fig. 3). (i) 3α-Hydroxysteroid dehydrogenase. The first step of bile salt degradation is the oxidation of 3-OH to a keto group by a 3α-hydroxy steroid dehydrogenase (3α-HSD) (
28,
37). A putative 3α-HSD (Nov2c6) is encoded in gene cluster 1 (
Fig. 4 and
5). This protein was present in lower abundances in glucose- versus steroid-grown cells (
Fig. 4). Additional putative 3α-HSDs, Nov2c397 and Nov2c683, are encoded close to gene clusters 3 and 4, respectively, and were not differentially expressed. This, as well as Nov2c6 being present in low abundances in glucose-adapted cells, is consistent with biotransformation experiments showing that oxidation of the 3-OH is constitutive in strain Chol11.
(ii) 5β-Δ4-3-Ketosteroid dehydrogenase. The next step is the introduction of a double bond in the A-ring by 5β-Δ
4-3-ketosteroid dehydrogenase (5β-Δ
4-KSTD, named 5β-Δ
4-KSTD1) (
29). 5β-Δ
4-KSTD1 is encoded in cluster 1 (Nov2c19) and is much more abundant in steroid- versus glucose-grown cells (>5.5-fold,
Fig. 4).
(iii) 7α-Hydroxysteroid dehydratase. The key enzyme of the Δ
4,6 pathway in strain Chol11, the 7α-hydroxysteroid dehydratase Hsh2 (Nov2c400), introduces a double bond in the B-ring by elimination of water (
28). The
hsh2 gene is located in close proximity to cluster 3. Hsh2 was shown previously to be active in glucose-grown cells (
29), in agreement with its similar abundance in all cells (
Fig. 4).
(iv) Δ1-3-Ketosteroid dehydrogenase. In the Δ
1,4 variant bile salt degradation pathway, the next step is the introduction of a second double bond in the A-ring at C1 by a Δ
1-3-ketosteroid dehydrogenase (Δ
1-KSTD), which is a structural prerequisite for subsequent cleavage of the B-ring. The formation of Δ
1,4,6 intermediates such as HATD (VIII in
Fig. 1) during growth with bile salts shows that this reaction also occurs in strain Chol11 (
10). Putative Δ
1-KSTDs (
38,
39) are Nov2c82 (encoded between gene clusters 1 and 2, not detected in any condition) and Nov2c695 (encoded close to gene cluster 4, about 2- to 3-fold increased abundance in steroid-grown cells) (
Fig. 4). This expression pattern is in line with the whole-cell biotransformation assays for testing inducibility of bile salt degradation in which no Δ
1-KSTD activity could be observed in glucose-adapted cells (Fig. S1). As no Δ
1,4,6 intermediates with a side chain were reported for strain Chol11 during growth on bile salts (
10,
28), this step might occur after side chain degradation in strain Chol11.
(v) Oxidation of the 12-OH-group. The previously observed formation of the 3,12-dioxo-chol-4,6-dienoate (DOCDA, XIII in
Fig. 3) (
10,
28) suggests the involvement of a 12α-dehydrogenase in the degradation of 12-hydroxy bile acids in strain Chol11. A similar reaction is catalyzed in
C. testosteroni by SteA, which is active on 12α-hydroxy steroids without a side chain (
25). In
C. testosteroni, the resulting 12-oxo-steroids are then reduced to the corresponding 12β-steroids by SteB (
Fig. 5B) before cleavage of the A-ring. The presence of a 12β-OH in the degradation intermediate HATD (VIII in
Fig. 1) indicates that the reduction to a 12β-OH also takes place in strain Chol11. Homologs to SteA and SteB are encoded in cluster 1 (Nov2c15 and Nov2c16, respectively) and were detected with higher Mascot scores in steroid-grown cells (
Fig. 4).
Phase 2: Side chain degradation (green section in Fig. 3). (i) CoA activation of the side chain. The steroid-C
24-CoA ligase SclA catalyzing the initial step of side chain degradation in strain Chol11 was previously described (
30) and is encoded in the putative side chain degradation gene cluster 2 (
Fig. 6A). SclA abundance, relative to that in glucose-grown cells, was 3.3-fold higher in cholate- and deoxycholate-grown cells and 1.3-fold higher in 12β-DHADD-grown cells (
Fig. 4).
(ii) Desaturation of the side chain. Previous enzyme assays with cell extract of strain Chol11 indicated that a double bond is introduced into the CoA activated side chain (
30). During the degradation of steroids, this dehydrogenation is catalyzed by α
2β
2-heterotetrameric acyl-CoA dehydrogenases (ACADs) (
40,
41) or by homodimeric ACADs in which the two subunits are fused (
42). Two predicted ACAD proteins in gene cluster 2 (Nov2c221 and Nov2c222) had 4- to 5-fold increased abundances in cholate- and deoxycholate-grown cells but not in 12β-DHADD-grown cells (
Fig. 4). This suggests that they comprise an ACAD involved in side chain degradation of bile salts, but the location of any double bond formed by this ACAD is unknown.
(iii) Further side chain degradation. The next step in bile salt side chain degradation is typically the addition of water to the double bond (
Fig. 6B). The enzymes catalyzing this hydration belong to the MaoC family, containing a hotdog fold domain, or the crotonase family (
19,
43–45). These hydratases consist either of a single protein, such as the C
5 side chain hydratases of
P. stutzeri Chol1 and
Mycobacterium tuberculosis (
19,
45), or of two subunits, such as the C
3 side chain hydratase of
M. tuberculosis (
43). Two proteins from the thioesterase superfamily with hotdog fold domains are encoded in cluster 2 (Nov2c219 and Nov2c220) adjacent to the ACAD genes. Nov2c219 was detected in only cholate- and deoxycholate-grown cells, and Nov2c220 was detected only in cholate-grown cells (
Fig. 4). However, these proteins show only very low similarity (less than 20% identity) to known side chain hydratases, which suggests a different function for these proteins.
Homologs to other known side chain degradation enzymes, such as thiolases and aldolases, are not encoded in cluster 2 (
30). In
P. stutzeri Chol1 and
R. jostii RHA1, the first cycle of side chain degradation leads to the release of acetyl-CoA and a shortened C
3 side chain (
17,
19). For the degradation of the C
3 side chain, a second cycle of aldolytic cleavage with similar steps, including introduction of a double bond, addition of water, and aldolytic cleavage, is necessary and encoded in the genomes of both organisms (
13,
16,
19,
23,
46,
47).
Our proteome and bioinformatic analyses of strain Chol11 revealed no enzymes that could potentially be involved in side chain cleavage or degradation of the C
3 side chain. This suggests that side chain degradation in strain Chol11 is a mechanism other than aldolytic or thiolytic cleavage. A so-far unknown alternative mechanism might involve other proteins encoded in cluster 2, including putative hydroxysteroid dehydrogenases, amidases, and a Rieske monooxygenase (
Fig. 6).
Phase 3: B-ring cleavage by the monooxygenase KshAB (yellow section in Fig. 3). The first step in the degradation of the steroid nucleus is the cleavage of the B-ring by the KshAB monooxygenase system (
9,
48). Five homologs of the oxygenase component KshA are encoded on chromosome 2 (Nov2c66, Nov2c228, Nov2c407, Nov2c430, and Nov2c440 with 27% to 32% identity to KshA of
P. stutzeri Chol1) (
Fig. 4). The numerous B-ring-cleaving KshA homologs in the genome of strain Chol11 strongly suggest that steroid nucleus degradation starts with 9,10-
seco cleavage, although the resulting 9,10
-seco-steroids have so far never been detected in cell-free supernatants of strain Chol11 cultures. Similar multiplicity of KshA homologs is also known from steroid-degrading
Rhodococci (
49,
50). Nov2c228, Nov2c407, Nov2c430, and Nov2c440 had increased abundances in steroid-grown cells and therefore are candidates for this reaction. However, Nov2c228 has the lowest similarity to KshA
Chol1, its encoding gene is localized in the side chain degradation gene cluster, and the abundance of Nov2c228 was increased only in bile salt-grown cells but not in 12β-DHADD-grown cells. Thus, a different role for this enzyme appears feasible. In this context, the similarities of the KshA oxygenases and Neverland oxygenases (22 to 24% identity of Nov2c228 and Nov2c407 to the Neverland oxygenases Nvd from
Drosophila melanogaster,
Caenorhabditis elegans,
Bombyx mori,
Xenopus laevis, and
Danio rerio), which are involved in the production of ecdysteroids in arthropods (
51,
52), could indicate a wider function for Rieske monooxygenases in steroid metabolism.
Interestingly, there are no distinct homologs of the reductase component KshB encoded in the genome of strain Chol11. This was also reported for
C. tardaugens NBRC16725 (
53). A flavodoxin reductase Novbp123 with 29% identity to KshB of
P. stutzeri Chol1 is encoded on plasmid pSb of strain Chol11 (
31). This enzyme was detected in the membrane protein-enriched fractions of all tested cells with similar Mascot scores, indicating that its synthesis is not regulated in response to steroid degradation. Novbp123 is encoded in a gene cluster together with a ferredoxin, a cytochrome c, and several exported and membrane proteins, suggesting that it is involved in membrane electron transport rather than bile salt degradation specifically (Fig. S3). This points at a different, KshB-independent electron shuttling mechanism for the KshA homologs in strain Chol11.
Phase 4: Complete degradation of the 9,10-seco intermediates (orange section in Fig. 3). Most proteins required for the degradation of the 9,10-
seco degradation intermediates, derived from both cholate and deoxycholate, and the respective HIP intermediates (
27) are encoded in gene cluster 3 (
Fig. 7A). All of these proteins are increased in abundance at least 1.5-fold in steroid-grown cells (
Fig. 4). This confirms that degradation of the steroid nucleus proceeds via the 9,10-
seco pathway. Cluster 3 is very similar to the cluster encoding testosterone degradation in
C. tardaugens NBRC16725 (
53). In both organisms, genes encoding homologs for the reductase component TesA1 of the 9,10-
seco-steroid monooxygenase, TesA1A2, are missing, which could be a further hint at a different electron shuttling mechanism. However, in cluster 3 of strain Chol11, a flavin reductase (Nov2c347) is encoded near the gene for the oxygenase component, TesA2 (Nov2c349), indicating that Nov2c347 could serve as a TesA1 substitute. Regarding HIP degradation, a homolog for the gene encoding HIP-CoA ligase ScdA (
54) is missing in strain Chol11, but the CoA transferase Nov2c359 could have this function.
(i) Fate of the 12-OH. In
P. stutzeri Chol1, the 12β-OH is removed during C- and D-ring degradation by the elimination of water catalyzed by Hsh1 and subsequent reduction of the resulting double bond by Sor1 (
23) (
Fig. 5C). In strain Chol11, homologs to Hsh1 and Sor1 are encoded in cluster 1 near
steA and
steB (
nov2c12 and
nov2c13, respectively;
Fig. 5A) and were found in increased abundances in steroid-grown cells (
Fig. 4). A gene cluster with the same order of genes for the 12-OH transforming enzymes SteA, SteB, Hsh1, and Sor1 is present in
C. tardaugens NBRC16725, and a similar cluster is present in
C. testosteroni CNB-2,
P. stutzeri Chol1, and
Azoarcus sp. strain Aa7 (
Fig. 5A), implicating a general role of these enzymes in bile salt degradation.
(ii) Channeling of 7-hydroxy and 7-deoxy bile salts into C- and D-ring degradation. During degradation of HIP intermediates (such as VI, XV, and XXII in
Fig. 7), the remainder of the B-ring is degraded via β-oxidation, which requires a hydroxy group at the former C
7 (
23) (
Fig. 7B). This hydroxy group is present in 7-hydroxy bile salts such as cholate, but during the degradation of 7-deoxy bile salts, such as deoxycholate, it has to be introduced into the propanoate side chain of the respective HIP intermediates. This is initiated by the introduction of a double bond by the heteromeric ACAD ScdC1C2 followed by the addition of water by the hydratase ScdD (
23,
55,
56).
In Δ
4,6 intermediates, the hydroxy group at C
7 is eliminated. Thus, bile salt degradation by the Δ
4,6 pathway variant results in HIP-like intermediates with a double bond in the propanoate side chain attached to ring C (XV in
Fig. 1). This is the same intermediate as that found during degradation of 7-deoxy bile salts after introduction of a double bond (
23), and the needed hydroxy group could be added by the hydratase, Nov2c364, which was detected in all steroid-grown cells in similar abundance (
Fig. 4). Although the ACAD reaction catalyzed by ScdC1C2 is not needed for the degradation of cholate via the Δ
4,6 variant, both subunits, Nov2c367 and Nov2c361, were found in all steroid-grown cells with similar Mascot scores (
Fig. 4). Thus, it is possible that the Δ
6 double bond had meanwhile been reduced, necessitating the ACAD.
As the hydroxy group eliminated by Hsh2 must again be added at the stage of HIP intermediates, the benefit of the elimination remains unclear. It might be related to the fact that many intermediates of bile salt degradation are excreted in significant amounts during growth of bile salt-degrading bacteria not only in laboratory cultures but also in soil samples (
10,
36). Other bile salt-degrading strains, such as
P. stutzeri Chol1, that exclusively use the Δ
1,4 degradation pathway are unable to utilize Δ
4,6 compounds as growth substrates (
10,
28). To this end, the dehydration might provide a way for strain Chol11 to secure the individual availability to these important carbon sources in natural habitats.
Widespread distribution of the Δ4,6 variant within the family Sphingomonadaceae.
(i) Prediction of the Δ4,6 variant pathway in steroid-degrading bacteria. Apart from strain Chol11, steroid degradation has been reported in other sphingomonads such as
C. tardaugens NBRC16725 (
53,
57) and
Sphingomonas sp. strain KC8 (
58). In addition,
Sphingomonas sp. strain Chol10 was also isolated as an HOCDA-degrading bacterium (
10).
To investigate the prevalence of the Δ
4,6 variant pathway within the family
Sphingomonadaceae, we searched all 398 complete and draft genomes from the genera
Sphingobium,
Novosphingobium, and
Sphingomonas available from the NCBI RefSeq database for the simultaneous presence of key steroid-degradation proteins and Hsh2, the key protein of the Δ
4,6 pathway. First, steroid-degrading bacteria were predicted using 23 hidden Markov models (HMMs) representing 10 key proteins of canonical steroid nucleus degradation (
59). Based on the presence of 7 out of 10 key proteins, including KshA and TesB, 53 genomes were predicted to encode steroid degradation. Second, Hsh2 orthologs were determined in these genomes by BLASTp analyses using Hsh2 of
Sphingobium sp. strain Chol11 (
28) as query. Thirty-nine genomes containing both the steroid-degradation genes and
hsh2 were found. To further confirm the prediction of these proteins being involved in steroid degradation in these organisms, a reciprocal BLASTp analysis was performed using steroid-degradation proteins from
P. stutzeri Chol1 and Hsh2
Chol11 as queries (
Fig. 8). Thirty-eight genomes were confirmed to encode Hsh2 orthologs, and all of them contained orthologs for the majority of key proteins for steroid nucleus degradation. However, only a small subset of side chain degradation proteins is encoded in most of the
Sphingobium and
Novosphingobium genomes. This was in contrast to the
Sphingomonas genomes where orthologs of the genes encoding CoA ligase, heteromeric ACAD, heteromeric hydratase, and aldolase for the degradation of the C
3 side chain from
P. stutzeri Chol1 are present.
This suggests that bile salt degradation via Δ4,6 intermediates is widely distributed among members of Sphingomonas, Sphingobium, and Novosphingobium, while the distinct side chain degradation mechanism proposed for strain Chol11 may occur in most members of Sphingobium and Novosphingobium.
(ii) Bile salt degradation in strains predicted to use the Δ4,6 variant. To investigate whether strains predicted to degrade bile salts via the Δ
4,6 pathway variant did so, a selection of type strains with complete genome sequences was analyzed for bile salt degradation.
Novosphingobium aromaticivorans F199,
Sphingobium herbicidovorans MH, and
C. tardaugens NBRC16725 were tested for the degradation of the 7α-hydroxy bile salts, cholate and chenodeoxycholate, and the 7-deoxy bile salt, deoxycholate (structures in
Fig. 9). Accumulation of Δ
4,6 intermediates was also monitored.
Strains MH, NBRC16725, and F199 grew on all tested bile salts (
Fig. 9B, D, and
F). While all bile salts were completely degraded, all three strains transiently accumulated the characteristic Δ
4,6 intermediates HOCDA (VII in
Fig. 1) and DOCDA (XIII in
Fig. 3) (
Fig. 9C, E, G, and
I). In addition, during degradation of the 7α-hydroxy bile salts, strain F199 formed the intermediates XXIV and XXV that also have a Δ
4,6 structure and lack a side chain according to their UV and mass spectra, respectively (
Fig. 9I). This supports our prediction that the Δ
4,6 variant is the predominant pathway for bile acid degradation in
Sphingobium and
Novosphingobium strains.
Like strain Chol11, strains NBRC16725, MH, and F119 were able to fully metabolize side chain-bearing bile salts, despite not encoding all side chain catabolic enzymes that are known from other bile salt-degrading bacteria. A BLASTp analysis revealed the presence of gene clusters with high similarity to gene cluster 2 of strain Chol11 in all three strains (
Fig. 6A), although the clusters from the
Novosphingobium strains, NBRC16725 and F119, were notably missing several genes. In particular, homologs of the thioesterase superfamily proteins Nov2c219 and Nov2c220, the only candidates for side chain hydratases in strain Chol11, were absent in the genome of NBRC16725. Homologs of SclA, the putative ACAD Nov2c221/Nov2c222, the Rieske monooxygenase Nov2c228, and the putative amidases Nov2c227 and Nov2c229 were found in the predicted side chain degradation clusters of all three strains.