Native C. phytofermentans inhibitor resistance.
We measured the growth of
C. phytofermentans in various concentrations of 12 lignocellulosic inhibitors (see Table S1 in the supplemental material) to gain a general understanding of the relative effects of aliphatic acids, furans, and phenolics (
Fig. 1; see also Fig. S1 in the supplemental material). Both aliphatic acids reduce growth; acetate (
Fig. 1A) was less toxic than formate (
Fig. 1B) on a mass per volume basis, but both acids had similar effects in terms of molarity (Fig. S1). At low furan concentrations, we observed normal growth rates after an extended lag phase (
Fig. 1C and
D), similar to other bacteria that reduce and detoxify furans (
15,
16). Growth lags are proposed to be due to alcohol dehydrogenase (ADH) reducing furan, causing NADH depletion and acetaldehyde accumulation (
17). Additionally, the ADH protein Cphy1179 shares 28% amino acid identity with a furfural-reducing, Zn-dependent ADH (
18). However, if
C. phytofermentans detoxifies furans, this mechanism is abruptly overwhelmed at concentrations above 2 g liter
−1 5-HMF and 1 g liter
−1 furfural.
We examined the toxicities of two types of phenolic acids: hydrocinnamic acids (
p-coumarate and ferulate) and hydroxybenzoic acids (vanillate and 4-hydroxybenzoic acid). We found that hydrocinnamic acids (
Fig. 1E and
F) are more toxic than hydroxybenzoic acids (
Fig. 1G and
H), which supports that the propionic group on the benzene ring in hydrocinnamic acids enhances toxicity, likely by affecting how the molecules partition into the membrane. Moreover, we found that phenolic acids are typically less toxic than the corresponding aldehydes (
Fig. 1I to
K) and catechol (
Fig. 1L). For example, vanillate (
Fig. 1G) is much less toxic than vanillin (
Fig. 1I), and 4-hydroxybenzoic acid (
Fig. 1H) is similarly less toxic than benzaldehyde (
Fig. 1J). The enhanced toxicity of aldehydes is likely due to their reactivity, resulting in formation of adducts with nucleophilic sites on DNA, proteins, and other macromolecules (
19).
Genome-wide mRNA expression during ferulate stress.
We quantified genome-wide mRNA expression changes at two time (
t) points (
t = 0.5 h and
t = 4 h) following supplementation of mid-log cultures with 2 g liter
−1 ferulate, which reduced growth (
Fig. 2A) similar to that of the initial growth screen (
Fig. 1F). Three to five million read pairs were aligned to the genome for each culture (see Table S3A in the supplemental material) to calculate gene expression levels (Table S3B). The number of differentially expressed genes (Table S3C to E) increased from 0 genes before ferulate addition (
Fig. 2B) to 78 genes after 30 min (
Fig. 2C) and then declined to 47 genes after 4 h (
Fig. 2D). The most abundant functional categories of differentially expressed genes at a
t of 0.5 h relate to the repression of energy production, coenzyme metabolism, and lipids (
Fig. 2E). The coenzyme-associated genes enable siroheme biosynthesis, which is repressed in clostridia in response to redox stress (
20). Lipid genes include the
fab gene cluster (
cphy0516-cphy0523) for fatty acid biosynthesis, which was strongly repressed at a
t of 0.5 h (
Fig. 2E) and recovered by a
t of 4 h. While cultures continued active growth after sampling, many of the differences between cultures with or without ferulate at a
t of 4 h indicate that the treatment without ferulate had depleted nutrients in the medium, triggering expression of genes to assimilate alternative carbohydrates (
Fig. 2F).
Gene expression at a
t of 0.5 h shows that abrupt ferulate stress induces expression of genes encoding the efflux pump
cphy1055-cphy1056, which is similar to
E. coli mdlAB conferring resistance to organic solvents (
21). Many of the genes upregulated at a
t of 0.5 h are colocated in two genomic regions. The first region encodes
tad (tight adherence)
cphy0029-cphy0040 genes for Flp-type type IV pilus assembly. Type IV pili are widespread in clostridia (
22) for adhesion to solid substrates to form protective biofilms (
23), reflecting how ferulate represses motility genes in
Clostridium beijerinckii (
24). The other cluster
cphy1838-cphy1845 includes genes for the flavin mononucleotide (FMN)-binding protein WrbA (
25) and two NADPH:FMN reductases. NADPH:FMN reductase inactivation confers ferulate resistance in
C. beijerinckii by an unknown mechanism (
26). While this appears to be in opposition to our data showing that NADPH:FMN reductases are upregulated by ferulate, both results support the importance of FMN-mediated oxidoreduction in ferulate resistance. This island also includes genes encoding an acetyltransferase and Cphy1845 that shares 41% amino acid identity and metal coordination with
E. coli YhhW, which cleaves the plant phenolic quercetin (
27).
C. phytofermentans may thus upregulate genes to transform or detoxify plant phenolics, similar to some ruminal clostridia (
28).
Selection and physiology of ferulate-resistant strains.
We selected
C. phytofermentans strains with increased ferulate resistance by cultivation in a GM3 automat, a dual-chamber, continuous-culture device that automates delivery of fresh medium and transfers the evolving cell suspension between twin growth chambers to prevent biofilm formation. During acclimation to increased ferulate in medium-swap mode, cell densities oscillated for 2 to 5 days because high densities triggered pulses of stressing medium (high ferulate) that reduced culture density, which in turn resulted in delivery of relaxing medium (low ferulate) that enabled recovery (
Fig. 3A). As such, the ferulate-based selection in medium-swap mode is modulated by the ratio of relaxing and stressing medium. Once cell densities stabilized in the stressing medium, the growth rate at the higher ferulate concentration was improved in turbidostat mode (
Fig. 3B). We initiated the growth selection with a stressing medium containing 1 g liter
−1 ferulate, the highest concentration at which we could establish a stable wild-type (WT) culture in the GM3. After 93 days (∼500 generations) of continuous, log-phase growth selection with incrementally higher ferulate, the culture grew with the same 3.75-h generation time in the 3 g liter
−1 ferulate medium as that of the WT in the absence of ferulate (
Fig. 3C). Clones isolated along the growth selection are progressively more ferulate resistant in batch culture (
Fig. 3D to
G); while no growth was observed above 2 g liter
−1 ferulate in the WT strain (
Fig. 3D), CFY3 clones grow robustly at the ferulate solubility limit (6 g liter
−1) (
Fig. 3G). We assessed the ferulate resistance of 2 clones from each of the CFY1 (CFY1A and CFY1B) and CFY2 (CFY2C and CFY2D) time points and 4 clones from the CFY3 time point (CFY3E to CFY3H). The duplicate CFY1 and CFY2 clones showed similar ferulate resistance, but CFY3H is much less ferulate resistant than the 3 other clones (see Fig. S2 in the supplemental material), showing that cells in the GM3 culture are heterogeneous with respect to ferulate resistance.
We examined whether selection for ferulate resistance in glucose medium resulted in physiological changes impacting cellulose fermentation and resistance to other inhibitors. CFY3 strains degrade cellulose similar to that in the WT (
Fig. 4A) and show accelerated cellulose degradation in medium supplemented with ferulate (
Fig. 4B), supporting that the evolved strains are potentially improved candidates for fermentation of lignocellulose. Moreover, the evolved resistance mechanisms extend to other biomass inhibitors, as CFY3 strains are also more resistant to vanillate and acetate (see Fig. S3 in the supplemental material), albeit with considerable variability between strains. We also used mass spectrometry to investigate if ferulate was consumed or transformed in WT and CFY3 cultures, revealing that the ferulate concentration was unaltered with no products corresponding to reduced, demethoxylated, or decarboxylated ferulate (see Fig. S4 in the supplemental material). Thus, even though
C. phytofermentans upregulates potential phenol-degrading enzymes in response to ferulate, the cell adapted to ferulate by reinforcing the cell or excluding this molecule rather than detoxifying it.
As the toxicity of aromatic molecules is often associated with disruption of the cell membrane, we profiled fatty acids (FAs) to determine if ferulate resistance is associated with altered membrane phospholipids (see Table S4 in the supplemental material). We found that when WT was exposed to ferulate, the plasmalogen (vinyl ether phospholipid) content in the membranes increased 18-fold. Moreover, CFY strains retained elevated plasmalogens even in the absence of ferulate (
Fig. 4C). In particular, the CFY1B plasmalogen content in the medium without ferulate was 185-fold higher than that of the WT. Related clostridia similarly increase plasmalogens in response to aliphatic alcohol stress (
29,
30), likely to fine tune membrane fluidity and protect from redox-mediated damage (
31). The distribution of FA chain lengths in WT cells (
Fig. 4D) is similar to that of other clostridia but with fewer unsaturated FAs and more cyclopropanes (
32), both of which reduce membrane fluidity to protect from solvent stress (
33). While the addition of ferulate had little immediate effect on the FA chains of WT cells (see Fig. S5A and B in the supplemental material), the CFY strains showed altered FAs relative to those of the WT in the absence of ferulate (
Fig. 4D). The CFY1B FA profile was the most perturbed with increased hydroxylated C
16 and unsaturated fatty acids, largely C
18:1, which is associated with increased ethanol tolerance in
E. coli (
34). CFY3F shifted to branched FA (especially C
15) and longer chain lengths (C
18, C
20), which increase membrane rigidity (
10) to potentially combat the membrane-fluidizing effects of ferulate.
C. phytofermentans fatty acids are decorated with a diversity of phospho, glyco, and amino head groups (Fig. S5C). While we did not detect changes in these head groups in the WT response to ferulate or in the CFY strains, we consider it likely that they participate in the response to solvents, similar to some other bacteria (
10).
Genomes of ferulate-resistant isolates.
We sequenced the genomes of eight CFY1 to CFY3 clones, giving between 106- and 705-fold coverage (see Table S5A in the supplemental material) to identify DNA variants relative to the wild type (Table S5B). Seven single-nucleotide variations (SNVs) and short insertions/deletions (indels) are present in all of the CFY genomes (
Fig. 5A), which likely fixed in the population during an early selective sweep. These variants caused nonsynonymous changes in 5 proteins, including a homolog of Cap5F (Cphy3503), a protein for biosynthesis of capsular polysaccharides (
35) that is associated with biofilm formation (
36) and stress resistance (
37). Strains subsequently accrued strain-specific mutations consistent with the population exploring alternative mutational pathways to improve ferulate resistance, particularly by modifying sensor kinases that can transduce signals associated with ferulate stress, fatty acid biosynthesis, and the surface layer (S-layer) (
Fig. 5A). For example, the CFY1 and CFY2 strains incurred coding variants in 3 genes putatively encoding fatty acid biosynthesis proteins: Cphy3113 for anaerobic synthesis of unbranched fatty acids (
38), the fatty acid dehydratase FabZ (Cphy0520), and the reductase FabV (Cphy1286) for the final step in fatty acid elongation. The genomes of CFY3E to CFY3G (high resistance) and CFY3H (low resistance) differ by variants in Cphy3510, the most highly expressed protein in the proteome that is proposed to form the S-layer (
39). The S-layer is a protein lattice that provides mechanical stabilization, sites for extracellular protein attachment, and a selective barrier for molecules (
40).
Intergenic changes that arose in the CFY genomes affect the expression levels of adjacent genes. For example, a 15-bp sequence between the first two genes of the ABC glucose transporter operon (
cphy2241-cphy2243) was duplicated in the CFY3 strains (see Fig. S6A and B in the supplemental material). The repeated sequence forms an inverted repeat (IR) similar to repeated extragenic palindrome (REP) sequences, a widespread mechanism in bacteria to tune gene expression by modulating the stability of different mRNA segments within an operon (
41). Duplication of this putative REP increases the mRNA secondary structure of the
cphy2243-cphy2242 intergenic region (Fig. S6B), supporting functions similar to those of REP that increase expression by forming stable stem-loop structures that protect mRNA from ribonucleases (
42,
43). Similarly, we found that mRNA expression of the two genes downstream of the insertion was elevated (
Fig. 5B), which may have increased fitness because the GM3 growth selections were done in glucose medium. The mRNA expression of genes in two colocated operons with upstream point mutations was upregulated in the CFY strains (
Fig. 5C). The A-to-G transition upstream of
cphy1464 created a TG dinucleotide 2 bp upstream of the Pribnow hexamer (Fig. S6A) that enhances transcription in other bacteria (
44,
45) and is present in the consensus −10 promoter sequence in
C. phytofermentans (
46). We propose that the upregulated operons
cphy1459-cphy1461 and
cphy1464-cphy1465 either enable increased production of malonyl coenzyme A (malonyl-CoA) for fatty acid biosynthesis or neutralize intracellular pH in response to ferulic acid stress through production of ammonium and lactate (
7) and bicarbonate buffering (Fig. S6C).
Adaptive function can be imparted by structural changes to the genome resulting from recombination and transposition of insertion sequences (IS elements).
C. phytofermentans encodes 31 IS elements (see Table S6 in the supplemental material), including 12 ISL
3 comprised of 2 isoforms—8 ISL
3-1 elements and 4 ISL
3-2 elements. IS elements inactivate genes through their transposition and act as the substrates for homologous recombination. In addition, the IRs of all 12 ISL
3 contain a 5′-TTGACA-3′ sequence matching an outward-facing, consensus −35 box from this organism (
46) (Table S6), suggesting that ISL
3 could activate expression of adjacent genes (
47). An ISL
3-1 was precisely deleted in all CFY genomes as evidenced by reduced read coverage (
Fig. 6A) as well as BioNano optical mapping and Sanger sequencing (see Fig. S7A and B in the supplemental material), showing that ISL
3 are active in
C. phytofermentans. Further, the CFY2C and CFY2D strains share a 333-kb duplication from
cphy2178 to
cphy2461 (276 genes) (
Fig. 6B), which we showed by PCR exists as a tandem duplication joined by a novel ISL
3-2 insertion (
Fig. 6C; Fig. S7C). We did not observe any extrachromosomal DNA by pulsed-field gel in the CFY2C strain, supporting that the
cphy2178-ISL
3-cphy2461 fragment did not excise as a circular molecule. Further, BioNano sequencing of DNA molecules greater than 400 kb spanning the junctions of the duplicated region localizes the rearrangement as a genomic, tandem duplication (
Fig. 6C). We quantified the relative abundance of cells bearing this duplication by quantitative PCR of the
cphy2178-ISL
3-cphy2461 fragment (
Fig. 6D). The duplication arose between day 13 and 20 and overtook the population to comprise 68% of cells by day 40; supporting it was the subject of positive selection. Subsequently, this variant declined in the population, representing 1% of cells at day 63, as it was gradually replaced by mutants with higher fitness; it was not present in any of the CFY3 genomes.