INTRODUCTION
Lignocellulosic biomass, including residue from agriculture and forestry, is an attractive alternative carbon feedstock for bioprocesses (
1). However, its inherent heterogeneity and recalcitrant nature pose a unique challenge to the efficient utilization of this abundant, renewable resource (
2). Integrating physicochemical pretreatment with microbial conversion of lignocellulose-derived sugars and aromatic compounds to fuels and chemicals has been proposed to overcome this challenge (
3,
4). Depolymerization of lignocellulose results in mixtures of hexose (glucose) and pentose (xylose, arabinose) sugars as well as aromatic compounds (
4). Most microbes grow on a subset of these compounds which limits their industrial use.
Pseudomonas alloputida KT2440 (previously known as
Pseudomonas putida KT2440), hereafter referred to as KT2440, is a promising industrial biocatalyst due to its metabolic versatility, ability to tolerate environmental stresses, and amenability to genetic manipulation (
5–8). However, KT2440 cannot natively utilize xylose (
9), which makes up a significant fraction of total plant sugars, preventing the complete utilization of lignocellulose sugars. Recently
P. putida strains capable of growing on pentose sugars were characterized by isolation, genomic sequencing, and phylogenetic analysis (
10). One of these strains,
Pseudomonas putida M2 (hereafter referred to as M2), is a newly isolated soil bacteria that grows on glucose, xylose, and arabinose as well as biomass-derived aromatic monomers, making it a promising host for plant biomass valorization (
10,
11).
To thrive in different nutritional environments with numerous and fluctuating amounts of carbon sources, Pseudomonads regulate carbon substrate use to optimize metabolism and growth. This complex regulation is known as carbon catabolite repression (CCR) (
12,
13). Previous studies have shown that Pseudomonads generally prefer organic acids and amino acids to glucose, which is preferred to hydrocarbons and aromatic compounds, with a few exceptions (
12,
14–16). For instance, KT2440 utilizes glucose and acetate in preference to aromatic compounds (
14,
17,
18). In Pseudomonads, Crc (catabolite repression control), a major regulatory protein, inhibits the translation of mRNAs responsible for the catabolism of non-preferred compounds in the presence of other preferred carbon sources (
12,
19). The RNA-binding Hfq protein facilitates the Crc-dependent inhibition of mRNA translation by recognizing the catabolite activity (CA) motif (AANAANAA) in the target mRNA (
20,
21). Furthermore, previous CCR studies have shown that small RNAs (sRNAs) that contain several CA motifs antagonize the regulatory effect of Crc by sequestering the Hfq protein, thus determining the strength of Crc-mediated CCR (
19,
22).
Depending on the type of lignocellulosic biomass and depolymerization method employed, a heterogeneous mixture of sugar and aromatic monomers is obtained. The conversion of these mixtures makes understanding of cellular physiology during mixed-substrate utilization of particular interest. To enable the conversion of lignocellulose-derived sugars and aromatic compounds with a single microbe, the microbial host must consume all the carbon sources present rapidly and simultaneously. However, preferential or incomplete consumption of carbon sources due to CCR hinders efficient valorization of lignocellulosic biomass-derived components (
14,
23) and negatively affects the economics of the bioconversion process. Accumulation of intermediates due to CCR creates challenges in the downstream processes used to extract the desired product. A systemic understanding of CCR in M2 is required to overcome this barrier and develop strategies for metabolic engineering to develop a robust microbial host. This work focused on characterizing and comparing the growth of M2 when both sugar and aromatic compound were simultaneously present in the growth medium. Furthermore, CRISPRi and sRNAs sequencing were employed to probe the role of
crc and sRNAs in CCR-mediated repression in M2.
DISCUSSION
Pseudomonads grow and compete in diverse habitats due to their flexible metabolic pathway and have a global regulatory mechanism referred to as CCR that ensures that the most energy-efficient carbon source is utilized from a mixture of nutrient sources. This study demonstrated that M2 can co-metabolize sugars and aromatic compounds; however, both sugars and aromatic compounds were not completely consumed, except for aromatic compounds in the presence of xylose. Unlike glucose-aromatic, the aromatic compounds were consumed rapidly, and the aromatic intermediates accumulated transiently in the xylose-aromatic mixtures. The proteomics results also showed that glucose repressed all the aromatic catabolic and β-ketoadipate proteins, while xylose inhibited only some of the aromatic catabolic proteins. Therefore, glucose exerted a stronger CCR effect over the consumption of aromatic compounds compared to xylose. The difference in CCR between glucose and xylose could be due to the difference in reducing power generated during their catabolism and their dependence on the aromatic catabolic pathways for their energy need. A multi-omics approach integrating proteomics, 13C-metabolomics, and quantitative flux analysis will provide a detailed understanding of carbon partitioning and the metabolic network during the co-utilization of sugars and aromatic compounds in M2.
In the case of glucose and aromatic compounds, there was a long lag phase after which consumption of both substrates commenced simultaneously and followed a similar trend indicating cross-catabolite repression. It should be noted that the wild-type M2 strain exhibited a similar phenotype of long lag phase and incomplete substrate consumption in both single (glucose alone) and dual (glucose +
pCA) conditions; however, the proteomics and CRISPRi results as discussed below consolidate the hypothesis regarding simultaneous cross-catabolite repression. This mode of metabolism differs from the CCR where one carbon is strictly preferred over the other as observed in other cases (
15,
31,
32). For example,
Pseudomonas putida CSV86 preferentially consumes aromatic compounds (naphthalene and benzyl alcohol) and organic acids over glucose where glucose metabolizing enzymes such as glucose-6-phosphate dehydrogenase and glucose transport protein were repressed during aromatic compounds or organic acids utilization (
31,
32). The phenomena of simultaneous CCR have been observed in KT2440 strains where glucose and toluene (
33) and glucose and benzoate (
34,
35) were simultaneously metabolized while repressing each other’s metabolism but in a manner that optimizes biomass growth. In KT2440 (pWW0), toluene-induced repression of glucose metabolism was directed to the glucokinase branch of glycolysis, while glucose inhibited TOL plasmid Pu promoter for toluene catabolism (
33). A segregated metabolic flux was observed in KT2440-JD1 grown on glucose and benzoate where glucose-derived carbons were compartmentalized in the upper Embden–Meyerhof–Parnas (EMP) and the pentose-phosphate (PP) pathways and the benzoate-derived carbons primarily populated the TCA cycle and the glyoxylate shunt (
34). In our study with M2, the proteome analysis demonstrated that the presence of sugars repressed both the upper aromatic catabolic pathway and β-ketoadipate proteins required for the aromatics assimilation including TCA and gluconeogenesis (in the case of xylose) proteins, while the effect of aromatic compounds was directed toward the Entner–Doudoroff (ED) branch (particularly the
eda gene) and the
xylX gene, required for glucose and xylose metabolism, respectively, as well as the amino acid biosynthesis and catabolic pathways.
In M2, aromatic monomers such as
pCA and VA are funneled into central intermediate PCA, which is further catabolized through the β-ketoadipate pathway to the TCA cycle to yield acetyl-CoA (
10). The conversions of 4-HBA and VA to PCA catalyzed by PobA and VanAB, respectively, are considered the rate-limiting steps in KT2440 and have been identified as targets for Crc regulation in KT2440 (
14). PobA and VanAB are the only enzymes in the aromatic catabolic pathway that require reducing equivalents for their activity; therefore, they may be subject to tighter regulation by Crc when selecting the most energy-efficient carbon substrate. Replacing the native PobA in KT2440 with PraI from
Paenibacillus sp. JJ-1b, which has a broader nicotinamide cofactor preference, relieved this bottleneck and improved muconate production from
pCA and glucose (
36). In M2, PobA and VanAB were also significantly downregulated; however, more work is needed to identify metabolic bottlenecks preventing the complete metabolism of aromatic compounds in the presence of glucose in M2.
The global regulatory protein Crc plays a key role in CCR in Pseudomonads where Crc has been shown to inhibit the translation of mRNA encoding the catabolic enzymes, transcriptional regulators, as well as transporters required for substrates to enter the cell (
37–39). The phenotypic characterization demonstrated that decreasing the expression of the
crc gene led to faster growth and metabolism of CRISPRi strains compared to the control during the co-utilization of glucose and
pCA or FA. The faster growth was further supported by evidence of upregulation of proteins involved in amino acid biosynthesis and transport, while the upregulation of Eda potentially led to faster glucose consumption. However, the CRISPRi strains still exhibited an accumulation of aromatic intermediates and incomplete substrate consumption which suggests there might be additional yet unknown regulatory elements that control CCR. While alleviating CCR by deleting the
crc gene has been shown to be beneficial to ensure efficient metabolism and improve product yield (
14,
29,
40,
41), there are also contrary results that showed no positive effect after
crc deletion (
18,
42–44). Castillo and Ramos (
33) showed that repression of glucose metabolism by toluene is mediated through the Crc protein, while glucose-mediated repression of toluene degradation requires PtsN protein. Besides Crc, some potential regulators related to catabolite repression in
P. putida are the cytochrome o ubiquinol oxidase (Cyo), the phosphoenolpyruvate: sugar phosphotransferase (Pts) system (
44–48). Therefore, CCR regulation is a complex phenomenon that can be carbon source- and strain-specific and more often involves more than one regulatory element. The mechanism of Crc-mediated CCR and other potential regulatory mechanisms are still unknown in M2 and need to be further investigated in future research. Besides alleviating CCR, it is also necessary to broaden the substrate spectrum, enhance substrate uptake, and improve tolerance to toxic compounds that might be present in biomass hydrolysates or lignolysates to create an efficient and robust single strain-based biocatalyst for upgrading lignocellulosic biomass (
49). Overexpressing gene encoding aromatic transporter [e.g. PcaK, the 4-HBA transporter (
50)] was shown to improve the bioconversion of
pCA to 2,4 pyridine dicarboxylic acid in engineered KT2440 (native
pcaHG was replaced with
ligAB from
Sphingobium sp. SYK-6) and avoid 4-HBA accumulation (
51). Future research should identify transporter genes and evaluate if overexpression or heterologous expression of transporter genes would improve bioconversion in M2.
The difference in sRNA abundances between different carbon sources in both KT2440 and M2 followed the same trend; however, the difference was more distinct in KT2440 compared to that in M2. While we have no experimental evidence to explain these differences, there may be other potential sRNAs in M2 that played an important role in CCR. Previous studies have used bioinformatics tools for sRNAs prediction (
52,
53) and experimental approaches involving shotgun cloning approach combined with Northern blotting and RT-PCR (
28,
54), to identify sRNAs involved in CCR in
Pseudomonas strains and can provide directions for future studies in M2.
As demonstrated in this study, M2 may be used as a platform biocatalyst to produce industrially relevant products from lignocellulosic biomass due to its versatile sugar and aromatic metabolism. Our findings provide important insights into the physiology and metabolism of M2 during sugar and aromatic compound co-utilization. However, incomplete consumption and accumulation of aromatic intermediates due to CCR may hinder its industrial application. Simultaneous and complete utilization of sugar and aromatic compound could also be beneficial for bioproduct formation, one substrate could be used for energy for cell growth, while the other substrate for product formation, a strategy already being used in KT2440 (
14). As M2 is amenable to genetic manipulation via CRISPRi as shown by our study and previous publications (
10,
25), this study provides a framework for the use of metabolic engineering tools to alleviate CCR without causing other metabolism imbalances.
MATERIALS AND METHODS
Culturing conditions for P. putida M2 and P. alloputida KT2440 and growth measurements
All microbial cultivation experiments were conducted in triplicates at 30°C using 10mL 0.2µm filter sterilized modified M9 mineral medium (final pH adjusted to 7) with the following composition: (NH4)2SO4 (1.0 g L−1), KH2PO4 (1.5 g L−1), Na2HPO4 (3.54 g L−1), ammonium ferric citrate (0.06 g L−1), MgSO4·7H2O (0.2 g L−1), CaCl2·2H2O (0.01 g L−1), and trace elements H3BO3 (0.3 mg L−1), CoCl2·6H2O (0.2 mg L−1), ZnSO4·7H2O (0.1 mg L−1), MnCl2·4H2O (0.03 mg L−1), NaMoO4·2H2O (0.03 mg L−1), NiCl26H2O (0.02 mg L−1), and CuSO4·5H2O (0.01 mg L−1). The M2 and KT2440 strains were inoculated from glycerol stocks onto an LB agar plate and a single colony from the plate was grown in 5mL of LB medium (first preculture) overnight with continuous shaking at 200rpm. The first preculture (100 µL) was inoculated into 5mL freshly prepared M9 minimal medium with a corresponding carbon source to prepare the second preculture and incubated overnight. Finally, the growth curve and growth kinetics experiments were started the following day from the second preculture at a desired starting optical density (0.025–0.05) in 50mL glass tubes and 48-well plates with 250µL cell culture in each well, respectively. The minimal media was supplemented with a mixture of sugar (glucose or xylose) and aromatic compound (pCA, 4-HBA, FA, or VA) as model lignocellulose-derived substrates at a concentration of~10 g L−1 (sugars) and~1 g L−1 (aromatic compounds). For the growth curve experiments, the cell culture samples were taken at different time points to determine growth spectrophotometrically (SpectraMax M2, Molecular Devices, San Jose, CA) by measuring absorbance at 600nm (OD600). Growth kinetics was done using a Synergy plate reader (BioTek Instruments, Winooski, VT, USA).
Quantification of sugars was performed using high performance liquid chromatography (HPLC) using Agilent 1260 Infinity system (Santa Clara, CA, USA) equipped with an Aminex HPX-87H (300 mm × 7.8 mm) column (Bio-Rad, Hercules, CA) and a refractive index detector heated at 35°C. An aqueous solution of 4 mM sulfuric acid was used as the mobile phase (0.4 mL min−1, column temperature of 25°C). The amount of monomeric aromatic compounds was analyzed using Agilent 1200 system with Eclipse Plus Phenyl-Hexyl (250 mm length, 4.6 mm diameter, 5 µm particle size) column kept at 50°C. The mobile phase consisted of binary system phases with 10 mM ammonium acetate in 0.07% formic acid (Solvent A) and 10 mM ammonium acetate in 90% acetonitrile and 0.07% formic acid (Solvent B). The mobile phase gradient method was as follows: 30% B (0 min; 0.5 mL min−1), 80% B (12 min; 0.5 mL min−1), 100% B (12.1 min; 0.5 mL min−1), 100% B (12.6 min; 1 mL min−1), 30% B (12.8 min; 1 mL min−1), and 30% B (15.6 min; 1 mL min−1). The metabolite concentration was quantified from peak areas using calibration curves generated by each metabolite standard.
Shotgun proteomics analysis using LC-MS
All strains were grown in three biological replicates at 30°C following a similar procedure as stated in the culturing conditions section above starting with an LB agar plate followed by the first and second preculture before growing them in 10mL M9 minimal media supplemented with corresponding carbon sources (Table S6) at 30°C. The cells were harvested by centrifugation at OD between 0.4 and 0.5 and stored at −80°C until sample preparation. Proteins from the samples were extracted using a previously described chloroform/methanol precipitation method (
55). Extracted proteins were resuspended in 100mM ammonium bicarbonate buffer supplemented with 20% methanol, and protein concentration was determined by the DC assay (BioRad). Protein reduction was accomplished using 5mM tris 2-(carboxyethyl)phosphine (TCEP) for 30min at room temperature, and alkylation was performed with 10mM iodoacetamide (IAM; final concentration) for 30min at room temperature in the dark. Overnight digestion with trypsin was accomplished with a 1:50 trypsin:total protein ratio. The resulting peptide samples were analyzed on an Agilent 1290 UHPLC system coupled to a Thermo scientific Obitrap Exploris 480mass spectrometer for the discovery proteomics (
56). Briefly, 20µg of tryptic peptides were loaded onto an Ascentis (Sigma–Aldrich) ES-C18 column (2.1 mm × 100mm, 2.7µm particle size, operated at 60°C) and were eluted from the column by using a 10-min gradient from 98% buffer A (0.1% FA in H2O) and 2% buffer B (0.1% FA in acetonitrile) to 65% buffer A and 35% buffer B. The eluting peptides were introduced to the mass spectrometer operating in positive-ion mode. Full MS survey scans were acquired in the range of 300–1200 m/z at 60,000 resolution. The automatic gain control (AGC) target was set at 3E06 and the maximum injection time was set to 60ms. The top 10 multiply charged precursor ions (
2–5) were isolated for higher-energy collisional dissociation (HCD) MS/MS using a 1.6-m/z isolation window and were accumulated until they either reached an AGC target value of 1e5 or a maximum injection time of 50ms. MS/MS data were generated with a normalized collision energy (NCE) of 30, at a resolution of 15,000. Upon fragmentation, precursor ions were dynamically excluded for 10s after the first fragmentation event. The acquired LCMS raw data were converted to mgf files and searched against the latest UniProt
P. alloputida KT2440 protein database or JGI
P. putida M2 protein database (IMG submission ID: 236130) with Mascot search engine version 2.3.02 (Matrix Science). The resulting search results were filtered and analyzed by Scaffold version 5.0 (Proteome Software Inc.). The normalized spectra counts of identified proteins were exported for relative quantitative analysis. The proteomics data were analyzed using a customized Python script to calculate log
2 FC values and identify proteins that were significantly downregulated in the dual substrate conditions compared to the single substrate condition. Only proteins with spectral counts above five were retained in the analysis. Proteins were identified to be significantly differentially expressed when they displayed a log
2 FC ≥ 0.5 across at least two out of three biological replicates with the
P-value cut-off of 0.05.
CRISPR interference-based crc gene repression
Escherichia coli TOP10 cells were used as the primary host for gene cloning and plasmid isolation. Kanamycin (50 µg mL
−1) was used for the selection of plasmid-harboring cells.
E. coli competent cell preparation and transformation were performed using standard molecular biology techniques (
57). Electroporation (1 mm gap cuvette, with parameters set at the resistance of 200 Ω, voltage 1.8 KV, capacitance of 25 μFD for 4.0–5.0 ms) was used for the transformation of competent M2 cells with selected plasmids. Cells were incubated in LB medium for 1.5 h at 30°C and 200 rpm on a shaker incubator and plated on LB plates with kanamycin for selection. Q5 High-Fidelity 2X and OneTaq 2X Master Mix (New England Biolabs; Ipswich, MA, USA) were used according to the manufacturer’s instructions for PCR amplification during gene cloning and colony PCR to confirm gene cloning, respectively. NEBuilder HiFi DNA Assembly Master Mix (New England Biolabs) was used to assemble plasmids. QIAprep Spin miniprep kit and QIAquick gel extraction kit (Qiagen, Hilden, Germany) were used according to the manufacturer’s protocol to isolate plasmids and to separate PCR products from agarose gels, respectively. The sequences of all plasmids and DNA constructs were confirmed by Sanger sequencing (Genewiz, CA, USA).
The CRISPR interference system utilizing the type II dCas9 homologs from
Streptococcus pyogenes (pRGPspdCas9bad,
Table 1) previously developed for KT2440 and M2 (
24,
25) was used as the backbone plasmid to develop a dual inducible CRISPR/dCas9 interference-based gene repression system in M2. Expression of spdCas9 was under the control of the P
tac [Isopropyl β-D-1-thiogalactopyranoside (IPTG)-inducible] promoter, while sgRNA expression was under the control of the P
araC/bad (arabinose-inducible) promoter. The wild-type M2 strain carrying an empty pRGPspdCas9bad plasmid served as the control. An open reading frame (Ga0436255_01_2263068_2263847) was found which had 95% sequence similarity to the KT2440
crc sequence in the M2 genome. Three sgRNAs (20 bp) targeting three different regions of the
crc gene (Fig. S10), resulting in three recombinant strains (crc-1, crc-2, and crc-3) were designed and cloned into the plasmid by PCR amplifications using primers listed in
Table 2. One sgRNA targeted a region close to the start codon at the 5′ end (crc-1) and two inside the ORF (crc-2 and crc-3) of the
crc gene. The constructed plasmids were electroporated into M2 to generate the CRISPRi strains. The recombinant CRISPRi strains were grown in minimal media supplemented with corresponding carbon sources (glucose +
pCA, glucose +FA, and xylose +
pCA), and both inducers (~3.5 g L
−1 arabinose and 1 mM IPTG) for phenotypic characterization (following a similar protocol as stated in the culturing conditions section above) and proteome comparison (glucose,
pCA, or glucose +
pCA as carbon sources) with the control strain.
The oligonucleotides were ordered from Integrated DNA Technologies (IDT, San Diego, CA, USA). All the bacterial strains, plasmids, and primers used in this study can be found in
Tables 1 and 2.
Small RNA sequencing
M2 and KT2440 cells were grown in minimal media containing either glucose, pCA, or glucose +pCA, and cell pellets were submitted to CD genomics (NY, USA) for total RNA extraction and sRNA sequencing. Briefly, total RNA was isolated using Qiagen miRNeasy mini kit (Hilden, Germany) following the manufacturer’s protocol, quantified by NanoDrop 2000 (Thermo Fisher Scientific, Waltham, MA, USA), and quality assessed with agarose gel electrophoresis. The preparation of the sRNA-enriched library was done with NEBNext Ultra II RNA kit (New England Biolabs, Ipswich, MA, USA) following the manufacturer’s recommendation which included cDNA synthesis, 3’ and 5’ adapter ligation, PCR enrichment of adapter-ligated RNA, and size fractionation (80–400 bp) with Pippin Prep to specifically enrich for sRNAs. The library quantity was measured by KAPA SYBR FAST qPCR, while the quality was evaluated by TapeStation D1000 ScreenTape (Agilent Technologies, CA, USA). Equimolar pooling of libraries was performed based on the QC values and sequenced on Illumina NovaSeq 6000 (Illumina, CA, USA) with a read length configuration of 150 paired-end for 10 million paired-end reads per sample.
Illumina adapter sequences and small RNA library adapter sequences were removed from raw sequencing reads using Fastp and Cutadapt, respectively. The coding mRNA sequences were removed by aligning the quality-filtered RNA-seq reads with the reference genome. We ran BLASTX against sequences similar to CrcY and CrcZ, the sRNAs found in KT2440 (
27), and determined their abundances in both M2 and KT2440 under different carbon conditions.