INTRODUCTION
Treponema pallidum subspecies
pallidum strain Nichols (hereafter
T. pallidum), a Gram-negative spirochete responsible for syphilis, presents significant research challenges due to its strict dependence on the human host and its extreme difficulty in cultivation under laboratory conditions. Despite being identified as the causative agent of syphilis over a century ago, the continuous culture of
T. pallidum outside a host environment remains elusive, thereby limiting investigations into how its metabolism influences its pathogenicity (
1). Genome sequencing revealed a streamlined genome of approximately 1.14 million base pairs, comprising 1,041 open reading frames (
2). This minimalistic architecture underscores
T. pallidum’s heavy reliance on host-derived resources, making it one of the most metabolically reduced human pathogens known (
3).
Early attempts to culture
T. pallidum in vitro date back to 1906, following its identification as the causative agent of syphilis (
4). Volpino and Fontana reported on the cultivation of the bacterium, but this initial effort faced significant challenges due to the organism’s fastidious nature and specific growth requirements (
5). In 1981, Fieldsteel et al. reported the short-term multiplication of
T. pallidum in a coculture system with rabbit epithelial cells, achieving up to 100-fold growth over 12 to 18 days (
6). However, this method was not sustainable for continuous culture. A pivotal advancement occurred in 2018, when Edmondson et al. developed a refined coculture system using a modified CMRL 1066 medium supplemented with 20% fetal bovine serum, allowing
T. pallidum to be cultivated for extended periods under controlled conditions (
7). This breakthrough addressed prior challenges with contamination from rabbit epithelial cell debris in generating proteomics data, thereby facilitating an improved understanding of
T. pallidum’s metabolic capability. Despite these advancements, implementing different synthetic biology tools (e.g., CRISPRi, gene over-expression, and gene knockouts) remains challenging in
T. pallidum, likely due to the bacterium’s sensitivity to genetic perturbation (
8). This limitation hinders functional studies of
T. pallidum’s metabolism and virulence mechanisms.
These challenges in cultivation, coupled with its streamlined genome, underscore the need for a deeper investigation into
T. pallidum’s metabolic strategies. Lacking a complete tricarboxylic acid (TCA) cycle,
T. pallidum instead appears to rely primarily on substrate-level phosphorylation and acetate overflow metabolism for ATP production (
9). This dependency on simplified energy generation schemes raises fundamental questions about how
T. pallidum meets the energetic demands required to sustain its high motility—an essential virulence factor that enables tissue invasion and immune evasion (
10). Recent studies suggest that
T. pallidum supplements ATP production through additional pathways, such as a flavin-dependent acetogenic energy conservation pathway involving D-lactate dehydrogenase (D-LDH) (
11). D-LDH oxidizes D-lactate to pyruvate, contributing to ATP generation. However, this mechanism introduces another question: how does
T. pallidum regenerate the necessary reducing power to support glycolytic fluxes?
Given the complexities of studying
T. pallidum’s metabolism and the unresolved questions about its redox strategies, we employed genome-scale metabolic models (GEMs). GEMs offer a comprehensive systems-level framework for simulating and analyzing metabolic networks, offering insights into the metabolic strategies of diverse organisms (
12). Their efficacy has been demonstrated in previous studies, particularly in elucidating key phenomena in bacteria associated with sexually transmitted diseases (
13,
14). Moreover, GEM can work as a unified platform to combine different “omics“ data to gain a holistic multi-omics understanding of an organism’s metabolism (
15). This approach is especially advantageous for
T. pallidum, where genetic perturbations remain difficult. By leveraging GEMs, we aim to provide mechanistic insights into
T. pallidum’s metabolic adaptations, including redox-balancing strategies essential for its survival and adaptation within the host environment (
16).
Accordingly, starting from an extensive literature review (
1,
2,
17), we reconstructed a comprehensive GEM for
T. pallidum, iTP251. The iTP251 achieved a MEMOTE score of 92%, demonstrating high-quality reconstruction and pathway coverage, with additional robustness assessment through FROG analysis (
18,
19). To validate iTP251, we conducted
in silico gene essentiality predictions, benchmarking them against data sets from
Escherichia coli, a model organism with extensive essentiality data, and
Neisseria gonorrhoeae, a human-obligate pathogen providing functional context (
20,
21). This comparative validation supported the reliability of iTP251 in capturing the essential metabolic functions necessary for
T. pallidum’s survival.
Building on this standard GEM, we developed an enzyme-constrained GEM (ecGEM), ec-iTP251, to incorporate enzyme capacity constraints not represented in traditional GEMs. While standard GEMs capture metabolism stoichiometrically, these lack representation of enzymatic limits, often leading to overestimated metabolic capacities (
16,
22). The ecGEM framework allows us to integrate enzyme efficiencies and abundance, which is essential for organisms with reduced genomes like
T. pallidum, where resource allocation is tightly regulated (
13). Furthermore, the recent availability of a high-resolution proteomic data set for
T. pallidum, covering 94% of its proteome under
in vitro conditions, provided an excellent benchmark for validating the ec-iTP251 (
23). Comparing the model’s predicted fractional enzyme allocations with the proteomic data revealed an 88% Pearson’s correlation within central carbon pathways, reaffirming the model’s accuracy. Additionally, ec-iTP251 accurately predicted experimentally observed growth rates of
T. pallidum for both glucose and pyruvate uptakes.
Therefore, in this study, we utilized ec-iTP251 to investigate
T. pallidum’s unique metabolic adaptations, focusing on ATP generation schemes and redox balancing strategies. We examined how this bacterium allocates proteins and generates ATP while utilizing experimentally verified carbon sources: glucose, pyruvate, and mannose (
24). Our findings revealed that protein allocation remains relatively low across these conditions, with the highest allocation observed for glucose at 29.6%, suggesting metabolic flexibility that facilitates lactate uptake. Additionally,
T. pallidum likely employs glycerol-3-phosphate dehydrogenase as an alternative electron sink during lactate uptake, enabling redox balance and contributing to ATP production. These adaptive strategies, combined with its reliance on motility and the absence of the TCA cycle, provide selective advantages for survival and pathogenicity within the host. By highlighting these metabolic features, ec-iTP251 offers a robust platform for future studies targeting bioenergetic pathways critical for
T. pallidum’s survival and pathogenicity.
MATERIALS AND METHODS
Genome-scale metabolic model reconstruction and refinement
We generated an initial draft of the genome-scale metabolic model for
T. pallidum (RefSeq:
NC_021490) using the KBase platform, where three reactions were added for gap-filling purposes. Given the obligate host dependency and energy limitations of
T. pallidum, extensive manual refinement was required to ensure that core metabolic pathways accurately reflected its unique physiology. Initial curation focused on central carbon metabolism, with adjustments to glycolytic and energy-conserving reactions. Specifically, the ATP-dependent phosphofructokinase reaction was replaced with a pyrophosphate-dependent phosphofructokinase, reflecting
T. pallidum’s reliance on PPi as an alternative to ATP (
1). This metabolic adaptation, shared with related spirochetes like
Borrelia burgdorferi, aligns with the bacterium’s strategy to conserve ATP in its energy-limited environment (
47). Additionally, 13 PTS reactions were removed, as experimental evidence confirms their absence in
T. pallidum’s metabolic repertoire (
48).
Targeted gap-filling, informed by high-resolution proteomic data, addressed additional gaps in the metabolic network. Eight reactions critical to
T. pallidum’s central and peripheral metabolic pathways were added, encompassing nucleotide biosynthesis, lipid biosynthesis, amino acid metabolism, and cofactor and energy metabolism (see
Table S4 for details). For nucleotide biosynthesis, reactions supporting pyrimidine and purine metabolism were incorporated to fulfill the demands of DNA replication and repair. Lipid biosynthesis pathways were expanded to include terpenoid backbone synthesis, reflecting
T. pallidum’s requirements for membrane components essential for cellular integrity and function. Enhancements to amino acid metabolism ensured the synthesis of alanine, aspartate, and glutamate, supporting basic protein synthesis and cellular maintenance. Cofactor and energy metabolism pathways, including thiamine, nicotinate, and NAD
+/NADH, were refined to maintain redox balance and energy production. One-carbon metabolism pathways were added to complete folate-related biosynthesis. Furthermore, D-lactate dehydrogenase was added to represent acetogenic ATP generation, enabling ATP synthesis independent of glycolysis.
Following these refinements, iTP251 was tested for its capacity to support growth on T. pallidum’s known carbon sources, including glucose, pyruvate, and mannose, to confirm biological accuracy. For pyruvate utilization, a transport reaction was added, while for mannose, both a transport reaction and a metabolic conversion from mannose-6-phosphate to fructose-6-phosphate were incorporated to enable substrate-specific growth.
Biomass composition and maintenance parameters
The biomass composition of iTP251 was adapted from the
Borrelia burgdorferi (iBB151) metabolic model to approximate the cellular macromolecular makeup of
T. pallidum (
30). The biomass was defined as 70% protein, 20% lipid, and 5% carbohydrate on a dry weight basis, consistent with published physiological data of this bacteria (
Table S1). To ensure accurate growth yield predictions, the biomass equation was standardized to achieve a molecular weight of 1 g/mmol.
Maintenance energy parameters, GAM and NGAM, were estimated to reflect T. pallidum’s unique physiology. Following Pirt’s equation was applied to calculate these parameters:
where
is the observed growth yield,
is the maximum growth yield,
is the specific growth rate, and
represents the maintenance coefficient. Theoretical parameters were initially derived from
Pseudomonas putida due to its well-characterized physiology and were then adjusted to fit the metabolic characteristics of
T. pallidum (
49). Using the repurposed Prit’s equation for
T. pallidum, we estimated GAM to be 48.69 mmol/gCDW/h (Text S1;
Table S2). This value was calibrated iteratively through flux balance simulations, ensuring agreement with its predicted growth characteristics. NGAM, representing baseline ATP requirements for essential cellular functions, was determined by simulating a zero-growth condition in the ec-iTP251 model. The glucose uptake rate (reaction “EX_cpd00027_e0”) was fixed at −0.0447 mmol/gDW/h, representing minimal glucose consumption for non-growth maintenance. The model’s objective was set to maximize flux through the ATPase reaction (“rxn05145_c0”), which yielded an NGAM value of 1.50 mmol ATP/gDW.
Both GAM and NGAM values were subsequently incorporated as fixed parameters, ensuring accurate representation of maintenance energy across simulated environments.
In silico reaction and gene essentiality analysis
Reaction and gene essentiality analyses were conducted using the COBRApy toolbox to evaluate the predictive capacity of the iTP251 model. Reaction essentiality was determined by simulating single-reaction knockouts, where a reaction was classified as essential if its deletion reduced biomass flux by more than 90% compared to the wild-type model. Gene essentiality was assessed by simulating single-gene knockouts under the same criteria. Experimentally validated essentiality data sets for
Escherichia coli and
Neisseria gonorrhoeae were obtained from the Online Gene Essentiality database (
50) for cross-species validation. Orthologous genes were identified using BLAST (
51,
52), with an E-value threshold of 1e-5. This dual analysis provided a comprehensive evaluation of essential metabolic functions in
T. pallidum while benchmarking the model’s predictions against established data sets for related organisms (
Table S5).
Enzyme-constrained model development and integration of enzyme kinetics
To enhance the physiological relevance of the iTP251 model, we developed ec-iTP251 by integrating
and molecular weights MW for reactions with GPR associations. The DLKcat tool predicted
values based on homology and structural properties for 391 out of the 471 GPR-associated reactions, thus providing estimates for reactions lacking direct experimental measurements (
Table S6). Additionally, molecular weights of enzymes were calculated from amino acid sequences obtained from the Kyoto Encyclopedia of Genes and Genomes database (
53–55).
For the 80 reactions lacking
data, a Monte Carlo approach was employed to simulate biological variability. This method generated 100 sets of
values by stochastically sampling from distributions derived from the DLKcat predictions (
Table S6). Each set was incorporated into the ec-iTP251 model, creating an ensemble of 100 models with diverse metabolic configurations.
To identify the most biologically relevant model, we calculated the total cellular protein demand () for each configuration, defined as:
The ensemble model with the lowest total protein demand was selected as the most protein-efficient representation of T. pallidum’s metabolism.
Validation against proteomic data
To assess the accuracy of ec-iTP251 in predicting protein allocation, we validated the model against a high-resolution proteomic data set. We first calculated the total protein associated with the central carbon pathway and then determined the fractional protein allocation required for each enzyme-associated reaction. Afterward, we log-transformed these fractional values. This process was conducted for both the ec-iTP251 predictions and the experimental proteomics data, enabling a direct comparison between the two. Pearson’s correlation coefficient (R) was then applied to quantify the degree of agreement, and the coefficient of determination (R2) was calculated to measure predictive accuracy. The validation yielded an R2 value of 0.876, indicating a strong correlation between predicted and observed enzyme allocations. These results confirm the reliability of ec-iTP251 in accurately simulating protein distribution.
Max-min driving force analysis
To evaluate the thermodynamic feasibility and compare the driving forces for lactate uptake versus lactate secretion, we applied MDF analysis. MDF identifies the minimal Gibbs free energy dissipation required to sustain flux directionality in a metabolic pathway, optimizing for the smallest driving force across all reactions. This approach is particularly useful in metabolic systems with limited metabolomic data, as it can be applied using plausible concentration ranges to ensure thermodynamically favorable reaction conditions.
The MDF formulation is outlined below:
Here,
represents the standard Gibbs free energy obtained using the eQuilibrator API (
56);
is the gas constant;
is the temperature (37°C);
is the vector of metabolite log-concentrations; and
(1 nM) and
(10 mM) define the lower and upper metabolite concentration bounds. The objective
represents the tightest lower bound on the driving force across all reactions, ensuring they operate as far from equilibrium as feasible within physiological constraints. Using this MDF framework, we conducted separate optimizations for both lactate uptake and lactate secretion conditions to capture and compare the thermodynamic driving force profiles.
Substrate-specific metabolic optimization under protein constraints
To analyze T. pallidum’s metabolic adaptations across the known carbon sources, substrate-specific optimizations were performed using the ec-iTP251 model. The objective was to identify reactions involved in ATP generation and NAD+ regeneration under protein-limited conditions.
For each substrate, specific constraints were implemented to account for its unique metabolic demands. For example, biomass production flux was fixed at 0.0231 h−1 and total cellular protein content () at 0.296 for glucose conditions. These values were readjusted for pyruvate and mannose to reflect their distinct metabolic profiles.
The optimization involved two steps. First, total protein allocation was minimized while satisfying biomass production requirements:
Subject to
Here, and are the sets of metabolites and reactions in the model, respectively. is the stoichiometric coefficient of metabolite in reaction , is the set of reactions for which GPR is available, and is the flux value of reaction . Parameters and denote the minimum and maximum allowable fluxes for reactions , respectively.
In the second stage, parsimonious flux balance analysis (
57) was performed with fixed
and
, minimizing enzyme usage while adhering to protein constraints. The resulting flux distributions were analyzed to identify substrate-specific differences in ATP generation and NAD
+ regeneration. Key reactions contributing to these processes are in
Table S3.
Software and computational tools
All computational analyses were performed using Python, specifically with the COBRApy 0.29.0 package for constraint-based modeling. The DLKcat tool was employed for predicting values, and MEMOTE was used for assessing model quality and consistency. All simulations and analyses were conducted on a Windows 11 Enterprise operating system (Version 22H2), running on a machine equipped with a 12th Gen Intel Core i7-12700H CPU at 2.30 GHz, 32.0 GB of RAM, and a 64-bit x64-based processor.