INTRODUCTION
Rothia mucilaginosa is a Gram-positive, encapsulated, non-motile, and non-spore-forming bacterium of the
Micrococcaceae family (
1,
2). While it is mainly aerobic, it may also grow anaerobically as it can switch to fermentation or other non-oxygen-involving pathways.
R. mucilaginosa is a common commensal of the normal oral, upper and lower respiratory tract, and part of the skin florae in humans (
1,
3–6). This means it coexists harmlessly within the host and may even provide benefits. Nonetheless, it can also act as an opportunistic pathogen, particularly in individuals with weakened immune systems, as an etiological agent of serious infections such as endocarditis, sepsis, and meningitis (
7). Janek et al. highlighted the high prevalence of
R. mucilaginosa within the nasal microbiome (
8). Moreover, they report its susceptibility to certain staphylococcal bacteriocins, indicating its major competition with the nasal staphylococci and the substantial impact of bacteriocins in shaping the nasal microbiota. In 2020, Uranga et al. revealed that
R. mucilaginosa produces the strongest Fe
3+-binding archetypal siderophore known, called enterobactin (
9). This attribute contributes to its competition with oral microbiota (the cariogenic
S. mutans,
A. timonensis, and
Streptococcus sp.) as well as four methicillin-resistant strains of
S. aureus (MRSA). Enterobactin is a type of siderophore produced by bacteria to scavenge, chelate, and transport ferric irons from their surrounding environment. These are essential for bacteria when iron is scarce as they facilitate their acquisition necessary for their growth and metabolic processes.
Prior metagenomic sequencing analyses have unveiled the prevalence of
R. mucilaginosa at high abundances and its enhanced metabolic activity in the lungs of cystic fibrosis (CF) patients (
10,
11). CF is caused by the hereditary mutation of the cystic fibrosis transmembrane conductance regulator (CFTR) gene that disrupts the transepithelial movement of ions, leading to an aberrant accumulation of thick and sticky mucus within the airways. The impaired immune clearance creates a hypoxic environment (
12) promoting the polymicrobial colonization of opportunistic microbes together with fungi and viruses, ultimately resulting in persistent and recurring infections (
13). Guss et al. and Bittar et al. declared
R. mucilaginosa as an emerging CF bacterium (
14,
15), while Lim et al. provided evidence supporting that
R. mucilaginosa is a frequently encountered and metabolically active inhabitant of CF airways (
16). Additionally, a study from 2018 shows that the opportunistic pathogen
Pseudomonas aeruginosa, which frequently causes infections in CF patients, builds essential primary metabolites, like glutamate, by utilizing compounds produced by
R. mucilaginosa (
17). This symbiotic interaction implies that
P. aeruginosa benefits from its neighboring microbes, which contributes to its pathogenesis in the CF lungs. On the other hand, Rigauts et al. revealed the anti-inflammatory activity of
R. mucilaginosa in the lower respiratory tract, which could impact the seriousness of chronic lung diseases (
18).
In systems biology, genome-scale metabolic models (GEMs) represent comprehensive reconstructions of organisms’ metabolic networks. They are built using genomic sequences and comprise all known biochemical reactions and associated genes. These models provide systems-level insights into cellular metabolism, allowing researchers to simulate and analyze the flow of metabolites through these networks (
19). The interactions among reactions and metabolites in a metabolic model are mathematically represented with a stoichiometric matrix (
20). In the past years, an array of
in silico methods has been developed to analyze GEMs and derive valuable hypotheses. Flux balance analysis (FBA) is such a powerful computational technique that operates on the principle of achieving a steady state by optimizing the flux (rate) of metabolites through reactions while accounting for various constraints such as stoichiometry, thermodynamics, and uptake/secretion boundaries (
21). Applying flux balance analysis on a GEM provides insights into the intricate biological system interactions. This analytical approach facilitates the prediction of cellular phenotypes and identification of promising drug targets and contributes to optimizing biotechnological processes (
22). Moreover, such models can guide genetic engineering by suggesting genetic modifications that could enhance desired product formation or cellular behavior. Further applications include ameliorating culture media by incorporating components that increase bacterial growth rates. So far, GEMs have been an invaluable resource in the systems biology field that helped untangle the metabolism of various organisms and especially of high-threat pathogens (
23,
24). As described above,
R. mucilaginosa has gained great interest in the context of polymicrobial CF environments. However, its metabolic capabilities and genotype-phenotype relationships in isolated monoculture settings remain largely unexplored.
Here, we present the first manually curated and high-quality GEM of
R. mucilaginosa,
iRM23NL, striving to understand its metabolism and unique phenotypes under diverse conditions. Our simulation-ready network accounts for thousands of reactions and is available in a standardized format following the community guidelines (
25). Through growth kinetic experiments and high-throughput phenotypic microarray assays, we validated
iRM23NL’s accuracy in predicting growth and substrate utilization patterns. We refined the reconstruction by comparing the
in vitro results to
in silico simulations, resulting in novel metabolic reactions and genes. To our knowledge, this is the first study presenting high-throughput nutrient utilization and comprehensive growth data for
R. mucilaginosa. Finally, we employed FBA to formulate novel gene essentiality hypotheses that could expedite the development of antimicrobial strategies.
Figure 1 summarizes the experimental and computational work presented here.
DISCUSSION
The metabolic phenome of
R. mucilaginosa, a bacterium with both beneficial and pathogenic behavior, remains still largely unexplored. Investigating its metabolic traits is of great importance as it holds the potential to unveil unique capabilities, including substrate utilization, byproduct production, and contributions to host-microbe interactions.
R. mucilaginosa is a versatile microbe found in humans’ oral, respiratory, and skin flora, where it coexists harmoniously. However, in immunocompromised individuals,
R. mucilaginosa can act as an opportunistic pathogen, causing severe infections. Our study focuses on the metabolic aspects of
R. mucilaginosa, particularly its behavior in isolated cultures. In 2019, a 17-species bacterial community model was reconstructed to simulate the polymicrobial community of the CF airways (
43). This model accurately predicted the abundance of specific bacteria within patients’ CF lung communities by linking metabolomics and 16S rRNA gene sequencing data. However, studying a bacterium’s metabolism and genotype-phenotype relationships in monoculture provides a more controlled knowledge base. This allows for the precise manipulation of variables, enhancing our understanding of its individual traits, genetic makeup, metabolic pathways, and responses to stimuli (
22,
23). Moreover, one can elucidate the bacterium’s unique contributions to nutrient uptake, substrate production, and growth dynamics, crucial for understanding its role in a broader ecosystem. Monoculture studies identify key genes and pathways, revealing how the bacterium functions autonomously. Such analysis serves as a valuable reference, differentiating inherent characteristics from those influenced by external interactions. To this end, we empirically analyzed the metabolic phenome of
R. mucilaginosa DSM20746 and developed the first high-quality strain-specific GEM of
R. mucilaginosa, called
iRM23NL. We considered literature and database organism-specific evidence to manually gap-fill the model and include highly relevant biochemical reactions. Phylogenetic analysis of further
Rothia species provided insights into the relationship and genetic diversity between these species and was utilized to extend the metabolic network’s completeness. Our model is simulation-ready, follows strictly community standards (
25), and exhibits a high content quality
memote score.
R. mucilaginosa is primarily aerobic and can perform oxic respiration by efficiently generating energy in the form of adenosine triphosphate (ATP) (
1). However, when oxygen is limited or absent,
R. mucilaginosa switches to anaerobic metabolism, which may involve fermentation or other alternative pathways to generate energy. As already mentioned,
R. mucilaginosa has been previously found to be metabolically active in CF lungs where the oxygen levels are notably restricted (
16). This indicates that the bacterium undergoes metabolic shift and can survive in microaerophilic environments. Various ROS products emerge as byproducts in the bacterial response to the fluctuating oxygen levels (
34). In more detail, the cascade of ROS is initiated by the formation of O
2− upon univalent oxygen reduction within the electron transport chain (ETC). Extreme oxygen fluctuations may be lethal and can ultimately damage cellular structure. The detoxifying pathway includes the enzymes superoxide dismutase (SOD), catalase, and peroxidase that break down lethal radicals to water and oxygen enabling the cell to neutralize the oxidative stress (
44) (see
Fig. 4). However, the exact anaerobic respiration mechanism of
R. mucilaginosa must be thoroughly examined in experimental settings.
Since
R. mucilaginosa’s metabolic behavior and adaptability are mainly yet unknown, we started by testing its growth behavior in various nutrient media. Exploring how bacteria react to various growth conditions within the human body is pivotal for understanding diseases and developing effective treatments. Moreover, they are essential for evaluating their evolution and adaptation to different environmental conditions, leading to new ecological niches in which the bacterium could be metabolically active. We ultimately validated
iRM23NL using our growth kinetics data in various growth media. Overall,
iRM23NL’s predictions were in line with the experimental observations.
R. mucilaginosa demonstrated higher experimental growth in nutrient-rich media. The model successfully simulated growth for most media, while no biomass production was achieved in the M9 pure medium. When comparing LB to RPMI, the simulated growth rate was higher in LB, while the empirical growth in RPMI was twice as high as that in LB. This can be attributed to the fact that computer models cannot mimic the entire experimental settings and lack kinetic parameters. As of September 2023, bacteria like
S. aureus,
B. subtilis, and
E. coli have been extensively researched for decades, with hundreds of thousands of PubMed (
45) entries since the early 1990s. In contrast,
R. mucilaginosa’s scientific prominence only began in the 21st century, with only 423 publications to date, indicating significant knowledge gaps crucial for metabolic reconstructions. More specialized BOF would enhance the predictive power and would reflect a more organism-specific metabolism. Therefore, this scarcity underscores the urgent need for further research efforts to uncover the hidden facets of
R. mucilaginosa’s metabolism and its significance. Notably, to simulate
in silico growth in RPMI and SCFM media, six metal ions needed to be supplemented. These metals have also been confirmed as essential for the
in silico growth of
S. aureus in RPMI (
41). According to the model’s predictions RPMI, supplementation with manganese, zinc, and molybdate was required. Transition metals could be highly toxic; however, in controlled levels are important in the survival of all living organisms (
46). For instance, they are involved in redox catalysis, needed for energy production through respiration, and in non-redox catalysis, necessary for many biosynthetic and metabolic processes. Additionally, transition metals are required for the activity of many enzymes, including those involved in genomic replication and repair and nitrogen fixation. However, since these compounds were absent from the providers’ medium formulation for RPMI, we speculate that the provided medium definition may not be exact. In all cases, the suggested metal co-factor promiscuity in
R. mucilaginosa by
iRM23NL needs to be examined to shed light on whether the bacterium could survive in the absence of one of the suggested metals.
Moreover, we experimentally characterized the strain’s ability to assimilate and utilize substrates using high-throughput phenotypic microarray assays. The utilization of various nitrogen sources did not result in active respiration, indicating that the bacterial genome lacks genes encoding for respective transporters. We used the phenotypic results to validate and extend our metabolic reconstruction,
iRM23NL. Inconsistencies between the model and the phenotypic microarray results served as a basis for further model refinement. We enriched the model with missing transport reactions and their respective GPRs by referring to the organism- and strain-specific BioCyc (
47) registry and the General Feature Format (GFF) annotation file. All in all, characterizing and determining the repertoire of nutrient sources a strain can use or assimilate is a critical factor of pathogenesis. It provides valuable insights into how pathogens adapt to host environments and evade host defenses. Our transporter-augmented model reflects a high accuracy degree with the experimental data regarding using carbon, nitrogen, phosphorus, and sulfur sources. Discrepancies between computational and empirical results highlight areas of current uncertainty knowledge regarding the metabolism of
R. mucilaginosa. They could be attributed to non-metabolic factors that fall beyond the metabolic models’ scope, including regulatory processes, gene expression, and signaling pathways. However, targeted experiments are needed to fill the remaining network gaps and reveal novel enzymatic processes.
Considering the predictive precision of our metabolic reconstruction, we utilized
iRM23NL to derive novel hypotheses. We examined the effects of condition-specific single gene knockouts on the bacterial capacity to produce biomass. Gene essentiality analysis is inherently contingent upon specific conditions. In the context of constraint-based metabolic modeling, a plethora of constraints are established, with the availability of nutrients, often the growth medium, being the most prevalent. By altering the availability of these nutrients, the environmental conditions are modified, consequently exerting a profound influence on the metabolic state and growth of an organism (
48,
49). However, the true strength and versatility of GEMs lie in their ability to rapidly generate condition-specific hypotheses on a large scale, circumventing the need for labor-intensive and expensive screenings that may not always yield direct success. Various models, spanning organisms like
A. baumannii,
E. coli,
S. cerevisiae,
P. falciparum, and
P. aeruginosa, demonstrated predictive accuracies ranging from 72% to 93% (
50–60). Additionally, gene essentiality analysis has been instrumental in identifying potential drug targets for diseases such as cancer and viral infections, aligning well with both
in vitro (
61–63) and
in vivo (
64) data. Therefore, we utlized our GEM and created a high-throughput
in silico-derived transposon mutant library considering two standard growth media, LB and M9, along with two growth media formulated to mimic the environment within the human body, SNM and SCFM. In this regard, we identified putative essential and partially essential genes and assessed their potential vulnerability under varying nutritional environments. With this, we opted for detecting candidate genes that could be considered in future antimicrobial and anti-inflammatory strategies in immunocompromised and CF patients. With this, we opted for identifying candidate genes for future research that hold promise for experimental validation. Determining which essential genes have human counterparts is of great importance for antibiotic drug development, as it helps assess potential side effects and cross-species effects on human genes targeted by antibiotics. Moreover, it provides insights into the molecular mechanisms of host-pathogen interactions, explaining how pathogens manipulate host cells and evade the immune system. Utilizing our GEM offers promising venues for future targeted engineering strategies without the need for laborious large-scale screening of knockouts and mutant libraries. This methodology would facilitate the rapid design of metabolic gene knockout strains by eliminating the associated reaction(s) from the model. Finally, CF lungs represent a highly dynamic environment (
65,
66). However, GEMs are adaptable and can be tailored to reflect the metabolic capabilities of bacteria across diverse environmental conditions.
The main objective in our endeavor to combat
R. mucilaginosa as an opportunistic pathogen causing infections (
7) is identifying essential genes, particularly those without human counterparts. Determining these essential genes is crucial as we aim to neutralize the pathogen without harming the host. Simultaneously, we are exploring
R. mucilaginosa as an agent with anti-inflammatory properties (
18). In this context, we opt for promoting
Rothia’s growth, focusing on modulating the environmental conditions that have been reported to do so. Once the key pathways involved in the beneficial functions of
R. mucilaginosa are known, our gene essentiality predictions can be exploited to boost activation of these pathways. Nonetheless, being aware of
R. mucilaginosa-specific essential genes is crucial to avoid inadvertently targeting them during therapeutic treatments, ensuring both the bacterium’s growth and its anti-inflammatory activities. With this dual perspective, we indicate
R. mucilaginosa’s therapeutic variety, including developing strategies to combat the bacterium, when it is detrimental while increasing cell biomass production when its anti-inflammatory properties are beneficial. The latter could benefit human health in the context of cystic fibrosis. However, these model-driven hypotheses need to be extensively validated via
in vitro and
in vivo experiments.
Altogether, creating a genome-scale metabolic network for R. mucilaginosa reveals insights that would have been resource-intensive to acquire using traditional wet-lab means. Understanding the metabolic complexities of R. mucilaginosa is essential for expanding our basic understanding of bacterium’s microbiology and would benefit various practical applications. In medicine, it could facilitate the development of strategies to deal with caused infections, while in biotechnology, it would allow us to use its metabolic abilities for bioprocessing and bioengineering purposes. Hence, our high-quality metabolic network, iRM23NL, could provide a systematic and detailed framework for analyzing R. mucilaginosa’s metabolism, yielding valuable insights with broad-reaching impacts.