INTRODUCTION
Many insects have established intimate relationships with bacterial symbionts that have helped them to exploit new niches. In many cases, intracellular symbionts provide nutrients to their insect host, while the host provides a stable environment favorable to symbiont growth (
1,
2). These symbiotic relationships are often obligate, as the host and its symbionts depend on products encoded in the genome of their partners. Moreover, obligate symbioses are known to involve host-symbiont complementarity, whereby each partner relies on metabolic products and enzymes of the other to complete some of their metabolic pathways.
One example is branched-chain amino acid (BCAA) biosynthesis in aphids and their endosymbiotic
Buchnera aphidicola (hereafter
Buchnera). Aphids depend on
Buchnera to supply many essential amino acids (EAAs) deficient in their phloem-sap diet (
3,
4). However, the genome of
Buchnera lacks the genes coding for branched-chain amino acid aminotransferase (
ilvE), the enzyme involved in the last step of BCAA biosynthesis (
5), which is present in the aphid genome (
6). The genomes of both symbiotic partners are therefore complementary, sometimes at the level of genes within pathways, and together encode all genes involved in EAA biosynthesis (
4,
7–9).
Such intimate metabolic interactions, which we term “collaborative pathways” (
10), are not unique to aphids and have also been found in several other host-symbiont systems (e.g.,
11,
12). Collaborative pathways typically evolve from hosts and symbionts endowed with complete functional pathways at the origin of symbiogenesis, but may depend on genes horizontally transferred from other bacteria to the host. In addition to the preexisting gene set of the symbiont pair, collaborative pathways require the evolution of unusual transport mechanisms for metabolites or enzymes between members of the pair. Collaborative pathways have been reported from obligate symbiotic systems that have a long history of coevolution (
10,
13).
To date, the existence of collaborative pathways between hosts and symbionts has only been investigated in sap-feeding insects (
13). Although a collaborative arginine biosynthetic pathway between carpenter ants and their obligate endosymbiont,
Blochmannia, has been proposed based on the gene content of the symbiont genome (
14), comparisons between the genomes of host insects with alternative diets and that of their symbionts have not yet been reported. A model for insect symbioses that do not involve sap-feeding taxa is that between Blattodea, which comprises cockroaches and termites (
15), and their intracellular symbiont
Blattabacterium cuenoti (hereafter
Blattabacterium). Blattodea have evolved a wide range of feeding habits and are sometimes known as “garbage collectors,” recycling dead plants, dead animals, and excrement (
16). Almost all cockroaches, as well as the most primitive termite
Mastotermes darwiniensis, host
Blattabacterium endosymbionts in specialized cells (bacteriocytes) located in their fat body.
Blattabacterium appears to have been lost in the lineage leading to all termites other than
M. darwiniensis, as it has not been found in these taxa. The association between Blattodea and
Blattabacterium is ancient and was established >200 million years ago (
17–19), which is comparable with the time frame of other symbiotic systems found in insects (e.g.160-280 MYA in aphids-
Buchnera, [
20]). Physiological experiments and genomic analyses have suggested that
Blattabacterium is involved in nitrogen recycling and essential amino acid biosynthesis (
21–23). Unlike sap feeders, cockroaches have adapted to various diets, and many species are known to engage in coprophagy (
16). Their diet is therefore more nutritionally balanced than plant sap. However, because their food sources are often transient (
16),
Blattabacterium is likely to contribute to host fitness through nitrogen recycling. Thus, metabolic collaborations between the two genomes are expected to be different from that of sap-feeding hosts and their symbionts.
The evolution of metabolic pathways in
Blattabacterium is largely unknown. In contrast to the endosymbionts belonging to the phylum
Pseudomonadata (previously
Proteobacteria), to which
Escherichia coli belongs, the enzyme repertoire of the order
Flavobacteriales (phylum
Bacteroidota), to which
Blattabacterium belongs, is not well-characterized. Unknown enzymes and pathways are still frequent in lineages that do not include model organisms, affecting prokaryote genome annotation (
24). Indeed, many bacteria have genomes annotated with incomplete amino acid biosynthetic pathways and are incorrectly assumed to be heterotrophic while they can be cultured without the given amino acids (
25). For this reason, it is unclear whether the apparent gaps present within the pathways of
Blattabacterium are genuine gaps, representing gene losses during the evolution of
Blattabacterium and complemented by their host Blattodea, or whether these genes are absent from the genomes of
Blattabacterium and their free-living relatives because of the existence of yet-uncharacterized genes replacing the functions. Genes absent in all
Blattabacterium genomes as well as the genomes of their free-living relatives are indicative of the presence of uncharacterized genes completing the pathways and are not expected to be compensated by the host genome.
In this study, we conducted a careful reconstruction of the metabolic pathways involved in amino acid and vitamin/cofactor biosynthesis of Blattabacterium and their host Blattodea. We investigated the genomic evidence of metabolic collaboration between host insects and Blattabacterium. To this end, we reconstructed the metabolic pathways of seven representative genomes of Blattabacterium and compared them to the metabolic pathways of other Flavobacteriales bacteria. This approach allowed us to distinguish genes that were lost in Blattabacterium and are complemented by their insect host from genes commonly absent in Flavobacteriales (whose inferred function is carried out by the products of yet-uncharacterized genes or pathways). We then inferred the metabolic pathways of Blattodea using the genome and transcriptome of 2 Blattodean species harboring Blattabacterium (Periplaneta americana and Cryptocercus punctulatus, respectively) and the genome of 1 Blattodean species devoid of Blattabacterium (Zootermopsis nevadensis). Using this approach, we obtained robust inference of the pathways common across Blattodea. We compared the inferred metabolic pathways of Blattodea and Blattabacterium and identified potential collaborative pathways. Finally, we compared the profile of collaborative pathways in Blattabacterium with those of other host-symbiont systems with variable diets to see how diet affects the profile of collaborative pathways.
DISCUSSION
In this study, we analyzed the amino acid and vitamin/cofactor metabolic pathways of
Blattabacterium endosymbionts from Blattodea and compared them to those of other
Flavobacteriales genomes. We found that pathways for EAA biosynthesis are enriched in
Blattabacterium genomes, while many non-EAA and vitamin/cofactor pathways have been lost. We further identified two types of gaps present in these pathways: (i) gaps shared by
Blattabacterium and other
Flavobacteriales strains, and (ii) gaps specific to
Blattabacterium. We then analyzed the metabolic pathways of 3 species of Blattodea and compared them to those of
Blattabacterium in order to examine potential metabolic collaborations in this symbiotic system. The product-level complementation (exchange of amino acids and vitamins between the host and its symbiont) shown in this study is consistent with results from previous studies, which performed genome-based flux balance analysis on
Blattabacterium (
70,
71). The pathways for several non-EAAs (Gln, Gly, Asn, and Pro) absent from
Blattabacterium genomes were present in the transcriptome of
B. germanica (
72). While we found several examples of product-level complementation, we only found 2 candidate examples of within-pathway mutual dependence (defined as “collaborative pathways” in this study), in the folate biosynthetic pathway for all strains (except for CPUbr and MADAR), and in the arginine biosynthetic pathway for CPUbr and MADAR. We also compared metabolic pathways across 11 insect-symbiont systems comprised of 55 genomes belonging to various taxonomic groups and having diverse diets (
Table 1, Table S3), and found that there is a clear pattern of putative collaboration related to the host diet. In addition to patterns related to diet, we also found that BCAA biosynthetic pathways have been retained by the endosymbionts of all sap-feeding insects examined here despite their significant levels of reductive genome evolution. In sharp contrast to this trend in sap-feeding insects, genome reduction in CPUbr and MADAR
Blattabacterium involves the loss of BCAA pathways.
We found several unique characteristics in
Blattabacterium metabolic pathways. Firstly, although urease is present in
Blattabacterium and in the pathogenic strains of the genera
Weeksella,
Ornithobacterium,
Riemerella, and
Chryseobacterium, it is generally absent from the genomes of other
Flavobacteriales genera. A previous study suggested that the presence of a urea transporter and urease accessory proteins is highly correlated with non-marine pathogenic
Flavobacteriales ecotypes (
73). Interestingly, our phylogenetic analysis of UreC protein suggests that
Blattabacterium has acquired urease via horizontal gene transfer while almost all ureases from other
Flavobacteriales strains are monophyletic (
Fig. 2). The universal presence of this gene among all known
Blattabacterium strains indicates it was present in their last common ancestor. The conversion of uric acid to usable nitrogen in the form of NH
3 occurs via collaboration between the host (which performs the first steps [
72]) and
Blattabacterium. This process is an essential trigger for the biosynthesis of many nutrients (
22,
23) and forms a key part of the symbiotic relationship between these partners. Other specific characteristics of the early stage of
Blattabacterium genome evolution include losses of several non-EAA pathways and the retention of EAA pathways, which suggest that
Blattabacterium has evolved with the strict requirement of supplying EAAs to their hosts.
Blattodea are known to contain a high level of cobalamin (B12) in their hindgut (
74,
75). Since both host and
Blattabacterium lack biosynthetic pathway of B12, the reported higher cobalamin concentration in the hindgut of Blattodea is likely due to the presence of B12-producing microbes in their hindgut, such as
Shimwellia blattae (
76). Unlike other animals, insects are not thought to require cobalamin as none of the cobalamin-dependent enzymes were detected from 19 publicly available genomes (
77). We searched for the 3 cobalamin-dependent eukaryotic enzymes and could not detect them in the Blattodea genomes used in this study. We expect that the high level of B12 in the hindgut of Blattodea is not required by the host or its
Blattabacterium.
Our study shows that the metabolic host-symbiont relationships in the Blattodea differ from those of other insects. With respect to collaborative pathways for amino acids biosynthesis, we propose 3 categories: (i) phloem-sap feeders, which commonly have collaborative BCAAs, phenylalanine, and methionine biosynthetic pathways; (ii) carpenter ants, which have a collaborative arginine biosynthetic pathway; (iii) and the 2 blattodeans
C. punctulatus and
M. darwiniensis and psyllids, which have another type of collaborative arginine biosynthesis pathway (
Fig. 5A). Blood-feeders do not have any biosynthetic pathways for EAAs. Interestingly, while phloem-sap feeders have evolved similar patterns of collaboration, xylem-sap feeders do not appear to possess any collaborative pathways. With respect to vitamin/cofactor pathways, we propose 3 categories: (i) aphids, which have a collaborative pantothenate biosynthetic pathway; (ii) carpenter ants and Blattodea, which have collaborative folate biosynthetic pathways; (iii) blood-feeders, which have collaborative pathways for both pantothenate and folate (
Fig. 5B).
A previous study suggested that host gene content can be a constraint for collaborative pathway evolution (
10). In the case of BCAA biosynthesis, the branched-chain-amino-acid transaminase (BCAT: EC 2.6.1.42) is ubiquitous in insects, indicating that collaborative BCAA biosynthesis has the potential to evolve. Similarly, the 2 collaborative pathways in vitamin/cofactor biosynthesis we describe here are supported by the universal presence of the beta-alanine biosynthetic pathway and the alkaline phosphatase gene in insect genomes. The type of cell that gave rise to bacteriocytes is also thought to potentially constrain the evolution of collaborative pathways because the cells require adequate levels of gene expression to complement gaps in the symbiont’s pathways (
10). The potential for forming collaborative pathways is thus expected to vary among different insect hosts, depending on gene expression levels in particular cells, even when the hosts possess the same set of genes. Previous studies have reported that within a host individual, different (primary and secondary) symbionts are harbored within morphologically distinct bacteriocytes (
57,
78,
79), and the bacteriocytes have distinct gene expression profiles (
80).
Although patterns of collaborative pathways across varying symbiotic systems are potentially explained by the “bacteriocyte cell type origin” hypothesis, our results suggest that collaborative pathways are driven by the host diet, and particularly the EAA and vitamin/cofactor profile required by the host. Previous studies have suggested that collaborative EAA biosynthesis is controlled by the host through control over amino donor supply (especially the supply of glutamate for BCAAs and phenylalanine) (
3). The production rate of BCAAs and phenylalanine in aphid bacteriocytes was estimated to exceed 60% of total amino acid production (
3). The glutamine synthetase/glutamine oxoglutarate aminotransferase (GS/GOGAT) cycle in the host cytoplasm is also thought to have a central role in efficient ammonia assimilation, which keeps toxic ammonia at low levels in bacteriocytes (
8). Generally, phloem-sap is rich in asparagine and sugars, and thus symbiotic systems can obtain sufficient ammonia and ATP to activate the GS/GOGAT cycle. In contrast, xylem-sap is generally poorer in both carbon and nitrogen sources than phloem-sap, which might explain why Auchenorrhyncha-
Sulcia systems lack collaborative pathways based on GS/GOGAT cycle. In this context, Blattodea do not need to use the GS/GOGAT cycle system because they use uric acid to store nitrogen wastes, and
Blattabacterium utilizes host-provided urea as a nitrogen source (
Fig. 5). Because Blattodea have various feeding habits, the EAA profile of their food source is less specific than plant sap. Therefore, control over the EAA production profile is likely to be less important in this system. The collaborative arginine biosynthetic pathway in the carpenter ant-
Blochmannia system is believed to be linked to the use of arginine as a nitrogen storage compound in ants (
14). This potentially explains the absence of the same type of collaborative arginine biosynthetic pathways found in carpenter ants and Blattodea. In termites, uric acid is transported into the gut via the malpighian tubules and degraded by gut microbiota (
81). This may explain the absence of uricolytic ability in the genome of the lower termite
Z. nevadensis, which has lost
Blattabacterium.
Our study has revealed several general trends of metabolic pathway evolution within and across insects and their symbionts. We found genomic evidence that symbiont partnerships vary in their amino acid and vitamin/cofactor biosynthetic collaborations. Experimental evidence, such as differential gene expression analysis or enzymatic assays, is required to identify the precise mechanisms by which these collaborations occur. For example, in the case of gaps unique to one or several strains of symbionts, complementation by unknown host enzymes or the emergence of new multifunctional enzymes should be considered as potential replacements (
69). Our analyses revealed several gaps specific to
Blattabacterium and several gaps common across
Flavobacteriales. However, since our approach focused on interpreting gaps rather than reconstructing whole system metabolic networks, it is unclear how the metabolic collaborations affect the total metabolic flux within the symbiotic system. Although multi-organism scale modeling methods are needed to answer such questions, these modeling methods require experimental evidence, such as metabolite analyses, chemically-defined diets, or differential gene expression data, to make accurate predictions. Future studies are needed to determine the precise mechanisms and adaptive significance of pathway complementation in insect-symbiont systems and to provide a clear picture of the evolution of their genomes.