A
Burkholderiaceae-related endofungal bacterium dwelling in
Mortierella parvispora E1425 was detected by sequencing the DNA fragment amplified by a 16S rRNA gene-targeted PCR with a template DNA extracted from the mycelia of E1425, and its intracellular feature was subsequently confirmed by fluorescence microscopic observation with a staining technique (
4). Before we attempted to cultivate the BRE, strain E1425 was sustained by subculture of the mycelia on a half-strength cornmeal-malt-yeast (CMMY) agar plate once for a month. A diagnostic PCR was carried out to check for the presence of BRE at 1 week after the subculture was made. In addition, before ultramicroscopic observation of strain E1425, the bacterium-like cells inside the hyphae were also detected by fluorescence
in situ hybridization with the universal bacterial 16S rRNA gene probe EUB338 (data not shown), as previously reported (
4). These continuous positive detections suggested a stable association of the endobacterium and
M. parvispora E1425.
Taxonomic assessment of strain B2-EB based on whole-genome sequencing.
Genomic DNA of strain B2-EB was separately sequenced on Illumina HiSeq2500 and PacBio RS II platforms, and 1.48 Gb of paired-end reads and 0.93 Gb of long reads were obtained, respectively. Hybrid assembly and read correction using SPAdes (version 3.10.1) and Pilon (version 1.22) resulted in a 1,876,900-bp circular chromosome with 1,117-fold genome coverage and 48.9 mol% GC content, which contained 2 rRNA operons (
rrn), 42 tRNA loci for 20 amino acids, and 1,627 protein-coding sequences (CDSs) with an average size of 1,027 bp (
Table 1). Of these CDSs, 1,319 were identified as single-copy CDSs accounting for 81% of total CDSs. In addition, 1,157 and 1,069 CDSs were assigned to COG and KEGG orthologs, respectively (
Table 1); 215 CDSs matched hypothetical proteins of unknown function, and 59 of them were less than 100 amino acid residues in size (data not shown). Compared with other BRE genomes, the B2-EB genome featured the lowest GC content, the highest proportion of single-copy genes, and the smallest genome size among the known cultured BRE.
A reconstructed maximum-likelihood (ML) tree based on concatenated amino acid sequences of 52 single-copy housekeeping genes conserved in the
Burkholderiaceae genomes showed that strain B2-EB was phylogenetically closest to the cluster of the species
M. cysteinexigens, forming an independent clade neighboring the “
Ca. Glomeribacter gigasporarum” clade (
Fig. 3). This result was consistent with previous analyses based on the 16S rRNA gene sequence (
4,
22). In addition, the whole-genome average nucleotide identity (ANI) between strains B2-EB and B1-EB
T (88.6%) was lower than the ANI cutoff value of same species (94 to 95%) (
33), suggesting that strain B2-EB can represent an undescribed species in the genus
Mycoavidus. Moreover, our phylogenetic analysis using 49 representative complete genomes in the family
Burkholderiaceae showed that
Mycetohabitans endosymbionts hosted by
Rhizopus microsporus were closer to bacteria with large and multipartite genomes in the genera
Caballeronia,
Paraburkholderia, and
Burkholderia than to
Mycoavidus and “
Ca. Glomeribacter gigasporarum” endosymbionts in the
Mortierella spp. and AMF, respectively (
Fig. 3). These results supported that
Mycetohabitans endosymbionts have an independent origin, sharing their most recent common ancestor with the free-living bacteria in these genera, whereas
Mycoavidus and “
Ca. Glomeribacter gigasporarum” endosymbionts seemed to originate from another common ancestor (
16). From the cultivability, translucent colony color, and ANI analysis results, strain B2-EB could represent an undescribed species in the genus
Mycoavidus.
Genomic comparison of Mycoavidus sp. B2-EB with Mycoavidus cysteinexigens B1-EBT and AG77.
Although the genomic features of
Mycoavidus sp. strain B2-EB, including GC content, number of
rrn and number of tRNA loci, were close to those of
M. cysteinexigens B1-EB
T, the other features of genome size, mobile genetic elements, and proportion of single-copy genes were strikingly different between these two cultured endofungal bacteria (
Table 1). Compared to the two complete genomes of
M. cysteinexigens B1-EB
T and AG77, strain B2-EB possessed an approximately ∼1.0-Mb-smaller genome with a paucity of transposable elements and a large proportion of single-copy genes (
Fig. 4A; see Fig. S1 in the supplemental material). However, phylogenetic orthology analysis using OrthoFinder showed that the B2-EB genome shared 91.3% of the orthologous groups (OGs) with the two
M. cysteinexigens genomes, which included 1,395 core OGs plus 13 and 33 OGs shared with the B1-EB
T and AG77 genomes, respectively (
Fig. 4B). These results indicated that although the genome size of strain B2-EB was reduced, its genetic elements were highly homologous with those of
M. cysteinexigens.
Further assessment of genomic variation on a population scale showed that the ratio of nonsynonymous to synonymous sites (
dN/
dS) for individual genes between the B2-EB and
M. cysteinexigens genomes displayed a leptokurtic distribution clustered around the mean value regardless of the
dN/
dS estimation method (
Fig. 4C). This pattern was similar to those found in nonessential heritable endosymbionts such as
Wolbachia,
Hamiltonella/
Regiella, and “
Ca. Glomeribacter,” as well as those of some free-living bacterial lineages such as
Burkholderia,
Prochlorococcus,
Bradyrhizobium, and
Bifidobacterium (
10). Additionally, the mean
dN/
dS ratio between B2-EB and
M. cysteinexigens estimated by three methods was in the range 0.118 to 0.152, suggesting strong purifying selection and large population size like those detected in some microbial endosymbionts (
10,
34). This tendency was even stronger than in the case of the “
Ca. Glomeribacter gigasporarum” endosymbionts (
dN/
dS, 0.149 to 0.198) estimated in the previous study (
10). Therefore,
Mycoavidus sp. B2-EB represents a model of nondegenerative genome evolution similar to that for “
Ca. Glomeribacter gigasporarum” endosymbionts. This also created a debate on whether the B2-EB genome is reduced or the B1-EB
T and AG77 genomes are extended (
35).
A syntenic comparison with
M. cysteinexigens B1-EB
T showed that 936 CDSs related mainly to a region for prophages and transposons and accounting for 40.4% of total CDSs in the B1-EB genome were missing in the B2-EB genome (
Fig. 5A). Even so, some gene loci within the translatable regions of the B1-EB
T and AG77 genomes were still found in the B2-EB genome (
Fig. 5A and
B).These residual genes indicate that genome reduction was likely to have resulted in the relatively small genome of strain B2-EB. Therefore, we proposed that the genome reduction occurring in strain B2-EB may be mediated by the repeated insertion-deletion of prophages and transposons, as previously reported (
31,
32,
36). The missing CDSs affiliated with the orthologous genes conserved in the genomes of
Burkholderiaceae-related bacteria seemed to be reduced from the B2-EB genome over evolution and were involved mainly in the secretion system, amino acid and glycolytic metabolism, and transcription (see Fig. S2 in the supplemental material). Notably, the other specific CDSs in the B1-EB
T genome might be acquired through horizonal gene transfers, such as genes encoding insecticidal toxins close to orthologs possessed by the genus
Xenorhabdus (
Gammaproteobacteria) living symbiotically with soil entomopathogenic nematodes (
37). However, the process of this gene gain event is still unclear.
On the other hand, 1,490 out of 1,627 CDSs in total in the B2-EB genome were well conserved in the B1-EB genome (
Fig. 5A), and 60.5% of these commonly conserved CDSs shared more than 90% amino acid identity (AAI) between these two strains (see Fig. S3 in the supplemental material). Orthologous genes sharing low AAI values of <80% accounted for 21.3% of the total common orthologous genes, which encoded mainly proteins involved in RNA processing and modification, transcription, and energy production and conversion. In the case of the obligate endosymbiont
Wolbachia associated with arthropod hosts, the specific genes for involvement in host interaction underwent elevated substitution rates that may be due in part to adaptations by
Wolbachia to a new host environment (
38). It is tempting to speculate that the genes with elevated mutation rates may represent specific gene elements in the two
Mycoavidus genomes for adapting to the different intracellular environments of the two
Mortierella species. Furthermore, a high degree of genomic rearrangement was identified in the 0.80- to 1.13-Mb downstream region of
dnaA, encoding the replication initiator, showing “symmetric inversions” with endpoints that were equally distant from the origin of chromosomal replication (
Fig. 5A and
B). Genomic rearrangement likely impacts gene expression and results in the loss of gene function when a breakpoint occurs inside a reading frame (
39,
40), which not only affects the organismal phenotype but also contributes to phylotype subdivision (
41). It has been reported that extensive genome rearrangements probably triggered or enhanced the
Wolbachia speciation event, because it could suppress homologous recombination in chromosomal regions that are not colinear (
42). Here, we showed that genomic arrangements occurred between the genomes of
Mycoavidus cysteinexigens and
Mycoavidus sp. B2-EB (the two closest species to one another), suggesting that they may play an important role in the
Mycoavidus speciation process.
Additionally, in spite of the high homology with the previously determined
M. cysteinexigens genomes, the B2-EB genome also possessed 137 unique CDSs (
Fig. 4B), including 89 and 48 CDSs encoding hypothetical and functional proteins, respectively. Most of the CDSs encoding functional proteins were involved in interactions with the host, mainly including 12 CDSs encoding lipopolysaccharide biosynthesis and secretion enzymes, 4 CDSs encoding secondary metabolite biosynthesis enzymes, and 4 CDSs encoding toxins or antitoxins (see Table S1 in the supplemental material). Such genes have been reported for the
Burkholderiaceae-related endofungal bacteria in previous studies (
22,
30,
43). The other functional CDSs specific to strain B2-EB were related to DNA replication and repair, nutrient metabolism, cell elongation and division, ribosome biogenesis, membrane transport, and signal transduction (Table S1). Notably, two of the unique CDSs (locus tags MPB2EB_0909 and MPB2EB_1044) were likely to be associated with bacterial survival inside the fungal cell, annotated as putative genes encoding
β-lactamase and E3 ubiquitin-protein ligase, respectively.
β-Lactamase can provide resistance to
β-lactam antibiotics produced by fungi, such as penicillins (
44), whereas the E3 ligases of some pathogenic bacteria mimic the activity of eukaryotic E3 ubiquitin ligases to benefit themselves (
45). However, the roles of the two genes in the endofungal lifestyle should be examined using molecular biological approaches in the future.
The genomic features required by cultivable BRE.
The smallest genomes possessed by
Buchnera aphidicola, an obligate intracellular endosymbiont in aphids, are shaped by the evolutionary force of genetic drift, in which slightly deleterious mutations fix rapidly, causing the loss of gene functions, even including DNA repair, which further accelerates the accumulation of mutations (
8). Notably, the proportion of transposable element accounted for only 2.4% of the total genome length in the B2-EB genome, versus 7.2 to 11.5% in known
Mycoavidus genomes. In contrast, the genome reduction of heritable endofungal bacteria is caused by limited recombination in a largely clonal population, which is well supported by genomic information on “
Candidatus Glomeribacter” (
10). In this study, we estimated the genome completeness across the family
Burkholderiaceae using the taxonomic-specific workflow of CheckM (
46). Surprisingly, the results indicated that
Mycoavidus endofungal bacteria dwelling in
Mortierella possessed 88.6% of the checked single-copy genes (
Fig. 6A), which was obviously more than those conserved in “
Ca. Glomeribacter gigasporarum” genomes (21.8 to 70.2%). Moreover, the full gene set of DNA repair mechanisms, including base excision repair, nucleotide excision repair, mismatch repair, and homologous recombination, were well conserved in the genomes of
Mycoavidus strains B1-EB
T and B2-EB and may be responsible for purging slightly deleterious mutations from the genomes. These findings suggested that nondegenerative genome reduction occurred during the evolution of
Mycoavidus-related endofungal bacteria adapting to the intracellular environments of fungal hosts.
The second surprising finding was that strain B2-EB represented the most streamlined genomic architecture in the
Burkholderiaceae bacteria with sequenced genomes, even beyond the streamlined genomes of
Polynucleobacter species (
47–49). The B2-EB genome was 1,877 kb with 1,627 CDSs and featured the highest proportion of single-copy genes in the genome (
Fig. 6B). Currently, the free-living bacterium with the smallest genome (1,304 kb) is an uncultured but abundant ocean methylotroph of the
Betaproteobacteria, HTCC2181, with 1,377 CDSs (
50), followed by the hyperthermophilic bacterium
Aquifex aeolicus with a slightly larger genome (1,591 kb and 1,613 genes) (
51). The smallest photosynthetic cell, represented by the oceanic cyanobacterium
Prochlorococcus marinus, has a 1,660-kb genome containing 1,765 genes (
52). In the family
Burkholderiaceae, the smallest genome is represented by
Polynucleobacter necessarius endosymbionts isolated from ciliates living in aquatic ecosystems (
53), with 1,560 kb and 1,279 CDSs (
47). However, the single-copy genes in this genome accounted for 64.3% of the total genes (
Fig. 6B), which was lower than the proportion in the B2-EB genome. Strain B2-EB has a small genome comparable to the minimal genomes of the above-mentioned free-living bacteria and possesses a radically streamlined structure. A reduction in genome size has previously been identified in the
M. cysteinexigens genomes (
22,
30), and our analysis here showed that the proportions of single-copy genes were 61.9% and 68.5% in the B1-EB
T and AG77 genomes, respectively (
Fig. 6B). Further comparisons with their free-living relatives (proportion of single-copy genes, 48.4% to 52.8%) (
Fig. 6B) suggested that all three of the
Mycoavidus genomes were streamlined, but the most streamlined was the B2-EB genome. Therefore, the genetic elements conserved in the B2-EB genome may represent the minimal genomic features required by a cultivable endofungal bacterium.
Compared with those of close relatives in another fungal hosts and free-living relatives in various habitats, the genomic features of
Mycoavidus were characterized by the harboring of more abundant genes responsible for functions including intracellular trafficking, secretion, vesicular transport, RNA processing and modification, and recombination (
Fig. 7). Furthermore, the streamlined genome of strain B2-EB harbored almost complete sets responsible for encoding the 30S and 50S ribosomal proteins, tRNA aminoacylation, rRNA and tRNA modifications, DNA replication and repair, transcription, translation, and protein folding (see Tables S2 to S9 in the supplemental material). Although some genes involved in amino acid metabolism (e.g.,
asnA,
asnB,
gltB, and
hutH) and secretion systems (e.g.,
yscF,
virB5,
gspC,
gspI,
gspL,
gspM,
vgrG,
hcp,
impL,
impK, and
vasG) were absent in the B2-EB genome (see Fig. S4 in the supplemental material), it was similar to the genomes of other BRE that possessed many of these genes (
6,
22,
29,
30), suggesting that they have an important role in the endofungal lifestyle. In addition, a large number of genes involved in fatty acid metabolisms were present in the B2-EB genome, indicating that fatty acids are likely to be a major carbon source for
Mycoavidus endofungal bacteria (Fig. S4). Together with other essential genes for life, such as those involved in tricarboxylic acid (TCA) cycling, energy generation, and various transport systems, the genetic elements harbored by strain B2-EB showed the minimal genomic features required by both an endofungal lifestyle and artificial culture.
Conclusion.
BRE have been detected in several fungal lineages, including
Gigaspora,
Racocetra, and
Cetraspora (Glomeromycotina),
Rhizopus (Mucoromycotina), and
Mortierella (Mortierellomycotina) (
10,
54). The sequenced BRE genomes have revealed common features such as reduced genome size, loss of genes encoding primary metabolism, and specialization in fungal metabolite uptake. In particular, the BRE affiliated with the
Mycoavidus-Glomeribacter clade were elucidated as having amino acid auxotrophies that have limited their artificial cultivation in pure cultures. Our previous efforts in reverse genomics have contributed to the isolation of
Mycoavidus cysteinexigens from the host fungus
Mortierella elongata by supplying exogenous cysteine to substitute for the genetic defect in bacterial cysteine biosynthesis (
19,
26).
In this study, we successfully isolated an endobacterium phylogenetically close to
M. cysteinexigens, named
Mycoavidus sp. B2-EB, from the host fungus
Mortierella parvispora. Moreover, the hybrid assembly of deep-sequencing data sets reconstructed a complete genome sequence of strain B2-EB, revealing a reduced genome size (1.88 Mb), which was less than those of
M. cysteinexigens strains (2.64 to 2.79 Mb) and comparable to those of “
Ca. Glomeribacter gigasporarum” genomes (1.34 to 2.36 Mb). Further assessment of genomic variation on a population scale indicated that
Mycoavidus sp. B2-EB possessed a nondegenerative genome comparable to that found in “
Ca. Glomeribacter gigasporarum” endosymbionts. In addition, comparative genomic analysis with other BRE and free-living relatives affiliated with the family
Burkholderiaceae showed that the B2-EB genome had a relatively high genome completeness and high proportion of single-copy genes but only a few transposable genetic elements. These results suggested that the B2-EB genome reached a state of relative evolutionary stability. The genetic elements conserved in the stabilized genome appear to have resulted from the adaption of endobacteria to the host metabolism. In our previous study (
30) and this study, we found no genes involved in invasion or lysis of fungal cells in the
Mycoavidus genome according to comparative genomic analysis between
Mycoavidus and
Mycetohabitans, indicating that strain B2-EB has already lost a way to get in and out of fungal cells. To the best of our knowledge, the B2-EB genome represents the smallest genome size among cultivable endofungal bacteria. Our findings are expected to fill a knowledge gap in the understanding of genome evolution of
Mycoavidus-related endofungal bacteria, which results in a minimal genome of a cultivable endosymbiont.
The information gleaned from this genome revealed the minimal genome features required by both an endofungal lifestyle and artificial cultivation, which furthers our understanding of genome reduction in fungal endosymbionts and extends the culture resources for the biotechnological development of engineering synthetic microbiomes.