Phylogenetic inferences.
A phylogenetic analysis based on a partial sequence of the 16S rRNA gene showed that the isolate OM4 formed a highly supported monophyletic group with strains Mesorhizobium delmotii STM4623T, Mesorhizobium prunaredense STM4891T, Mesorhizobium wenxiniae WYCCWR 10195T, Mesorhizobium muleiense CCBAU 83963T, Mesorhizobium robiniae CCNWYC115T, Mesorhizobium temperatum SDW018T, and Mesorhizobium mediterraneum NBRC 102497T (see Fig. S1 in the supplemental material). Analyses of the housekeeping genes recA and atpD revealed a close relationship between OM4 and M. prunaredense STM4891T with high branch support (Fig. S2A and B). In addition, whole-genome sequence analysis demonstrated a distant relationship between these strains (see below).
The 16S rRNA gene sequence comparison of isolate TH2 with related
Rhizobiaceae members suggested a close relationship with
Rhizobium alvei strain TNR-22
T (GenBank accession no.
HE649224), sharing 98.08% nucleotide identity for an alignment length of 1,405 bp. This was below the stringent cutoff of 98.7% 16S rRNA sequence identity and was proposed to delineate new species (
35). These two strains shared only 86.14% and 87.65% nucleotide identity for their partial
atpD and
recA gene sequences, respectively, suggesting their distinctiveness. The latter comparison was limited to 496- and 567-bp sequence lengths because only partial
atpD (GenBank accession no.
KX938336) and
recA (GenBank accession no.
KX938338) sequences for
R. alvei TNR-22
T were available. The 16S rRNA and
recA-based phylogenetic analyses demonstrated that the isolate TH2 and
R. alvei formed a monophyletic group with high support values (Fig. S3; Fig. S4A). The
atpD-based analysis resulted in a tree with different topology where TH2 did not cluster with
R. alvei but with other representatives of
Rhizobium,
Agrobacterium, and
Ciceribacter (Fig. S4B). Whole-genome analysis, however, showed a distant relationship between these strains (see below).
Core genome phylogeny, overall genome relatedness indices, and plasmid comparison.
Core genome phylogeny was determined for isolates OM4 and TH2 and 99 additional Rhizobiales strains, including representatives of Rhizobiaceae and Phyllobacteriaceae. The core genome of strains included in this analysis contained 180 homologous gene clusters. A phylogenomic tree was inferred from 118 top markers that were selected using GET_PHYLOMARKERS software.
The maximum-likelihood (ML) core genome phylogeny indicated that the isolate OM4 grouped within the genus
Mesorhizobium (
Fig. 1). It clustered with strains
M. delmotii STM4623
T and
M. temperatum SDW018
T as its closest relatives. Isolate OM4 exhibited the highest genomic relatedness to these two strains, as they shared ~94.8% average nucleotide identity based on BLAST (ANIb; Table S3). This was below the proposed threshold for species delineation, which ranges between 95 and 96% for ANI (
36). To clarify the taxonomic assignment of the isolate OM4, we calculated additional overall genome relatedness indices (OGRIs), in particular, orthoANIu and digital DNA-DNA hybridization (dDDH). Obtained values were also below the thresholds for species definition (Table S3). This suggests that isolate OM4 represents a novel
Mesorhizobium species for which we proposed the name
Mesorhizobium onobrychidis sp. nov. (see Appendix). The novelty of
M. onobrychidis strain OM4
T was also confirmed by Type (Strain) Genome Server (TYGS) analysis, suggesting that this strain does not belong to any species found in the TYGS database (data not shown).
Phylogenetic analysis assigned isolate TH2 to
Rhizobiaceae (
Fig. 1). It clustered independently and was distantly related to other
Rhizobiaceae genera described so far. Different OGRIs were computed to further assess the relationship of isolate TH2 to representatives of
Rhizobiaceae. Because of the distinctiveness of this isolate, the comparisons at the nucleotide level were not satisfactory, and only a limited proportion of the whole-genome DNA sequence could be used for the calculations. For instance, for ANIb, only ~12 to 26% of the whole-genome sequences were aligned and used for comparisons (data not shown). Therefore, we performed whole-proteome comparisons (whole-proteome average amino acid identity [wpAAI]) that offer a higher resolution. Isolate TH2 exhibited wpAAI values ranging from 61.5 to 67.5% with the representatives of
Rhizobiaceae included in the analysis (Table S2). The wpAAI values were notably low and supported the divergence of the isolate TH2, which was evidenced by the separate clustering of the strain on a wpAAI dendrogram (Fig. S5). Isolate TH2 exhibited the highest genomic relatedness to strain
Ensifer meliloti 1021 (67.5% wpAAI), although they were phylogenetically distantly related (
Fig. 1; Table S3). This value was lower than wpAAI values computed between representatives of defined and phylogenetically well-separated genera
Agrobacterium and
Rhizobium that ranged from 68.12 to 70.55% wpAAI. The core-proteome average amino acid identity (cpAAI) between the isolate TH2 and reference
Rhizobiaceae spp. was <76% (Table S4), which was below the threshold of ~86% for delimitation of
Rhizobiaceae genera proposed recently (
37). This suggested that isolate TH2 represents a new genus and species, described here as
Onobrychidicola muellerharveyae gen. nov. sp. nov. The separate taxonomic position of strain
O. muellerharveyae TH2
T was also confirmed by results of TYGS analysis (data not shown).
Plasmids of
M. onobrychidis OM4
T and
O. muellerharveyae TH2
T did not show high similarity to known plasmids based on mash analysis and pan-genome analysis and revealed a high proportion of unique genes (
Fig. 2, Fig. S6, and Text S7).
Genome sequencing and assembly.
Genomes of strains
M. onobrychidis OM4
T and
O. muellerharveyae TH2
T were sequenced and circularized employing PacBio and Illumina platforms upon completion. Basic genome assembly statistics of both strains are summarized in
Table 1.
For strains OM4
T and TH2
T, 80,208 and 80,011 postfiltered long reads with mean polymerase read lengths of 14,639 and 14,004 bp have been generated, respectively. Long-read assembly with a target genome size of 15 Mb resulted in 6 (OM4
T) and 2 (TH2
T) contigs, respectively, with 4 (OM4
T) and 2 (TH2
T) of them being circular and, as such, handled as final replicons. For short-read error correction, a total of 2 × 1,572,218 (OM4
T) and 2 × 2,942,298 (TH2
T) reads of 151 bp have been trimmed and mapped to the complete genome for error correction. No contamination has been detected based on the obtained replicons. Genome completeness has been evaluated using BUSCO (
https://usegalaxy.eu/) by manual selection of the
Rhizobiales lineage as reference (639 BUSCOs). Here, in both cases, 638 complete BUSCOs were detected, reaching completeness values of 99.9%.
The complete genome size of strain
M. onobrychidis OM4
T was 7.55 Mb, comprising the circular chromosome of 7.32 Mb and one circular plasmid of 227 kb, and is stored in the NCBI GenBank under accession numbers CP062229 and CP062230 (
Fig. 2). The G+C content of the total genome is 61.9%. Genome size and G+C content of strain
M. onobrychidis OM4
T are similar to other
Mesorhizobium spp. (Table S5), for instance, the type strain of this genus, strain
Mesorhizobium loti DSM 2626
T (GenBank accession no.
QGGH01000000).
The genome of strain
O. muellerharveyae TH2
T, composed of the circular chromosome (5.88 Mb) and three circular plasmids (98 kb, 223 kb, and 238 kb), was deposited under the accession numbers CP062231 to CP062234 at NCBI GenBank (
Fig. 2). The genome size and G+C content of the total genome were 6.44 Mb and 60.6%, respectively, which was similar to other representatives of
Rhizobiaceae (Table S5).
Two chromosomal rRNA (5S, 23S, and 16S) operons were identified in both strains OM4
T and TH2
T. In
O. muellerharveyae TH2
T, they were identical, while rRNA operons of strain
M. onobrychidis OM4
T differed in five single nucleotide polymorphisms (SNPs) located in the intergenic region. For
M. onobrychidis OM4
T, two phage-like particles (PLPs) were identified, while in
O. muellerharveyae TH2
T, one phage and three PLPs were found (
Fig. 2, circle 4).
Mesorhizobium onobrychidis OM4
T harbored 136 transposases and 33 recombinases/invertases, e.g.,
xerC and
xerD, whereas
O. muellerharveyae TH2
T revealed respective counts of 28 and 13 only (
Fig. 2, circle 6). Approximately half of the genes of both genomes were lacking meaningful annotations (hypothetical genes) according to homology-based alignment by PROKKA and KOfamKOALA hidden Markov model (HMM) searches. According to the genomic island prediction tool IslandViewer 4,
M. onobrychidis OM4
T, but not
O. muellerharveyae TH2
T, contains a very large genomic island on its chromosome, harboring 414 genes (
Fig. 2, circle 3; Fig. S8A). This is the only larger fragment of the
M. onobrychidis OM4
T chromosome with a high density of unique genes (Table S1). The genomic island on the
M. onobrychidis OM4
T chromosome is located next to unique genes that are enriched in particular functions such as catalyzing DNA exchange. The plant growth-promoting trait (PGPT) density of the genomic island is 75%, considerably higher than the average PGPT density of the entire chromosome, which is only 55% (see also
Table 1). Details regarding the differences of PGPTs between
M. onobrychidis OM4
T,
O. muellerharveyae TH2
T, and other closely related strains are provided below.
Comparative genomics and functional annotation.
Pan-genome analysis of strains
M. onobrychidis OM4
T and
O. muellerharveyae TH2
T revealed a large number of gene clusters, ranging from 36,631 for all
Mesorhizobium strains to 85,606 for all
Rhizobiaceae strains here analyzed (
Fig. 2, circle 2; Fig. S9A and B). The strain
M. onobrychidis OM4
T revealed 2,683 core, 1,151 accessory, 2,444 cloud, and 1,068 unique genes. While 428 unique genes could not be assigned to any KO number (KEGG annotations), 441 KO numbers were detected for
M. onobrychidis OM4
T as unique genes, with various gene copy numbers. Functions of unique genes were associated with, among others, prokaryotic cellular community, signal transduction, carbohydrate and amino acid metabolism, cofactor and vitamin biosynthesis, energy metabolism, membrane transport, and lipid metabolism (Fig. S10). In contrast, the putative novel genus comprising single strain
O. muellerharveyae TH2
T revealed only 1,107 core genes, while counts of 1,839 for accessory, 1,105 for cloud, and 2,261 for unique genes were scored. The results did not allow further pan-genomic analysis for
O. muellerharveyae TH2
T due to a distant phylogenetic relation between
O. muellerharveyae TH2
T and the strains here analyzed.
Overall, KEGG functional analysis and respective abundance clustering of all KEGG annotations confirmed that M. onobrychidis OM4T contained functional similarities to M. delmotii STM4623T and M. temperatum SDW018T. The analysis also supported the novelty of this species when considering only strain-specific enriched K numbers (Fig. S9C). Analyzing all K numbers for O. muellerharveyae TH2T resulted in a distinct clustering, which became more distinct when considering only enriched ones (Fig. S9D). Both patterns highly supported its status as a new genus.
The KEGG functional annotation for M. onobrychidis OM4T and O. muellerharveyae TH2T revealed two distinct clusters of level 2 and level 3 KEGG functions (Fig. S11; Text S8). Onobrychidicola muellerharveyae TH2T showed higher counts for genes related to membrane transport, cell motility, cell growth and death, antimicrobial drug resistance, signal transduction, and replication, repair, transport, and catabolism (Fig. S11A).
The detection of specific secondary metabolite biosynthesis gene clusters (BGCs) further confirmed the different lifestyles of strains OM4T and TH2T (Fig. S12; Text S9). The whole-genome alignment of Mesorhizobium spp. revealed 85 regions unique to M. onobrychidis OM4T, harboring at least 5 and up to 77 genes as one of its novel species characteristics (Fig. S13, Table S1; Text S10). Twenty-one regions could be assigned to 7 of the entire 11 BGCs of OM4T. Among them, 2 BGCs matched with the genomic island, which covers 63 unaligned regions, including 364 genes, all assigned as unique genes (Fig. S14; Table S2; Text S10).
Functional PGPT annotation.
The main genetic features and functional PGPT annotations, based on KOfam-KEGG for PGPT mapping of all 80 strains, are summarized in a heatmap (
Fig. 3). Detailed values are given in Table S1. The pattern of depleted (blue) and enriched (red) traits coincided with the phylogenetic clades apart from very few exceptions in clade C. Heatmap fractions belonging to the
Mesorhizobium clade (clade A),
Ensifer clade (clade D), and
Rhizobium clade (clade F) were dominated by PGPT classes of enriched gene counts.
Focusing on the Mesorhizobium clade, a fraction of depleted traits refers to three subclasses of the PGPT class “colonization,” namely, chemotaxis, flagellar system, and flagellum assembly, important for bacteria to migrate toward chemical stimuli. In total, 4,046 genes of M. onobrychidis OM4T could be allocated to PGPTs, compared to an average of 3,735 genes among other Mesorhizobium strains. A similar PGPT count was also found for its closest relative, M. delmotii STM4623T. In general, the newly described species, M. onobrychidis, is very similar to the other species of the genus Mesorhizobium. Among Mesorhizobium, M. onobrychidis OM4T is one of the strains with the highest number of genes in the following PGP level 2 classes: biofertilization, phytohormone, plant signal production, stress resistance, competitive exclusion, and plant immune response stimulation. PGPT counts of M. onobrychidis OM4T for the mentioned phytohormone and plant signals and plant immune system stimulation traits were higher than its two closest relatives, M. delmotii STM4623T and/or M. temperatum SDW018T (Table S1). In contrast, M. onobrychidis strain OM4T showed only an average amount of bioremediation genes, distinguishing it merely from M. delmotii STM4623T and M. temperatum SDW018T. Mesorhizobium onobrychidis OM4T harbored genes related to fixing carbon dioxide via ribulose 1,5-bisphosphate carboxylase/oxidase (Rubisco) as another highly plant-beneficial feature (data not shown). It comprised a versatile set of stress resistance and colonization genes; their abundance mostly coincided with its both closest relatives. Furthermore, it contained the genetic ability for nodulation, vitamin B3 and pilus-fimbriae biosynthesis, the use of plant-derived metabolites, e.g., amino acids, and the degradation of jasmonate/salicylic acid. Traits related to competitive exclusion showed a higher PGPT count for bacterial fitness than all other Mesorhizobium strains, especially for oxidative phosphorylation and resistance against plant antimicrobial compounds hydroxycinnamic acid and quinoline. The most significant differences between M. onobrychidis OM4T and other Mesorhizobium strains occurred in the number of transposases and xerC/xerD recombinases, which are important PGPTs related to colonization and competitive exclusion. Mesorhizobium onobrychidis OM4T has approximately 2.5 times as many genes belonging to these categories as the other Mesorhizobium strains on average (transposases, 136 compared to 57; recombinases, 33 compared to 13). Regarding secretion systems, M. onobrychidis OM4T encoded one type VI secretion system (T6SS), two T3SSs, and one T4SS (trb) on its chromosome, as well as one copy of the virB-specific T4SS on its plasmid. The PGPT distribution alternated in a shared pattern or highly varied between M. onobrychidis OM4T and its relatives M. delmotii STM4623T and/or M. temperatum SDW018T.
Onobrychidicola muellerharveyae TH2
T strongly differed in its overall PGPT abundancy profile from any other phylogenomic clades (
Fig. 3,
B1 to
B3 and
C to
J). It contained a rather low number of genes for biofertilization and bioremediation, while the classes phytohormone and plant signaling, stress resistance, colonization, and competitive exclusion were slightly above average.
Onobrychidicola muellerharveyae TH2
T is one of the
Rhizobiaceae strains with the highest phospholipid- and gibberellin-encoding PGPT count. In terms of stress resistance,
O. muellerharveyae TH2
T exceeds all other strains in the copy number of the gene for tabtoxin degradation (
ttr), which is produced by some plant pathogens. While most
Rhizobiaceae have one tabtoxin degradation gene (
Fig. 3),
O. muellerharveyae TH2
T contained four copies of this gene. In terms of competitive exclusion,
O. muellerharveyae TH2
T was superior to all other investigated strains concerning the enrichment of genes for toxin-antitoxin systems (TASs). This is the case also in biofilm secretion and resistance to antimicrobial compounds. In terms of (host) colonization,
O. muellerharveyae TH2
T was remarkable in the subclass “host invasion factors” and subclasses that enable target-oriented movement (chemotaxis, flagellar system, and flagellum assembly). However, it lacked the nodulation gene cluster despite possessing single nodulation-associated genes like
nolA and
nodD. It showed an exceptional higher gene count for plant-branching inhibition and embryogenesis compounds spermidine and putrescine that act as plant signals. Regarding secretion systems,
O. muellerharveyae TH2
T only harbored one T4SS (
virB) on plasmid 2 and one T2SS on plasmid 3.
Phenotypic characterization and fatty acid analysis.
Phenotypic characteristics of strains O. muellerharveyae TH2T and M. onobrychidis OM4T are summarized in Table S6. Differential characteristics of O. muellerharveyae TH2T and the type species from the other genera of the family Rhizobiaceae are indicated in Table S5. Phenotypic tests performed with the API 20NE system and Biolog GEN III microplates were assessed as unreliable since negative reactions were observed for the majority of tests (data not shown). This was likely because of the growth conditions that were inadequate for these strains. Therefore, most of the tests included in the API 20NE system were repeated as conventional biochemical assays in test tubes in order to facilitate monitoring of bacterial growth and result assessment. Although more satisfactory results were obtained this way, no bacterial growth was observed for some tests, i.e., in media containing l-tryptophane as a substrate (indole production test). For strain M. onobrychidis OM4T, no bacterial growth was observed in media containing esculin-ferric citrate (esculin activity test) and gelatin (esculin hydrolysis test) as the substrates.
The results of the fatty acid analysis are summarized in Table S7. The major fatty acids (>5%) of
O. muellerharveyae TH2
T are C
18:1 ω7c, C
19:0 cyclo ω7c, and C
16:0 and C
17:0 cyclo ω7c. Generally, as in other
Rhizobiaceae members, the dominant fatty acid in
O. muellerharveyae TH2
T was C
18:1 ω7c, which is, in some strains, comprised in summed feature 8 (C
18:1 ω7c/C
18:1 ω6c). Unlike other type species from the other genera of
Rhizobiaceae,
O. muellerharveyae TH2
T contained a relatively high (>5%) amount of C
17:0 cyclo ω7c. For
M. onobrychidis OM4
T, the major fatty acids (>5%) were C
18:1 ω7c, C
16:0, C
19:0 cyclo ω7c, and 11 methyl C
18:1 ω7c and C
18:0, similar to other
Mesorhizobium spp. (
38).