Description of novel taxa.
Functional annotation required extraction of both coding DNA sequences (CDS) and clustered regularly interspaced short palindromic repeats (CRISPR) via Prokka (v1.14) (
81). Protein sequences were checked against the Comprehensive Antibiotic Resistance Database (CARD) (downloaded January 2020) (
82) and for carbohydrate activate enzymes (CAZymes) (downloaded July 2019) (
83) using DIAMOND (>40% identity, >40% query and subject coverage) (
84). The Prokka annotations were mapped to KEGG (
85) using PROKKA2KEGG (
https://github.com/SilentGene/Bio-py/tree/master/prokka2kegg).
Taxonomic assignment was based on both 16S rRNA gene and genome sequences. Each isolate’s 16S rRNA gene sequence was compared to the Living Tree Project (LTP) (v132) (
86) and pairwise sequence identities were calculated (
69). The 50 best matches with a valid name and for which a genome exists were compared to the isolate using both average nucleotide identity (ANI) via FastANI (v1.3) (
87) and the percentage of conserved proteins (POCP) (
88). Phylogenomic trees were created using the genomes of the input isolate and its closest relatives with PhyloPhlan3 (v3.0.53) (
89). For confirmation of species delineation, the genome of target isolates was entered into the type-strain genome server (TYGS) to determine digital DNA:DNA hybridization (dDDH) values (
90).
For taxa delineation, 16S rRNA gene sequence identities of ≤98.7% and ≤94.5% were indicative for novel species and genera, respectively (
75). ANI values of <95% between genomes were considered an indication for separate species. If ANI values were close to 95% or inconsistent with the 16S rRNA assignment, dDDH was consulted with a species delineation value of ≤70%. A difference in G+C content of genomic DNA ≥1% was considered a further indication of species-level differentiation (
91). POCP values of ≤50% were indicative for the creation of a new genus (
88). The classification of isolates was further confirmed using GTDB-Tk (v1.2.0) (
92) and by constructing 16S rRNA gene-based and phylogenomic trees (
Fig. S2). A database of publicly available MAG data sets (
20,
24,
93–102) was compared to isolates using MASH (v2.2) (
103). Only those with a distance score of <0.05 were reported. All taxonomic and functional data used to create the protologues mentioned below are available at
https://github.com/thh32/Protologger within the data sets section.
(i) Description of Gallibacter gen. nov. Gallibacter (Gal.li.bac’ter. L. masc. n.
gallus, chicken; N.L. masc. n.
bacter, rod; N.L. masc. n.
Gallibacter, a chicken rod). The genus falls into the family
Eubacteriaceae (phylum
Firmicutes) according to 16S rRNA gene-based phylogeny (
Fig. S2A). The closest relatives are
Eubacterium brachy (93.3% sequence identity),
Eubacterium infirmum, and
Anaerovorax odorimutans (both 90.7%). POCP analysis suggests the input genome to belong to the genus
Eubacterium with a matching value of 54.8% to
Eubacterium brachy. However, the type species of this genus
Eubacterium limosum, shared a POCP value of 22.5% with the isolate. GTDB-Tk assigned the isolate to the family “
Anaerovoraceae” (not valid), genus
Eubacterium_M, showing that the current taxonomic status of the genus
Eubacterium is incongruent. Therefore, a novel genus with the name
Gallibacter is proposed to accommodate the isolate. The type species is
Gallibacter intestinalis.
(ii) Description of Gallibacter intestinalis sp. nov. Gallibacter intestinalis (in.tes.ti.na’lis. N.L. masc. adj.
intestinalis, pertaining to the intestine). The species has all features of the genus. Cells are approximately 1 μm long and 0.3 μm wide. They grow under strictly anaerobic conditions in GAM modified medium (DSMZ medium 1715) at 37°C. All functional attributes of this species can be found at
https://github.com/thh32/Protologger within the data sets section. The major cellular fatty acids were C
15:0 ISO (14.9%) and C
15:0 ANTEISO (14.7%). Other fatty acids included C
18:1 cis9 (7.4%), C
16:0 ISO DMA (7.4%), C
16:0 DMA (6.9%), C
18:2 cis9,12 (4.4%), C
15:0 ISO DMA (3.9%), C
17:0 ANTEISO DMA (2.8%), C
16:0 (2.7%), C
14:0 (2.5%), and C
16:1 cis9 (2.2%). The type strain Cla-CZ-54
T (= DSM 108706
T) was isolated from colon content of a free-range layer chicken. Its G+C content of genomic DNA is 40.9%.
(iii) Description of Gallalistipes gen. nov. Gallalistipes (Gall.a.li.sti'pes. L. masc. n.
gallus, chicken; N.L. masc. n.
Alistipes, a bacterial genus; N.L. masc. n.
Gallalistipes, a relative of
Alistipes from chicken). The isolate is phylogenetically placed into the family
Rikenellaceae (phylum
Bacteroidetes) with the closest neighbors being
Alistipes onderdonkii,
Alistipes finegoldii, and
Alistipes timonensis based on 16S rRNA sequence identities of 92.5%, 92.3%, and 92.2%, respectively (
Fig. S2B). POCP analysis suggests the genome to belong to the genus
Alistipes with a matching value of 56% to
Alistipes indistinctus. However, the 16S rRNA gene identity to
Alistipes indistinctus is, at 91.6%, even lower than to the close relatives aforementioned. POCP to the type species of the genus,
Alistipes putredinis, was 48.1%, suggesting the creation of a sister genus. This was also supported by GTDB-Tk being unable to assign the input genome to a sequenced genome at the genus and species levels and by the phylogenomic tree (
Fig. S2B). Hence, these data indicate that
Alistipes indistinctus may have to be reclassified in the future and that the creation of a novel genus is necessary to accommodate the isolate, for which the name
Gallalistipes is proposed. The type species is
Gallalistipes aquisgranensis.
(iv) Description of Gallalistipes aquisgranensis sp. nov. Gallalistipes aquisgranensis (a.quis.gra.nen’sis. M.L. masc. adj. a
quisgranensis, pertaining to Aachen (Germany), where the bacterium was isolated). The species has all features of the genus. Cell are small rods that are 1 to 2 μm long and 0.3 μm wide. They grow under strictly anaerobic conditions in GAM modified medium (DSMZ medium 1715) at 37°C. All functional attributes of this species can be found at
https://github.com/thh32/Protologger within the data sets section. The major cellular fatty acids were C
15:0 ISO (46.2%) and C
18:1 cis9 (37.9%). Other fatty acids included C
17:0 ISO 3OH (4.6%) and C
18:1 t9 (2.3%). The type strain Cla-CZ-119
T (= DSM 108975
T) was isolated from cecal content of an M11-layer chicken. Its G+C content of genomic DNA is 58%.
(v) Description of Gemmiger gallinarum sp. nov. Gemmiger gallinarum (gal.li.na’rum. L. fem. n.
gallina, hen; L. gen. fem. pl. n.
gallinarum, of hens). The closest relative is
Gemmiger formicilis, the type species of this genus, with a 16S rRNA gene sequence identity of 96.2%. Even though
Gemmiger formicilis is placed into the family
Hyphomicrobiaceae (phylum
Proteobacteria) according to its current taxonomic lineage, it clearly falls into the family
Oscillospiraceae based on phylogenetic trees (
Fig. S2C) (
104). Other relatives included
Fournierella massiliensis and
Subdoligranulum variabile within the family
Oscillospiraceae, with 94.8% and 94.7% sequence identities, respectively, which supports the accommodation of the isolate within this family. The highest ANI values were 81.7% to
Subdoligranulum variabile (GCF_000157955.1) and 79.7% to both
Gemmiger formicilis and
Butyricicoccus pullicaecorum, respectively. GTDB-Tk placed the genome of this isolate into the family
Oscillospiraceae and assigned it to the genus
Gemmiger but was unable to provide species-level identification. dDDH values to the closest species
Subdoligranulum variabile,
Gemmiger formicilis, and
Fournierella massiliensis were 25.3%, 22.0%, and 20.2%, respectively. Taken together, a novel species within the genus
Gemmiger is proposed to accommodate the isolate, supported by a POCP value of 61.2% to
Gemmiger formicilis. Cells are rod-shaped, 1 to 2 μm long and 0.4 μm wide, and grow under strictly anaerobic conditions in modified BHI medium (DSMZ medium 215c) at 37°C. All functional attributes of this species can be found at
https://github.com/thh32/Protologger within the data sets section. The major cellular fatty acids were C
16:0 DMA (37.3%) and C
16:0 (32.6%). Other fatty acids included C
14:0 (14.3%) and C
16:0 ALDEHYDE (7.7%). The type strain Cla-CZ-245
T (= DSM 109015
T) was isolated from cecal content of an M11-layer chicken. Its G+C content of genomic DNA is 59.2%.
(vi) Description of Olsenella gallinarum sp. nov. Olsenella gallinarum (gal.li.na’rum. L. fem. n.
gallina, hen; L. gen. fem. pl. n.
gallinarum, of hens). According to 16S rRNA gene-based phylogeny, this isolate is placed into the family
Atopobiaceae (phylum
Actinobacteria) (
Fig. S2D). The closest phylogenetic neighbors are
Olsenella umbonata,
Olsenella profusa, and
Olsenella uli (the type species of this genus) with 96.6%, 96.2%, and 95.8% sequence identities, respectively. The highest ANI value was 80.4% to
Olsenella scatoligenes (GCF_001494635.1), which is below the species delineation threshold of 95%. The POCP value to
Olsenella uli was 54.9%, indicating that the isolate represents a novel species within the genus
Olsenella, as confirmed by GTDB-Tk. Cell morphology varies from spherical cells with a diameter of approximately 1 μm to small rods that are approximately 1.6 μm long and 0.4 μm wide and form chains. The species grows under strictly anaerobic conditions in modified BHI medium (DSMZ medium 215c) at 37°C. All functional attributes of this species can be found at
https://github.com/thh32/Protologger within the data sets section. The major cellular fatty acids were C
14:0 (33.8%) and C
14:0 DMA (30.3%). Other fatty acids included C
14:0 ALDEHYDE (11.6%), C
12:0 (4.7%), C
16:0 DMA (4.3%), C
12:0 DMA (2.6%), and C
14:0 ALCOHOL (2.5%). The type strain Cla-CZ-62
T (= DSM 107455
T) was isolated from colon content of a free-range layer chicken. Its G+C content of genomic DNA is 68%.
(vii) Description of Pseudoflavonifractor gallinarum sp. nov. Pseudoflavonifractor gallinarum (gal.li.na’rum. L. fem. n.
gallina, hen; L. gen. fem. pl. n.
gallinarum, of hens). The 16S rRNA gene-based identification placed the isolate into the family
Oscillospiraceae, with highest sequence identity (98.2%) to
Pseudoflavonifractor capillosus, the type species of the genus (
Fig. S2E). Other relatives include
Flavonifractor plautii and
Intestinimonas butyriciproducens with 97.7% and 95.3% identities, respectively. The highest ANI value was 82.2% to
Pseudoflavonifractor capillosus (GCF_000169255.2). The highest POCP value of 56.9% to
Pseudoflavonifractor capillosus suggests that the isolate belongs to the genus
Pseudoflavonifractor. GTDB-Tk assigned the genome of this isolate to the genus
Flavonifractor and to the species
Flavonifractor sp900199495. In summary, the different taxonomic delineation values and the phylogeny are somewhat inconsistent for this isolate. We nonetheless propose the creation of a new species within the genus
Pseudoflavonifractor to accommodate it, but this group of bacteria (
Flavonifractor and
Pseudoflavonifractor spp.) will likely have to be reclassified in the near future once additional strains have been isolated. Cells grow as short rods (0.7 to 1.2 μm in length) with a width of approximately 0.4 μm under strictly anaerobic conditions in WCA medium (DSMZ medium 339a) at 37°C. All functional attributes of this species can be found at
https://github.com/thh32/Protologger within the data sets section. The major cellular fatty acids were C
14:0 (29.7%) and C
16:0 DMA (28.7%). Other fatty acids included C
14:0 DMA (9.5%), C
12:0 (8.6%), C
18:0 DMA (6.1%), C
16:0 ALDEHYDE (4.2%), C
16:0 (3.6%), and C
14:0 ALDEHYDE (2.2%). The type strain Cla-CZ-98
T (= DSM 107456
T) was isolated from cecal content of a free-range layer chicken. Its G+C content of genomic DNA is 59.9%.
(viii) Description of Ructibacterium gen. nov. Ructibacterium (Ruc.ti.bac.te’ri.um. L. masc. n.
ructus belch, burp; Gr. neut. n.
bakterion a small rod; N.L. neut. n.
Ructibacterium a belching, rod-shaped bacterium, referring to the production of gas in liquid culture). The genus is phylogenetically placed into the family
Oscillospiraceae (phylum
Firmicutes) based on 16S rRNA gene analysis (
Fig. S2F). The closest relatives are
Acetivibrio cellulolyticus, the type species of this genus,
Acetivibrio thermocellus, and
Acetivibrio straminisolvens, with 16S rRNA gene sequence identities of 89.7%, 89.6%, and 89.4%, respectively. POCP did not exceed 50% for any of the close relatives. GTDB-Tk placed the genome of this isolate into the family “
Monoglobaceae” (not validly published) but was unable to provide genus- or species-level assignment. A novel genus,
Ructibacterium, is proposed within the currently validly named family
Oscillospiraceae to accommodate the isolate. The type species is
Ructibacterium gallinarum.
(ix) Description of Ructibacterium gallinarum sp. nov. Ructibacterium gallinarum (gal.li.na’rum. L. fem. n.
gallina, hen; L. gen. fem. pl. n.
gallinarum, of hens). The species has all features of the genus. Cells are small cocci that are approximately 0.6 μm in diameter. They grow under strictly anaerobic conditions in GAM modified medium (DSMZ medium 1715) at 37°C. All functional attributes of this species can be found at
https://github.com/thh32/Protologger within the data sets section. The major cellular fatty acids were C
15:0 ISO (22.8%) and C
16:0 ISO (19.0%). Other fatty acids included C
15:0 ANTEISO (12.6%), C
15:0 ISO DMA (9.5%), C
17:0 ANTEISO (6.6%), C
16:0 ISO DMA (5.4%), C
14:0 ISO (3.5%), C
15:0 ANTEISO DMA (3.5%), and C
17:0 ISO (2.0%). The type strain Cla-CZ-49
T (= DSM 107454
T) was isolated from the cecal content of an M11-layer chicken. Its G+C content of genomic DNA is 43.5%.
(x) Description of Sellimonas monacensis sp. nov. Sellimonas monacensis (mo.na.cen’sis. M.L. neut. n.
Monacum, Munich, a German city; M.L. fem. adj.
monacensis, from/of Munich (Germany), referring to the city where the animal facility of the donor chicken was located). Based on 16S rRNA gene phylogeny, the isolate is placed into the family
Lachnospiraceae (phylum
Firmicutes) (
Fig. S2G). The closest relatives are
Faecalicatena contorta (the type species of the genus
Faecalicatena),
Faecalicatena orotica, and
Coprococcus comes, with sequence identities of 94.4%, 93.9%, and 93.9%, respectively. FastANI identified the genome as novel, with the best match of 78.3% to
Ruminococcus lactaris (GCF_000155205.1). POCP analysis demonstrates a genus separation from
Faecalicatena with a value of 44.8% to
Faecalicatena contorta. The highest POCP values were identified to
Dorea longicatena (56.4%) and to
Dorea formicigenerans (51.7%), the type species of the genus
Dorea, but also to
Clostridium scindens (54.7%),
Clostridium hylemonae (53.1%), and
Sellimonas intestinalis (52.9%). The 16S rRNA gene sequence identity between the isolate and the latter species was 93.7%. GTDB-Tk assigned the isolate to “
Dorea phocaeensis”; however, this species name is not valid. The highest accordance in dDDH value was for “
Lachnoclostridium phocaeense” Marseille-P3177 with 84.6% and a G+C difference of 0.3%. Although this bacterium was described in 2017 (
105), the name “
Lachnoclostridium phocaeense” is still not valid. Taken together, and also considering the topology of both the 16S rRNA gene-based and phylogenomic trees (
Fig. S2G), we suggest the creation of a novel species within the genus
Sellimonas to accommodate the isolate, for which the name
Sellimonas monacensis is proposed. Cells are approximately 1 to 2 μm long and 0.5 μm wide. They grow under strictly anaerobic conditions in GAM modified medium (DSMZ medium 1715) at 37°C. All functional attributes of this species can be found at
https://github.com/thh32/Protologger within the data sets section. The main cellular fatty acid was C
14:0 (17.5%). Other fatty acids included C
18:0 DMA (14.0%), C
16:0 DMA (12.3%), C
16:0 (11.6%), C
14:0 DMA (9.3%), C
12:0 (4.9%), C
18:1 cis11 DMA (3.9%), C
14:0 ALDEHYDE (3.8%), C
18:0 ALDEHYDE (2.9%), C
16:0 ALDEHYDE (2.5%), C
18:1 cis9 DMA (2.4%), and C
18:0 (2.2%). The type strain Cla-CZ-80
T (=DSM 108991
T) was isolated from cecal content of an M11-layer chicken sampled in Munich. Its G+C content of genomic DNA is 50.3%.