Open access
Bacteriology
Announcement
28 August 2024

Draft genome sequences of Butyrivibrio hungatei DSM 14810 (JK 615T) and Butyrivibrio fibrisolvens DSM 3071 (D1T)

ABSTRACT

Here, we report the draft genome sequences of two Butyrivibrio-type strains isolated from rumen fluid. The genome sequence of Butyrivibrio hungatei DSM 14810 was 3.3 Mb with 3,093 predicted genes, while the Butyrivibrio fibrisolvens DSM 3071 genome sequence was 4.8 Mb with 4,132 predicted genes.

ANNOUNCEMENT

Butyrivibrio is commonly found in the rumen of wild and domesticated ruminant animals (1, 2). Butyrivibrio spp. are anaerobic, Gram-negative curved rods with a single flagellum, and they ferment rumen oligosaccharides and monosaccharides into butyrate, an essential source of energy for the host (1, 3, 4). Butyrivibrio fibrisolvens DSM 3071 (D1T) was isolated from cow rumen in the 1950s in Maryland, USA (3), while Butyrivibrio hungatei DSM 14810 (JK 615T) was isolated from sheep rumen in the 1990s in Mnichovice, Czech Republic (1). Here, we report draft genome sequences of these two Butyrivibrio-type strains as part of the 1000 Microbial Genomes Project (5). These resources will improve our understanding of the complex metabolism and physiology of rumen microbiomes.
Leibniz Institute DSMZ cultured B. fibrisolvens DSM 3071 and B. hungatei DSM 14810 anaerobically at 37°C and 40°C, respectively, using DSMZ Medium 330 broth (https://mediadive.dsmz.de/medium/330) inoculated from an ampoule preserved by freeze-drying. The centrifuged cell pellet from an approximately 50-mL culture was used to extract genomic DNA using Epicentre’s MasterPure Gram-positive DNA Purification Kit. The Joint Genome Institute constructed an Illumina paired-end library with an average insert size of 270 bp. The library was quantified using KAPA Biosystem’s sequencing library qPCR kit and run on a Roche LightCycler 480 real-time PCR instrument. Several libraries were multiplexed and prepared for sequencing using a TruSeq paired-end cluster kit (v.3) and Illumina’s cBot instrument. Sequencing was performed on an Illumina HiSeq 2000 using TruSeq SBS sequencing kits (v.3) following a 2 × 150 indexed run protocol. A total of 8,786,458 reads totaling 1,318.0 Mbp were generated for B. fibrisolvens DSM 3071 and 9,376,984 reads totaling 1,406.5 Mbp were generated for B. hungatei DSM 14810. Reads were quality controlled and trimmed through DUK (v.1.0) (6). Filtered reads were assembled using Velvet (v.1.2.07) (velveth: 63 –shortPaired and velvetg: –very clean yes –exportFiltered yes –min contig lgth 500 –scaf- folding no –cov cutoff 10); 1- to 3-kb simulated paired-end reads were created from Velvet contigs using wgsim (v.1.0) (–e 0–1 100–2 100 –r 0 –R 0 –X 0); and these reads were assembled with simulated read pairs with Allpaths-LG (v.r46652) (PrepareAllpathsInputs: PHRED 64 = 0 PLOIDY = 1 FRAG COVERAGE = 125 JUMP COVERAGE = 25 LONG JUMP COV = 50, RunAllpath- sLG TARGETS = standard) (79). Gene annotation was performed using Prodigal (v.2.5), with manual curation via GenePRIMP (v.1.0) (10, 11). Predicted coding sequences were translated and used to search National Center for Biotechnology Information nr, UniProt, TIGRFam, Pfam, KEGG, COG, and InterPro (1218). tRNAScanSE (v.1.3.1) was used to identify tRNA genes, while rRNA genes were found using SILVA (v.123) (19, 20) (Table 1).
TABLE 1
TABLE 1 Genomic features of Butyrivibrio hungatei DSM 14810 and Butyrivibrio fibrisolvens DSM 3071a
Genome featureB. hungatei DSM 14810B. fibrisolvens DSM 3071
Length (bp)3,394,9474,837,257
Number of contigs2283
Contig N50 (bp)286,711119,711
Average fold coverage415×272×
GC content (%)b39.8639.66
Predicted genes3,0934,132
Predicted protein-coding genes3,0304,062
Predicted rRNAs911
Predicted tRNAs4644
GH1 domains (β-glucosidase)13
GH3 domains (β-glucosidase)615
GH5 domains (cellulase)25
GH8 domains (cellulase)10
GH9 domains (cellulase)03
GH44 domains (cellulase)00
GH48 domains (cellulase)00
Joint Genome Institute IMG/M taxon ID25825807262585428068
NCBI WGS accession numberGCA_900143205GCA_900129945
NCBI Bioproject accession numberPRJNA245645PRJNA245644
NCBI Sequence Read Archive accession numberSRR4096539SRR4096531
NCBI BioSample numberSAMN02745247SAMN02745229
a
NCBI, National Center for Biotechnology Information.
b
GC, Guanine/Cytosine.
Both B. fibrisolvens DSM 3071 and B. hungatei DSM 14810 metabolize cellobiose (2, 3), and accordingly, their genomes contain many predicted beta glucosidases (GH1 and GH3 domains). However, DSM 3071 can metabolize polysaccharides such as carboxymethyl cellulose, while DSM 14810 cannot (1). This activity is likely attributable to one or more of the eight predicted cellulases in the DSM 3071 genome, compared to three proteins with cellulase domains (GH5/8/9/44/48) in the DSM 14810 genome (Table 1).

ACKNOWLEDGMENTS

The work (proposal DOI: https://doi.org/10.46936/10.25585/60000886) was conducted by the US Department of Energy, Joint Genome Institute (JGI) (https://ror.org/04xm1d337), a DOE Office of Science User Facility that is supported by the Office of Science of the US Department of Energy operated under contract no. DE-AC02-05CH11231. This announcement was largely prepared by undergraduate students, and we gratefully acknowledge JGI for initiating and supporting it as an educational project (the “Adopt-a-Genome” Project).

REFERENCES

1.
Kopečný J, Zorec M, Mrázek J, Kobayashi Y, Marinšek-Logar R. 2003. Butyrivibrio hungatei sp. nov. and Pseudobutyrivibrio xylanivorans sp. nov., butyrate-producing bacteria from the rumen. Int J Syst Evol Microbiol 53:201–209.
2.
Palevich N, Kelly WJ, Leahy SC, Altermann E, Rakonjac J, Attwood GT. 2017. The complete genome sequence of the rumen bacterium Butyrivibrio hungatei MB2003. Stand Genomic Sci 12:72.
3.
Bryant MP, Small N. 1956. The anaerobic monotrichous butyric acid-producing curved rod-shaped bacteria of the rumen. J Bacteriol 72:16–21.
4.
Brown DW, Moore WEC. 1960. Distribution of Butyrivibrio fibrisolvens in nature. J Dairy Sci 43:1570–1574.
5.
Kyrpides NC, Woyke T, Eisen JA, Garrity G, Lilburn TG, Beck BJ, Whitman WB, Hugenholtz P, Klenk H-P. 2014. Genomic type strains, phase I: the one thousand microbial genomes (KMG-I) project. Stand Genomic Sci 9:1278–1284.
6.
Choi DH, Ahn C, Jang GI, Lapidus A, Han J, Reddy TBK, Huntemann M, Pati A, Ivanova N, Markowitz V, Rohde M, Tindall B, Göker M, Woyke T, Klenk H-P, Kyrpides NC, Cho BC. 2015. High-quality draft genome sequence of Gracilimonas tropica CL-CB462T (DSM 19535T), isolated from a Synechococcus culture. Stand Genomic Sci 10:98.
7.
Zerbino DR, Birney E. 2008. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 18:821–829.
8.
Gnerre S, Maccallum I, Przybylski D, Ribeiro FJ, Burton JN, Walker BJ, Sharpe T, Hall G, Shea TP, Sykes S, Berlin AM, Aird D, Costello M, Daza R, Williams L, Nicol R, Gnirke A, Nusbaum C, Lander ES, Jaffe DB. 2011. High-quality draft assemblies of mammalian genomes from massively parallel sequence data. Proc Natl Acad Sci U S A 108:1513–1518.
9.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup. 2009. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25:2078–2079.
10.
Pati A, Ivanova NN, Mikhailova N, Ovchinnikova G, Hooper SD, Lykidis A, Kyrpides NC. 2010. GenePRIMP: a gene prediction improvement pipeline for prokaryotic genomes. Nat Methods 7:455–457.
11.
Hyatt D, Chen G-L, Locascio PF, Land ML, Larimer FW, Hauser LJ. 2010. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11:119.
12.
O’Leary NA, Wright MW, Brister JR, Ciufo S, Haddad D, McVeigh R, Rajput B, Robbertse B, Smith-White B, Ako-Adjei D, et al. 2016. Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res 44:D733–D745.
13.
Bairoch A, Apweiler R, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M, Martin MJ, Natale DA, O’Donovan C, Redaschi N, Yeh L-SL. 2005. The universal protein resource (UniProt). Nucleic Acids Res 33:D154–D159.
14.
Haft DH, Loftus BJ, Richardson DL, Yang F, Eisen JA, Paulsen IT, White O. 2001. TIGRFAMs: a protein family resource for the functional identification of proteins. Nucleic Acids Res 29:41–43.
15.
Mistry J, Chuguransky S, Williams L, Qureshi M, Salazar GA, Sonnhammer ELL, Tosatto SCE, Paladin L, Raj S, Richardson LJ, Finn RD, Bateman A. 2021. Pfam: the protein families database in 2021. Nucleic Acids Res 49:D412–D419.
16.
Kanehisa M, Goto S. 2000. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res 28:27–30.
17.
Tatusov RL, Galperin MY, Natale DA, Koonin EV. 2000. The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res 28:33–36.
18.
Paysan-Lafosse T, Blum M, Chuguransky S, Grego T, Pinto BL, Salazar GA, Bileschi ML, Bork P, Bridge A, Colwell L, et al. 2023. InterPro in 2022. Nucleic Acids Res 51:D418–D427.
19.
Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964.
20.
Pruesse E, Quast C, Knittel K, Fuchs BM, Ludwig W, Peplies J, Glöckner FO. 2007. SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. Nucleic Acids Res 35:7188–7196.

Information & Contributors

Information

Published In

cover image Microbiology Resource Announcements
Microbiology Resource Announcements
Volume 13Number 1010 October 2024
eLocator: e00517-24
Editor: Julie C. Dunning Hotopp, University of Maryland School of Medicine, Baltimore, Maryland, USA
PubMed: 39194265

History

Received: 14 May 2024
Accepted: 12 August 2024
Published online: 28 August 2024

Keywords

  1. Butyrivibrio
  2. rumen
  3. genomes

Contributors

Authors

Keely Berner
Department of Biological Sciences, California State University San Marcos, San Marcos, California, USA
Author Contributions: Formal analysis, Investigation, and Writing – original draft.
Michelle Zoza-Veloz
Department of Biological Sciences, California State University San Marcos, San Marcos, California, USA
Author Contributions: Formal analysis, Investigation, and Writing – original draft.
Matt Nolan
Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contributions: Data curation, Formal analysis, Investigation, Methodology, Software, Validation, and Visualization.
Danielle Graham
Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contributions: Data curation, Formal analysis, Investigation, Methodology, Software, Validation, and Visualization.
Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contributions: Conceptualization, Formal analysis, Investigation, Methodology, Project administration, Supervision, and Validation.
Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contributions: Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Validation, and Writing – review and editing.
Stefan Spring
Department Microorganisms, Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
Author Contributions: Methodology and Writing – review and editing.
Department of Biological Sciences, California State University San Marcos, San Marcos, California, USA
Author Contributions: Conceptualization, Formal analysis, Investigation, Supervision, Writing – original draft, and Writing – review and editing.

Editor

Julie C. Dunning Hotopp
Editor
University of Maryland School of Medicine, Baltimore, Maryland, USA

Notes

Keely Berner and Michelle Zoza-Veloz contributed equally to this article. They are listed alphabetically by surname.
The authors declare no conflict of interest.

Metrics & Citations

Metrics

Note:

  • For recently published articles, the TOTAL download count will appear as zero until a new month starts.
  • There is a 3- to 4-day delay in article usage, so article usage will not appear immediately after publication.
  • Citation counts come from the Crossref Cited by service.

Citations

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. For an editable text file, please select Medlars format which will download as a .txt file. Simply select your manager software from the list below and click Download.

View Options

Figures

Tables

Media

Share

Share

Share the article link

Share with email

Email a colleague

Share on social media

American Society for Microbiology ("ASM") is committed to maintaining your confidence and trust with respect to the information we collect from you on websites owned and operated by ASM ("ASM Web Sites") and other sources. This Privacy Policy sets forth the information we collect about you, how we use this information and the choices you have about how we use such information.
FIND OUT MORE about the privacy policy