Open access
26 July 2018

Complete Genome Sequences of Streptococcus sobrinus SL1 (ATCC 33478 = DSM 20742), NIDR 6715-7 (ATCC 27351), NIDR 6715-15 (ATCC 27352), and NCTC 10919 (ATCC 33402)


The bacterium Streptococcus sobrinus causes tooth decay in humans. We present complete circularized genome sequences for four strains of S. sobrinus, type strain SL1, strain NIDR 6715-7 and the related NIDR 6715-15, and strain NCTC 10919. The finished genomes will enable genomic comparisons between S. sobrinus and other cariogenic microbes.


Tooth decay, or dental caries, occurs when acid produced by oral bacteria demineralizes the tooth structure. The primary causative agents of caries are the Gram-positive mutans streptococci, including Streptococcus sobrinus (1). S. sobrinus occurs in 10 to 14% of carious lesions (2) and is associated with the poorest clinical outcomes, especially among children (3, 4).
No complete or reference genome exists for any S. sobrinus strain. The current draft genome for the S. sobrinus type strain SL1 contains 54 contigs (5). There is no genome sequence information for strain NIDR 6715 or NCTC 10919 despite multiple phenotypic studies on these microbes (6, 7). We believe that the complete S. sobrinus genomes presented herein are an important step toward understanding the genetic basis of cariogenesis by S. sobrinus.
All strains were cultured overnight in Todd-Hewitt broth with 5% yeast extract (Sigma) and 5 μl/ml Oxyrase (Oxyrase, Inc.) in sealed 15-ml tubes. Genomic DNA was extracted using an UltraClean microbial DNA isolation kit (Mo Bio), with 15 min of incubation at 65°C following step 4. DNA quantity and quality were checked using a Qubit fluorimeter (Invitrogen) and gel electrophoresis.
Genome sequencing used both Illumina short-read and Oxford Nanopore long-read sequences. Libraries for the Illumina MiSeq instrument were generated with Nextera kits and sequenced using a paired end 2 × 250-bp version 2 flow cell yielding 27.4 million reads. FASTQ files containing reads were generated and demultiplexed using blc2fastq version (Illumina).
Nanopore sequencing libraries (8) were prepared for strains SL1, NIDR 6715-7, and NCTC 10919 using the rapid barcoding library kit SQK-RBK004 (Oxford Nanopore). Initial assembly of the Illumina reads revealed that strains NIDR 6717-7 and NIDR 6715-15 are nearly identical, so NIDR 6715-15 was not used for Nanopore sequencing. Libraries were sequenced on a single SpotON R9.4.1 FLO-MIN106 flow cell for 24 h using a GridIONx5 sequencer yielding 2.47 Gb across 537,986 reads (max length, 46.479 kb; mean length, 4.572 kb). Base calling was performed with Albacore 2.1.7 (Oxford Nanopore), and all reads shorter than 1 kb were discarded.
All Illumina short reads were trimmed to 150 bp and quality filtered using BBDuk version 37.09 (option “maq = 6”). Nanopore adapters were trimmed with PoreChop version 0.2.1 using the default settings. Illumina and Nanopore reads from strains SL1, 6715-7, and 10919 were assembled using Unicycler version 0.4.2 (9) with the default options. Each genome assembled to a single circularized contig. Hybrid assembly with SPAdes version 3.11.1 (10) resulted in 1, 1, and 3 contigs for strains SL1, NIDR 6715-7, and NCTC 10919, respectively. Assembly of the strain NIDR 6715-15 Illumina short reads with SPAdes version 3.11.1 (option “–careful”) generated 53 contigs. The contigs from strain NIDR 6715-15 were mapped to the complete NIDR 6715-7 assembly using Quast version 4.6.1 (11), revealing >99% identity between the two substrains. The NIDR 6715-15 contigs were scaffolded into a complete circularized assembly using the NIDR 6715-7 genome.
Genomes were annotated using the NCBI Prokaryotic Annotation Pipeline (12). The SL1 genome is 2.199 Mb (43.5% G+C content) with 2,094 predicted genes. The genomes for strains NIDR 6715-7 and NIDR 6715-15 are 2.145 Mb (43.7% G+C content), with 2,082 predicted genes for strain NIDR 6715-7 and 2,081 predicted genes for strain NIDR 6715-15. The NCTC 10919 genome is 2.145 Mb (43.6% G+C content), with 2,149 predicted genes.

Data availability.

This whole-genome shotgun project has been deposited in GenBank under the accession numbers CP029490 (SL1), CP029560 (NIDR 6715-7), CP029559 (NIDR 6715-15), and CP029491 (NCTC 10919).


We thank Chris Wright and Alvaro Hernandez for their assistance with library preparation and sequencing. Sequencing was performed at the Roy J. Carver Biotechnology Center at the University of Illinois at Urbana-Champaign (UIUC).
This work was supported by NIH National Institute of Dental and Craniofacial Research grant DE026817 and the Bioengineering M.Eng. program at the UIUC. M.J.S. was supported in part by an undergraduate research support grant. The NIDR 6715-7 and NIDR 6715-15 strains were provided by the ARS Culture Collection (NRRL B-14554 and B-14555).
We declare no financial or commercial conflicts of interest.


Loesche WJ. 1986. Role of Streptococcus mutans in human dental decay. Microbiol Rev 50:353–380.
Sánchez-Acedo M, Montiel-Company J-M, Dasí-Fernández F, Almerich-Silla J-M. 2013. Streptococcus mutans and Streptococcus sobrinus detection by polymerase chain reaction and their relation to dental caries in 12 and 15 year-old schoolchildren in Valencia (Spain). Med Oral Patol Oral Cir Bucal 18:e839–e845.
Okada M, Soda Y, Hayashi F, Doi T, Suzuki J, Miura K, Kozai K. 2005. Longitudinal study of dental caries incidence associated with Streptococcus mutans and Streptococcus sobrinus in pre-school children. J Med Microbiol 54:661–665.
Conrads G, de Soet JJ, Song L, Henne K, Sztajer H, Wagner-Döbler I, Zeng A-P. 2014. Comparing the cariogenic species Streptococcus sobrinus and S. mutans on whole genome level. J Oral Microbiol 6:26189.
Song L, Wang W, Conrads G, Rheinberg A, Sztajer H, Reck M, Wagner-Döbler I, Zeng A-P. 2013. Genetic variability of mutans streptococci revealed by wide whole-genome sequencing. BMC Genomics 14:430.
Nascimento MM, Lemos JAC, Abranches J, Gonçalves RB, Burne RA. 2004. Adaptive acid tolerance response of Streptococcus sobrinus. J Bacteriol 186:6383–6390.
Martinez AR, Abranches J, Kajfasz JK, Lemos JA. 2010. Characterization of the Streptococcus sobrinus acid-stress response by interspecies microarrays and proteomics. Mol Oral Microbiol 25:331–342.
Jain M, Olsen HE, Paten B, Akeson M. 2016. The Oxford Nanopore MinION: delivery of nanopore sequencing to the genomics community. Genome Biol 17:239.
Wick RR, Judd LM, Gorrie CL, Holt KE. 2017. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput Biol 13:e1005595.
Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477.
Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075.
Haft DH, DiCuccio M, Badretdin A, Brover V, Chetvernin V, O’Neill K, Li W, Chitsaz F, Derbyshire MK, Gonzales NR, Gwadz M, Lu F, Marchler GH, Song JS, Thanki N, Yamashita RA, Zheng C, Thibaud-Nissen F, Geer LY, Marchler-Bauer A, Pruitt KD. 2018. RefSeq: an update on prokaryotic genome annotation and curation. Nucleic Acids Res 46:D851–D860.

Information & Contributors


Published In

cover image Microbiology Resource Announcements
Microbiology Resource Announcements
Volume 7Number 326 July 2018
eLocator: 10.1128/mra.00804-18
Editor: Julia Maresca, University of Delaware


Received: 5 June 2018
Accepted: 21 June 2018
Published online: 26 July 2018



Mia J. Sales
Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
William G. Herbert
Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
Yuting Du
Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
Amitha S. Sandur
Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
Naaman M. Stanley
Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA


Julia Maresca
University of Delaware


Address correspondence to Paul A. Jensen, [email protected].
Y.D., A.S.S., and N.M.S. contributed equally to this work.

Metrics & Citations



  • For recently published articles, the TOTAL download count will appear as zero until a new month starts.
  • There is a 3- to 4-day delay in article usage, so article usage will not appear immediately after publication.
  • Citation counts come from the Crossref Cited by service.


If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. For an editable text file, please select Medlars format which will download as a .txt file. Simply select your manager software from the list below and click Download.

View Options

Figures and Media






Share the article link

Share with email

Email a colleague

Share on social media

American Society for Microbiology ("ASM") is committed to maintaining your confidence and trust with respect to the information we collect from you on websites owned and operated by ASM ("ASM Web Sites") and other sources. This Privacy Policy sets forth the information we collect about you, how we use this information and the choices you have about how we use such information.
FIND OUT MORE about the privacy policy