Open access
Announcement
3 September 2020

Genome Sequences of Four Potentially Therapeutic Bacteriophages Infecting Shiga Toxin-Producing Escherichia coli

ABSTRACT

Four phages infecting Shiga toxin-producing Escherichia coli (STEC) strains of different serotypes were isolated from wastewater samples. Their virion DNAs range from 51 to 170 kbp, are circularly permuted or have defined terminal repeats, and can encode 82 to 279 proteins. Despite their high similarity to other phages, only about 30% of their genes have a predicted function.

ANNOUNCEMENT

Shiga toxin-producing Escherichia coli (STEC) causes significant foodborne diseases in humans. Being generally nonpathogenic in ruminants, they use their gut as a natural reservoir. Transmission to humans occurs through the consumption of contaminated foods, such as raw or undercooked meat products, raw milk, and contaminated raw vegetables. Because fecal shedding is the major contamination source of carcasses, causing subsequent food recalls and human outbreaks, the role of the live animal in the production of a safe food product is critical. Here, we report the isolation of four broad STEC-infecting phages (vB_EcoM_Lutter [Lutter], vB_EcoM_Ozark [Ozark], vB_EcoM_Gotham [Gotham], and vB_EcoS_Chapo [Chapo]) isolated in Braga, Portugal.
Phages were isolated and produced as described previously (1). Briefly, sewage samples enriched with double-strength tryptic soy broth medium and STEC strains were grown overnight at 37°C with agitation. Filtered supernatants were spotted onto bacterial lawns, and collected phages were used for further purification.
Phage genomic DNA was extracted using phenol-chloroform-isoamyl alcohol extraction (2). Next, whole-genome libraries were constructed using a TruSeq Nano DNA library prep kit. The generated DNA fragments were multiplexed and sequenced in the same Illumina MiSeq run using 300-bp paired-end sequencing reads. The sequence reads were assembled in the Geneious Prime 2020 (Biomatters Ltd., New Zealand) de novo assembler (with medium-low sensitivity), yielding average coverages of 97× (61,819 reads), 20× (9,253 reads), 79× (31,782 reads), and 130× (19,306 reads) for Lutter, Ozark, Gotham, and Chapo, respectively. Quality control of the sequence reads was performed with FastQC v0.11.5 (3), while the assembly quality was verified with Geneious Prime (4). The assembled reads of Lutter, Ozark, and Chapo formed single contigs of overlapping ends with no regions of 2× increased coverage, as expected in the case of terminally redundant and circularly permuted sequences. Their starts were selected to align with the starts of the genomes of similar reference phages. The genomes were annotated using MyRAST (5), BLAST (6), tRNAscan-SE v2.0 (7), ARAGORN (8), PhagePromoter (9), and HHpred (10) (with default program parameters) and manually inspected. A summary of their basic characteristics is presented in Table 1.
TABLE 1
TABLE 1 Morphology and overall features of isolated Escherichia phages
Phage nameMorphology (family)Subfamily, genusGenome size (bp)Virion DNAPackaging strategyG+C content (%)No. of CDSaNo. of tRNAs
vB_EcoM_LutterMyoviridae (Myoviridae)Tevenvirinae, Tequatrovirus170,054Terminally redundant, circularly permutedHeadful packaging, preferred pac cuts between pos.b 97225 and 97248 of genomic sequence35.42798
vB_EcoM_OzarkMyoviridae (Myoviridae)Tevenvirinae, Tequatrovirus167,600Terminally redundant, circularly permutedHeadful packaging, preferred pac cuts between pos.b 94420 and 94443 of genomic sequence39.526810
vB_EcoM_GothamMyoviridae (Myoviridae)Vequintavirinae, Vequintavirus137,054With 459-bp terminal repeatsSame specific start sequence for packaging of all virions43.72146
vB_EcoS_ChapoSiphoviridae (Drexlerviridae)Tunavirinae, Tunavirus51,099Terminally redundant, circularly permutedHeadful packaging, pac cut at pos.b 68/69 of genomic sequence45.5820
a
CDS, coding DNA sequences.
b
pos., position(s).
Lutter was isolated using a STEC O104 strain. It is a myovirus with a 170,054-bp genome that can encode 279 putative proteins (only 120 with predicted function) and shares 90% overall nucleotide identity with the Escherichia phage teqhad (GenBank accession number MN895434). Ozark, isolated using a different STEC O29:H12 strain, is closely related to Lutter (97% overall nucleotide identity). They are both related to prototypical phage T4 and share the preferred 24-bp region of T4 DNA packaging. Gotham is a smaller myovirus with a 137,025-bp DNA molecule and 459-bp terminal repeats, sharing 90% overall nucleotide identity with several other Escherichia phages (e.g., vB_EcoM-ECP26, GenBank accession number MK883717). Chapo is a siphovirus isolated using the STEC O29:H12 strain and is related to phage T1. It has a 51,099-bp genome divided into oppositely transcribed halves and can encode 82 potential proteins (only 22 with predicted functions). The pac cut site of Chapo was localized between positions 68 and 69 of the genomic sequence pointed out by the identical ends in ∼20% of these region reads. All the genomes have defined modules coding different functions. In particular, the lysis cassettes of the myoviruses contain putative holin and endolysin genes that are separated, with the exception of Gotham, where the holin gene was not identified. Siphovirus Chapo is predicted to encode a holin, an endolysin, and u-spanin canonical genes.

Data availability.

The GenBank accession numbers are MT682713, MT682714, MT682715, and MT682716 for vB_EcoM_Ozark, vB_EcoM_Lutter, vB_EcoS_Chapo, and vB_EcoM_Gotham, respectively. The corresponding SRA data have been deposited in NCBI under BioProject accession number PRJNA646048.

ACKNOWLEDGMENTS

This study was supported by the Portuguese Foundation for Science and Technology (FCT) under the scope of the strategic funding of unit UIDB/04469/2020 and the BioTecNorte operation (NORTE-01-0145-FEDER-000004), funded by the European Regional Development Fund under the scope of Norte 2020–Programa Operacional Regional do Norte. This study was supported by grants PTDC/CVT-CVT/29628/2017 (POCI-01-0145-FEDER-029628) and POCI-01-0247-FEDER-033679.

REFERENCES

1.
Oliveira H, Pinto G, Oliveira A, Oliveira C, Faustino MA, Briers Y, Domingues L, Azeredo J. 2016. Characterization and genome sequencing of a Citrobacter freundii phage CfP1 harboring a lysin active against multidrug-resistant isolates. Appl Microbiol Biotechnol 100:10543–10553.
2.
Sambrook JR. 2001. Molecular cloning: a laboratory manual. Cold Spring Harbor Laboratory Press, New York, NY.
3.
Andrews S. 2010. FastQC: a quality control tool for high throughput sequence data. http://www.bioinformatics.babraham.ac.uk/projects/fastqc.
4.
Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, Thierer T, Ashton B, Meintjes P, Drummond A. 2012. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28:1647–1649.
5.
Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O. 2008. The RAST server: Rapid Annotations using Subsystems Technology. BMC Genomics 9:75.
6.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol 215:403–410.
7.
Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964.
8.
Laslett D, Canback B. 2004. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16.
9.
Sampaio M, Rocha M, Oliveira H, Dias O. 2019. Predicting promoters in phage genomes using PhagePromoter. Bioinformatics 35:5301–5302.
10.
Soding J, Biegert A, Lupas AN. 2005. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res 33:W244–W248.

Information & Contributors

Information

Published In

cover image Microbiology Resource Announcements
Microbiology Resource Announcements
Volume 9Number 363 September 2020
eLocator: 10.1128/mra.00749-20
Editor: John J. Dennehy, Queens College

History

Received: 16 July 2020
Accepted: 10 August 2020
Published online: 3 September 2020

Contributors

Authors

Carla Dias
CEB–Centre of Biological Engineering, University of Minho, Braga, Portugal
Carina Almeida
INIAV, IP–National Institute for Agrarian and Veterinary Research, Vairão, Vila do Conde, Portugal
Department of Microbial Biochemistry, Institute of Biochemistry and Biophysics of the Polish Academy of Sciences, Warsaw, Poland
CEB–Centre of Biological Engineering, University of Minho, Braga, Portugal

Editor

John J. Dennehy
Editor
Queens College

Notes

Address correspondence to Hugo Oliveira, [email protected].

Metrics & Citations

Metrics

Note:

  • For recently published articles, the TOTAL download count will appear as zero until a new month starts.
  • There is a 3- to 4-day delay in article usage, so article usage will not appear immediately after publication.
  • Citation counts come from the Crossref Cited by service.

Citations

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. For an editable text file, please select Medlars format which will download as a .txt file. Simply select your manager software from the list below and click Download.

View Options

Figures

Tables

Media

Share

Share

Share the article link

Share with email

Email a colleague

Share on social media

American Society for Microbiology ("ASM") is committed to maintaining your confidence and trust with respect to the information we collect from you on websites owned and operated by ASM ("ASM Web Sites") and other sources. This Privacy Policy sets forth the information we collect about you, how we use this information and the choices you have about how we use such information.
FIND OUT MORE about the privacy policy