ANNOUNCEMENT
In the barren polar desert of the McMurdo Dry Valleys, Antarctica, meltwater-driven supraglacial depressions (i.e., cryoconite holes) and streams such as those in the Canada Glacier basin are among the top refuges for microorganisms (
1 – 6). Cryoconite holes are cylindrical, water-filled depressions on glacial surfaces formed by the deposition and accumulation of aeolian material. The Canada Stream is an ephemeral stream, fed by runoff from the Canada Glacier.
While microbes thrive in cryoconite holes and the stream channel (
7 – 10), rapid response and adaptation are required to withstand frequent disturbances, freezing, desiccation, and transition to quiescence. Our goal is to understand the genetic traits linked to persistence under the multitude of environmental stressors in these systems.
Cryoconite and stream water samples from the Canada Glacier (77°37′S, 162°59′E) were collected in December 2009 and are described elsewhere (
9). Bacteria were isolated aerobically on R2A agar plates at 4°C. The cetyltrimethylammonium bromide procedure was used for extracting DNA from bacterial isolates grown to late exponential phase in R2A broth at 4°C while shaking at 150 rpm (
11). Standard quality shotgun libraries for bacterial strains CAN_C2, CAN_C3, CAN_C7, CAN_S1, CAN_S2, CAN_S4, and CAN_S7 were sequenced on the Illumina NovaSeq S4 platform (2 × 151 bp paired-end reads) (
12). Libraries were prepared on the PerkinElmer Sciclone NGS robotic liquid handling workstation using the Kapa Biosystems Library Kit. DNA was sheared to 459 bp using a Covaris LE220-focused ultrasonicator. DNA fragments were size selected by double solid phase reversible immobilization (SPRI). Fragments were end repaired, A tailed, and ligated with Illumina compatible sequencing adaptors. Raw Illumina sequences were quality filtered using BBTools v38.95 (QV ≥20; length ≥100 bases) (
13) per JGI SOP 1061 and assembled with SPAdes (≥version v3.14.1; –phred-offset 33 –cov-cutoff auto -t 16 m 64 –careful -k 25,55,95) (
14). Contigs with a length <1 kb (BBTools reformat.sh: minlength = 1,000 ow = t) were discarded.
For improved high-quality draft genomes, Pacbio SMRTbell libraries for bacterial strain CAN_C5 were sequenced on Pacific Biosciences (PacBio) Sequel platform (
15). PacBio sequencing generated 889,350 reads with an average length of 8,908 bp. DNA was sheared around 10 kb using Megaruptor 3 (Diagenode). Sheared DNA was treated with exonuclease, DNA repair enzyme mix, end-repair/A-tailing mix, and ligated with barcoded overhang adapters using SMRTbell Express Template Prep Kit 2.0 (PacBio). Libraries were purified with AMPure PB Beads (PacBio) and bound to Sequel II polymerase 2.0 using the Sequel II Binding Kit 2.0. Libraries were sequenced using tbd-sample-dependent sequencing primers, 8M v1 SMRT cells, and Version 2.0 sequencing chemistry with 1 × 900 sequencing movie run times. Reads >5 kb were assembled with hierarchical genome assembly process (HGAP) [smrtlink/8.0.0.80529, HGAP 4 (1.0)] using default settings (
16).
CRISPR elements were identified using the program CRT (CRISPR Recognition Tool) (
17) or PILER-CR (
18). All genomes were annotated in the Integrated Microbial Genomes (IMG) database (≥IMG Annotation Pipeline v.5.0.19) (
19). Genome completeness was estimated using checkM (
20). The genome-sequence-acquired 16S ribosomal RNA gene was queried in command line against the SILVA database 138.1 with blastn, from BLAST+ 2.13.0 for taxonomic classification (
21). Sequence details are given in
Table 1.
ACKNOWLEDGMENTS
This work was supported by the National Science Foundation under grant ANT-0838970 to C.M.F., NASA Fellowship NESSF Planet11R-0011 to H.J.S., and ANT-2037963 to H.J.S., M.D., and C.M.F. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation. The work conducted by the U.S. Department of Energy Joint Genome Institute, a DOE Office of Science User Facility, is supported under Contract No. DE-AC02-05CH11231. Data were generated for JGI Proposal #505037.