Open access
Genomics and Proteomics
Announcement
6 November 2023

Pelagic metagenome-assembled genomes from an estuarine salinity gradient in San Francisco Bay

This article has a companion.
VIEW THE COMPANION

ABSTRACT

San Francisco Bay (SFB) is a large and highly human-impacted estuarine system. We produced 449 metagenome-assembled genomes from SFB waters, collected along the salinity gradient, providing a rich data set to compare the metabolic potential of microorganisms from different salinity zones within SFB and to other estuarine systems.

ANNOUNCEMENT

Despite the ecological, cultural, and economic importance of San Francisco Bay (SFB), few studies have published microbial metagenome-assembled genomes (MAGs) from SFB (1, 2). We submitted eight bottom water samples from along the salinity gradient for metagenomic sequencing particularly to gain insights into nitrifier ecology in ammonia-rich SFB waters. Using a variety of methods, we created a library of representative MAGs from riverine to brackish to marine waters of SFB. We previously reported on ammonia-oxidizing archaea MAGs, including a massive bloom lineage captured by this data set (1), and the ecogenomics of Pelagibacterales (SAR11) subclade IIIa (2); however, most of the rich genomic data we generated remain to be explored.
Sample collection, DNA extraction, sequencing, and assembly methods were previously described (1). Briefly, 400–1,000 mL of bottom water (1 m above estuary floor) was collected from the SFB channel on 24th and 25th October 2013, at eight stations during United States Geological Survey (USGS) Water Quality monitoring cruises, including stations 657 (fresh, <0.5 PSU), 649 (oligohaline, 0.5–5 PSU), 3 (mesohaline, 5–18 PSU), 9 and 13 (polyhaline, 18–30 PSU), and 18, 27, and 34 (euhaline, 30–40 PSU). Microbial biomass was collected on 0.22-µm filters after passage through a 10-µm pore size pre-filter, and DNA was extracted using the FastDNA SPIN Kit for Soil (MP Biomedicals, Santa Ana, CA). Library preparation and metagenomic sequencing were performed by the DOE Joint Genome Institute (JGI) via a CSP project (Proposal ID 503022) to create Illumina fragments (300 bp) sequenced on an Illumina HiSeq 2500-1TB generating paired 150 bp length reads. For library preparation, 100 ng of DNA was sheared (to 479–610 bp) using the Covaris LE220 and size was selected using SPRI beads (Beckman Coulter). The fragments were treated with end-repair, A-tailing, and ligation of Illumina-compatible adapters (IDT, Inc.) using the KAPA-Illumina library creation kit (Kapa Biosystems). qPCR was used to determine the concentration of the libraries. We used quality-controlled and filtered metagenome data generated by JGI using their standard BBtools (v38.87) (3) pipeline (v3.7.3) for assembly, binning, and refining using the metaWRAP (v1.3.2) pipeline (4). Additionally, metagenomes were subset using Seqtk (https://github.com/lh3/seqtk) to 5%, 10%, 20%, and 50% of reads and co-assembled based on salinity zone. All metagenomes, subsets, and co-assemblies were assembled and binned as previously described (1) using metaSPAdes (v3.13.0) (5), MEGAHIT (v1.1.3) (6), MaxBin 2.0 (v2.2.6) (7), and MetaBAT2 (v2.12.1) (8). Bins were consolidated and filtered using metawrap bin_refinement to have >50% completeness and <10% contamination and then reassembled with metaSPAdes. MAGs were dereplicated using dRep (v2.3.2) (9) at 95% average nucleotide identity (ANI) and then taxonomically classified using the Genome Taxonomy Database toolkit (GTDB-tk v2.1.1; GTDB R07-RS207) (10).
Here, we present a total of 449 MAGs representing 17 phyla, including many MAGs from Proteobacteria (n = 178), Bacteroidota (n = 118), and Actinobacteriota (n = 73) (Fig. 1). Binning from subset metagenomes yielded MAGs from high-coverage genera such as Pelagibacter, Nitrosopumilus, or TMED189 (Actinobacteria) (Fig. 2). A median of 10.6× (range 4× to 103×) genome coverage was optimal for recovering genomes. An average of 24% (range 19%–28%) of metagenomic reads were competitively recruited back to the dereplicated MAG library using Bowtie2 (v2.4.2) (11) with the default minimum score threshold. This data set allows for comparisons of closely related microorganisms from different salinity zones and increases our understanding of estuarine pelagic microbial communities. These samples have corresponding 16S rRNA amplicon libraries (12).
Fig 1
Fig 1 Accession numbers, assembly statistics, taxonomic assignments, and sample data for MAGs also available at https://doi.org/10.6084/m9.figshare.24085767
Fig 2
Fig 2 Number of MAGs (x-axis) in data set by GTDB-tk classification (order grouped by phylum) on the y-axis. Bar color indicates the metagenome treatment from which a representative MAG was binned (either subset, all reads [100%], or co-assembly). Plot faceted by salinity zone and by Bay region (North and South).

ACKNOWLEDGMENTS

Thanks go to Julian Damashek for sample collection and Linta Reji for advice on metagenomic workflows. Thanks go to Jim Cloern and the Water Quality of San Francisco Bay monitoring group at USGS and the R/V Polaris crew for facilitating our participation in numerous cruises. Some of the computing for this project was performed on the Stanford University Sherlock cluster. We would like to thank Stanford University and the Stanford Research Computing Center for providing computational resources and support that contributed to these research results.
This work was supported in part by NSF CAREER grant OCE-0847266 from the Biological Oceanography program (to C.A.F.) and in part by fellowship support from the NSF GRFP and Stanford Data Science Scholars program (to A.N.R.). The work conducted by the U.S. Department of Energy Joint Genome Institute, a DOE Office of Science User Facility, is supported by the Office of Science of the U.S. Department of Energy under Contract no. DE-AC02-05CH11231. Sequencing thanks to JGI CSP project 503022 to C.A.F.

REFERENCES

1.
Rasmussen AN, Francis CA. 2022. Genome-resolved metagenomic insights into massive seasonal ammonia-oxidizing archaea blooms in San Francisco Bay. mSystems 7:e0127021.
2.
Lanclos VC, Rasmussen AN, Kojima CY, Cheng C, Henson MW, Faircloth BC, Francis CA, Thrash JC. 2023. Ecophysiology and genomics of the brackish water adapted Sar11 Subclade Iiia. ISME J 17:620–629.
3.
Bushnell B, Rood J, Singer E. 2017. Bbmerge – accurate paired shotgun read merging via overlap. PLOS One 12:e0185056.
4.
Uritskiy GV, DiRuggiero J, Taylor J. 2018. Metawrap—a flexible pipeline for genome-resolved metagenomic data analysis. Microbiome 6:158.
5.
Nurk S, Meleshko D, Korobeynikov A, Pevzner PA. 2017. metaSPAdes: a new versatile Metagenomic assembler. Genome Res 27:824–834.
6.
Li D, Liu C-M, Luo R, Sadakane K, Lam T-W. 2015. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de bruijn graph. Bioinformatics 31:1674–1676.
7.
Wu Y-W, Simmons BA, Singer SW. 2016. MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets. Bioinformatics 32:605–607.
8.
Kang DD, Li F, Kirton E, Thomas A, Egan R, An H, Wang Z. 2019. MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies. PeerJ7:e7359
9.
Olm MR, Brown CT, Brooks B, Banfield JF. 2017. dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication. ISME J 11:2864–2868.
10.
Chaumeil P-A, Mussig AJ, Hugenholtz P, Parks DH. 2020. GTDB-Tk: a toolkit to classify genomes with the genome taxonomy database. Bioinformatics 36:1925–1927.
11.
Langmead B, Salzberg SL. 2012. Fast gapped-read alignment with bowtie 2. Nat Methods 9:357–359.
12.
Rasmussen AN, Damashek J, Eloe-Fadrosh EA, Francis CA. 2021. In-depth spatiotemporal characterization of planktonic archaeal and bacterial communities in North and South San Francisco Bay. Microb Ecol 81:601–616.

Information & Contributors

Information

Published In

cover image Microbiology Resource Announcements
Microbiology Resource Announcements
Volume 12Number 1214 December 2023
eLocator: e00800-23
Editor: Simon Roux, DOE Joint Genome Institute, Berkeley, California, USA
PubMed: 37929976

History

Received: 27 August 2023
Accepted: 23 September 2023
Published online: 6 November 2023

Keywords

  1. estuary
  2. metagenome-assembled genomes
  3. pelagic
  4. gradients

Data Availability

Raw metagenomes are available under NCBI BioProject no. PRJNA439806 to PRJNA439813 (NCBI SRA accession numbers: SRR7130817, SRR7130819, SRR7130820, SRR7130903, SRR7131305, SRR7131306, SRR7132116, SRR7132117). Quality controlled and filtered metagenomes are available from the JGI Genome Portal under accession numbers 3300021957 to 3300021964. MAG assemblies are deposited in NCBI BioProject no. PRJNA819083, as are links to BioSamples for the two ammonia-oxidizing archaea MAGs (assemblies are stored under PRJNA439808 and PRJNA439812) (Fig. 1, column Accession.NCBI). Corresponding 16S rRNA amplicon libraries are available in NCBI BioProject no. PRJNA577706. Figure 1 contains relevant MAG metadata (e.g., quality, taxonomy, accession numbers, etc.).

Contributors

Authors

Department of Earth System Science, Stanford University, Stanford, California, USA
Department of Earth System Science, Stanford University, Stanford, California, USA

Editor

Simon Roux
Editor
DOE Joint Genome Institute, Berkeley, California, USA

Notes

The authors declare no conflict of interest.

Metrics & Citations

Metrics

Note:

  • For recently published articles, the TOTAL download count will appear as zero until a new month starts.
  • There is a 3- to 4-day delay in article usage, so article usage will not appear immediately after publication.
  • Citation counts come from the Crossref Cited by service.

Citations

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. For an editable text file, please select Medlars format which will download as a .txt file. Simply select your manager software from the list below and click Download.

View Options

Figures and Media

Figures

Media

Tables

Share

Share

Share the article link

Share with email

Email a colleague

Share on social media

American Society for Microbiology ("ASM") is committed to maintaining your confidence and trust with respect to the information we collect from you on websites owned and operated by ASM ("ASM Web Sites") and other sources. This Privacy Policy sets forth the information we collect about you, how we use this information and the choices you have about how we use such information.
FIND OUT MORE about the privacy policy