Open access
Biotechnology
Announcement
27 December 2024

Genome assembly and annotation of microalga Nannochloropsis oceanica C018

ABSTRACT

The microalga Nannochloropsis is an important organism for algae-based biocommodity production of food, feed, and fuel, among other products. Using PacBio Revio, we sequenced, assembled, and annotated a 26.41 Mbp Nannochloropsis oceanica C018 genome.

ANNOUNCEMENT

Microalgae-based technologies have emerged as sustainable approaches to help mitigate the impact of rising CO2 emissions while providing scalable solutions applicable in agriculture, medicine, and energy, among others. Nannochloropsis microalgae exhibit high productivity and tolerance for elevated CO2 concentrations, thrive in outdoor conditions, have potential for biofuel production, and have emerged as model organisms for applied phycology (1, 2). Additionally, the biochemical characteristics of Nannochloropsis make them a valuable candidate as a food source for both animals and humans. Nannochloropsis oceanica C018 was isolated by S Brown (University of Hawaii-Manoa) near the Island of Hawaii (19.666° -156.032°) using dilution to extinction with seawater-based F/2 medium (3).
Nannochloropsis oceanica C018 was grown in 20 L carboys with F/2 (3) on a 12-hour day/night light cycle (~400 µmol Q/m2/sec) at 25°C and bubbled with sterile air. Following cell harvesting, DNA was isolated with Quick-DNA/RNA Miniprep Kit D7001 (Zymo Research, USA). Total DNA was quantified using DeNovix DS-11 Series Spectrophotometer/Fluorometer (DeNovix Inc., USA) and normalized to 45 ng/µL prior to sequencing library preparation. DNA fragments between 15 and 20 kb were selected for sequencing, and libraries were prepared using Express Template Prep kit version 3 (Pacific Biosciences, USA) for standard HiFi library preparation. High-fidelity sequencing reads were obtained with Pacific Biosciences Revio system with version SMRT Link v13 and Instrument Control Software (ICS) v13. 1,906,268 total reads were generated with an N50 value of 5090 bp, and HiFi read yield was 11,001,781,445 bp.
Genome assembly followed a modified version of the VGP pipeline (4) in usegalaxy.org (5). Samtools v.1.15.1 (6) was used to extract fastq files from .bam sequence output files, and Cutadapt v.4.6 (7) was implemented to trim adapter sequences and filter low-quality reads. Hifiasm v.0.19.8 (8) was used to assemble contigs from high-fidelity reads with “light” purge level. The resulting assembly was decontaminated with kraken2 v.2.1.1 (9), and duplicates were purged with purge_dups v.1.2.6 (10). Completeness of the resulting genome was estimated with BUSCO v.5.5.0 (11) and quality assessed with QUAST v.5.2.0 (12). Default parameters were used except where otherwise noted.
The resulting genome assembly consisted of 30 contigs, 26.41 Mbp (415× coverage) with a GC content of 54.26%, and a L50 of 10. The genome was annotated following the JGI Annotation pipeline and PhycoCosm (12), with 80 million paired-end transcriptomic reads. Functional annotation of the 8,768 gene models was performed using the JGI annotation pipeline. Both genome assembly size and gene count are within the range of other published Nannochloropsis genomes (1315) (https://phycocosm.jgi.doe.gov/eustigmatophyta). This genome will spur the continued development of microalgae for use as a biofuel feedstock and provide prerequisite information needed for genetic manipulation and expression analyses.

ACKNOWLEDGMENTS

This work was supported by U.S. Department of Energy grant numbers DE-EE0007091, DE-EE0008518, and DE-EE0009278 and U.S. Department of Agriculture (through Cornell University) grant number 2019-69012-29905 to ZIJ and benefited from the entire “MAGIC” team. This project was also funded by the Duke Bass Connections program. AEG was supported by a U.S./Mexico Fulbright García-Robles award. The work (proposal: 10.46936/10.25585/60008191) conducted by the U.S. Department of Energy Joint Genome Institute (https://ror.org/04xm1d337), a DOE Office of Science User Facility, is supported by the Office of Science of the U.S. Department of Energy operated under Contract No. DE-AC02-05CH11231. Duke Sequencing and Genomic Technologies would like to acknowledge funding from NIH S10OD034222-01 for data obtained with Revio system.

REFERENCES

1.
Chiu SY, Kao CY, Tsai MT, Ong SC, Chen CH, Lin CS. 2009. Lipid accumulation and CO2 utilization of Nannochloropsis oculata in response to CO2 aeration. Bioresour Technol 100:833–838.
2.
Mimouni V, Couzinet-Mossion A, Ulmann L, Wielgosz-Collin G. 2018. Lipids from microalgae, p 109–131. In Microalgae in health and disease prevention. Academic Press.
3.
Guillard RR. 1975. Culture of phytoplankton for feeding marine invertebrates, p 29–60. In Culture of marine invertebrate animals: proceedings—1st conference on culture of marine invertebrate animals greenport. Springer US, Boston, MA.
4.
Larivière D, Abueg L, Brajuka N, Gallardo-Alba C, Grüning B, Ko BJ, Ostrovsky A, Palmada-Flores M, Pickett BD, Rabbani K, et al. 2024. Scalable, accessible and reproducible reference genome assembly and evaluation in Galaxy. Nat Biotechnol 42:367–370.
5.
Afgan E, Baker D, van den Beek M, Blankenberg D, Bouvier D, Čech M, Chilton J, Clements D, Coraor N, Eberhard C, Grüning B, Guerler A, Hillman-Jackson J, Von Kuster G, Rasche E, Soranzo N, Turaga N, Taylor J, Nekrutenko A, Goecks J. 2016. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update. Nucleic Acids Res 44:W3–W10.
6.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup. 2009. The sequence alignment/map format and SAMtools. Bioinformatics 25:2078–2079.
7.
Martin M. 2011. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet j 17:10.
8.
Cheng H, Concepcion GT, Feng X, Zhang H, Li H. 2021. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods 18:170–175.
9.
Wood DE, Lu J, Langmead B. 2019. Improved metagenomic analysis with Kraken 2. Genome Biol 20:257.
10.
Guan D, McCarthy SA, Wood J, Howe K, Wang Y, Durbin R. 2020. Identifying and removing haplotypic duplication in primary genome assemblies. Bioinformatics 36:2896–2898.
11.
Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. 2015. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31:3210–3212.
12.
Grigoriev IV, Hayes RD, Calhoun S, Kamel B, Wang A, Ahrendt S, Dusheyko S, Nikitin R, Mondo SJ, Salamov A, Shabalov I, Kuo A. 2021. PhycoCosm, a comparative algal genomics resource. Nucleic Acids Res 49:D1004–D1011.
13.
Ohan JA, Hovde BT, Zhang XL, Davenport KW, Chertkov O, Han C, Twary SN, Starkenburg SR. 2019. Nuclear genome assembly of the microalga Nannochloropsis salina CCMP1776. Microbiol Resour Announc 8:10–1128.
14.
Schwartz AS, Brown R, Ajjawi I, McCarren J, Atilla S, Bauman N, Richardson TH. 2018. Complete genome sequence of the model oleaginous alga Nannochloropsis gaditana CCMP1894. Genome Announc 6:10–1128.
15.
Corteggiani Carpinelli E, Telatin A, Vitulo N, Forcato C, D’Angelo M, Schiavon R, Vezzi A, Giacometti GM, Morosinotto T, Valle G. 2014. Chromosome scale genome assembly and transcriptome profiling of Nannochloropsis gaditana in nitrogen depletion. Mol Plant 7:323–335.

Information & Contributors

Information

Published In

cover image Microbiology Resource Announcements
Microbiology Resource Announcements
Volume 14Number 211 February 2025
eLocator: e00883-24
Editor: Jason E. Stajich, University of California Riverside, Riverside, California, USA
PubMed: 39727385

History

Received: 28 August 2024
Accepted: 9 December 2024
Published online: 27 December 2024

Keywords

  1. algae
  2. carbon capture
  3. biofuels
  4. next-generation sequencing
  5. biotechnology
  6. bioinformatics

Data Availability

This whole genome project has been deposited at GenBank under the accession JBEBFO000000000. The version described in this paper is JBEBFO010000000. Raw sequencing data has been deposited in the NCBI Sequence Read Archive (SRA) under the BioProject accession PRJNA1118351. Genome assembly and annotations are also accessible at the JGI algal portal PhycoCosm (12): phycocosm.jgi.doe.gov/NanspC018_1/.

Contributors

Authors

Marine Laboratory, Duke University, Beaufort, North Carolina, USA
Author Contributions: Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, and Writing – original draft.
Hari Koneru
Marine Laboratory, Duke University, Beaufort, North Carolina, USA
Author Contributions: Investigation and Writing – review and editing.
Jason Arnold
Department of Molecular Genetics and Microbiology, Duke Microbiome Center, Duke University, Durham, North Carolina, USA
Author Contributions: Conceptualization, Methodology, Supervision, and Writing – review and editing.
Sara Calhoun
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contributions: Formal analysis and Resources.
Igor V. Grigoriev
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California, USA
Author Contributions: Funding acquisition and Writing – review and editing.
Marine Laboratory, Duke University, Beaufort, North Carolina, USA
Biology, Civil & Environmental Engineering and Duke Microbiome Center, Duke University, Durham, North Carolina, USA
Author Contributions: Conceptualization, Funding acquisition, Project administration, Resources, Supervision, and Writing – review and editing.

Editor

Jason E. Stajich
Editor
University of California Riverside, Riverside, California, USA

Notes

The authors declare no conflict of interest.

Metrics & Citations

Metrics

Note:

  • For recently published articles, the TOTAL download count will appear as zero until a new month starts.
  • There is a 3- to 4-day delay in article usage, so article usage will not appear immediately after publication.
  • Citation counts come from the Crossref Cited by service.

Citations

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. For an editable text file, please select Medlars format which will download as a .txt file. Simply select your manager software from the list below and click Download.

View Options

Figures

Tables

Media

Share

Share

Share the article link

Share with email

Email a colleague

Share on social media

American Society for Microbiology ("ASM") is committed to maintaining your confidence and trust with respect to the information we collect from you on websites owned and operated by ASM ("ASM Web Sites") and other sources. This Privacy Policy sets forth the information we collect about you, how we use this information and the choices you have about how we use such information.
FIND OUT MORE about the privacy policy