Open access
Environmental Microbiology
Announcement
8 February 2024

Whole community shotgun metagenomes of two biological soil crust types from the Mojave Desert

ABSTRACT

We present six whole community shotgun metagenomic sequencing data sets of two types of biological soil crusts sampled at the ecotone of the Mojave Desert and Colorado Desert in California. These data will help us understand the diversity and function of biocrust microbial communities, which are essential for desert ecosystems.

ANNOUNCEMENT

Biological soil crusts (BSCs) cover approximately 30% of the global dryland surface area (1) and carry out important ecological services (2). As ecosystem pioneers, BSCs colonize bare soil surfaces in low-water, extreme-temperature environments. However, the underlying mechanisms for how microbes give rise to the structure and function of biocrusts and how they interact to provide ecosystem services are still not well understood.
Dry light cyanobacterial/algal crust (LAC) and cyano lichen crust (CLC) samples (~5 cm2 in size) were collected at Sheephole Valley Wilderness within the ecotone of the Mojave Desert and Colorado Desert of San Bernardino County, CA, on 9 September 2018 with each of three replicate samples taken 250 m apart from each other at GPS location 34.1736 N, 115.3888 W, and placed in sealed glass mason jars (cushioned by paper towels) for transport back to the laboratory. One hundred milligrams of each sample were randomly subsampled and used for DNA extraction using a MoBio PowerSoil DNA Isolation Kit (12888-50; QIAGEN, Germantown, MD) following the manufacturer’s instructions and sent to the DOE Joint Genome Institute (JGI) for library construction and sequencing. One hundred nanograms of extracted DNA were sheared to a target length of 656 bp using a Covaris LE220 Focused-ultrasonicator (Covaris LLC, Woburn, MA) and size selected using SPRI magnetic beads (Beckman Coulter Life Sciences, Indianapolis, IN). DNA fragments were treated with end-repair, A-tailing, and ligated to Illumina compatible adapters (Integrated DNA Technologies, Inc., Coralville, IA) using a KAPA-Illumina library kit (Roche Diagnostics Corporation, Indianapolis, IN). Libraries were quantified using a KAPA Biosystems next-generation sequencing library qPCR kit and a Roche LightCycler 480 real-time PCR instrument (Roche Diagnostics Corporation, Indianapolis, IN).
Sequencing was performed on an Illumina NovaSeq sequencer (Illumina, Inc., San Diego, CA) using XP V1 reagent kits, S4 flowcells, and a 2 × 151 indexed recipe. Reads were processed with the standard JGI Metagenome workflow (3). Briefly, BBDuk version 38.79 from the BBTools package (https://jgi.doe.gov/data-and-tools/bbtools/) was used to process raw reads, which were then corrected using BBCMS version 38.34 (k-mer count ≥2, high-count fraction ≥0.6) (4) and assembled using metaSPAdes version 3.13.0 (metagenome flag, no error correction, k-mer sizes of 33, 55, 77, 99, and 127) (5) and MEGAHIT version 1.2.9 with default settings (6). Processed reads were mapped to the final assembly, and coverage information was generated using BBMap version 38.34 (4).
Approximately 98% of raw reads passed trimming/quality control filters, and over 70% of processed reads were assembled into contigs. On average, more reads were assembled into contigs using MEGAHIT (75.8%) than metaSPAdes (72.7%) with an estimated metagenome coverage of ~13×. MetaSPAdes assemblies were more compact in terms of total assembly length, with generally longer maximum contig lengths than those of MEGAHIT, although MEGAHIT generated fewer contigs than metaSPAdes. The JGI Integrated Microbial Genomes analysis pipeline (https://genome.jgi.doe.gov/portal/ProMicSoilCrusts/ProMicSoilCrusts.info.html) revealed few eukaryotic microbes, likely due to poor DNA extraction yields from these microbes with the conventional commercial soil DNA isolation kit used. Nevertheless, these data can provide useful initial insights into the prokaryotic diversity of LAC and CLC biocrust types.
TABLE 1
TABLE 1 Accession numbers and characteristics of metagenome data from two biocrust types from the Sheephole Valley Wilderness (Mojave Desert, CA)a
MetagenomeNCBI BioSample IDNCBI
BioProject ID
No. of filtered readsAssembly
BioSample ID
No. of
contigs
No. of assembled (150 bp) readsMetagenome assembly length (bp)Metagenome coverageN50 (bp)Max contig length (kb)
LAC replicate 1SAMN14510913PRJNA620793215,587,996PRJNA1021617
PRJNA1021621
2,550,257
2,332,566
166,293,935
170,423,036
1,742,734,840
1,868,428,908
14.3×
13.7×
458,966
441,759
651.104
546.176
LAC replicate 2SAMN14510861PRJNA620794219,555,444PRJNA1021629
PRJNA1021637
2,663,237
2,365,106
162,376,141
166,589,100
1,700,000,691
1,813,819,568
14.3×
13.8×
513,765
479,343
1000.405
1000.030
LAC replicate 3SAMN14510934PRJNA620795208,714,792PRJNA1021639
PRJNA1021640
2,353,418
2,177,883
158,432,369
162,939,834
1,571,681,495
1,713,465,883
15.1×
14.3×
415,201
416,349
1301.000
541.813
CLC replicate 1SAMN14510746PRJNA620790191,564,444PRJNA1021643
PRJNA1021649
2,437,969
2,319,548
134,704,800
139,237,529
1,670,634,364
1,835,861,038
12.1×
11.4×
422,767
453,111
609.588
169.489
CLC replicate 2SAMN14510985PRJNA620791200,791,158PRJNA1021657
PRJNA1021659
2,812,278
2,605,836
142,285,895
146,893,754
1,913,908,387
2,049,607,909
11.2×
10.8×
508,862
526,639
756.686
384.386
CLC replicate 3SAMN14510745PRJNA620792209,432,468PRJNA1021660
PRJNA1021664
3,121,332
2,934,558
142,063,838
147,954,872
1,959,209,449
2,157,330,921
10.9×
10.3×
649,941
653,857
949.882
345.113
a
All the contigs are ≥0.1 kb. Assembly statistics for metaSPAdes and MEGAHIT assemblies are listed on the top and bottom (in italics) of each row, respectively.

ACKNOWLEDGMENTS

We thank the BLM Needles CA office for their assistance with permitting at the Sheephole Valley Wilderness and Dr. Hung Phan (Iowa State) for help with cleaning and submitting data to NCBI.
This work was performed and supported in part by the following: the Facilities Integrating Collaborations for User Science (FICUS) program (proposal: https://doi.org/10.46936/fics.proj.2018.50356/60000035) and used resources at the DOE Joint Genome Institute (JGI) (https://ror.org/04xm1d337) and the National Energy Research Scientific Computing Center (NERSC) (https://ror.org/05v3mvq14), which are DOE Office of Science User Facilities operated under contract no. DE-AC02-05CH11231; Bureau of Land Management (BLM) Cooperative Agreement L15AC00153 to N. Pietrasiak and BLM permit number 6850-CAD0000.06 to N. Pietrasiak and J.E.S.; the U.S. Department of Agriculture, National Institute of Food and Agriculture Hatch project CA-R-PPA-211–5062-H to N. Pombubpa and J.E.S.; a Royal Thai Government Scholarship to N. Pombubpa; and NSF GoLife grant DEB-1541538 and CAREER grant DEB-1846376 to E.F.Y.H. J.E.S. is a CIFAR fellow in the Fungal Kingdom: Threats and Opportunities program. This is UM’s Center for Biodiversity and Conservation Research Publication No. 40.

REFERENCES

1.
Guang S, Ying Z, Haotian Y, Xinrong L. 2023. Biocrust mediates the complexity and stability of bacterial networks in both biocrust and subsoil layers in the Tengger Desert. Plant Soil 490:217–237.
2.
Tian C, Pang J, Bu C, Wu S, Bai H, Li Y, Guo Q, Siddique KHM. 2023. The microbiomes in lichen and moss biocrust contribute differently to carbon and nitrogen cycles in arid ecosystems. Microb Ecol 86:497–508.
3.
Clum A, Huntemann M, Bushnell B, Foster B, Foster B, Roux S, Hajek PP, Varghese N, Mukherjee S, Reddy TBK, Daum C, Yoshinaga Y, O’Malley R, Seshadri R, Kyrpides NC, Eloe-Fadrosh EA, Chen I-MA, Copeland A, Ivanova NN. 2021. DOE JGI metagenome workflow. mSystems 6:e00804-20.
4.
Bushnell B. 2014. BBMap: a fast, accurate, splice-aware aligner. Lawrence Berkeley National Laboratory LBNL-7065E.
5.
Nurk S, Meleshko D, Korobeynikov A, Pevzner PA. 2017. metaSPAdes: a new versatile metagenomic assembler. Genome Res 27:824–834.
6.
Li D, Liu C-M, Luo R, Sadakane K, Lam T-W. 2015. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 31:1674–1676.

Information & Contributors

Information

Published In

cover image Microbiology Resource Announcements
Microbiology Resource Announcements
Volume 13Number 312 March 2024
eLocator: e00980-23
Editor: Frank J. Stewart, Montana State University, Bozeman, Montana, USA
PubMed: 38329355

History

Received: 13 October 2023
Accepted: 23 January 2024
Published online: 8 February 2024

Keywords

  1. metagenomics
  2. soil crusts

Data Availability

Raw sequencing data and metagenome assemblies are available at the NCBI using the hyperlinked BioSample and BioProject ID numbers that are listed in Table 1. Data are also available from JGI’s genome portal (https://genome.jgi.doe.gov/portal/ProMicSoilCrusts/ProMicSoilCrusts.info.html) and GOLD database (https://gold.jgi.doe.gov/study?id=Gs0142145).

Contributors

Authors

Department of Biology and Center for Biodiversity and Conservation Research, University of Mississippi, University, Mississippi, USA
Author Contributions: Data curation, Formal analysis, Investigation, Methodology, Validation, Writing – original draft, and Writing – review and editing.
Department of Microbiology and Plant Pathology, University of California, Riverside, California, USA
Author Contributions: Investigation and Methodology.
Present address: Department of Microbiology, Faculty of Science, Chulalongkorn University, Bangkok, Thailand
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contributions: Formal analysis and Methodology.
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contributions: Methodology and Supervision.
Brian Foster
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contributions: Formal analysis and Methodology.
Bryce Foster
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contribution: Methodology.
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contributions: Supervision, Writing – review and editing, and Methodology.
Krishnaveni Palaniappan
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contribution: Methodology.
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contributions: Formal analysis and Methodology.
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contribution: Data curation.
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contribution: Data curation.
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contributions: Methodology and Supervision.
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contributions: Methodology and Supervision.
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contributions: Methodology and Supervision.
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contributions: Methodology and Supervision.
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contributions: Methodology and Supervision.
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contribution: Project administration.
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Author Contributions: Supervision and Methodology.
School of Life Sciences, University of Nevada-Las Vegas, Las Vegas, Nevada, USA
Author Contributions: Conceptualization, Funding acquisition, Investigation, Methodology, Resources, Supervision, and Writing – review and editing.
Department of Microbiology and Plant Pathology, University of California, Riverside, California, USA
Author Contributions: Conceptualization, Funding acquisition, Investigation, Methodology, Resources, Supervision, and Writing – review and editing.
Department of Biology and Center for Biodiversity and Conservation Research, University of Mississippi, University, Mississippi, USA
Author Contributions: Conceptualization, Data curation, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Writing – original draft, and Writing – review and editing.

Editor

Frank J. Stewart
Editor
Montana State University, Bozeman, Montana, USA

Notes

Thuy M. Nguyen and Nuttapon Pombubpa contributed equally to this article. NP performed the sampling and DNA isolation. TMN performed the analysis and wrote the manuscript.
The authors declare no conflict of interest.

Metrics & Citations

Metrics

Note:

  • For recently published articles, the TOTAL download count will appear as zero until a new month starts.
  • There is a 3- to 4-day delay in article usage, so article usage will not appear immediately after publication.
  • Citation counts come from the Crossref Cited by service.

Citations

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. For an editable text file, please select Medlars format which will download as a .txt file. Simply select your manager software from the list below and click Download.

View Options

Figures and Media

Figures

Media

Tables

Share

Share

Share the article link

Share with email

Email a colleague

Share on social media

American Society for Microbiology ("ASM") is committed to maintaining your confidence and trust with respect to the information we collect from you on websites owned and operated by ASM ("ASM Web Sites") and other sources. This Privacy Policy sets forth the information we collect about you, how we use this information and the choices you have about how we use such information.
FIND OUT MORE about the privacy policy