INTRODUCTION
Some members of acidophilic microbial communities catalyze ferrous iron oxidation, accelerating ferric iron-mediated oxidative dissolution of sulfide minerals and thus formation of acid mine drainage (AMD) (
1,
2). Because of this capacity, acidophilic microbial communities are harnessed to release metals from sulfide minerals in biomining (reviewed in reference
3). Microbial communities, especially those adapted to very low-pH conditions (less than pH 2), are often dominated by
Leptospirillum bacteria of the phylum
Nitrospira (
4–6). To date, there are three recognized groups within the clade
Leptospirillum based on 16S rRNA phylogeny: group I (
Leptospirillum ferrooxidans), group II (
Leptospirillum ferriphilum and “
Leptospirillum rubarum”), and group III (
Leptospirillum ferrodiazotrophum) (
7–9). All three
Leptospirillum groups have been observed in 16S rRNA gene surveys and metagenomic studies from acidic and bioleaching environments worldwide (
5,
10–13). Based on isolate characterization studies, all are iron-oxidizing chemoautotrophs, and two groups (groups I and III) are reported to be capable of nitrogen fixation (
9,
14). Near-complete genomes for
Leptospirillum rubarum (UBA type), “
Leptospirillum group II 5wayCG” type, and
L. ferrodiazotrophum have been recovered from community genomic data sets (
15–17), and the complete genomes of
L. ferrooxidans and
L. ferriphilum isolates are now available (
18,
19). Much research has focused on floating biofilms sampled from the Richmond Mine at Iron Mountain, California; the biofilms are microns to hundred microns thick and are typically dominated by
Leptospirillum group II (
20,
21). Community genomics, proteomics, and transcriptomics have allowed for the cultivation-independent study of generally abundant members of these communities (
16,
22–25); however, the roles of lower-abundance community members have not been well studied. Here we describe unusual, near-centimeter-thick floating biofilm communities that grow within the Richmond Mine and report the partial genome of the new species “
Leptospirillum group IV UBA BS,” a representative of a new group in the
Leptospirillum clade, group IV. These bacteria comprise less than 3% of the sequenced community, so this study demonstrates the power of community genomics for achieving insight into the physiology of relatively low-abundance community members.
MATERIALS AND METHODS
Biofilm samples, which we refer to as “UBA BS,” were obtained from the A-drift tunnel from within the Richmond Mine, at Iron Mountain, California (40°40′ 38.42″ N and 122″ 31′ 19.90″ W, elevation of ∼900 m). Samples were collected in November 2005 (Nov05), August 2007 (Aug07), November 2007 (Island 2 and 3), June 2008 (Jun08), and December 2011 (Dec11). Sample Nov05 was subjected to Sanger sequencing, as previously described (
15,
22). Additionally, community proteomic data were obtained for Nov05 and Aug07 samples as described earlier (
16,
26). Briefly, proteins were released from biofilms via sonication, fractionated based on cellular location, denatured, and reduced with 6 M guanidine–10 mM dithiothreitol (DTT), digested using sequencing-grade trypsin (Promega, Madison, WI), desalted, and analyzed via two-dimensional nanoliquid chromatography electrospray ionization coupled to tandem mass spectrometry (nano-LC-ES-MS/MS) (linear ion trap Orbitrap; Thermo Fisher Scientific). For the Nov05 samples, two protein extraction methods were used as described in reference
26: the first method used an acidic buffer referred to as M2 buffer, and the second used a 0.1 M sodium acetate (pH 5.0) buffer (S buffer). The resultant MS/MS spectra from individual runs were then used to search a database of predicted proteins from AMD genomic sequences as well as common contaminants (trypsin/keratin) with SEQUEST and filtered with DTASelect (
26). For comparison of protein abundance levels for
Leptospirillum ferrodiazotrophum and the
Leptospirillum group IV UBA BS species, proteomics data were analyzed using clustered normalized spectral abundance factors (NSAFs), as described earlier (
27,
28).
Fluorescence
in situ hybridization (FISH) was done on samples Aug07, Island 2 and 3, Jun08, and Dec11 as described in reference
29 using the following probes: Eubmix (general bacteria), Arc415 (general archaea), Lf1252 (
Leptospirillum group III), and Lf288CG (
Leptospirillum group II 5wayCG). We have designed two probes that target the
Leptospirillum group IV UBA BS species 23S rRNA gene: LIV307 (5′-CCCTCTTTGGCGGACCTTTC-3′) and LIV1191 (5′-CACTCCAGGCCGAACGCTCC-3′). FISH was performed using 40% formamide concentration.
Community genomics data obtained from sample UBA (
15) and from the Nov05 biofilm (UBA BS biofilm) were used to assemble the partial genome of the
Leptospirillum group IV UBA BS species. Briefly, reads belonging to
L. rubarum and
L. ferrodiazotrophum were removed from both data sets, and the remaining reads were coassembled using Phred/Phrap/consed as described previously (
30). Contigs were binned by using ESOM (
31) and by comparing coverage and sequence similarity to
L. ferrodiazotrophum. Manual curation of the assembly was done using methods reported previously (
16). To confirm the accuracy of the binning, reads belonging to
L. rubarum and
L. ferrodiazotrophum were removed from an additional community genomic data set (5wayCG [
22]), and the remaining reads were mapped onto the assembled genome of the
Leptospirillum group IV UBA BS species using gsMapper (Roche/454) with 90% minimum sequence identity and 40-bp minimum overlap as the parameters. Automated annotation was done using an in-house pipeline, and gene calls were manually curated using BLASTX (
32) against the SwissProt database.
To assess abundance of organisms in the genomically characterized UBA BS Nov05 biofilm sample, reads were mapped to the available genomes of AMD organisms using gsMapper with 99% sequence identity and 40-bp minimum overlap as the parameters. The relative abundance of each organism was estimated based on coverage statistics. To calculate coverage, the number of reads mapping to all scaffolds binned to each organism was multiplied by the average read length (800 bp), and the result was divided by the genome size (cumulative length of scaffolds in each bin).
Biofilms for transcriptomic exploration were grown in laboratory bioreactors, as described in reference
33, in the dark, at pH 1 and 37°C, and harvested at early and mid stages of development. Briefly, bioreactors consist of a Teflon channel (30 cm long by 5 cm wide by 3 cm deep), which allows the acidic modified 9K medium to flow at a constant rate (
33). The medium was recycled through the reactor until oxidized (turning from bright green to red color), at which point the spent medium was replaced by fresh 9K medium.
Community transcriptomic data were obtained from eight environmental samples and five samples grown in a bioreactor. Biofilms were lysed using the MirVana lysis buffer (Ambion) and bead-beating. Total RNA was extracted using acid phenol-chloroform-isoamyl alcohol (Ambion), pelleted with cold isopropanol for about 1 h, and immediately purified using the RNEasy MinElute kit (Qiagen). The integrity of the RNA was confirmed using a Bioanalyzer 2100 (Agilent Technologies). An aliquot of good-quality RNA (RNA integrity number > 7) from six environmental samples and three bioreactor samples underwent rRNA depletion using the MicrobExpress kit (Ambion). Good-quality total RNA and rRNA-depleted RNA were converted to cDNA using Superscript III (Invitrogen) as described in reference
34, and the cDNA was fragmented with a Covaris S-system (Covaris, Inc.) to an average fragment size of 200 bp. Fragmented cDNA was sent to the University of California Davis sequencing facility for Illumina genomic library preparation and sequencing. Samples were indexed to sequence multiple samples in an Illumina lane.
To separate ribosomal from nonribosomal reads, transcriptomic reads were mapped to a modified Silva database containing rRNA genes from AMD bacteria and archaea (
35) using Bowtie (
36) with parameters –v 1 –best –
y. Nonribosomal reads from the nine rRNA-depleted samples were pooled with nonribosomal reads obtained from the corresponding total RNA. Nonribosomal reads from all 13 samples were then mapped to the partial genome of the
Leptospirillum group IV UBA BS species using Bowtie with parameters –v 1 –best –
y. Transcript abundance per gene was normalized by dividing the read counts by the gene length, and the resulting value was divided by the total sum of the length-normalized values in each sample.
The 16S rRNA phylogenetic tree was built using ARB (
37) with 1,000 bootstraps. Protein model predictions were done using the Phyre website (
38) and visualized using Pymol (PyMOL molecular graphics system, version 1.2r3pre; Schrödinger, LLC). Ligand binding was predicted using the 3DLigandSite webserver (
39). Clustering of NSAF values was done using the cluster v3.0 for Mac OSX, centering genes and samples by the median, using the Spearman rank correlation similarity matrix, and average linkage as the clustering method (
40). The heatmaps were visualized with the TreeView software (
41).
Nucleotide sequence accession number.
This Whole Genome Shotgun project has been deposited in DDBJ/EMBL/GenBank under the accession AURA00000000. The version described in this paper is version AURA01000000. The genome annotation and proteomics data sets are already publically available via the open knowledgebase ggKbase website (
http://genegrabber.berkeley.edu/amd/organisms/506).
DISCUSSION
Bacteria generally dominate Richmond Mine AMD biofilms, and only recently were
Archaea reported to dominate sunken biofilms degrading under microaerophilic and anaerobic conditions (
48). Here we described an unusual
Archaea-dominated acidophilic biofilm community that contains a novel bacterial species,
Leptospirillum group IV UBA BS. The identity of its 16S rRNA gene sequence compared to sequences of the other
Leptospirillum spp. is sufficiently low to warrant its designation as a distinct species (<98.7% identity, as recommended by Stackebrandt and Ebers [
49]). Enrichment of this species has not been observed in cultures from the Richmond Mine, possibly due to its habitat within an extraordinarily thick polymer environment and its association with uncultivated
Archaea. Notably,
Leptospirillum group IV UBA BS is most abundant in biofilms that contain only very low abundances of other cultivated
Leptospirillum spp. Consistent with its designation as a distinct species, among the leptospirilli, the
Leptospirillum group IV UBA BS species is unique in its capacity for mercury resistance, the presence of a multicopper oxidase, and many unique hypothetical proteins. As in
Leptospirillum ferrooxidans, it has the potential for hydrogen metabolism.
Its genomic content suggests that cells are likely motile, capable of both carbon and nitrogen fixation. It has been suggested that motility allows
L. ferrodiazotrophum to redistribute into microcolonies (
16). Motility may be particularly important for
Leptospirillum group IV UBA BS growing in the gel-like environment of the thick floating biofilms, allowing it to find a suitable habitat as conditions change during biofilm development.
Leptospirillum group IV UBA BS might be capable of anaerobic growth using H
2 as electron donor, as shown for
Acidithiobacillus ferrooxidans (
50), although experimental validation is required to confirm this function. Hydrogen metabolism could be associated with growth under anaerobic conditions predicted within AMD biofilms thicker than a few microns (
51).
Leptospirillum bacteria are known iron oxidizers, likely using cytochrome 572 (Cyt
572) to take electrons from Fe(II) (
52,
53). Cytochrome 579 (Cyt
579) is involved in electron transport (
54,
55). The presence of a multicopper oxidase, as well as Cyt
572 and Cyt
579, whose biochemical function was verified in
Leptospirillum group II from the Richmond Mine (
52,
54), provides strong evidence supporting the role of
Leptospirillum group IV UBA BS in iron oxidation. The
Leptospirillum group IV UBA BS species likely fixes CO
2 via the reverse tricarboxylic acid (TCA) cycle using pyruvate:ferredoxin oxidoreductase (PFOR), as shown in other
Leptospirillum spp. (reviewed in reference
56).
Community transcriptomics shows that the fraction of
Leptospirillum group IV UBA BS genes for which a transcript was detected was highest in the environmental sample A-drift GS0 (see Table S2 in the supplemental material). This early developmental stage biofilm, dominated by
L. ferrodiazotrophum, was collected from the A-drift tunnel in September 2010. The temperature at that location was 40°C, the pH was 1.27, and the solution contained unusually high concentrations of ferric iron (Shufen Ma, personal communication). The sample in which the lowest fraction of genes for which a transcript was detected was the environmental sample C75 GS1 (Table S2). This mid-developmental stage biofilm, collected from a pool in the C-drift tunnel with pH 0.86 and temperature 46°C, was dominated by
Leptospirillum rubarum (group II, UBA type), and
L. ferrodiazotrophum was detected at low abundance (
21). Therefore, both physiologically and in terms of certain environmental preferences, the
Leptospirillum group IV UBA BS species is most similar to
L. ferrodiazotrophum (group III), to which it is most closely related based on phylogenetic analysis. It is notable that expression of hydrogenases in the
Leptospirillum group IV UBA BS species was detected only in the sample in which the nitrogen fixation operon is expressed in
L. ferrodiazotrophum. H
2 is a by-product of nitrogen fixation, but there is no evidence that it can be used by
L. ferrodiazotrophum. Thus, consumption of H
2 may be the basis for cooperative interactions between the
Leptospirillum group IV UBA BS species and
L. ferrodiazotrophum.