INTRODUCTION
Just as early humans were hunter-gatherers with respect to food, modern humans remain hunter-gatherers with respect to energy. We drill, mine, or extract wherever we can find resources. To become a sustainable society, we must transition from this hunter-gatherer mode to one that provides a continuous, stable, and abundant supply of energy. Bio-oil derived from microalgae is one of the more promising forms of renewable energy, as algae can produce more oil per acre than land plants and can be grown nearly anywhere, including in degraded habitats and nonarable lands (
1,
2). Fatty acid methyl esters (FAMEs) are the main constituent of biodiesel produced by the extraction and transesterification of fatty acids from algal biomass. Thus, the key metrics of the quality of algal biomass for bio-oil production are the proportion of the biomass composed of fatty acids and the composition of different types of fatty acids. Also, high FAME yields lead to improved bio-oil yield and quality from newer more promising technologies such as hydrothermal liquefaction that transform the entire wet algal biomass into biocrude oil (
3).
A key bottleneck that limits the economic viability of energy cultivation by algae is our inability to maximize biomass quality, based on fatty acid content, while at the same time maximizing biomass quantity. The physiological reasoning behind this limitation is that organisms with finite resources choose either to use energy to grow or to store energy in the form of lipids for future growth (
4–6). One approach that has proven moderately successful has been to genetically engineer microalgal strains (
7), but these strains are expensive to produce and unlikely to thrive under the variable field conditions inherent in outdoor commercial-production facilities. However, there is an alternative approach that uses principles in ecology to engineer communities of multiple species (
1,
4,
8,
9). This approach is based on ecological research that has shown that multispecies ecosystems regularly outperform monocultures in many functions, including biomass production (
10–14).
These positive causal relationships between biodiversity and function are especially strong when considering multiple functions at once (
15). While simultaneously improving multiple functions would have clear benefits for the extraction of sustainable resources, our knowledge of biodiversity-ecosystem function relationships in natural systems has yet to be applied to commercial applications such as biofuels and bioproducts. For example, a biorefinery, in which plant biomass is grown and refined to yield multiple products such as biofuel and biochemicals, would benefit from simultaneous improvements of multiple functions (
16). Specific for biofuels, studies suggest that algal species richness may influence the quantity or quality of the biomass (
13,
17). However, the simultaneous effects of species richness on the biomass yield and biomass quality needed for improved bio-oil production, along with the genomic drivers underlying such a phenomenon, remain to be rigorously investigated.
Here, we incorporated new data and analyses into an experiment that was previously analyzed to test the effect of evolutionary relatedness and gene expression on community stability and coexistence (
18–20). In this experiment, we grew monocultures of eight species of green freshwater microalgae,
Chlorella sorokiniana,
Closteriopsis acicularis,
Cosmarium turpinii,
Pandorina charkowiensis,
Scenedesmus acuminatus,
Selenastrum capricornutum,
Staurastrum punctulatum, and
Tetraedron minutum, and all 28 of the possible species pairs under controlled laboratory conditions for 48 days until the majority of the communities had reached steady-state biomass. These species were specifically selected to span a large gradient in phylogenetic distances on our molecular phylogeny (
21). The charophytes included the two desmids
Cosmarium turpinii and
Staurastrum punctulatum, while the remaining species belong to the chlorophytes. At the end of the experiment (i.e., day 48), for each species, we quantified the total cell density and RNA transcription levels. For each culture, we measured the biomass quality as total fatty acid methyl ester (FAME) content, which consisted of a mixture of 28 different fatty acids. The proportions of these different fatty acids were used to calculate two important qualities of extracted total FAME, cetane number and higher heating value. Previously, these data were used to show that bicultures consisting of more closely related species generally exhibited more similar gene expression, weaker competitive interactions, and greater temporal stability of biomass (
19,
20). In our current analyses, we show that multispecies communities of freshwater microalgae outperform monocultures in producing a high quantity and quality of biomass representative of algal feedstock for bio-oil production. To assess the causes underlying our observation that species richness simultaneously improved the algal biomass yield and quality, we also measured gene expression in each algal population in each community. We used these data to contrast lipid metabolism, nutrient assimilation, and photosynthetic gene expression of each algal species alone in monoculture versus those in each biculture growing condition, and related genome-wide expression patterns to the biomass attained by each algal population and our metric for biomass quality, FAME.
RESULTS
We first tested whether algal bicultures outperformed monocultures in the production of bio-oil, considering the metrics of biomass quantity and quality. We considered these measures, on average, in bicultures versus monocultures, but also tested whether bicultures overyield with respect to monocultures. Overyielding refers to a biculture that significantly exceeds expectations on the basis of monoculture yields. Second, we evaluated differential expression in bicultures relative to that in their monocultures, focusing specifically on gene functions involved in lipid metabolism that we hypothesized would be important for the production of bio-oil. We first considered these genes categorized into broad Gene Ontology functional groups and then identified specific sets of orthologous genes. Lastly, we linked differential expression of genes involved in lipid metabolism with the overyielding and relative composition of fatty acids, which are key indicators of fuel quality.
We found that bicultures, on average, produced significantly higher levels of algal biomass and of FAMEs, which we measured in two ways: FAME yield as a weight percentage of the biomass, and the total yield of FAMEs calculated by multiplying FAME yield and algal biomass (
Fig. 1A to
C). We found that bicultures frequently overyielded with respect to their comprising monocultures in the individual function of either biomass yield or quality. Nearly half of all bicultures produced a biomass of significantly higher quality (i.e., FAME yields) relative to the quality of the biomass produced by their constituent monocultures (FAME overyielding occurred in 13 of 28 combinations). Additionally, five bicultures significantly overyielded in biomass. Further, bicultures generated greater biomass than monocultures in part through a greater utilization of nutrient pools from the growth medium. This trend was significant throughout the experiment (NO
3:
F1,34 = 6.42,
P = 0.012; PO
4:
F1,34 = 15.32,
P < 0.001), but was greater in magnitude toward the end of the experiment, when PO
4 levels were 51% lower and NO
3 levels were 35% lower in bicultures than in monocultures.
Furthermore, we found that bicultures significantly outperformed monocultures when simultaneously considering biomass yield and quality, as they were, on average, closer to the optimum than monocultures (nested linear model on the Euclidean distances from the 1:1 “optimum” point;
F1,34 = 2.9,
P < 0.01) (
Fig. 1D). The optimum was defined as the combination of the highest biomass observed across all experiments (i.e., biomass of the
Closteriopsis,
Staurastrum biculture) with the highest FAME yield observed (i.e., FAME yield of the
Chlorella,
Staurastrum biculture). This conclusion that bicultures outperformed monocultures was robust using either the “threshold” or “averaging” method for calculating multifunctionality (see Fig. S1 in the supplemental material). While bicultures, on average, significantly outperformed monocultures when considering multifunctionality, and the best biculture exceeded the function of the best monoculture when considering each function separately, the best biculture was not significantly better than the best monoculture when simultaneously considering both functions (Euclidean distance to optimum of the best monoculture fell within 0.27 standard deviations of a bootstrapped distribution of the top three bicultures).
We also found that biomass quality can be predicted by gene expression. Results from our genetic analyses in part explain how species richness promotes multifunctionality: 64% of all bicultures that overyielded in FAME also contained at least one algal species that was differentially expressing genes involved in lipid metabolism (
Fig. 2). The relative FAME yield was calculated as the mean FAME yield of A plus B biculture/(mean [mean yield of species A in monoculture and mean yield of species B in monoculture]). We indicate an overyielding in FAME only where the biculture yield was significantly higher than the average of the two monocultures. Differential expression indicates that the algal species expressed significantly greater or lower levels of the gene in the biculture growing condition relative to that in the monoculture growing condition. We identified candidate genes that we can infer are involved in lipid metabolism using a keyword search of lipid-relevant terms in the Gene Ontology annotation database. For conciseness, we refer to these genes as “lipid metabolism genes.” Importantly, one-half of the algal populations that were differentially expressing their lipid genes were also growing at biomass levels greater than those produced in monoculture (
Fig. 2). The differential expression of genes involved in lipid metabolism was predictive of FAME overyielding in bicultures relative to the expectations based on monoculture yields (logistic regression: relative FAME, χ
2 = 6.9,
P < 0.01). Furthermore, the differential expression of these genes was predictive of the total FAME yield in bicultures (logistic regression: total FAME, χ
2 = 4.8,
P = 0.027). We directly assessed the effect of species richness on lipid metabolism by measuring the gene expression in each algal population in each biculture compared to the expression by that species in monoculture. For example, a community can contain either two species that do not differentially express lipid genes (i.e.,
Chlorella and
Cosmarium), one species that does differentially express lipid genes and one that does not (i.e.,
Chlorella and
Tetraedron), or two species that both differentially express lipid genes (i.e.,
Chlorella and
Staurastrum). With this approach, we identified 10 occasions in which at least one of the two species in biculture differentially expressed genes with known lipid functions compared to all other genes (Kolmogorov-Smirnov tests, all false-discovery-rate-corrected
P values were <0.05) (
Fig. 2) and found that overyielding in FAME and/or biomass occurred in all 10 of these occasions. Furthermore, among the top 10 bicultures with the highest relative FAME yields, six bicultures had one or both species differentially regulating their lipid genes (
Fig. 2). Lastly, we confirmed the reproducible effects of these treatment conditions on algal gene expression by calculating Euclidean distances between biological replicates that were visualized with multidimensional scaling. We show that Euclidean distances among biological replicates within the same species combination were significantly less than the distances among biological replicates of different combinations (see Table S2).
We also found that the expression of genes known to regulate specific lipid functions was a significant predictor of overyielding in FAME (
Fig. 3). Specifically, overyielding bicultures diverged from nonoveryielding bicultures via the overexpression of genes regulating the saponifiable lipids, including glycerophospholipids and lipid A, as well as the regulation of dihydrolipoyllysine-residue succinyltransferase activity, which is essential for the production of coenzyme A and fatty acid synthesis (
22). Overyielding bicultures also diverged from nonoveryielding bicultures via the underexpression of genes regulating the metabolism of steroids, which are nonsaponifiable lipids (
Fig. 3, PC2 and PC3). This trend was common across multiple species, with overyielding bicultures diverging significantly from nonoveryielding bicultures in lipid gene expression, as summarized using the mean change in gene expression of 28 Gene Ontology terms involved in lipid metabolism (see principal-component analysis [PCA] in
Fig. 3) (logistic regression: χ
2 = 10.7,
P = 0.0047; multivariate analysis of variance:
F1,38 = 16.0,
P < 0.001) (see Table S3 for variable loadings and Fig. S2 for a PCA plot of PC1 versus PC2).
We then identified orthologous sets of genes across all eight species of microalgae by searching for best matches in
Chlorella variabilis, a more-well-studied species of algae. With this approach, we found that differential expression of orthologous sets of genes in the monoculture versus biculture growing conditions was predictive of the elevated FAME yields observed in bicultures relative to that in monocultures. Specifically, the upregulation of a set of orthologous genes involved in peroxisome biogenesis, where fatty acid oxidation occurs, was predictive of elevated FAME yields (
Fig. 4). Furthermore, the downregulation of a set of orthologous genes involved in the modification of phospholipids was predictive of an elevated FAME yield among bicultures (
Fig. 4). In addition to searching for orthologs in
Chlorella variabilis, we searched
Arabidopsis thaliana for two enzymes that were poorly described in algae but are especially notable for fatty acid biosynthesis (diacylglycerol acyltransferase [DGAT] and phospholipid diacylglycerol acyltransferase [PDAT]). Differential expression levels of DGAT and PDAT genes were not predictive of an elevated FAME yield (
Fig. 4). However, we found that the differential expression of several other orthologous sets of genes was predictive of the fold change in other important metrics of biomass quality measured in the monoculture versus biculture conditions. The differential expression among four orthologous sets of genes was predictive of the fold change in cetane number, a measure of fuel ignition delay (see Fig. S3), and differential expression among five orthologous sets of genes was predictive of the fold change in higher heating value, a measure of the quantity of heat released during fuel combustion (see Fig. S4). Cetane number and higher heating value are largely determined by the relative composition of fatty acids that differ in carbon chain length and saturation.
Overall, our differential expression analyses contribute to understanding the underlying mechanism by identifying which algal species within the community contributes to total fatty acid generation. Linking lipid gene expression to specific community members enabled us to merge these data with population densities to determine that lipid overyielding does not necessitate a reduction in growth. To further elucidate mechanistically how bicultures may be outperforming monocultures in fatty acid production, we evaluated whether differential expression among genes regulating photosynthesis and nutrient assimilation was associated with increased FAME production or total fatty acid production, which accounts for both increased biomass production and increased FAMEs as a weight percentage of the biomass. The genes we tested for their role in regulating photosynthesis included those encoding carbonic anhydrases, glutamate semialdehyde aminotransferases, and light harvesting chlorophyll a-b complexes. We tested iron permeases, nitrite transporters, nitrite reducers, nitrate transporters, nitrate reducers, and phosphate transporters for their roles in regulating nutrient assimilation. Although we found that 39% of bicultures contained at least one algal species upregulating photosynthesis genes (see Fig. S7), and 25% of bicultures contained at least one algal species upregulating nutrient assimilation genes (see Fig. S8), we found no systematic association between genes regulating photosynthesis or nutrient assimilation with the magnitude of FAME overyielding or production of total fatty acids (see Fig. S9).
DISCUSSION
Our results illustrate that in addition to improving each aspect of bio-oil production individually, species richness can simultaneously improve biomass yield and quality. Our findings are consistent with the majority of studies that have found that species richness enhances biomass yield (
10,
13,
14), though there are exceptions (
23). Our study is also consistent with a prior study that suggests richness may help improve biomass quality (
17). However, our study shows that species richness can help us accomplish both of these at once. Furthermore, our results demonstrate that key metrics of bio-oil quality have significant transcriptomic correlates that are common across the multiple species of algae studied here. We show that species richness directly affects the regulation of fatty acid metabolic processes that are known to be key for the production of high-quality bio-oil based on both a fundamental understanding of fatty acid biosynthesis (
22) and the validation here from finding correlations of gene expression with our measurements of bio-oil quality. By tracing lipid gene expression to individual populations of algae, we found that species richness can simultaneously improve both the biomass yield and the fatty acid content of algal biomass.
Our results add to several other studies that document numerous advantages of incorporating diversity. For example, compared to monocultures, multispecies crops are more stable, require less fertilizer, generate less nutrient pollution, and are more resistant to pest invasions (
23–26). While recent studies have used gene engineering approaches to improve fatty acid production under controlled laboratory conditions (
7), few monocultures thrive when exposed to the abiotic and biotic stressors of outdoor conditions that are essential for commercially scaled production. Our results show that ecological engineering is a promising approach that harnesses multispecies communities to improve the quantity, quality, and environmental sustainability of bio-oil production. However, while diversity improves the production of bio-oil on average, we did not find evidence that the best diverse crop significantly outperformed the best monoculture crop. An important next step is to apply our findings toward identifying whether diversity would improve yields among our best-performing monoculture crops, for example by using a more diverse group of algal species beyond green algae.
Lipid content is arguably the most important measure of quality for an algal feedstock, regardless of whether the biofuel industry relies on direct lipid extraction or adopts newer methods, such as hydrothermal liquefaction, that use the entire algal biomass. By experimentally introducing interactions among species, we show that algae frequently respond to interspecific interactions by differentially expressing genes that regulate lipid biosynthesis. Although it is challenging to definitively identify the functions of genes in nonmodel organisms, we show that the degree of differential expression of our genes that were purportedly involved in lipid metabolism was indeed significantly predictive of the degree of overyielding in FAME in bicultures relative to that in monocultures. We used a cautious approach for these transcriptomics analyses by focusing only on a subset of transcripts that were most taxonomically relevant and functionally informative. By retaining only those transcripts mapping to taxa in the Diaphoretickes (i.e., the Plants+Hacrobia+SAR [stramenopiles, alveolates, and Rhizaria] megagroup of photosynthetic organisms) that have Gene Ontology annotations, we avoided the possible inclusion of transcripts with unknown lipid functions in our “nonlipid” category of genes and minimized the possible misinterpretations from functions inferred from taxa outside the Diaphoretickes megagroup, as the algal cultures we used have associated bacteria and fungi. However, this approach could exclude genes that may be poorly annotated but biologically relevant, which could explain why certain bicultures overyielded in FAMEs but showed no underlying genomic signature (i.e., the
Cosmarium-Closteriopsis biculture). We note that our study did not exclude bacteria and fungi in the algal “phycosphere”: those microbes residing in the mucilaginous layer immediately surrounding the algal cell that are known to be facilitating or even essential for algal growth (
27–29). Further efforts to improve the cultivation of outdoor algal biofuels should test the effects of these bacteria and fungi, which will inevitable colonize ponds, on the outcomes of biofuel production.
We found that overyielding bicultures could be differentiated from nonoveryielding bicultures on the basis of the differential expression of genes that regulated the synthesis of different types of lipids. Genes regulating the saponifiable lipids, including glycerophospholipids and lipid A, tended to be upregulated among overyielding bicultures. While lipid A is best known from bacterial systems, studies have confirmed that plant mitochondrial membranes contain the full biosynthetic pathway for lipid A (
30,
31). Also notable is that overyielding and nonoveryielding bicultures diverged significantly in the expression of the enzyme dihydrolipoyllysine-residue succinyltransferase. This enzyme catalyzes the reaction generating coenzyme A, which is a key player in the synthesis of fatty acids (
22). Specifically, the amounts of coenzyme A and its close derivatives, such as malonyl-CoA, that have accumulated in plant plastids are directly predictive of the rate of fatty acid biosynthesis (
32). In contrast, overyielding bicultures also diverged from nonoveryielding bicultures via the underexpression of genes regulating the metabolism of steroids, which are nonsaponifiable lipids that are not convertible to FAMEs (
33).
Our results also identified several sets of orthologous genes for which differential expression caused by the diversity treatment was strongly predictive of improved biofuel quality metrics among bicultures. Interestingly, we found that the upregulation of peroxisome biogenesis protein 16 (PEX16) was positively associated with the overyielding of FAMEs. This is notable because the peroxisome and PEX genes are best described for their role in the β-oxidation or degradation of fatty acids, which is often induced in plants under stress from pathogens or herbivores (
34). However, PEX16 has been implicated in the synthesis of fatty acids in
Arabidopsis (
35). Lin et al. hypothesized that peroxisomes may therefore be involved in both the degradation and the synthesis of fatty acids, perhaps with some degradation necessary to sustain net synthesis (
35). We also identified genes involved in phospholipid metabolism that were predictive of fuel metrics; however, the directional effects were mixed. Phospholipids themselves, with only two fatty acid tails, yield less bio-oil than triglycerides. Resources dedicated toward modifications among the diverse forms of glycerophospholipids or the synthesis of additional phospholipids at the expense of triglycerides may decrease bio-oil yields. For example, triglycerides yield more than 99% bio-oil, while phospholipids typically yield less than 70% bio-oil (
36,
37). However, the genes involved in increasing glycerophospholipid transformations into free fatty acids, such as those encoding phospholipase A2s (PLA2s), may improve the bio-oil yield (
38). This may explain why some genes, such as those involved in phospholipid translocation, were positively associated with bio-oil production, while others were negatively associated with production. Lysophospholipid acyltransferase and CDP-diacylglycerol-glycerol-3-phosphate 3-phosphatidyltransferase, which are involved in modifications within phospholipids, were both negatively associated with bio-oil production. Such modifications to the phosphate head of the phospholipid that leave the fatty acid tails unchanged may still affect bio-oil yield by changing the proportional mass of the molecule comprising the fatty acids tails.
Lastly, we found that the differential expression of several orthologous sets of genes was associated with changes in cetane number and higher heating value. A main predictor of cetane number is the relative composition of saturated, monounsaturated, and polyunsaturated fatty acids. Our algal diversity treatment drove the differential expression of fatty acid desaturases, which significantly increased cetane number in bicultures. The upregulation of PEX16 also significantly predicted greater cetane number among bicultures. Several findings further suggested that algal diversity contributed to the long-term storage of lipids. Diversity caused an upregulation in lipid-droplet-associated hydrolases, which have been shown with experimental work in cell lines to increase triacylglycerol levels (
39). Diversity also upregulated very-long-chain 3-ketoacyl-coenzyme A synthases, which are involved in elongating lipids prior to storage (
40). The upregulation of these genes involved in lipid storage corresponded to increased cetane number and higher heating value metrics in bicultures relative to those in monocultures.
Combined, our ecological and transcriptomics analyses show that algal populations growing within overyielding communities are differentially regulating lipid metabolism while frequently maintaining growth at biomass levels above those attained in monoculture. This finding provides an underlying explanation for our observation of a beneficial effect of species richness on both biomass yield and quality, which is somewhat contrary to expectations from within-species tradeoffs. Indeed, attempts to improve multiple functions within monocultures have been met with failure due to an inability to overcome tradeoffs that are inherent to biological populations. One such tradeoff that has been proposed is between the biomass that a population of organisms produces versus the quality of this biomass for bio-oil production. Organisms with access to excess resources tend to grow, while those under nutrient stress instead tend to accumulate lipids that improve the quality of algal biomass (
5,
41,
42). It has therefore been argued that populations can increase either biomass quantity or quality or produce suboptimal levels of both. In contrast, our findings suggest that species consortia can better utilize nutrient resources for an improved production of biomass. While our data indeed showed that there was a greater utilization of available nutrients in bicultures, this was unlikely to be causing nutrient stress, because nutrient levels remained orders of magnitude higher than the levels typically reported as the lowest at which green algal species can maintain a stable population (resource ratio [R*] values) (
43,
44).
Thus, interspecific interactions evidently trigger the accumulation of fatty acids without necessarily compromising biomass yield. While tradeoffs may be an inherent aspect of biology at the species level, our results begin to show that we can harness what decades of research is seeming to prove as an inherent aspect of ecology: diversity improves functions, and quite often, improves functions simultaneously. By rooting our findings in the genomic foundation of species interactions, our results contribute to our understanding of the drivers of elevated function in nature and may increase our ability to both predict when and where to expect such patterns to occur and apply these phenomena to agricultural and biotechnological uses.
ACKNOWLEDGMENTS
This work was supported by the U.S. National Science Foundation DIMENSIONS of Biodiversity program (DEB-1046121) to B.J.C. and T.H.O. and Emerging Frontiers in Research and Innovation in Photosynthetic Biorefineries (EFRI-PSBR 1332343) to B.J.C., V.J.D., P.E.S., and T.H.O.
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
The authors declare no competing interests.
B.J.C. and T.H.O. conceived the ideas and developed them with P.E.S. and V.J.D. A.N. designed and carried out algal growth experiments. R.B.L. and D.C.H. measured algal fatty acid content. For transcriptomics analyses, B.B. completed quality control, library construction, and read mapping. S.L.J. completed gene annotation and differential expression analyses, analyzed data sets, and generated figures with advisement from V.J.D. and B.J.C. S.L.J. drafted the paper with conceptual and editorial input from all authors.