INTRODUCTION
Diet has a fundamental role in human metabolism and health. Furthermore, there is convincing evidence that various components of the diet, such as carbohydrates, fat, and fiber (
1,
2), influence the composition of the gut microbiota (
3). There are further factors that modulate metabolic health and the gut microbiota (i.e., the host’s nutritional status, such as overweightness and outright obesity, two conditions commonly observed today). In this respect, it has been hypothesized that the link between diet, gut microbiota and metabolic health may also be influenced by the host’s nutritional status.
The healthy human gut microbiota comprises a variety of distinct bacterial species and a reduced diversity has been associated with metabolic complications such as insulin resistance and dyslipidemia (
4). In a similar manner, a good dietary quality, in accordance with national recommendations, has been demonstrated to confer health benefits in reducing the risk of type 2 diabetes and cardiovascular diseases (
5,
6). Considerable work has been done to reveal this relationship, for example showing that excess adiposity of the host is linked with a range of adverse metabolic outcomes, as well as with deviations in gut microbiota composition (
7,
8). The gut microbiota is malleable to changes in environment and diet (
9). A recent review (
10) provided information regarding effects of various diets (such as animal-based diet, plant-based diet, Mediterranean diet) and dietary components (such as fiber, resistant starch, polyphenols) in modifying gut microbiota composition. Reports regarding diet quality and its association with gut microbiota composition and metabolic functions are few (
11–13). The evidence thus far indicates that a higher dietary quality associates with beneficial bacteria and its increased diversity (
11,
12). Some indication also exists for the influence of diet with gut microbial functional benefit (
13). This small (
n =144) longitudinal study reported that, metabolic pathways related to an increased alpha-diversity were associated with habitual diet. However, the association between the host’s nutritional status, diet and gut microbiota remains to be elucidated further.
There are a myriad of mechanisms through which the gut microbiota may mediate human health effects. One of the key routes is considered to be low-grade inflammation, which is also observed in obesity (
14,
15). A low bacterial gene richness has been associated with a more pronounced inflammatory phenotype (
4). In our previous study with overweight and obese pregnant women, a higher gut microbiota richness defined by 16S rRNA gene sequencing was related to lower levels of GlycA, a novel marker of low-grade inflammation (
16). As compared to 16S rRNA sequencing, the metagenomics approach provides a higher level of resolution at the strain- and species-level of microbiota as well as functional potential of microbiota (
17), which yields further benefit in evaluation of the diet-microbiota-metabolism associations.
Here, our first aim was to investigate the association between dietary quality (
18) and gut microbiota composition, analyzed by metagenomics. Our second aim was to examine the relationship between the composition of the gut microbiota and the metabolic and inflammatory status of the individual. We investigated whether the overweight and obese individuals would reveal distinct associations between dietary quality, gut microbiota composition and metabolic and inflammatory status. Finally, we also investigated the association between microbial functional pathways with dietary quality and body mass index (BMI) groups (overweight and obese).
DISCUSSION
We demonstrated here that there are partially distinct associations between microbiota, diet quality, and metabolic and inflammatory markers in overweight and obese pregnant women (please see a summary in
Fig. 1). Our findings suggest that the metabolic and inflammatory profiles of overweight women benefit from gut microbiota species richness and overall bacterial composition (beta diversity), but in obese women, the quality of diet may influence species diversity and modify the inflammatory profile, particularly the GlycA level.
Our findings show that the diversity of gut microbiota species reflects dietary quality in obese women in early pregnancy. As far as we are aware, this is the first study to have applied a metagenomics approach to explore the relationship between dietary quality and gut microbiota diversity, richness and function in pregnancy. The metagenomics approach provides a higher level of resolution at the strain- and species-level of microbiota compared to previous studies using 16S rRNA gene sequencing. Metagenomics allows us to explore more specific taxonomic and functional classifications for the microbial community (
17). In previous studies, two dietary measures, described as the healthy eating index (compliance with the U.S. Dietary Guideline for Americans) and the Mediterranean diet score (adherence to a Mediterranean diet) correlated with gut microbiota diversity indices in non-pregnant subjects (
19). In another study in women with gestational diabetes, no differences were observed in gut microbiota diversity between the women who adhered to the dietary recommendations and those not eating according to these guidelines (
20). In our smaller scale study of 84 women, the dietary quality correlated positively with gut microbiota diversity (
11). Unlike the previous studies, here we differentiated the participants according to their overweight and obesity status. Indeed, if the women were classified as either overweight or obese, this altered the correlation between dietary quality and gut microbiota species diversity (i.e., the relationship was only observed in obese women). The reason for this observation remains unclear. The dietary quality
per se did not differ between overweight and obese women, but the result suggests that the gut microbiota of obese individuals is more adaptable to dietary modifications.
Interestingly, species richness and community composition were related to both the metabolic and inflammatory profiles, but only in the overweight women. An inverse correlation was observed between the gut microbiota species richness and markers of glucose metabolism and inflammation, here assessed via the GlycA level. Further, the community composition was associated with the GlycA level. In the taxonomic analysis, only one association (i.e., a positive association was found between GlycA and
A. finegoldii and only in overweight women).
A. finegoldii is a bile resistant bacterium, and lower counts have been found in patients with nonalcoholic steatohepatitis (NASH) when compared to healthy adults (
21), and a higher abundance has been associated with elevated blood pressure (
22). GlycA is a novel marker for low grade inflammation which has been associated with metabolic complications including incident type 2 diabetes and cardiovascular diseases (
23,
24). The traditional marker, hsCRP, was related to neither of the richness indices. In a previous study with a subset of the same study population, where 16S rRNA gene sequencing was utilized to explore gut microbiota alpha diversity, a negative correlation was found with GlycA in a combined group of overweight and obese women (
16). Another study in normal weight, overweight, or obese pregnant women detected a negative correlation between the species diversity index and the level of hsCRP in the third trimester (
7), but unlike in our current study, the overweight and obesity status was not considered. Nonetheless, as the metabolic burden is increased in association with maternal BMI (
25), and in obese women compared to overweight women (
26), we propose that a higher gut microbiota species richness may benefit the overweight women by alleviating disturbances in glucose metabolism and reducing the extent of low-grade inflammation. On the contrary, this was not observed in obese women; in these individuals, a higher IDQ was related to higher species diversity, indicating that the obese women with a more disturbed metabolic phenotype may benefit from dietary modification. This is evidenced by the finding of our study; the dietary quality in obese women was reflected in the maternal inflammatory status, and this relationship was independent of gut microbiota taxonomic diversity, but rather it seemed to originate from the direct effects of the diet on the mother’s health.
We also explored associations of diet quality and the BMI groups (overweight and obese) with gut microbiota functional potential. Although such associations might exist (
11–13), we did not find any statistically significant associations. We observed one pathway, iron complex transport system that showed borderline negative correlation with diet quality, but its exact role is not yet understood. Reports investigating the relationships between diet quality and gut microbial function are all in all few (
11–13).
Our study has several strengths. Firstly, we applied high resolution (i.e., at species-level) metagenomics analytics to assay the gut microbiota’s richness and diversity as well as performed functional analyses. Secondly, we used a validated method (
18), for analyzing the dietary quality. Furthermore, due to the well-known effects of adiposity on dietary intake, gut microbiota and circulating metabolic profile, in this comprehensive approach, we took into account the possible influence of the prepregnancy BMI value. Another strength was that our study population was homogeneous with a similar age range and gender. The robustness of statistical analyses was ensured by applying multiple corrections. There are also limitations in our study. Even though the physiological alterations are mild in early pregnancy, it is not clear whether the study results are generalizable to non-pregnant conditions. It also needs to be clarified whether women in late pregnancy or normal weight pregnant subjects behave similarly. Although no statistically significant associations with species diversity and serum markers were observed, the complementary analyses using species richness indicate that such associations might well exist. These indices are complementary indicators and focus on the different aspects of the microbial community. These associations need to be confirmed in larger studies to determine the clear effects. In our study the group of obese women was smaller than that of overweight women which might limit observing statistically significant differences.
Conclusions.
Our results demonstrate that overweight and obese women may display partially distinct metabolic and inflammatory responses to the diet and that they exhibit differences in their gut microbiota species richness and diversity, and community composition. A higher dietary quality was reflected in a richer gut microbiota species diversity, but only in obese women. In overweight women, a higher gut microbiota species richness was related to lower levels of markers of glucose metabolism and both species richness and community composition and one species (i.e., A. finegoldii) associated with low grade inflammation. The different responses to diet and gut microbiota species richness and diversity observed in overweight and obese women may originate from the differences in the metabolic and inflammatory burden related to the excess adiposity. The fact that we observed no relationship between metabolic and inflammatory status with gut microbiota species diversity in obese women suggests that other factors, such as diet as observed in our study, could have regulatory effects of maternal metabolism and inflammation in obese women, although the findings will need to be verified in larger studies. We also propose that the dietary quality index may be used as a tool for controlling the dietary intake in gut microbiota studies, but at the same time, one needs to consider the adiposity of the participants.
MATERIALS AND METHODS
This was a cross-sectional study investigating overweight and obese pregnant women participating in a mother-infant dietary single-center intervention trial (ClinicalTrials.gov, NCT01922791) being conducted in Southwest Finland. The recruitment took place between 10/2013 and 7/2017. The inclusion criteria for the study were overweight (self-reported prepregnancy BMI ≥ 25 kg/m2) and early pregnancy (<18 weeks of gestation). The exclusion criteria were gestational diabetes diagnosed during the current pregnancy, multifetal pregnancy, and the presence of metabolic or inflammatory diseases, including type 1 and type 2 diabetes, celiac disease, and inflammatory bowel disease. The presence of allergy was allowed. Here we describe the interaction between dietary quality, fecal gut microbiota alpha diversity (species richness and Shannon diversity index), community composition i.e., beta diversity and species abundance, and serum markers of glucose metabolism and low-grade inflammation at baseline prior to the onset of the intervention. Women who had consumed antibiotics within 8 weeks before the study visit, and those who did not provide fecal or serum samples and did not fill in the dietary questionnaire were excluded, resulting in 386 out of 439 women being included in the analyses.
Prepregnancy BMI (kg/m
2) was calculated by dividing self-reported weight in kilograms, obtained from welfare women clinic records, by height measured in the study visit with a wall stadiometer to the nearest 0.1 cm. Other characteristics of the women (
Table 1), including age, education and smoking were collected by interviews and questionnaires.
This study was conducted according to the guidelines laid down in the Declaration of Helsinki as revised in 2013, and all procedures that involved human subjects were approved by the Ethics Committee of the Hospital District of Southwest Finland (permission number 115/180/2012). All participants provided written informed consent. The metadata are not publicly available due to their containing information that could compromise the privacy of research participants.
Quality of the diet.
The overall dietary quality with reference to that recommended was measured with the validated index of diet quality (IDQ) questionnaire (
18), which contains 18 questions regarding the frequency and amount of consumption of foods during the week preceding the study visit. The criteria for a health-promoting diet were derived from dietary recommendations and consisted of consumption of whole grain, vegetables, fruits and berries, dairy and choice of foods that will yield a good quality of dietary fat intake. The quality of the diet was defined as poor when index points were less than 10 out of the maximum 15 points and good when points were 10 or more out of 15 (
18). Here, the IDQ score was used both as a continuous variable and as dichotomized in accordance with the previous article describing score (
18).
DNA extraction.
Fecal samples were collected in sterile plastic pots on the morning of the study visit or the previous evening, delivered to the study unit and kept at −20°C until DNA extraction. DNA was extracted from 50 mg of homogenized feces using GTX stool extraction kit and fully automated GenoXTract machine (Hain Lifescience, Nehren, Germany) as previously described (
27). Prior to extraction, mechanical lysis was performed by bead-beating the samples in ceramic bead tubes with MOBIO PowerLyzerTM 24 Bench Top Bead-Based Homogenizer (MO BIO Laboratories, Inc., USA). The DNA concentrations were measured with Qubit 2.0 dsDNA HS assay kit (Life Technologies), after which the DNAs were stored at −80°C until sequencing.
Metagenomic sequencing.
The genomic DNA was randomly sheared into fragments of approximately 350 bp. The fragmented DNA was used for library construction using NEBNext Ultra II Library Prep Kit for Illumina (New England Biolabs). The prepared DNA libraries were evaluated using Qubit 2.0 fluorometer quantitation and Agilent 2100 Bioanalyzer for the fragment size distribution. Quantitative real-time PCR (qPCR) was used to determine the concentration of the final library before sequencing. The library was sequenced using 2 × 150 bp paired-end sequencing on an Illumina HiSeq platform.
Bioinformatics processing.
(i) Data preprocessing. The raw FASTQ files were quality controlled using KneadData (v.0.6.1) to remove low-quality bases and reads derived from the host genome as follows: Using Trimmomatic (v.0.36), the reads were quality trimmed by removing Nextera adapters, leading or trailing bases with a Phred score below 20, and trailing bases in which the Phred score over a window of size 4 drops below 20. Trimmed reads shorter than 100 bases were discarded. Reads that mapped to the human reference genome GRCh38 (with Bowtie2 v.0.2.3.2 using default settings) were also discarded. Read pairs in which both reads passed filtering were retained; these were classified as high-quality non-host (HQNH) reads.
(ii) Mapping reads to gene catalog. HQNH reads were mapped to the integrated gene catalog (IGC) (
28), using BWA mem (v.0.7.16a) with options to increase accuracy (-r 1 -D 0.3). PCR/optical duplicates were removed using samtools (v.1.6). A read pair where both reads had a mapping quality (MAPQ) ≥ 20 and an alignment of at least 100 bp and with ≥ 95% identity to a single IGC gene was considered mapped. However, the mapping was rejected if > 10 bases at either end of the read failed to align to an existing gene sequence (i.e., alignment beyond the IGC gene sequence was accepted). The read counts were used to estimate the abundance of the Clinical Microbiomics proprietary set of IGC metagenomic species (MGS) (
29) derived from abundance profiles across 3200 reference samples. For each MGS, the “core” genes have been defined as the 100 genes with the highest correlation of abundance across the reference samples. A table of MGS counts was created based on the total gene counts for the 100 core genes of each MGS. However, an MGS was considered detected only if the read pairs were mapped to at least three of the 100 core genes; MGSs that did not satisfy this criterion were set as zero counts. The relative abundance estimate of each MGS was made by normalizing the counts for gene lengths. Rarefied (downsampled) MGS abundance profiles were calculated by performing the above procedure on a rarefied gene counts table (generated by random sampling, without replacement, of HQNH read pairs).
The species diversity (Shannon index) and richness were calculated with
vegan R package (
30), from the number of metagenomics species (MGSs) that were detected and their relative abundances in the rarified (7,281,907 read pairs) data. Richness describes the number of species that is detected in a sample, whereas the Shannon diversity index also takes the relative abundance of the species into account. Shannon diversity is less sensitive to sampling errors than species richness, as it gives a higher weight to the more abundant species than to their rarer counterparts.
(iii) Functional annotation of gene catalog. Emapper software (v. 1.0.3, HMM mode) was used to compare each gene in the gene catalog to the EggNOG (v. 4.5) orthologous groups database (
http://eggnogdb.embl.de/). These genes were then mapped from EggNOG to Kyoto Encyclopedia of Genes and Genomes (KEGG) orthologies (KO) and modules (
http://www.genome.jp/kegg/kegg1.html) using MOCAT2 lookup tables (
http://mocat.embl.de/). KEGG pathways that had > 40% prevalence (i.e., pathways which were observed in more than 40% of study population) were selected. Altogether, 195 KEGG pathways were selected for microbial functional analyses out of a total of 530 pathways present in the data.
Sequencing statistics.
On average about 21% reads were discarded being low quality, 0.5% for mapped to the host genome, and 10% for not mapping to the gene catalog, leaving about 69% mapped reads (Table S1 in the supplemental material). Information about read counts (%) per sample is provided as a figure (Fig. S1).
Taxonomy reads annotation.
Reads with an unknown taxonomy were reported as unmapped (if reads not mapped to the gene catalog) and orphan genes (if mapped to the gene catalog but not attributed to an MGS). Mean relative abundances (%) were calculated for bacterial taxa for overweight and obese (Table S2). The differences in the relative abundances of bacterial taxa between overweight and obese were analyzed with Mann-Whitney test and were corrected with Benjamini-Hochberg method. Adjusted P-value < 0.05 was considered as significant.
Taxonomic and functional analyses.
Only those serum variables, which had significant associations with species richness i.e., insulin, HOMA2-IR and GlycA were selected for beta diversity (community composition) analysis. Associations between beta diversity and these continuous variables as well as association between beta diversity and dichotomized IDQ were quantified with PERMANOVA using the
adonis function from the
vegan R package. For beta diversity analysis reads were not rarefied. PERMANOVA was used to test the significance of association based on the Aitchison distance (centered log ratio (CLR) transformation of abundance with Euclidean distance); the
P-values were adjusted with the Bonferroni method. We quantified the association between the GlycA level and individual taxonomic groups (bacterial species) with DESeq2 (
31), separately for overweight and obese women. For DESeq2 analysis, original count data was used. The multiple hypothesis correction was done with Bonferroni method. Adjusted
P-value < 0.05 was considered as significant, whereas cases with adjusted
P-value < 0.1 were considered to be borderline significant. Associations of KEGG pathways with diet quality index (IDQ) and both BMI groups (overweight and obese) were quantified with Spearman correlation and Wilcoxon test, respectively. The multiple hypothesis correction was done with Benjamini-Hochberg method. The taxonomic and functional analyses were performed in R (v.3.6.3).
Markers of glucose metabolism and low-grade inflammation.
Fasting (10 h’ minimum) blood samples were drawn from the antecubital vein, and the serum was separated and analyzed for insulin, glucose, and hsCRP, and the rest of the samples were frozen in aliquots at −80°C until being analyzed for serum metabolomics. A high-throughput proton NMR metabolomics platform (Nightingale, Helsinki, Finland) was used to analyze (
32), the level of serum glycoprotein acetylation (GlycA), a novel marker of low-grade inflammation. GlycA consists of a complex heterogeneous nuclear magnetic resonance signal originating from the N-acetyl sugar groups present on multiple acute phase glycoproteins in the circulation; α1-acid glycoprotein, haptoglobin, α1-antitrypsin, α1-antichymotrypsin and transferrin (
33). The concentrations of glucose, insulin, and hsCRP were measured in an accredited Turku University Hospital Laboratory according to its quality control system. The glucose concentration was measured using an enzymatic method utilizing hexokinase (Cobas 8000 automatic c702-analyzer, Roche Diagnostics GmbH, Mannheim, Germany). Insulin concentrations were determined with an immunoelectrochemiluminometric assay (a modular E170 automatic analyzer, Roche Diagnostics GMbH, Mannheim, Germany). Insulin resistance, HOMA2-IR, was calculated from fasting plasma glucose and fasting insulin using HOMA calculator (
http://www.dtu.ox.ac.uk/). Insulin sensitivity, QUICKI, was calculated as = 1/[log(FastingInsulin) + log(FastingGlucose)] (
34). HsCRP levels were determined using an automated colorimetric immunoassay on the Dade Behring Dimension RXL autoanalyzer (Siemens Healthcare, Camberly, Surrey, UK). The lower limit of detection was 0.1 mg/L.
Statistics.
As not all variables were normally distributed (inspected using histograms and Kolmogorov-Smirnov test), non-parametric tests were used for all statistical tests. The relationship of IDQ and prepregnancy BMI with gut microbiota species richness and diversity with markers of glucose metabolism and low-grade inflammation was analyzed by Spearman’s correlation. The correlations of species richness and diversity with IDQ and serum variables were corrected for multiple testing. The differences in gut microbiota species richness and diversity between the highest and the lowest quartiles of IDQ, between the poor (IDQ < 10) and the good (IDQ ≥ 10) dietary quality and between overweight (prepregnancy BMI < 30) and obese status (prepregnancy BMI ≥ 30) and group difference between overweight and obese for baseline characteristics, were investigated using Mann-Whitney U test. The differences in markers of glucose metabolism and low-grade inflammation between the highest and the lowest species richness and diversity quartile were analyzed by Mann-Whitney U test. The P-values obtained were corrected for multiple testing. Statistical analyses were performed with SPSS for Windows, version 24 (IBM Corp., Armonk, NY) with adjusted P < 0.05 considered statistically significant, whereas cases with adjusted P-value < 0.1 were considered to be borderline significant.
Data availability.
The data sets are not available due to their containing information that could compromise participant privacy and consent.
ACKNOWLEDGMENTS
Clinical trial was supported by the State Research Funding for university-level health research in the Turku University Hospital Expert Responsibility Area, Academy of Finland (#258606 to K.L.), the Diabetes Research Foundation and the Juho Vainio Foundation. Funding to the University of Turku for the metabolomics and gut microbiota analyses and reporting was provided by Janssen Research & Development, LLC. Bioinformatics analyses and writing up the manuscript was supported by the Biocodex Foundation and the Finnish Foundation for Cardiovascular Research. L.L. acknowledges funding from the Academy of Finland (#295741). These funding sources had no role in the design, execution, analyses, interpretation of the data, or decision to submit these results.
K.L. and K.M. designed the research, K.L. and K.M. conducted research, E.M. supervised the fecal DNA extraction, E.K. and N.H. participated in sample and data collection, N.S. and H.B.N. performed the gut microbiota sequencing and bioinformatics, K.M., M.L., and K.L. wrote the paper, L.L. designed and supervised statistical analysis, M.L. and K.M. performed statistical and bioinformatics analyses. K.M., M.L, L.L., and K.L. interpreted the results, all authors revised the manuscript. K.L. had primary responsibility for the final content. All authors read and approved the final manuscript.
We declare no conflict of interest.