ABSTRACT

Increasing evidence indicates the significant racial difference in gut, vaginal, and skin microbiomes. However, little is known regarding the racial difference in the oral microbiome. In this study, deep sequencing of 16S rRNA genes was utilized to assess the oral microbiome in mouth rinse samples of 1,058 African-Americans (AAs) and 558 European-Americans (EAs) from the Southern Community Cohort Study. Generally, AAs had a higher species richness than EAs, with P = 5.28 × 10−14 (Wilcoxon rank sum test) for Faith’s phylogenetic diversity index. A significant difference in overall microbiome composition was observed between AAs and EAs, with P = 5.94 × 10−4 (MiRKAT) for the weighted UniFrac distance matrix. We also found 32 bacterial taxa showing a significant differential abundance or prevalence between the two racial groups at a Bonferroni-corrected P < 0.05 in linear or logistic regression analyses. Generally, AAs showed a higher abundance of Bacteroidetes and a lower abundance of Actinobacteria and Firmicutes. Interestingly, four periodontal pathogens, Porphyromonas gingivalis, Prevotella intermedia, Treponema denticola, and Filifactor alocis, were more prevalent among AAs than among EAs, with Bonferroni-corrected P values of 5.23 × 10−6, 4.47 × 10−6, 1.08 × 10−3, and 4.49 × 10−5, respectively. In addition, all of these 32 taxa were significantly correlated with the percentage of genetic African ancestry. These findings call for research to understand how the racial difference in oral microbiome influences the health disparity.
IMPORTANCE In this systemic investigation of racial differences in the oral microbiome using a large data set, we disclosed the significant differences in the oral microbial richness/evenness, as well as in the overall microbial composition, between African-Americans and European-Americans. We also found multiple oral bacterial taxa, including several preidentified oral pathogens, showing a significant different abundance or prevalence between African-Americans and European-Americans. Furthermore, these taxa were consistently found to be associated with the percentage of genetic African ancestry. Our findings warrant further research to understand how the racial difference in the oral microbiome influences the health disparity.

INTRODUCTION

It has been estimated that nearly 100 trillion microbes colonize in different human body habitats, collectively composing the microbiota (1). Host-microbiota interactions are deeply involved in various physiological and metabolic activities (2) and hosts’ health (3). The human mouth is heavily colonized by microorganisms (1) and acts as a portal for microbes to gain access to the respiratory and digestive tracts (4). It is well acknowledged that the oral microbiota affects two common oral diseases, dental caries and periodontal disease (5), while recent studies also indicate that the oral microbiota may play roles in maintaining systemic health through nutrition absorption, metabolism, and immune system regulation (4).
Studies have suggested that many host-related factors are associated with diversity and compositions of microbial communities, such as the host’s race/ethnicity (6), genetic background (7), and socioeconomic status (8). Increasing evidence suggests that there are racial differences in microbial profiles (6) of vaginal (9), gut (10), and skin (11) microbiomes. Two studies have investigated the differences in oral microbiome across racial groups (12, 13). In the earlier one (12), two strategies, terminal restriction fragment length polymorphism (t-RFLP) and 16S rRNA gene pyrosequencing, were used to assess microbiota in plaque and saliva samples from 192 individuals of four ethnic affiliations. They found that the oral microbiota of African-Americans (AAs) had lower alpha diversity than that of European-Americans (EAs), Chinese, and Latinos. In the more recent one (13), the 16S rRNA gene was sequenced for the saliva samples of 152 participants from three different climate zones (13). The authors found that both alpha and beta diversity differed significantly among populations from Alaska, Germany, and Africa. However, both studies had a small sample size, and/or a limited number of microorganisms were investigated. Here, we investigated the oral microbiota of 1,616 participants of the Southern Community Cohort Study (SCCS), including 1,058 AAs and 558 EAs.

RESULTS

Characteristics of the study participants.

Table 1 shows the general characteristics of the participants included in the present study. A total of 1,616 individuals were involved, including 1,058 AAs and 558 EAs. Overall, the study participants have low social economic status, with ∼57% of AAs and ∼35% of EAs having an annual household income of <$15,000, and ∼32% of AAs and ∼18% of EAs receiving less than 12 years of education. Among the AAs, ∼39% were current smokers and ∼25% were former smokers, while among EAs, ∼32% were current smokers and ∼38% were former smokers. The AAs had worse oral health status than the EAs, with ∼87% versus ∼75% having lost at least one tooth. Among these lifestyle factors, race, age, alcohol drinking, tooth loss, annual household income, and sequencing batch were associated with Faith’s phylogenetic diversity (PD) index, i.e., the overall phylogenetic richness and evenness of oral microbiota (P < 0.05; see Fig. S1 in the supplemental material).
TABLE 1
TABLE 1 Characteristics of the participants in the Southern Community Cohort Study
CharacteristicAfrican-Americans (n = 1,058)European-Americans (n = 558)
Age (yr), mean ± SD54.72 ± 8.7357.95 ± 8.41
Sex, no. (%)  
    Female549 (51.89)232 (41.58)
    Male509 (48.11)326 (58.42)
BMI (kg/m2), mean ± SD29.76 ± 7.3528.53 ± 6.79
Genetic African ancestry (%), mean ± SD87.39 ± 10.500.27 ± 0.75
Annual household income ($), no. (%)  
    <15,000593 (56.69)192 (35.23)
    15,000–24,999216 (20.65)74 (13.58)
    25,000–49,999150 (14.34)106 (19.45)
    ≥50,00087 (8.32)173 (31.74)
Highest level of education, no. (%)  
    <High school335 (31.72)99 (17.74)
    High/vocational school394 (37.31)175 (31.36)
    Some college191 (18.09)110 (19.71)
    ≥College136 (12.88)174 (31.18)
Alcohol consumption, no. (%)a  
    None492 (47.40)252 (46.32)
    Light295 (28.42)179 (32.90)
    Moderate120 (11.56)60 (11.03)
    Heavy131 (12.62)53 (9.74)
Tobacco smoking, no. (%)  
    Never381 (36.01)166 (29.75)
    Current414 (39.13)178 (31.90)
    Former263 (24.86)214 (38.35)
Tooth loss, no. (%) of participants  
    None85 (13.08)104 (25.12)
    1–10316 (48.62)182 (43.96)
    >10, not all149 (22.92)65 (15.70)
    All100 (15.38)63 (15.22)
a
Light, <1 drink/day; moderate, 1 to 2 drinks/day; heavy, >2 drinks/day.

Difference of overall microbial diversity and composition between AAs and EAs.

The data collection of the present study represented well for both alpha and beta diversity in both AAs and EAs. As shown in Fig. S2, within both AAs and EAs, Faith’s PD index (Fig. S2A and B) and weighted UniFrac distance (Fig. S2C and D) decreased along with the increase in samples size. In addition, when the sample size of AAs reached ∼800 (<final sample size of 1,058) and the sample size of EAs reached ∼450 (<final sample size of 558), both Faith’s PD index and weighted UniFrac distance did not change materially. In the present study, AAs showed higher alpha diversity, i.e., microbial richness and evenness, than EAs, with a P value of 2.83 × 10−15 for Faith’s PD index, as estimated by Wilcoxon rank sum test (Fig. 1A). The significantly higher Faith PD index among AAs than among EAs was consistently observed in the two sequencing batches, with P values (Wilcoxon rank sum test) of 7.68 × 10−4 and 1.52 × 10−4 for the first and the second batch, respectively (Fig. 1A). Along with the increase in hosts’ percentage of genetic African ancestry, the Faith PD index of oral microbiome increased (Fig. 1B). A significant difference in beta diversity was also found between AAs and EAs (Fig. 2A) with P values of 1.50 × 10−3, 1.17 × 10−7, and 3.16 × 10−5 for the weighted UniFrac distance, unweighted UniFrac distance, and Bray-Curtis dissimilarity matrices, respectively, as tested by MiRKAT. The significant differences in beta diversity were also consistent between the two sequencing batches (Fig. 2A), with P values (MiRKAT), for the weighted UniFrac distance, unweighted UniFrac distance, and Bray-Curtis dissimilarity matrices of 0.03, 1.91 × 10−4, and 8.88 × 10−4 for the first batch, and 4.28 × 10−3, 2.50 × 10−6, and 0.01 for the second batch, respectively. Figure 2B showed the F values of other factors on the weighted UniFrac distances in combined analyses and in stratified analyses by the two sequencing batches. In addition, beta diversity was significantly associated with hosts’ percentage of genetic African ancestry, with P values (MiRKAT) of 1.52 × 10−4 for weighted UniFrac distance, <2.20 × 10−16 for unweighted UniFrac distance, and 2.32 × 10−4 for Bray-Curtis dissimilarity distance. The changes in weighted UniFrac distance along with the increase in hosts’ percentage of genetic African ancestry are shown in Fig. 2C.
FIG 1
FIG 1 Significant differences of oral microbial richness and evenness between AAs and EAs. The oral microbial richness and evenness were estimated using Faith’s phylogenetic diversity (PD) index. P values were calculated using Wilcoxon rank sum test. (A) Significant differences of Faith’s PD index were consistently observed in combined analyses and in stratified analyses by the two sequencing batches. (B) Increases in Faith’s PD index along with the increase in hosts’ genetic African ancestry.
FIG 2
FIG 2 Significant differences of overall oral microbiome composition between AAs and EAs. The overall oral microbiome composition was estimated using the weighted UniFrac matrices, and the P values were calculated using MiRKAT. (A) Red circles and blue triangles represent AAs and EAs, respectively. The ellipses and centroids (diamond) of the first two principal-coordinate analyses of weighted UniFrac distances for AAs (red) and EAs (blue) were estimated using the “dataEllipse” function of the R package “car.” (B) F values of other factors on weighted UniFrac distance in combined analyses and in stratified analyses by the two sequencing batches. (C) Changes in the first two principal coordinates of weighted UniFrac distance along with the increase in hosts’ genetic African ancestry.

Common bacterial taxa with a significant differential abundance between AAs and EAs.

Table 2 and Fig. 3A present the 13 taxa with their relative abundances showing a significant racial difference in linear regression analyses after Bonferroni correction. Generally, AAs showed a higher abundance of Bacteroidetes but a lower abundance of Actinobacteria and Firmicutes than EAs. In Bacteroidetes, three taxa, including Porphyromonadaceae, Porphyromonas, and Prevotella denticola, showed a significantly higher relative abundance among AAs, with Bonferroni-corrected P values of 0.03, 0.05, and 1.56 × 10−3, respectively. The phylum Actinobacteria, along with three taxa within it, Micrococcaceae, Rothia, and Rothia mucilaginosa, were found to be significantly more abundant among EAs, with Bonferroni-corrected P values of 6.14 × 10−8, 1.81 × 10−6, 1.14 × 10−6, and 1.21 × 10−5, respectively. In Firmicutes, five taxa, Carnobacteriaceae, Granulicatella, Granulicatella adiacens, Streptococcus oligofermentans, and Streptococcus sp. oral taxon 057, were more abundant among EAs, with Bonferroni-corrected P values of 0.03, 0.02, 0.03, 7.88 × 10−5, and 0.03, respectively. Another taxon of this phylum, Peptostreptococcaceae, showed a higher abundance among AAs (Bonferroni-corrected P = 2.35 × 10−4). The effect sizes of participants’ lifestyle factors on these 13 significant associations are shown in Table S1 and Fig. S3. All of these 13 taxa were also associated with hosts’ percentage of genetic African ancestry at P < 0.05 (Table 2, Fig. 3B, and Fig. S4). In addition, 10 of these 13 taxa showed a consistent significant differential abundance between AAs and EAs in both sequencing batches (Table S2).
TABLE 2
TABLE 2 Significantly higher abundance of Bacteroidetes and lower abundance of Actinobacteria and Firmicutes among African-Americans
TaxonAmong all participants (n = 1,616)Genetic African ancestry percentage (n = 882)
% relative abundanceCoefficientaP valuebCoefficientaP valuea
European-
Americans
African-
Americans
Phylum Bacteroidetes      
    Family Porphyromonadaceae0.190.670.380.030.884.81 × 10−6
        Genus Porphyromonas0.180.640.370.050.851.91 × 10−5
    Species Prevotella denticola0.040.110.491.56 × 10−30.871.22 × 10−5
    
Phylum Actinobacteria11.0510.09−0.386.14 × 10−8−0.462.78 × 10−6
    Family Micrococcaceae5.775.47−0.411.81 × 10−6−0.398.65 × 10−4
        Genus Rothia5.775.47−0.421.14 × 10−6−0.391.04 × 10−3
            Species Rothia mucilaginosa4.804.65−0.411.21 × 10−5−0.355.91 × 10−3
    
Phylum Firmicutes      
    Family Carnobacteriaceae1.351.13−0.220.03−0.260.02
        Genus Granulicatella1.321.09−0.240.02−0.250.03
            Species Granulicatella adiacens1.281.04−0.230.03−0.250.03
    Species Streptococcus oligofermentans0.460.22−0.427.88 × 10−5−0.488.50 × 10−4
    Species Streptococcus sp. oral taxon 0579.898.85−0.170.03−0.292.77 × 10−4
    Family Peptostreptococcaceae0.090.200.412.35 × 10−40.665.48 × 10−6
a
For each sample, centered log-ratio transformation was used to normalize taxon read counts. The associations of taxon abundance with race/African ancestry percentage were evaluated using linear regression analyses.
b
Bonferroni-corrected P values, adjusted for 25 independent tests.
FIG 3
FIG 3 Thirteen common bacterial taxa showing a significant differential abundance between AAs and EAs in linear regression analyses. (A) The associations of taxon abundance and race were evaluated using linear regression analyses. *, **, and *** represent the Bonferroni-corrected P values <0.05, <0.01, and <0.001, respectively. (B) Changes in centered-log-ratio-transformed relative abundance of three common taxa along with the increase in hosts’ genetic African ancestry. Similar plots for other common taxa shown in Table 2 are presented in Fig. S4.

Rare bacterial taxa with a significant differential prevalence between AAs and EAs.

In total, 19 rare taxa showed a significant differential prevalence between AAs and EAs in logistic regression analyses, and all of them were more prevalent among AAs (Table 3 and Fig. 4A). Among them, interestingly, four periodontal pathogens, including Porphyromonas gingivalis, Prevotella intermedia, Treponema denticola, and Filifactor alocis, were significantly more prevalent among AAs than among EAs, with Bonferroni-corrected P values of 5.23 × 10−6, 4.47 × 10−6, 1.08 × 10−3, and 4.49 × 10−5, respectively. Among the remaining 15 taxa, two belonging to Bacteroidetes, including Porphyromonas sp. oral taxon 285 and Prevotella sp. oral taxon 526, were more prevalent among AAs with Bonferroni-corrected P values of 8.02 × 10−3 and 3.32 × 10−5, respectively. In Firmicutes, two families, four genera, and five species were more prevalent among AAs. Among them, the species Eubacterium minutum, carried by 32.1% of EAs and 53.9% of AAs, showed the most significant differential prevalence, with Bonferroni-corrected P = 2.36 × 10−6. The phylum SR1 and a species of the phylum Spirochaetes, Treponema medium, showed a significantly higher prevalence among AAs with Bonferroni-corrected P values of 0.02 and 6.32 × 10−4, respectively. The effect sizes of lifestyle factors, which were adjusted in logistic regression analyses, on these 19 significant associations are presented in Table S3 and Fig. S5. All of these 19 taxa were positively associated with the percentage of genetic African ancestry at P < 0.05 (Table 3, Fig. 4B, and Fig. S6). In addition, the significant higher prevalence of all these 19 taxa among AAs was also consistently observed in both sequencing batches (Table S4).
TABLE 3
TABLE 3 Taxa showing a significant different prevalence between African-Americans and European-Americans
TaxonAmong all participants (n = 1,616)Genetic African ancestry percentage (n = 882)
% prevalenceCoefficientaP valuebCoefficientaP valuea
European-
Americans
African-
Americans
Phylum Bacteroidetes      
    Species Porphyromonas gingivalis57.5376.090.755.23 × 10−61.204.05 × 10−7
    Species Porphyromonas sp. oral taxon 28556.4571.170.538.02 × 10−30.804.22 × 10−4
    Species Prevotella intermedia61.1176.090.794.47 × 10−60.824.95 × 10−4
        Species Prevotella sp. oral taxon 52627.7850.280.703.32 × 10−50.999.20 × 10−6
    
Phylum Firmicutes      
    Genus Peptococcus39.6155.010.666.30 × 10−51.152.52 × 10−7
    Family Peptoniphilaceae85.8492.160.770.020.860.01
    Species Filifactor alocis42.4764.180.674.49 × 10−50.878.28 × 10−5
    Genus Eubacterium31.0049.720.607.08 × 10−40.596.49 × 10−3
        Species Eubacterium saphenum29.5747.350.510.010.440.05
        Species Eubacterium minutum32.0853.880.752.36 × 10−60.756.91 × 10−4
    Genus Peptostreptococcus75.0984.780.859.94 × 10−61.331.35 × 10−6
        Species Peptostreptococcus stomatis74.7384.500.875.67 × 10−61.367.37 × 10−7
    Family Erysipelotrichaceae86.7492.340.760.031.317.50 × 10−4
    Genus Mycoplasma32.8051.510.500.020.831.64 × 10−4
        Species Mycoplasma faucium32.4450.760.480.030.765.64 × 10−4
    Species Veillonella sp. oral taxon 78069.8978.070.742.31 × 10−41.064.99 × 10−5
 
Phylum Spirochaetes      
    Species Treponema denticola46.2464.930.591.08 × 10−30.822.26 × 10−4
    Species Treponema medium42.4761.910.616.32 × 10−40.942.91 × 10−5
 
Phylum SR133.3348.200.480.020.921.86 × 10−5
a
For each taxon, individuals were categorized into carriers and noncarriers according to whether they carried the taxon or not. The associations of taxon prevalence with race/African ancestry percentage were evaluated using logistic regression analyses.
b
Bonferroni-corrected P values, adjusted for 71 independent tests.
FIG 4
FIG 4 Nineteen rare bacterial taxa showing a significant differential prevalence between AAs and EAs in logistic regression analyses. (A) The associations of taxon prevalence and race were evaluated using logistic regression analyses. *, **, and *** represent Bonferroni-corrected P values of <0.05, <0.01, and <0.001, respectively. (B) Increases in prevalence of four rare taxa along with the increase in hosts’ genetic African ancestry. Similar plots for other rare taxa shown in Table 3 are presented in Fig. S6.

DISCUSSION

Previous studies have demonstrated the racial differences in the human microbiome, with most studies focusing on the microbiota of the gastrointestinal tract (14, 15), skin (16), and vagina (9, 17). Several studies have also implied the racial differences in the oral microbiome (6, 18, 19). However, these studies had limited sample sizes and the oral microbiome of AAs was not well studied. In the study presented here, we observed significant differences in overall microbial diversity and composition between AAs and EAs and found multiple bacterial taxa, including several preidentified oral pathogens, that showed a significantly different abundance or prevalence between the two racial groups.
In the present study, significant differences in overall microbial composition were observed between AAs and EAs, which were consistent with the results from two previous studies (12, 13). In the first one (13), the saliva microbiome was profiled from 74 native Alaskans, 10 Germans, and 66 Africans. The authors found that Africans had a significant different microbial composition from native Alaskans and Germans. Similarly, in the other study (12) investigating the subgingival microbiome of AAs, EAs, Chinese, and Latinos, a significant difference in overall microbial composition was observed between AAs and EAs. We also found that AAs showed a higher alpha diversity than EAs. However, in both of those studies, AAs showed a lower alpha diversity. This inconsistency could have two potential explanations. First, both of those studies were conducted with very small sample sizes, including 10 to 74 individuals within each group, which is substantially smaller than that (1,058 AAs and 558 EAs) of the present study. Due to the very small sample size, many bacteria of low abundance/prevalence could not be detected, which affected the estimation of microbial diversity. On the other hand, in one study (13), the average sequencing depth was only ∼441 reads per sample, which is much lower than that of the present study (75,021 reads per sample). In the other study (12), microbiomes were profiled using two strategies, i.e., terminal restriction fragment length polymorphism (t-RFLP) and 16S rRNA gene pyrosequencing The latter one, which has higher resolution power, was used for only a portion of the participants, which would have affected the accuracy of taxonomic assignment and then the diversity estimation.
In addition to the difference in overall microbial composition, we found 13 common taxa showing a differential abundance between AAs and EAs. Especially, AAs had a higher abundance of Bacteroidetes and a lower abundance of Actinobacteria and Firmicutes. In the above-mentioned saliva microbiome study (13), several genera showed a significantly different abundance in comparing Africans with native Alaskans and Germans. Several of them, including the higher abundance of Porphyromonas and the lower abundance of Rothia and Granulicatella among Africans, were consistent with results of the present study. In addition, the lower abundance of Rothia among AAs was also reported by the above-mentioned subgingival microbiome study (12). No studies have investigated the racial differences of the remaining taxa; hence, a comparison could not be made. Among these 16 taxa, several have been associated with diseases. For example, Actinobacteria was reported to be associated with a decreased risk of type 2 diabetes (20). Granulicatella adiacens (21) and Streptococcus oligofermentans (22) were found to be associated with infective endocarditis.
We also found 19 rare taxa that showed a significantly higher prevalence among AAs. Among them, four species, Porphyromonas gingivalis, Prevotella intermedia, Treponema denticola, and Filifactor alocis, have been established to be involved in the pathogenesis of a variety of forms of periodontal diseases (23, 24). Studies have shown a racial disparity in periodontal disease, which is highly correlated with oral bacterial pathogens (25). Several studies have reported that older AAs have more missing and decayed teeth than EAs (26, 27). In addition, data from the National Health and Nutrition Examination Survey (NHANES) showed a 20% greater prevalence of periodontitis (28) and 25% higher rates of dental caries (29) among older AAs (aged 65 years or older) than among older EAs. The differential prevalence of these four oral pathogens may, to some extent, contribute to the disparity of oral health between AAs and EAs. In addition to these oral pathogens, another 15 rare taxa were more prevalent among AAs as well. An earlier study, using 16S rRNA gene cloning and sequencing, found several genera, including Peptostreptococcus, associated with periodontitis (30). In addition, one of the species of this genus, Peptostreptococcus stomatis, was observed in peri-implantitis by two recent studies (31, 32). Therefore, the overprevalence of these two taxa might also have contributed to the worse oral health status among AAs than among EAs. However, given that oral hygiene may also contribute to the oral health disparity between the two racial groups but oral hygiene data were not collected from study participants, we could not eliminate the possibility that the enrichment of these periodontal disease-related bacteria in AAs may be attributed to the differences in oral hygiene between AAs and EAs.
To the best of our knowledge, this study is the largest to explore racial differences in the oral microbiome. 16S rRNA gene sequencing was utilized to profile the oral microbiota, which has better resolution than traditional techniques, such as probe-based DNA-DNA hybridization, used in earlier studies. In addition, we adjusted for a variety of covariates among all statistical analyses, making the findings of this study reflect, to the greatest extent, the relationship between oral microbiota and racial affiliation. Further, the availability of genetic data for a portion of study participants made our study the first to evaluate the associations of hosts’ genetic African ancestry with the oral microbiome. A limitation of this study is that it lacks a comprehensive oral health assessment at the baseline examination during the enrollment. In addition, for each of the participants, only one mouth rinse sample was collected; hence, our findings may be impacted by the potential misclassification bias. Further, it is well acknowledged that though 16S rRNA sequencing can provide a stable and accurate resolution for microbiota at the genus level, the species-level profiling was not optimal. Future studies employing the shotgun metagenomic sequencing technology will be needed to fill this gap.
In summary, we found that there were significant differences of overall oral microbiota composition, as well as individual bacterial taxon abundance/prevalence, between AAs and EAs. These results suggest the potential role of oral microbiome in health disparity. The causal mechanisms and factors shaping this difference warrant further investigation in larger sample sizes and with better microbiome profiling techniques.

MATERIALS AND METHODS

Study population and data collection.

The SCCS is a prospective study designed to explore health disparities in low-income populations. Details of the study have been described elsewhere (33). Briefly, more than 85,000 adults, aged 40 to 70, were recruited during 2002 to 2009 from 12 states in the southeastern United States, with two-thirds of the participants being AAs. At the enrollment, mouth rinse samples were collected from ∼34,100 participants. Written informed consent was obtained from all study participants. The SCCS was reviewed and approved by Vanderbilt University Medical Center and Meharry Medical College.
During enrollment, the baseline survey was taken by all participants through the filing of a comprehensive questionnaire to gather individuals’ basic information, including age, race/ethnicity, sex, education level, income, lifestyle, anthropometric features, disease history, and so forth. After recruitment, study participants were followed up by using record linkage and mail- or telephone-based surveys. Health-related outcomes were determined from National Death Index mortality records and/or through linkage with state cancer registries.
The present study included participants who provided mouth rinse samples during the study enrollment and were involved in four nested case-control studies to investigate the oral microbiome and incident cases of colorectal cancer, type 2 diabetes, lung cancer, and upper aerodigestive tract cancer. All participants were free of any diseases at the time of mouth rinse sample donation. After excluding participants with a self-reported history of antibiotic usage during the year before biospecimen collection, 1,616 individuals were included in the present study.

16S rRNA gene sequencing.

DNA was extracted from mouth rinse samples using Qiagen’s QIAamp DNA kit (Qiagen Inc., Germantown, MD, USA). The NEXTflex 16S V4 Amplicon-Seq kit (Bioo Scientific, Austin, TX, USA) was used to build a library to sequence 253 bp of the V4 domain of the 16S rRNA gene. The data were generated in two batches. For the first batch, 150-bp paired-end sequencing was performed using the Illumina MiSeq 300 at the Vanderbilt Technologies for Advanced Genomics (VANTAGe) Core. For the second batch, 250-bp paired-end sequencing was conducted via the Illumina HiSeq System at BGI Americas (Cambridge, MA, USA). For both batches, each 96-well plate, including an additional negative-control sample and two duplicated quality control (QC) samples, was sequenced. All duplicated samples showed comparable microbial profiles. For example, for the overall microbial richness (alpha diversity measured by Faith’s phylogenetic diversity [PD] index), the coefficient of variability (CV) among the repeated QC samples is 1.7%. For the relative abundance of individual taxa, the median of the Spearman correlation coefficients between the duplicated QC samples is 98.6%.

Sequencing data processing and quality controls.

For 16S rRNA sequencing data, Sickle (v1.33), BayesHammer, and PANDAseq (v2.10) were used successively to perform low-quality read trimming and removal, sequencing error correction, and paired-end read assembly (34). Then, the merged high-quality reads were processed by Quantitative Insights Into Microbial Ecology (QIIME; v1.9.1). The Human Oral Microbiome Database (HOMD) was used as reference. UCLUST (v1.2.22q) was used for clustering with 97% sequence similarity as the threshold. Those operational taxonomic units (OTUs) observed in fewer than two samples were highly unreliable; hence, they were excluded. Then, the OTU table was summarized to microbial taxon levels.

Statistical analysis.

For the microbial richness, i.e., alpha diversity, Faith’s PD index was calculated. We first evaluated the associations of participants’ lifestyle factors with alpha diversity through linear regression analyses. Then, the difference of the alpha diversity between AAs and EAs was estimated by the Wilcoxon rank sum test. For the overall microbial composition, i.e., beta diversity, the weighted UniFrac distance, unweighted UniFrac distance, and Bray-Curtis dissimilarity matrices were generated. The beta diversity between AAs and EAs was evaluated through the regression-based kernel method, implemented in MiRKAT (35) (v0.02). We also evaluated whether our data collection was representative enough for both alpha and beta diversity through estimating the changes in alpha and beta diversity along with the increase in number of samples within AAs and EAs, respectively.
For individual taxa, we tested the difference of the relative abundance and/or prevalence at the phylum, family, genus, and species levels between AAs and EAs. First, we investigated the taxa with a relative abundance of >0.10% among AAs, namely, common taxa, including five phyla, 15 families, 16 genera, and 29 species. For each sample, centered log-ratio (clr) transformation was used to normalize taxon read counts. Then, linear regression analysis was conducted with transformed abundance data as outcome and race as independent variables. For those taxa with a relative abundance of ≤0.10% in AAs, namely, rare taxa, we tested their differential prevalence between AAs and EAs via logistic regression. Due to the limited power for the very rare taxa, only those with a prevalence of >30% (with a non-zero read count in >30% of the participants) among AAs were included in the analyses, including four phyla, 19 families, 42 genera, and 102 species.
Genome-wide single nucleotide polymorphism (SNP) array data were available for 397 of the 1,616 individuals, including 324 AAs and 73 EAs, and were used to estimate the percentage of genetic African ancestry for these 397 participants in our previous studies (3638). Briefly, autosomal common SNPs (minor allele frequency > 0.05) with low linkage disequilibrium (pairwise r2 < 0.10) were used to estimate the genetic African ancestry, utilizing ADMIXTURE (v1.3.0). We then evaluated the association of the genetic African ancestry percentage with taxon relative abundance or prevalence using linear regression. Among the 73 self-reported EAs, the average African ancestry was only 0.27%. Hence, we included the remaining 485 EAs without genetic data (assuming the African ancestry percentage being 0) in the analyses, resulting in 882 participants in total.
During all of the statistical analyses, we adjusted for the following variants: age, sex, body mass index (BMI), smoking, alcohol consumption, total energy intake, tooth loss, annual household income, state of enrollment, disease status during the first follow-up, and sequencing batch. Among them, BMI and age were treated as continuous variables, and all the other categorical factors were treated as dummy variables, including sex (men and women), smoking (current, former, and never-smoker), alcohol consumption (ever-drinker, never-drinker, and missing), total energy intake (first tertile, second tertile, third tertile, and missing), tooth loss (no teeth lost, loss of 1 to 10 teeth, loss of >10 but not all teeth, loss of all teeth, and missing), annual household income (<$15,000, $15,000 to $50,000, >$50,000, and missing), state of enrollment (12 U.S. states), disease status during the first follow-up (any diseases and no disease), and sequencing batch (first and second batch). Among the factors with missing values, the missing rate is high only for tooth loss (∼34%) and low for all of the others, i.e., ∼5% for total energy intake and ∼1% for annual household income. The microbial taxa at different taxonomic levels are highly correlated. Therefore, Bonferroni correction is too conservative to correct multiple testing. To address this, we used a method described by Galwey (39), implemented in the R package “poolR” (v0.1-0) (https://github.com/ozancinar/poolR/), to evaluate the number of effective tests for common taxa and rare taxa separately. All P values were then corrected for multiple testing based on the estimated number of effective tests. For the alpha diversity index, beta diversity matrices, and bacterial taxa that were significantly associated with race, we further conducted stratified analyses by sequencing batch to evaluate the consistency of these associations between batches. All analyses in the present study were carried out using R (v3.3.1) and Python (v2.7.8).

Data availability.

The 16S rRNA sequencing data used in the present study can be requested through the SCCS online request system (https://www.southerncommunitystudy.org/research-opportunities.html). The R scripts used for statistical analyses of the present study are available in GitHub (https://github.com/YaohuaYangVEC/Codes-for-the-mSystem-paper).

ACKNOWLEDGMENTS

We thank all of the individuals who took part in the study and all of the researchers, clinicians, technicians, and administrative staff who enabled this work to be carried out. We thank Regina Courtney, Jie Wu, Jing He, and Marshal Younger for their help with sample preparation, statistical analysis, and technical support for the project. The data analyses were conducted using the Advanced Computing Center for Research and Education (ACCRE) at Vanderbilt University.
Sample preparation was conducted at the Survey and Biospecimen Shared Resources, which is supported in part by the Vanderbilt-Ingram Cancer Center (P30 CA68485). The SCCS was supported by NIH grant R01CA92447 and U01CA202979. This project was also supported by the development fund from the Department of Medicine at Vanderbilt University Medical Center and the NIH-supported grants R01CA207466, R01CA204113, and U54CA163072. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
The authors declare no conflict of interest.

Supplemental Material

File (msystems.00639-19-sf001.pdf)
File (msystems.00639-19-sf002.pdf)
File (msystems.00639-19-sf003.pdf)
File (msystems.00639-19-sf004.pdf)
File (msystems.00639-19-sf005.pdf)
File (msystems.00639-19-sf006.pdf)
File (msystems.00639-19-st001.pdf)
File (msystems.00639-19-st002.pdf)
File (msystems.00639-19-st003.pdf)
File (msystems.00639-19-st004.pdf)
ASM does not own the copyrights to Supplemental Material that may be linked to, or accessed through, an article. The authors have granted ASM a non-exclusive, world-wide license to publish the Supplemental Material files. Please contact the corresponding author directly for reuse.

REFERENCES

1.
Human Microbiome Project Consortium. 2012. Structure, function and diversity of the healthy human microbiome. Nature 486:207–214.
2.
Nicholson JK, Holmes E, Kinross J, Burcelin R, Gibson G, Jia W, Pettersson S. 2012. Host-gut microbiota metabolic interactions. Science 336:1262–1267.
3.
Cho I, Blaser MJ. 2012. The human microbiome: at the interface of health and disease. Nat Rev Genet 13:260–270.
4.
Wade WG. 2013. The oral microbiome in health and disease. Pharmacol Res 69:137–143.
5.
Aas JA, Paster BJ, Stokes LN, Olsen I, Dewhirst FE. 2005. Defining the normal bacterial flora of the oral cavity. J Clin Microbiol 43:5721–5732.
6.
Gupta VK, Paul S, Dutta C. 2017. Geography, ethnicity or subsistence-specific variations in human microbiome composition and diversity. Front Microbiol 8:1162.
7.
Bonder MJ, Kurilshikov A, Tigchelaar EF, Mujagic Z, Imhann F, Vila AV, Deelen P, Vatanen T, Schirmer M, Smeekens SP, Zhernakova DV, Jankipersadsing SA, Jaeger M, Oosting M, Cenit MC, Masclee AAM, Swertz MA, Li Y, Kumar V, Joosten L, Harmsen H, Weersma RK, Franke L, Hofker MH, Xavier RJ, Jonkers D, Netea MG, Wijmenga C, Fu J, Zhernakova A. 2016. The effect of host genetics on the gut microbiome. Nat Genet 48:1407–1412.
8.
Belstrøm D, Holmstrup P, Nielsen CH, Kirkby N, Twetman S, Heitmann BL, Klepac-Ceraj V, Paster BJ, Fiehn N-E. 2014. Bacterial profiles of saliva in relation to diet, lifestyle factors, and socioeconomic status. J Oral Microbiol 6:23609.
9.
Fettweis JM, Brooks JP, Serrano MG, Sheth NU, Girerd PH, Edwards DJ, Strauss JF, The Vaginal Microbiome Consortium, Jefferson KK, Buck GA. 2014. Differences in vaginal microbiome in African American women versus women of European ancestry. Microbiology 160:2272–2282.
10.
Deschasaux M, Bouter KE, Prodan A, Levin E, Groen AK, Herrema H, Tremaroli V, Bakker GJ, Attaye I, Pinto-Sietsma S-J, van Raalte DH, Snijder MB, Nicolaou M, Peters R, Zwinderman AH, Bäckhed F, Nieuwdorp M. 2018. Depicting the composition of gut microbiota in a population with varied ethnic origins but shared geography. Nat Med 24:1526–1531.
11.
Leung MH, Wilkins D, Lee PK. 2015. Insights into the pan-microbiome: skin microbial communities of Chinese individuals differ from other racial groups. Sci Rep 5:11845.
12.
Mason MR, Nagaraja HN, Camerlengo T, Joshi V, Kumar PS. 2013. Deep sequencing identifies ethnicity-specific bacterial signatures in the oral microbiome. PLoS One 8:e77287.
13.
Li J, Quinque D, Horz H-P, Li M, Rzhetskaya M, Raff JA, Hayes MG, Stoneking M. 2014. Comparative analysis of the human saliva microbiome from different climate zones: Alaska, Germany, and Africa. BMC Microbiol 14:316.
14.
De Filippo C, Cavalieri D, Di Paola M, Ramazzotti M, Poullet JB, Massart S, Collini S, Pieraccini G, Lionetti P. 2010. Impact of diet in shaping gut microbiota revealed by a comparative study in children from Europe and rural Africa. Proc Natl Acad Sci U S A 107:14691–14696.
15.
Sankaranarayanan K, Ozga AT, Warinner C, Tito RY, Obregon-Tito AJ, Xu J, Gaffney PM, Jervis LL, Cox D, Stephens L, Foster M, Tallbull G, Spicer P, Lewis CM. 2015. Gut microbiome diversity among Cheyenne and Arapaho individuals from western Oklahoma. Curr Biol 25:3161–3169.
16.
Blaser MJ, Dominguez-Bello MG, Contreras M, Magris M, Hidalgo G, Estrada I, Gao Z, Clemente JC, Costello EK, Knight R. 2013. Distinct cutaneous bacterial assemblages in a sampling of South American Amerindians and US residents. ISME J 7:85–95.
17.
Zhou X, Brown CJ, Abdo Z, Davis CC, Hansmann MA, Joyce P, Foster JA, Forney LJ. 2007. Differences in the composition of vaginal microbial communities found in healthy Caucasian and black women. ISME J 1:121–133.
18.
Nasidze I, Li J, Quinque D, Tang K, Stoneking M. 2009. Global diversity in the human salivary microbiome. Genome Res 19:636–643.
19.
McDonald D, Hyde E, Debelius JW, Morton JT, Gonzalez A, Ackermann G, Aksenov AA, Behsaz B, Brennan C, Chen Y, DeRight Goldasich L, Dorrestein PC, Dunn RR, Fahimipour AK, Gaffney J, Gilbert JA, Gogul G, Green JL, Hugenholtz P, Humphrey G, Huttenhower C, Jackson MA, Janssen S, Jeste DV, Jiang L, Kelley ST, Knights D, Kosciolek T, Ladau J, Leach J, Marotz C, Meleshko D, Melnik AV, Metcalf JL, Mohimani H, Montassier E, Navas-Molina J, Nguyen TT, Peddada S, Pevzner P, Pollard KS, Rahnavard G, Robbins-Pianka A, Sangwan N, Shorenstein J, Smarr L, Song SJ, Spector T, Swafford AD, Thackray VG, Thompson LR, Tripathi A, Vázquez-Baeza Y, Vrbanac A, Wischmeyer P, Wolfe E, Zhu Q, American Gut Consortium, Knight R. 2018. American Gut: an open platform for citizen science microbiome research. mSystems 3:e00031-18.
20.
Long J, Cai Q, Steinwandel M, Hargreaves MK, Bordenstein SR, Blot WJ, Zheng W, Shu XO. 2017. Association of oral microbiome with type 2 diabetes risk. J Periodontal Res 52:636–643.
21.
Adam EL, Siciliano RF, Gualandro DM, Calderaro D, Issa VS, Rossi F, Caramelli B, Mansur AJ, Strabelli T. 2015. Case series of infective endocarditis caused by Granulicatella species. Int J Infect Dis 31:56–58.
22.
Matta M, Gousseff M, Monsel F, Poyart C, Diebold B, Podglajen I, Mainardi J-L. 2009. First case of Streptococcus oligofermentans endocarditis determined based on sodA gene sequences after amplification directly from valvular samples. J Clin Microbiol 47:855–856.
23.
Newman MG, Takei H, Klokkevold PR, Carranza FA. 2011. Carranza’s clinical periodontology. Elsevier Health Sciences, New York, NY.
24.
Aruni AW, Mishra A, Dou Y, Chioma O, Hamilton BN, Fletcher HM. 2015. Filifactor alocis—a new emerging periodontal pathogen. Microbes Infect 17:517–530.
25.
Cruz GD, Chen Y, Salazar CR, Le Geros RZ. 2009. The association of immigration and acculturation attributes with oral health among immigrants in New York City. Am J Public Health 99:S474–S480.
26.
Quandt SA, Chen H, Bell RA, Anderson AM, Savoca MR, Kohrman T, Gilbert GH, Arcury TA. 2009. Disparities in oral health status between older adults in a multiethnic rural community: the rural nutrition and oral health study. J Am Geriatr Soc 57:1369–1375.
27.
Wu B, Liang J, Plassman BL, Remle RC, Bai L. 2011. Oral health among white, black, and Mexican‐American elders: an examination of edentulism and dental caries. J Public Health Dent 71:308–317.
28.
Eke PI, Dye BA, Wei L, Slade GD, Thornton-Evans GO, Borgnakke WS, Taylor GW, Page RC, Beck JD, Genco RJ. 2015. Update on prevalence of periodontitis in adults in the United States: NHANES 2009 to 2012. J Periodontol 86:611–622.
29.
Dye BA, Thornton-Evans G, Li X, Iafolla T. 2015. Dental caries and tooth loss in adults in the United States, 2011–2012. National Center for Health Statistics, Centers for Disease Control and Prevention, US Department of Health and Human Services, Atlanta, GA.
30.
Kumar PS, Griffen AL, Moeschberger ML, Leys EJ. 2005. Identification of candidate periodontal pathogens and beneficial species by quantitative 16S clonal analysis. J Clin Microbiol 43:3944–3955.
31.
Koyanagi T, Sakamoto M, Takeuchi Y, Maruyama N, Ohkuma M, Izumi Y. 2013. Comprehensive microbiological findings in peri‐implantitis and periodontitis. J Clin Periodontol 40:218–226.
32.
Koyanagi T, Sakamoto M, Takeuchi Y, Ohkuma M, Izumi Y. 2010. Analysis of microbiota associated with peri-implantitis using 16S rRNA gene clone library. J Oral Microbiol 2:5104.
33.
Signorello LB, Hargreaves MK, Blot WJ. 2010. The Southern Community Cohort Study: investigating health disparities. J Health Care Poor Underserved 21(1 Suppl):26–37.
34.
Schirmer M, Ijaz UZ, D’Amore R, Hall N, Sloan WT, Quince C. 2015. Insight into biases and sequencing errors for amplicon sequencing with the Illumina MiSeq platform. Nucleic Acids Res 43:e37.
35.
Zhao N, Chen J, Carroll IM, Ringel-Kulka T, Epstein MP, Zhou H, Zhou JJ, Ringel Y, Li H, Wu MC. 2015. Testing in microbiome-profiling studies with MiRKAT, the microbiome regression-based kernel association test. Am J Hum Genet 96:797–807.
36.
Long J, Zhang B, Signorello LB, Cai Q, Deming-Halverson S, Shrubsole MJ, Sanderson M, Dennis J, Michailidou K, Easton DF, Shu X-O, Blot WJ, Zheng W. 2013. Evaluating genome-wide association study-identified breast cancer risk variants in African-American women. PLoS One 8:e58350.
37.
Long J, Edwards T, Signorello LB, Cai Q, Zheng W, Shu X-O, Blot WJ. 2012. Evaluation of genome-wide association study-identified type 2 diabetes loci in African Americans. Am J Epidemiol 176:995–1001.
38.
Bhatia G, Tandon A, Patterson N, Aldrich MC, Ambrosone CB, Amos C, Bandera EV, Berndt SI, Bernstein L, Blot WJ, Bock CH, Caporaso N, Casey G, Deming SL, Diver WR, Gapstur SM, Gillanders EM, Harris CC, Henderson BE, Ingles SA, Isaacs W, De Jager PL, John EM, Kittles RA, Larkin E, McNeill LH, Millikan RC, Murphy A, Neslund-Dudas C, Nyante S, Press MF, Rodriguez-Gil JL, Rybicki BA, Schwartz AG, Signorello LB, Spitz M, Strom SS, Tucker MA, Wiencke JK, Witte JS, Wu X, Yamamura Y, Zanetti KA, Zheng W, Ziegler RG, Chanock SJ, Haiman CA, Reich D, Price AL. 2014. Genome-wide scan of 29,141 African Americans finds no evidence of directional selection since admixture. Am J Hum Genet 95:437–444.
39.
Galwey NW. 2009. A new measure of the effective number of tests, a practical tool for comparing families of non‐independent significance tests. Genet Epidemiol 33:559–568.

Information & Contributors

Information

Published In

cover image mSystems
mSystems
Volume 4Number 617 December 2019
eLocator: 10.1128/msystems.00639-19
Editor: Jian Xu, Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences

History

Received: 2 October 2019
Accepted: 4 November 2019
Published online: 26 November 2019

Keywords

  1. oral microbiome
  2. racial difference

Contributors

Authors

Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt-Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, Tennessee, USA
Wei Zheng
Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt-Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, Tennessee, USA
Qiuyin Cai
Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt-Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, Tennessee, USA
Martha J. Shrubsole
Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt-Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, Tennessee, USA
Zhiheng Pei
Department of Pathology, New York University School of Medicine, New York, New York, USA
Rowland Institute, Harvard University, Cambridge, Massachusetts, USA
Mark Steinwandel
International Epidemiology Field Station, Vanderbilt University Medical Center, Rockville, Maryland, USA
Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, USA
Department of Pathology, Microbiology, and Immunology, Vanderbilt University, Nashville, Tennessee, USA
Zhigang Li
Department of Biostatistics, University of Florida, Gainesville, Florida, USA
William J. Blot
Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt-Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, Tennessee, USA
Xiao-Ou Shu
Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt-Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, Tennessee, USA
Jirong Long
Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt-Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, Tennessee, USA

Editor

Jian Xu
Editor
Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences

Notes

Address correspondence to Jirong Long, [email protected].

Metrics & Citations

Metrics

Note:

  • For recently published articles, the TOTAL download count will appear as zero until a new month starts.
  • There is a 3- to 4-day delay in article usage, so article usage will not appear immediately after publication.
  • Citation counts come from the Crossref Cited by service.

Citations

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. For an editable text file, please select Medlars format which will download as a .txt file. Simply select your manager software from the list below and click Download.

View Options

Figures and Media

Figures

Media

Tables

Share

Share

Share the article link

Share with email

Email a colleague

Share on social media

American Society for Microbiology ("ASM") is committed to maintaining your confidence and trust with respect to the information we collect from you on websites owned and operated by ASM ("ASM Web Sites") and other sources. This Privacy Policy sets forth the information we collect about you, how we use this information and the choices you have about how we use such information.
FIND OUT MORE about the privacy policy