INTRODUCTION
Until recently, it has been generally held that the upper respiratory tract contains abundant bacteria while the lower respiratory tract is sterile when healthy (
1–4). In light of this belief and the invasive nature of lower respiratory tract sampling by bronchoalveolar lavage (BAL), the Human Microbiome Project (HMP) did not include the lower respiratory tract in its microbiome surveys (
5). However, the concept of lung sterility during health would be in sharp contrast to earlier evidence, some of which was described in the 1920s and was obtained by using radiotracers, that pharyngeal microaspiration is common in healthy individuals (
6–8). Previous studies based on culture-independent molecular techniques have analyzed the lung bacterial microbiomes of healthy adult subjects obtained via BAL and readily identified bacterial sequences in BAL fluid, with the most common bacterial phyla being
Bacteroidetes,
Firmicutes, and
Proteobacteria (
3,
9–14). The prominent genera in BAL fluid samples from healthy subjects include Prevotella, Veillonella, and Streptococcus. These genera are also detected in concurrently collected oral samples, but their relative abundance in BAL fluid generally differs from that in oral samples.
On the basis of accumulating data from many laboratories, we have recently proposed the concept that the entire respiratory tract should be considered a single ecosystem extending from the nasal and oral cavities to the alveoli, which includes gradients and niches that modulate microbiome dispersion, retention, survival, and proliferation (
2,
15). Thus, the composition of the lung microbiome will be determined by three factors: immigration, elimination, and the relative growth rates of its members. Anatomical and physiological considerations suggest that this model is also applicable to the microbiota of the stomach, with key differences. Just as the oral cavity is the site of the primary source community of the lung microbiota (acquired via microaspiration and inhalation), it is likewise the site of the primary source community of the gastric microbiota (acquired via swallowing). Direct mucosal dispersion of microbes from the nasal and oral cavities to both the lungs and the stomach, as parts of the contiguous aerodigestive tract, can occur (
Fig. 1). The epiglottis acts as a major barrier to prevent saliva, food, and liquid from flowing into the trachea, diverting it to the esophagus. In contrast, microaerosols generated in the oral cavity will pass the epiglottis and be inhaled. The total flow of saliva out of the mouth via swallowing (roughly a liter per day) greatly exceeds that of microaspiration during health. Thus, the rate of immigration of the microbiome from the mouth to the stomach should far exceed that from the mouth to the airways. Similarly, the volume of saliva produced in the mouth greatly exceeds the secretions produced by the nasal mucosa. Thus, the “downstream” communities of the lungs and stomach would be expected to more closely resemble the source community of the mouth than that of the nose.
Despite these theoretical models, no studies have examined the relationships within an individual subject among the mouth, nasopharynx, lung, and stomach microbiotas in terms of species richness and the relative abundance of the bacterial microbiotas at those sites. Because it is well documented that there is significant subject-to-subject variation in their microbiome (
3,
9–14), we collected (and compared) all of these samples from each study subject, thereby allowing each individual to serve as his/her own control. Our objective in performing this type of intrasubject and intersite analysis was to determine the contributions of the two upper mucosal sites (mouth and nose) as source communities to the bacterial microbiomes of two lower sites (lungs and stomach).
DISCUSSION
This study, which analyzed the upper aerodigestive tract of 28 healthy human subjects, is the first to compare the mouth, nasal, gastric, and lung bacterial microbiota within the same individual. We demonstrated that the bacterial communities of the healthy lung overlapped those found in the mouth but were found at lower concentrations, with lower membership and a different community composition. The nasal microbiome, which was distinct from the oral microbiome, appeared to contribute little to the composition of the lung microbiome in healthy subjects. The stomach microbiome was far more similar in all study subjects to that of the mouth in bacterial concentrations, membership/diversity, and composition, as expected, due to swallowing of saliva (and bacteria) from the mouth. Intrasubject analysis of Prevotella OTU abundances in the mouth compared to the lungs and stomach supports the concept that the mouth is a major source community for the lungs and stomach microbiomes (
Fig. 1); however, the lungs exhibit selective elimination of Prevotella species, one of the most abundant bacterial community members in the aerodigestive tracts of healthy individuals. Within an individual, the similarity between the oral and lung microbiomes varied widely, with some individuals having a lung microbiome that was almost identical to their oral microbiome, similar to a previous report (
9), while in others the two sites were markedly different. Our nearly bimodal data are consistent with the reports that approximately half of healthy test subjects aspirate oral/nasal secretions in a given 24-h window (
6–8). Overall, our data support the concept that microaspiration is common in healthy individuals and illustrate the microbiological continuity of the aerodigestive tract in healthy adults.
Neither the lung nor the stomach was included in the microbiome surveys of healthy individuals for the HMP (
5), but other studies have begun to fill in those gaps, focusing on either the lungs or the stomach (
1–4,
17–20). The detection of high levels of mouth-resident bacteria in the stomach is consistent with previous reports (
18–20). This is also the expected result of the swallowing of saliva (and bacteria) from the mouth. One limitation of culture-independent analysis is that it cannot distinguish live from recently killed bacteria. However, even with this caveat, our study lays the foundation for future investigations of anatomy and unique environmental niches of the upper aerodigestive tract as factors that shape the lung microbiota during health and disease.
Our comparison of nasal, oral, lung, and gastric microbiotas within the same individuals provides culture-independent microbiological support for the concept that microaspiration is common in healthy individuals. Huxley and colleagues used radioactive indium to study pharyngeal aspiration during sleep in 20 healthy subjects and 10 patients with depressed consciousness. Almost half of the healthy subjects and 70% of those with depressed consciousness aspirated during deep sleep. Those normal subjects who did not aspirate were noted to sleep poorly (
7). Similarly, Gleeson and colleagues performed a study in which a radioactive Tc tracer was deposited into the nasopharynxes of 10 healthy subjects through a small catheter during electroencephalogram-documented sleep and standard lung scans were conducted immediately following final awakening. They documented that microaspiration occurs commonly in healthy young men during sleep, is unrelated to sleep quality, and is variable within subjects who were studied on more than one occasion. They concluded that the “quantity aspirated is of an order of magnitude likely to contain bacterial organisms in physiologically significant quantities” (
6). Furthermore, as discussed by Quinn and Meyer, healthy human subjects, as well as other mammals, aspirate small amounts of liquids from the upper airways into the lower airways (
8).
We did not detect a significant contribution of the nasal microbiome to that of the lungs during health (of the three major OTUs in the nasal cavity, only low levels of the Propionibacterium OTU were readily detected). Staphylococcus, Corynebacterium, and Propionibacterium species are also dominant members of the skin microbiota and have been previously reported as members of the nasal cavity microbiota (
5,
21–23). Our study raises the possibilities that (i) the earlier tracer studies were far more sensitive for tracking nasal-oral-airway dispersion, (ii) nose-associated bacteria are more firmly attached than the intranasal liquid radiotracers, and (iii) the design of these studies, both of which involved the insertion of a catheter via the nose into the posterior nasopharynx, actually more closely simulates aspiration of saliva in the supine position. During health, there is a constant flow of liquid from the mouth (saliva) while there is much less liquid flow from the nose into the throat. During states of ill health, such as viral upper respiratory tract infections, allergies, or sinus infections, the liquid flow from the nasal cavity increases, and this clearly has the potential to affect the lung microbiome, consistent with studies of rhinovirus infection and asthmatics that highlighted changes in the lung microbiome under these conditions (
3,
11,
24,
25).
The stomach and lung present very different ecological niches for bacterial survival, yet both are highly restrictive. Helicobacter and Lactobacillus are two bacterial genera that are well adapted for growth at low pH and have routinely been reported by culture from healthy human stomachs (
18). Of note, we only observed Lactobacillus at a >0.5% abundance in the gastric samples of one of our subjects. This exact same protocol and primers can detect Lactobacillus in our mock-treated controls, as well as the mouse intestinal tract (
26). Thus, the lack of Lactobacillus likely reflects differences in sampling. The pH should exert some selective pressure, favoring the retention of acid-tolerant bacteria such as Lactobacillus; however, the fact that we did not observe such a selective pressure likely reflects the fact that immigration is constant and at a high level (swallowing of saliva), compared to the levels of indigenous colonization of the stomach and the gastric aspirate is largely sampling the upper compartment of the stomach. In contrast to the stomach, no such bacteria have been reported by culture for healthy human lungs. Culture-independent surveys of the stomach and lungs have revealed a microbiome that is much more diverse than that reported by culture alone (
1–4,
17–20). Identifying the reasons for this discrepancy between culture-based and culture-independent studies is an active area of investigation (
27) and very likely involves the adaptation of immigrant bacteria to the metabolically restricted and harsh environments of the stomach and lower airways.
The high shared membership of bacterial species between the lung microbiome and that of the mouth in our study and others, contrasting with that reported for air, suggests that microaspiration and direct mucosal translocation contribute more to microbial immigration than does inhalation of bacteria (
6–8,
28–31). Microbes are cleared from the respiratory tract via mucociliary clearance, coughing (frequent even among healthy subjects [
32]), and innate and adaptive immunity. The distal alveoli are bathed in pulmonary surfactant, which also has bacteriostatic activity against some bacterial strains, further creating selective pressure on reproducing communities (
33). Thus, the “steady state” of the lung microbiome during health is likely one of constant influx, constant elimination, and unfavorable growth conditions.
Our data support the concept that immigration is the significant driver of “maintenance” of the lung microbiota during health. Healthy individuals with very similar oral and BAL fluid microbiome community structures may simply have experienced recent aspiration events or may have some anatomical abnormalities. In addition to being strongly influenced by immigration and weakly influenced by the relative reproduction rates of community members, the community membership of the lung microbiome is strongly influenced by selective elimination, which is absent from the gastric microbiome. Our empirical evidence of these ecological determinants is consistent with the known anatomical, physiological, and immunological features of the respiratory tract.
Similar to the studies of Segal et al., we could identify individuals with high versus low Prevotella levels in the lungs (
13), and when our results are combined with those of the previous study, a model begins to emerge that suggests that subclinical lung inflammation may create an environment that promotes retention of Prevotella in the lungs. Alternatively, the presence of Prevotella may stimulate subclinical lung inflammation, as suggested by Larsen et al. (
34,
35). The latter investigators demonstrated that Haemophilus, Moraxella, and Prevotella species can all induce activation of human monocyte-derived dendritic cells (DCs), but asthma- and chronic obstructive lung disease (COPD)-associated pathogenic Haemophilus and Moraxella bacteria induce 3- to 5-fold higher production of the interleukin-23 (IL-23), IL-12p70, and IL-10 cytokines than the “commensal” Prevotella bacteria and also induce more pulmonary inflammation when introduced into mice. Interestingly, coculture experiments found that Prevotella species were able to reduce
Haemophilus influenzae-induced IL-12p70 but not IL-10 in DCs (
35). All together, these studies raise the possibility that subclinical lung inflammation associated with the presence of Prevotella may protect the lungs from overexuberant inflammatory responses.
MATERIALS AND METHODS
Subject demographics and sample collection.
The University of Michigan and Veterans Affairs Ann Arbor health care system institutional review boards approved this study. All subjects provided written consent. The subjects were clinically well and could not have received antibacterials or corticosteroids in the 3 months prior to sampling. Detailed exclusion criteria were described previously (
17). The demographics of the 28 subjects included in this study are presented in
Table 2 (FEV
1% = FEV
1/FVC ratio, where FEV
1 is the volume that has been exhaled at the end of the first second of forced expiration and FVC is the forced vital capacity [the vital capacity from a maximally forced expiratory effort]).
The same bottle of freshly opened sterile 0.9% saline (Baxter Healthcare Corporation) was used for all sample collection from a given participant. Two saline control samples were collected for each bronchoscopy in specimen cups: an aliquot of saline directly from the bottle (neat saline) and a 10-ml sample of saline that was collected after injection through the bronchoscope with a sterile syringe (scope saline).
Oral washes were performed at the start of the bronchoscopy study visit, before any topical anesthesia or sedation, by having the participant gargle for 30 s with 20 ml sterile saline and then immediately expectorate into a sterile specimen cup. The scope saline, neat saline, and oral wash samples were placed on ice. After the oral wash was completed, subjects were asked to gargle with 10 to 20 ml of Listerine antiseptic mouthwash for 20 s.
Bronchoscopy was performed under moderate conscious intravenous sedation conducted in accordance with standard safety and monitoring procedures for clinically indicated bronchoscopy, with some modifications to reduce possible contamination of the lower respiratory tract during sample collection. Participants were premedicated with diphenhydramine, midazolam, and fentanyl, followed by topical lidocaine. The bronchoscope was advanced via the mouth to a position just above the vocal cords. Lidocaine (4%) was slowly injected around the glottis as 4 aliquots of 1 ml each, attempting when possible to have the participant oppose the cords by phonating the letter E to maximize exposure to the vocal cords and minimize penetration of the medication below the cords. The bronchoscope was then advanced rapidly, without suction and as much as possible without additional lidocaine in the lower airways, into the wedged position in a subsegment of the right middle lobe and lingula.
BAL was performed by installation of saline in 30-ml aliquots that were immediately aspirated under gentle manual suction to a total of 300 ml (150 ml for each lung segment). After insertion of the bronchoscope past the epiglottis into the first lobe (usually the left one) for the first BAL, the bronchoscope was then retracted slightly without pulling the tip of the bronchoscope more proximal than the midtrachea and then repositioned into the other lobe for the second BAL. During BAL, the participant underwent gentle mechanical percussion with a pneumatic chest percussor to maximize the yield of alveolar cells for ancillary analysis of the host immune response. BAL fluid specimens from the two lung segments were collected in separate specimen cups and stored on ice.
After both BALs were completed and while the subject was still sedated, the bronchoscope was withdrawn to a position immediately superior to the glottis, where it was stabilized at the mouth with the vocal cords in constant view. An 18-gauge sterile gastric tube was introduced via the mouth, and the participant's head was flexed gently. Passage of the tube posterior to the larynx into the esophagus was observed directly to exclude entry into the trachea. The gastric tube was advanced to 40 to 45 cm, and its correct placement was confirmed by auscultation of air injected with a syringe. The bronchoscope was then withdrawn. Next, 50 ml saline was instilled through the tube and immediately aspirated with the syringe. The return was collected and placed on ice, and the gastric tube was removed.
Finally, while the participant was still sedated, the posterior nasopharynx was swabbed gently with a single sterile swab introduced via the nares without lubrication or topical anesthetic. The tip of the swab was cut with sterilized scissors and placed into a 2-ml UltraClean Fecal DNA Isolation kit (Mo Bio Laboratories, Inc.).
Sample processing and DNA isolation.
Samples were transferred to the laboratory on ice. For saline controls, oral wash samples, BAL fluid samples, and gastric aspirate samples, 1 ml was transferred to a dry bead tube (Mo Bio Laboratories, Inc.), the tubes were centrifuged for 2 min at ~16,000 × g, and the supernatant was removed. This was repeated until 5 ml saline control, 5 ml BAL fluid, 5 ml oral wash, or 10 ml gastric aspirate had been transferred to the dry bead tube. Samples were then stored at −80°C until DNA isolation.
For DNA isolation, 750 µl PowerSoil DNA kit bead solution (Mo Bio) and 60 µl PowerSoil DNA kit solution C1were added to each Dry Bead Tube (containing a nasal swab or the pellet from 5 ml saline control, 5 ml BAL fluid, 5 ml oral wash, or 10 ml gastric aspirate). Samples were bead beaten for 2 min at the Homogenize setting of a Mini-BeadBeater-8 (BioSpec Products) and centrifuged for 30 s at 10,000 × g. We then continued with the PowerSoil DNA Isolation kit protocol (Mo Bio) starting with step 7 (transfer supernatant to a clean 2-ml collection tube) or transferred samples into a 1-ml collection plate to continue with the PowerSoil-htp 96 Well Soil DNA Isolation kit protocol (Mo Bio) starting with step 10 (add 250 µl solution C2 to each well).
qPCR.
Quantification of bacterial 16S rRNA-encoding genes was performed by real-time PCR with TaqMan hydrolysis probes on a Roche 480 LightCycler. The primers used, targeting the V1 and V2 regions of the 16S rRNA-encoding gene, were 5′ AGAGTTTGATCCTGGCTCAG 3′ (forward) and 5′ CTGCTGCCTYCCGTA 3′ (reverse), and the probe used was 5′-FAM-TA+ACA+CATG+CA+AGTC+GA-BHQ1-3′ (where FAM is 6-carboxyfluorescein, locked nucleic acid nucleotides are indicated by preceding plus signs, and BHQ1 is Black Hole Quencher 1) (
9,
36). After an initial denaturation for 5 min at 95°C, 40 cycles of amplification for 30 s at 94°C, 30 s at 50°C, and 30 s at 72°C were performed. A final elongation step of 72°C for 2 min was performed. Final cooling was performed at 4°C.
Microbiota analysis by pyrosequencing.
Libraries of V3V5 16S rRNA gene amplicons were constructed on the basis of the HMP protocol (
http://www.hmpdacc.org/doc/16S_Sequencing_SOP_4.2.2.pdf) as described previously (
37,
38), except under the following PCR conditions. The PCR started with 2 min at 95°C, which was followed by (i) 20 cycles of a touchdown PCR for 20 s at 95°C, 30 s at the annealing temperature (which was 60°C in the first cycle and dropped 0.5°C with each cycle), and 5 min at 72°C and then (ii) 20 cycles of a standard PCR with 20 s at 95°C, 30 s at 50°C, and 5 min at 72°C. Large-volume Lib-L emPCRs (Roche 454) were performed, and 454 sequencing was performed at the University of Michigan with the GS FLX titanium platform (Roche) in accordance with the manufacturer's instructions.
Sequence processing and low-biomass controls.
Sequences were processed with mothur v.1.30.0 (
39,
40) as described previously (
37,
38,
41) through the step used for removal of sequences classified as chloroplast, mitochondria,
Archaea,
Eukaryota, or unknown kingdom. However, the fasta, names, and groups files (six of each, two from each sequencing run) were concatenated into a single fasta, names, and groups file, respectively, after the trim.seqs step. To control for equipment- and reagent-derived DNA that could contaminate the low-bacterial-biomass BAL fluid samples, we collected and sequenced controls from (i) scope saline, (ii) neat saline, and (iii) reagents used for DNA extraction. A customized R script based on the neutral model (
42) was then used to identify and remove individual BAL fluid sequences that that likely arose by contamination from neat saline, scope saline, DNA isolation reagents, or PCR reagents (
17). This model uses the relative abundance of a sequence read in the controls to calculate the probability of its detection in BAL fluid samples because of carryover. This analysis was performed prior to clustering of the sequences into OTUs at 97% similarity. Sequence reads falling within the confidence intervals of the model (neutrally distributed from controls) and outside the lower bounds of the confidence intervals (enriched in controls) were considered to be contaminating sequences and were removed from the analysis of all BAL fluid samples. Only sequences outside the upper bounds of the confidence interval with the controls as the source (overrepresented in BAL fluid samples) and unique to BAL fluid (not detected in controls) were retained for subsequent analyses.
Sequence analysis.
Sequences were clustered by mothur into OTUs defined as ≥3% difference between OTUs using the average-neighbor algorithm. Consensus taxonomic classifications were generated for each OTU based on modified Ribosomal Database Project reference files. The make.Shared command was used to produce a table (shared file) of the number of sequence reads assigned to each OTU in each sample. Seven hundred sequences from each sample were subsampled, and samples with fewer than 700 sequences were discarded. A complete sample set (BAL1, BAL2, oral wash, gastric aspirate, and nasal swab), with a minimum of 700 sequences/sample, was required for each subject included in the analysis. The shared file was used to calculate the θYC distance (1 − θYC) between bacterial communities.
For analyses in which BAL1 and BAL2 were combined (identified as BAL), the sequences of each OTU in the BAL1 and BAL2 samples were added together for each subject prior to subsampling at 700. Seven hundred sequences per sample were then subsampled from the shared file, and samples with fewer than 700 sequences were discarded. A complete sample set (BAL fluid, oral wash, gastric aspirate, and nasal swab), with a minimum of 700 sequences/sample, was required for each subject included in the analysis. This shared file was used to calculate the number of OTUs observed in each sample, AMOVA based on θYC, PERMANOVA, redundancy analysis (RDA), relative abundances of OTUs and the θYC distances (1 − θYC) between bacterial communities in all of the samples and within each subject.
Statistical analysis.
Statistical analyses were performed with Prism 5 (GraphPad Software) for analysis of variance (ANOVA),
t test, and regression analysis; mothur for AMOVA (
43); and vegan and R for all diversity, rank abundance, and ordination analyses. The θ
YC distance (1 − θ
YC) between bacterial communities measures differences in community structure (
44) and was selected because it weighs rare and abundant OTUs more evenly than other metrics such as Bray-Curtis or Morisita-Horn (
39). Using the shared file, we calculated θ
YC distances (1 − θ
YC) between communities with mothur (
http://www.mothur.org/wiki/Thetayc), where
ST is the total number of OTUs in samples P and Q,
pi is the relative abundance of OTU
i in sample P, and
qi is the relative abundance of OTU
i in sample Q. The θ
YC distance (1 − θ
YC) incorporates the relative abundances of shared and nonshared OTUs of two communities. For data display by ordination, RDA (the constrained form of principal-component analysis) was used to focused on the variation that can be explained by the question of interest (are the microbial communities different at different sampling locations). ANOVA-like permutation testing of constrained ordinations, including RDA, was performed with the anova.cca function in the R package vegan. Significant differences in community membership identified via constrained ordination were confirmed by PERMANOVA via the Adonis function in vegan.
BioProject accession numbers.
The sequences determined in this study are available at the NIH Sequence Read Archive (
http://www.ncbi.nlm.nih.gov/bioproject) under BioProject accession numbers PRJNA263948 and PRJNA269493.
ACKNOWLEDGMENTS
We thank Arvind Venkataraman for his contributions on the use of the neutral model for removing contaminating DNA sequence signals from low-biomass samples; Lisa McCloskey for her contributions in subject recruitment; and Kavitha Santhosh, Nicole Falkowski, and Zachary Britt for their technical contributions to this project.
This project has been supported in part though the following grants from the National Heart, Lung and Blood Institute: T32HL00774921 (C.B., R.P.D.), U01HL098961 (J.M.B., J.L.C., V.B.Y, T.M.S, G.B.H.), and R01HL114447 (G.B.H.). Additional support was provided by the Biomedical Laboratory and Clinical Science Research & Development Services, Department of Veterans Affairs (C.M.F., J.M.B., J.L.C.), the Host-Microbiome Initiative at the University of Michigan, and the Nesbitt Family Foundation (G.B.H.).