DISCUSSION
Close examination of several
B. fragilis cultivars isolated from longitudinal luminal samples combined with metagenomic read mapping demonstrated that each patient harbored a distinct
B. fragilis population. These potential pathobionts were present early after the creation of the pouch and persisted for at least 2 years, remaining at relatively high abundance during periods of health, inflammation, and postantibiotic treatment. Comparisons of luminal and mucosal samples from inflamed visits of the same patient detected two distinct
B. fragilis populations, one of which could be detected only in mucosal samples. Comparisons of genomes from these organisms revealed that their differences principally occurred within multiple capsular polysaccharide biosynthesis loci that contribute to microcapsule production and typically contain transcriptional regulatory elements (UpxY and UpxZ), coding regions for glycosyltransferase genes, UDP-
N-acetylglucosamine 2-epimerase (EC 5.1.3.14), and glucosamine-6-phosphate deaminase (EC 3.5.99.6). While
Bacteroides organisms are abundant members of the human intestinal tract, the relatively abundant species during inflammation in this study (
B. fragilis,
B. ovatus,
B. vulgatus, and
B. thetaiotaomicron) also commonly occur in association with anaerobic infection and IBD (
23–25).
These
Bacteroides represent known opportunistic pathogens that can reach relative abundance of >80% in IBD patients with cell numbers exceeding 10
9 cell/ml at the mucosal layer (
25,
26). The ability of
Bacteroides to maintain high abundance during pouch inflammation and normal epithelial function in the pouch likely reflects their ability to activate, suppress, or evade an overly active immune response during W, I, and A sampling visits, respectively. To maintain the abundance patterns observed in this study, members of the genus must exploit a mechanism to survive within each of these unique host environments. Several
Bacteroides species (
B. thetaiotaomicron,
B. ovatus,
B. fragilis, and
B. vulgatus) have evolved multiple distinct capsular polysaccharides through close interaction with host cells and other microbes within the human intestinal tract.
Bacteroides have evolved diverse capsules that include both lipopolysaccharide and polysaccharide (
25).
B. fragilis in this study contains at least eight unique CPS biosynthesis loci that correspond to zwitterionic polysaccharide A to polysaccharide H (PSA-PSH), which serve an essential role in the growth, virulence, and adaptation to variable conditions (
27). The inversion of the promoter region regulates each polysaccharide loci (
28), but the UpxY and UpxZ proteins dynamically control expression (
29). The expression of a single CPS operon and location of its polysaccharide product on the surface in any given cell lead to a population of antigenically diverse cells (
30). This population-level variability facilitates the survival of
B. fragilis in many environments and its ability to maintain significant relative abundance without causing inflammation. Although the flanking regions of the PSA-PSH are largely conserved, our mapping results indicate little to no homology between the PSA-PSH contained in the well-characterized
B. fragilis NCTC 9343 or among each of the three cultivars identified in this study. These results agree with other characterizations of
B. fragilis CPS diversity (
9).
Figure 4 reveals that the percent GC content of CPS loci differs from the surrounding genome, consistent with recent transfer events and/or recombination within the genome (
31,
32).
Some of the CPS loci in
Bacteroides MAGs contain genes (e.g.,
neuB,
wcbM, and
cpsM) that define virulence factors in other pathogens. For example, the protein product of
neuB catalyzes the final step in the biosynthesis of the sialic acid Neu5Ac (
33). The potential decoration of CPS with sialic acids in
Bacteroides cultivars from p_patients resembles mechanisms for avoiding or causing host innate immune response in other virulent organisms, e.g.,
Campylobacter jejuni,
Escherichia coli,
Neisseria meningitidis, and
Streptococcus suis (
34). Several
B. fragilis CPS loci in this study contain genes encoding other elements that also occur within the capsules of known virulent genomes, including
cpsM from the capsule of
Streptococcus pneumoniae and
Burkholderia pseudomallei wcbM (
35), which shares strong homology with
hddC from
C. jejuni (
36). The effect of
wcbM,
cpsM, and
neuB in
B. fragilis on virulence remains unclear, but their presence within the genome of other
Bacteroides in this study suggests that they may contribute to the cause and/or elevation of a host immune response.
Previous reports have reported horizontal gene transfer between
Bacteroides through pairwise comparison of genome sequences from isolates (
37,
38). Comparisons of the MAGs and cultivar genomes in this study provide suggestive evidence of gene transfer between
Bacteroides. For example, both a
B. fragilis MAG and the same genome from a cultivar draft genome from p-207 shared nearly 100% sequence identity with several genomic regions of a MAG for
B. vulgatus. The shared sequences spanned coding regions for several outer membrane receptors, site-specific recombinase, and integrative conjugative elements (ICEs) containing tetracycline resistance genes. The presence of these genes within independent blooms of two different species during inflammation suggests that they provide an adaptive advantage. We also detected genes shared among some
Bacteroides from all sample types for multiple patients, including ICEs, membrane-related proteins, transcriptional regulatory genes, mobile element proteins, outer membrane receptor proteins, and multidrug resistance genes that confer increased fitness to
Bacteroides. ICEs in particular can be modified by transposons, resulting in the insertion of genes from the recipient genome. This process can yield a customized ICE that benefits subsequent
Bacteroides colonization of the gut (
37). They exhibited high copy numbers (10 copies) in relation to the surrounding genome of conjugative transposons (CTn) and related genes common to CTn-like ICEs. The high copy numbers indicate that these elements excise from the genome and replicate prior to moving to a recipient genome within the pouch, consistent with previous work (
39). Because transfer events increase within inflamed environments, movement of these genes in the pouch environment may be elevated among
Bacteroides and has the potential to transfer virulence genes from the CPS of
B. fragilis to other species and possibly across genera.
While antibiotics seemed to reduce the severity of disease (
1,
2,
8), the abundance of
Bacteroides did not significantly decrease, and in three patients, the same
Bacteroides returned 1 month after antibiotic treatment was completed. The reestablishment of
Bacteroides following antibiotic treatment could reflect the presence of multiple antibiotic resistance genes (
40), which spread by conjugation and mobilization (
34). Alternatively, the same
Bacteroides might reestablish from external sources or from survivors of the original antibiotic treatment. Although
B. fragilis was undetected following antibiotic treatment in p-207, a previously undetected
B. vulgatus became dominant during refractory inflammation with functional similarity to a
B. vulgatus from patient p-208 and contained ICEs carrying genes necessary for tetracycline and clindamycin resistance. Ciprofloxacin was used to treat pouchitis in this study, and it is unknown whether tetracycline or clindamycin genes would confer resistance to this antibiotic.
The high coverage of marker gene analysis contributes to reduced statistical variability when profiling microbial communities and enables the detection of rare taxa. Yet, the reliance upon a single locus constrains the level of information for resolving differences between closely related taxa. MED’s ability to resolve amplicon sequences into oligotypes that differ by a single nucleotide (
11) offers increased resolution relative to taxonomy and clustering techniques (
41,
42), but neither MED nor the MG-RAST annotation of short metagenomic reads detected community shifts revealed through comparisons of the MAGs. Despite having identical oligotypes and essentially identical functional annotations, read mapping to cultivar draft genomes described differences between
B. fragilis populations from different patients and between luminal samples (M sample type) versus mucosal samples (GG sample type) taken during that same inflammation visit. When combined with longitudinal sampling of a patient, it became possible to track temporal changes in the abundance of closely related but distinct
Bacteroides populations. Abundant
Bacteroides species during inflammation were limited to
B. ovatus,
B. vulgatus, and
B. fragilis. Each of these
Bacteroides genomes was 100% covered over the length of all contigs at least once prior to inflammation. These mapping results provide evidence for the presence of those genomes in the patients’ microbiomes prior to the development of mucosal inflammation. In the case of p-214, the
B. fragilis genome appeared at least 1 year before inflammation, suggesting that inflammation results from a well-established but low-abundance microbe. Read mapping from individual patient visits revealed significant variation in response to antibiotics among
Bacteroides genomes. Several of the
Bacteroides reached 10% relative abundance following antibiotic treatment. In other patients, read mapping did not detect the abundant
Bacteroides after antibiotic treatment. In the case of p-207, read mapping demonstrated that after antibiotic treatment,
B. vulgatus replaced the
B. fragilis that dominated during inflammation.
B. vulgatus represented 23% of the p-207 microbiome at a second inflammation visit.
Large-scale 16S rRNA surveys that have included hundreds of IBD patients and healthy patient controls have contributed important information about dysbiosis variability but have not yet identified at the genome level specific microbes that could direct diagnosis or treatment of the disease. Although the number of samples in this study is small compared to other studies (
8,
43,
44), the longitudinal sampling and clinical data paired with cultivation, shotgun metagenomic assemblies and 16S amplicon sequencing provide unique insights into microbial influences on the development and course of pouchitis. We found a significant difference in the overall community structure between the p_patients and n_patients; however, 3% differences in overall community variation did not provide a useful clinical predictor of inflammation. The relative abundance of
Bacteroides increased significantly during inflammation, but differences in its relative abundance in p-patients versus n-patients were not significant. The absence of data related to the state of the immune system and other confounding factors may have compromised larger studies where
Bacteroides either decreased or weakly correlated with IBD (
43,
44). The lack of significant decreases in the relative abundance of
Bacteroides in the antibiotic-treated group highlights the ability of
Bacteroides to resist certain antibiotics (
40,
45). The level of interpersonal variation in overall community structure and the diversity of
Bacteroides are also important observations. The
Bacteroides oligotypes were largely stable for a particular individual and occurred across multiple individuals, suggesting that the same
B. fragilis occurred in multiple patients, yet the
B. fragilis MAGs that correlate with those operational taxonomic units differed significantly from each other at the nucleotide and functional levels. Similar results for
B. vulgatus provide further evidence for the lack of sensitivity of 16S rRNA surveys to identify the particular microbial organisms that may be related to the disease.
Despite the abundance of
Bacteroides and the presence of known virulence genes, low-abundance members of the community might elicit an inflammatory or synergistic interaction between multiple microbes and drive inflammation (
12). Two unrelated taxa that were dominant during this study,
Cetobacterium somerae and
Clostridium perfringens (
44,
45), each might contain genes that contribute to evading an immune response and/or eliciting an abnormal immune response. Different genera and/or variants within the same microbial species likely contribute to the etiology of IBD. Within closely related
Bacteroides, the patient-specific occurrence of dissimilar genome elements that define functionally related gene clusters have the potential to drive microbial community dynamics and inflammation.
Complex host-microbe relationships leading to pouchitis vary for individual patients. The biological variation may explain why it has been so difficult to find a consistent pathogen in IBD using conventional tools and cross-sectional study designs. Our results underscore the importance of performing prospective studies with highly curated clinical metadata and the application of multiple tools, including high-resolution genomics. The identification of five potentially virulent Bacteroides spp. provides specific targets for understanding the underlying mechanisms that lead to refractory pouchitis and potentially IBD. While these conclusions rely principally on abundance patterns from a small group of samples, the known virulence, unique capsule, and correlation with disease index strongly support a role for these species as potential causative agents of disease.