Helicobacter pylori is a gastric bacterial pathogen of humans that chronically infects an estimated 50% of the world's population (
19). The consequence of long-term infection with
H. pylori is typically asymptomatic gastritis, but ca. 10 to 15% of those infected will develop peptic ulcer disease or gastric adenocarcinoma (
19). Several bacterial virulence factors are found more commonly in
H. pylori strains that are associated with disease, the best studied of which are the cytotoxin-associated gene pathogenicity island (
cag PAI) and certain alleles of the vacuolating cytotoxin, VacA. Another bacterial factor that is associated with disease, rather than asymptomatic infection, is the capacity for adherence. Approximately 4% of the
H. pylori genome encodes a diverse repertoire of outer membrane proteins (OMPs), the largest of which is the 21-gene Hop family (
1). BabA is a member of the Hop family that binds Lewis b (Leb) and related terminal fucose residues found on blood group O (H antigen), A, and B antigens that are expressed on the gastric epithelium (
3,
6). Some studies suggest that patients infected with strains that express BabA are more likely to present with ulcer or gastric cancer, particularly if they also have the
cag PAI and the s1m1 allele of
vacA (
12,
35).
DNA sequence analysis suggests that among the genes encoding Hop family proteins there is considerable potential for both antigenic variation, in which combinatorial DNA shuffling creates antigenically distinct proteins, and phase variation, in which there is reversible on/off switching of gene expression. For example, there is extensive 5′ and 3′ homology between
babA and two other
H. pylori OMPs of unknown function called
babB and
babC (
1). The greater similarity among bab paralogues (within a genome) than among orthologues (across genomes) suggests that there is frequent recombination and concerted evolution among these genes (
27). Regions of dinucleotide CT repeats in the 5′ coding region of
babA,
babB, and other Hop genes may promote phase variation by slipped-strand mispairing during DNA replication. Finally, polynucleotide (A or T) tracts in
babA and other Hop promoters might alter expression by subtle changes in the spacing between −35 and −10 hexamers. BabA proteins are also highly polymorphic in the variable midregion (
26,
27), and they differ by more than 1,500-fold in their affinity for Leb (
3).
We recently showed that
H. pylori strains recovered from experimentally challenged rhesus macaques had lost expression of BabA and the capacity to attach to Leb (
33). In some cases the strains underwent a gene conversion event in which the
babA gene was replaced by a duplicate copy of
babB. In other cases the
babA gene was present in
H. pylori strains recovered from macaques, but it was not expressed due to phase variation that resulted from gain or loss of one CT repeat in the 5′ coding region of
babA. These results demonstrate that
H. pylori regulates OMP expression
in vivo using both antigenic variation and phase variation. Here we sought to extend these observations by comparison of different animal hosts, including primate, mouse, and gerbil, and different
H. pylori strains. The results indicate that loss of BabA expression and Leb attachment is a robust phenomenon that occurs via multiple genetic mechanisms in different
H. pylori strains and across diverse animal models. Furthermore, analysis of subtle BabA variants that emerge during colonization permitted the identification of six amino acid changes that are sufficient to eliminate binding to Leb.
DISCUSSION
Studies in different human populations in many geographic locations have demonstrated the remarkable genomic diversity of
H. pylori. Isolates from unrelated individuals represent a seemingly unlimited number of unique strains that differ in gene content, position, and allelic profile. Diversity is thought to develop through point mutation, genomic recombination, and genetic exchange, as well as phase variation, all of which are common in
H. pylori (
13,
29,
30), coupled with random genetic drift and selection for adaptation to particular environments. Animal models of
H. pylori infection permit experimental observation of diversity that develops
in vivo in real time after inoculation with a clonal population. We previously used the rhesus macaque model to analyze changes in the
H. pylori genome over time within the same animal host (
33). The results demonstrated that in rhesus macaques the expression of BabA was lost, either by phase variation or by nonreciprocal gene conversion, in which a duplicate copy of
babB replaced
babA.
Here we have replicated the analysis of BabA expression in macaques using a different
H. pylori strain and extended the results to the mouse and gerbil, which are commonly used small-animal models of
H. pylori. The results demonstrated the loss of BabA expression in macaques, mice, and gerbils, although the particular mechanisms differed. After challenge of mice with
H. pylori J166, which has 5′ CT repeats in the
babA gene, BabA expression, and Leb binding were lost by phase variation. In contrast, challenge of macaques with J99 yielded output strains that lost BabA expression either by selection and then expansion of a subpopulation of J99 that had a single base pair mutation that encoded a stop codon, or by nonreciprocal gene conversion of
babA with a duplicate copy of
babB as we observed previously (
33). Gene conversion also explained loss of Leb binding activity in the gerbil model after colonization with strain 7.13, although in this case the
babA gene that encoded Leb binding was converted by a nonbinding allele. This gene conversion was not present in strain 7.13, which was also recovered from a gerbil 3 weeks after challenge with strain B128 (
11). This is likely explained by the short duration of infection, since loss of BabA expression is not uniform until 8 weeks of colonization (
33).
Although most
H. pylori strains recovered from humans express BabA (
29,
35), strains similar to those recovered after animal challenge are sometimes seen in humans. For example, we previously examined 43 clinical isolates by PCR and DNA sequencing and showed that while most had
babA and
babB, in 16% of strains (7 of 43)
babA was absent and
babB was duplicated, like what is seen in macaques. Some strains had duplicate copies of
babA (like strain 7.13), one of which was typically out of frame, while others had still different bab genotypes, such as
babC (HopU, HP0317), at the locus where
babA or
babB are typically found (
7). Others have found similar diversity in bab genotypes and demonstrated remarkable heterogeneity in Leb binding (
3). Therefore, it is likely that whatever factors select for loss of BabA expression in animal models of
H. pylori may sometimes also be found in humans.
The particular selective pressures that modulate expression of BabA and related proteins are unknown. One possibility is that BabA (but not BabB) induces an adaptive immune response, so that BabA expression is lost while BabB is maintained. This seems unlikely because we have been unable to demonstrate antibodies to BabA or BabB in macaques (
33) and because other reports have failed to demonstrate BabA or BabB as immunodominant antigens in humans (
14,
18). Furthermore, in the case of the gerbil, Leb-binding BabA expression is lost, but BabA expression
per se is not. BabA expression is not lost in short-term infection with C3H/HeJ mice (Fig.
1), which are known to have a defect in TLR4 signaling, so innate immunity might also be important. The presence of Leb may also influence BabA expression. However, since Leb serves on the one hand as a cellular receptor for BabA and on the other hand as a decoy receptor that likely plays a role in innate immunity (
21), the relationship between BabA and Leb expression is likely complex. BabA metastability and heterogeneity likely contribute to
H. pylori fitness (
5) and so are tightly regulated, with BabA sometimes expressed by most cells in the population and other times archived as “memory cells” ready to expand when conditions change.
Although
H. pylori strains differ markedly in binding affinity to Leb, no consensus amino acid sequences have been identified that are conserved among strains that bind Leb (
3,
15). Since the strains recovered from gerbils did not bind Leb or gastric histosections and yet expressed BabA that differed at only six amino acids from the Leb-binding allele (Fig.
4), one or more of these six residues are likely important, either as binding sites or as key amino acids that alter the tertiary structure of BabA. The location of these residues within the midregion of BabA is consistent with this suggestion, since this portion of the protein is highly polymorphic (
26,
27), is thought to be surface exposed, and is likely responsible for Leb binding. In most cases, the differences in amino acids between BabA2 and BabA1 at these positions represent a change in charge or hydrophobicity, such as E218K (negative to positive charge), K252E (positive to negative charge) and S302G (hydrophilic to hydrophobic), which might be expected to change the tertiary structure of the BabA protein.
The receptor for BabA in the gerbil is unknown, and Leb has generally been thought to be absent in rodents. However, recent studies in mice with anti-Leb antibodies, corroborated by antibody inhibition using specific synthetic saccharides, suggest that rodents may in fact express Leb or related Lewis type 1 structures (
23). Loss of BabA-mediated binding to infected gerbil gastric tissue (Fig.
5) demonstrates that as
H. pylori undergoes modification in surface adhesins during infection; so, too, does the host play an active role, probably with changes in Leb or closely related glycans expressed on the epithelial surface. This is consistent with recent studies that have demonstrated that
H. pylori induces profound changes in surface glycoproteins, including the gastric mucins MUC5AC (
8,
21) and MUC1 (
22). We hypothesize that modification of BabA during
H. pylori infection is a mechanism to adapt to changing conditions of inflammation and glycan expression at the epithelial surface.