Research Article
1 February 2001

Allelic Variation within Helicobacter pylori babA and babB

ABSTRACT

Helicobacter pylori strains show both geographic and disease-associated allelic variation. We investigated the diversity present in two genes, babA and babB, which are members of a paralogous family of outer membrane proteins. Eleven family members within a single H. pylori strain, predicted to encode proteins with substantial N- and C-terminal similarity to each other, were classified as babA paralogues. In their central regions, most are less than 54% related to one another. Examining the babA and babB central regions in 42 H. pylori strains from different geographic locales, we identified five different allele groups of babA (AD1 to AD5) and three different allele groups of babB (BD1 to BD3). Phylogenetic analysis revealed that the allelic groupings ofbabA and babB are independent of one another and that, for both, geographic variation is present. Analysis of synonymous and nonsynonymous substitutions in these regions showed thatbabA is more diverse, implying an earlier origin than that of the same region of babB, but that the babAdiversity region may have more functional constraints. Although recombination has been central to the evolution of both genes, withbabA and babB showing low mean compatibility scores and homoplasy ratios of 0.71 and 0.67, respectively, recombination is not sufficient to obscure evidence of clonal descent. Despite the involvement of babA in binding to the host blood group antigen Lewis B, neither the presence of differentbabA allele groups nor that of different babBallele groups is a determining factor in Lewis B binding of H. pylori strains.
Helicobacter pylori is a microaerophilic, gram-negative bacterium that colonizes the human stomach (12). Colonization induces chronic gastritis and plays a role in the development of peptic ulcer disease and gastric adenocarcinoma (12, 44). H. pylori strains are highly diverse (22, 31), as evidenced by allelic variation within vacA (10, 11, 17, 42, 43); the presence of nonconserved DNA fragments among different strains, such as the cag pathogenicity island (4, 15, 16, 28, 41, 44); and the occupation of a single genomic site with different genes such as iceA1 andiceA2 (44). DNA fingerprinting and multilocus enzyme electrophoresis techniques have shown that there is greater total genetic diversity for H. pylori than for other bacteria that have been studied (2, 3, 20, 22, 32, 39).
H. pylori adhesion to the gastric epithelium is mediated, at least in part, through the Lewis B blood group antigen (13). Adherence to the epithelium is believed to help protect the bacteria from gastric acidity, as well as from displacement due to peristalsis. Two H. pylori genes, babA andbabB, were identified based on the N-terminal similarity of their products to the Lewis B binding protein (25), and it was determined that the babA gene product is necessary for Lewis B binding activity. The two gene products are members of a paralogous family of outer membrane proteins, in which the members have significant N- and C-terminal similarity (40).babA and babB are nearly identical in their 5′ and 3′ regions, with most of their sequence divergence being in their midregions (8, 25).
We sought to examine the relationships between babA and other members of this family of proteins and to examine the diversity that exists within babA and babB. Our specific goals were to determine whether there exists geographic variation or allele group differences among babA and babBsequences and to determine whether there is any association betweenbabA and babB diversity and Lewis B binding phenotypes of H. pylori strains. We also sought to examine the evolutionary relationships between babA andbabB and to determine whether their current gene structures have been largely shaped by recombination.

MATERIALS AND METHODS

Strains and growth conditions.

A total of 42 H. pylori strains were obtained from patients from several different locales and with differing clinical diagnoses (Table1). Strains were cultured at 37°C in a 5% CO2 atmosphere on Trypticase soy agar plates with 5% sheep blood (TSAB) and frozen at −70°C until used in this study.
Table 1.
Table 1. H. pylori strains used in this study
StrainavacAtypeiceA typeAllele groupClinical diagnosisc
smbabAbabB
J99s1bm12AD2BD2DU
96-74s1bm11AD2BD2DU
97-141s1bm11AD2BD2Gastritis
84-183s1bm12AD2BD3NA
96-71s1am12AD1BD2DU
95-76bs2m22AD1BD2Gastritis
98-292s1bm21AD1BD3NA
97-503s2m22AD1BD3Gastritis
95-56bs2m22AD2BD2Gastritis
96-68s1am11AD3BD2Gastritis
98-454s2m22AD3BD2NA
97-5s1bm21AD3BD3Gastritis
90-40s1am11AD2BD1NA
97-147bs2m21AD5BD2NA
97-780s2m22AD3BD2NA
95-97s1bm12AD3BD2Gastritis
26695s1am11AD2BD3Gastritis
96-28s1bm22AD3BD2DU
CCUG17875s1am11AD3BD3NA
95-23s1am11AD3BD2NA
95-10s1am11AD3BD1NA
88-28s1am22AD3BD3NA
98-10s1am11AD2BD1GU
98-16s1am11AD2BD2GU
98-24s1am12AD2BD1DU, GU
98-18s1am11AD2BD3GU
96-10s1am11AD3BD2DU
98-22s1am12AD3BD3Gastritis
98-26s1am11AD3BD2GU
98-32s1am12AD3BD3DU, GU
98-30s1am11AD3BD2GU
98-14s1am11AD2BD1DU
98-12s1am11AD4BD1DU
97-793s1am11AD2BD2Gastritis
97-776s1am21AD2BD2Gastritis
98-60s1am11AD4BD2Gastritis
98-344s1cm22AD3BD2Gastritis
98-77s1am11AD3BD2Gastritis
97-679s1am12AD4BD2Gastritis
97-723s1am22AD4BD3Gastritis
97-687s1am12AD3BD2Gastritis
98-53s1am22AD3BD2Gastritis
a
The geographic location of strains is outlined in Fig. 4.
b
cagA-negative strain.
c
NA, not available; DU, duodenal ulcer; GU, gastric ulcer.

Genotyping of H. pylori.

Genomic DNA was prepared from each strain after 48 to 72 h of growth on TSAB plates, as described previously (47). Strains were examined by PCR to determine vacA allelic types and the presence of cagA, iceA1, and iceA2, using established primers (10, 43, 48). PCR products were analyzed by agarose gel electrophoresis (10).

Analysis of babA homologs.

To compare sequences of babA and other homologs from strains J99 and 26695, nucleotide searches were performed using BLAST algorithms (9) from GenBank and The Institute for Genomic Research (http://www.tigr.org/tdb/mdb/hpdb/hpdb.html ) and Astra-Boston (http://scriabin.astrazeneca-boston.com/hpylori/ ) databases. Translations of nucleotide sequences were performed using GCG (Genetics Computer Group; Madison, Wis.) Translate, and pairwise similarities were determined using GCG Gap. Points of division were determined based on an alignment of the paralogues, where the first division point corresponds to the beginning of the HP722 open reading frame (ORF) (nucleotides 1 to 360 of 26695 babA), the second division point corresponds to the beginning of the HP229 ORF (nucleotides 361 to 760), and the third division point is based on the midpoint of the HP229 ORF (nucleotides 761 to 1510 and nucleotide 1511 to the end of each ORF). Secondary structure analysis was performed using Garnier's algorithm (21) athttp: //pbil.ibcp.fr/cgi-bin/npsa_automat.pl?page=/NPSA/npsa_gor.html, transmembrane domains were deduced using the TMPred server athttp://www.ch.embnet.org/software/TMPRED_form.html (24), and isoelectric points were deduced using the server at http://www.expasy.ch/tools/pi_tool.html (46).

Diversity region fragment amplification and sequence analysis.

babB fragments were amplified using primers AN5036 and AN5037 (Table 2).babA fragments were amplified primarily with primers C6678 and C6679; however, alternate combinations of forward and reverse primers among C6678, C6679, B1999, BA909, and B1998 were used to amplify fragments from strains that did not amplify with the primary primer set. PCR fragments were purified using the QiaQuick PCR purification kit and the QiaQuick Gel Extraction kit. PCR fragments were directly sequenced on both strands, using an automated Applied Biosystems, Inc., sequencer at the Vanderbilt Cancer Center sequencing core facility, and were analyzed using Sequencer 3.1.1 (Gene Codes Corp, Inc., Ann Arbor, Mich.).
Table 2.
Table 2. PCR primers used for this study
Primer nameGene or designationDirectionLocation in genecGenBank accession no.Sequence (5′-3′)b
C6778babA (HP1243)Forward570–588AE000629GCTTACCCGCGCTCAAAG
C6779babA (HP1243)Reverse1056–1076AE000629CTCCGTGAAAGGGTTGAAAG
B1999babA (HP1243)Reverse1088–1107AE000629GTTAAGCGAGCATGCCTTG
BA909babA (HP1243)Forward410–428AE000629CTTCAACCACCATCTTCA
B1998babA(95-76)aForward562–580NAaGCTTGCCAGCGCTCAACC
AN5036babB (HP896)Forward451–469AE000599ACCATCACTTGCAATTCG
AN5037babB (HP896)Reverse1042–1060AE000599GAGCGTTTTTGAGCATGC
B9617babA (HP1243)Forward1–23AE000629ATGAAAAAACACATCCTTTCAT
AN5954babA(JHP833)Reverse2213–2235AE001512TTAGTAAGCGAACACGTAATTC
C5773babB (HP896)Forward1–20AE000599ATGAAAAAAACCCTTTTAC
C5774babB (HP896)Reverse2138–2156AE000599TTAGTAAGCGAACACATA
AN5045JHP834Forward1–19AE001512GTGTGCGGCGTATTATCG
AN5043HP1244Forward1–21AE000629ATGGAAAGAAAACGCTATTC
C7491pUC4KForward369–401X06404GCTCTAGAGCTCACGACGTTGTAAAACGACGGCCAGTG
C7492pUC4KReverse1598–1638X06404GCTCTAGAGCGTTGTGTCTCAAAATCTCTGATGTTAC
AN5485pUC4KForward369–401X06404CGCGGATCCGCGTCACGACGTTGTAAAACGACGGCCAGTG
AN5486pUC4KReverse1598–1638X06404CGCGGATCCGCGGTTGTGTCTCAAAATCTCTGATGTTAC
a
NA, not available, unpublished data.
b
Underlined sequences correspond toXbaI and BamHI restriction sites.
c
Nucleotide positions.

Phylogenetic and nucleotide analyses.

Multiple alignments of nucleotide and predicted amino acid sequences were created using GCG Pileup, ClustalW, and GCG Pretty (Wisconsin Package version 9.1; Genetics Computer Group). Similarity plots were created using SimPlot 2.5 (http://www.welch.jhu.edu/ s̃ray/download). Phylograms of each nucleotide alignment were generated using both parsimony and distance matrix methods, using Paup 4.0b2 (Sinauer Associates, Sunderland, Mass.). All phylograms were displayed using Treeview (35) and Paup 3.1 (Illinois Natural History Survey, Champaign), using midpoint rooting. Transitions and transversions were calculated with MEGA 1.01 (Pennsylvania State University, University Park). Nucleotide and amino acid similarities of both babA and babB fragments were determined using BoxShade 3.2. Average GC content was calculated using GeneDoc 2.5 (K. B. Nicholas and H. B. Nicholas, Jr.). Synonymous and nonsynonymous substitution rates were generated using DnaSP 3.0 (37). The amount of homoplasy (H ratio) was determined as the mean of five independent tests using the homoplasy test (33), and all sites containing gaps were excluded from the analysis. The consistency index was determined using MacClade Version 3.0 (Sinauer Associates). Compatibility matrices were created using Reticulate (26). All means, standard deviations, and ratios were determined using Corel Quatro Pro (Corel Corp., Ottawa, Canada).

babA and babB mutant strains.

ThebabA and babB ORFs were amplified from H. pylori strain J99, using primer pairs B9617-AN5954 and C5773-C5774, respectively (Table 2). Each ORF was cloned into pGem-T Easy and transformed into Escherichia coli DH5α. A uniqueXbaI site was used to linearize the resulting plasmids containing babA, and a unique BamHI site was used to linearize babB-containing plasmids. The aphAcassette (kanamycin resistance) was amplified from plasmid pUC4K using primer pair C7491-C7492 or AN5485-AN5486 with either XbaI orBamHI restriction ends, respectively (Table 2). The resulting fragments were then cloned into the respective unique sites in babA and babB, creating plasmids pDP501 and pDP601. Strain J99 was transformed to Kan with both plasmids, and chromosomal DNA was isolated from each resulting strain. Insertion of the aphA cassette into babA or babBwas confirmed using primer pairs AN5045-AN5954 and AN5043-C5774, respectively.

Lewis B binding assay.

H. pylori strains were cultured for 48 h at 37°C in a 5% CO2 atmosphere on TSAB plates. Cells were harvested into 50 ml of brucella broth supplemented with 5% newborn calf serum and grown for another 48 h. Cells then were harvested and diluted to an optical density (OD) of 1.0 at 450 nm. One milliliter of each culture was then resuspended in a Phosphate-buffered saline (PBS) solution containing Thimersol and Tween 20 (PBSTT), plus 0.1% gelatin, and incubated at 37°C for 1 h. Cells were washed in PBSTT and incubated with or without 1 μg of Lewis B-human serum albumin glycocongugate (Isosep, Tullinge, Sweden) for 1 h at 37°C in PBS containing 0.5% bovine serum albumin. Cells were washed, followed by incubation for 1 h at 37°C with anti-Lewis B murine monoclonal antibody (Signet Pathology Laboratories, Inc., Dedham, Mass.) in PBS containing 0.1% gelatin and 0.5% bovine gamma globulin. Cells were washed, followed by incubation with anti-mouse immunoglobulin M horseradish peroxidase conjugate (Cappel Laboratories, Cochranville, Pa.) for 1 h at 37°C in a solution containing 0.1% bovine gamma globulin and 1% bovine serum albumin. Cells were washed twice and developed for 30 min in 500 μl of a solution containing 9.4 ml of 0.2 M Na2HPO4, 10.6 ml of 0.1 M citric acid, and 32 μl of H2O2. Results were expressed in relative units (OD × 1,000) as the OD difference between cells with Lewis B and those incubated without Lewis B. Each experiment was repeated on consecutive days.

Nucleotide sequence accession numbers.

babAGenBank accession numbers are AF277904 to AF277942, and babBGenBank accession numbers are AF277943 to AF277981 .

RESULTS

Classification of babA paralogues.

A GenBank BLAST search of the genome in H. pylori strain 26695 usingbabA (HP1243) nucleotides revealed 24 genes with scores of >100 and probabilities ranging from 0.0 to 0.998. For 11 genes, probabilities ranged from 0.0 × 10−118 to 1.9 × 10−118, whereas for the other 13 genes, probabilities were substantially lower. Using babA(JHP833) from H. pylori strain J99 to search that strain's ORFs, 22 genes were identified with scores of >100, with 10 having probabilities ranging from 0.0 × 10−95 to 7.5 × 10−95 and the other 12 genes having substantially lower scores. Therefore, our further analysis concentrated on the strongest paralogues in each strain.
All of the 11 genes from 26695 are members of a paralogous family, previously identified as outer membrane proteins through the characteristic C-terminal alternating hydrophobic residues (40), with at least one N-terminal domain of similarity and at least seven C-terminal domains of similarity. Two genes (HP25 and HP229) are predicted on the basis of sequence homology to encode porins (18, 19). These 11 genes (10 genes from J99) had significant 5′ similarity (over 350 nucleotides) but much greater 3′ similarity (over 750 to 1,500 nucleotides).
These 11 genes (10 genes in J99) share other properties. By Garnier's secondary structure analysis, each protein sequence was predicted to have parallel motifs at similar relative positions (data not shown). All have predicted basic pIs ranging from 8.47 to 9.24 (data not shown), except for two paralogues encoded by JHP212 and JHP1261 from strain J99 (both with pIs of 6.52). Each ORF has at least two predicted transmembrane domains (one each within the 3′ and 5′ regions), and all contain a predicted N-terminal signal peptidase I signal sequence (40). Based on the BLAST search probability scores and the similarity of the gene motifs and specific structures, we have classified these 11 genes (10 genes in J99) as babAparalogues.

Analysis of sequence similarities among the 26695 and J99babA paralogues.

Comparison of the 26695babA paralogues with reference gene babA (HP1243) from 26695 (data not shown) showed that the first segment (nucleotides 1 to 360) is well conserved among five genes; however, the other five genes either have a truncated version or do not possess this segment. Pairwise similarity comparisons show that HP227 and HP1342 are identical throughout their ORFs and that HP317 is nearly identical tobabA (HP1243) in this first segment; most of the paralogues have 61 to 100% similarity to each other in this same region (Table3). The next segment (nucleotides 361 to 760) is more variable in that none are >64% similar to one another, except for HP725 and HP722 (73% identical to each other), and the third segment (nucleotides 761 to 1510) also has substantial variability (Table 4). However, the fourth segment (nucleotide 1511 to the end of each ORF) showed similarities ranging from 67 to 100% among the different paralogues. These analyses indicate that there is substantial 5′ and 3′ conservation of these paralogues, with the greatest variation in the midregions. The mean (± standard deviation) (both calculated with Corel Quatro Pro) percent nucleotide similarities for the various regions of the babA paralogues were as follows: nucleotides 1 to 360, (68.0 ± 14.5)%; nucleotides 361 to 760, (42.9 ± 13.2)%; nucleotides 761 to 1510, (48.5 ± 13.7)%; and nucleotide 1511 to the end, (76.0 ± 8.11)%. A parallel analysis of the 10 babA paralogues in strain J99 identified similar relationships (data not shown).
Table 3.
Table 3. Pairwise nucleotide similarity matrix of the 26695babA paralogues for the 5′ regions
Paralogue (nucleotides 361 to 760)% Similarity for paralogue (nucleotides 1 to 360)a
HP1243HP317HP896HP25HP9HP227HP229HP722HP725HP1177HP1342
HP1243 9877767666b744366
HP31748 76767665744365
HP8964349 727470797370
HP25404664 6863834863
HP935363741 74414474
HP2274747545140 7461100
HP229 
HP722403636393636 
HP72536373741443873 3274
HP11774240343649384441 61
HP13424747545140100363838 
a
Percent similarity was determined by using the GCG gap program. Each value was rounded to the nearest integer, and values of >85% are represented in boldface.
b
—, no analysis was possible, since there was no specified sequence due to alignment of paralogues.
Table 4.
Table 4. Pairwise nucleotide similarity matrix of the 26695babA paralogues for the 3′ regions
Paralogue (nucleotide 7511 to end)% Similarity for paralogue (nucleotides 761 to 1510)a
HP1243HP317HP896HP25HP9HP227HP229HP722HP725HP1177HP1342
HP1243 62726056524038383852
HP317100 756163484037383948
HP8969797 7358533944443953
HP25727273 58533543423953
HP978778075 563841415456
HP2277878797279 38403838100
HP229696772727171 42384038
HP72269697474697668 893840
HP72569697474707667100 4138
HP1177717276747379707474 38
HP1342787879727910071767679 
a
Percent similarity was determined using the GCG gap program. Each value was rounded to the nearest integer, and values of >85% are represented in boldface.

Paralogue-flanking genes and intergenic regions.

Most of thebabA paralogues are in similar genomic locations in 26695 and J99, based on their flanking genes. Each is flanked by at least one gene that is identical between the strains with the exception of babA, babB, and HP25 (data not shown). Although the genes flanking babA in 26695 flankbabB in J99, the genes that flank babB in 26695 do not immediately flank babA in J99. The only significant identity in the upstream intergenic regions exists betweenbabA (HP1243) and HP317 (74% over 300 nucleotides), its most similar paralogue in strain 26695, and between HP722 and HP725 (88% over 290 nucleotides). In contrast, there is substantial (74 to 98%) identity in the intergenic regions downstream of babA, babB, and HP317. Although babA and HP317 are most similar, the regions downstream of babB and HP317 are 98% identical. Since HP317 and babB are flanked by identical downstream genes (HP316 and HP895), there is virtual identity for 900 bp downstream (data not shown).

Phylogenetic analysis of babA paralogues.

To better understand the origins of these genes, the phylogeny of the 21 paralogues from the two strains was determined (Fig.1). Each 26695 ORF is most closely related to an ORF in J99, indicating the substantial interstrain conservation of each paralogue. The identical ORFs in 26695 (HP227 and HP1342) are closely related to the identical ORFs in J99 (JHP212 and JHP1261). ORF HP317 has no J99 counterpart. The phylogeny confirms the expectation that babA, HP317, and babB all are highly related and that HP722 and HP725 are highly related to each other. All bootstrap values on the phylogram are ≥75, and essentially identical phylogenies were produced using maximum parsimony algorithms (data not shown).
Fig. 1.
Fig. 1. Nucleotide phylogram of babA paralogues fromH. pylori strains 26695 and J99. Sequences were obtained from The Institute for Genomic Research database (26695) and the database at Astra-Boston (J99), aligned using GCG Pileup, and analyzed using PAUP 4.0b2 neighbor-joining analysis based on Kimura's two-parameter model distance matrices. Bootstrap values (based on 500 replicates) are represented at each node, and the branch length index is represented below the tree. Strain names and gene numbers are indicated at the termination of each branch. The percent nucleotide similarity (determined using GCG Gap) of each pair of homologs is indicated in the right column. The brace is used to indicate the similarity between the two sets of paired ORFs from each strain. The GC content of each ORF is shown in the middle column. The mean GC content of the paralogues of the top (A) cluster of (41.4 ± 0.6)% is significantly lower than that for the bottom (B) cluster ([44.2 ± 1.0]%; P < 0.001) .
Although the genomes of 26695 and J99 have an overall 94% conservation at the nucleotide level (5), the 10 paired babAparalogues ([90.4 ± 2.4]%) are less highly related (Fig. 1). Levels of amino acid identity generally parallel the nucleotide identity for each pair of homologs (data not shown). The average GC content for the H. pylori genome is 39% (5, 40), while for the 11 paralogues in 26695, the mean GC content is (42.9 ± 1.6)% (range from 41 to 46%). For the J99 paralogues, the mean GC content is (43.1 ± 1.6)% (range from 41 to 45%). The GC content of the cluster (Fig. 1, cluster A) of paralogues including hopA is (41.4 ± 0.6)%, whereas for the cluster (Fig. 1, cluster B) of paralogues includingbabA it is (44.2 ± 1.0)%. This significant (P < 0.001, Student's t test) difference is further support for the validity of the phylogram.

Allelic diversity within babA.

Because of the segmental conservation and diversity among these paralogues, we next examined the best-characterized paralogues, babA andbabB, for their diversity among different H. pylori strains. From multiple alignments of the four and three sequences of babA and babB available from different strains, respectively (5, 25, 40; unpublished data), there is strong 5′ and 3′ conservation for both genes, with the greatest variation occurring in the midregions (data not shown). The greatest diversity occurs between nucleotides 612 and 1046 inbabA (86% mean identity) and between 488 and 1010 inbabB (91% mean identity). We then examined thesebabA and babB regions by directly sequencing PCR products from 42 H. pylori strains from around the world (Table 1). The babA sequences from the different strains are highly variable (Fig. 2); however, the segment including predicted amino acids 306 to 334 (nucleotides 915 to 999) shows five distinct families of variants, designated allele groups AD1 (babA diversity allele 1), AD2, AD3, AD4, and AD5 (Table1), as determined through similarity plot analysis (Fig.3A and data not shown). However, there exists a spectrum of similarity among the different babAallele groups, where groups AD3 and AD4 are most similar to one another, followed by group AD2, while groups AD1 and AD5 have little overall similarity to any of the other allele groups (Fig. 3B and data not shown). Phylogenetic analyses of nucleotides 915 to 999 also support the grouping of five separate babA variant families (Fig. 4, segment 1 phylogram), yielding results congruent with those of the similarity plots, while parallel analyses of the flanking nucleotides (830 to 914 and 1000 to 1084) do not show these allelic groupings (Fig. 4 and data not shown). All 42 strains belong to one of these allele groups, with AD5 strain 97-147 resembling each of the other four allele groups (Fig. 2).
Fig. 2.
Fig. 2. Amino acid alignment of 42 babA diversity region fragments. Amino acid translations of nucleotide sequences were performed using GCG Translate, and sequences were aligned using GCG Pileup. A consensus sequence (based on a plurality of 10) was determined using GCG Pretty, and all the sequences were displayed using Boxshade 3.2. The dots refer to sites where amino acids match those of the consensus sequence, the hyphens represent deletions, the asterisks represent sites where the consensus sequence is indeterminate, and the boxes are used to separate different allele groups among thebabA sequences. Each allele group is based on amino acids 306 to 334 and is represented at the right of the alignment. ThebabA diversity alleles 1 to 5 are represented as AD1 to AD5, respectively.
Fig. 3.
Fig. 3. Similarity plots of babA andbabB allele groups. Each plot was generated using 26 strains representing each of the babA and babB allele groups with a 100-bp window, a 10-bp step, and a Jukes-Cantor correction, using Simplot 2.5. Strains 96-71, 95-76, 98-292, 97-503, 97- 679, 97-723, 96-68, 98-12, 98-60, 96-68, 98-12, 98-60, 96-28, CCUG17875, 97-5, 97-687, 98-53, 98-344, 96-10, 95-10, 97-147, 98-18, 98-14, 98-24, 97-776, 98-10, 97-793, 84-183, and 26695 were used to represent the five different babA allele groups, and strains 95-10, 98-14, 98-10, 98-12, 98-24, 90-40, 98-292, 97-5, 84-183, 26695, CCUG17875, 97-503, 97-723, 98-18, 98-32, 98-22, 95-23, 97-776, 98-77, 98-16, 96-10, 98-30, 98-344, 98-454, and 98-60 were used to represent the three different babB allele groups. Allele groups AD1 and BD1 are depicted in black, groups AD2 and BD2 are depicted in gray, groups AD3 and BD3 are depicted in red, group AD4 is depicted in blue, and group AD5 is depicted in green. (A) Plot of babAsequences, using 96-71 (AD1) as the reference strain. (B) Plot ofbabA sequences, using 97-679 (AD4) as the reference strain. (C) Plot of babB sequences, using 95-10 (BD1) as the reference strain. (D) Plot of babB sequences, using 98-18 (BD3) as the reference strain.
Fig. 4.
Fig. 4. Phylograms of babA segment 1 (nucleotides 915 to 999 on the left) and babA segment 3 (nucleotides 1000 to 1084 on the right). Segment 1 includes the babA allele group-defining sequences, and segment 3 is the region downstream of segment 1. Regions from 42 strains underwent sequence analysis as previously described and were aligned using GCG Pileup, and specific regions were identified using GCG Pretty. The sequences then were realigned using ClustalW and subjected to phylogenetic analysis using Paup 4.0b2 neighbor-joining analysis based on Kimura's two-parameter model distance matrices. Strain number, geographic location, and allelic type (AD1 to AD5) are represented at the termination of each branch, and the branch length index is shown below each tree. Bootstrap values of >50 (based on 500 replicates) are shown at each node. Colors are used to represent strains from the United States (orange), Asia (other than India) (blue), India (green), and other countries or groups (black).

Allelic variation within babB.

Analysis of the translated babB sequences (nucleotides 488 to 1010 of 26695babB) suggests that babB is more conserved than is babA; however, the region between amino acids 216 and 252 (nucleotides 646 to 754) shows three families of variants, designated allele groups BD1 (babB diversity allele 1), BD2, and BD3 (Fig. 5 and Table 1), as initially discovered through similarity plot analysis (Fig. 3C and data not shown). As for babA, there also exists a spectrum of similarity, where allele groups BD2 and BD3 are most similar to one another, while group BD1 has little overall similarity to the other allele groups (Fig. 3D). Phylogenetic analysis also reinforces these groupings (Fig. 6, segment 1 phylogram), but these groupings are not present in the flanking (537 to 645 and 755 to 863) nucleotides. Importantly, strains that are clustered based on particular babA allelic types are not clustered based on their babB allelic types (Fig. 4 and 6).
Fig. 5.
Fig. 5. Amino acid alignment of babB diversity regions from 42 different H. pylori strains. Amino acid translations of nucleotide sequences were performed using GCG Translate, and sequences were aligned using GCG Pileup. A consensus sequence (based on a plurality of 10) was determined using GCG Pretty, and all the sequences then were displayed using Boxshade 3.2. The dots refer to sites where amino acids match those of the consensus sequence, the hyphens represent deletions, the asterisks represent sites where the consensus sequence is indeterminate, and the boxes are used to separate different allele groups among the babB sequences. Each allele group is based on amino acids 216 to 252 and is represented at the right of each alignment. The babB diversity alleles 1 to 3 are represented as BD1 to BD3, respectively.
Fig. 6.
Fig. 6. Phylograms of babB segment 1 (nucleotides 646 to 754 on the left) and babB segment 3 (nucleotides 755 to 863 on the right). Segment 3 is the region downstream of segment 1, which is the region from which the alleles of babB were described. Fragments from 42 strains underwent sequence analysis as previously described and were aligned using GCG Pileup, and specific regions were identified using GCG Pretty. The sequences were then realigned using ClustalW and subjected to phylogenetic analysis using Paup 4.0b2 neighbor-joining analysis based on Kimura's two-parameter model distance matrices. Strain numbers, geographic locations, and allelic types are represented at the termination of each branch, and the branch length index is shown below each tree. Bootstrap values of >50 (based on 500 replicates) are shown at each node. Colors are used to represent strains from the United States (orange), Asia (not including India) (blue), India (green), and other countries (black).

Geographic variation within babA andbabB.

Phylogenetic analysis of the regions 3′ (segment 3) of the babA and babB allele group regions demonstrates geographic clustering of the strains (Fig. 4 and 6). All but three of the U.S. strains cluster on the upper half of thebabA segment 3 phylogram, while two of the remaining three U.S. strains cluster near three of the four South American strains on the lower half. Most strains from Europe and Oceania cluster with the U.S. strains. Many of the Indian strains form a small cluster on the upper half, and all of the East Asian strains are found on the lower half of the phylogram. In the phylogram of babB segment 3, all of the Japanese strains are present on the upper branch, whereas all but one of the U.S. and South American strains are on the lower half. Thus, for both babA and babB, allele groupings (segment 1) that are not related by geographic origin of the strains and that are 5′ of regions (segment 3) that are largely related according to geographic origin are present. The regions 5′ of segment 1 for both genes also show evidence of geographic clustering (data not shown).

Comparison of babA and babB diversity regions.

Among the 42 strains studied, the babAdiversity region varied in length from 434 to 488 bp (>10%), whereas there was little babB length heterogeneity (514 to 526 bp). The babB diversity region also was much more conserved at both the nucleotide (87 versus 78%) and amino acid (84 versus 74%) levels than was the babA diversity region. Nearly everybabA and babB fragment was unique, except that Indian strains 98-344 and 97-679 had identical babBfragments while differing in other genotypes (Table 1). The GC contents for the fragments (46 and 43% for babA and babB, respectively) resembled those for the entire genes (Fig. 1). Both transitions and transversions occurred at high levels but more often inbabA than in babB (data not shown), further indicating the diversity in these regions.

Analysis of synonymous and nonsynonymous substitutions in thebabA and babB diversity regions.

To explore the evolutionary basis for variation in thebabA and babB diversity regions, we analyzedKs (mean number of synonymous substitutions per site), Ka (mean number of non synonymous substitutions per site), and theKa/Ks ratio for all possible pairs of aligned sequences. Since synonymous base changes do not alter amino acid translations, their usage is less constrained than that of nonsynonymous base changes, and thus Ksshould roughly reflect the divergence time between sequences. The greater value of Ks for babA than forbabB (0.33 versus 0.23) suggests a more distant common ancestor for babA. Since theKa/Ks ratio controls for differences in divergence time, the ratios' similarity forbabA and babB (0.42 versus 0.51) indicates that these two regions are subject to parallel functional constraints, which are less than previously determined within single alleles for the 5′ region of cagA (42) and the midregion ofvacA (11). Comparisons ofKa/Ks ratios among thebabA or babB allele groups indicate that the functional constraints for each allele group are similar (data not shown).

Homoplasy and recombination in the babA andbabB fragments.

Homoplasy is similarity within a locus that is not attributable to common ancestry and may reflect convergent, parallel, or reverse evolution (MacClade). In the absence of specific selective influences, recombination is believed to be the most important mechanism resulting in homoplasies on bacteria. The consistency index measures the amount of homoplasies in a population (from 0.0 to 1.0), where 1.0 indicates the absence of homoplasy. Phylograms of the babA and babB diversity regions have identical consistency indices of 0.34, indicating that there is substantial homoplasy among both these sequences. To determine whether the homoplasy is due to recombination, both the babA andbabB diversity region fragments then were examined by the homoplasy test (33); which measures the frequency with which the same nucleotide changes (homoplasies) occur in different branches of a maximum parsimony tree. The product, the Hratio, represents the frequency of observed synonymous homoplasies relative to the frequencies expected for the observed sequence variation under clonal descent (in the absence of recombination,H = 0) or free recombination (H = 1) (1, 33, 38). Several H. pylori housekeeping genes yield ratios ranging from 0.60 to 0.79, indicating high levels of recombination for these genes (1). AlthoughvacA gene fragments yield an H ratio of 0.69,cagA gene fragments yield H ratios of 0.15 to 0.17, among the lowest ratios yet observed for any species (38). babA and babB fragments yielded H ratios of 0.71 and 0.67, respectively, indicating that recombination is frequent in both of these loci and of a magnitude similar to that in the housekeeping genes.
As a further measure of the extent of recombination withinbabA and babB, compatibility matrices were used. In compatibility matrices, white spaces represent pairs of informative sites that are compatible with a maximal parsimony tree, whereas black spaces show pairs of sites that are incompatible. The mean compatibility score measures the average frequency with which pairs of informative sites are compatible. Both the babA and babB matrices are largely filled with black spaces (Fig. 7), and both yield similarly low mean compatibility scores (0.31 and 0.34, respectively), confirming that repeated recombination events have been involved in the evolution of both gene segments.
Fig. 7.
Fig. 7. Compatibility analysis of the babA (left) and the babB (right) diversity regions from 42 H. pylori strains. In the matrix, compatible sites are represented by white boxes and incompatible sites are represented as black boxes. Series of incompatible sites probably arose from repeated recombination, whereas compatible sites arose under clonal descent. Matrices were created using Reticulate (26).

Lewis B binding for babA and babB allele groups.

To determine if there is any association betweenbabA and babB allele groups and the ability of the strains to bind Lewis B, the Lewis B binding capacity of a group of strains representing each allele group was assessed. There was substantial variation from <10 to 658 units (mean, 346 ± 282) (see Materials and Methods for description of units) among these strains; however, there was no association between the allele group status of strains from each allele group and Lewis B binding (data not shown). Each babB allele group included strains that bound Lewis B well, or not at all, while strains from each babAallele group showed a broad range of values. Isogenic babAand babB mutants also were tested in the assay (Table5). The babA mutants failed to bind Lewis B, whereas the babB mutants bound to a similar degree as did the wild-type strains (Table 5). These results further indicate that, while babA is involved in binding to Lewis B (25), babB is not and that the particularbabA and babB allele groups are not determining factors in Lewis B binding.
Table 5.
Table 5. Lewis B binding of J99 wild-type and J99babA2 and babB mutant strains
J99 genotypeLewis B binding (relative OD units)b, c
Mean ± SDRange
Wild type653 ± 8647–658
babA2a8 ± 17<10–20
babBa596 ± 111517–674
a
Genes disrupted by insertion ofaphA (Kanr) cassette in specified ORF.
b
Values represent the means of four separate experiments.
c
Relative units are expressed as OD × 1,000 as described in Materials and Methods.

DISCUSSION

We classified the 11 (10 in J99) babA-related genes in 26695 as babA paralogues by several criteria, including the natural break found in the BLAST search probability scores in both strains J99 and 26695 and their substantial N- and C-terminal similarity. Both babA (HP1243) and hopZ (HP9) have been shown to be involved in H. pylori adherence to gastric cells (25, 36), and paralogues hopD(HP25) and hopA (HP229) appear to be porins, involved in molecular transport across the H. pylori membrane (40). These 11 paralogues do not constitute the fullH. pylori repertoire for adherence or molecular transport, since other members (HP912 and HP913) of the family of outer membrane proteins have been shown previously to be involved in adherence (34). HP912, HP913, and HP706 also appear to function as porins (19, 40).
The strong similarities at both the N and C termini of most of these paralogues imply their necessity for conserved functions, whereas the central variable regions likely encode unique functions. The predicted transmembrane domains in the conserved N- and C-terminal regions of each paralogue could serve as membrane anchors, with the variable regions forming extracellular loops involved in specific ligand binding, or other unique functions. The extensive 5′ and 3′ identities among the paralogues also could facilitate both intrastrain or interstrain recombination; interstrain recombination would be a powerful mechanism for increasing the functional repertoire of the recipient strains. That babA and babB are in opposite locations in relation to flanking genes in strains J99 and 26695 suggests that a reciprocal exchange could have occurred (8). The high (74 to 98%) level of identity between the downstream intergenic regions of HP317, babA, andbabB and the downstream gene identity for HP317 andbabB also could promote recombination.
Gene duplications among the babA paralogues have been observed for all three strains studied; strain CCUG17875 has two copies of babA (25), whereas strains 26695 and J99 have identical paralogues HP227 and HP1342 and JHP212 and JHP1261, respectively. Gene duplications potentially yield additional functional copies of the gene for enhanced expression of the product (14). The duplicate genes also may serve as the foci of gene conversion events that result in horizontal genetic movement between the copies (6, 7, 23). In strain CCUG17875, babA2 (but not babA1) is necessary for Lewis B binding (25). Thus, babA1 does not increase binding efficiency but may be useful for increasingbabA diversity through gene conversion.
Segments of DNA, such as pathogenicity islands, that differ in GC content from the surrounding chromosome are believed to result from relatively recent cross-species acquisitions (29). That the babA paralogues have a consistently higher GC content (mean, 43%) than does the H. pylori genome (39%) suggests that they may have been a relatively recent genomic acquisition, followed by gene duplication events leading to the presence of multiple paralogues. The presence of HP317 in strain 26695 (absent in J99) could be explained by an even more recent gene duplication event or conversely by its deletion in J99. That the amount of intergenic similarity for each of the babA paralogues (90.4%) in 26695 and J99 is significantly less than that for the entire genomes (94.0%) suggests that, on average, they are diversifying faster than the rest of the H. pylori genome.
Allelic variation within individual babA paralogues has been previously reported for HP9 (hopZ; with two distinct alleles) and for HP1342 and HP227 (27, 36). ForhopZ (36), the region of greatest diversity is located in nearly the analogous position to the babA andbabB diversity regions. Despite the substantial polymorphism that exists throughout the babA fragments, the 42 strains that we studied cluster phylogenetically almost exactly according to the diversity present in the 84-nucleotide allele group segment (data not shown); that this region dominates the phylogenetic structure suggests an important functional role. Strain 97-147 (AD5), which has features of each of the other babA allele groups, is most closely related to another South American strain (90-40, AD2). Its unique sequence indicates the possibility of recombination within the babA allele groups in the diversity region, paralleling (but less common than) recombination within the babB allele group region.
Most of the diversity in the babB fragments exists in a 108-nucleotide segment. Each of the different allele groups ofbabB contains greater diversity than that which exists within each babA allele group (Fig. 2, 3, and 5), which likely is due to interallelic recombination. In contrast tobabA, phylogenetic analyses of the entire babBfragments show that strains do not segregate according to the allele groups but rather based on the geographic origin of each strain (data not shown). That the babA and babB allele group regions are largely independent of geographic origin but are flanked by regions that show evidence of geographic variation implies that these allele groups have moved horizontally throughout the H. pylori population. In support of this hypothesis, the similarity plot analysis indicates that the borders of each of the allele groups may represent recombination breakpoints (Fig. 3).
Overall, for the sequences studied, babA shows much more variation in length and lower average similarity thanbabB. In the third codon position, transversions are more likely to change amino acids in coding sequences, while transitions almost always leave coding sequences intact (Met and Trp are the only exceptions); thus, transitional substitutions tend to predominate for most species (30). In H. pylori, transitions account for most interstrain diversity, accounting for a mean of 80% (range, 66 to 94%) of the polymorphisms (45). However, in both babA and babBsequences, transitions account for only 50% of the polymorphisms; thus, transversions are far more common than the typical H. pylori gene (45). Because both babA andbabB are outer membrane proteins, the diversity observed in this region in both genes may result from selection, possibly due to ongoing immune recognition.
Analysis of synonymous and nonsynonymous substitutions among thebabA and babB diversity region fragments indicates that the babB fragments share a more recent common ancestor and that, as measured by theKa/Ks ratio, both gene products are under similar functional constraints. That, for both genes, the Ka/Ks ratios for comparisons of sequences from specific allele groups are little different from one another (data not shown) and are considerably less than those observed for fragments within single alleles of eithercagA and vacA indicates that there are similar functional restrictions between these allele groups (11, 42).
The existence of substantial recombination in both the babAand babB fragments is supported by the consistency index, compatibility matrices, and the homoplasy test. Homoplasies can arise through recombination or independent mutations, and the Hratio gives a measure of the observed synonymous homoplasies relative to the observed sequence variation (1, 33). TheH ratios for babA and babB indicate that substantial recombination has likely occurred within both genes and at a level similar to that observed previously for several housekeeping genes (1). However, the presence of the allele groups and the geographic variation in both babA andbabB (Fig. 4 and 6) indicate that recombination has not been sufficient to totally obscure evidence of clonal descent and suggest specific functional differences among the different allele groups. The high frequency of recombination for both babA andbabB helps explain why the phylograms of the same strains are not congruent.
The presence of well-conserved allele groups in the babAand babB diversity regions implies an important functional role. babA has been shown previously to be responsible for Lewis B binding in H. pylori(25), which we now confirm. Because of the substantial similarity of babB and babA, babB might be involved in Lewis B binding as well; however, our data clearly indicate that neither the presence of babB allele groups nor the presence of babA allele groups is a determining factor in Lewis B binding.
In summary, both substantial conservation and variation exist among the babA paralogues. Two such paralogues,babA and babB, show both geographic and allelic group-associated variation in their predicted regions of maximum diversity. babB fragments appear to share a more recent common ancestor, but both the babA andbabB gene products are under similar functional constraints. Although recombination accounts for much of the variation inbabA and babB, as for vacA, it is not sufficient to obscure clonal structures present in both genes. Despite the involvement of babA in Lewis B binding, neither the babA allele groups nor the babBallele groups are determining factors in Lewis B binding. Whether the presence of different alleles of babA and babBhas other functional implications could aid in our understanding ofH. pylori-host ligand interactions.

ACKNOWLEDGMENTS

This work was supported in part by the Vanderbilt University Medical Scientist Training Program, the UNCF-Merck Science Initiative, and RO1 DK 53707 and the Cancer Center Core grant CA 68485 from the National Institutes of Health.
We thank Judith Romero and Margarita Carmolinga for the donation of DNA samples.

REFERENCES

1.
Achtman M., Azuma T., Berg D. E., Ito Y., Morelli G., Pan Z.-J., Suerbaum S., Thompson S. A., van der Ende A., and van Doorn L. Recombination and clonal groupings within Helicobacter pylori from different geographical regions.Mol. Microbiol.321999459-470
2.
Akopyanz N. S., Bukanov N. O., Westblom T. U., Kresovich S., and Berg D. E. DNA diversity among clinical isolates of Helicobacter pylori detected by PCR-based RAPD fingerprinting.Nucleic Acids Res.2019925137-5142
3.
Akopyanz N. S., Bukanov N. O., Westblom T. U., and Berg D. E. PCR-based RFLP analysis of DNA sequence diversity in the gastric pathogen Helicobacter pylori.Nucleic Acids Res.2019926221-6225
4.
Akopyants N. S., Clifton S. W., Kersulyte D., Crabtree J. E., Youree B. E., Reece C. A., Bukanov N. O., Drazek E. S., Roe B. A., and Berg D. E. Analyses of the cag pathogenicity island of Helicobacter pylori.Mol. Microbiol.28199837-54
5.
Alm R. A., Ling L.-S. L., Moir D. T., King B. L., Brown E. D., Doig P. C., Smith D. R., Noonan B., Guild B. C., deJonge B. L., Carmel G., Tummino P. J., Caruso A., Uria-Nickelsen M., Mills D. M., Ives C., Gibson R., Merberg D., Mills S. D., Jiang Q., Taylor D. E., Vovis G. F., and Trust T. J. Genomic-sequence comparison of two unrelated isolates of the human gastric pathogen Helicobacter pylori.Nature3971999176-180
6.
Alm R. A., Guerry P., and Trust T. J. Distribution and polymorphism of the flagellin genes from isolates of Campylobacter coli and Campylobacter jejuni.J. Bacteriol.17519933051-3057
7.
Alm R. A., Guerry P., and Trust T. J. Significance of duplicated genes in Campylobacter.J. Mol. Biol.2301993359-363
8.
Alm R. A., Bina J., Andrews B. M., Doig P., Hancock R. E. W., and Trust T. J. Comparative genomics of Helicobacter pylori: analysis of the outer membrane protein families.Infect. Immun.6820004155-4168
9.
Altschul S. F., Madden Y. L., Schaffer A. A., Zhang J., Zhang Z., Miller W., and Lipman D. J. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.Nucleic Acids Res.2519973389-3402
10.
Atherton J. C., Cao P., Peek R. M. J., Tummuru M. K., Blaser M. J., and Cover T. L. Mosaicism in vacuolating cytotoxin alleles of Helicobacter pylori. Association of specific vacA types with cytotoxin production and peptic ulceration.J. Biol. Chem.270199517771-17777
11.
Atherton J. C., Sharp P. M., Cover T. L., Gonzalez-Valencia G., Peek R. M., Thompson S. A., Hawkey C. J., and Blaser M. J. Vacuolating cytotoxin (vacA) alleles of Helicobacter pylori comprise two geographically widespread types, m1 and m2, and have evolved through limited recombination.Curr. Microbiol.391999211-218
12.
Blaser M. J. and Parsonet J. Parasitism by the “slow” bacterium Helicobacter pylori leads to altered gastric homeostasis and neoplasia.J. Clin. Investig.9419944-8
13.
Boren T., Falk P., Roth K. A., Larson G., and Normark S. Attachment of Helicobacter pylori to human gastric epithelium mediated by blood group antigens.Science26219931892-1895
14.
Brown C. J., Todd K. M., and Rosenzweig R. F. Multiple duplications of yeast hexose transport genes in response to selection in a glucose-limited environment.Mol. Biol. Evol.151998931-942
15.
Censini S., Lange C., Xiang Z., Crabtree J. E., Ghiara P., Borodovsky M., Rappuoli R., and Cocacci A. cag, a pathogenicity island of Helicobacter pylori, encodes type I-specific and disease-associated virulence factors.Proc. Natl. Acad. Sci. USA93199614648-14653
16.
Covacci A., Censini S., Bugnoli M., Petracca R., Burroni D., Macchia G., Massone A., Papini E., Xiang Z., and Figura N. Molecular characterization of the 128-kDa immunodominant antigen of Helicobacter pylori associated with cytotoxicity and duodenal ulcer.Proc. Natl. Acad. Sci. USA9019935791-5795
17.
Cover T. L., Tummuru M. K., Cao P., Thompson S. A., and Blaser M. J. Divergence of genetic sequences for the vacuolating cytotoxin among Helicobacter pylori strains.J. Biol. Chem.269199410566-10573
18.
Doig P., Exner M. M., Hancock R. E. W., and Trust T. J. Isolation and characterization of a conserved porin protein from Helicobacter pylori.J. Bacteriol.17719955447-5452
19.
Exner M. M., Doig P., Trust T. J., and Hancock R. E. W. Isolation and characterization of a family of porin proteins from Helicobacter pylori.Infect. Immun.6319951567-1572
20.
Fujimoto S., Marshall B., and Blaser M. J. PCR-based restriction fragment polymorphism typing of Helicobacter pylori.J. Clin. Microbiol.321994331-334
21.
Garnier J., Osguthorpe D. J., and Robson B. Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins.J. Mol. Biol.120197897-120
22.
Go M. F., Kapur V., Graham D. Y., and Musser J. M. Population genetic analysis of Helicobacter pylori by multilocus enzyme electrophoresis: extensive allelic diversity and recombinational population structure.J. Bacteriol.17819963934-3938
23.
Guerry P. R., Alm R. A., Power M. E., Logan S. M., and Trust T. J. Role of two flagellin genes in Campylobacter motility.J. Bacteriol.17319914757-4764
24.
Hofmann K. and Stoffel W. TMbase—a database of membrane spanning protein segments.Biol. Chem. Hoppe-Seyler3471993166
25.
Ilver D., Arnqvist A., Ögren J., Frick I.-M., Kersulyte D., Incecik E. T., Berg D. E., Covacci A., Engstrand L., and Borèn T. Helicobacter pylori adhesin binding fucosylated histo-blood group antigens revealed by retagging.Science2791998373-377
26.
Jacobson I. B and Easteal S. A program for calculating matrices as an aid in determining reticulate evolution in molecular sequences.Comput. Appl. Biosci.121996291-295
27.
Kersulyte D., Chalkauskas H., and Berg D. E. Emergence of recombinant strains of Helicobacter pylori during infection.Mol. Microbiol.31199931-43
28.
Kuipers E. J., Perez-Perez G. I., Meuwissen S. G., and Blaser M. J. Helicobacter pylori and atrophic gastritis: importance of the cagA status.J. Natl. Cancer Inst.8719951777-1780
29.
Lawrence J. G. and Ochman H. Amelioration of bacterial genomes: rates of change and exchange.J. Mol. Evol.441997383-397
30.
Li W.-H. Molecular evolution 1997 309 -334 Sinauer Associates, Inc., Publishers Sunderland, Mass
31.
Logan R. P. H. and Berg D. E. Genetic diversity of Helicobacter pylori.Lancet34819961462-1463
32.
Marshall D. G., Coleman D. C., Sullivan D. J., Xia H., O'Morain C. A., and Smyth C. J. Genomic DNA fingerprinting of clinical isolates of Helicobacter pylori using short oligonucleotide probes containing repetitive sequences.J. Appl. Bacteriol.811996509-517
33.
Maynard Smith J. and Smith N. H. Detecting recombination from gene trees.Mol. Biol. Evol.151998590-599
34.
Odenbreit S., Till M., Hofreuter D., Faller G., and Haas R. Genetic and functional characterization of the AlpAB gene locus essential for the adhesion of Helicobacter pylori to human gastric tissue.Mol. Microbiol.3119991537-1548
35.
Page R. D. M. TREEVIEW: an application to display phylogenetic trees on personal computers.Comput. Appl. Biosci.121996357-358
36.
Peck B., Ortkamp M., Diehl K. D., Hundt E., and Knapp B. Conservation, localization, and expression of hopZ, a protein involved in adhesion of Helicobacter pylori.Nucleic Acids Res.2719993325-3333
37.
Rozas J. and Rozas R. DnaSP version 3.0: an integrated program for molecular population genetics and molecular evolution analysis.Bioinformatics151999174-175
38.
Suerbaum S., Smith J. M., Bapumia K., Morelli G., Smith N. H., Kunstmann E., Dyrek I., and Achtman M. Free recombination within Helicobacter pylori.Proc. Natl. Acad. Sci. USA95199812619-12624
39.
Taylor N. S., Fox J. G., Akopyants N. S., Berg D. E., Thompson N., Shames B., Yan L., Fontham E., Janney F., Hunter F. M., and Correa P. Long-term colonization with single and multiple strains of Helicobacter pylori assessed by DNA fingerprinting.J. Clin. Microbiol.331995918-923
40.
Tomb J.-F., White O., Kerlavage A. R., Clayton R. A., Sutton G. G., Fleischman R. D., Ketchum K. A., Klenk H. P., Gill S., Dougherty B. A., Nelson K., Quackenbush J., Xhou L., Kirkness E. F., Peterson S., Loftus B., Richardson D., Dodson R., Khalak H. G., Glodek A., McKenney K., Fitzgerald L. M., Lee N., Adams M. D., Hickey E. K., Berg D. E., Gocayne J. D., Utterback T. R., Peterson J. D., Kelley J. M., Cotton M. D., Weidman J. M., Fujii C., Bowman C., Watthey L., Wallin E., Hayes W. S., Borodovsky M., Karp P. D., Smith H. O., Fraser C. M., and Venter J. C. The complete genome sequence of the gastric pathogen Helicobacter pylori.Nature3881997539-547
41.
Tummuru M. K., Cover T. L., and Blaser M. J. Cloning and expression of a high-molecular-mass major antigen of Helicobacter pylori: evidence of linkage to cytotoxin production.Infect. Immun.6119931799-1809
42.
van Doorn L.-J., Figueiredo C., Sanna R., Blaser M. J., and Quint W. G. V. Distinct variants of Helicobacter pylori cagA are associated with vacA subtypes.J. Clin. Microbiol.3719992306-2311
43.
van Doorn L.-J., Figueiredo C., Sanna R., Pena S., Midolo P., Ng E. K. W., Atherton J. C., Blaser M. J., and Quint W. G. V. Expanding allelic diversity of Helicobacter pylori vacA.J. Clin. Microbiol.3619982597-2603
44.
van Doorn L. J., Figueiredo C., Sanna R., Plaisier A., Schneeberger P., Boer W. D., and Quint W. G. V. Clinical relevance of the cagA, vacA, and iceA status of Helicobacter pylori.Gastroenterology115199858-66
45.
Wang G. E., Humayun M. Z., and Taylor D. E. Mutation as an origin of genetic variability in Helicobacter pylori.Trends Microbiol.71999488-493
46.
Wilkins M. R., Gasteiger E., Bairoch A., Sanchez J.-C., Williams K. L., Appel R. D., and Hochstrasser D. F. Protein identification and analysis in the ExPaSy Server 2-D proteome analysis protocols. Link A. J. 1998 531 -552 Humana Press, Inc. Totowa, N.J
47.
Wilson K. Preparation of genomic DNA from bacteria Current protocols in molecular biology Ausubel F. M., Brent R., Kingston R. E., Moore D. D., Seidman J. G., Smith J. A., and Struhl K. 1 1995 2.4.1-2.4.5 John Wiley & Sons, Inc. New York, N.Y
48.
Yamoaka Y., Kodama T., Gutierrez O., Kim J. G., Kashima K., and Graham D. Y. Relationship between Helicobacter pylori iceA, cagA, and vacA status and clinical outcome: studies in four different countries.J. Clin. Microbiol.3719992274-2279

Information & Contributors

Information

Published In

cover image Infection and Immunity
Infection and Immunity
Volume 69Number 21 February 2001
Pages: 1160 - 1171
Editor: A. D. O'Brien
PubMed: 11160014

History

Received: 3 July 2000
Returned for modification: 18 October 2000
Accepted: 17 November 2000
Published online: 1 February 2001

Permissions

Request permissions for this article.

Contributors

Authors

David T. Pride
Division of Infectious Diseases, Department of Medicine, Vanderbilt University School of Medicine and VA Medical Center, and Department of Microbiology and Immunology, Vanderbilt University, Nashville, Tennessee,1 and
Richard J. Meinersmann
USDA Agricultural Research Service, Athens, Georgia2
Martin J. Blaser
Division of Infectious Diseases, Department of Medicine, Vanderbilt University School of Medicine and VA Medical Center, and Department of Microbiology and Immunology, Vanderbilt University, Nashville, Tennessee,1 and

Editor

A. D. O'Brien

Metrics & Citations

Metrics

Note:

  • For recently published articles, the TOTAL download count will appear as zero until a new month starts.
  • There is a 3- to 4-day delay in article usage, so article usage will not appear immediately after publication.
  • Citation counts come from the Crossref Cited by service.

Citations

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. For an editable text file, please select Medlars format which will download as a .txt file. Simply select your manager software from the list below and click Download.

View Options

Figures

Tables

Media

Share

Share

Share the article link

Share with email

Email a colleague

Share on social media

American Society for Microbiology ("ASM") is committed to maintaining your confidence and trust with respect to the information we collect from you on websites owned and operated by ASM ("ASM Web Sites") and other sources. This Privacy Policy sets forth the information we collect about you, how we use this information and the choices you have about how we use such information.
FIND OUT MORE about the privacy policy